Dynamic models as tools for forecasting and planning

2 downloads 0 Views 2MB Size Report
pointed out by Howrey and Keleijan [17]. I will retum to this topic in connection with the evaluation of stochastic dynamic models. With the aid of a computer the ...
PETER GXRDENFORS

D Y N A M I C M O D E L S AS T O O L S F O R F O R E C A S T I N G AND PLANNING:

A PRESENTATION

METHODOLOGICAL

AND SOME

ASPECTS*

I. INTRODUCTION Many of the mathematical models which have proved to be most useful as tools for forecasting and planning can be described as dynamic models. A dynamic model consists essentially of a set of equations where the variables included in the equations can be divided into input variables (or exogenous variables) and state variables (or endogenous variables). Dynamic models can be used to combine, in a strictly mathematical manner, an analysis of causes and effects, with or without time-lags, and various feed-back relationships. The abstract definition of a dynamic model has grown out from various applications within control theory. But also within other areas dynamic models have been used to model processes. Above all, many macroeconomic models are of this kind, but applications can be found also in other social and biological sciences. As regards the classical applications within control theory, the equfitions which describe the connections between the variables in the model can be obtained from some well-developed theory, viz. some branch of physics. However, when the applications within the 'soft' social sciences are considered, there is no such generally accepted theory from which the mathematical formalism can be acquired. When a dynamic model is used to produce forecasts within an area of this kind, the forecasting process is not the methodologically most difficult part of the work, but it is rather the model construction, i.e., the description of the dynamic model, that requires the greatest caution and the soundest judgments. There is always some portion of fortuitousness in the choice of equations in, for instance, macroeconomic models. A central problem, when judging a forecast obtained from such a model, is thus how much the uncertainty about the form of the equations influences the accuracy of the forecast. It would be an exaggeration to maintain that the dynamic models which Theory and Decision 14 (1982) 237-273. 0040-5833/82/0143-0237503.70. Copyright 9 1982 by D. ReideI Publishing Co., Dordrecht, Holland, and Boston, U.S.A.

238

PETER G.~RDENFORS

are used within the social sciences are applications of the mathematical control theory and that the models can be compared to steam engine governors, autopilots etc. The reason is that the methodological problems connected with forecasting models are quite different from the problems connected with physical applications of control theory. Much of the mathematical apparatus which belongs to the engineer's tool kit is simply irrelevant to applications within the 'soft' sciences. On the other hand, problems of equation choice and parameter estimation do not bother the engineer to the same extent as when constructing forecasting models. This argument does not prevent that some of the conceptual framework which has developed within control theory may be useful also when nonclassical applications are considered. For instance, in connection with the evaluation of forecasting models I will emphasize the methodological importance of the concepts of stability and sensitivity of a model. I believe that, when social science applications are considered, these aspects have been swept under the carpet in favour of statistical concepts of 'goodness of fit', etc. One of the main reasons why dynamic models have become so frequently used as tools for forecasting and planning is that much of the heavy work in connection with model construction and model analysis can be done by a computer. The computer can be used when estimating equation parameters, when analysing the dynamic properties of the model by simulation, when forecasting and when planning with the aid of the model. A number of user directed programs (and even programming languages) have been specially constructed for simulating and analysing dynamic models. When studying the use of dynamic models in forecasting and planning it is easy to be swamped with mathematical analysis and statistical hair-splitting. Here I will try to keep my head above the surface, but still look for what is going on in the depth. The paper will mainly be a surveyal presentation of dynamic models and their use in forecasting and planning. This will be complemented with some methodological remarks and recommendations in connection with model construction and model evaluation.

TOOLS FOR F O R E C A S T I N G AND P L A N N I N G

239

II. DYNAMIC MODELS

1. Definitions of Deterministic Dynamic Models The abstract definition of a dynamic model has developed from various ways of analysing and synthesising control systems. The methods used when constructing centrifugal governors for steam engines, electronic amplifiers, autopilots, etc. have resulted in an abstract mathematical theory where the basic 'objects' of study are models of dynamic systems. In this section I will introduce some basic concepts from control theory and give a definition of a discrete deterministic dynamic modelJ Some examples of dynamic models will also be outlined.

A. Basic Concepts A fundamental postulate is that the system or process which is to be modelled can be described by a set of state variables taking real numbers as values. These variables are functions of time and they will be denoted by xl(t), x2(t) . . . . . The state of a system at a time to is identified with the set of values xl(to), x2(to) , . . . . In this paper only models with finitely many state variables will be discussed. The state of a model can then be described by a vector 2(t) = (xl(t), x2(t) . . . . . Xn(t)). The set of all possible states of a dynamic model is called the state space. The first state during the time interval which is considered is called the initial state. Normally, a system can be controlled by external factors. It is assumed that the control can be characterised by a set of real-valued input variables (or exogenous variables). These variables are also functions of time and will be denoted ul(t), u2(t) . . . . . Here, only finitely many input variables will be considered, which then can be described by a vector ti(t) = (ul(t), u 2 ( t ) , . . . , urn(t)). The set of all possible input vectors is called the input space.~ In a deterministic model the state of the model is uniquely determined by the input vector together with earlier and present values of the state variables. If the earlier and present values of the state variables have no influence on the state of the model, but this is determined from the present values of the input variables only, then the model is said to be static. In a stochastic model the input vector and the earlier and present states of the model determine a probability distribution over the state space. The time variable is either characterised by a continuous line or by a

240

PETER G)~RDENFORS

discrete set o f points o f time. In the first case, the system is said to be con-

tinuous, and in the second, discrete. The central part in the definition of a dynamic model is to describe how the state o f the model depends on the time variable and the input and state variables. In this paper I shall concentrate on the case when the state variables are differentiable functions of time. The change of the state of the model can then be defined as a function of the previous state 2 ( t -- 1), the present state ~(t), the present input vector fi(t) and the time t. For continuous models the changes can be characterised by first order differential equations and for discrete models first order difference equations can be used.

B. Discrete dynamic models Next, a more systematic definition of a discrete dynamic model will be given. DEFINITION 1. A discrete deterministic dynamic model consists of: (i) a set T which is a set of consecutive natural numbers to, tl, t2, 9 9 9 representing the points of time; (ii) a set X which is a continuous subset of R n, representing states of the system. The number of state variables is n and an element in X, a state, is written Yc = (xl, x2 . . . . . Xn); (iii) a set U which is a subset of R m, representing the input space. The number of input variables is m and an element in U, an input vector is written 17 = ( u l . . . .

,Urn);

(iv) a set Y which is a subset o f R k, representing observation vectors. The number of observation variables is k and an element in Y, an observation vector, is written)7 = (Yl . . . . , Yk); (v) for each state variable xi(t), a function fi from X x U x T to R which is continuously differentiable with respect to time such that

xi(t)

= k(~(t-

1),~(t), •t), t). []

A special case o f this definition is when xi(t ) can be described as a function of 2 ( t -- 1) and t~(t) only. This case is of interest in many respects, and if the equations of a model are written in this form one says that the model is given a state-space description or that the model is given in reduced form. A definition of a continuous dynamic model would be quite similar except that the points of time in (i) would be replaced by a time interval and

TOOLS FOR FORECASTING AND PLANNING

241

that the difference equations in (v) would be replaced with differential equations. The rather abstract definition of a discrete deterministic dynamic model which has been given here will now be illustrated by two examples.

Example 1. A very simple macroeconomic model will be outlined and it wiP be shown how it can be given a state-space description. The model contains four variables: aggregate consumption C(t), gross investment I(t), gross national product Y(t), and government spending G(t). These variables are related by the following three equations: (1)

C(t) = al + a2Y(t -- 1);

(2)

I(t)

(3)

r ( t ) = C(t) + I(t) + G(t).

=

b~ + b 2 ( r ( t

-

1) -- Y ( t - - 2));

In these equations as, a2, bl and b2 are parameters. 2 A good starting point when reformulating these equations as a state-space description is to sort up the variables. According to economic terminology C, I and Y are endogenous variables while G is an exogenous variable. Exogenous variables correspond to input variables in a dynamic model, and G is thus the only input variable in the desired dynamic model. The Equations (2) and (3) do not quite fit the pattern of a state-space description since they do not only contain values of state variables at t - 1, but also at t and t - - 2. We can get away from this deviation by some manipulations of the Equations (1)-(3). If we put in (1) and (2) in (3), we obtain (4)

Y(t) = a 1 + a 2 Y ( t - - 1 ) + b l + b 2 Y ( t - - 1 ) - - b 2 Y ( t - - 2 ) + G ( t ) .

From (1) it follows that (5)

Y(t--2) = 1C(t_ a2

1)

al a2"

If we now put (5) in (2) and (4) we get finally

(6)

I(t) = bl + b2ala2 + b 2 Y ( t - - 1) --~22 C(t-- 1);

and

242 (7)

PETER G~.RDENFORS

Y(t) = a i + b l + b 2 a l +(a2+b2)Y(t--1)--b-~2C(t--1) a2 +

a2

G(t).

The form of the Equations (1), (6) and (7) now accords with the requirements of a state-space descdption. []

Example 2. The 'world models' which have been constructed by, among others, Forrester [11] and Meadows et al. [23] can easily be rewritten as discrete dynamic models. These models do not explicitly contain any input variables, but the value of a state variable at t is determined solely from the state of the model at t - 1. On the other hand, some of the parameters that appear in the model equations can be interpreted as input variables and with the aid of this it is possible to perform simple policy experiments. In Forrester's model the state of the model is described by five state variables, viz., population, natural resources, pollution, capital investment, and capital-investment-in-agriculture fraction. Besides these state variables the model contains a number of auxiliary variables which are introduced by static equations. The model constructed by Meadows et al. contains eleven state variables and also here a large number of auxiliary variables. All state variables are assumed to be measurable, but in many cases the high level of aggregation,, makes this assumption seem very artificial. For example, Forrester measures natural resources in 'natural resource units' and pollution in 'pollution units'. The equations of Forrester's model are well documented in [ 11 ]. In Limits to Growth, the equations which have been used when simulating the model are not explicitly stated, but these have been presented in a technical report ([24]) which has been published only afterwards. []

C. Classifying Dynamic Systems A model with input variables is said to be autonomous (or closed). The world models presented in Example 2 are autonomous models. A model is said to be time invariant, if the functions f l . . . . , fn and gl . . . . . gh do not depend on t. In this paper only time invariant models will be considered. A model is said to be linear, if the functions f l . . . . , fn and g~. . . . . g~ are linear in the state variables, the control variables and t. The model in Example 1 is linear, while both Forrester's and Meadows's world models contain non-linear equations.

TOOLS

FOR FORECASTING

AND PLANNING

243

2. Some Comments on the Definitions o f Dynamic Models A. Classifying Variables Although some variable may be classified as an input variable in a dynamic model it is not always possible for human agents to have any influence to 'control' the values of the variable; or sometimes the 'control' may be very restricted. For example, in a dynamic model which is intended to model agricultural production, meteorological variables as temperature and precipitation may occur as input variables. For this reason, we can divide the input variables of a model used for forecasting and planning into policy variables (or control variables), which are controlled by those who use the model when forecasting and planning, and external variables, which are not controlled. The borderline between these kinds of variables is not sharp since some variables may be controlled only to a limited degree. In econometric models a distinction is made between endogenous and exogenous variables. Exogenous variables correspond to input variables in the definition of a dynamic model. Endogenous variables correspond to state variables and other auxiliary variables which may be introduced by static equations or identities. B. Higher Order Equations The equations used in (v) of Definition 1 are first order difference equations. It is now easy to give examples from models in physics or econometrics where higher order difference (or differential) equations are exploited. For this reason, the requirement that when formulating dynamic models only first order equations may be used seems, a priori, as a serious restriction. It is, however, easy to show that this requirement does not involve any real restriction. By introducing an appropriate number of new state variables it is possible to reduce a higher order equation to a set of first order equations. Suppose, for example, that we are given a third order difference equation of the form x ( t ) = f ( x ( t -- 1), x ( t -- 2), x ( t -- 3), t~, t).

If we now introduce two variablesy(t) and z(t) which represent x ( t - 1) and x ( t -- 2) respectively, then the equation above can be replaced by the following set of first order equations: x(t) = f(x(t--

1 ) , y ( t - - 1 ) , z ( t - - 1),K, t);

244

PETER G ~ R D E N F O R S y ( t ) = x ( t - - 1); z(t) = y ( t - - 1).

If all equations are linear (and in many other cases), then it is, in this way, possible to present all equations in accordance with (v) of Definition 1. C. Stochastic Models A generalisation of Definition 1 is to relax the assumption that the model e q~Ja~.ms are deterministic and introduce stochastic equations which will then give us stochastic dynamic models. In continuous stochastic models the equations are so-called stochastic differential equations. It would take me too far to present here this kind of equations and their analysis. The interested reader is referred to Astr6m [4]. The equations used in discrete stochastic systems are called stochastic difference equations. These equations have the following form: xi(t) = f i ( x ( t -- 1), x(t), u(t), t) + v i ( x ( t - 1), x(t), u(t), t). Here v i is supposed to be a random variable with fixed mean. Sometimes it is assumed to have certain additional statistical properties as being normally distributed, having some fixed standard deviation, being serially uncorrelated, etc. Another form of stochastic equations is obtained if the input variables are regarded as stochastic variables. For the solution of stochastic difference equations and further analysis of stochastic models the reader is referred to Astr6m [4]. I will return to stochastic dynamic models in connection with simulation and when analysing forecasts made with the aid of dynamic models. 3. Analysing Dynamic Models There are two main ways of studying the properties of a dynamic model. The first way is to search for analytic solutions to the set of equations given by the model. An analytic solution consists of l~mding, for each state variable, an equation which expresses the state variable as a function of the time and of the input variables (and with the initial value as a parameter), but which is independent of all state variables, including earlier values of the variable itself. From these equations the interesting properties of the model can then be derived.

TOOLS FOR F O R E C A S T I N G AND P L A N N I N G

245

The second way to study a dynamic model is simulation, By providing initial values to the state variables and values to the input variables during some subsequent time interval, one can use the equations in a deterministic model to compute the values of the state variables at all points of time in the interval. By varying the initial values, the values for the input variables and the equation parameters one can then obtain some information about the dynamic properties of the model. I will now turn to a more detailed discussion of these two methods of analysing dynamic models.

A. Analytic Solutions The goal, when looking for the analytic solution to a set of model equations, is to 'solve' these equations so that the references to the state variables in the equations are eliminated. In deterministic models the equations are ordinary difference (or differential) equations. When solving these kinds of equations we can rely on standard classical techniques.

Example 3. Let us assume that the following difference equation is one of the equations in a discrete dynamic model:

x(t) = 2 + 0 . 9 " x ( t - - 1). This equation is a purely autoregressive equation since the new value of the variable depends only on an earlier value of the same variable and neither on time nor on any input variable. If we put X(to) = Xo, the solution to this equation is given by

x(t) = (Xo-- 20)" 0.9(t -to) + 20. If xo > 20, x(t) will describe a curve which decreases exponentially towards the limit value 2 0 ; i f x o < 20, the curve will increase towards 20. [] If the model is linear and if the input variables are ignored in the sense that their values during the time interval are assumed to be constantly zero, then there is a general method of solving a differential or difference equation which essentially involves finding the roots of the so-called characteristic equation of the model equation. With the aid of these roots the transient part of the solution of the model equations is obtained. If now each input variable varies according to some simple function u(t), it may be possible to find some particular solution to the model equation. The complete solution

246

PETER G ~ R D E N F O R S

is then obtained by adding the transient part to the particular solution (and fixing the arbitrary constants so that the initial conditions are satisfied). For many simple equations particular solutions are known and in these cases the analytical solutions are easy to obtain. However, as soon as the course of the input variables becomes more complex, it is in general extremely difficult, not to say impossible, to fred particular solutions, a For non-linear models there is no such general method but the equations in such a model have to be attacked by more or less ad hoc methods. A deterministic difference or differential equation for a state variable x(t) is solved if we know the values of x(t) for all t. The solution of a stochastic equation will not produce determinate values but probability distributions over the values of the state variable. Thus a stochastic equation for a state variable x(t) is solved if we know the probability distribution o f x ( t ) for all t. The solution of a stochastic difference equation will be a Markov process. It would take us too far to present here any solution methods for stochastic equations. The interested reader is referred to Astr6m [4].

B. Simulation It does not take many variables and complicated functions for the input variables before it becomes overwhelmingly difficult to find the analytic solution even for a linear deterministic model; and as soon as non-linearities appear in the equations or stochastic equations are considered, the search for an analytic solution is, from a practical point of view, hopeless. In these situations other means for analysing dynamic models must be exploited; and here computer simulation techniques offer great possibilities. A simulation of a dynamic model during a time interval T consists of supplying initial values for the state variables and values for the input variables during T and then using the model equations to compute the values of the state variables for a number of points of time during T. As regards discrete deterministic models, the equations of which are given in reduced form, this process is computationally simple. From the initial values of the state variables, the values for the next period can be obtained immediately from the system equations. These new values can then be put back into the system equations and the values for the subsequent period can be computed, and so on. If the equations cannot be put in reduced form, then the computational

TOOLS FOR FORECASTING AND PLANNING

247

process may be more complicated. In this case one may have to rely on algorithms for solutions of equation systems where an approximative solution is computed by an iterative procedure. Among the most common are algorithms based on the Gauss-Seidel, Newton-Raphson and Runge-Kutta procedures. There is a danger in using approximate solutions. During a simulation, computed values for the state variables, which contain some approximation errors, are' used to compute new values for the state variables at a later point of time. Although the error at each step may be small, it may happen that these errors concur and magnify each other so that the simulated value of a state variable soon deviates considerably from the analytic value. For this reason some form of error analysis of the approximations used in a simulation is necessary. When stochastic models are considered the advantages of using simulation are even greater since analytic solutions are difficult to obtain and use for this kind of models. The inclusion of random variables in the model equations involves no computational problems. There are, however, some methodological problems in connection with simulation of stochastic models as is pointed out by Howrey and Keleijan [17]. I will retum to this topic in connection with the evaluation of stochastic dynamic models. With the aid of a computer the simulation methods can be programmed and the simulations performed rapidly and efficiently. A number of special programming languages have been developed with the particular purpose of simulating dynamic models. 4 As input to such a program one gives a model description, essentially consisting of edited versions of the difference (or differential) equations, together with initial values for the state variables and values for the input variables during the simulation interval. The program then performs the simulation and produces as output either tables of values of the state variables during the simulation interval, or plotted curves describing how the state variables vary with time.

Example 4. The curves which can be found in the books by Forrester and Meadows et al. are the results of a number of computer simulations of their world models. Since these models are autonomous only initial values for the state variables are necessary as input when the system equations are described.

248

PETER G A R D E N F O R S

The programming language which has been used for the simulation process is DYNAMO. s [] If one is able to work with an interactive simulation program, it is possible to produce the state variable curves directly on a screen. One can then comfortably see the effects on the dynamic behaviour of the model of changes in initial values, parameters in the equations, and the values for the input variables. In this way one may, without knowing anything about the analytic solution of the model, work out a 'Fingerspitzengefuhl' for the dynamic properties of the studied model, even if it is quite complex.

C. Stability and Sensitivity A central problem within control theory is whether a system is stable or not. In an unstable system a small disturbance may, for example, lead to selfoscillations which reinforce and the system collapses. The disturbances may originate from different sources. The initial conditions for a system may not be set accurately, which will lead to a different course of the system than the desired one. Another type of disturbance occurs if the equations in the model used to represent the system are not exact because of error in the equation parameters or because some aspect of the modelled system is neglected or simplified when the equation is formulated. A third type occurs if the control variables cannot be perfectly controlled but are subject to some error. An important question is now how these kinds of disturbances affect the course of the state variables in general and the output variables in particular. For some of the possible kinds of disturbances of a dynamic model criteria of stability have been formulated. The first definition of stability is connected with disturbances in the initial values of a dynamic model. The basic idea is that small disturbances in the initial values will only lead to small changes in the course of the state variables, if the model is stable. DEFINITION 2. A dynamic model with given initial values at to for the state variables is said to be Lyapunov stable if and only if for every 8 > 0 there is some e > 0 such that whenever an initial value is changed at to by an amount the absolute value of which is smaller than 8, then the changes of the values of the state variables for all points of time t after to never exceed e. []

TOOLS FOR FORECASTING AND PLANNING

249

Example 5. I return to the simple macroeconometric model which was presented in Example 1. If the equation for Y(t) is solved as a function of lagged values and the control variable G(t), one obtains the following equation: Y ( t ) - (a2 + b~)" Y ( t -- l) + b2" Y ( t -- 2) = al + bl + G(t).

Now suppose the initial values for this equation are Y(0) and Y(1). For simplicity, it will be assumed that G(t) is kept constant and equal to (al + bl) during the relevant time interval. 6 Thus we wilt be studying the behaviour of the homogenous equation

-

-

Y ( t ) -- (a2 + b2)" Y ( t -- 1) + b2" Y ( t -- 2) = 0

under disturbances of the initial conditions. The analytic solution to this second order difference equation, under the given initial conditions, is (unless as = 2x/b2 -- b2) Y ( t ) = Ax" 3`I + A2" 3`t2

where Aa = (3`2Y(0) -- Y(1))/(3`2 -- 3`1), A2 = (Y(1) -- 3`aY(0))/(3,2 -- 3`a) and where 3`1 and 3`2 are the roots of the characteristic polynom 3`2_(a 2 + b 2 ) . 3 ` + b 2 = 0 of the homogenous equation. (If a2 = 2x/b2 -- b2, then 3`1 = 3`2 and the solution is then Y ( t ) = Y(O)" 3`t, which entails that Y(1) = 3,1" Y(0).) The dynamic behaviour of Y(t) depends on the values of 3`1 and 3`2 and thereby on the values of a2 and b2. For some values of 3`1 and X2, Y(t) will converge towards 0 without oscillations (independently of the values of Y(0) and Y(1)) and the model described by the equation above will be Lyapunov stable. For other values, Y ( t ) will converge towards 0 with damped oscillations and also be Lyapunov stable. For still other values of 3`I and X2, Y(O will grow exponentially, with or without oscillations, and in these cases the model is not Lyapunov stable] [] It can be shown that if a linear time invariant dynamic model is Lyapunov stable for some initial values of the state variables, then it is stable for all initial values. The stability of a linear time invariant dynamic model is thus a property of the model, and not only a property referring to a particular set of initial conditions.

250

PETER G)kRDENFORS

Another concept of stability is introduced in the following definition: DEFINITION 3. A dynamic model is input-output stable if and only if limited values for the input variables result in limited values for the output variables. [] It can be shown that if a linear time invariant dynamic model where all output variables are state variables is Lyapunov stable, then it is also inputoutput stable. Input-output stability covers disturbances in the input variables, since such will only have a limited effect if the system is input-ouput stable. In a sense, this criterion also covers disturbances in the equation specifications since these can be seen as changes in the values of the input parameters. As regards linear time invariant models the result mentioned above shows that if a model is stable under disturbances of initial values, then it is stable under other kinds of disturbances as well. When investigating the dynamic properties of a linear economic model, the method most frequently used is that of examining the multipliers associated with the model's exogenous variables. If the model is stable then a one-period increase in the value for one of the input variables would result in limited changes in the following values of the state variables. These changes are called the dynamic multipliers of the input variable. The first-period change of a state variable is called the impact multiplier of the variable. The long-run multiplier of a state variable is the sum of all the dynamic multipliers over time. The multipliers of a model often give a good picture of the dynamic properties of the model, but it should be noted that as regards non-linear models all questions concerning the stability and sensitivity of such a model are not settled by multiplier analysis. Another important part of the study of a dynamic model is the sensitivity analysis where the goal is to find out how sensitive the model is to different kinds of changes. This kind of analysis is, of course, closely related to the analysis of the stability of the model. The most important aspects are to analyse the model with respect to (i) changes in initial values for the state variables, (ii)changes in the parameters in the model equations, and 0ii) changes in the values of the input variables. The sensitivity of a factor in a model is a measure of how much a change in the factor is magnified when

TOOLS FOR F O R E C A S T I N G AND P L A N N I N G

251

comparing the behaviour of the state variables before and after the change. If it is possible to compute an analytic solution to the equations of a dynamic model, then it is possible to answer all questions concerning sensitivity from this solution. If simulation is used, the sensitivity of a model can be determined by running through various combinations of changes in initial values, equation parameters, and input variables. I will return to this kind of analysis in the sequel since the sensitivity of a dynamic model is an important feature when evaluating the adequacy of using the model as a tool for forecasting. III. F O R E C A S T I N G AND P L A N N I N G WITH DYNAMIC MODELS

1. Model Construction When one wishes to use a dynamic model as a tool for forecasting and planning the most difficult part, from a methodological point of view, is not how to do the forecasting or planning with the model, but how to construct the model. In this section I will dis.cuss some methodological issues connected with the choice of variables in the model, the choice of the form of the model equations, and the estimation of the equation parameters.

A. Choice of Variables The 'normal' procedure when constructing a dynamic model as a model of a real process consists roughly of the following main steps: (i) choose the set of variables to be included in the model and divide them into input and state variables; (ii) formulate the system equations (at least with respect to 'form' and which variables should be included in each equation); and (iii) use available data for the state and input variables to estimate the parameters in the equations. I do not intend to say that one of these steps must be finished off before the next may be commenced. In practice one often starts with a rough work on step (i) and formulates some preliminary equation in step (ii) to obtain some first estimation results and then returns to step (i) or (ii) to search for less approximate solutions. It is hardly possible to formulate any general rules for which variables should be included in a dynamic model as state variables or as input variables. For this problem any form of theory of the process one wishes to construct a model for may be useful. Theoretical considerations may result in anything

252

PETER G ~ R D E N F O R S

from well corroborated mathematical formulas to loose speculations about causal connections. A variable, for which one intends to obtain a forecast, is never an isolated phenomenon. It is part of some causal structure and is influenced by other variables and the variable also influences other factors itself. It may be a part of causal cycles, i.e., feed-back loops. A first delimitation when constructing a model is to find some causal network, which includes the variables to be forecasted, but which is relatively isolated in the sense that variables 'within' the network, i.e., those which are potential state variables, do not influence the variables 'outside' the network, i.e., potential input variables, in any essential way. This causal network can be chosen more or less inclusive. To make the demarcation one has to weigh the completeness and richness of detail against perspicuity and manageability. Another problem when selecting the set of variables for a dynamic model is the level of aggregation. For the construction of a macroeconometric model we may have available sufficient data for a large number of variables as individual consumers, individual entrepreneurs etc. It might be possible to write out various kinds of relationships between these magnitudes as model equations and thus obtain an extremely detailed model. Such a dynamic model, however, would have a rather limited practical interest. In order to explain and predict the behaviour of macroeconometric variables, it is necessary to deliberately disregard a considerable amount of details. Here, again, completeness and manageability must be weighed against each other. The advantage of a model which contains a large number of variables is that it may be possible to reach a closer representation of the 'real' process than with a model containing fewer variables. The main disadvantage is that things get complicated and much more work is needed; more model equations have to be specified, more equation parameters have to be estimated, and it may be difficult to collect sufficient data for all variables. Another, more fundamental, disadvantage is that if the theory for the process to be modelled is 'weak', i.e., not articulated in an exact way or not so well corroborated, then it is difficult to motivate the choice of model equations; and the more equations one has to construct, the more uncertain will thermodel as a whole be. Even if some variable has been chosen to be included in the model, one has to decide how the variable is to be represented in the model equations.

TOOLS FOR F O R E C A S T I N G AND P L A N N I N G

253

Sometimes the derivative of a variable may seem more appropriate than the variable itself or some functional transformation of the values of a variable may yield a representation which is easier to include in the equations. Even if the resulting models are, in a sense, equivalent, the choice of the representation of a variable may have devastating consequences for the outcomes of parameter estimation procedures. When choosing the control variables to be included in the model one is faced with the same problems as when choosing state variables. Since a dynamic model is often used as a tool for planning, the choice of control variables may even have political consequences. A dynamic model can be used to give recommendations of which values of the control variables to aim at, if a certain goal is to be attained. Since different control variables may be governed by different political institutions, the mere choice of control variables may reflect or influence a political power structure. When choosing input variables which are not policy variables, i.e., what was earlier called external variables, the accessibility of actual and future data for these variables is an important aspect. The theory of the process to be modelled may prescribe an input variable which, because of measurement problems or other data collecting problems, may be useless when the model is to be used for forecasting or planning. Actual and past data of input variables are necessary for the estimation of equation parameters and forecasted" future data are necesary when using the model to forecast future values of the state variables. If the forecast for an external variable is very unreliable, then this usually affects the reliability of the forecast for the state variables of the model; and it may then be preferable to replace the external variable with another for which there exists a more reliable forecast.

B. Choice of Equations Within areas where precisely formulated and well corroborated theories are available, as e.g. the classical theories of physics, the choice of model equations does not normally present any fundamental problems, since the equations are, on the whole, provided by the theory. On the other hand, within the 'soft' sciences, where the theoretical foundations are more shaky and less informative, the choice of model equations often is the most difficult part when constructing dynamic models. Even if an incomplete theory does not provide us with the exact model

254

PETER G ~ R D E N F O R S

equation for a state variable, it may nevertheless give some guidance as to which state and input variables are causally connected with the state variable under consideration and thus determine, at least, which variables ought to be essentially included in the equation. The theory may also indicate the 'form' of the equation, e.g., whether it should be linear, exponential or logarithmic in the variables. But frequently the information provided by the theory is scanty and one will then be forced to experiment with different types of equations. Within economic theory it is often assumed that all equations are linear in the variables. The main motivations for this assumption seem to be practical: Having linear equations allows you to, more or less mechanically, solve the system of equations analytically and thereby you will be able to study the dynamic properties of the model in a careful way; and, furthermore, the statistical techniques available for estimating linear equations are more thoroughly investigated, better developed and have more appealing properties than the corresponding methods for non-linear equations. It is also argued that the restriction to linear equations is not a serious shortcoming since linear equations (the linear terms of a Taylor expansion) are good approximations of the 'real' equations as long as the variations in the variables are relatively small. This argument is, naturally, of limited value. For many processes stochastic models seem more appropriate than deterministic ones. When formulating the equations of a stochastic model the choice of the error term involves another source of problems. In most applications the error term is simply added to the rest of the model equation. This error term is then furnished with various statistical properties (normally distributed, zero mean, no serial correlation with earlier errors, homoscedastic, etc.) depending on what the theory of the process says about the error and what data show. An important problem is how the error term in an equation is to be interpreted. One way is that the model equation for a state variable is not perfectly specified but, due to some idealisations or some unknown features of the modelled process, there will be a small residual difference between the true value of the state variable at a certain point of time and the value obtained from the equation at the same point of time. The residuals can be seen as the unexplained part of the value of the variable. Another way of interpreting the error term, which can be combined with

TOOLS FOR FORECASTING

AND P L A N N I N G

255

the first, is that some or all of the variables used in the equations are measured with error. F o r this kind of error, assumptions o f normal distribution, zero mean and no serial correlation between different error terms seem quite natural. If the stochastic parts o f the model equations consist only o f measurement errors, then the underlying process is essentially deterministic and the stochastic dynamic model may, if correctly specified, be replaced by a deterministic one. If, on the other hand, the error terms are principally residuals, then there is no reason, a priori, why they should have the statistical properties mentioned above. In many cases the theory of the process to be modelled does not give much guidance as to which form of the model equations is the appropriate one. In these cases one may acquire some information by studying the available data for the exponential trend information can be It is dangerous,

state variables. The curves may suggest a linear or an or there may be some cyclical movements; and this helpful when formulating the model equations. however, to let the apparent form o f the curves for the

state variables dictate the choice of the model equations. This danger was pointed out b y Haavelmo [15] so long ago that much of his argument needs be repeated: The degree of conformity between.., theoretical solution and the corresponding observed time series is used as a test of the validity of the model. In particular, since most economic time series show cyclical movements, one is led to consider only mathematical models the solutions of which are cycles corresponding approximately to those appearing in the data. This means that one restricts the class of admissible hypotheses by inspecting the apparent form of the observed time series. This condition.., may not even be a necessary condition, and its application may result in a dangerous and misleading discrimination between theories. (p. 312) The reason why inspection of data series may lead to prejudiced conclusions about the form of the model equation is, according to Haavelmo, that the whole question is connected with the type of errors we have to"lntroduce as a bridge between pure theory and actual observations. (p. 312) Haavelmo then discusses two kinds o f errors corresponding to the two interpretations of stochastic equations, i.e., measurement error and unexplained residuals, mentioned above.

256

PETER G X R D E N F O R S

The conclusion to be drawn from this discussion is that the form of a model equation should not be chosen only because the data series shows a certain pattern - there should be some theoretical reason, even if it is weak, for making the choice. This ends my discussion of the choice of model equations. The results of the search for a set of appropriate equations may not be univocal but one is often left with a number of equation forms which all are reasonable candidates for being included in the final model. The final choice among these equations is postponed until their ability to 'explain' data and their dynamic properties are investigated. C. Estimation o f Equation Parameters

Apart from state and input variables the model equations contain a number of parameters. Even if the mathematical form of the equation can be fully inferred from the theory of the process to be modelled, the actual value of the parameters are not derivable from theory alone. In very particular cases only it is possible to obtain the 'correct' values from data in combination with theory - in general one must be content with estimates. The theory of parameter estimation in multi-equation models is complex. There are a great number of statistical procedures available for the estimation of the parameters in a system of equations. Some of the more well-known methods are ordinary least squares, two-stage least squares, and fullinformation maximum likelihood. These methods are constructed to be applicable under certain assumptions concerning the error terms in the equations. It would, however, take us too far to present any of these methods here. There are no general rules for determining the choice of estimation procedure for a particular system of equations. Partly the choice depends on what is known about the statistical properties of the errors in the equations, partly it depends on the purpose for which the estimated system of equations is to be used. For example, the choice of estimation technique can have substantial effects on the dynamic properties of the estimated model. Example 6. Pindyck and Rubinfeld [29], pp. 329-330, present a simple

macroeconomic dynamic model. The equation for the consumption function C(t) is estimated as a linear function of C(t -- 1), G(t -- 1), Y ( t -- 2) and

T O O L S F O R F O R E C A S T I N G AND P L A N N I N G

257

R ( t -- 4), where G is government spending, Y is GNP, and R is a short-term interest rate. Using quarterly data during the period 1955-1970, the

equation was estimated by two-stage least squares combined with an autoregressive transformation to correct for serial correlations. The result was C(t) = 2.097 + 0.966C(t -- 1) + O.187G(t -- 1) --

-- 0.009Y(t -- 2) -- 0.609R(t -- 4). If the equation is estimated using ordinary least squares instead of the procedure above, then the result is: C(t) = 5 . 8 4 2 + 0 . 9 2 6 C ( t - 1)+ 0 . 4 3 7 G ( t - 1 ) -

O.lOOr(t

-

2) -

1.687R(t - 4).

The dominating part of both of these equations is the autoregressive term C ( t -- 1). The behaviour of the variable C is very sensitive to changes in the parameter for this term. So even if there seems to be little difference between the two values 0.966 and 0.926, the dynamic properties of the two equations are quite different. Which equation is 'best' cannot be determined by statistical techniques only, but the dynamic behaviour of the model as a whole must be considered when choosing the final estimates of the parameters. [] 2. Model Evaluation

When a dynamic model has been constructed its performance must be evaluated. This means that the values for the state variables produced by the system is to be compared to the 'real' values, past, present, or future, of the state variables. The better the values produced by the model fit the real values, the better is the model. But how is one to estimate the degree of fit between model output and actual values? Frijda [13] notes that There is hardly any methodology existing here. As much ingenuity as has been invested in the making of computer programs, as little has been spent on the assessment of their value. Next to high precision there always seem to be spots of rough approximations which undercut this very precision. We are left largely to our subjective impressions of what we consider good or bad correspondence. (p. 65) There exists a large number of criteria which can be used to evaluate dynamic

258

PETER G.g,R D E N F O R S

models. Here, I will not attempt at a systematic presentation but confine myself to a discussion of some of the most important criteria.

A. Goodness o f Fit The quantitative measure of how closely individual state variables track their corresponding data series that is most often used is called the root-meansquare (RMS) error. In order to define this measure for a discrete state variable x(t), we distinguish between xs(t), the value o f x ( t ) at t predicted by the given model under the given initial conditions, and Xa(t), the actual value of x(t) at t. DEFINITION 4. Let xs(to), Xs(tl ) . . . . . xs(tn) be a series of values obtained from a discrete dynamic model and let xa(to ), Xa(tl ), . . . , xa(tn) be the corresponding data series. Then the RMS error for this time interval is

x/Zi(xs(ti) -- xa(ti))2/(n + 1). [] The square root of the mean square error is taken in order to obtain a measure which has the same dimension as the predicted and actual values themselves. If a dimensionless measure is desired, one can use either the RMS percent error which is defined as

x/Zi(xs(ti) -- xa(tt)/xa(ti))2/(n + 1) or Theil's inequality coefficient as it is defined in [31], which is just a way of normalising the RMS error. All these measures can be decomposed into three parts. The first part, the error in central tendency, is zero if and only if the average predicted change from the mean of the predicted values coincides with the average actual change. The second part, the error due to unequal variation, is zero if and only if the standard deviations of predicted and actual changes are equal. The third term, the error due to incomplete covariation, is zero if and only if the covariance of predicted and actual values takes its maximum value. This kind of decomposition is often useful since each term refers to a particular kind of prediction error, s

B. The R 2 Measure Another measure which is often used as an estimate of the goodness of fit is

T O O L S F O R F O R E C A S T I N G AND P L A N N I N G

259

the R 2 measure. It measures the proportion of the variation of a discrete state variable x(t) which is 'explained' by the system equation. R 2 is often informally used to compare the validity of alternative specifications of system equations for a state variable. To define this measure we begin by defining the total variation of x(t) about its mean (or total sum of squares) as TSS = ~i(xa(ti) -- ~a) 2. Here, Xa(ti) are the actual values, as before, and xa is the mean of these values during the time interval t o , . . . , tn. The residual variation o f x ( t ) ( o r error sum o f squares) is ESS = "ti(xa(ti)-- xs(ti)) 2. As before, xs(ti) denotes the value of x(t) at t i produced by the model. Finally, the explained variation o f x ( t ) (or regression sum o f squares) is RSS

=

~i(xs(ti)-- ~a) 2.

Now, if the mean error of the predicted values is zero and the errors are perfectly uncorrelated with the variables in the equation, it can be shown that TSS = ESS + RSS. DEFINITION 5. The R 2 measure for a given discrete equation during the time interval to . . . . . t,, isR 2 = RSS/TSS. [] When the above mentioned equality holds, this is equivalent to R 2 = 1 -- ESS/ TSS. Thus R 2 is the proportion of the total variation in x(t) explained by the model equation. The R 2 ranges in value between 0 and 1, and is higher the better the xs(t) values correspond with the Xa(t) values. An R ~ value of 1 can occur only in case o f a perfect fit. A high R 2 value cannot be the only criterion to determine whether an equation describes some data 'correctly', since if further variables are added to the model equation, the R 2 value is never lowered and it is likely to be raised (the addition of a new explanatory variable does not alter TSS, but is likely to increase RSS). There must be additional reasons for including a variable in an equation - theoretical reasons or, if the theory is 'weak', tests for statistical significance.

260

PETER G ~ R D E N F O R S

In Gardenfors [14] the R 2 measure is further criticised as a measure of the explanatory value of a model equation. In that paper it is argued that the explanatory value of a forecasting equation is to be measured by the improvement of fit to the actual values obtained from the forecasting model in comparison to the values obtained from a 'naive' model. Here, 'naive' means that the forecast of a variable is determined without the aid of any other explanatory variables, as e.g. in a time series model. C. The t Statistic

A mistake that is difficult to discover when evaluating a system of dynamic equations is that some variable is missing in an equation. This problem occurs mainly when the theory of the process to be modelled does not give much guidance as to which variables have a direct influence on a given variable. For this reason it is easy to include too many variables in the first attempts at formulating the model equations and then later exclude some variables when the statistical analysis shows that they have no explanatory value. When the model is linear, one of the main tools in this statistical anaysis is the so called t statistic which, for each variable, determines whether the estimated parameter coefficient for the variable is significantly different from 0, i.e., that the null hypothesis that the coefficient equals 0 can be rejected with a certain degree of confidence. It is often demanded, for each estimated coefficient in a linear equation, that the t statistic should show that the null hypothesis can be rejected at the 5 percent level of significance. However, if this demand is followed too strictly, then it may very well happen that some important variables will be excluded from the model equations. If, for example, an input variable (say, oil price), which has a strong influence on the variable one wishes to explain; is constant or almost constant during the time interval which is used for the parameter estimation, then it is likely that the t statistic for the coefficient for this input variable is not large enough to warrant the rejection of the null hypothesis. If there are theoretical reasons for including a certain variable in an equation, my recommendation is that it should be included even if the t statistic for the corresponding parameter is low. It should be noted, however, that, in these cases, the estimated value of the parameter may not be reliable. Example 7. The following quotation from Pindyck and Rubinfeld [29], p.

379, illustrates the argument above:

TOOLS FOR FORECASTING

AND P L A N N I N G

261

Despite the fact that the estimated coefficient of the long-term interest rate is not significant at the 5 percent level, we do not drop the variable from the equation. Because we believe a priori that the interest rate variable provides the link between monetary policy and investment, it is necessary that we include an interest rate as one of the determinants of interest. [] The conclusion to be drawn is that one should not rely blindly on the statistical tools when choosing and specifying model equations. The parameter values, acquired by some estimation procedure, must be interpreted and their reasonableness checked against whatever theory is available.

D. Dynamic Properties When studying a 'real' process one acquires some intuitive knowledge about the dynamic properties o f the process. One may know that it is stable under quite general conditions and one may have a fairly detailed knowledge of how sensitive the process is to different changes in the input variables. When trying to represent such a process by a dynamic model it is important that the model has roughly the same dynamic properties. By using the definitions of stability presented earlier, it may be possible to express some o f the properties o f the model in a precise manner. But even if the available knowledge about the stability and sensitivity properties of a process cannot be expressed formally, it is nevertheless an important complement to the data series for the state and control variables when the tentative dynamic model is to be evaluated. Since a system o f equations, by including more and more variables, can be made to fit actual data series for the state variables arbitrarily well, the goodness o f fit cannot be the only critierion of the validity o f the model, but the dynamic properties also play an important part in the evaluation. Kmenta [21], p. 593, notes that The stability condition is clearly important from the economic point of view. It should also be realised that the existence of stability (or, at worst, of a regular oscillatory pattern) is, in fact, assumed in the process of estimation. The assumption that the predetermined variables of the system have finite variances as t-~ ~ applies also to the lagged endogenous variables, and this assumption would be violated ff the endogenous variables were to grow or decline without limit. If this assumption were not made, there would be difficulties in proving the asymptotic properties of estimators.

Example 8. Pindyck [28] aims at formulating an equation for inventory investment, which is denoted by I(t). As explanatory variables he chooses

262

PETER G~,RDENFORS

Y(t), disposable income, C(t), consumption, together with lagged values of I(t), Y(t) and C(t). The model is discrete and the equation is assumed to be linear in these variables. The following two equation forms are then selected for estimation: (i)

I(t)

= alY(t) + d~I(t--

(ii)

I(t)

+ bl(Y(t)

- - Y ( t - - 2)) + c l ( C ( t ) - - C ( t - - 2)) +

i) + k~ ;

= a 2 ( Y ( t ) - - Y ( t - - 1)) + b 2 ( C ( t ) - - C ( t - - 1)) + + e 2 I ( t - - 1) + k2.

The results of the estimations are: (i)

I ( t ) = 0.0113Y(t) + 0.4647(Y(t) - Y ( t - - 2)) - 0.6002(C(t) ---C(t-

(ii)

2)) + 0.4219I(t -- 1 ) - 2.4615;

I(t) = 0.8407(Y(t) -- Y ( t - 1)) -- 1.0087(C(t) -- C ( t - - 1)) + + 0.7362I(t - 1) + 0.8726.

The R 2 value for (i) is 0.740 and the standard error is 2.228 while the R 2 value for (ii) is 0.905 and the standard error 1.349. Pindyck has the following comment on these statistical results: [Equation (ii)] gives better statistical fit along the regression line, which is not surprising since a large proportion of the high frequency change in Y can be accounted for by fluctuations in inventory investment. But ff [Equation (ii)] is used in simulation of the model as a whole, it results in extreme instability. This is not surprising - a sudden large increase results in a large I, which in turn results in a larger increase in Y and thus an even larger I, etc. This is an example of statistical fit that is somewhat artificial - we in effect are regressing a noisy variable against its own noise. For these reasons [Equation (i)] was found preferable for use in the final model. (p. 59) There are no particular reasons within macroeconomic theory for choosing (i) instead of (ii). Pindyck's argument shows that the good statistical fit of Equation (ii) is outweighed by its dynamic properties. The interpretation of the equation coefficients shows that Equation (i) is preferable to (ii). [] E x a m p l e 9. In connection with the world models constructed by Forrester

[11] and Meadows e t al. [23] some interesting sensitivity analyses have been

TOOLS FOR F O R E C A S T I N G AND P L A N N I N G

263

made. The authors are well aware of the problems connected with the estimation of the equation parameters in their models. It is argued, however, that accuracy is not necessary since the main variables of the models show the same qualitative behaviour, which essentially means an impending population catastrophe, even if the parameter values are allowed to vary within 'reasonable' limits. In support of this claim both Forrester and Meadows et al. provide the results of a number of simulations of their models where various equation parameters and initial values are changed. For instance, Meadows et al. show that even if the assumed initial amount of natural resources is doubled, there is no essential change in the behaviour of the population variable, but a catastrophe is also in this case predicted to occur in the middle of the 21st century. It is, however, not sufficient to make sensitivity tests for some model parameters only in order to evaluate the reliability of the models, but a more systematic investigation is necessary. Zwicker [32] has pointed out a most undesirable case of sensitivity in Meadows' model. He studies the effects of changing three coefficients in some equations concerning capital. In Meadows' model, the constants are called ALIC (average lifetime of industrial capital), ICOR (industrial capital coefficient) and FIOAC (consumption fraction). Zwicker increases the values for ALIC and ICOR by 10% and decreases the value for FIOAC by 10%. These changes are clearly within the error limits of the estimations of the coefficients. Zwicker shows that if these constants are changed in the year 1900 in a simulation run, then all important state variables (population, natural resources, food per capita, industrial production, and pollution) will remain at a constant or almost constant level for all foreseeable future. In another simulation it is shown that if these changes of the equation coefficients are made in the year 1975, instead of 1900, then the curves will have almost the same form as those considered desirable by Meadows et al. Zwicker's investigation shows, contrary to what is maintained in Limits to Growth, that Meadows' model is unwarrantably sensitive to changes in the values of the equation parameters. His simulations are alone sufficient to considerably lower the trustworthiness of the conclusions and recommendations presented in Limits to Growth. []

264

PETER G ~ R D E N F O R S

3. Forecasting Once a dynamic model has been constructed, forecasting with the aid of the model is, from a methodological point of view, comparatively simple. What is needed when using a deterministic model is a set of initial values for the state variables and a set of values for the input variables for the time interval of the forecast. With the aid of these requisites the model is then run into the future. If the model is stochastic it is necessary to rely on so-called stochastic simulation. Such a simulation can be made by including various kinds of errors in the description of the model - making the external variables and the equation parameters random variables and adding error terms to the equations. By running a large number of simulations with this kind of model one gains a comprehension o f the most likely values of the forecasting variables and with the aid of the distribution of the simulation curves one also obtains a rough measure of the uncertainty of the forecast. 9 The acquisition of future values of the input variables may be problematic. As regards policy variables, the forecaster chooses their values himself or relies on the decision makers' plans, but when it comes to external variables, their values must be obtained from already given forecasts. A. Conditional and Unconditional Forecasts A distinction is often made between conditional and unconditional forecasts. In an unconditional forecast, it is assumed that values for all the input variables are known with certainty. In a conditional forecast, values for one or more of the input variables are not known with certainty, but the forecast is made under the condition that certain assumed values of the input variables will become the actual values. If the dynamic model, which is used for making a forecast, is a closed model, i.e., without input variables, then all forecasts will, by definition, be unconditional. When models with control variables are used, which are the most common and most important cases, unconditional forecasts will occur only in extreme cases, for instance when the input variables influence the state variables only after some lag and the forecast interval is so short that only known values of the input variables are needed. In all other cases, the required values of the input variables have to be found in some other way. For all external variables utilised when forecasting with a dynamic model

TOOLS FOR F O R E C A S T I N G AND P L A N N I N G

265

one needs forecasts of their future values. These forecasts are afflicted with some uncertainty and this uncertainty is the source of a corresponding uncertainty in the forecast made by the dynamic model. If the uncertainty of the external variables can be described by some probability distribution around an expected mean, then, with the aid of statistical analysis or stochastic simulation, this distribution can be used to determine the sensitivity of the forecast obtained by the dynamic model to errors in the forecasts of the external variables. As regards the policy variables, one cannot, in the same sense, forecast their future values, since they are dependent, at least to some extent, on decisions which are not yet made. This means that a forecast which depends on policy variables is made under the condition that certain decisions are made. Thus, the distinction between forecasting and planning is dissolved in these cases, since the forecasts are dependent on the planning for the future.

B. The Uncertainty of Forecasts A fundamental source of uncertainty is the model used when producing the forecast. The choices of variables and model equations are, as pointed out earlier, always dependent on the theory of the process which is to be modelled; and within the fields where forecasts are most desirable, i.e., much of the social sciences, theories are often vaguely formulated and considered to be first approximations only. Dynamic models which are constructed with the aid of such theories are, at best, rough simplifications of the process of interest. This kind of uncertainty is unavoidable and it is extremely difficult to judge how much the simplifications and idealisations influence a given forecast. When judging the uncertainty of forecast obtained from econometric models it is often assumed, simply, that the model is a correct description of the real process to the extent that the right variables are included in the model and the mathematical form, at least, of the model equations is correct. Even if this strong assumption is made, there are several sources of uncertainty and error in the forecast. I have already mentioned the uncertainty due to the forecasts of the values of the external variables. Other uncertain factors are the parameters in the model equations. These are almost never the 'real' coefficients but only estimated values. The process of estimating the parameters may give some clues to the degree of uncertainty associated with

266

PETER G ~ R D E N F O R S

these values. For instance, the standard error of the parameter can be estimated. If a model equation contains a stochastic error term, this is another source of uncertainty since the future values of this error term have to be predicted. Here it is possible to form a picture of the properties of the error term by studying the residuals of the estimation from the set of available data. For instance, one may estimate the mean of the error (in general zero), the standard error and the serial correlation of the errors. These estimated properties are then assumed to be valid also for the errors in the forecasting period. For each of these types of uncertainty it may be possible to use some kind of statistical analysis to compute a set of confidence intervals for the forecasts of the state variables. However, this kind of analysis is complicated even if strong assumptions are made and as soon as several kinds of uncertainty are to be combined one has to use other tools. Here stochastic simulation can be helpful.

4. Policy Experiments A. Trialand Error One of the main advantages of using dynamic models is that they are very suitable for policy experiments. If we assume that reliable forecasts for the external variables are available, then by studying the behaviour of the model for various choices of the future values of the policy variables one may get a rough idea of which planning leads to the most desired combinations of values for the input and state variables. As was pointed out above, forecasting with the aid of a dynamic model in general involves a certain amount of planning and there is thus little difference between the two kinds of activities. Necessary prerequisites for this method of planning are (i)a certain knowledge of the dynamic properties of the model, (ii)enough energy to investigate a large number of choices of future values of the policy variables in a systematic manner, and (iii)a sufficient amount of computer time (analytic solutions are, in general, too complicated and therefore expensive to use). By repeated simulation trials and by learning from mistakes, it is possible to pick out some plans for the future which will result in acceptable consequences.

TOOLS FOR FORECASTING AND PLANNING

267

Example 10. Both Forrester and Meadows et al. use their dynamic models as tools for policy recommendations. Their world models are, as pointed out above, defined as closed models, but, in practice, some of the equation parameters function as policy variables. In both cases, the authors' aim is an equilibrium state of the world with constant population and constant quality of life. Forrester shows, by some examples, that such an equilibrium is possible in his model, but he does not give any account of an attempt to show that the final equilibrium is the 'optimal' one (which may mean that it has as high quality of life as possible). The changes of model parameters which leads to the equilibrium state in his final example can hardly be taken as recommendations for future planning since one change is that food production is reduced by 20% in 1970. The method used by Meadows et al. is similar. It is not argued that the recommended changes are optimal, but it is claimed that if the changes are postponed (to 2000 instead of 1975), then it is no longer possible to maintain the equilibrium state. [] B. Optimal Control Trial-and-error is no efficient method of finding the optimal planning for the future with a dynamic model and there is no guarantee that it will ever be found. As a consequence, one has tried to use certain results from'control theory by constructing the optimal planning problem as a mathematical optimisation problem. Such an optimisation problem contains the following components: (i) a dynamic model; (ii) a time interval for which the optimal planning is to be computed; (ii0 a set of initial values of the state variables, and, possibly, a set of final values which the state variables must reach in the final state; (iv) restrictions on the state and input variables (these restrictions may consist of maximum or minimum limits which must not be overstepped); (v) a function which measures the utility of the behaviour of the model for various choices of values for the input variables (this utility function is a quantitative aggregation of the planner's goals, values and costs). When these five components are given, the optimisation problem is then to find the set of values for the control variables during the time interval such that the utility of the behaviour of the model is maximised.

268

PETER G ~ R D E N F O R S

For certain special cases there are general algorithms for solving this optimisation problem. The cases which are most extensively studied are linear time-invariant models. As regards the utility measure, it is constantly assumed that the utility of the behaviour of the model during the time interval is the sum of the utilities of the states of the model during that interval. The utility of a state of the model is then, normally, determined in the following way: for each state and input variable, first determine an 'optimal' value of the variable, and then let the utility of the actual value of the variable in the given state depend on its distance from this optimum. It is often assumed that the utility of a variable is inversely proportional to the square of the distance from the optimal value. The utility of a state, finally, is then computed as a weighted sum of the utilities of the state and input variables, where the weight of a variable depends on its relative importance. For the kind of optimisation problem outlined above, there are numerical methods for computing the optimal sequence of values for the input variables (cf. e.g. Chow [6] and Pindyck [28]). The computations soon get complex and a computer is unavoidable. I f the dynamic model to be used corresponds well to the process to be analysed and if the utility function realistically describes actual goals, costs and values, then one may avoid intuitive planning and instead rely on the computed solution.

Example 11. Pindyck [28] describes a small macroeconometric model for the American economy which he then uses as a basis for optimisation investigations. He uses the same kind of utility function as described above. He computes the optimal series of values for the input variables during a five year period and studies the effects of using different variants of the utility function (obtained by varying the weights of the variables). In this way it is possible to acquire a conception of how sensitive the optimal planning is to the uncertainty of the utility valuations. Pindyck also studies how sensitive the optimal planning is to small variations of the parameters in the model equations. He shows, by an example, that a change in a parameter (for the autoregressive term of the price equation) which is definitely within the margin of error, will radically change the series of optimal values for the poficy variables. Pindyck confesses that these results are rather disconcerting, since they say that a small change in a coefficient value can result in a large change in the optimal policy and in the resulting behaviour of

TOOLS FOR FORECASTING

AND P L A N N I N G

269

the economy. This throws some doubt on the usefulness of our particular econometric model for policy planning. In particular, the explanatory power of the model as indicated by the standard regression statistics is probably misleading because autoregressive terms were used so extensively. In any case, a sensitivity testing of the optimal policy solution is, as we have here seen, essential before that policy is used. (pp. 146, 148) Somewhat later, in connection with a similar result for a small change in the autoregressive term in the consumption function, he complements the argum e n t above with the following methodological remark: What this says is that the optimal policy is very sensitive to a change in a particular coefficient because the model itself is sensitive to such a change. In particular, the use of autoregressive terms in building models must be done with caution, since the apparent increase in explanatory power that results is just an extrapolation of past trends, the effects of changes in other coefficients are magnified, and a change in the autoregressive coefficient itself has a large effect on the behaviour of the model. (pp. 149, 151) Pindyck's remark that the standard way of judging the explanatory value o f a model may be misleading accords well with the discussion in Ggrdenfors [14] of the appropriateness o f the R 2 value as a measure of the explanatory value. [] The discussion so far has been limited to the optimal control of deterhainistic systems. When stochastic systems are considered one faces additional problems when determining the optimal sequence of control values. The computational techniques become even more complicated. But, apart from this complication, there is a more fundamental problem which arises from the fact that for a stochastic system we cannot 'know' the state o f the system at a future point o f time, even if all values o f the input variables are known up to that point. When considering deterministic systems the planner may specify the time paths o f the policy variables at the beginning o f the planning period. These paths can then be followed without regard to future events. For stochastic systems another type of policy making m a y be necessary. Here the policy variables should be specified as functions o f observations of the state of the system y e t to be made, so that the future values o f the variables depend on future observations. This type o f control is called feedback control or a closed loop policy, while the former type is called an open loop policy.

270

PETER G ~ R D E N F O R S IV. CONCLUSION

Dynamic models are, no doubt, among the best tools for forecasting and planning. The conceptual framework is rich enough to cover most mathematically formulated models in which temporal aspects are central, and it allows precise formulations of causal relationships and mutual dependencies between variables. The equations in a model are formal representations of the structure of a complex situation. Our intuitions and theoretical education may be of good help when formulating these equations. But when the pieces of the model have been assembled, our intuitions are poor when estimating the dynamic interactions of the parts of the model. Here the computer may be extremely helpful. The mould of a dynamic model, as given by the definitions, is well suited for computer representations. Much of the estimation and experimenting in connection with the forecasting and planning process can be efficiently carried out on a computer. The fundamental problem when using dynamic models is how to find the model which is the most suitable representation of the 'real' system or process one wishes to forecast. Since dynamic models are such powerful tools for forecasting and planning it is easy to forget that the choice of model for a particular process presents the greatest methodological problems. In my opinion, too much emphasis has been put on mathematical and statistical methods for analysing given dynamic models and too little on how to choose among the potential models. Only when it comes to parameter estimation, some detailed work has been done. However, the efforts put into the construction of efficient estimation procedures have tended to overshadow other, more fundamental, problems of modal construction. Also, the work with estimation procedures has concentrated on the statistical fit of the model to given data and neglected the dynamic properties of the resulting model. Even though the problems connected with choice of variables and choice of model equations do not lend themselves to mathematical formalism as parameter estimation does, these problems should not be neglected since they are of fundamental importance for the success of the dynamic model to be constructed. Throughout the paper I have illustrated my arguments with examples taken from world models and macroeconomic models. I will now, finally, make some brief remarks on the methodological status of these models.

TOOLS

FOR

FORECASTING

AND

PLANNING

271

The main advantage of a small world model is that it may present a general view of a complex structure. Even if such a small model must be based on gross simplifications, it may have a basically correct dynamic structure, which can be studied and evaluated. Already in models of this small size our mental powers are not sufficient to keep count of the dynamic changes in the model, but a computer is necessary. I have already commented upon some of the drawbacks of Forrester's and Meadows' world models. The variables in the models are difficult to measure, and this is mainly due to their high level of aggregation. This makes any numerical relationships in the system equations hard to test. Even if simulations of the world models show a reasonable fit with actual data, the dynamic properties of the models must be considered highly unrealistic. They are, for instance, unwantedly sensitive to changes of some parameters, and thus the forecasts produced by the models are on the verge of nonsense. The macroeconomic models fare somewhat better. Variables are only allowed in these models if they are objectively measureable. A great deal of effort is spent on estimating the parameters in the model equations. Some analysis of the sensitivity of the models is often made, in particular by studying dynamic multipliers. From a methodological point of view, the standard way of constructing and exploiting dynamic macroeconomic models nevertheless has some flaws. The class of permitted model equations is too often restricted to linear equations. This restriction is motivated by the applicability of the 'best' estimation procedures. However, this argument points to another drawback: too much emphasis is put on the statistical fitting of the model to data and too little on the dynamic properties of the model. Finally, among the many measures used to evaluate a model, few have been supplied with a good motivation for being appropriate for forecasting and planning models. Lund University, Sweden NOTES * The work with this paper has been supported by a research project on future studies sponsored by the Planning Division o f the Research Institute of Swedish National Defence (FOA P). I wish to t h a n k Professor Eckhart Zwicker for extensive and constructive criticism o f earlier versions o f this paper.

272

PETER

GXRDENFORS

i I have befitted from the presentation in AstrSm [3]. 2 A note on terminology: Economists sometimes distinguish between models, where the parameters in the equations are undetermined variables, and structures, which are models where the parameters have been replaced by fixed numbers. a For an elementary presentation of the solutions of difference equations and their economic interpretations, the reader is referred to Baumol [5 ]. 4 For a presentation of simulation languages, cf. Kiviat [19]. A popular tool for econometric forecasts is the TROLL system. The use of the language DYNAMO is presented in Pugh [30], and the SIMNON language, which is a general programming language for simulating (linear and non-linear) dynamic systems, is presented in Elmqvist [9]. 5 Cf. Forrester [12] and [11] and Pugh [30]. 6 This assumption is not realistic, but it will simplify the mathematics. For stability analyses of difference equations, cf. Baumol [5]. 8 This decomposition was first presented and discussed by Thei1 [ 31 ]. 9 In the case of non-linear stochastic models Howrey and Keleijan [17] argue that only stochastic simulations of models containing stochastic errors are appropriate since the non-stochastic portions of the model may yield inconsistent forecasts. Their arguments are questioned by Aigner [2]. Also cf. Haavelmo's paper which was mentioned earlier.

REFERENCES Aberg, C. J., Samhiillsekonomisk prognosteknik, Norstedts, Stockholm, 1966. Aigner, D. J., 'A Note on Verification of Computer Simulation Models', Management Science 18 (1972), 615 -619. [3 ] AstrSm, C. J., Reglerteori, 2rid ed., Almqvist & Wiksell, Uppsala, 1976. [4] AstrSm, C. J., Introduction to Stochastic Control Theory, Academic Press, New York, 1970. [5] Baumol, W. J., Economic Dynamics, 3rd ed., Macmillan, Toronto, 1970. [6] Chow, G. C., Analysis and Control of Dynamic Economic Systems, Wiley, New York, t975. [7] Cole, H. S. D. and Curnow, R. C., 'An Evaluation of the World Models', Futures 5 (1973), 108-134. [8] Dutton, J. M. and Starbuck, W. H., Computer Simulation of Human Behaviour, Wiley, New York, 1971. [9] Elmqvist, H., SIMNON: User's Manual, Lund Institute of Technology, Lund, 1975. [10] Evans, M. K., Macroeconomic Activity, Harper & Row, New York, 1969. [ 11 ] Forrester, J. W., World DynamiCs, Wright-Allen Press, Cambridge, Mass., 1971. [12] Forrester, J. W., Principles of Systems, 2nd ed., Wright-Allen Press, Cambridge, Mass., 1968. [ 13 ] Frijda, N. H., 'Problems of Computer Simulation', Behavioural Science 12 (1967), 59-67. (Reprinted in Dutton & Starbuck [8] .) [14] G~denfors, P., 'On the Information Provided by Forecasting Models', Technological Forecasting and Social Change 16 (1980), 351- 361. [15] Haavelmo, T., 'The Inadequacy of Testing Dynamic Theory by Comparing Theoretical Solutions and Observed Cycles', Econometrica 8 (1940), 312-321. [1] [2]

T O O L S F O R F O R E C A S T I N G AND P L A N N I N G [16]

273

Hanna, J.F., 'Information-theoretic Techniques for Evaluating Simulation Models', in Dutton & Starbuck [8], pp. 682-692. [17] Howrey, E. P. and Keleijan, H. H., 'Simulation Versus Analytical Solutions: The Case of Econometric Models', in Naylor [26], pp. 299-319. [18] Johnston, J.,EeonometrieMethods, McGraw-Hill,New York, 1963. [19] Kiviat, P. J., 'Simulation Languages', in Naylor [26], pp. 406-408. [20] Klein, L. R., An Essay on the Theory of Economic Prediction, Yrj6 Jahnssonin Siiiiti/5,Helsinki, 1968. [21] Kmenta, J., Elements of Eeonometrics, Macmillan, New York, 1971. [22] Maier,H. and Hugger, W., 'Invariant Structures of Prognosis Curves in Forrester's World Dynamics', Theory and Decision 3 (1973), 231-261. [23] Meadows, D. L., Meadows, D. H., Randers, J., and Behrens, W. W., The Limits to Growth, Wright-AllenPress, Cambridge, Mass., 1972. [24] Meadows,D. L. et al., Dynamics of Growth in a Finite World, Wright-AllenPress, Cambridge, Mass., 1974. [25] Meadows, D. L. etal., 'A Response to Sussex', Futures 5 (1973), 135-152. .~ [26] Naylor, T.H., Computer Simulation Experiments with Models of Economic Systems, Wiley, New York, 1971. [27] Nordhaus, W.D., 'World Dynamics: Measurement Without Data', Economic Journal 9 (1973), 1156-1183. [28] Pindyck, R. S., Optimal Planning for Economic Stabilisation, North-Holland, Amsterdam, 1973. [29] Pindyck, R. S. and Rubinfeld, D. L., Econometric Models and Economic Forecasts, McGraw-Hill,New York, 1976. [30 ] Pugh,A. L., DYNAMO User'sManual, the MIT Press, Cambridge, Mass., 1963. [31] Theft,H.,Economic Forecasts and Policy, North-Holland, Amsterdam, 1961. [32] Zwicker, E., 'M6glichkeiten und Grenzen der modellgesttitzten Prognose sozio6konomischer Entwicklungen: Dargestellt am Beispiel der Weltmodelle yon Meadows und Forrester', Preprint Series of the International Institute of Management 1/76-84, Berlin.

Suggest Documents