arXiv:physics/9901034v1 [physics.data-an] 21 Jan 1999
How to quantify deterministic and random influences on the statistics of the foreign exchange market Rudolf Friedrich, Institute f¨ ur Theoretische Physik, Universit¨at Stuttgart, D-70550 Stuttgart, Joachim Peinke, Christoph Renner Fachbereich 8 Physik, Universit¨at Oldenburg, D-26111 Oldenburg email
[email protected]; peinke@uni-oldenburg February 2, 2008 Abstract It is shown that prize changes of the US dollar - German Mark exchange rates upon different delay times can be regarded as a stochastic Marcovian process. Furthermore we show that from the empirical data the Kramers-Moyal coefficients can be estimated. Finally, we present an explicite Fokker-Planck equation which models very precisely the empirical probabilitiy distributions.
PACS: 02.50-r;05.10G Since high-frequency intra-day data are available and easy to access, research on the dynamics of financial markets is enjoying a broad interest. [1, 2, 3, 4, 5, 6, 7]. Well-founded quantitative investigations now seem to be feasible. The identification of the underlying process leading to heavy tailed probability density function (pdf) of price changes and the volatility clustering (see fig.1) are of special interest. The shape of the pdf expresses an unexpected high probability of large price changes on short time scales which is of utmost importance for risk analysis. In a recent work [8], an analogy between the short-time dynamics of foreign exchange (FX) market and hydrodynamic turbulence has been proposed. This analogy postulates the existence of hierarchical features like a cascade process from large to small time scales in the dynamics of prices, similar to the energy cascade in turbulence c.f. [9]. This postulate has been supported by some work [7] and questioned by others [6]. 1
2
Friedrich, Peinke, Renner
One main claim of the hypothesis of cascade processes is that the statistics of the time series of the financial market can be determined. The aim of the present paper is to discuss a new kind of analysis capable to derive the underlying mathematical model directly from the given data. This method yields an estimation of an effective stochastic equation in the form of a Fokker-Planck equation (also known as Kolmogorov equation). The solutions of this equation yields the probability distributions with sufficient accuracy (see fig. 1). This means that our method is not based on the conventional phenomenological comparison between models and several stochastic aspects of financial data. Our approach demonstrates how multiplicative noise and deterministic forces interact leading to the heavy tailed statistics. [10] Recently it has been demonstrated that from experimental data of turbulent flow the hierarchical features induced by the energy cascade can be extracted in form of a Fokker-Planck equation for the length scale dependence of velocity fluctuations [12]. In the following it will be shown the dynamics of foreign exchange rates can also be described by a Fokker-Planck equation. Here we use a data set consisting of 1,472,241 quotes for US dollar-German mark exchange rates from the years 1992 and 1993 as used in reference [8]. We investigate the statistical dependence of prize changes ∆xi := x(t+ ∆ti ) − x(t) upon the delay time ∆ti . Here x(t) denotes the exchange rate at time t. We will show that the prize changes ∆x1 , ∆x2 for two delay times ∆t1 , ∆t2 are statistically dependent, provided the difference ∆t2 − ∆t1 is not too large and that the shorter time interval is nested inside the longer one. In order to characterize the statistical dependency of price changes ∆xi , we have evaluated the joint probability density functions p(∆x2 , ∆t2 ; ∆x1 , ∆t1 )
(1)
for various time delays ∆t2 < ∆t1 directly from the given data set. One example of a contourplot of the logarithms of these functions is exhibited in figure 2. If the two price changes ∆x1 , ∆x2 were statistically independent, the joint pdf would factorize into a product of two probability density functions: p(∆x2 , ∆t2 ; ∆x1 , ∆t1 ) = p(∆x2 , ∆t2 )p(∆x1 , ∆t1 ).
(2)
The tilted form of the probability density p(∆x2 , ∆t2 ; ∆x1 , ∆t1 ) clearly shows that such a factorization does not hold and that the two price changes are statistically dependent. This dependency is in accordance with observations of M¨ uller et al. for cross-correlation functions of the same data [5]. To analyze these correlations in more detail, we address the question: What kind of statistical process underlies the price changes over a series of nested time delays ∆ti of decreasing length? In general, a complete characterization of the statistical properties of the data set requires the evaluation of joint pdfs pN (∆x1 , ∆t1 ; ....; ∆xN , ∆tN ) depending on N variables (for arbitrarily large N).
3
Friedrich, Peinke, Renner
In the case of a Markov process (a process without memory), an important simplification arises: The N-point pdf pN are generated by a product of the conditional probabilities p(∆xi+1 , ∆ti+1 |∆xi , ∆ti ), for i = 1, ..., N − 1. As a necessary condition, the Chapman-Kolmogorov equation [13] p(∆x2 , ∆t2 |∆x1 , ∆t1 ) =
Z
d(∆xi ) p(∆x2 , ∆t2 |∆xi , ∆ti ) p(∆xi , ∆ti |∆x1 , ∆t1 )
(3)
should hold for any value of ∆ti embedded in the interval ∆t2 < ∆t3 < ∆t1
.
(4)
We checked the validity of the Chapman-Kolmogorov equation for different ∆ti triplets by comparing the directly evaluated conditional probability distributions p(∆x2 , ∆t2 |∆x1 , ∆t1 ) with the ones calculated (pcal ) according to (3). In figure 3, the contour lines of the two corresponding pdfs are superimposed for the purpose of illustration; the red lines corresponding to pcal . Only in the outer regions, there are visible deviations probably resulting from a finite resolution of the statistics. As it is well-known, the Chapman-Kolmogorov [13] equation yields an evolution equation for the change of the distribution function p(∆x, ∆t) across the scales ∆t. For the following it is convenient (and without loss of generality) to consider a logarithmic time scale τ = −ln∆t . (5) Then, the limiting case ∆t → 0 corresponds to τ → ∞. The ChapmanKolmogorov equation formulated in differential form yields a master equation, which can take the form of a Fokker-Planck equation (for a detailed discussion we refer the reader to [13]): d ∂ ∂2 p(∆x, τ ) = [− D (1) (∆x, τ ) + D (2) (∆x, τ )]p(∆x, τ ) dτ2 ∂∆x ∂∆x2
(6)
The drift and diffusion coefficients D (1) (∆x, τ ), D (2) (∆x, τ ) can be estimated directly from the data as moments M (k) of the conditional probability distributions (c.f fig. 3): 1 lim∆τ →0 M (k) k! Z 1 = d∆x′ (∆x′ − ∆x)k p(∆x′ , τ + ∆τ |∆x, τ ). ∆τ
D (k) (∆x, τ ) = M (k)
(7)
As indicated by the functional ∆x dependency of the moments M (k) , it turns out that the drift term D (1) is a linear function of ∆x, whereas the diffusion term
4
Friedrich, Peinke, Renner
D (2) is a function quadratic in ∆x. In fact, from a careful analysis of the data set we obtain the following approximation: D (1) = −0.44∆x D (2) = 0.003 exp (−τ /2) + 0.019(∆x + 0.04)2
(8)
(∆x is measured in units of the standard deviation of ∆x at ∆t = 40960s). With these coefficients we can solve the Fokker-Planck equation (6) for the pdf at times τ > τ0 with a given distribution at τ0 . Figure 1 shows that the solutions of our model nicely fit the experimentally determined pdf’s [14]. In contrast to the use of phenomenological supposed fitting functions c.f [8, 2], we obtained the changing forms of the pdfs by a differential equation. This method provides the evolution of pdfs from large time delays to smaller ones. This definitely is a new quality in describing the hierarchical structure of such data sets. At last, it is important to note that our finding of the Fokker-Planck equation for the cascade is in good agreement with the previously found phenomenological description in [8]. Based on the given exact solution of our Fokker-Planck equation [11], we see that the chosen type of fitting function for the pdfs in [8] was the correct one. Furthermore, we see that the observed quadratic dependency of the diffusion term D 2 corresponds to the found logarithmic scaling of the intermittency parameter in [8], which was taken as an essential point to propose the analogy between turbulence and the financial market. We remind the reader that the Fokker-Planck equation is equivalent to a Langevin equation of the form (we use the Ito interpretation [13]): q d ∆x(τ ) = D (1) (∆x(τ ), τ ) + D (2) (∆x(τ ), τ )F (τ ) dτ
.
(9)
Here, F (τ ) is a fluctuating force with gaussian statistics δ-correlated in τ : < F (τ ) >= 0
,
< F (τ )F (τ ′ ) >= δ(τ − τ ′ )
(10)
In our approximation (8) the stochastic process underlying the prize changes is a linear stochastic process with multiplicative noise, at least for large values of ∆x: √ d . (11) ∆x(τ ) = −0.44∆x(τ ) + .019∆x(τ )F (τ ) dτ This stochastic equation yields realizations of prize changes ∆x(τ ), whose ensemble averages can be described by the probability distributions p(∆x, τ ). Thus, the Langevin equation (9) produces the possibility to simulate the price cascades for time delays from about a day down to several minutes. Furthermore, with this last presentation of our results it becomes clear that we are able to separate the deterministic and the noisy influence on the hierarchical structure of the finance data in terms of the coefficients D (1) and D (2) , respectively.
Friedrich, Peinke, Renner
5
Summarizing, it is the concept of a cascade in time hierarchy that allowed us to derive the results of the present paper, which in turn quantitatively supports the initial concept of an analogy between turbulence and financial data. Furthermore, we have shown that the smooth evolution of the pdfs down along the cascade towards smaller time delays is caused by a Markov process with multiplicative noise. Helpful discussions and the careful reading of our manuscript by Wolfgang Breymann, Shoaleh Ghashghai and Peter Talkner are acknowledged. The FX data set has been provided by Olsen & Associates (Z¨ urich).
References [1] M¨ uller, U. A. et al. Journal of Banking and Finance 14, 1189–1208 (1990). [2] Mantegna, R. N., Stanley, H. E. Nature 376, 46–49 (1995). [3] Vassilicos, J. C. Nature 374, 408–409 (1995). [4] The 1st International Conference on High Frequency Data in finance, Olsen & Associates, Z¨ urich, 1995). [5] M¨ uller, U. A., et al., J. Empirical Fin. (in the press). [6] Mantegna, R. N. & Stanley, H.E. N., Nature 383, 587-588 (1996). [7] Beck, C. and Hilgers, Int. J. Bifurcation and Chaos, in press; Schmitt,F., Schertzer, D., Lovejoy,S. : Turbulence fluctuations in financial markets: a multifractal approach, in Chaos, Fractals and Models, G. Salvadori ed., Italian University Press, 1998. Arneodo, A., Muzy, J.-F., Sornette, D., Eur. Phys. J. B 2, 277-282 (1998). [8] Ghashghaie, S., Breymann, W., Peinke, J., Talkner, P., and Dodge, Y., Nature 381, 767-770 (1996). [9] Frisch, U.: Turbulence,(Cambridge ,1995). [10] Based on a recent work by Donkov et.al [11] it is easily seen that the obtained Fokker-Planck equation is consistent with the phenomenological results presented previously [8]. [11] Donkov,A.A., Donkov, A.D., Grancharova, E.I., The exact Solution of one Fokker-Planck Type Equation . . . , math-ph/9807010. [12] Friedrich, R., Peinke, J., Physica D 102, 147 (1997); Friedrich, R., Peinke, J., Phys. Rev. Lett. 78, 863 (1997)
Friedrich, Peinke, Renner
6
[13] Risken, H., The Fokker-Planck equation, (Springer-Verlag Berlin, 1984); H¨anggi, P. and Thomas, H., Physics Reports 88, 207 (1982); Van Kampen,N. G., Stochastic processes in physics and chemistry (North Holland, Amsterdam, 1981); Gardiner, C.W. Handbook of Stochastic Methods, (SpringerVerlag Berlin, 1983) [14] We remind the reader that the description of the statistics of price changes by a the Fokker-Planck equation may only be an approximate one, since the sample paths need not be continuous functions in time. In this more general case one is led to an extension of the Fokker-Planck equation by terms taking care of finite size jumps [13].
7
Friedrich, Peinke, Renner
figure captions Figure 1: Probability densities (pdf) p(∆x, ∆t) of the price changes ∆x = x(t + ∆t) − x(t) for the time delays ∆t = 5120, 10240, 20480, 40960s (from bottom to top). Symbols: the results obtains from the analysis of middle prices of 1,472,241 bit-ask quotes for the US dollar-German Mark exchange rates from 1 October until 30 September 1993. Full lines: results form a numerical iteration of an effective Fokker-Planck equation with the initial condition of the probability distribution for ∆t = 40960s. As drift term D (1) = −0.44∆x and as diffusion term D (2) = 0.003exp(−τ /2) + 0.019(∆x + 0.04)2 were taken. The units of ∆x are multiples of the standard deviation σ of ∆x with ∆t = 40960s. The pdfs are shifted in vertical directions for convenience of presentation.
Figure 2: Joint pdf p(∆x2 , ∆t2 ; ∆x1 , ∆t1 ) for the simultaneous occurrence of price differences ∆x1 (∆t1 ) and ∆x2 (∆t2 ) for US dollar-Deutsch mark exchange rates. The contour plot is shown for ∆t1 = 6168s and ∆t2 = 5120s. σ = 0.0635DM denotes the standard deviation of x(t). The contour lines correspond to log p = −1, . . . , −4. If the two price changes were statistically independent the joint pdf would factorize into a product of two pdfs: p(∆x2 , ∆t2 ; ∆x1 , ∆t1 ) = p(∆x2 , ∆t2 )p(∆x1 , ∆t1 ). The tilted form of the joint pdf provides evidence that such a factorization does not appear for small values of | log(∆t1 /∆t2 )|. Figure 3 : Contour plot of the conditional pdf p(∆x2 , ∆t2 |∆x1 , ∆t1 ) = p(∆x2 , ∆t2 ; ∆x1 , ∆t1 )/ p(∆x1 , ∆t1 ) for ∆t2 = ∆t1 /(1.2)2 and t1 = 5120, in the range (−3σ ≤ ∆xi ≤ 3σ, i = 1, 2). σ denotes the standard deviation of x(t), see fig.2. The contour lines correspond to log p = −1, . . . , −4. For statistically unrelated quantities the conditional pdf would reduce to the probability density p(∆x2 , ∆t2 ). In order to verify the Chapman-Kolmogorov equation, the directly evaluated pdf (black lines) is compared with the pdf calculated by pcal (∆x2 , ∆t2 |∆x1 , ∆t1 ) = R dxi p(∆x2 , ∆t2 |∆xi , ∆ti )p(∆xi , ∆ti |∆x1 , ∆t1 ) for ∆t2 = ∆t1 /(1.2)2 ; ∆ti = ∆t1 /1.2 (red lines). Assuming an statistical error of the square root of the number of events of each bin we find that both pdfs are statistically identical.
8
Friedrich, Peinke, Renner
Figure 1:
9
Friedrich, Peinke, Renner
Friedrich, Peinke, Renner Fig.2
Figure 2:
10
Friedrich, Peinke, Renner
Friedrich, Peinke, Renner Fig.3
Figure 3: