Dec 22, 2011 - The Heath-Jarrow-Morton equation (HJM-equation) of interest rate theory ([26]; ..... 0 (Hi)) ⤠C for all t â [0,ε) with some C > 0 and ε > 0, where ...... David Heath, Robert Jarrow, and Andrew Morton, Bond pricing and the term ...
EFFICIENT SIMULATION AND CALIBRATION OF GENERAL HJM MODELS BY SPLITTING SCHEMES
arXiv:1112.5330v1 [math.PR] 22 Dec 2011
¨ PHILIPP DORSEK AND JOSEF TEICHMANN
Abstract. We introduce efficient numerical methods for generic HJM equations of interest rate theory by means of high-order weak approximation schemes. These schemes allow for QMC implementations due to the relatively low dimensional integration space. The complexity of the resulting algorithm is considerably lower than the complexity of multi-level MC algorithms as long as the optimal order of QMC-convergence is guaranteed. In order to make the methods applicable to real world problems, we introduce and use the setting of weighted function spaces, such that unbounded payoffs and unbounded characteristics of the equations in question are still allowed. We also provide an implementation, where we efficiently calibrate an HJM equation to caplet data.
1. Introduction The Heath-Jarrow-Morton equation (HJM-equation) of interest rate theory ([26]; see [14, 5, 15] for expositions) is a stochastic partial differential equation (SPDE) on the state space of forward rate curves, which is flexible enough to describe complicated dynamical features such as non-constant (local or stochastic) volatility, non-constant correlation, or jumps, or dependence structures. An analysis of geometric properties was performed in [16]. As forward rate curves already encode all the market’s information on default-free bond prices, it only remains to estimate volatilities either from the time series or from option prices or from both of them. For this purpose it is required that the numerical treatment of the HJM-equation can be performed efficiently: it is the purpose of this article to actually show that efficient numerical methods for the HJM-equation are at hand, how to construct and how to implement them. In the case of generic SPDEs we usually neither have sufficient analytical information on the marginal’s distribution, nor on its Fourier-Laplace transform, nor its short-time asymptotics. We are therefore forced to apply simulation techniques to approximate the random variables in question and we face two main sources of problems in such a procedure: 1.1. Discretization error. Numerical weak or strong approximation schemes with probabilistic flavor are built upon stochastic Taylor expansion and its iteration along n steps due to the Markov property. Depending on the local error of the method this leads (at least for some class of test functions) to a global error O(1/ns ), which is called error of order s. The method is called high order method if s > 1, and standard or low order method otherwise. There are schemes, e.g. cubature methods [31, 32, 35] or splitting methods [38, 37], that substantially increase s and therefore reduce the global discretization error for a fixed number of discretization steps n. When applying the theory of weighted spaces we can also enlarge the sets of test functions and generic equations, which can be treated by the discretization method. 1.2. Integration error. Having discretized the SPDE problem we still have to evaluate the random variables involved in each local step, which usually leads to a 1
2
¨ PHILIPP DORSEK AND JOSEF TEICHMANN
numerical integration problem on some Rdn , where d is the fixed number of dimensions which are needed for each local discretization step. Here we can apply three approaches: (deterministic) numerical integration, Monte-Carlo algorithms (MC) or Quasi-Monte-Carlo algorithms (QMC). Due to the n-dependence of integration space we do not try a direct numerical integration method, even though we have some hope that such an √ approach could possibly work. MC algorithms lead to integration errors O(1/ K), where K denotes the number of integration points, whereas QMC algorithms lead to integration errors approximately O(1/K). In both cases the integration error dominates the total error asymptotically, which can be seen by complexity analysis. Let us be more precise on this: we assume that O(n) operations are performed to calculate the value of the functional which we intend to integrate. Here we tacitly assume that dealing with elements in state space is O(1), which is strictly speaking only guaranteed to be true in a finite dimensional setting. However dealing, e.g., with curves on the real line numerically can still be of O(1) if only the relevant parts of the curve are actually calculated. Hence the total complexity C of the method is d3 nK, where d3 is a constant. Given an accuracy ǫ the following inequality has to hold true additionally, d1 d2 + √ ≤ ǫ, ns K whence we end up with the simple constraint minimization problem to minimize the complexity C = d3 nK → min given the previous inequality on accuracy. Its asymptotic solution is given by C = O(ǫ−2−1/s ) with K = O(ǫ−2 ) and n = O(ǫ−1/s ). This can be improved by multi-level methods [27, 18, 7] to a complexity estimate of order almost O(ǫ−2 ), which is in turn the complexity of one dimensional MC integration. In other words, the complexity is equal to the integration of a functional where evaluating at a single point is of order O(1). Multi-level methods improve by telescoping errors on different levels of discretization n. However, in this case the asymptotic complexity is not improved by higher-order methods anymore, since it depends on weak and strong convergence orders so that one is restricted to low order Euler-like methods. If we perform the same complexity analysis in case of higher order discretization schemes with a QMC algorithm instead of an MC algorithm we obtain an asymptotic complexity C = O(ǫ−1−1/s ) with K = O(ǫ−1 ) and n = O(ǫ−1/s ), which is indeed considerably better than multi-level MC in case of higher order methods (s > 1). On the other hand it is not better than multi-level QMC [19] theoretically could be. We emphasize that a multi-level QMC is theoretically far from being understood, and additionally we would need strong order 1 methods which are not always at hand. An additional problematic aspect is the need of high-dimensional integration spaces, where QMC is not known to perform well anymore. We claim that standard QMC with high-order weak approximation schemes is superior to multilevel MC due to the low dimensionality of the integration space Rdn as long as the QMC order of convergence is understood to hold true. For accuracy ǫ the dimension of integration space is of order O(ǫ−1/s ), which in real world implementations is often sufficiently small such that the QMC order of convergence is ensured. The goal of this work is therefore twofold: first we want to show how actually the theory of weighted spaces applies to the HJM equation. We even show that we have a simple weak approximation method of order 2 within this setting. Second, we claim that a QMC algorithm integrating the resulting functional is numerically efficient. We underline this statement by a calibration of a time-homogeneous,
EFFICIENT NUMERICS FOR HJM MODELS
3
non-linear, diffusive HJM-equation to caplet data, i.e., we calibrate this equation to ten volatility smiles simultaneously. Our method is not only fast enough for the calibration of the model, but also the computer programming itself is almost as easy as a standard Euler-Maruyama scheme due to the use of a splitting approach. Let us compare our results to well-known and recent results on splitting schemes and weak approximation methods for SPDEs. In contrast to classical results on the Lie-Trotter splitting such as [2, 3, 17, 1, 4, 39, 33, 42, 28, 20, 22, 21], we focus on a higher order method for nonlinear problems in the spirit of [38], hence allowing us to conclude the practical efficiency of the method as explained above. The topic of weak approximation for SPDEs was recently analysed in [9]. While there, the focus was on space-time white noise driving the system, we consider only finite-dimensional noise, and can obtain the same rate of convergence as in the finite-dimensional setting with bounded and smooth vector fields. Contrary to [36], our model is inherently infinite-dimensional and does not allow a reduction to a low-dimensional stochastic differential equation. 2. Weighted spaces and analysis of stochastic partial differential equations We provide an overview of the theory of weighted spaces that is at the core of the presented numerical method. For more details, see also [40, 12, 11, 10]. 2.1. The generalised Feller condition. Given a fixed ℓ ≥ 1, we consider the following setup. (1) For i = 0, · · · , ℓ, (Hi , h·, ·iHi ) is a separable Hilbert space, and its norm is 1/2 denoted by kxkHi := hx, xiHi . (2) Hi+1 is compactly and densely embedded into Hi for i = 0, . . . , ℓ − 1. (3) A : dom A ⊂ H0 → H0 is the generator of a strongly continuous semigroup of contractions (St )t≥0 on H0 . (4) For i = 0, . . . , ℓ − 1, A : Hi+1 → Hi is bounded. (5) For i = 1, . . . , ℓ, (St )t≥0 is a strongly continuous semigroup of contractions on Hi . In many cases, it will be adequate to choose Hi := dom Ai , e.g., if A is a differential operator on a bounded domain. If, however, A is a differential operator on an unbounded domain, dom A will usually not be compactly embedded into H0 . As we are interested in the HJM equation, where the underlying space variable varies in [0, ∞), we consider the above, more general setup. Definition 1. Let i ∈ {0, . . . , ℓ}. Given a left-continuous, increasing function ρ : [0, ∞) → (0, ∞) with limu→∞ ρ(u) = +∞, set ψi (x) := ρ(kxkHi ), we define the i k k enveloping space Bψ k (Hi ) := f ∈ C (Hi ) : kf kψi ,k < ∞ , where C (X) denotes the space of k times continuously Fr´echet differentiable functions and (1)
kf kψi ,k :=
k X i=0
|f |ψi ,j
with |f |ψ,j := sup ψi (x)−1 kDj f (x)kLj (Hi ) . x∈Hi
Here, Lj (Hi ) is the linear space of bounded multilinear forms a : (Hi )j → R endowed with the norm (2)
kakLj (Hi ) :=
sup x1 ,...,xj ∈Hi kxi kHi ,i=1,...,j
|a(x1 , . . . , xj )|,
which makes (Lj (Hi ), k·kLj (Hi ) ) a Banach space.
¨ PHILIPP DORSEK AND JOSEF TEICHMANN
4
Given an orthonormal basis (ej )j∈N , define the space A(Hi ) of bounded smooth cylindrical functions Hi → R by (3) N A(Hi ) := f : f = g(h·, e1 iHi , · · · , h·, eN iHi ) for some N ∈ N and g ∈ C∞ b (R ) . ψi i The closure of A(Hi ) in Bψ k (Hi ) is denoted by Bk (Hi ), k ≥ 0.
Remark 2. The above assumptions on the weight function ψi are very restrictive. A weaker assumption on the weight function on which our analysis can be performed would be that the sets {x ∈ Hi : ψi (x) ≤ R} are weakly compact, and hence bounded, in Hi , and that ψi is bounded on bounded sets. This is applied in Section 5.1. Applying [40, Corollary 5.3, Remark 5.4], we see that our space B0ψi (Hi ) coincides ockner and Z. Sobol. Hence, the following with the space W Cψi defined by M. R¨ result is proved in [40, Theorem 5.1]. Proposition 3. There exists an isometric isomorphy from B0ψ (Hi )∗ , the dual space to B0ψ (Hi ), to the space (4) Z ψi (x)|µ(dx)| < ∞ , Mψ (Hi ) := µ : µ is a signed Borel measure on Hi with Hi
R where the latter space is endowed with the norm kµkψi ,∗ := Hi ψi (x)|µ(dx)|, |µ| denoting Rthe total variation measure to µ. The inverse of this isometry is given by ℓµ (f ) := Hi f (x)µ(dx) for all f ∈ B0ψi (Hi ) and µ ∈ Mψ (Hi ).
This result allows us to obtain a generalisation of the well-known Feller condition for the strong continuity of operator semigroups on C0 (D), D a locally compact topological space, to the infinite-dimensional setting. Corollary 4 (generalised Feller condition). Fix i ∈ 0, . . . , ℓ. Let (Pt )t≥0 be a family of continuous operators on B0ψi (Hi ) satisfying the generalised Feller condition, i.e.,
(1) P0 = I, the identity on B0ψi (Hi ), (2) Pt+s = Pt Ps for s, t ≥ 0, (3) kPt kL(Bψi (H )) ≤ C for all t ∈ [0, ε) with some C > 0 and ε > 0, where 0
i
L(B0ψi (Hi )) is the space of bounded and linear operators on B0ψi (Hi ) and endowed with the operator norm, and finally (4) limt→0+ Pt f (x) = f (x) for all x ∈ X and f ∈ B0ψi (Hi ).
Then, (Pt )t≥0 is a strongly continuous semigroup on B0ψi (Hi ), i.e., for every f ∈ B0ψi (Hi ), limt→0+ kPt f − f kψi = 0.
Proof. This is an easy consequence of Proposition 3 and [13, Theorem I.5.8]: We only need to prove that limt→0+ ℓ(Pt f ) = ℓ(f ) for all f ∈ B0ψi (Hi ) and ℓ ∈ B0ψi (Hi )∗ . R But by Proposition 3, ℓ(f ) = X f (x)µ(dx) for all f ∈ B0ψi (Hi ) with some µ ∈ Mψi (Hi ). As limt→0+ Pt f (x) = f (x) for all x ∈ Hi , an application of Lebesgue’s dominated convergence theorem yields the claim. Hence, in contrast to the weak continuity of Markov semigroups for infinite dimensional stochastic equations [6], the above result allows us to work with standard strongly continuous semigroups. Usually, the difficult part in verifying the generalised Feller condition for a given Markov semigroup (Pt )t≥0 is proving that Pt (B0ψi (Hi )) ⊂ B0ψi (Hi ). The following result can often be applied to this problem. Theorem 5. For k ≥ 0 and i = 1, . . . , ℓ, Ckb (Hi−1 ) ⊂ Bkψ (Hi ) is dense.
EFFICIENT NUMERICS FOR HJM MODELS
5
Proof. Apply Proposition 16 to obtain an orthonormal basis (en )n∈N of Hi−1 that is simultaneously orthogonal in Hi . Defining A(Hi ) using (en /ken kHi )n∈N , we see that every f = g(h·, e1 /ke1 kHi iHi , . . . , h·, eN /keN kHi iHi ) ∈ A(Hi ) can be extended to a smooth cylindrical function on Hi−1 , as (5)
g(h·, e1 /ke1 kHi iHi , . . . , h·, eN /keN kHi iHi )
= g(ke1 kHi h·, e1 iHi−1 , . . . , keN kHi h·, eN iHi−1 ).
Whence A(Hi ) ⊂ Ckb (Hi−1 ). Next, we show Ckb (Hi−1 ) ⊂ Bkψ (Hi ). Given f ∈ Ckb (Hi−1 ) and ε > 0, we shall construct fε ∈ A(Hi ) such that kf − fε kψi ,k < ε. Let πN denote the Hi−1 orthogonal projection onto span{ej : j = 1, . . . , N }. For R > 0 arbitrary, we estimate kf − f ◦ πN kψi ,k ≤ (6)
k X j=1
sup x∈Hi kxkHi ≤R
ψi (x)−1 kDj f (x) − Dj f (πN x)kLj (Hi )
+ ρ(R)−1 kf − f ◦ πN kCkb (Hi )
As πN : Hi → Hi−1 is of operator norm one for all N ∈ N, it is easy to see by the properties of ρ that the final term goes to zero as R goes to infinity, and hence can be made smaller than ε/3 by choosing Rε , depending on f but not on N , large enough. For the first term, note that B := {x ∈ Hi : kxkHi ≤ Rε } is precompact in Hi−1 . Hence, there exists δ > 0 such that kDj f (x) − Dj f (y)kLj (Hi ) < ε/3 for x, y ∈ B with kx − ykHi−1 < δ. Choose Nδ according to Corollary 17 to obtain that kx − πN xkHi−1 < δ whenever N ≥ Nδ and x ∈ B. Pk Finally, choose fε : Hi,Nδ → R in such a way that j=0 supx∈Hi,Nδ kDj fε (x) − kxkHi
Dj (f ◦πNδ )(x)kLj (Hi ) < ε/3 and kfε kCkb (Hi,N ) ≤ kf ◦πNδ kCkb (Hi,N ) . Here, Hi,Nδ := δ δ span {ej : j = 1, . . . , Nδ }. Such a choice is always possible, as Hi,Nδ is finite dimensional and we can thus apply a standard mollifying argument. It follows similarly as above that kfε − f ◦ πNδ kψi ,k < ε/3, and plugging the results together, we obtain (7)
kf − fε kψi ,k < ε.
Thus, A(Hi ) ⊂ Ckb (Hi−1 ) ⊂ Bkψi (Hi ), and the claim follows.
Theorem 6. Fix i ∈ {1, . . . , ℓ}. Let (x(t, x0 ))t≥0 be a time homogeneous Markov property on the stochastic basis (Ω, F , (Ft )t≥0 , P) with values in Hi−1 . Assume that (1) the mapping Hi−1 → Hi−1 , x0 7→ x(t, x0 ) is almost surely continuous with respect to the norm topology on Hi for every t ≥ 0, (2) if x0 ∈ Hi and t ≥ 0, then x(t, x0 ) ∈ Hi almost surely, (3) for some ε > 0 and C > 0, E[ψi (x(t, x0 ))] ≤ Cψ(x0 ) for all x0 ∈ Hi and t ∈ [0, ε], and (4) (x(t, x0 ))t≥0 has almost surely c` adl` ag paths in the weak topology of Hi . Then, Pt ∈ L(B0ψ (X)) for all t ≥ 0, where Pt f (x0 ) := E[f (x(t, x0 ))], (Pt )t≥0 satisfies the generalised Feller condition, and hence, (Pt )t≥0 is a strongly continuous semigroup on B0ψ (X).
6
¨ PHILIPP DORSEK AND JOSEF TEICHMANN
Proof. First, we prove that limt→0+ Pt f (x0 ) = f (x0 ) for fixed f ∈ B0ψi (Hi ) and x0 ∈ Hi . Let R > kx0 kHi . Set BR := {x ∈ Hi : kxkHi ≤ R}, then |Pt f (x0 ) − f (x0 )| ≤ E[|f (x(t, x0 )) − f (x0 )|]
≤ E[|f (x(t, x0 )) − f (x0 )|χBR (x(t, x0 ))] + E[|f (x(t, x0 ))|χHi \BR (x(t, x0 ))]
(8)
+ |f (x0 )|P[kx(t, x0 )kHi > R]
with χA (x) := 1, x ∈ A, 0 otherwise the indicator function of the set A. The Markov inequality yields (9)
P[kx(t, x0 )kHi > R] ≤ ρ(R)−1 E[ψi (x(t, x0 ))],
and this term goes to zero as R goes to infinity. Furthermore, (10)
E[|f (x(t, x0 ))|χHi \BR (x(t, x0 ))] ≤ kf kψi ,0 E[ψi (x(t, x0 ))χHi \BR (x(t, x0 ))],
and dominated convergence proves that this also goes to zero as R goes to infinity. Finally, note that f |BR is weakly continuous, as f ∈ B0ψi (Hi ). Weak compactness of BR yields that |f (x) − f (x0 )| ≤ 2 supx∈BR |f (x)| < ∞ for x ∈ BR , and monotone convergence proves limt→0+ E[|f (x(t, x0 )) − f (x0 )|χBR (x(t, x0 ))] = 0. Hence, we have shown limt→0+ Pt f (x0 ) = f (x0 ). Next, note that Pt (Cb (Hi−1 )) ⊂ Cb (Hi−1 ), which is a consequence of the assumption of almost sure continuity of the mapping x0 7→ x(t, x0 ). As kPt f kψi ,0 = sup ψi (x0 )−1 |E[f (x(t, x0 ))]| x0 ∈Hi
(11)
≤ kf kψi ,0 sup ψi (x0 )−1 E[ψi (x(t, x0 ))] ≤ Ckf kψi ,0 x∈Hi
for t ∈ [0, ε], this proves by Theorem 5 that Pt ∈ L(B0ψi (Hi )) for t ∈ [0, ε]. As the semigroup property is satisfied due to the Markov property, an induction shows Pt ∈ L(B0ψi (Hi )) for all t ≥ 0. Thus, Corollary 4 proves the claim. 2.2. Application to stochastic partial differential equations. Let (x(t, x0 ))t≥0 be the solution of the stochastic partial differential equation (12a)
dx(t, x0 ) = (Ax(t, x0 ) + V0 (x(t, x0 )))dt +
d X j=1
(12b)
x(0, x0 ) = x0 .
Vj (x(t, x0 )) ◦ dWtj ,
Here, (Wt )t≥0 is a d-dimensional standard Brownian motion. The vector fields Vj N are assumed to be of the form Vj (x) = gj (Lx), where gj ∈ C∞ b (R ; Hℓ ) is a smooth N N function on R with values in Hℓ , and L : H0 → R is a bounded linear mapping. These are typical assumptions for HJM models to be applied in practice, see [16]. Then, it follows that (12) admits unique solutions in every space Hi , i = 0, . . . , ℓ, given that the initial value x0 is smooth enough. Lemma 7. Fix β > 0 and i ∈ {0, . . . , ℓ}. For some ε > 0, there exists C > 0 such that (13)
E[cosh(βkx(t, x0 )kHi )] ≤ C cosh(βkx0 kHi )
for x0 ∈ Hi and t ∈ [0, ε].
EFFICIENT NUMERICS FOR HJM MODELS
7
Proof. We apply Itˆ o’s formula. For m ≥ 2, 2(m−1)
hx(t, x0 ), dx(t, x0 )iHi dkx(t, x0 )k2m Hi = mkx(t, x0 )kHi 1 2(m−2) + m (m − 1)kx(t, x0 )kHi hx(t, x0 ), dx(t, x0 )i2Hi 2 2(m−1) hdx(t, x0 ), dx(t, x0 )iHi + kx(t, x0 )kHi 2(m−1) hx(t, x0 ), Ax(t, x0 )iHi dt = mkx(t, x0 )kHi + hx(t, x0 ), V0 (x(t, x0 ))iHi dt +
d X j=1
(14)
hx(t, x0 ), Vj (x(t, x0 ))iHi dWtj
d X 1 2(m−2) (m − 1)hx(t, x0 ), Vj (x(t, x0 ))i2Hi + mkx(t, x0 )kHi 2 j=1 + kx(t, x0 )k2Hi hVj (x(t, x0 )), Vj (x(t, x0 ))iHi dt.
Taking expectations, the boundedness of the Vj and the dissipativity of A yield, as all moments are uniformly bounded by [8, Theorem 7.3.5], a constant C > 0 independent of m ≥ 2 such that (15) Z t h i 2(m−1) 2m−1 2m ds. + Cm ] ≤ kx k + mkx(s, x0 )kHi E[kx(t, x0 )k2m E kx(s, x0 )kH 0 Hi Hi i 0
For m = 1, we similarly obtain (16)
E[kx(t, x0 )k2Hi ] ≤ kx0 k2Hi + C
Z
t
E[kx(s, x0 )kHi + 1]ds,
0
and trivially, E[kx(t, x0 )k0 ] = 1. Note that cosh(u) = the monotone convergence theorem proves
u2m m=0 (2m)! .
P∞
Summing up,
E[cosh(βkx(t, x0 )kHi )] ≤ cosh(βkx0 kHi ) Z t "X ∞ E + Cβ 0
m 2m−1 β 2m−1 kx(s, x0 )kH i (2m)! m=1
∞ X m2 2(m−1) 2(m−1) β kx(s, x0 )kHi +β (2m)! m=1
#
≤ cosh(βkx0 kHi ) Z β t +C E sinh(βkx(s, x0 )kHi ) 2 0 (17) + β cosh(βkx(s, x0 )kHi ) ds. P∞ m u2m−1 , and that 2m−1 ≤ 1 for m ≥ 1. Here, we have used that sinh(u) = m=1 (2m−1)! As sinh(u) ≤ cosh(u), we obtain that with a constant C > 0 depending on β, (18)
E[cosh(βkx(t, x0 )kHi )] ≤ cosh(βkx0 kHi ) Z t +C E[cosh(βkx(s, x0 )kHi )]ds. 0
The method of the moving frame (see [43]) allows us to conclude that E[cosh(βkx(t, x0 )kHi )] < ∞ for t ≥ 0. Hence, an application of Gronwall’s inequality proves the claim.
¨ PHILIPP DORSEK AND JOSEF TEICHMANN
8
Hence, the choice of weight function ψi,β (x) := cosh(βkxkHi ), β > 0, is appropriate. This is particularly important in the application of our results to the HJM equation, see Section 5. Corollary 8. Given i ∈ {1, . . . , ℓ} and β > 0, the Markov semigroup (Pt )t≥0 of ψ (x(t, x0 ))t≥0 is strongly continuous on B0 i,β (Hi ). Proof. Under the given assumptions, we can prove the conditions of Theorem 6 using [8, Theorem 7.3.5]. Choose some ℓ0 ∈ {1, . . . , ℓ} and β0 > 0. We perform an analysis of the infinitesimal generator G with domain dom G of (Pt )t≥0 , considered as strongly continuous semigroup on B ψℓ0 ,β0 (Hℓ0 ). In the following, V f (x) := Df (x)(V (x)) denotes the directional derivative for sufficiently smooth functions f : Hi → R and vector fields V : Hi → H i . Lemma 9. Fix i ∈ {0, . . . , ℓ}. For j = 0, . . . , d and f ∈ A(Hi ), Vj f ∈ A(Hi ). Furthermore, the directional derivative f → 7 Vj f defines a bounded linear operator ψi,β ψi,β from Bk (Hi ) to Bk−1 (Hi ), k ≥ 1. Proof. The special form of Vj proves Vj f ∈ A(Hi ) for f ∈ A(Hi ). The estimate kVj f kψi,β ≤ Ckf kψi,β can be shown by a direct calculation using the boundedness of ψ Vj and its derivatives, and the result follows from the density of A(Hi ) in Bk i,β (Hi ). Lemma 10. Fix i ∈ {1, . . . , ℓ} and β1 < β2 . The operator f 7→ Df (·)(A·) maps ψ A(Hi−1 ) to A(Hi ), and defines a bounded linear operator from Bk i−1,β1 (Hi−1 ) to ψi,β2 Bk−1 (Hi ), k ≥ 1. Proof. Given f ∈ A(Hi−1 ), there exists a Hi−1 -orthogonal projection π with finitedimensional range such that f ◦ π = f . Hence, Df (x)(Ax) = Df (x)(πAx), and it is easy to see that this function is in A(Hi ). The boundedness is again shown by a direct calculation, where we apply that u cosh(β1 u) ≤ C cosh(β2 u) for all u ∈ [0, ∞) with some constant C > 0. An application of Itˆ o’s formula, see [8, Theorem 7.2.1], yields that for i ∈ {1, . . . , ℓ} and f ∈ A(Hi−1 ), d
(19)
Gf (x) = Df (x)(Ax) + (V0 f )(x) +
1X 2 (V f )(x) 2 j=1 j
for x ∈ Hi .
Theorem 11. Fix i ∈ {1, . . . , ℓ}. For j ≥ 0 and 0 < β1 < β2 , the operator ψ ψβ 1 (Hi−1 ) → Bj β2 (Hi ), given by the right hand side of (19), is well-defined as G˜ : Bj+2 ψ
a bounded linear operator. Furthermore, for β ∈ (0, β0 ), B2 ℓ0 −1,β (Hℓ0 −1 ) ⊂ dom G, ˜ and on this space, G = G.
Proof. The boundedness of G˜ follows from Lemmas 9 and 10. For the second ψ ψ property, note that G˜ maps B2 ℓ0 −1,β (Hℓ0 −1 ) into B0 ℓ0 ,β0 (Hℓ0 ) as a bounded linear operator, G = G˜ on A(Hℓ0 −1 ), and that G is a closed operator. Hence, G = G˜ follows from a density argument. Corollary 12. Fix β ∈ (0, β0 ). Given k ∈ {0, . . . , ℓ0 − 1}, we have the Taylor expansion (20)
Pt f =
k X tj j=0
j!
G j f + Rt,k f
ψ
ℓ0 −(k+1),β for f ∈ B2(k+1) (Hℓ0 −(k+1) ),
EFFICIENT NUMERICS FOR HJM MODELS
9
ψ
ℓ0 −(k+1),β where the operator Rt,k : B2(k+1) (Hℓ0 −(k+1) ) → B ψℓ0 ,β0 (Hℓ0 ) is bounded uniformly in t ∈ [0, ε] for given ε > 0.
ψ
ψ
ℓ0 −(k+1),β Proof. Theorem 11 proves that G j : B2(k+1) (Hℓ0 −(k+1) ) → B0 ℓ0 ,β0 (Hℓ0 ) is a bounded linear operator for j = 0, . . . , k + 1. Hence, a standard Taylor expansion argument can be applied to prove the stated theorem.
ψ
ψ
Lemma 13. For β > 0, k ≥ 0 and i ∈ {1, . . . , ℓ}, Pt : Bk i,β (Hi ) → Bk i,β (Hi ) is a bounded linear operator. Its operator norm is bounded uniformly for t ∈ [0, T ], where T > 0 can be chosen arbitrarily. Proof. This is consequence of smooth dependence on the initial value in Hi−1 . By considering the sensitivity equations, see [8, Theorem 7.3.6], all derivatives Dxj 0 x(t, x0 )(h1 , . . . , hj ) are shown to satisfy bounds of the type (21)
p
E[kDxj 0 x(t, x0 )(h1 , . . . , hj )kpHi ] ≤ Cp (kh1 kHi · · · khj kHi )
for p ≥ 2,
where Cp is independent of x0 . The boundedness of Pt in the norms given above then follows from the Cauchy-Schwarz inequality together with the property c cosh(2u) ≤ cosh(u)2 ≤ C cosh(2u) for some constants c, C > 0. Due to Theoψ rem 5, it follows that Pt (A(Hi−1 )) ⊂ Bk i,β (Hi ). A density argument proves the claim. ψ
Corollary 14. For k ≥ 2, i ∈ {0, . . . , ℓ0 − 1} and β ∈ (0, β0 ), Bk i,β (Hi ) is a core for G. ψ
Proof. Applying [13, Proposition II.1.7], this is clear from Lemma 13, as B2 i,β (Hi ) ⊂ ψ dom G is invariant with respect to the semigroup and dense in B0 i,β (Hi ). 3. The rate of convergence of splitting schemes for stochastic partial differential equations As numerical discretisation scheme, we suggest the use of a splitting scheme. PM Decomposing the drift coefficient further, V0 = m=1 V0,m , we define the split problems d (22a) x0,0 (t, x0 ) = Ax0,0 (t, x0 ), dt d (22b) x0,m (t, x0 ) = V0,m (x0,m (t, x0 )), m = 1, . . . , M, dt (22c) dxj (t, x0 ) = Vj (xj (t, x0 )) ◦ dWtj , j = 1, . . . , d.
We stress that all of these problems can be solved by finding the corresponding deterministic flows; in the case of j = 1, . . . , d; we need to evaluate the flow induced by the vector field Vj at the stochastic time Wtj . In particular, the processes (x0,m (t, x0 ))t≥0 , m = 0, . . . , M , are deterministic. The split semigroups are defined by Pt0,m f (x0 ) := f (x0,m (t, x0 )) and Ptj f (x0 ) := E[f (xj (t, x0 ))], j = 1, . . . , d. We consider the following splitting schemes. Lie-Trotter splitting, forward ordering: The Lie-Trotter splitting with forward ordering is of first order and reads (23)
0,0 0,1 0,M 1 d QLTfwd (∆t) f := P∆t P∆t . . . P∆t P∆t . . . P∆t f
for f ∈ B ψℓ0 ,β0 (Hℓ0 ).
Lie-Trotter splitting, backward ordering: The Lie-Trotter splitting with backward ordering is obtained by reversing the order of the operators in the Lie-Trotter splitting with forward ordering, (24)
0,M 0,1 0,0 d 1 QLTbwd (∆t) f := P∆t . . . P∆t P∆t . . . P∆t P∆t f
for f ∈ B ψℓ0 ,β0 (Hℓ0 ),
¨ PHILIPP DORSEK AND JOSEF TEICHMANN
10
and is also of first order. Ninomiya-Victoir splitting: The Ninomiya-Victoir splitting is a generalisation of the well-known Strang splitting to more than two generators and reads 1 0,0 0,1 0,M 1 d P . . . P∆t P∆t . . . P∆t P f := QNV (∆t) 2 ∆t/2 ∆t
(25)
0,0 0,M 0,1 d 1 P∆t/2 f + P∆t . . . P∆t P∆t . . . P∆t
for f ∈ B ψℓ0 ,β0 (Hℓ0 ).
It is of second order. The theory of Section 2 now applies not only to the continuous semigroup (Pt )t≥0 , but also to every split semigroup (Pt0,m )t≥0 and (Ptj )t≥0 , yielding spaces invariant to the dynamics of (Pt )t≥0 on which we can apply the generators G, G0,m and Gj , m = 0, . . . , M and j = 1, . . . , d, and (26)
G=
M X
m=0
G0,m +
d X j=1
Gj .
Hence, we obtain the following result. Theorem 15. Let β ∈ (0, β0 ), and assume that (Q(∆t) )∆t≥0 is any splitting approximation of (Pt )t≥0 based on the split semigroups (Pt0,m )t≥0 and (Ptj )t≥0 , m = 0, . . . , M , j = 1, . . . , d, which is of formal order s ∈ {1, . . . , ℓ0 − 1}. For ψ0,β f ∈ B2(s+1) (H0 ), (27)
kPt f − Qn(t/n) f kψs+1,β0 ≤ CT n−s kf kψ0,β ,2(s+1) .
Proof. The theory of [25] yields this result in the following manner. Clearly, all split ψ semigroups are stable on the space B0 s+1,β0 (Hs+1 ) in the sense that the operator norms of the operators are bounded by exp(Ct) with some constant C > 0 independent of t and of the semigroup. Furthermore, for every t ≥ 0, Pt is a bounded linear ψ0,β operator on B2(s+1) (H0 ) by Lemma 13, and on this space, we have that all generators of the split semigroups and the original semigroup are well-defined together with their products, and satisfy α M d X X (28) Gj = G α , α = 0, . . . , s + 1. G0,m + m=0
j=1
Hence, we obtain the claimed result from [25, Theorem 2.3, Sections 4.1, 4.4].
4. Symmetrically weighted sequential splitting Applying the theory of [23, 24] allows us to obtain asymptotic expansions for the forward and backward ordering of the Lie-Trotter splitting and the NinomiyaVictoir splitting if the function f is sufficiently smooth. Using symmetry, we can even prove that the Ninomiya-Victoir splitting and the symmetrically weighted sequential splitting, going back at least to [41, equation (25)] and given by 1 LTfwd n n (29) QSWSS f := (Q(t/n) ) f + (QLTbwd ) f , t,n (t/n) 2 have asymptotic expansions not only in n−1 , but even n−2 . Hence, every extrapolation step would improve convergence by two orders. In particular, the symmetrically weighted sequential splitting is of second order. Comparing the dimension of integration space of different second order schemes and in view of possible extrapolations we use SWSS in our numerical computations detailed below. Indeed
EFFICIENT NUMERICS FOR HJM MODELS
11
dimension of integration space for the Ninomiya-Victoir scheme is n(d+ 1), whereas sequential splitting leads to dimension nd + 1. 5. Application: the Heath-Jarrow-Morton equation As application of our theoretical results, we provide a numerical method for the efficient simulation of the Heath-Jarrow-Morton equation of interest rate theory. It is of the form specified in (12), where the infinitesimal generator is given by d the differential operator dx . In order to include a stochastic volatility process, the Hilbert spaces Hi , i = 0, . . . , ℓ are specified as follows. We set Hi := h ∈ L1loc ((0, ∞)) : h is i + 1 times weakly differentiable and (30) h′ ,. . . ,h(i+1) ∈ L2αi ((0, ∞)) × R. Here, 0 < α0 < · · · < αℓ , and (
(31)
L2α ((0, ∞)) :=
h ∈ L1loc ((0, ∞)) :
Z
(0,∞)
)
r(x)2 exp(αx)dx < ∞ .
It is easy to see that Hi+1 ⊂ Hi for i ∈ {0, . . . , ℓ − 1}, and that every function in H0 is continuous and bounded (see also [14]). The scalar product on Hi reads h(h1 , v1 ), (h2 , v2 )iHi := h1 (0)h2 (0) i Z X (m) (m) (32) h1 (x)h2 (x) exp(αi x)dx + v1 v2 . + m=1
(0,∞)
With the induced norm, (33)
A : Hi+1 → Hi ,
(h, v) 7→ (h′ , −αv),
where α ≥ 0 is a constant, becomes a bounded linear operator. It agrees with the generator of the shift semigroup on the first component of Hi , i ∈ {0, . . . , ℓ − 1}. Consider the Heath-Jarrow-Morton equation with stochastic volatility in Itˆ o form, dr(t, r0 , v0 ) = (Ar(t, r0 , v0 ) + αHJM (r(t, r0 , v0 ), v(t, v0 )))dt (34a)
+
d X
σj (r(t, r0 , v0 ), v(t, v0 ))dWtj ,
j=1
(34b)
dv(t, v0 ) = −αv(t, v0 )dt +
(34c)
r(0, r0 , v0 ) = r0 ,
(34d)
v(0, v0 ) = v0 .
d X
γj dWtj ,
j=1
The stochastic volatility v(t, v0 ) is chosen as a mean-reverting Ornstein-Uhlenbeck process. The HJM drift satisfies the condition Z x d X σj (h, v)(x) σj (h, v)(ξ)dξ. (35) αHJM (h, v)(x) = 0
j=1
We assume that σj are of the form required in Section 2.2, i.e., σj (h, v) = gj (Lh, v), N +1 where gj ∈ C∞ ; Hℓ ) and L : H0 → RN is bounded linear. Rewriting the b (R equation in Stratonovich form, we see that d
(36)
V0 (h, v) = αHJM (h, v) −
1X Dσj (h, v)(σj (h, v)), 2 j=1
12
¨ PHILIPP DORSEK AND JOSEF TEICHMANN
N +1 and it follows easily that V0 (h, v) = g0 (Lh, v) with some g0 ∈ C∞ ; Hℓ ). b (R Hence, Theorem 15 applies to prove the optimal rate of convergence of s of a splitting scheme for sufficiently smooth functions f : H0 → R, given that the initial value satisfies (r0 , v0 ) ∈ Hs+1 .
5.1. The money market account. In order to calculate standard payoffs, we not only need the instantaneous forward curve, but also the money market account (Bt )t≥0 . It is given by Bt = exp(z(t, 0)), where (37)
dz(t, r0 , v0 , z0 ) = Rt dt,
z(0, r0 , v0 , z0 ) = z0 ,
and can therefore be easily included into our splitting scheme. Here, we denote by Rt := r(t, r0 , v0 )(0) the short rate induced by our HJM model. To recover the optimal rate of convergence, we argue as follows. On the product ˜ i := Hi × R, we consider the weight function ψ˜i,β (h, v, z) := ψi,β (h, v) + z 2 space H (see Remark 2). As proved before, (38)
E[ψi,β (r(t, r0 , v0 ), v(t, v0 ))] ≤ exp(Ct)ψi,β (r0 , v0 ).
Furthermore, as Rt ≤ ψi,β (r(t, r0 , v0 )), Z t Z t 2 2 2 E[z(t, r0 , v0 , z0 ) ] ≤ z0 + E[z(t, r0 , v0 , z0 ) ]ds + E[Rt2 ]ds 0 0 Z t 2 (39) ≤ z0 + E[ψ˜i,β (r(t, r0 , v0 ), v(t, v0 ), z(t, r0 , v0 , z0 ))]ds. 0
Altogether, an application of Gronwall’s inequality proves (40)
E[ψ˜i,β (r(t, r0 , v0 ), v(t, v0 ), z(t, r0 , v0 , z0 ))] ≤ exp(Ct)ψ˜i,β (r0 , v0 , z0 ), ˜ ψ
and we can apply the above theorems to all functions contained in Bk i,β (Hi × R) by evident modifications of the above proofs. Now, the money market account is not included in the above setting. More precisely, Bt = exp(z(t, r0 , v0 , 0)), and this growth is larger that the quadratic growth admitted by ψ˜i,β . We deal with this problem in the following way: Actually, z(t, r0 , v0 , 0) should be nonnegative from an economic point of view. Hence, we ˜t := exp(Φ(z(t, r0 , v0 , 0))), where Φ : R → R replace the money market account by B is C∞ with bounded derivatives, satisfies Φ(z) = z for all z ≥ −K, and is bounded from below by −2K with some K > 0. In our numerical experiments, performed using the model calibrated to the data from [30], we never encountered paths with z(t, 0) ≤ 0. Furthermore, even if z(t, 0) becomes slightly negative on some paths, this is numerically innocent, as we can adjust K accordingly. We want to stress that our modification only acts on economically dubious paths where the money market account falls significantly in the long run, and neither limits temporary decrease, or any increase whatsoever. ˜t−1 ≤ exp(2K). Hence, the modified payoff of a zero coupon bond with Clearly, B time to maturity δ, Z δ h(s)ds), (41) f (h, v, z) := exp(−Φ(z)) exp(− 0
˜ ψ Bk 0,β (H0
is included in our setup, and lies in × R) for all k ≥ 0 if β > 0 is chosen large enough: first, note that f depends on h only via the bounded linear functional Rδ ℓ : H0 → R, ℓ(h) := 0 h(s)ds. It follows that
(42)
|f (h, v, z)| ≤ exp(2K) exp(CkhkH0 ).
EFFICIENT NUMERICS FOR HJM MODELS
13
Choosing β > C, the claim is proved, as it is clear that we can approximate f by functions of the form (h, z) → 7 exp(−Φ(z))ϕ(ℓ(h)) with ϕ ∈ C∞ b (R) in the norm of ˜ ψ
Bk 0,β (H0 × R). While standard payoffs, such as caplets and swaptions, do not satisfy the smoothness assumptions required in our results, we can at least prove that they are contained in a space on which convergence – albeit without rates – is ensured. A similar argument as for the bond price can be used to prove that the modified payoffs of caplets, (43) where Lδ (h) := f (h, v, z) = = exp(−Φ(z))
1 δ
f (h, v, z) = exp(−Φ(z))(Lδ (h) − K)+ , R δ exp 0 h(τ )dτ − 1 is the LIBOR rate, and payer swaptions,
" I X
exp(−
i=1
are contained in B
˜0,β ψ
Z
iδ
h(τ )dτ ) exp
0
Z
iδ
(i−1)δ
h(τ )dτ
!
!#
− (1 + δK)
,
+
(H0 × R). Here, however, taking the positive part makes ˜ ψ
0,β these functions nonsmooth. As the space B2(s+1) (H0 × R) of functions on which a
˜
rate of convergence is proved is dense in B ψ0,β (H0 × R), we still obtain convergence. 6. Numerics for the Heath-Jarrow-Morton equation
We present the results of numerical computations for a Heath-Jarrow-Morton model. We do neither claim that the chosen HJM model is particularly well suited nor that the chosen calibration strategy is the best. We only want to demonstrate that a non-linear infinite-dimensional HJM model with stochastic volatility can be efficiently calibrated to market data with a satisfactory result. First, a numerical calibration to caplet prices is performed, afterwards, a payer swaption is priced using the calibrated model. In our numerics, space discretisation is performed using piecewise affine and continuous functions, where the mesh is aligned with the time mesh. Hence, the partial differential equation ∂ ∂ r0,0 (t, r0 )(x) = r0,0 (t, r0 )(x), r0,0 (0, r0 )(x) = r0 (x), (44) ∂t ∂x is solved exactly by shifting r0 . 6.1. Calibration. We demonstrate the efficiency of the presented method by performing the calibration of a parametrised, time-homogeneous Heath-Jarrow-Morton model to the caplet volatility surface provided in [30]. Note that the bond prices given there are automatically reproduced in our model by choosing them as the initial value. We set d = 3, and specify σj (h, v) = gj (h, v)λj . Here, λj is assumed to be of the Pi0 exponential-polynomial type [14], λj (x) = i=0 αj,i xi exp(−βx). It is easy to see that under such assumptions, the regularity required in [14, Section 5.2] is satisfied. In our experiments, we choose i0 = 2. There are several economically sound possibilities forp choosing gj . Guided by the Cox-Ingersoll-Ross model, one could choose gj (h, v) = |vh(tj )| with some tj ≥ 0, where the absolute values are necessary as we cannot guarantee positive interest rates by this approach. This ansatz, however, is not contained in our general setup, as gj is not a smooth function of h. Rt Instead, we assume gj (h, v) = tanh(cj exp(v) 0 j h(s)ds). This ensures that the Rt volatilities are bounded and vanish if the benchmark yields 0 j h(s)ds driving the equation go to zero. We discretize the HJM-equation by the symmetrically weigthed
¨ PHILIPP DORSEK AND JOSEF TEICHMANN
14
ttm: 1, L=0.044842
ttm: 2, L=0.049291
0.3
0.3
0.25
0.25
0.2
0.2 0.05 0.1 ttm: 3, L=0.051787
market data calibration results
0.05 0.1 ttm: 4, L=0.053765
ttm: 5, L=0.055473
0.25
0.25
0.25
0.2
0.2
0.2
0.15
0.15
0.15 0.05 0.1 ttm: 6, L=0.056694
0.05 0.1 ttm: 7, L=0.057394
0.05 0.1 ttm: 8, L=0.057104
0.25
0.25
0.25
0.2
0.2
0.2
0.15
0.15
0.15
0.05 0.1 ttm: 9, L=0.056574
0.05 0.1 ttm: 10, L=0.057849
0.05
0.1
0.25 0.2
0.2
0.15
0.15 0.05
0.1
0.1
0.05
0.1
Figure 1. Calibration of the tanh-type volatilities sequential splitting scheme, as described in Section 4. The calibration is performed by combining a custom-written genetic algorithm, searching for global minima, with the Levenberg-Marquardt implementation from [34] to optimise locally. The model caplet values are calculated numerically, using 12 time steps per year and 2048 quasi-Monte Carlo paths, based on the direction vectors for Sobol′ sequences of Joe and Kuo [29]. All in all, 13 parameters are used to match 120 prices, and total calibration time is 14.5 minutes running on 16 cores of a Primergy RX200 S6 spotting 4 Intel Xeon CPU X5650 processor, each of which provides 6 cores. The calculation of 120 option prices takes about .5 seconds and therefore merits to be called efficient. We are able to match the market volatilites taken from [30] very well using the tanh-type volatilies. Only the error in the earlier time slices is significant, see Figure 1. This is typical for models without jumps. These are well known to misprice options close to maturity. This behaviour can also be connected to the short end of interest rates depending more on announcements by central banks than random fluctuations. With respect to the martingale property of traded assets, numerical calculations show that bond prices and LIBOR rates satisfy the expected value property to a very high precision already using 2048 quasi-Monte Carlo paths. 6.2. Pricing. As an application, we price an at the money payer swaption with a time to maturity of T = 5 years, where the underlying swap pays out quarter annually over three years, i.e., at the times Ti = T + iδ for i = 1, . . . , 12 and
EFFICIENT NUMERICS FOR HJM MODELS
15
δ = .25. A reference computation with 16384 paths and 120 time steps per year yields the value 0.0281579. Using 2048 paths and 12 time steps per year, as in the calibration, we obtain 0.028074. The relative error is thus approximately .003. As the calculation of the coarser approximation takes .25 seconds, we have established the efficiency of the suggested method. 7. Conclusions We introduce an analytic setup for the analysis of weak approximation methods for stochastic partial differential equations. The Heath-Jarrow-Morton equation of interest theory is shown to be included in the class where this approach is applicable. Moreover, the set of admissible test functions contains important payoffs such as caplets and swaptions. We argue that higher-order weak approximation schemes can be used together with QMC algorithms to obtain an efficient pricing method, which is even superior to multi-level MC. The efficiency of our numerical method is proved by the calibration of the model to given caplet data. Appendix A. Functional analytic results Proposition 16. Let (X, h·, ·iX ), (Y, h·, ·iY ) be separable Hilbert spaces with norms k·kX and k·kY such that Y is compactly and densely embedded into X. Then, there exists an orthonormal basis (en )n∈N ⊂ Y of X that is simultaneously orthogonal in Y . Furthermore, limn→∞ ken k−1 Y = 0. Proof. By the Riesz representation theorem, there exists a bounded operator κ : X → Y such that (45)
hκx, yiY = hx, yiX
for all x ∈ X and y ∈ Y .
With ι : Y → X the compact embedding, we set K := ικ. K is clearly compact and also symmetric, as (46)
hKx1 , x2 iX = hκx1 , κx2 iY = hx1 , Kx2 iX .
Thus, there exists an orthonormal basis (en )n∈N of X and a sequence (λn )n∈N ⊂ R decreasing monotonically to zero such that Ken = λn en , and we see that (en )n∈N ⊂ Y . We obtain (47)
−1 −1 hen , em iY = λ−1 n hKen , em iY = λn hen , em iX = λn δn,m −1/2
whence (en )n∈N is orthogonal in Y and ken kY = λn
for n, m ∈ N,
, and the claim is proved.
Corollary 17. Under the assumptions of Proposition 16, let πN denote the Xorthogonal projection onto XN := span {en : n ∈ N}. Then, (48)
sup ky − πN ykX = 0.
lim
n→∞
y∈Y kykY ≤1
Proof. By Parseval’s identity, ky − πN yk2X = (49)
∞ X
hy, en i2X =
n=N +1
≤ sup λn n>N
∞ X
∞ X
hy, Ken i2Y
n=N +1
2 hy, λ1/2 n en iY ≤ λN +1 kykY ,
n=N +1
−1/2 λn
where we apply that ken kY = and that (en )n∈N is orthogonal in Y . As (λn )n∈N decreases to zero, the claim follows.
16
¨ PHILIPP DORSEK AND JOSEF TEICHMANN
Acknowledgements. The numerical calculations were performed on the computing facilities of the Departement Mathematik of ETH Z¨ urich. Parts of the computer implementation were written by Dejan Veluˇsˇcek, whom the authors thank for his support. Financial support from the ETH Foundation is gratefully acknowledged. References 1. A. Bensoussan, Splitting up method in the context of stochastic PDE, Stochastic partial differential equations and their applications (Charlotte, NC, 1991), Lecture Notes in Control and Inform. Sci., vol. 176, Springer, Berlin, 1992, pp. 22–31. MR 1176767 2. A. Bensoussan and R. Glowinski, Approximation of Zakai equation by the splitting up method, Stochastic systems and optimization (Warsaw, 1988), Lecture Notes in Control and Inform. Sci., vol. 136, Springer, Berlin, 1989, pp. 257–265. MR 1180784 3. A. Bensoussan, R. Glowinski, and A. R˘ a¸scanu, Approximation of the Zakai equation by the splitting up method, SIAM J. Control Optim. 28 (1990), no. 6, 1420–1431. MR 1075210 (91m:65243) 4. , Approximation of some stochastic differential equations by the splitting up method, Appl. Math. Optim. 25 (1992), no. 1, 81–106. MR 1133253 (92k:60139) 5. Ren´ e A. Carmona and Michael R. Tehranchi, Interest rate models: an infinite dimensional stochastic analysis perspective, Springer Finance, Springer-Verlag, Berlin, 2006. MR 2235463 (2008a:91001) 6. Sandra Cerrai, A Hille-Yosida theorem for weakly continuous semigroups, Semigroup Forum 49 (1994), no. 3, 349–367. MR 1293091 (95f:47058) 7. Jakob Creutzig, Steffen Dereich, Thomas M¨ uller-Gronbach, and Klaus Ritter, Infinitedimensional quadrature and approximation of distributions, Found. Comput. Math. 9 (2009), no. 4, 391–429. MR 2519865 (2010h:65027) 8. Giuseppe Da Prato and Jerzy Zabczyk, Second order partial differential equations in Hilbert spaces, London Mathematical Society Lecture Note Series, vol. 293, Cambridge University Press, Cambridge, 2002. MR 1985790 (2004e:47058) 9. Arnaud Debussche, Weak approximation of stochastic partial differential equations: the nonlinear case, Math. Comp. 80 (2011), no. 273, 89–117. MR 2728973 (2011j:65014) 10. Philipp D¨ orsek, Numerical Methods for Stochastic Partial Differential Equations, Ph.D. thesis, Vienna University of Technology, October 2011. , Semigroup Splitting And Cubature Approximations For The Stochastic Navier-Stokes 11. Equations, ArXiv e-prints (2011). 12. Philipp D¨ orsek and Josef Teichmann, A Semigroup Point Of View On Splitting Schemes For Stochastic (Partial) Differential Equations, ArXiv e-prints (2010). 13. Klaus-Jochen Engel and Rainer Nagel, One-parameter semigroups for linear evolution equations, Graduate Texts in Mathematics, vol. 194, Springer-Verlag, New York, 2000, With contributions by S. Brendle, M. Campiti, T. Hahn, G. Metafune, G. Nickel, D. Pallara, C. Perazzoli, A. Rhandi, S. Romanelli and R. Schnaubelt. MR MR1721989 (2000i:47075) 14. Damir Filipovi´ c, Consistency problems for Heath-Jarrow-Morton interest rate models, Lecture Notes in Mathematics, vol. 1760, Springer-Verlag, Berlin, 2001. MR 1828523 (2002e:91001) , Term-structure models, Springer Finance, Springer-Verlag, Berlin, 2009, A graduate 15. course. MR MR2553163 16. Damir Filipovi´ c and Josef Teichmann, On the geometry of the term structure of interest rates, Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 460 (2004), no. 2041, 129–167, Stochastic analysis with applications to mathematical finance. MR 2052259 (2005b:60145) 17. Patrick Florchinger and Fran¸cois Le Gland, Time-discretization of the Zakai equation for diffusion processes observed in correlated noise, Stochastics Stochastics Rep. 35 (1991), no. 4, 233–256. MR 1113256 (92i:60139) 18. Michael B. Giles, Multilevel Monte Carlo path simulation, Oper. Res. 56 (2008), no. 3, 607– 617. MR 2436856 (2009g:65008) 19. Michael B. Giles and Benjamin J. Waterhouse, Multilevel quasi-Monte Carlo path simulation, Advanced financial modelling, Radon Ser. Comput. Appl. Math., vol. 8, Walter de Gruyter, Berlin, 2009, pp. 165–181. MR 2648461 (2011c:91261) 20. Istv´ an Gy¨ ongy, Approximations of stochastic partial differential equations, Stochastic partial differential equations and applications (Trento, 2002), Lecture Notes in Pure and Appl. Math., vol. 227, Dekker, New York, 2002, pp. 287–307. MR 1919514 (2003b:60096) 21. Istv´ an Gy¨ ongy and Nicolai Krylov, On the rate of convergence of splitting-up approximations for SPDEs, Stochastic inequalities and applications, Progr. Probab., vol. 56, Birkh¨ auser, Basel, 2003, pp. 301–321. MR 2073438 (2005f:65012)
EFFICIENT NUMERICS FOR HJM MODELS
22. 23. 24. 25. 26.
27. 28.
29. 30. 31.
32.
33.
34.
35.
36.
37.
38.
39.
40.
41. 42.
43.
17
, On the splitting-up method and stochastic partial differential equations, Ann. Probab. 31 (2003), no. 2, 564–591. MR 1964941 (2004c:60182) , Expansion of solutions of parameterized equations and acceleration of numerical methods, Illinois J. Math. 50 (2006), no. 1-4, 473–514 (electronic). MR 2247837 (2008c:65003) , Accelerated numerical schemes for pdes and spdes, Stochastic Analysis 2010 (Dan Crisan, ed.), Springer Berlin Heidelberg, 2011, pp. 131–168. Eskil Hansen and Alexander Ostermann, Exponential splitting for unbounded operators, Math. Comp. 78 (2009), no. 267, 1485–1496. MR MR2501059 David Heath, Robert Jarrow, and Andrew Morton, Bond pricing and the term structure of interest rates: A new methodology for contingent claims valuation., Econometrica 60 (1992), no. 1, 77–105 (English). Stefan Heinrich, Multilevel Monte Carlo methods., Berlin: Springer, 2001 (English). Kazufumi Ito and Boris Rozovskii, Approximation of the Kushner equation for nonlinear filtering, SIAM J. Control Optim. 38 (2000), no. 3, 893–915 (electronic). MR 1756900 (2001b:93074) Stephen Joe and Frances Y. Kuo, Constructing Sobol′ sequences with better two-dimensional projections, SIAM J. Sci. Comput. 30 (2008), no. 5, 2635–2654. MR 2429482 (2009j:65066) Wolfgang Kluge, Time-inhomogeneous l´ evy processes in interest rate and credit risk models, Ph.D. thesis, University of Freiburg, 2005. Shigeo Kusuoka, Approximation of expectation of diffusion process and mathematical finance, Taniguchi Conference on Mathematics Nara ’98, Adv. Stud. Pure Math., vol. 31, Math. Soc. Japan, Tokyo, 2001, pp. 147–165. MR 1865091 (2003k:60198) , Approximation of expectation of diffusion processes based on Lie algebra and Malliavin calculus, Advances in mathematical economics. Vol. 6, Adv. Math. Econ., vol. 6, Springer, Tokyo, 2004, pp. 69–83. MR MR2079333 (2005h:60124) Fran¸cois Le Gland, Splitting-up approximation for SPDEs and SDEs with application to nonlinear filtering, Stochastic partial differential equations and their applications (Charlotte, NC, 1991), Lecture Notes in Control and Inform. Sci., vol. 176, Springer, Berlin, 1992, pp. 177–187. MR 1176783 M.I.A. Lourakis, levmar: Levenberg-marquardt nonlinear least squares algorithms in C/C++, [web page] http://www.ics.forth.gr/~lourakis/levmar/, Jul. 2004, [Accessed on 14 Jun. 2011.]. Terry Lyons and Nicolas Victoir, Cubature on Wiener space, Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 460 (2004), no. 2041, 169–198, Stochastic analysis with applications to mathematical finance. MR MR2052260 (2005b:35306) Mariko Ninomiya, Application of the Kusuoka approximation with a tree-based branching algorithm to the pricing of interest-rate derivatives under the HJM model, LMS J. Comput. Math. 13 (2010), 208–221. MR 2669158 (2011d:65020) Mariko Ninomiya and Syoiti Ninomiya, A new higher-order weak approximation scheme for stochastic differential equations and the Runge-Kutta method, Finance Stoch. 13 (2009), no. 3, 415–443. MR 2519839 (2010f:65013) Syoiti Ninomiya and Nicolas Victoir, Weak approximation of stochastic differential equations and application to derivative pricing, Appl. Math. Finance 15 (2008), no. 1-2, 107–121. MR MR2409419 (2009d:60227) A. R˘ a¸scanu and C. Tudor, Approximation of stochastic equations by the splitting up method, Qualitative problems for differential equations and control theory, World Sci. Publ., River Edge, NJ, 1995, pp. 277–287. MR 1372759 (96m:60131) Michael R¨ ockner and Zeev Sobol, Kolmogorov equations in infinite dimensions: wellposedness and regularity of solutions, with applications to stochastic generalized Burgers equations, Ann. Probab. 34 (2006), no. 2, 663–727. MR 2223955 (2007b:35323) Gilbert Strang, Accurate partial difference methods. I. Linear Cauchy problems, Arch. Rational Mech. Anal. 12 (1963), 392–402. MR 0146970 (26 #4489) M. Sun and R. Glowinski, Pathwise approximation and simulation for the Zakai filtering equation through operator splitting, Calcolo 30 (1993), no. 3, 219–239 (1994). MR 1353268 (96g:93067) Josef Teichmann, Another approach to some rough and stochastic partial differential equations, Stoch. Dynam. 11 (2011), no. 2–3, 535–550.