Estimating the Wishart Affine Stochastic Correlation

Seediscussions,stats,andauthorprofilesforthispublicationat:https://www.researchgate.net/publication/228266805
EstimatingtheWishartAffineStochastic
CorrelationModelUsingtheEmpirical
CharacteristicFunction
ArticleinStudiesinNonlinearDynamics&Econometrics·August2012
DOI:10.2139/ssrn.1054721
CITATIONS
READS
16
217
3authors:
JoseDaFonseca
MartinoGrasselli
AucklandUniversityofTechnology
UniversityofPadova
23PUBLICATIONS488CITATIONS
25PUBLICATIONS586CITATIONS
SEEPROFILE
SEEPROFILE
FlorianIelpo
UniversitédeParis1Panthéon-Sorbonne
44PUBLICATIONS195CITATIONS
SEEPROFILE
Allin-textreferencesunderlinedinbluearelinkedtopublicationsonResearchGate,
lettingyouaccessandreadthemimmediately.
Availablefrom:FlorianIelpo
Retrievedon:11August2016
Estimating the Wishart Affine Stochastic Correlation Model
using the Empirical Characteristic Function∗
José Da Fonseca†
Martino Grasselli‡
Florian Ielpo§
First draft: November 27, 2007
This draft: November 10, 2008
Abstract
This paper provides the first estimation strategy for the Wishart Affine Stochastic
Correlation (WASC) model. We provide elements to show that the utilization of empirical characteristic function-based estimates is advisable: this function is exponential
affine in the WASC case. We use a GMM estimation strategy with a continuum of
moment conditions based on the characteristic function. We present the estimation
results obtained using a dataset of equity indexes. The WASC model captures most
of the known stylized facts associated with financial markets, including the leverage
and asymmetric correlation effects.
Keywords: Wishart Process, Empirical Characteristic Function, Stochastic Correlation,
Generalized Method of Moments.
∗
Acknowledgements: We are particularly indebted to Marine Carrasco for remarkable insights and
helpful comments. We are also grateful to Christian Gourieroux, Fulvio Pegoraro, François-Xavier Vialard
and the CREST seminar participants for useful remarks. We are thankful to the seminar participants of
the 14th International Conference on Computing in Economics and Finance, Paris, France (2008), the 11th
conference of the Swiss Society for Financial Market Research, Zürich (2008), Mathematical and Statistical
Methods for Insurance and Finance, Venice, Italy (2008), the 2nd International Workshop on Computational and Financial Econometrics, Neuchâtel, Switzerland (2008), the First PhD Quantitative Finance
Day, Swiss Banking Institute, Zürich (2008), Inference and tests in Econometrics, in the honor of Russel
Davidson, Marseille, France (2008), the Inaugural conference of the Society for Financial Econometrics
(SoFie), New York, USA (2008), the 28th International Symposium on Forecasting, Nice, France (2008),
the ESEM annual meeting, Milano, Italy (2008), the Oxford-Man Institute of Quantitative Finance Vast
Data Conference, Oxford, UK (2008), the Courant Institute Mathematical Finance seminar, New York,
USA (2008) and the Bloomberg Seminar, New York, USA (2008) for their comments and remarks. Any
errors remain ours.
†
Ecole Supérieure d’Ingénieurs Léonard de Vinci, Département Mathématiques et Ingénierie Financière,
92916 Paris La Défense, France. Email: jose.da [email protected] and Zeliade Systems, 56, Rue JeanJacques Rousseau, 75001 Paris.
‡
Università degli Studi di Padova , Dipartimento di Matematica Pura ed Applicata, Via Trieste 63,
Padova, Italy. E-mail: [email protected] and ESILV.
§
Pictet & Cie, Route des Acacias 60, CH-1211 Genève 73. E-mail: [email protected].
1
Electronic copy available at: http://ssrn.com/abstract=1054721
1
Introduction
The estimation of continuous time processes under the physical measure attracted a lot
of attention over the few past years, and several estimation strategies have been proposed
in the literature. When the transition density is known in closed form, it is possible to
perform a maximum likelihood estimation of the diffusion parameters, as presented e.g.
in Lo (1988). However, the number of models for which the transition density is known
in a closed form expression is somewhat limited. Moreover, the existence of unobserved
factors such as the volatility process in the Heston (1993) model makes it difficult – if not
impossible – to estimate such models using a conditional maximum likelihood approach.
A possible solution consists in discretizing and simulating the unobservable process: for
example, Duffie and Singleton (1993) used the Simulated Methods of Moments to estimate financial Markov processes (methods of this kind are reviewed in Gouriéroux and
Monfort, 1996). However, as pointed out in Chacko and Viceira (2003), even though these
methods are straightforward to apply, it is difficult to measure the numerical errors due
to the discretization. What is more, the computational burden of this class of methods
precludes its use for multivariate processes.
For the special class of affine models, another estimation strategy can be used. The
affine models present tractable exponentially affine characteristic functions that can in
turn be used to estimate the parameters under the historical measure. Singleton (2001)
and Singleton (2006) present a list of possible estimation strategies that can be applied
to recover these parameters from financial time series, using the characteristic function of
the process. Methodologies of this kind have been applied to one-dimensional processes,
like the Cox-Ingersoll-Ross process (e.g. Zhou, 2000), the Heston process and a mixture
of stochastic volatility and jump processes (e.g. Jiang and Knight, 2002, Rockinger and
Semenova, 2005 and Chacko and Viceira, 2003) and affine jump diffusion models (e.g.
Yu, 2004), yielding interesting results. Still, it involves additional difficulties. First, as
remarked in Jiang and Knight (2002) and Rockinger and Semenova (2005), numerically
integrating the characteristic function of a vector of the state variable is computationally
intensive. In our multivariate case the state variable is already a vector: for this type of
methodology, the integral discretization is likely to lead to numerical errors. Second, with
the Spectral GMM method presented in Chacko and Viceira (2003), the use of a more
limited number of points of the characteristic function settles the numerical problem, but
leads to a decrease in estimates efficiency. Carrasco and Florens (2000) and Carrasco
2
Electronic copy available at: http://ssrn.com/abstract=1054721
et al. (2007) present a method that uses a continuum of moment conditions built from the
characteristic function. With this method, the estimates obtained reach the efficiency of
the maximum likelihood method, thanks to the instrument used in this strategy. These
features make this methodology particularly well-suited for the estimation of affine multivariate continuous time processes.
Here, we propose to use a Spectral Generalized Method of Moments estimation strategy to
estimate the Wishart Affine Stochastic Correlation model, an affine multivariate stochastic volatility and correlation model introduced in Da Fonseca et al. (2007). Based on the
previous models of Gourieroux and Sufana (2004), this affine model can be regarded as a
multivariate version of the Heston (1993) model: in fact, the volatility matrix is assumed
to evolve according to the Wishart dynamics (mathematically developed by Bru, 1991),
the matrix analogue of the mean reverting square root process. In addition to the Heston
model, it allows for a stochastic conditional correlation, which makes it very promising
process for financial applications. Buraschi et al. (2006) independently proposed a related
model corresponding to a constrained correlation version of the WASC model.
Multivariate stochastic volatility models have recently attracted a great deal of attention.
Bauwens et al. (2006) and Asai et al. (2006) present a survey of the existing models, along
with estimation methodologies. When compared to the previously mentioned processes,
several important differences must be underlined. (1) The volatility being a latent factor,
the observable state variable (the asset log returns) is not Markov anymore and the ML
efficiency cannot be reach. (2) What is more, as we discuss it in the paper, since the
process involves latent volatilities and correlations, the instrument must be set to be equal
to one for the usual GMM methodology to be used. This precludes the use of the Double
Index instruments procedure presented in Carrasco et al. (2007). (3) Since correlations
are also stochastic, there are more latent variables than in the stochastic volatility models,
making simulation-based methods useless. (4) Finally, the dimensionality of the problem
makes the characteristic function difficult to invert.
In view of these difficulties, we propose to estimate the WASC model using its characteristic function, following an approach that is closed to the ones presented in Chacko
and Viceira (2003) and Carrasco et al. (2007). We present a Monte Carlo investigation of
the estimates’ behavior in a small sample and we discuss the empirical results obtained
using a real dataset composed of the prices of the SP500, FTSE, DAX and CAC 40. Our
3
results unfold as follow. (1) The estimated WASC parameters are comparable to what
is obtained in the univariate empirical literature. (2) Thanks to its ability to describe
dynamic correlation, the WASC model can encompass most of the desired features of financial markets: it reveals asymmetric correlation and leverage effects. (3) Our estimates
reject systematically the particular correlation structure chosen by Gourieroux and Sufana
(2004) and Buraschi et al. (2006), favoring the flexibility of the specification presented in
Da Fonseca et al. (2007).
The paper is organized as follows. First we present the WASC process, along with the
computation of its conditional characteristic function. In Section 3, we present the estimation methodology used in this paper and briefly review the main theoretical results.
Finally, in Section 4, we present the estimation results obtained with both simulated and
real datasets and discuss their interpretation.
2
The model
In this section, we present the Wishart Affine Stochastic Correlation model introduced in
Da Fonseca et al. (2007): we detail the diffusion that drives this multidimensional process
and present the conditional characteristic function together with its derivatives.
2.1
The dynamics
The Wishart Affine Stochastic Correlation (WASC) model is a new continuous time process
that can be considered as a multivariate extension of the Heston (1993) model, with a more
accurate correlation structure. The framework of this model was introduced in Gourieroux
and Sufana (2004). It relies on the following assumption.
Assumption 2.1. The evolution of asset returns is conditionally Gaussian while the
stochastic variance-covariance matrix follows a Wishart process.
In formulas, we consider a n-dimensional risky asset St whose risk-neutral dynamics are
given by
p
dSt = diag[St ] µdt + Σt dZt ,
(1)
where µ is the vector of returns and Zt ∈ Rn is a vector Brownian motion. Following
Gourieroux and Sufana (2004), we assume that the quadratic variation of the risky assets
4
is a matrix Σt which is assumed to satisfy the following dynamics:
p
p
dΣt = ΩΩ> + M Σt + Σt M > dt + Σt dWt Q + Q> (dWt )> Σt ,
(2)
with Ω, M, Q ∈ Mn , Ω invertible, and Wt ∈ Mn a matrix Brownian motion (> denotes
transposition). In the present framework we assume that the above dynamics are inferred
from observed asset price time series, hence the stochastic differential equation is written
under the historical measure.
Equation (2) characterizes the Wishart process introduced by Bru (1991), and represents
the matrix analogue of the square root mean-reverting process. In order to ensure the
strict positivity and the typical mean-reverting feature of the volatility, the matrix M is
assumed to be negative semi-definite, while Ω satisfies
ΩΩ> = βQ> Q,
β > n − 1,
(3)
with the real parameter β > n − 1 (see Bru, 1991 p. 747).
In full analogy with the square-root process, the term ΩΩ> is related to the expected longterm variance-covariance matrix Σ∞ through the solution to the following linear equation:
−ΩΩ> = M Σ∞ + Σ∞ M > .
(4)
Moreover, Q is the volatility of the volatility matrix, and its parameters will be crucial in
order to explain some stylized observed effects in equity markets.
Last but not least, Da Fonseca et al. (2007) proposed a very special yet tractable correlation structure that is able to accommodate the leverage effects found in financial time
series and option prices. Since it is well known that it is possible to approximately reproduce observed negative skewness within the Heston (1993) model by allowing for negative
correlation between the noise driving returns and the noise driving variance, they proposed
the following assumption:
Assumption 2.2. The Brownian motions of the asset returns and those driving the covariance matrix are linearly correlated.
Da Fonseca et al. (2007) proved that Assumption 2.2. leads to the following relation:
dZt = dWt ρ +
p
1 − ρ> ρdBt ,
5
(5)
with Zt = (dZ1 , dZ2 , . . . , dZn )> , B a vector of independent Brownian motions orthogonal
to W , as defined in equation (2), and ρ = (ρ1 , ρ2 , . . . , ρn )> .
With this specification, the model is able to generate negative skewness, given the possibly
negative correlation between the noise driving the log returns of the assets and the matrixsized noise perturbating the covariance matrix. This is easy to show in the special case of
two assets (n = 2), fow which the variance-covariance matrix is given by


12
Σ11
Σ
t
t 
Σt = 
.
12
Σt Σ22
t
(6)
The correlations between assets’ returns and their volatilities admit a closed form expression, highlighting the impact of the ρ parameters on its value and positivity:
ρ1 Q11 + ρ2 Q21
(7)
corr d log S1 , dΣ11 = p 2
Q11 + Q221
ρ1 Q12 + ρ2 Q22
corr d log S2 , dΣ22 = p 2
,
(8)
Q12 + Q222
√
√
where we recall that Σ11 (resp. Σ22 ) represents the volatility of the first (resp. second)
asset. Therefore, the sign and magnitude of the skew effects are determined by both the
matrix Q and the vector ρ. When Q is diagonal, we obtain the following skews for asset
1 and 2:
corr d log S1 , dΣ11 = ρ1
corr d log S2 , dΣ22 = ρ2 ,
(9)
thus allowing a negative skewness for each asset whenever ρi < 0, ∀i (see Gourieroux and
Jasiak (2001) on this point). This correlation structure is similar the one obtained in an
Heston model.
Other less general specifications close to the WASC model, actually nested within the
WASC correlation structure, have been proposed in the literature. First, Gourieroux and
Sufana (2004) imposed ρ = 0n ∈ Rn , a choice that leads to a zero correlation case (see
equations (5), (7) and (8)) by analogy with the well known properties of the Heston model.
With this specification, the log returns’ univariate distribution is symmetric.
Second, Buraschi et al. (2006) proposed to impose ρ = (1, 0)> . Their model is thus able
to display negative skewness for asset 1 (resp. asset 2), depending on the positivity of
Q11 (resp. Q12 ). This model is actually close to the WASC and is able to display similar
6
features. Their choice of ρ is less restrictive than it seems, in so far as this parameter
is only defined up to a rotation. Thus, their hypothesis is reduced to ||ρ|| = 1. With
these settings, the vector-sized noise in the returns is fully generated by the Brownian
motions of the covariances W . This hypothesis having no a priori justification, the WASC
model of Da Fonseca et al. (2007) eliminates it, assuming the more general correlation
structure compatible with an exponential affine characteristic function (see Proposition 1
in Da Fonseca et al. (2007) on this point).
2.2
The Characteristic functions
In the WASC model, the characteristic function of Σt and Yt = log St is an exponential
affine function of the state variables. For the log returns, the characteristic function of
Yt+τ conditional on Yt and Σt is denoted:
h
i
ΦYt ,Σt (τ, ω) = E eihω,Yt+τ i |Σt , Yt ,
(10)
where E[.|Σt , Yt ] denotes the conditional expected value with respect to the historical
measure, ω ∈ Rn , i2 = −1 and h., .i is the scalar product in Rn .
Proposition 2.1. (Da Fonseca et al., 2007) The characteristic function of the asset returns in the WASC model is given by
h
i
ΦYt ,Σt (τ, ω) = E eihω,Yt+τ i |Σt , Yt
(11)
= exp {Tr [A(τ )Σt ] + hiω, Yt i + c(τ )} ,
(12)
where ω = (ω1 , . . . , ωn )> ∈ Rn and the deterministic function A(t) ∈ Mn is as follows:
A (τ ) = A22 (τ )−1 A21 (τ ) ,
(13)
with




Q> ρiω >
−2Q> Q

 A11 (τ ) A12 (τ ) 
 M+
.

 = exp τ 



 P
n
1
>−
> + iωρ> Q
(iω)(iω)
iω
e
−
M
A21 (τ ) A22 (τ )
j jj
j=1
2
(14)
7
The function c(τ ) can be obtained by direct integration, thus giving:
i
h
i
β h
c(τ ) = − Tr log (A22 (τ )) + τ M > + τ iω(ρ> Q) + τ Tr µiω > .
2
(15)
The characteristic function of the Wishart process is defined as:
h
i
ΦΣt (τ, ∆) = E eiTr[∆Σt+τ ] |Σt ,
(16)
where ∆ ∈ Mn .
Proposition 2.2. (Da Fonseca et al., 2007) Given a real symmetric matrix D, the conditional characteristic function of the Wishart process Σt is given by:
h
i
ΦΣt (τ, ∆) = E eiTr[∆Σt+τ ] |Σt
= exp {Tr [B(τ )Σt ] + C(τ )} ,
(17)
where the deterministic complex-valued functions B(τ ) ∈ Mn (Cn ), C(τ ) ∈ C are given by
B (τ ) = (i∆B12 (τ ) + B22 (τ ))−1 (i∆B11 (τ ) + B21 (τ ))
(18)
i
β h
C(τ ) = − Tr log(i∆B12 (τ ) + B22 (τ )) + τ M > ,
2
with



 B11 (τ ) B12 (τ ) 
 M

 = exp τ 



0
B21 (τ ) B22 (τ )

−2Q> Q
−M >

.

What is more, these characteristic functions can be derived with respect to β and the
elements of M , Q and ρ, using the results of Daleckii (1974), on the derivative of a matrix
function. We provide detailed calculations of these derivatives in the Appendix.
3
Spectral GMM in the WASC setting
In this section, we present the detailed estimation methodology used for the WASC model.
In this paper, we are in the special setting where the correlation and variance processes are
unobserved. In this case, the feasible estimation strategies are: (1) to filtrate covariances
8
out of return time series either (a) using DCC estimates, (b) a GARCH-like discretization
of the continuous time process or (c) a linearized Kalman filter; (2) to estimate the process
using the conditional characteristic function. We favor the second type of methodologies
since (1) DCC-based estimates of the ρ parameter are biased1 and (2) any type of discretization or linearization will lead to additional estimation errors. In the WASC case,
the characteristic function is known in a closed form expression, thus being a very suitable
tool for the estimation of vector-processes, especially when compared to simulation-based
estimators. For further discussion on the estimation strategies of Wishart-based models,
see Gourieroux (2006).
Recent articles presented estimation methodologies using the empirical characteristic function as an estimation tool, since this function has a tractable expression for many continuous time processes. In this section, we present how to estimate the WASC in this
framework, building on the approaches developed in Chacko and Viceira (2003) and Carrasco et al. (2007).
The usual way to present the generalized method of moments based on spectral moment
conditions unfold as follows. Let ht be the conditional moment condition such that
ht = eihw,Yt+τ −Yt i − Xt ,
(19)
with the notations developed earlier and Xt a stochastic process such that E[ht |Yt ] = 0.
Therefore Xt = E[eihw,Yt+τ −Yt i |Yt ]. Then the estimation can be based on unconditional
moment conditions of the form E[ht g(Yt )] = 0. However, this approach can not be implemented here because E[eihw,Yt+τ −Yt i |Yt ] does not have a known expression, principally
because the distribution of Σt given Yt is unknown. The solution we adopt is to use unconditional moment condition, which is equivalent to set the instrument g(Yt ) equal to one,
as in Chacko and Viceira (2003) (see page 272)2 . This setting stems from the fact that we
integrate the volatility out when computing X. Were Σt observable, a more general form
of instruments would be readily used.
‘
Now ht is simply
ht = eihw,Yt+τ −Yt i − E[eihw,Yt+τ −Yt i |Σ0 ]
1
We ran Monte Carlo test to prove this point empirically. The tables are available upon request.
2
We thank Marine Carrasco for pointing out this fact.
9
(20)
where the initial value of Σ0 is treated as an unknown parameter to be estimated. We
have
E[eihw,Yt+τ −Yt i |Σ0 ] = ec(τ ) E[ehA(τ ),Σt i |Σ0 ] = ec(τ ) ΦΣ0 (t, −iA(τ )),
(21)
with c(τ ) defined in equation (15), A(τ ) defined in equation (13) and ΦΣt (.) defined in
equation (17) (the conditional expectation (21) can also be computed using twice the
function (10)).
In order to increase the efficiency of our estimates, we use a continuum of moment conditions, as presented in Carrasco et al. (2007). Note that the fact that we set the instruments
to be equal to 1 naturally prevents us from reaching the ML efficiency of CGMM estimates
of Carrasco et al. (2007). Anyway, Yt conditionally upon its past is no longer a Markov
process, since the covariance matrix is unobservable. ML efficiency cannot be reach for
non-Markov process: the special instruments chosen here does not necessarily jeopardize
the estimation results.
Let now ĥ(.) be the sample mean of the moment condition, that is a function from R2n to
C. In an infinite conditions framework, Carrasco et al. (2007) showed that the objective
function to minimize is:
θ̂ = arg min kK −1/2 ĥ(θ)k,
θ
(22)
where K is the covariance operator, that is the counterpart of the covariance matrix in
finite dimension – as in standard GMM approach and k.k is the weighted norm
Z Z
2
kf k =
f (ω)f (ω)π(ω)dω
Rn
Rn
where π denotes any probability measure. As in Carrasco et al. (2007), we chose it to be
the normal distribution. Carrasco et al. (2007) showed that the covariance operator K
can be written as follows:
Z
Kf (ω) =
k(ω, τ )f (τ )π(τ )dτ
where the function k is the so called kernel of the integral operator K and is defined by:
k(ω, τ ) =
+∞
X
Eθ0 ht (ω; θ0 )ht−j (τ ; θ0 ) .
j=−∞
Since our approach is nested within the Carrasco et al. (2007)’s, we now thoroughly follow
their settings. Our approach can also be related to the methodology presented in Rockinger
10
and Semenova (2005). In order to construct an estimator of the covariance operator,
Carrasco et al. (2007) proposed a two-step procedure. The first step consists in finding:
θ̂1 = arg min kĥ(θ)k.
(23)
θ
Since ht is autocorrelated the second step consists in estimating the kernel k as follows:
T−1
X
T
j
k̂(ωs , ωr , ωv , ωw ) =
ω
Γ̂T (j),
(24)
T−q
ST
j=−T+1
with
Γ̂T (j) =



1
T
1
T
PT
t=j+1 ht (ωs , ωr , θ̂1 )ht−j (ωv , ωw , θ̂1 ), j
PT
≥0
,
(25)
t=−j+1 ht+j (ωs , ωr , θ̂1 )ht (ωv , ωw , θ̂1 ), j < 0,
where w(.) is any kernel satisfying some regularity conditions (see Carrasco et al. (2007)
Appendix A.6) and ST is a bandwidth parameter.
Once the covariance operator is estimated, the minimization in equation (22) requires
the computation of the inverse of K. Unfortunately, K has typically a countable infinity
of eigenvalues decreasing to zero, so that its inverse is not bounded. We need then to
regularize the inverse of K, which can be done by replacing K by a nearby operator that
has a bounded inverse, due to the presence of a penalizing term. Carrasco et al. (2007)
used the Tikhonov approximation of the generalized inverse of K. Let α be a strictly
positive parameter, then K −1 is replaced by:
(K α )−1 = (K 2 + αI)−1 K.
(26)
As outlined in Carrasco et al. (2007), the choice of α is important but does not jeopardize
the consistency of the estimates. Carrasco and Florens (2000) investigated an empirical
method to select its value, and the optimal value for it should represent a trade-off between the instability of the generalized inverse (for small values of α) and the distance
from the true inverse as α increases. Furthermore we found much more convenient to
compute (K α )−1 using the Cholesky’s decomposition than the spectral decomposition: it
is sufficient for the evaluation of equation (22) and avoids the numerically difficult problem
of eigenvectors computation and requires the discretization of the integrals.
Under mild regularity conditions (conditions A.1. to A.5. in Carrasco et al., 2007), it can
be proved that the optimal C-GMM estimator of θ is obtained by:
θ̂ = arg min k(KTα )−1/2 ĥT (θ)k
θ
11
(27)
and is asymptotically Normal with
√
5/4
as T and T a αT
L
T (θ̂T − θ0 ) → N 0, (hE θ0 (∇θ h), E θ0 (∇θ h)iK )−1 ,
(28)
go to infinity and α goes to zero. (∇θ h denotes the Jacobian matrix of
h(.)).
Finally, it is important to mention that Carrasco et al. (2007) present a matrix-based
version of their estimation method that may be more appealing than the one presented
here for a WASC model based on more than two assets or for other models.
4
Empirical Results
We now review the empirical results obtained with the aforementioned estimation methodology. First, we provide insight into the model and the parameters interpretation. Then
we review the results of a Monte Carlo experiment investigating the empirical behavior
of the estimation methodology. Finally, we present the estimates obtained using equity
indexes and discuss the results obtained.
Before moving to the detailed presentation of the results, it is noteworthy to mention that
with this type of model, no forecasting exercise can be performed for two main reasons.
First, with this kind of continuous time stochastic covariances process and the chosen
estimation strategy, we are restricted to the estimation of the parameters driving the
process: we cannot filtrate correlation or volatility time series out of returns and hence
forecast these quantities. Second, since the volatility and correlation are unobserved on
financial markets, it would naturally be impossible to compare – when existing – any
forecast to ”true” values. For these reasons, we cannot perform any test of the model
based on forecasts.
4.1
Preliminary considerations
Unlike the Heston (1993) model, the Wishart Affine Stochastic Correlation model is a new
model for which the parameters interpretation is not immediate. Such an interpretation is
however essential to the understanding of the model and for its estimation. For the sake of
simplicity, we focus on the case where n = 2, i.e. the case for which we observe two assets.
Yt is the vector containing the log of the asset prices, and Σt is its covariance matrix given
12
by equation (6). Yt1 being the log return of the first asset, its volatility is given by
p
Σ11
t .
In the WASC framework, individual parameters can hardly be interpreted on their own:
on the contrary, combinations of these parameters have standard financial interpretations,
such as the mean-reverting parameter or the volatility of volatility. Now, we review the
computation of these quantities.
For the first asset, the quadratic variation of the volatility can be computed as follow:
2
2
dhΣ11 , Σ11 it = 4Σ11
t (Q11 + Q21 )dt.
(29)
Therefore the first column of Q parametrizes the volatility of volatility of the first asset.
Similar results can be obtained for the second asset.
Then, as presented in Section 2,
Q11 ρ1 + Q21 ρ2
,
corr dY1 , dΣ11 = p 2
Q11 + Q221
(30)
where corr(.) is the correlation coefficient. As already mentioned, the short term behavior
of the smile and the skewness effect heavily depend on the correlation structure given by
the vector ρ. If Q and ρ are such that this quantity is negative, then the volatility of S 1 will
rise in response to negative shocks in returns of this asset. We expect this correlation to
be large and negative, in order to account for the large skewness found in financial datasets.
The Gindikin coefficient β insures the positiveness of the Wishart process. What is more,
an increase of it will shift the distribution of the smallest eigenvalue to positive values.
Thus, this parameter can be interpreted as a global variance shift factor. From equations
(3) and (4), if β is multiplied by a factor α, the long term covariance matrix Σ∞ will
be multiplied by the same factor. β also impacts the mean reverting and variance of the
correlation process. The higher the β parameter and the lower the persistence and the
variance of the correlation process. Thus, there is a trade-off in the WASC model between
volatility of the returns and volatility of the correlation process.
The M matrix can be compared to the mean reverting parameter in the Cox-IngersollRoss model. Like for the parameters previously investigated, the elements of this matrix
can hardly be interpreted directly. However, we can compute in a closed form expression
13
the drift part of the dynamics of Σij . In the case of the first asset:
"
#
p
22
Σ
11
dΣ11
2M11 + 2M12 p t ρ12
+ ...,
t = . . . + Σt
t
Σ11
t
(31)
where ρ12
t is the instantaneous correlation between the log-returns of the two assets. Thus,
the mean reverting parameter for Σ11 is a combination of the elements of M . What is
more, this drift term
√ 22is made of two parts: a deterministic part (2M11 ) and a stochastic
Σ
correction (2M12 √ t11 ρ12
t ), linked to the joint dynamics of both assets. Thus, the drift
term of
Σ11
t
Σt
is influenced by one of the off-diagonal elements of M . This feature cannot
be replicated by most of the multivariate GARCH-like models. We can perform similar
22
calculations for Σ12
t and Σt . These quantities can then be used to compare the half life
of the variances and covariance processes and thus evaluate their relative persistence in
financial markets.
The instantaneous correlation between assets has also a closed form expression:
q
2
12 2
12
dρ12
=
A
ρ
+
B
ρ
+
C
1 − ρ12
dt
+
(...)d(N oiset )
t
t t
t
t
t
t
(32)
22
with At , Bt , Ct recursive functions of Σ11
t , Σt and the model parameters. We present the
drift coefficients and the diffusion term in the Appendix. The drift associated to the correlation is quadratic, and the linear term has a negative coefficient Bt < 0, thus presenting
the typical mean reverting behavior of ρ12
t (at least around zero where the quadratic part
is negligible). The linear part can thus be used to analyze the persistence of the correlation
and its mean-reverting characteristics, during low correlation periods. When the absolute
value of the correlation is higher, the quadratic part of the drift get the upper hand and
the correlation process looses most of its persistence. By comparing the values of Bt and
At , when can thus compare the correlation behavior during low and high correlation cycles. This information has not been documented until now, whereas it is important to
understand the joint behavior of financial assets.
The WASC model can also be used to investigate potential contagion effects in financial
markets. By computing the correlation between the correlation process and the returns, we
can discuss under which condition the model is able to display an asymmetric correlation
effect3 . Asymmetric correlation effect leads correlation to go up whilst returns are getting
3
On asymmetric correlation effects, see Roll (1988) and Ang and Chen (2002).
14
down. As already noticed in Da Fonseca et al. (2007), we have:
s
2
Σ11
1 12
t
dhY , ρ it =
(1 − ρ12
) × (Q12 ρ1 + Q22 ρ2 ) .
t
22
{z
}
|
Σt
(33)
Sign of asset 2 skew
Thus, the sign of the skews determines the one of the covariance between correlation and
returns. Were the skew to be negative and the model would also display increases in
the correlation following negative returns. Thus, the WASC model is also able to display
an asymmetric correlation effect, whose sign is driven by the skewness associated to the
returns. Since the asset returns are negatively correlated to their own volatility (leverage
effect), we thus expect volatilities to be positively correlated to correlation: negative
returns periods correspond to both higher correlation and higher volatility periods. In
fact, simple computations given in Appendix lead to
s
12 11 Σ11
12 2
t
1
−
ρ
Q̃
Q̃
+
Q̃
d ρ ,Σ t =
12
11
22 dt.
t
Σ22
t
where Q̃ is the symmetric positive definite matrix associated to the polar decomposition
of Q4 . A positive value for Q̃12 would mean that the WASC model is able to accomodate
stylized effects of the type mentioned earlier. Due to the increase in the drift term of the
correlation dynamics, situation of this kind are expected not to last for long.
We now turn our attention toward a series of Monte Carlo experiments, so as to investigate
the empirical performance of the chosen estimation strategy.
4.2
Monte Carlo study
Following Carrasco et al. (2007), we present the results of a Monte Carlo study of the
CGMM estimation methodology applied to the WASC. We first present the technical details of the simulation and then we review the results obtained.
For the ease of the presentation, we restrict to the two-assets case. The parameters used
4
Any invertible matrix Q can be uniquely written as the product of a rotation matrix and a symmetric
positive definite matrix Q̃, see the Appendix.
15
in the simulation are the following:

Σ0 = 

M =
0.0225
−0.0054
−0.0054
0.0144
−5 −3


(34)


−3 −5
h
i
ρ = 0.3 0.4


0.1133137 0.03335871

Q=
0.0000000 0.07954368
(35)
β = 15.
(38)
(36)
(37)
The Q matrix is obtained by inverting the relation that links Q to M , Σ∞ and β:
Q> Q = −
o
1n
M Σ∞ + Σ∞ M > .
β
(39)
This ensure the stationarity of the correlation process. When Q is selected arbitrary and
given the mean reverting property of Σt , the first part of the simulated sample will be
tainted by the collapse of the process toward its long term average. Situations of this
kind should be discarded. The Figure 1 presents a simulated path for both volatilities
and correlation, using the previous parameter values. The figure displays mean-reverting
dynamics for each of these moments.
The Figure 2 shows the characteristic function used in the spectral GMM method used in
the paper, as presented in equation (??). The grid used for the numerical integration of
the objective function ranges on the real line from -300 to 300. We used Gaussian kernels
with appropriate variance parameter to maintain as much information as possible. The
integral is computed numerically using the Trapezoidal Rule that seemed to performed
well over the simulated dataset. The objective function is minimized using a simulated
annealing method, as described in Belisle (1992).
We present the results of different Monte Carlo experiments. Each of them comes out after
1000 iterations, but they differ by the length of the simulated sample and the sampling
frequency: daily, weekly and monthly. For each sampling frequency, we used two different
samples, one of which contains 500 observations and the other one 1500 observations. The
Table 6.2 presents the Mean Bias and the Root Mean Square Error (RMSE) obtained. We
did not reported the median bias insofar it was close to the median bias, thus indicating
16
that the estimators have a symmetric empirical distribution.
The results can be analyzed as follows. The Monte Carlo results obtained for Q, β and ρ
show that an increase in the sample depth globally results in a reduction of the variance of
the estimates. The bias obtained are small and not significative. For the weekly frequency,
ρ1 displays a noticeable difference as the variance of the estimate grows with the sample
size. This feature will have to be considered when analyzing the real dataset-based estimation results. The M parameter also presents this variance increase feature. However,
this behavior is not very surprising: a large number of articles emphasize the difficulties
involved by the estimation of the mean reverting parameter in continuous time diffusions
(see e.g. Gouriéroux and Monfort, 2007). The Monte Carlo results indicate that this mean
reverting parameter is estimated with less volatility with daily series. What is more, diagonal elements (resp. off-diagonal elements) of M are estimated with a small positive
(resp. negative) bias and thus may be underestimated (resp. overestimated) when working
with a real-time dataset. Finally, the correlation vector displays a remarkably small bias
and small RMSE for the daily datasets, even in the small sample version. This fact is
somewhat constant for each sampling frequency. This point is important for the WASC
model, given that we are interested in the analysis of the fine correlation structure implicit
in asset dynamics.
We now detail the empirical results obtained with stock indexes.
4.3
Estimation on stock indexes
In this last subsection, we present the empirical results obtained when estimating the
WASC using the C-GMM method on a real dataset. We used the following stock indexes:
SP500, FTSE, DAX and CAC 40. For each stock, the time series starts on January 2nd
1990 and ends on June 30th 2007. This period excludes the 1987 crash and the subprime
crisis. It nonetheless includes a lot of financial turmoils, as pointed in Rockinger and
Semenova (2005). The table 2 presents the descriptive statistics for the sample used in the
estimation, at a weekly sampling frequency. We used daily and weekly time series. We discarded the use of monthly ones since the sample would be far too small. In many articles
devoted to the estimation of continuous time models, the change in the sampling frequency
usually leads to an interesting analysis of the subtle dynamics of financial markets (see
e.g. Chacko and Viceira, 2003). Since the characteristic-function based estimators do
17
not suffer from discretization errors, we can actually use any sampling frequency. Like in
Carrasco et al. (2007), α = 0.02 were found to perform well. The integration grid is the
same as the one used for the previous simulation exercises. We chose to use the Bartlett
kernel for the GMM covariance matrix estimation, following the procedure presented in
Newey and West (1994).
For numerical sake, we focus again on a two-assets case (n = 2). We estimated the parameters driving the WASC process for the following couples of indexes: (SP500,FTSE),
(SP500,DAX), (SP500,CAC), (DAX/CAC), (DAX,FTSE) and (FTSE/DAX). This way,
we will be able to compare the characteristics specific to individual stock while estimated
with each of the others. For example, we will be able to compare the volatility of volatility
of the SP500, when it is estimated with the DAX, CAC and FTSE as a second asset. It
will highlight the impact of joint dynamics on idiosyncratic behaviors, which has hardly
be documented until now.
The estimation results are presented in table 3 for the daily results and in table 4 for the
weekly results. Most of the estimates are significative up to a 5 or 10% risk level. What
is more, in the weekly observation case, the size of the sample is strongly reduced and so
is the efficiency of the estimation method. Nevertheless, the estimation results yield interesting information both about the WASC process and the dynamics of the stock indexes.
As presented in the previous subsection, it is difficult to compare the individual parameters
and we will thus focus on combinations of these parameters most of which are comparable
with the ones of the Heston (1993) model.
For the estimated parameters presented in Table 3, the associated volatility of volatility
are presented in Table 5. The estimation of this quantity is essential to test the ability
of the model to capture financial market features: as pointed out in Chacko and Viceira
(2003), this parameter controls the kurtosis of the underlying process. Several remarks can
be made. First, the global results match what is generally expected from stock indexes.
Such markets are known to lead to a volatility of volatility ranging from 5% to 25%.
Second, the results obtained for the SP500 are remarkably stable when the second asset
changes at least for the daily sample: it ranges from 14.6% to 24.4%, thus matching the
results obtained in Eraker et al. (2003). Still, it is below the estimates obtained in Chacko
and Viceira (2003) and Rockinger and Semenova (2005): this may be explained by the
18
fact that the model that is estimated here is multivariate, whereas existing attempts to
capture stochastic volatility has been made in a univariate framework. Da Fonseca et al.
(2007) showed that the correlation between the volatility of each stock is non 0 insofar as
dhΣ11 , Σ22 i = Σ12 dt.
(40)
Hence, whenever Σ12 is positive, the WASC model is able to model volatility transmission phenomena among assets. It is noteworthy to remark that these results are globally
stable across the datasets and close to the existing results. Third, when the sampling
frequency reduces, the volatility of volatility parameter globally increases. The few lacks
of consistency for this fact may be due to the fact that the number of observations in the
weekly dataset is far below the one used in the daily dataset. These results are different
from those obtained in Chacko and Viceira (2003). However, this is in line with what is
observed for the volatility of the log-returns when reducing the sampling frequency. This
divergence may also be explained by the effect of the correlation between variances that
cannot be mimicked in a Heston-like framework.
Now, let us discuss an important parameter for the specification of the WASC model, that
is the correlation between the returns and the volatility. We already mentioned that this
parameter is essential to have a model that is consistent with many stylized facts, such as
negative skewness and thus skewed implied volatility surfaces. It can be computed using
its expression given in equation (30). The Table 6 displays the results obtained.
This time, we have results that are comparable to the one obtained in the existing literature, and especially for the SP500. The correlation for this index is reported in the first
line of the previous Table. In the literature, it actually ranges from -0.27 (Rockinger and
Semenova, 2005) to -0.62 (Chacko and Viceira, 2003), which is close to what is obtained
here. The parameter values obtained for the CAC, DAX and FTSE indexes are not surprising either, since their sign is negative. The main problem here lies in the instability
of the ρ parameter for the different estimation involved, when comparing both the sample
frequency and the couple of indexes that are estimated. The change in sampling frequency
does not lead to a similar behavior across the datasets: depending upon the couples and
the sampling frequency, the skewness in the dataset can considerably differ, underlining
the fact that correlation processes implicit in financial markets are complex. Beyond the
remarks made in the previous paragraph on the importance of the dataset depth, we also
emphasize that the computation of this parameter dwells on the inverse of the square root
19
of a quantity that is small. In this situation, the inverse of something small can be found
to be very variable: any error in the estimation of Q11 and Q21 will have a strong impact
on √
1
.
Q211 +Q221
Thus, this skewness quantity must be cautiously interpreted. Last but not
least, since the skews are negative, the fitted WASC models also display asymmetric correlation effects: negative returns are likely to be followed by a higher correlation between
the two assets.
We now turn our attention toward the mean reverting matrix M . For SP500, CAC, DAX
and FTSE, we find the same structure for the matrix M . They are definite negative thus
ensuring a mean reverting behavior for the Wishart process and have positives off diagonal
terms. As presented in the previous subsection, the drift can be decomposed into two different part: an idiosyncratic part (denoted κ1 in the tables) and a joint part (denoted κ2 in
the tables). For univariate stochastic volatility models, the estimation results usually lead
to an estimate of κ = κ1 + κ2 , that is the sum of the two preceding elements. Thanks to
the complexity of the WASC process, we are now able to disentangle and analyse these two
different elements. The estimation results are reported in table 7. The κ values should
be compared with the mean reverting value of the Heston. We are close to Rockinger
and Semenova (2005) results who found 6.3352 (see their Table 1) for the S&P500, even
though their results are obtained on a different sample and using daily data. However,
when analysing κ1 and κ2 , we find that the idiosyncratic mean reverting component is
always higher than the usual Heston parameter. This idiosyncratic element is dampened
by the negative joint mean reverting component: its negativity is to be related to the
negative non-diagonal elements of the estimated M matrices. Again, when the sampling
frequency changes, each of these values vary, suggesting that the mean reverting parameter associated to the volatility strongly depends on the sampling frequency, as pointed out
in Chacko and Viceira (2003). Globally, the associated half lives are around one month,
which is a realistic value.
As presented in the previous section, it is possible to perform similar computations for
the drift term of the stochastic correlation. This drift is a non linear function of ρ12
t and
the usual comments have to be adapted. We present in figure 3 and 4 the instantaneous
12
variation of ρ12
t as a function of ρt , highlighting the contribution of the quadratic term
when the correlation gets very high – that is during crisis period. Our results suggest
that the correlation process is much more persistent than volatility when the correlation is
below its long term level, since in such a case its mean reverting parameter can be reduced
20
to (−Bt ): in this situation, the half life is around 2 months, which is again realistic. When
correlation is high, the quadratic term gets higher and the persistence goes down, since
At is added to Bt as both these elements are negative. This is consistent with what is
empirically observed during financial market crises: the correlation gets very high on a
very short period, to finally go back to its long term behavior rapidly. The aforementioned
figures display reaction functions of this kind, underlining the ability of the WASC model
to encompass this standard feature of financial markets.
Another quantity of importance is ||ρ||, since the WASC can be seen as a generalization
of the processes proposed in Gourieroux and Sufana (2004) and Buraschi et al. (2006),
with a more complex correlation structure. Since the WASC model is only defined up to
a rotation matrix, the model presented in Buraschi et al. (2006) encompasses any correlation structure that satisfies ||ρ|| = 1. Testing such an assumption is thus of a tremendous
importance to judge whether the complexity of the WASC is empirically justified. The
table 10 presents the norm of this vector parameter. Each of the estimated value strongly
differs from 1, suggesting the general correlation structure imposed in Da Fonseca et al.
(2007) is empirically grounded.
As presented earlier, a contagion effect can be handled by the WASC through a positive
value for Q̃12 . In table 11 we report the estimated values for this parameter. We found
positive values for all couples of indexes as expected. Therefore, the estimated WASC
model is able to detect the existence of potential contagion effects in the dataset. As
mentioned earlier, these findings may be due to the fact that the dataset includes several
financial crises, periods during which dramatic contagion effects are expected.
5
Conclusion
In this paper we investigated the estimation of a new continuous time model: the Wishart
Affine Stochastic Correlation model, presented in Da Fonseca et al. (2007). After having presented the problem that arise when trying to estimate a discrete version of this
model, this paper proposes to estimate the process using its exponential affine characteristic function. The estimation method uses a continuum of spectral moment conditions
in a GMM framework. After a preliminary Monte Carlo investigation of the estimation
methodology, we show that real-dataset estimation results bring support to the WASC
21
process. First, the empirical results are comparable to those obtained in the literature
(when comparable). Second, the general correlation structure of the WASC casts light on
not-so-well documented features of international equities, allowing us to discuss for example the persistence of the correlation process, contagion effects or asymmetric correlation
effects. Third, the generality of the correlation structure is not rejected by the dataset,
bringing empirical support to the model presented in Da Fonseca et al. (2007).
References
Ang, A. and Chen, J. (2002). Asymmetric Correlations of Equity Portfolios. Journal of
Financial Economic, (63):443–494.
Asai, M., McAleer, M., and Yu, J. (2006). Multivariate Stochastic Volatility: a Review.
Econometric Review, 25(2-3):145–175.
Bathia, R. (2005). Matrix Analysis. Graduate Texts in Mathematics, Springer-Verlag.
Bauwens, L., Laurent, S., and Rombouts, J. (2006). Multivariate garch models: a survey.
Journal of Applied Econometrics, 21(1):79–109.
Belisle, C. J. P. (1992). Convergence Theorems for a Class of Simulated Annealing Algorithms. Rd J Applied Probability, 29:885–895.
Bru, M. F. (1991). Wishart Processes. Journal of Theoretical Probability, 4:725–743.
Buraschi, A., Porchia, P., and Trojani, F. (2006). Correlation risk and optimal portfolio
choice. Working paper, SSRN-908664.
Carrasco, M., Chernov, M., Florens, J.-P., and Ghysels, E. (2007). Efficient Estimation
of Jump Diffusions and General Dynamic Models with a Continuum of Moment Conditions. Journal of Econometrics, (140):529–573.
Carrasco, M. and Florens, J. (2000). Generalization of GMM to a Continuum of Moment
Conditions. Econometric Theory, (16):797–834.
Chacko, G. and Viceira, L. M. (2003). Spectral GMM Estimation of Continuous-Time
Processes. Journal of Econometrics, 116(1-2):259–292.
Da Fonseca, J., Grasselli, M., and Tebaldi, C. (2007). Option Pricing when Correlations
are Stochastic: an Analytical Framework. Review of Derivatives Research, 10(2):151–
180.
22
Da Fonseca, J., Grasselli, M., and Tebaldi, C. (2008). A Multifactor Volatility Heston
Model,. Quantitative Finance, 8(6):591–604. An earlier version of this paper circulated
in 2005 as ”Wishart multi–dimensional stochastic volatility”, RR31, ESILV.
Daleckii, J. (1974). Differentiation of Non-Hermitian Matrix Functions Depending on a
Parameter. AMS Translations, 47(2):73–87.
Daleckii, J. and Krein, S. (1974). Integration and Differentiation of Functions of Hermitian
Operators and Applications to the Theory of Perturbations. AMS Translations, 47(2):1–
30.
Donoghue, W. J. (1974). Monotone matrix functions and analytic continuation. SpringerVerlag.
Duffie, D. and Singleton, K. (1993). Simulated Moments Estimation of Markov Models of
Asset Prices. Econometrica, 61:929–952.
Eraker, B., Johannes, M., and Polson, N. (2003). The Impact of Jumps in Volatility and
Returns. The Journal of Finance, 58(3):1269–1300.
Faraut, J. (2006). Analyse sur les groupes de Lie. Calvage & Mounet.
Gourieroux, C. (2006). Continuous Time Wishart Process for Stochastic Risk. Econometric
Review, 25(2-3):177–217.
Gourieroux, C. and Jasiak, J. (2001). Financial Econometrics. Princeton University Press.
Gourieroux, C. and Sufana, R. (2004). Derivative Pricing with Multivariate Stochastic
Volatility: Application to Credit Risk. Les Cahiers du CREF 04-09.
Hall, B. C. (2003). Lie Groups, Lie Algebras, and Representations: An Elementary introduction. Graduate Texts in Mathematics 222, Springer-Verlag.
Heston, S. (1993). A Closed-Form Solution for Options with Stochastic Volatility with
Applications to Bond and Currency Options. The Review of Financial Studies, 6(2).
Jiang, G. J. and Knight, J. L. (2002). Estimation of Continuous-Time Processes via
the Empirical Characteristic Function. Journal of Business & Economic Statistics,
20(2):198–212.
Lo, A. W. (1988). Maximum Likelihood Estimation of Generalized Ito Processes with
Discretely Sampled Data. Econometric Theory, 4:231–247.
23
Newey, W. K. and West, K. D. (1994). Automatic lag selection in covariance matrix
estimation. Review of Economic Studies, 61(4):631–53.
Rockinger, M. and Semenova, M. (2005). Estimation of Jump-Diffusion Process via Empirical Characteristic Function. FAME Research Paper Series rp150, International Center
for Financial Asset Management and Engineering.
Roll, R. (1988). The International Crash of October, 1987. Financial Analysts Journal,
(September-October):19–35.
Singleton, K. (2001). Estimation of Affine Pricing Models Using the Empirical Characteristic Function. Journal of Econometrics, (102):111–141.
Singleton, K. J. (2006). Empirical Dynamic Asset Pricing: Model Specification and Econometric Assessment. Princeton University Press.
24
6
6.1
Appendix
Computing the gradient
The gradient of the characteristic function is needed to study the asymptotic distribution
of the estimates but also in the optimization process underlying the estimation procedure.
Therefore we turn our attention to the differentiation of matrix function depending on a
parameter. We illustrate the theoretical framework with the characteristic function of the
assets’ log returns and we give without technical details the results for the forward characteristic function needed in our empirical study. We mainly rely on the work of Daleckii
(1974) for the general case (i.e. in the non-Hermitian matrix case) and to Daleckii and
Krein (1974), Donoghue (1974) and Bathia (2005) for the Hermitian matrix case.
Let us first state some basic results on linear algebra. Denote by {λi ; i = 1..n} the set of
eigenvalues of a matrix X ∈ Mn and mi the multiplicity of λi as a root of the characteristic
polynomial of X. Define Li = Ker(X − λi I) and Pi the projection operator from Cn onto
P
Li , then we have ni=1 Pi = I. Define also Ji such that (X − λi I)Pi = Ji . The Jordan
P
normal form of X if given by the well known decomposition X = ni=1 (λi Pi + Ji ).
Let f be a function from Mn into Mn : the derivative of f at X in direction H, denoted
Df,X (H), is by definition kf (X +tH)−f (X)−Df,X (H)k = to(kHk) and can be computed
using the following formula Daleckii (1974):
j1 −1 mj2 −1
n mX
X
X 1 ∂ r1 +r2 f (λj ) − f (λj ) 1
2
Df,X (H) =
Pj1 Jjr11 HPj2 Jjr22 .
r1 r2 ∂λrj11 ∂λrj22
λj1 − λj2
j1 ,j2 r1 =0
(41)
r2 =0
Remark 6.1. Whenever λj1 = λj2 the term within the bracket should be replaced by
f 0 (λj1 ).
Remark 6.2. When X can be diagonalized then mj = 1 for each j and we are lead to the
very simple form
n X
f (λj1 ) − f (λj2 )
Pj1 HPj2 .
Df,X (H) =
λj1 − λj2
(42)
j1 ,j2
If X is Hermitian then P −1 = P ∗ and we recover the result presented for example in
Daleckii and Krein (1974).
25
Simple algebra leads to Df,X (H) = P Mf ◦ (P −1 HP )P −1 where P is the matrix of the
eigenvectors of X 5 , ◦ is the Schur product6 and Mf = (Mf (λk , λl )){k=1...n,l=1...n} is the
Pick matrix associated to the function f , which is defined by

 f (λ)−f (µ) if λ =
6 µ
λ−µ
Mf (λ, µ) =
 f 0 (λ)
if λ = µ
(43)
This formulation is well known and can be found for example in Donoghue (1974) p. 79
and Bathia (2005) p123-124.
The gradient of the characteristic function involves ∂α A where A is given by (13). In fact
for any parameter value α which may be equal to Mkl , Qkl , Rkl or β the gradient is given
by
∂α ΦYt ,Σt (τ, z) = (Tr(∂α AΣt ) + ∂α c(τ )) ΦYt ,Σt (τ, z).
From (13) and ∂α (L−1 ) = −L−1 (∂α L)L−1 implied by the derivation of L−1 L = I we
conclude that
∂α A (τ ) = −A22 (τ )−1 ∂α (A22 (τ ))A(τ ) + A22 (τ )−1 ∂α A21 (τ ) ,
Therefore we are lead to the computation of


∂ A (τ ) ∂α A12 (τ )
 α 11
,
∂α A21 (τ ) ∂α A22 (τ )
that is the derivative with respect to a parameter of a function of a matrix (in this case
the exponential function). In order to use formula (41) we specify in the following table
for each parameter of the WASC model the choice of the matrice X and H. As usual
{el ; l = 1 . . . n} resp. {ekl ; k, l = 1 . . . n} stands for the canonical basis of Rn resp. Mn (R)
(the function f being f (x) = ex ).
Parameter
X
H

Mkl
τG
τ

Qij
ρl
5
τG τ 
τG
ekl
0
(ekl )> ρiω >
0

τ

0
−ekl
>
−2(Q> ekl
Q> el iω >
0
+
(ekl )> Q)
−iωρ> ekl

0

−iωe>
Q
l


If pi is the ith eigenvector of X and qi is the ith row of P −1 then the projection operator on Li is given
by Pi = pi qi>
6

Given two matrix of same size X kl and Y kl then X ◦ Y = X kl Y kl
26
To fulfill the analytical computation of the gradient we need the derivative of c(τ ) with
respect to a model parameter and particularly the term log A22 (τ ).
∂α c(τ ) = −
i
βh
Tr (∂α (log A22 (τ ))) + τ Tr ∂α M > + iω∂α (ρ> Q)
2
(44)
In order to apply (41) we just need to define P22 from M2n into Mn such that P22 L = L22
with

L=
L11 L12
L21 L22

.
(45)
Then it is easy to see that
∂α log A22 (τ ) = Dlog,A22 (τ ) (P22 Dexp,τ G (H))
(46)
Once again using (41) with the log function gives the result.
Remark 6.3. If f is the exponential function then we can also compute the derivative of
the exponential of a matrix using the Baker-Hausdorff ’s formula (see e.g. Hall (2003) p.
71 formula (3.10) for the details)
∂α eτ G = D expτ G ∂α (τ G),
−adX
where D expX = eX I−eadX
(47)
and adX Y = [X, Y ] = XY − Y X is the Lie bracket.
The empirical study is based on the forward characteristic function of assets’ log returns
defined by
ΦΣ0 (t, −iA(τ ))ec(τ ) = exp {T r [B(t)Σ0 ] + C(t) + c(τ )}
(48)
with
B (t) = (A(τ )B12 (t) + B22 (t))−1 (A(τ )B11 (t) + B21 (t))
i
β h
C(t) = − Tr log(A(τ )B12 (t) + B22 (t)) + tM >
2
i
β h
c(τ ) = − Tr log(A22 (τ )) + τ M > + τ iγ(ρ> Q)
2
As for the characteristic function of assets’ log returns it is straightforward to show that
for any given model parameter α we have:
∂α ΦΣ0 (t, −iA(τ ))ec(τ ) = ΦΣ0 (t, −iA(τ ))ec(τ ) (Tr(∂α B(t)Σ0 ) + ∂α C(t) + ∂α c(τ )))
27
where the matrix derivatives are given by
∂α B(t) = −(A(τ )B12 (t) + B22 (t))(∂α A(τ )B12 (t) + A(τ )∂α B12 (t) + ∂α B22 (t))−1 B(t)
+ (A(τ )B12 (t) + B22 (t))−1 (∂α A(τ )B11 (t) + B21 (t)),
β
∂α C(t) = − Tr(Dlog,A(τ )B12 (t)+B22 (t) (∂α A(τ )B12 (t) + A(τ )∂α B12 (t) + ∂α B22 (t))).
2
This completes the analytical computation of the gradient.
6.2
Dynamics of the correlation process
In this Appendix we compute in the 2-dimensional case the drift and the diffusion coefficients of the correlation process ρ12
t defined by
Σ12
t
p
ρ12
=
.
t
11
Σt Σ22
t
We differentiate both sides of the equality ρ12
t
2
(49)
2
=
(Σ12
t )
22
Σ11
t Σt
. We refer to Da Fonseca et al.
(2008) for the explicit computation of all covariations involved in the below formulas. We
obtain:
12
2ρ12
t dρt
2Σ12
12 2
= 11 t 22 dΣ12
t + Σt
Σt Σt
so that
1
dρ12
t = p 11 22
Σt Σt
1
d
Σ22
t
1
Σ11
t
1
+ 11 d
Σt
1
Σ22
t
+ (.)dt,
Σ12
Σ12
t
t
11
22
dΣ12
−
dΣ
−
dΣ
+ (.)dt.
t
t
t
2Σ11
2Σ22
t
t
By using the covariations among the Wishart elements we have
11 2
1
2
12
22
2
2
Σ
Q
+
Q
+
2Σ
(Q
Q
+
Q
Q
)
+
Σ
Q
+
Q
11
12
21
22
t
12
22
t
t
11
21
22
Σ11
t Σt
2 Q211 + Q221 Q212 + Q222
Σ12
t
+ Σ12
+
+
2
(Q
Q
+
Q
Q
)
11 12
21 22
t
22
Σ11
Σ22
Σ11
t
t
t Σt
Σ12
t
12
Σ11
Q211 + Q221
− 2 11
t (Q11 Q12 + Q21 Q22 ) + Σt
Σt
Σ12
t
12
2
2
22
−2 22 Σt Q12 + Q22 + Σt (Q11 Q12 + Q21 Q22 ) dt,
Σt
d ρ12 t =
which leads to:
2 d ρ12 t = 1 − ρ12
t
ρ12
Q212 + Q222 Q211 + Q221
t (Q11 Q12 + Q21 Q22 )
p
+
−
2
22
11
22
Σt
Σt
Σ11
t Σt
Now let us compute the drift of the process ρ12
t .
28
!
dt.
12
Σt
We differentiate both sides of the equality ρ12
t = √ 11
Σt Σ22
t
and we consider the finite
variation terms:
dρ12
t
=
1
p
dΣ12
t
11
22
Σt Σt
+
Σ12
t d
!
1
p
22
Σ11
t Σt
1
+ d Σ12 , √
Σ11 Σ22 t
1
=p
Ω11 Ω21 + Ω12 Ω22 + M21 Σ11
+ M12 Σ22
+ (M11 + M22 ) Σ12
dt
t
t
t
22
Σ11
Σ
t t


1
1
p
− q
 Ω211 + Ω212 + 2M11 Σ11
+ Σ12
+ 2M12 Σ12
t
t
t
3
Σ22
t
2 Σ11
t


1
1 
22
 Ω221 + Ω222 + 2M21 Σ12
− q
+
2M
Σ
+p
22
t
t
3
Σ11
t
2 Σ22
t
11 22 3
3
+ p
2 d Σ t + p 11 22 22 2 d Σ t
11
22
11
8 Σt Σt Σt
8 Σt Σt Σt



1
1 
1
 d Σ11 , Σ12
+ q
d Σ11 , Σ22 t  dt + p
− q
t
22
3
Σt
22 3
4 Σ11
2 Σ11
t Σt
t


1
1 
 d Σ12 , Σ22 + Diffusions.
− q
+p
t
3
Σ11
t
2 Σ22
t
Now we use the formulas of the covariations of the Wishart elements and we arrive to an
expression which can be written as follows:
12 2
12
dρ12
=
A
ρ
+
B
ρ
+
C
dt + Diffusions,
t
t
t
t
t
t
where7 :
s
From the definition of Ω =
√
Σ22
t
M12 −
Σ11
t
s
Σ11
t
At = p
(Q11 Q12 + Q21 Q22 ) −
M21
11
22
Σ22
Σt Σt
t
Ω2 + Ω 2
Q2 + Q2
Q2 + Q2
Ω2 + Ω 2
Bt = − 11 11 12 − 21 22 22 + 11 11 21 + 12 22 22 h0
2Σt
2Σt
2Σt
2Σt
1
(Ω11 Ω21 + Ω12 Ω22 − 2 (Q11 Q12 + Q21 Q22 ))
Ct = p
22
Σ11
t Σt
s
s
Σ22
Σ11
t
t
+
M12 +
M21 .
11
Σt
Σ22
t
1
βQ> and the Gindikin condition we deduce that Bt is
negative. As a by-product, we easily deduce the instantaneous covariation between the
7
Notice that the diffusion term and both the expressions for Bt and Ct are different from the ones
obtained by Buraschi et al. (2006).
29
1.5 2.0 2.5
0.5 1.0
Volatility
Time Varying Volatilities
0
200
400
600
800
1000
800
1000
Index
0.4
0.0
Correlation
0.8
Time Varying Correlation
0
200
400
600
Index
Figure 1: Time varying (simulated) volatilities (top) and correlations (bottom).
This figure displays simulated volatilities and correlation in the two dimensional case
(n = 2). The simulation has been produced using the parameters used in thepMontep
Carlo
11 ,
experiments. Given Σt the dynamic covariancematrix, the volatilities
are
Σ
Σ22
t
t .
p
p
12
11
22
The correlation is obtained by computing Σt /
Σt + Σt .
Wishart element Σ11
t and the correlation process:
1
Σ12
Σ12
d ρ12 , Σ11 t = p
dhΣ11 , Σ12 it − t11 dhΣ11 , Σ11 it − t22 dhΣ22 , Σ22 it
2Σt
2Σt
Σ11 Σ22
st t
Σ11
12 2
t
=2
(Q11 Q12 + Q21 Q22 ) dt.
1
−
ρ
t
Σ22
t
Using the fact that Q ∈ GL(n, R)8 there exists a unique couple (K, Q̃) ∈ O(n) × Pn 9 such
that Q = K Q̃. We refer to Faraut (2006) for basic results on matrix analysis. The law of
the Σt being invariant by rotation of Q we rewrite this covariation as
s
12 11 Σ11
12 2
t
d ρ ,Σ t = 2
1
−
ρ
Q̃
Q̃
+
Q̃
12
11
22 dt.
t
Σ22
t
8
GL(n, R) is the linear group, the set of invertible matrices.
9
O(n) stands for the orthogonal group ie O(n) = {g ∈ GL(n, R)|g > g = In } and Pn is the set of
symmetric definite positive matrices.
30
0.10
0.08
Objective
0.06
0.04
0.02
−300
−200
−100
300
200
100
0
w
0
100
−100
200
w
−200
300 −300
Figure 2: Integrand of the C-GMM estimation criterion.
The figure displays the characteristic function with the integrated volatility presented in
equation (??). The parameters used for to compute this characteristic function are those
used in the Monte Carlo experiments.
31
M11
M12
M21
M22
Q11
Q12
Q21
Q22
β
ρ1
ρ2
Number of obs.
Bias
RMSE
Bias
RMSE
Bias
RMSE
Bias
RMSE
Bias
RMSE
Bias
RMSE
Bias
RMSE
Bias
RMSE
Bias
RMSE
Bias
RMSE
Bias
RMSE
Daily
500
1500
0.0133 0.0382
1.5126 1.5213
-0.04
-0.014
1.0136 1.0341
-0.056
-0.04
0.981
1.0205
0.1667 0.1784
1.5378 1.5502
0.1077 0.1069
0.088
0.0872
0.0309 0.0287
0.0664 0.0675
-3E-04 -6E-04
0.0876 0.0862
0.0792 0.0772
0.0678 0.0661
0.0632 -0.066
4.335
4.3472
0.0055 -0.007
0.4617 0.4913
-0.004 0.0265
0.5231 0.4734
Weekly
500
1500
0.227
0.0706
1.529
1.6011
-0.048 -0.036
1.098
1.0896
-0.085 -0.036
1.104
1.1101
0.070
0.0657
1.511
1.5778
0.132
0.1166
0.476
0.3568
0.022
0.0208
0.554
0.4467
-0.059 -0.008
0.553
0.371
0.057
0.0574
0.455
0.4424
0.044
-0.107
4.546
4.4364
0.035
0.0268
0.641
0.8031
0.063
0.0445
0.653
0.6405
Table 1: Results of the Monte Carlo experiments
This table displays the results for the Monte Carlo simulations performed using the following parameters:
0.0225
−0.0054
−5 −3
Σ0 =
.M =
, ρ = 0.3 0.4 ,
(50)
−0.0054
0.0144
−3 −5
0.1133137 0.03335871
Q=
, β = 15.
(51)
0.0000000 0.07954368
Two different types of simulations are presented: one of the sample includes 500 daily observations and a
second one uses 1500 daily observations, as in Carrasco et al. (2007).
SP500
Min. :-0.128129
1st Qu.:-0.009940
Median : 0.002491
Mean : 0.001535
3rd Qu.: 0.013506
Max. : 0.123746
FTSE
Min. :-0.141420
1st Qu.:-0.010858
Median : 0.002101
Mean : 0.001014
3rd Qu.: 0.013424
Max. : 0.135879
DAX
Min. :-0.197775
1st Qu.:-0.014283
Median : 0.003727
Mean : 0.001523
3rd Qu.: 0.019416
Max. : 0.171546
CAC 40
Min. :-0.149149
1st Qu.:-0.015173
Median : 0.002117
Mean : 0.001095
3rd Qu.: 0.018025
Max. : 0.166252
Table 2: Descriptive statistics for the real dataset.
The table summarizes the descriptive statistics for the available dataset. This dataset is made of the
SP500, FTSE, DAX and CAC time series, on a weekly sampling frequency. The dataset starts on January
2nd 1990 and ends on June 30th 2007.
32
33
SP500/CAC
-2.94
(0.217)
0.33
(0.267)
0.96
(0.372)
-3.41
(0.261)
-0.07
(0.005)
0.01
(0.007)
0.02
(0.005)
0.12
(0.005)
12.25
(0.776)
0.6
(0.103)
-0.4
(0.063)
SP500/FTSE
-2.65
(0.309)
0.74
(0.519)
1.79
(0.491)
-3.04
(0.384)
-0.01
(0.008)
-0.08
(0.007)
-0.08
(0.007)
-0.03
(0.01)
12.7
(1.236)
0.4
(0.098)
0.28
(0.081)
DAX/CAC
-3.22
(0.15)
1.51
(0.266)
1.51
(0.168)
-3.49
(0.193)
0.06
(0.004)
-0.04
(0.004)
-0.08
(0.007)
-0.1
(0.004)
11.34
(0.664)
0.16
(0.076)
0.4
(0.06)
FTSE/DAX
-2.62
(0.158)
0.04
(0.358)
0.74
(0.153)
-3.53
(0.215)
0.06
(0.005)
-0.04
(0.004)
0.09
(0.009)
0.08
(0.005)
10.74
(0.738)
-0.2
(0.081)
-0.26
(0.076)
FTSE/CAC
-2.61
(0.181)
0.5
(0.236)
0.76
(0.162)
-2.48
(0.165)
0.12
(0.003)
0.04
(0.003)
0.01
(0.004)
0.06
(0.003)
10.87
(0.638)
-0.3
(0.069)
-0.81
(0.11)
The table presents the parameters estimated using the C-GMM method developed in Carrasco et al. (2007), using a simple index procedure. The integration grid
is ω = [−300; 300]. The datasets are made of daily observations, from 01/01/1990 until 09/12/2007, using active days closing prices. Standard deviations are given
between brackets.
Table 3: Historical parameter estimates for the WASC model using the daily dataset.
M11
Std. Dev.
M12
Std. Dev.
M21
Std. Dev.
M22
Std. Dev.
Q11
Std. Dev.
Q12
Std. Dev.
Q21
Std. Dev.
Q22
Std. Dev.
β
Std. Dev.
ρ1
Std. Dev.
ρ2
Std. Dev.
SP500/DAX
-3.44
(0.239)
1.17
(0.211)
0.22
(0.469)
-2.87
(0.21)
-0.05
(0.01)
-0.1
(0.012)
0.07
(0.007)
0.07
(0.014)
10.65
(0.733)
0.3
(0.067)
-0.2
(0.071)
34
SP500/CAC
-2.67
(1.077)
1.28
(0.566)
1.54
(1.661)
-3.79
(0.393)
-0.05
(0.012)
0.02
(0.014)
0.03
(0.004)
0.13
(0.006)
13.58
(1.376)
0.01
(0.041)
-0.62
(0.039)
SP500/FTSE
-3.43
(0.671)
1.13
(0.538)
0.82
(0.72)
-3.21
(0.291)
0.07
(0.006)
0.08
(0.005)
-0.02
(0.004)
-0.05
(0.004)
13.17
(1.929)
-0.4
(0.031)
0.64
(0.058)
DAX/CAC
-3.14
(0.326)
0.44
(0.343)
0.63
(0.38)
-3.9
(0.264)
0.01
(0.003)
-0.15
(0.002)
-0.11
(0.007)
-0.1
(0.008)
14.85
(0.392)
0.39
(0.038)
0.26
(0.076)
FTSE/DAX
-2.89
(0.279)
0.83
(0.564)
0.89
(0.264)
-1.27
(0.326)
0.04
(0.011)
0.06
(0.007)
0.09
(0.006)
0.01
(0.005)
11.56
(1.847)
-0.23
(0.078)
-0.49
(0.065)
FTSE/CAC
-3.42
(0.267)
0.01
(0.527)
1.09
(0.26)
-3.76
(0.312)
-0.12
(0.006)
-0.05
(0.005)
-0.02
(0.006)
-0.07
(0.004)
14.37
(1.29)
0.36
(0.062)
0.11
(0.054)
The table presents the parameters estimated using the C-GMM method developed in Carrasco et al. (2007), using a simple index procedure. The integration grid is
ω = [−300; 300]. The datasets are made of weekly observations, from 01/01/1990 until 09/12/2007, using active days closing prices. Standard deviations are given
between brackets.
Table 4: Historical parameter estimates for the WASC model using the weekly dataset.
M11
Std. Dev.
M12
Std. Dev.
M21
Std. Dev.
M22
Std. Dev.
Q11
Std. Dev.
Q12
Std. Dev.
Q21
Std. Dev.
Q22
Std. Dev.
β
Std. Dev.
ρ1
Std. Dev.
ρ2
Std. Dev.
SP500/DAX
-2.83
(0.671)
1.87
(0.376)
0.42
(1.002)
-3.29
(0.305)
0.04
(0.003)
0.05
(0.005)
0
(0.005)
-0.12
(0.007)
12.44
(1.205)
-0.75
(0.067)
0.35
(0.035)
35
Vol
Vol
Vol
Vol
vol
vol
vol
vol
Asset
Asset
Asset
Asset
1
2
1
2
SP500/CAC
0.146
0.241
0.117
0.263
SP500/FTSE
0.161
0.171
0.146
0.189
DAX/CAC
0.2
0.215
0.221
0.361
FTSE/DAX
0.216
0.179
0.197
0.122
FTSE/CAC
0.241
0.144
0.243
0.172
Asset
Asset
Asset
Asset
1
2
1
2
SP500/FTSE
-0.327
-0.468
-0.56
-0.678
DAX/CAC
-0.224
-0.431
-0.219
-0.466
FTSE/DAX
-0.327
-0.143
-0.541
-0.307
Table 6: Skew estimated from the different datasets, with different sample frequency.
Skew
Skew
Skew
Skew
SP500/CAC
-0.687
-0.349
-0.328
-0.611
FTSE/CAC
-0.366
-0.84
-0.373
-0.299
Q11 ρ1 + Q21 ρ2
Corr dYt1 , dΣ11
= p
t
Q211 + Q221
Q12 ρ1 + Q22 ρ2
.
Corr dYt2 , dΣ22
= p
t
Q212 + Q222
In the table, the first line corresponds to the skew associated to the first asset. For example, for the column SP500/CAC, the first line corresponds to the skew of
the SP500. The second line displays the one obtained for the CAC. The skew is obtained when computing:
Weekly
Daily
SP500/DAX
-0.337
-0.36
-0.75
-0.612
In the table, the first line corresponds to the volatility of volatility of the first asset. For example, for the column SP500/CAC, the first line corresponds to the
volatility of volatility of the SP500. The second line displays the one obtained for the CAC. The volatility of volatility is obtained when computing:
q
Vol/vol Asset 1 = 2 Q211 + Q221
q
Vol/vol Asset 2 = 2 Q212 + Q222 .
Table 5: Volatility of volatility estimated from the different datasets, with different sample frequency.
Weekly
Daily
SP500/DAX
0.172
0.244
0.08
0.26
36
κ1
κ2
κ
κ1
κ2
κ
SP500/CAC
Asset 1 Asset 2
5.88
6.82
-0.363
-0.208
5.517
6.612
5.34
7.58
-2.288
-1.308
3.052
6.272
SP500/FTSE
Asset 1 Asset 2
5.3
6.08
-0.612
-0.586
4.688
5.494
20.86
6.42
-1.509
-1.536
19.351
4.884
DAX/CAC
Asset 1 Asset 2
6.44
6.98
-2.149
-2.489
4.291
4.491
6.28
7.8
-0.672
-0.796
5.608
7.004
FTSE/DAX
Asset 1 Asset 2
5.24
7.06
-0.038
-0.074
5.202
6.986
5.78
2.54
-0.844
-1.781
4.936
0.759
−κ
FTSE/CAC
Asset 1 Asset 2
5.22
4.96
-0.589
-0.983
4.631
3.977
6.84
7.52
-0.012
-0.021
6.828
7.499
κ1 is the idiosyncratic part of the mean reverting parameter. κ2 is the cross-component in the parameter and κ is the standard Heston-like
mean reverting parameter. (ρ12 ) = Corr(dlogS1 , dlogS2 ) is the long term correlation between the asset log returns. The drift is evaluated
at its stationary value, using the values presented in table 7.
dΣ11
t




s


22


Σt
11 
12 
(ρt ) dt + ...
= ... + Σt 2M11 + 2M12
11
{z }
Σt
|−κ

|
{z
}
 1


−κ2
|
{z
}
Table 7: Mean reverting parameter associated to Σt .
This table displays the mean reverting parameter associated to the volatilities. For the first asset, we have:


Weekly
Daily
SP500/DAX
Asset 1 Asset 2
6.88
5.74
-1.493
-0.74
5.387
5
5.66
6.58
-3.739
-1.804
1.921
4.776
37
SP500/CAC
0.158
0.206
0.417
0.18
0.238
0.676
SP500/FTSE
0.158
0.158
0.405
0.18
0.18
0.674
DAX/CAC
0.221
0.206
0.766
0.26
0.238
0.831
FTSE/DAX
0.221
0.158
0.669
0.26
0.18
0.739
Table 8: Long term covariance matrix implied by the estimates.
SP500/DAX
0.158
0.221
0.449
0.18
0.26
0.694
FTSE/CAC
0.206
0.158
0.761
0.238
0.18
0.781
A(t)
B(t)
C(t)
A(t)
B(t)
C(t)
SP500/CAC
-1.108
-3.246
1.722
-2.76
-3.723
3.988
SP500/FTSE
-2.376
-3.345
3.929
-1.056
-3.89
4.664
DAX/CAC
-2.902
-2.544
4.21
-0.868
-7.545
3.945
FTSE/DAX
-0.918
-2.779
2.283
-1.761
-1.986
2.849
FTSE/CAC
-1.199
-2.782
2.874
-1.21
-4.761
4.571
The computation of this drift is presented in the appendices. The drift is computed at its stationary value, using the values presented in
table 7.
12 2
12
dρ12
t = (At (ρt ) + Bt (ρt ) + Ct )dt + diffusion part.
Table 9: Mean reverting parameters associated to ρ12
t .
12
This table displays the drift part of the correlation coefficient (ρt ) in the two-asset case, for the value estimated in the previous tables.
The drift is given by:
Weekly
Daily
SP500/DAX
-1.522
-3.022
4.371
-2.922
-2.485
3.629
22
12
The long term covariance matrix is estimated through the usual moment estimators. Its diagonal elements are Σ11
t and Σt . (ρt ) is the long term correlation
computed from the estimated long term covariance matrix.
Weekly
Daily
√
√Σ11
Σ22
ρ12
√
√Σ11
Σ22
ρ12
drho
0.70
0.8
0.80
0.9
rho
DAX
CAC
rho
1.0
SP500
DAX
0.90
1.1
1.00
0.4
0.2
0.0
0.55
0.60
0.40
rho
0.65
0.70
DAX
FTSE
rho
0.45
SP500
CAC
0.75
0.50
0.80
0.55
2.0
1.5
1.0
0.60
0.70
0.80
CAC
FTSE
rho
0.75
0.85
0.90
rho
0.65 0.70 0.75 0.80 0.85 0.90
0.65
SP500
FTSE
12
The blue line is the linear term of the latter relation. The red one is the total expected variation of d(ρ12
t ) depending upon the (ρt ) values.
The green line signals the long term correlation implied by the estimated WASC model. This long term correlation is the only positive root
of the polynomial defined by the stationary drift of the correlation, using the values presented in table 7.
12 2
12
dρ12
t = (At (ρt ) + Bt (ρt ) + Ct )dt + diffusion part.
12
Figure 3: d(ρ12
t ) as a function of (ρt ) using the daily dataset.
12
This figure displays the drift term associated to d(ρ ) as a function of (ρ12 ), using daily estimation results. This relation is specified in the
following equation:
drho
0.0 0.5 1.0 1.5 2.0
−1.0
2
1
0
−1
drho
drho
−0.2
−0.4
0.6
0.2
−0.2
−0.6
drho
drho
0.5
0.0
−1.0
1.0
0.5
0.0
−0.5
38
drho
0.40
0.45
rho
0.50
DAX
CAC
rho
0.55
0.60
0.65 0.70 0.75 0.80 0.85 0.90
SP500
DAX
2.0
1.5
1.0
0.5
0.65
0.60
0.75
0.65
0.75
rho
0.85
DAX
FTSE
rho
0.70
SP500
CAC
0.95
0.80
0.85
1.5
1.0
0.9
CAC
FTSE
rho
1.0
1.1
rho
0.65 0.70 0.75 0.80 0.85 0.90 0.95
0.8
SP500
FTSE
12
The blue line is the linear term of the latter relation. The red one is the total expected variation of d(ρ12
t ) depending upon the (ρt ) values.
The green line signals the long term correlation implied by the estimated WASC model. This long term correlation is the only positive root
of the polynomial defined by the stationary drift of the correlation, using the values presented in table 7.
2
12
dρ12 = (At (ρ12
t ) + Bt (ρt ) + Ct )dt + diffusion part.
12
Figure 4: d(ρ12
t ) as a function of (ρt ) using the weekly dataset.
12
This figure displays the drift term associated to d(ρt ) as a function of (ρ12
t ), using weekly estimation results. This relation is specified in
the following equation:
drho
0.0 0.5 1.0 1.5 2.0
−1.0
1.0
0.5
0.0
−0.5
drho
drho
0.0
−1.0
1.5
1.0
0.5
0.0
−0.5
drho
drho
0.5
0.0
−1.0
1.5
1.0
0.5
0.0
−1.0 −0.5
39
40
SP500/FTSE
0.484
0.755
SP500/DAX
0.058
0.012
SP500/CAC
0.009
0.016
SP500/FTSE
0.02
0.049
DAX/CAC
0.028
0.034
FTSE/DAX
0.025
0.022
FTSE/DAX
0.328
0.541
FTSE/CAC
0.03
0.038
FTSE/CAC
0.864
0.376
decomposition of Q is unique and is refered to as polar decomposition. We refer to Faraut (2006) for further results on matrix analysis.
O(n) stands for the orthogonal group ie O(n) = {g ∈ GL(n, R)|g > g = In } and Pn is the set of symmetric definite positive matrices, such that Q = K Q̃. This
Using the fact that Q ∈ GL(n, R), where GL(n, R) is the linear group, i.e. the set of invertible matrices, there exists a unique couple (K, Q̃) ∈ O(n) × Pn , where
Table 11: Off diagonal term of the polar decomposition of Q estimated from the different datasets, with different sample
frequency.
Daily
Weekly
DAX/CAC
0.431
0.466
Table 10: Norm of the ρ vector.
SP500/CAC
0.721
0.620
p
The norm of ρ is computed as ||ρ|| = (ρ> ρ), which is the Euclidian norm.
Daily
Weekly
SP500/DAX
0.361
0.828