Estimating the Dynamic Mixed Hitting Time Model Using Characteristic Function Based Moments Yogo Purwono∗ Irwan Adi Ekaputra† Zaafri A.Husodo‡ May 25, 2015 Abstract We propose a characteristic function-based moment method to estimate the dynamic mixed hitting time model which specify duration between events as the successive passage times of components of an underlying multivariate Brownian motion relative to in itself random boundaries, while the other, correlated, Brownian components generate the marks. The proposed estimation method afcilitates computation and overcomes problems related to the discretization error in moment conditions and to the non-tractable probability density function. An empirical application using transaction level data on shares of a company traded on the Indonesia Stock Exchange (ISX) are illustrated. The findings suggests that durations and return volatility have strong persistence and they are contemporanously negatively correlated. The assessment of instantaneous volatility levels accounting for instantaneous causality between volatilities and durations is also investigated. Keywords: duration modeling, mixed hitting time, market microstructure, characteristic function JEL codes: C13, C41, G12 1 Introduction Dynamic Mixed hitting-time (DMHT) model is a structural model of duration between events and associated marks. In this model, durations between events are specified as the first times an ∗ Corresponding author, Graduate Program in Management, University of Indonesia, email:[email protected] Graduate Program of Management, University of Indonesia ‡ Department of Management, University of Indonesia † 1 underlying latent process, which is one specific component of multivariate Brownian motion, hits random positive boundaries, and the other, correlated, Brownian components generate the marks. The DMHT model has been developed to facilitate the analysis about asset return movements when they are observed at endogenous random times. Observation times are endogenous precisely because they are closely related to the price and volatility process. This situation primarily appears in tick by tick financial data where the prices, volumes and other marks are recorded at irregularly spaced points in time. Endogeneity in the duration between trades has been suggested in several literatures. Asymmetric information theory of market microstructure (e.g Easley and OHara (1987)) explains that trades are induced by market participants reacting in part to new information hitting financial markets. Therefore, it is very likely that the time between trades is not a random exogenous process. It ought to interact with the price formation process. So if we use the information that is in the durations between trades on top of the information from the returns between trades, it could potentially improve the accuracy of our estimates of asset return as well as volatility. Other theoretical models (e.g., Admati and Pfleiderer (1988) and Foster and Viswanathan (1990)) capture the endogeneity of durations from the relationship between volatilities and length of trading periods. These studies explain that high (low) volatilities occur during exchange trading (nontrading) periods. Empirical evidences about the endogeneity in the durations between trades has also been documented in the literature. Li, et al.(2009) find evidence of endogeneity of sampling time in high-frequency data and establish a central limit theorem for realized volatility in an endogenous time setting. Using semiparametric (GMM) approach, Renault and Werker (2011) document some empirical evidence of instantaneous causality between durations and volatility. Beside facilitating the analysis of price movements under endogenous observation times, the DMHT model allows one to address problems that cannot be handled by autoregressive conditional duration (ACD) or stochastic conditional duration (SCD) models (e.g Engle and Russell (1998) and Bauwens and Veredas (2004)). Examples include the modeling of optimal execution time in dynamic trading and the modeling of durations between trades of several assets. Under the ACD/SCD approach, we are not allowed to incorporate new information that appeared since the previous event and could be relevant for predicting the timing of the next event. Within the DMHT model, the arrival time of such new information can be considered a separate event leading to an update of the distribution of the remaining duration. A major difficulty in continuous-time financial modeling is the lack of efficient tools for estimating and making an inference with discretely observed samples, especially when samples are 2 observed at endogenous random times. This is particularly striking for the DMHT model studied in this paper. The maximum likelihood (ML) estimation is usually inapplicable because the transition density is rarely in closed-form. The frequently used simulation-based methods are difficult to implement since the model is hard to simulate and the traditional GMM is computationally demanding since high order derivatives need to be calculated. Fortunately, for most of continuous time financial models are in the Levy class, and their analytical characteristic functions are obtainable. Since the characteristic function is equivalent to and contains the same information as the probability density function, the estimation using charateristic function can be as efficient as the ML estimator. This paper considers characteristic function based GMM with continuum of moments to estimate the DMHT model of the joint behavior of durations between events and asset return in which both durations and returns are generated by an underlying multivariate Brownian motions. Since the model that we want to estimate represent the joint process of two variables, we propose a characteristic function-based method to jointly estimate model by using information contained in both duration between consecutive trades and the trade prices. Given the concrete specification of duration and return proces, we derive the conditional joint characteristic function of return and duration. Following Carrasco and Florens (2000) and Carrasco et al (2007), the estimator of the model are then derived using the characteristic function based continuum GMM. Monte Carlo and empirical studies show that the method is computationally less costly than the other methods and can be easily adapted to different specifications of the model and the definition of durations. Estimation results indicate that durations and volatility have strong persistence and they are contemporanously negatively correlated. Estimate results also confirm some stylist facts in high frequency financial data as suggested in some market microstructure theories. The remainder of this paper is organized as follows. Section 2 builds the DMHT model. Section 3 describes the characteristic function-based estimation method and provides a Monte Carlo study. Section 4 discusses empirical results and their implications. Lastly, Section 5 concludes the paper. 3 2 Dynamic Mixed Hitting Time Model 2.1 Specification and Characterization Let (ti )ni=0 be the increasing sequence of non-negative random times that we will call event times for expository reasons. These times may, in general, represent any calender time, such as the transaction (trading) times of a stock, or any event of interest, for instance, the time at which a certain level of volume in an asset has been traded. The modeling interest is to describe the stochastic behavior of the event time ti conditional on the information available at some time t < ti as well as the joint stochastic behavior of event time ti and its associated mark, such as asset price Sti . Let (Ft )t≥0 be the continuous-time filtration that represents the time evolution of information available to the econometrician. We assume that the filtration (Ft )t≥0 satisfies the usual condition. The DMHT model specifies an event time as the first time a real-valued (underlying) Wiener process (Wd,t )t≥0 , defined relatively to the filtration (Ft )t≥0 , hits a given positive random boundary whose values depend on past observations as well as unobservable mixing variables Mi . More precisely, if we have just seen an event at time ti , the time ti+1 , when the next event occurs is defined by ti+1 = inf{t > ti : Wd,t − Wd,ti + µ̄ti (t − ti ) = h̄ti } (1) where µ̄ti and h̄ti are the drift and the hitting boundary respectively. The dirft and hitting boundary are assumed to be strictly positive Fti -measurable random variables. In addition, they are also assumed to may depend on Mi . In other words, µ̄ti and h̄ti are given by µ̄ti = µti (Mi ), (2) h̄ti = hti (Mi ). (3) The drift µ̄ti and hitting boundary h̄ti altogether capture observed heterogeneity in the thresholds and associated hitting times. The heterogeneity in µ̄ti and h̄ti can generate all kinds of autoregressive or log–autoregressive dynamics in order to reproduce stylized facts for the durations of interest. In the DMHT model specified in (1), mixing variables (Mi ), for i = 1, 2, ..., n, represent unobserved risk factor that determine duration between events and are assumed to be i.i.d. positive random variables that are independent of Wd . The DMHT model is introduced by Renault, van der Heijden, and Werker (2014) in or- 4 der to provides a general framework to perform inference about asset returns when they are observed at endogenous random times, either in the univariate or multivariate context. This modeling scheme also offers flexibility in understanding the stochastic behaviour of financial duration data beyond the SCD/ACD specifications. In the DMHT model, the mixing component Mi potentially generates a large class of (conditional) duration distributions. Depending on the functional form on the way the mixing variable explains µ̄ti and h̄ti and the assumed distribution for Mi , the DMHT model can be expanded to accommodate stylized facts in intraday trading like clustering of short durations or fat tails of the duration distribution, see, e.g., Valenti, Spagnolo, and Bonanno (2007). Although we can impose some distributions with non negative support for Mi , the gamma distribution turns out to be sufficiently flexible to accommodate both very short as well as very long durations (at least as shown in Whitmore (1986)). This general duration model nests a number of specific models, for instance, SCD and ACD models, depending on the appropriate restrictions on the imposed parameters. For example, the SCD duration model can be obtained by imposing the expected duration hti µti to be a constant. Therefore, in the estimation, we can naturally impose the testable hypothesis about the existence of SCD or ACD specification. It is known (see, e.g., Borodin and Salminen (2002), Relation 2.2.0.1) that, when the event times occur according to the model specified in (1), the conditional distribution of dti+1 given Fti can be characterized by its conditional moment generating function (MGF) s Eti [exp(udti+1 )|Mi ] = exp µ̄ti h̄ti − µ̄ti h̄ti 2u 1 − ¯2 µ ! (4) ti Here, Eti is short-hand for the conditional expectation given Fti . The conditional MGF in (4) implies dti+1 |Fti , Mi ∼ IG h̄ti 2 , h̄ µ̄ti ti where IG(α, θ) denotes the Inverse Gaussian distribution with mean α and shape parameter θ. Following the terminology introduced by Ghysels, Gourieroux, and Jasiak (2004), the DMHT model has a two-factor SVD like model that can be characterized by the factors (µ̄ti , h̄ti ). The two-factor structure has an advantage in the ability to accommodate separate dynamic patterns for the conditional mean and variance of durations. From (4), the first two moments 5 of the conditional distribution of duration are h̄ti , µ̄ti h̄t V arti [dti+1 |Mi ] = 3i µ̄ti Eti [dti+1 |Mi ] = (5) To ensure that the first and second moment for the duration are finite, we need a strictly positive drift in the model. In order to apply the characteristic function based estimation method to the model, we need to represent the conditional distribution of dti+1 given Fti and Mi in term of its conditional characteristic function (CF). For MGF MX (t) and CF ϕX (t) there is a one-to-one correspondences, where ϕX (t) = MX (it). Based on this relationship, the conditional CF of dti+1 given Fti and Mi is given by s Cµ̄t i ,h̄ti 2.2 (u) = Eti [exp(iudti+1 )|Mi ] = exp µ̄ti h̄ti − µ̄ti h̄ti 2iu 1 − ¯2 µ ! (6) ti Model Including Asset Return The DMHT model discuss in section 2.1 covers the modeling framework for the duration processes between transaction times only. We can extend the model to include observed marks, for instance asset returns, at these time points as well. Let Sti be the univariate mark, where in Section 4 they refer to the logarithm of the prevailing mid-quote at time ti . For ease of exposition, also in this theoretical section, we already refer to as Sti the price at time ti . Suppose that the prices Sti are embeded in a Wiener process Ws , correlated with Wd . More precisely, as in Renault et al (2014), we assume that for ti < t ≤ ti+1 , q St − Sti = ᾱti (t − ti ) + V̄ti (Ws,ti+1 − Ws,ti ) (7) where ᾱti = αti (Mi ) and V̄ti = Vti (Mi ) are the prevailing drift and volatility of the price process respectively. It is assumed that Ws has correlation with the Wiener process Wd that define durations such that Wd,t = Z1,t Ws,t = ρ̄s,ti Z1,t + 6 q 1 − ρ̄2s,ti Z2,t where their coefficient correlation is ρ̄ti . For reasons of identiability, it is assumed that the Wiener process Z1,t is independent of Z2,t . The coefficient Correlation between durations and prices is modeled through the coefficient ρ̄ti . Hence, the concrete model for the prices Sti are St − Sti = ᾱti (t − ti ) + q h q i V̄ti ρ̄s,ti (Z1,t − Z1,ti ) + 1 − ρ̄2s,ti (Z2,t − Z2,ti ) (8) As with µ¯ti dan h¯ti in section 2.1, the drift ᾱti , the spot variance V̄t , and ρ̄ may all depend on the mixing variable Mi . Renault et al. (2014) in the Theorem 3.1 show that the distribution of the price change Sti+1 − Sti = Rti+1 in (8) conditional on Fti , Mi , and dti+1 is Gaussian with mean Eti [Rti+1 |dti+1 , Mi ] = ᾱti dti+1 q + (h̄ti − µ̄ti dti+1 ) V̄ti ρ̄ti (9) and variance V arti [Rti+1 |dti+1 , Mi ] = (1 − ρ̄2s,ti )V̄ti dti+1 (10) The theorem allows us to directly characterize the joint conditional distribution of returns and durations in terms of its characteristic function. In other words, it paves the way for efficient inference by GMM with a continuum of moment conditions in the spirit of Carrasco and Florens (2000) and also Carrasco et al (2007). Because the conditional distribution of returns Rti+1 given Fti , Mi , and dti+1 is Gaussian, then its conditional characteristic function is given by 1 Eti [exp(ivRti+1 )|dti+1 , Mi ] = exp imti v − s2ti v 2 2 where mti and s2ti are in (9) and (10) respectively. Equation (9) and (10) explain that both the mean and the variance of the price changes are affine (linear in their states) in the contemporaneous duration dti+1 . Therefore, using the law of iterated expectation, one can derive the conditional joint characteristic function of duration dti+1 and price change Rti+1 q Eti [exp(iudti+1 + ivRti+1 )|Mi ] = exp[iv V̄ti ρ̄,ti h̄ti ] q iv 2 Cµ̄t ,h̄t u + v ᾱ − v V̄ti ρ̄ti h¯ti + (1 − ρ̄2ti )V̄ti i i 2 where Cµ̄t i ,h̄ti (11) (.) is conditional CF in (6). In order to apply the GMM estimation with con- 7 tinuum of moment conditions, one needs to derive the conditional joint characteristic function Eti [exp(iudti+1 + ivRti+1 )] by integrating out the mixing variable Mi . We will present explicitly this conditional joint characteristic function in Section 4 when we discuss the empirical application. 3 Econometric Methodology Assume we observe at random event points in time (ti ) a log-price process (Sti ), defined on some filtered probability space (Ω, F, (Ft )t≥0 , P) that has dynamics as specified in (1). Our interest in this paper is estimation of parameters for process Sti which is robust to the specification of the rest of the components in the model, i.e., the drift term αt and the correlation between log-price process and duration between events. Importantly, we will be interested in developing an estimation method that makes full use of information contained in the duration between events and exploits tractability of the characteristic function of the model. Let Xt ∈ Rm (t = 1, 2, ..., T ) be an m-dimensional random vector process whose distribution is indexed by a finite dimensional parameter θ with true value θ0 . When the process Xt is stationary and Markovian, Feuerverger and McDunnough (1981b) show that Generalized Method of Moments (GMM) estimation can be applied to a discrete set of the following moment conditions by restricting the continuous index u ∈ Rm to a discrete grid u ∈ (u1 , u2 , ..., uN ), E[y(u, Xt )(exp[iu0 Xt+1 ] − ϕ(u; θ, Xt ))] = 0 (12) where u ∈ Rm is the Fourier transformation variable, i is imaginary number such that i2 = 1, ϕ(u; Θ, Xt ) = E θ (exp(iu0 Xt+1 )|Xt ) is the conditional characteristic function (CCF) of Xt and exp[iu0 Xt+1 ] is its empirical counterpart (ECF), and E θ is the expectation operator with respect to the data generating process indexed by θ. The Estimation consists of minimizing a norm of the sample average of the moment function y(u, Xt )(exp[iu0 Xt+1 ] − ϕ(u; θ, Xt )) with objective function involves an optimal weighting function −1 Z ∞ z(u, Yt ) = (2π) −∞ ∂ log f (Yt+s |Yt , θ) exp(−iuYt+s )dYt+s , ∂θ (13) that depends on the true unknown probability density function. They show that the asymptotic variance of the resulting estimator can be made arbitrarily close to the Cramer-Rao bound by selecting the grid for u sufficiently fine and extended. However this estimation is difficult to 8 apply because for most of continuous time models, it is impossible to have the density function for the model. Similar discretization approaches are used in Singleton (2001) and Chacko and Viceira (2003). The basic idea behind their approach is to construct moment conditions by firstly dividing the domain of the characteristic index into a finite number of grids and then approximating the instrument. However, one critical problem of their approach is when the number of points in the grid of u is too refined or too extended in order to improve efficiency, the associated covariance matrix of the discrete set of moment conditions becomes not invertible. As a result, it is necessary to apply operator methods in a suitable Hilbert space to be able to handle the estimation procedure at the limit. Carrasco and Florens (2000) develop a characteristic function based GMM that permits to efficiently use the whole continuum of moment conditions (CGMM) that is computationally less demanding than the commonly used simulation-based method and traditional GMM as well as solves the problems of singularity and instability induced by the discrete moment condition methods. Carrasco, Chernov, Florens and Ghysels (henceforth, CCFG (2007)) extend the scope of the CGMM procedure to Markov and weakly dependent models. Because our model deal with the joint dynamics of return and duration between transaction, we follow CCFG (2007) approach, but with some modification due to the nature of our process. When the process Xt is directly observed at non random integer times, one can match the empirical and model-implied conditional characteristic function at a given lag K using some forms of objective function. In the CGMM procedure, when Xt is Markov instead of being IID, CCFG (2007) propose to use the moment function based on the conditional CF (CCF): mt (τ, θ) = (exp[iu0 Xt+1 ] − ϕ(u; θ, Xt )) exp(iv 0 Xt ) (14) where as before ϕ(u; θ, Xt ) denote the CCF of Xt and τ = (u0 , v 0 ) ∈ R2m . CCFG (2007) show that the instruments {exp(iv 0 Xt ), v ∈ Rm } in (8) are optimal given the Markovian structure of the model. Moment functions defined by (8) describes a martingale difference sequence. Under certain regularity conditions, the efficient estimator of CGMM based on CCF (which we will call CCF–CGMM for simplicity) is given by D E −1/2 −1/2 θbT (λT ) = arg min KT,λT m b T (., θ), KT,λT m b T (., θ) (15) θ where m b T (., θ) = 1 T PT t=1 mt (τ, θ) −1/2 is the sample conterpart of moment functions, KT,λT is the 9 weighting covariance operator associated with pre-determined moment function mt (τ, θ), and λT denotes the optimal regularization parameter for a given sample size T . In the CGMM estimator given in (9), the objective function D E −1/2 −1/2 Q = KT,λT m b T (., θ), KT,λT m b T (., θ) is an inner product defined on Hilbert space L2 (π) of complex valued functions that are square integrable, which is defined as: 2 L (π) = {f : R m Z → C; f (u)f (u)π(u)du < ∞} (16) where f is the complex conjugate of f for all f ∈ C. Here π is an arbitrary finite measure on R2m that has nothing to do with the data generating process of Xt . As long as π > 0, the choice of π does not affect the estimation efficiency in large sample. In practice, it is customary to set π(τ ) = exp(−τ 0 τ ) in order to be able to compute (9) using Hermitian quadratures. The inner product on L2 (π) may be defined as Z hf, gi = f (τ )g(τ )π(τ )dτ where the overline denotes complex conjugate. The CCF–CGMM estimator θbT (λT ), under some regularity conditions (see Carrasco and Kotchoni, 2010), is asymptotically normal T 1/2 θbT (λT ) − θ0 → N (0, Iθ−1 ) 0 (17) as T and λ2T T go to infinity and λT goes to zero, where Iθ−1 denotes the inverse of the Fisher 0 Information Matrix, and θ0 is the true parameter vector. Carrasco and Florens (2000) show that the asymptotic variance of the optimal CGMM estimator is VT b t (., θ) = where G ∂m b T (.,θ) ∂θ i h√ T θbT (λT ) − θ0 = V ar D E−1 −1/2 b t (., θ) , K −1/2 E G b t (., θ) = KT,λ E G T,λ b t,j (., θ) = is a column vector of length l whose j th element is G (18) ∂m b T (.,θ) , ∂θj and for every two vector functions g and h, we have: hg, hii,j = hgi , hj i. The asymptotic 10 variance (12) is consistently estimated by: D E−1 −1/2 b −1/2 b \ VbT = AV ar θb = KT,λ G T (τ, θ), KT,λ GT (τ, θ) b T (τ, θ) = where G 1 T PT (19) t=1 Gt (τ, θ). As in (9), here KT is the covariance operator with the i kernel k(s, τ ) = E mt (s, θ), mt (τ, θ) such that b h Z Kf (τ ) = k(s, τ )f (s)π(s)ds (20) Because KT is not invertible, it need to be stabilized by a regularization parameter λT ∈ [0, 1] such that −1 KT,λ = KT2 + λT IT T −1 KT (21) where IT is the identity operator and T is the sample size. Hence, consistent estimate for KT,λT can also be obtained from the sample moment conditions. See Carrasco and Florens (2000) and Carrasco and Kotchoni (2013) for further discussion of the consistency and root–T asymptotic normality of the CCF–CGMM estimator. In our model, the estimation problem is more complicated as we observe the price process Sti at endogeneous random times ti . The endogeneity of observation times are precisely because they are probably related to the price process and its volatility. We overcome this problem by jointly model the marks and duration between events. Hence for the estimation problem here we can only work with the joint conditional characteristic function ϕti (u, v; Θ, Yti+1 ) (JCCF) instead of CCF ϕt (u; θ, Xt+1 ), where Θ is now the parameters vector that we will estimate and Yti+1 = (Rti+1 , dti+1 ) is the joint process of return and duration between events. Using the above CCF–CGMM estimation analysis, we propose to base inference on matching the model implied ϕti (u, v; Θ, Yti+1 ) with its sample estimate exp(iudti+1 + ivRti+1 ). If the joint process of duration and return is stationary and Markov within a day, then exactly as above appropriate weighting of these moment conditions will yield the Cramer-Rao efficiency bound based on daily direct observations of (Rti+1 , dti+1 ). More specifically, our vector of moment conditions is given by Eti mti (τ, Θ) = (exp(iudti+1 + ivRti+1 ) − ϕti (u, v; Θ, Yti+1 )) exp(iwdti ) = 0 (22) where Eti is short-hand for the conditional expectation given Fti and τ = (u, v, w) is the Fourier transform. In equation (22), the set of basis functions {exp(iwdti ), w ∈ Rm } is being used as 11 instruments. These instrument are choosed by considering the efficiency of the estimation given the Markovian structure of the model (as suggested by CCFG, 2007). Because our joint process (Rti+1 , dti+1 ) are Markov, then the above instrument must be optimal. Based on the moment functions mti (τ, Θ) = (exp(iudti+1 + ivRti+1 ) − ϕti (u, v; Θ, Yti+1 )) exp(iwdti ), then our estimator is minimum distance (GMM) with estimating equations in the vector given c converging in probability to a positive definite matrix W : in (22) and some weight matrix W D E b N (λN ) = arg min W cm cm Θ b T (., Θ), W b T (., θ) Θ where m b N (., Θ) = 1 N PN ti =1 mti (τ, Θ) (23) is the sample conterpart of moment functions mN (τ, Θ). c to be an estimate of the optimal weight covariance As in CCF–CGMM analysis, we set W operator defined by the asymptotic variance of the empirical moments to be matched, that c = K −1/2 , while as before λN denotes the optimal regularization parameter for a given is W N,λN sample size N . Note that because of the separability of data from parameters in the moment vector mN (τ, Θ), we construct our optimal weight matrix by using only the data. This in particular means that all moments are weighted the same way regardless of the model that is estimated, provided the set of moments used in the estimation is kept the same of course. The consistency and root–T asymptotic normality of our CJCF-CGMM estimator follow Carrasco and Florens (2000) and Carrasco and Kotchoni (2013) studies. Although in their original paper, CCFG (2007) have proved that the optimal weighting c = K −1/2 , we also use the identity matrix as the weighting covariance covariance operator W N,λN operator in order to compare the numerical instability. It is shown in Cochrane (2005) that the use of identity matrix as the weighting matrix in GMM-based estimation, will produces more robust and stable estimates. We conduct a Monte Carlo study to evaluate the loss of efficieny in using other matrix such as the identity matrix as the weighting covariance operator. So far, the Monte Carlo study shows that the loss of efficiency in using the identity matrix is tiny and that the algorithm is more stable and faster than that based on the optimal weighting operator. However we do not show yet the results in this paper, because we still have not been sure about the validity of the results since the number of trial in the Monte Carlo study is relatively little. Perhaps we present the Monte Carlo result in the next draft of paper. 12 4 Empirical Illustration In this section, we illustrate the application of the proposed estimation method using event-byevent data for a large cap stock, and discuss their implication. Section 4.1 presents the data, Section 4.2 explains the concrete specification of the DMHT model that we want to analyse, Section 4.3 presents the estimation procedure, and Section 4.4 reports the parameter estimates and discusses the results. 4.1 Data The data used in this paper is intraday transactions of a large cap stock Telkom (ticker: TLKM) traded in the Indonesia Stock Exchange (ISX) during March 2015, for a total of 20 trading days. They were obtained from Thomson Reuters Tick History. The dataset contains the following series within a day: Stock Ticker, Trading Date, Trading Time, Trading Price, Best Bid and Ask Prices, Trading Volume, and Others. TLKM is one of the largest corporations listed on the ISX, with a total of about 1.24B shares outstanding and a market capitalization around US180B. . The Thomson Reuters Tick History dataset originally contains microsecond time-stamps transactions. Because we are interested not only in durations, but also in returns and volatility per unit of time, we aggregate events that occur within a single second. If we include very short (microsecond) durations in the analysis, the volatility per unit of time tends to inflate when a nonzero return is observed. We follow the common practice (e.g., Engle and Russell, 1998 and Bauwens and Veredas, 2004) of aggregating these zero return trades as a single transaction. For each duration, the corresponding change in stock price (return) is recorded as well. We occasionally see multiple trades recorded with the same timestamp. They are trades from multiple buyers or sellers or split-transactions occurring when the volume of an order on one side of the market is larger than the available opposing orders. We take, as in Engle (2000), the stock price at each transaction from the log of the geometric average of the prevailing ask and bid quotes. For simplicity, we refer to this value as the midquote. After aggregation, the final data sets contain 35284 observed durations. We use only single exchange to allows us discarding effects on the transaction process caused by different market structures for various exchanges (as suggested in Engle, Ferstenberg, and Russell, 2008). Hence, in this illustration we implicitly assume that a transaction is executed on a given exchange just by chance and not because it has specific characteristics. Summary statistics for both the durations dti+1 and the price changes Rti+1 for consecutive 13 Series Duration (sec) Return (%) Series Duration (sec) Return (%) Mean Std dev Min Median 95% 99% Max 12.72 15.99 1 3 38 72 384 -8.55E-7 9.88E-4 -1.59E-2 0 1.73E-3 1.77E-3 1.11E-2 Skewness Kurtosis ACF(1) ACF(2) ACF(3) ACF(4) ACF(5) 4.9663 64.1980 0.1729 0.1401 0.1352 0.1225 0.1155 0.1716 111.8810 -0.4253 -0.0033 -0.0044 0.0065 0.0120 Table 1: Descriptive statistics of TLKM durations (dti+1 ) and returns (Rti+1 = Sti+1 − Sti ) Figure 1: Histogram of duration between trades of TLKM during March 2015 trades are presented in Table 1, while the histograms of these durations and price changes are plotted in Figures 1 and 2 respectively. The table and figures document well-known stylized facts of durations, such as excess dispersion, and a large fraction of very short durations combined with a fat right tail. Just over 30% of all durations for TLKM stock has a length of one second. The distribution of the durations turns out to be skewed to the right and the sample kurtosis is also larger than the normal value of three. It is interesting to know the largest intervals between trades in this sample is around five minutes and of course the minimal interval is one second. The average waiting time for a trade to happen is slightly less than 15 seconds. For the returns, we observe that returns are very much concentrated; more than half of the TLKM trade durations do not induce a change in the midquote. The return distribution is smoother because the tick size is smaller relative to the share price. As for temporal dependence, we see the usual strong negative autocorrelation of order one in the return series, often associated to the price bouncing between the bid and ask. We also see decaying (persistent) dependence in the duration series but the magnitude of the autocorrelation of order one is smaller for the durations (17%) than for the returns (42%) . Figures 3 and 4 display average durations and average volatility per seconds that we calcu- 14 Figure 2: Histogram of return of TLKM during March 2015 Figure 3: Diurnal pattern of durations, averaged over 30 minutes Figure 4: Diurnal pattern of return volatility per second, averaged over 30 minutes 15 lated every 30 minutes respectively. It is widely known that transaction intensities and return volatilities exhibit intraday seasonality. For instance, average durations between transactions and quote changes tend to be longer around lunch time than near the opening and the closing of the trading day. This seasonality pattern, that is usually call the diurnal pattern, holds true for our data as well, as shown in Figures 3 and 4. These pattern also true for the adjusted data following the correction for intraday seasonality which we described in Section 4.2.1. 4.2 Parameterization The modeling scheme considered in Section 2 so far did not make any specific assumptions on the distribution of the mixing variable Mi or on the way µ̄ti and h̄ti depend on Mi . In order to apply the estimation framework to the data, we need to impose a concrete specification for all component in DMHT model that allows us to understanding of the features of the model more precisely. This specification is also the one that will be estimated using the TLKM data. A fully specified DMHT model consists of assumptions about 1) the correction for intraday seasonality, 2) the distribution of the mixing variable Mi , 3) the dynamic properties of µti and hti , and 4) return volatility dynamics. 4.2.1 Intraday Seasonality It is well known that there is deterministic seasonal pattern during the business day for volatility (e.g., see Andersen and Bollerslev (1997)) and trade intensity (e.g., see Engle and Russell (1998)). For assets that are not traded around the clock, volatility is usually higher when the markets are opening, it lowers during the middle part of the day before increasing again after that. A similar pattern exist for durations: they are relatively shorter at the opening and closing of the market, and relatively longer during the middle of the business day. Trade intensity being inversely related to durations, have a smiler pattern as volatility: relatively higher trade intensity at the beginning and end of the business day than during the middle of the day. To capture these diurnal patterns, we introduce two deterministic functions of the calender time of the day, gd (t̃i ) and gd (t̃i ) for the duration and return volatility respectively. Following Engle and Russell (1998) and functional form inspired by the intraday seasonality corrections in Hasbrouck (1999) and Andersen, Dobrev, and Schaumburg (2008), we adjust duration and 16 return volatility using the following function respectively: gd (t̃i ; g1 , g2 , g3 ) = exp[−(g1 − g3 )t̃i − g3 ] + exp[−g2 (1 − t̃i )] [exp(−g1 ) − exp(−g3 )]/(g3 − g1 ) + [1 − exp(−g2 )]/g2 (24) 1 − exp(−gS2 ) gS (t̃i ; gS1 , gS2 , gS3 , gS4 ) = 1 + gS1 exp(−gS2 t̃i ) − gS2 1 − exp(−gS3 ) + gS4 exp(−gS3 (1 − t̃i )) − gS3 (25) We assume that adjusted duration takes the form dd ti+1 ≡ g(t̃i )dti+1 , where dti+1 is the observed durations. The realized return volatility per unit of time will follow the same correction accordingly. The intraday seasonality correction for both durations and prices will be normalized R1 R1 by imposing 0 g(t)dt = 1 and 0 gS (t)dt = 1, so that the correction at any point is relative to the average duration (volatility) in the sample. The parameters of gd (t̃i ) and gd (t̃i ) will be calibrated by using the least square error principles. 4.2.2 Mixing Distribution For simplicity, we assume multiplicative form for hitting barrier h̄ti = hti and constant form for the drift µ̄ti = µti . We endow the mixing variable Mi with a Gamma distribution with parameter B and imposing E(Mi ) = 1 as an identifying restriction. Its density thus reads fM (m) = B B B−1 m exp(−Bm) Γ(B) (26) Depending on the value of B, the density at zero may be vanishing, nite or diverge to infinity, which turns out to be an empirically desirable feature. While Mi has a constant mean 1, its variance is inversely proportional to B. 4.2.3 Specification for Duration Components Intraday financial durations exhibit significant autocorrelation that dies out only slowly. For comparison purposes, we endow the expected durations hti µti with some GARCH-type dynamics as in Engle and Russell (1998). More precisely, we specify ht hti = β0 + β1 (dti )β3 + β2 i−1 µti µti−1 in which β0 , β1 , β2 > 0 dan β3 ∈ [0, 1]. 17 (27) In order to allow, but not to impose, an SCD specification, we model µti = ω4 + ω1 + ω2 hti /µti ω6 /2 + ω3 (dti )ω5 (28) To test the existence of the SCD specification, we can test the following hypothesis ω1 = ω3 = ω4 = 0 and ω6 = 1, because the SCD specification would need the constant µti hti . 4.2.4 Return Model For the model of the returns, the drift of the price process per unit of time is assumed to be constant αti = α and also for the coefficient correlation between the Brownian motions W and Z (generating durations and returns, respectively) ρti = ρ . For the return volatility per unit of time we assume that V̄ti = (δM Vti + θM Mi )2 (29) where δM and θM are restricted by the normalization E (δM 1 =1 + θ M Mi ) 2 (30) This normalization identifies the unconditional level of V̄ti and Vti . Finally, we use a GARCH model that accounts for the irregularly observed data following Meddahi, Renault, and Werker (2006), Vti = α0 gS (t̃i ) + (1 − dt α1 )α2 i Rti − αdti p dti !2 dt + α1 α2 i Vti−1 (31) where gS (t̃i ) is as specified in section 4.2.1. 4.3 Conditional JCF–CGMM Based Inference The concrete model in section 4.2 is then estimated using the GMM with continuum of moment based on the (conditional) joint characteristic function (CJCF) as presented in section 3. However, we need to derive the CJCF that only depend on flow of information Fti in order to apply the proposed estimation method. From the above concrete specification, applying the law of iterated expectation to the CCF of return given contemporaneous duration and mixing variable, we have the conditional JCF of returns and durations conditional on Mi , Eti [exp(iudti+1 + ivRti+1 )|Mi = mi ] = 18 s " exp iv ! # s Vti 2ir ρ hti mi + µti hti mi − µti hti mi 1 − 2 (δm + θm mi )2 µti (32) where s r = u + vα − v ! Vti iv 2 Vti 2 ρ hti mi + (1 − ρ ) (δm + θm mi )2 2 (δm + θm mi )2 Integrating out the mixing variable Mi , one would then get a continuum of moment conditions (indexed by (u, v) ) for the purpose of CGMM inference. The following proposition gives the CJCF of duration and return that we will use in the estimation of TLKM data Proposition 1 The CJCF of duration and return with concrete specification defined in section 4.2 is given by: ψti (u, v; θ) = Eti [exp(iudti+1 + ivRti+1 )] Z ∞ Eti [exp(iudti+1 + ivRti+1 )|Mi = mi ] = 0 B B B−1 m exp(−Bmi )dmi Γ(B) i (33) where Eti [exp(iudti+1 + ivRti+1 )|Mi = mi ] is defined in (11), Γ(.) is the Gamma function, and θ ∈ Θ is the set of parameters. The estimation then will be conducted using the following moment conditions: E[ exp(iudti+1 + ivRti+1 ) − ψti (u, v; θ) Y (., dti )] = 0 (34) where we set Y (., dti ) = exp(iwdti ) as the instrument for the optimality of our estimation as suggested by CCFG (2007). Based on the moment conditions (28), our CJCF–CGMM estimator for the model using the procedure suggested in section 3 is D E b N (λN ) = arg min K −1/2 m Θ b N (., Θ), K −1/2 m b N (., Θ) Θ (35) where as above m b N (., Θ) is the sample counterpart of the moment functions m(u, v, w; Θ) which form the moment condition (28) m(u, v, w; Θ) = exp(iudti+1 + ivRti+1 ) − ψti (u, v; θ) exp(iwdti ) = 0 The estimation and inference then just follows the procedure discusses in section 3. 19 (36) 4.4 Estimation Results Table 2 reports estimation results including parameter estimates and their standard deviations. Models are estimated by the Joint CCF-CGMM described in Section 3. The standard errors, which are computed using the formula (23) combined with (25), (26), and W = K, are presented in brackets. θ = (β0 , β1 , β2 , β3 , ω1 , ω2 , ω3 , ω4 , ω5 , ω6 , α, α0 , α1 , α2 , ρs , ρu , δM , θM , σu2 , B) Parameter Estimate (Std Errors) Parameter Estimate (Std Errors) Parameter Estimate (Std Errors) β0 1.2E-13 0.0081 ω4 0.00 0.012 ρ -0.075 0.006 β1 0.100 0.006 ω5 0.171 0.016 δM 0.000 0.005 β2 β3 ω1 0.955 0.915 0.000 0.002 0.0019 0.013 ω6 α α0 2.300 0.000 3.7E-8 0.068 0.00 3.0E-10 θM B 1.095 20.518 0.000 0.005 ω2 2.325 0.052 α1 0.625 0.014 ω3 0.042 0.011 α2 0.795 0.005 Table 2: Descriptive statistics of TLKM trade durations (dti+1 ) and logprice changes (Sti+1 − Sti ) The first thing to notice is all the estimates are statistically significant. A second thing to look at is the dynamic nature in the return process when they are observed at endogeneous random times. When we look at expected duration related parameters, we find that the conditional expected durations are non-explosive because the estimate of β1 and β2 are both highly significant and their sum is greater than 1 (1.055), while at the same time both beta2 < 1 and beta3 < 1. This confirm the properties of DMHT-based duration model in Remark 4.1. of Renault et al (2013). From the estimates of the observe heterogeneity µ̄ti parameters ω1 , ω2 , ω3 , ω4 , ω5 , ω6 , we can test if the dynamic of durations follow the SCD (and ACD) specification. Based on discussion in the section 4.2, the null we need to test is H0 : ω1 = ω4 = ω6 − 1 = 0. This hypotheses can be tested using a standard Wald test. Under the null, the test statistic is asymptotically χ23 distributed. It’s 99% quantile equals 11.34. Because the sample value of the test statistic is 786, so the null is (strongly) rejected for TLKM stock. Therefore, there is no need to further examine the additional constraint within H0 that defines the SCD model, that is H01 : ω3 = 0. Would H0 not have been rejected, then a test of H01 could be conducted using a Wald test. Since the ACD model is a strict subclass of the SCD model (see Proposition 2.2 in Renault et al., 2013) and the SCD model is rejected, then the ACD model is also rejected in favor of the more flexible DMHT model. 20 Looking at the estimated value of B, we have indication that the density function of mixing variable Mi for TLKM stock is relatively concentrated, because the variance of Mi equals 1/B = 0.049. This density vanishes at zero. Recall that the mixing variable multiplies the conditional expected duration, so a vanishing density at zero implies that a significant decrease in the conditional expected duration due to the mixing variable is very unlikely. Since we only observe durations of length at least one second in the data, it does not appear surprising to find such result. While very small values of the mixing variable are unlikely, it can still take on relatively large values, so as to enable the model to fit the occasional large duration found in the data. We have a slightly positive, but poorly identified, drift α for the parameters of the return process, while the GARCH process that updates volatility per unit of time Vti putting almost equal weights (α̂1 = 0.625) on both the lagged volatility and the realized squared midquote return per second. For TLKM stock, if the duration between trades increases, the effect of both the lagged volatility and realized squared midquote return decays geometrically with exponent α̂2 ≈ 20.80 per second. Hence for long durations, the predicted return volatility per unit of time is about equal to the diurnally adjusted constant term α0 gS (t∗i ) i), and this at least indicate that there is a considerably intraday variation in volatility per unit of time. The parameters estimate of the mixing part of the mixed variance return V̄t2i are δM = 0.000 and θM = 1.095 respectively, where the value for θM follows from the identifying constraint (30). Hence, the return variance V̄t2i is estimated to be inversely related to the value of the mixing variable Mi . Since Mi scales the hitting boundary, hence, a larger value of Mi implies both a longer expected duration as well as a smaller variance, which is consistent with the observations in for example Engle (2000) and Dufour and Engle (2000). There is significant correlation between the Wiener process that generate price and the Wiener process that generate duration. The correlation is estimated to be negative (ρ̂ = 0.075), so when there is a positive shock to the return volatility, then on average there is a negative shock to the duration between trades, which means shorter durations. Therefore, our estimation results are consistent with Easley and O’Hara (1992) arguing that a long duration indicates the absence of new information. Factors such as characteristics of the news, market participant and market mechanism affect each individuals trading decision. Announcement of pubic news normally induces an increased trading intensity and a substantial up or down movement in the price on the stock market very quickly, while private news has less impact on the market. Informed trader tends to take advantage of the information before certain time without moving market price to the adverse direction. There are type of investors who are 21 more willing to wait and have no tolerance of the relative high price, and type of the investor who want to close the position soonest, such as liquidity trader. Block trade is normally divided into orders of smaller size. The time of waiting for the order execution depends on how much price impact the trader can take. However, we dont observe these individual level characteristics to identify type of each trade and buyer or seller and news, what we have are variables in a market scale, e.g. stock price, quotes, trade arrival time and volume. But we know all factors that playing roles in affecting the trading intensity and price formation together. The negative correlation, although the magnitude is slightly small, in our result also justifies the empirical support of the DMHT model to the predictions about the relation between a contemporaneous duration and moments of the conditional return distribution as put forward in several market micro structure theories (e.g., Diamond and Verrecchia (1987), Easley and O’Hara (1992)). This empirical support can be confirmed through the analytic expressions of the first two moments of the return distribution, conditional upon past information and the contemporaneous duration derived in Renault et al. (2013): Eti Rti+1 |dti+1 dt i+1 V arti Rti+1 |dti+1 dti+1 √ = α + ρ V ti Eti [f (hti , µti , Mi )|dti+1 ] = (1 − ρ2 )Vti Eti [g(Mi )|dti+1 ] + (37) ρ2 Vti V arti [h(hti , µti , Mi )|dti+1 ] (38) dti+1 where f (.), g(.), and h(.) are some different functions. From (18) and (19), it is clear that in case the correlation parameter ρ equals zero, both the expected return per unit of time and its variance per unit of time do not depend on the contemporaneous duration. However, if the estimated correlation is not zero, although with small magnitude, we can state that the correlation plays important role in explaining the above conditional moments of return distribution. This result in line with Renault, et al. (2013). The graphical illustration is used to analyse the net effect of ρb on the expected return and return standard deviation per unit of time as a function of the length of the contemporaneous duration, as specified in (18) and (19). They show that while expected returns are slightly increasing with the contemporaneous duration, the return volatility per unit of time are decreasing with the contemporaneous duration. Hence, this pattern is consistent with the Easley and O’Hara (1992) and other market microstructure models which explain that return volatility tends to decrease when there are no trading in relatively long period. 22 4.5 Monte Carlo Studies In this section, we implement two Monte Carlo studies using model specification described in Section 4.2. The goal of the Monte Carlo studies are two folds. The first is to manifest the estimation efficacy of the Conditional JCF-CGMM, and the second is to show the efficiency gain in estimating asset return variances when jointly considering the stock price and duration data. Parameter True Value Identity Weighting Matrix Mean Median RMSE Optimal Weighting Matrix Mean Median RMSE Parameter True Value Identity Weighting Matrix Mean Median RMSE Optimal Weighting Matrix Mean Median RMSE Parameter True Value Identity Weighting Matrix Mean Median RMSE Optimal Weighting Matrix Mean Median RMSE β0 1.2E-13 β1 0.100 β2 0.955 β3 0.915 ω1 0.000 ω2 2.325 ω3 0.042 1.18E-13 0.250 1.21E-13 0.200 0.05E-13 1.225 0.957 0.955 0.085 0.920 0.918 0.057 0.001 0.000 0.0005 2.355 0.053 2.350 0.040 0.057 0.0145 1.25E-13 1.23E-13 0.04E-13 ω4 0.00 0.150 0.125 0.925 ω5 0.171 0.965 0.960 0.080 ω6 2.300 0.918 0.915 0.045 α 0.000 0.000 0.001 0.0002 α0 3.7E-8 2.350 0.048 2.345 0.045 0.060 0.0135 α1 α2 0.625 0.795 0.00 0.00 0.00 0.175 0.173 0.051 2.320 2.315 0.065 0.000 0.000 0.055 0.00 0.00 0.00 ρ -0.075 0.173 0.171 0.045 δM 0.000 2.340 2.320 1.214 θM 1.095 0.000 3.69E-8 0.628 0.798 0.000 3.72E-8 0.624 0.792 0.000 0.063E-8 0.069 0.0241 B 20.518 -0.074 -0.075 0.0057 0.000 1.092 20.525 0.000 1.096 20.528 0.000 0.0162 0.151 -0.076 -0.074 0.005 0.000 1.097 20.522 0.000 1.094 20.517 0.000 0.0071 0.122 3.75E-8 0.635 0.799 3.73E-8 0.625 0.785 0.072E-8 0.057 0.0252 Table 3: The Results of Monte Carlo Study I The first Monte Carlo study is based on 5000 simulation of a data set of one thousand durations and corresponding price changes using the estimated parameter values as inputs. Since the minimum observed duration in the data set has a length of one second, we round up all durations between half a second and one second up to one second. Durations shorter than 23 half a second are discarded. The model is then estimated by the Conditional JCF-CGMM using the simulated stock prices and durations, and both the identity and optimal weighting matrices are used. We find that the optimization is less sensitive to the initial values when using the identity weighting matrix since using the optimal weighting matrix results in 350 exploding estimates among 5000 simulations/estimations. Table 3 presents the Monte Carlo study results, which indicate that the loss of efficiency is tiny. In the second simulation, we simulate data from a model with endogenous durations, which is the DMHT model as described in Section 2 and its concrete specification Section 4.2. The same true values as in the first Monte Carlo study is used in this simulation. Each simulation is done with 20 days worth of observations (more than 30,000 trades). For each sample, we re-estimate two models: (1) a model allowing for endogenous durations estimated with the simulated data and (2) the restricted version of our model where the durations are assumed exogenous (when ρt = 0). Each time after estimating the models, The estimates of the return variance are also calculated and compared to the true variance. The Mean Square Error (MSE) and the Mean Absolute Error (MAE) of the estimated variance are then reported. The results for the MSE and MAE for each of the 20 replications are reported in Table 4. Replication 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Mean MSE Endogenous 0.2620 0.2605 0.2618 0.2704 0.2703 0.2619 0.2509 0.3171 0.2995 0.2505 0.2717 0.2518 0.2593 0.2761 0.2362 0.2562 0.2418 0.2428 0.2575 0.2518 0.2486 MSE Exogenous 0.3021 0.3452 0.3123 0.3031 0.3150 0.2908 0.4193 0.2969 0.3511 0.3453 0.3174 0.2895 0.3095 0.2989 0.3175 0.3075 0.2875 0.2980 0.3185 0.2895 0.3209 MSE Endogenous 0.4075 0.3957 0.4092 0.4098 0.4099 0.4043 0.3969 0.4068 0.4353 0.3964 0.3934 0.3998 0.4064 0.4198 0.3852 0.3943 0.3952 0.3899 0.3862 0.3998 0.3982 MSE Exogenous 0.4373 0.4662 0.4272 0.4389 0.4388 0.4255 0.4662 0.4301 0.4319 0.4663 0.4463 0.4255 0.4195 0.4353 0.4496 0.4368 0.4465 0.4389 0.4501 0.4255 0.4403 Table 4: MSE and MAE of the estimated variances when durations are either endogenous or exogenous 24 From Table 4, we can see that, not surprisingly, for 19 out of 20 simulations the MSE or MAE of the model that allows for endogenous durations is smaller than for the model with exogenous durations. Averaging the results over the 15 replications, we see that the MSE and MAE of the exogenous duration model are 29% and 11% bigger than for the endogenous duration model. 5 Conclussion In this paper we propose an efficient method to estimate the dynamic mixed hitting time model which specifies durations between events as the time it takes a standard Brownian motion with drift to hit a certain random boundary, and the corresponding values of the marks are modeled by additional, correlated, Brownian motions. The estimation is based on the conditional characteristic function proposed in and Carrasco et al. (2007). The technique is particularly tractable and easy to apply when the joint conditional characteristic function of the model is known in closed form or up to a relatively easy numerical integration. The emprirical study using high frequency data on a stock, indicates some stylized facts of the duration data where a duration is defined as the time span between two transactions. These stylized facts include high persistence in the durations and fat tails, as well as a large fraction of very short durations. More importantly, the empirical study helps characterizing the causality relations between return volatility and durations between trades. As widely documented in the literature, there is a negative instantaneous relation between volatility and contemporaneous duration. In this paper, we estimate the models for one asset or stock. We have not yet considered the estimation of the models in terms of multivariate modeling involving different assets, different exchanges, etc. For asset pricing, risk management and asset allocation involving more than single assets or single exchanges, one needs to estimate the joint movements in prices and transaction times for several assets in the portfolio. In this regard, the estimation method for multivariate version of the DMHT models are needed to develop. Therefore, it may be interesting to study the estimation methods and their implications using tick history information from several assets in the portfolio. References [Abb12] Abbring, J. H. (2012). Mixed Hitting-Time Models. Econometrica, 80, 783–819. 25 [Adm88] Admati, A. R., and P. Pfleiderer (1988). A theory of intraday trading patterns: volume and price variability. Review of Financial Studies, 1, 3–40. [Aitk07] Aı̈t-Sahalia, Y., and Kimmel, R. (2007). Maximum Likelihood Estimation of Stochastic Volatility Models. Journal of Financial Economics, 83, 413–452. [AitM03] Aı̈t-Sahalia, Y., and Mykland, P.A. (2003). The Effects of Random and Discrete Sampling When Estimating Continuous Time Diffusions. Econometrica, 71, 483–549. [AitMZ05] Aı̈t-Sahalia, Y., Mykland, P. A., and Zhang, L. (2005). How Often to Sample a Continuous-Time Process in the Presence of Market Microstructure Noise. Review of Financial Studies, 18, 351–416. [Ander08] Andersen, T. G., D. Dobrev, and E. Schaumburg (2008). Duration-Based Volatility Estimation. Northwestern University Working Paper. [Ander97] Andersen, T. G. and T. Bollerslev (1997). Heterogeneous information arrivals and return volatility dynamics: uncovering the long-run in high frequency returns.Journal of Finance, 52, 975-1005. [Ander00] Andersen, T. G., T. Bollerslev, F. X. Diebold, and P. Labys (2000). Exchange rate returns standardized by realized volatility are (nearly) gaussian. Multinational Finance Journal, 4, 159-179. [Ander01] Andersen, T. G., T. Bollerslev, F. X. Diebold, and P. Labys (2001). The distribution of exchange rate volatility. Journal of the American Statistical Association, 96, 42-55. [Back91] Back, K. (1991). Asset pricing for general processes. Journal of Mathematical Economics, 20:371395, 1991. [Bandir06] Bandi, F. M., and Russell, J. R. (2006). Separating microstructure noise from volatility. Journal of Financial Economics, 79, 655–692. [Band07] Bandi, F., and Russell, J. (2007). Volatility, in Handbook of Financial Engineering, 183-222. North Holland, Elsevier. [Barns02] Barndorff-Nielsen, O.E., and Shephard, N. (2002). Econometric Analysis of Realized Volatility and Its Use in Estimating Stochastic Volatility Models. Journal of Royal Statistical Society: Series B, 64, 253–280. [Bauw00] Bauwens, L., and P. Giot (2000). The Logarithmic ACD Model: An Application to the Bid-Ask Quote Process of Three NYSE Stocks. Annales d’Economie et de Statistique, 60, 117–149. 26 [Bauw04] Bauwens, L., P. Giot, J. Gramming, and D. Veredas (2004). A Comparison of Financial Duration Models via Density Forecasts. International Journal of Forecasting, 20, 589–609. [Baut06] Bauwens, L., and N. Hautsch (2006). Stochastic Conditional Intensity Processes. Journal of Financial Econometrics, 4(3), 450–493. [Bauw041] Bauwens, L., and D. Veredas (2004). The Stochastic Conditional Duration Model: A Latent Variable Model for the Analysis of Financial Durations. Journal of Econometrics, 119, 381–412. [Bol86] Bollerslev, T. (1986). Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics. 21, 307-328. [Bol87] Bollerslev, T. (1987). A conditionally heteroskedastic time series model for speculative prices and rates of return. Review of Economics and Statistics. 69, 542-547. [Bolgz10] Bollerslev, T., M. Gibson, and H. Zhou (2010). Dynamic estimation of volatility risk premia and investor risk aversion from option-implied and realized volatilities. Journal of Econometrics, 160, 235-245. [Bolic06] Bollerslev, T., J. Litvinova, and G. Tauchen (2006). Leverage and volatility feedback effects in high-frequency data. Journal of Financial Econometrics, No. 3, 4:353-384. [Bolzu02] Bollerslev, T. and H. Zhou (2002). Estimating stochastic volatility diffusion using conditional moments of integrated volatility. Journal of Econometrics, 109:33-65. [Bor02] Borodin, A. N., and P. Salminen (2002). Handbook of Brownian Motion: Facts and Formulae(2nd edition). Birkhäuser, Basel. [Carn04] Carnero, M. A., Peńa, D., Ruiz, E. (2004). Persistence and kurtosis in GARCH and stochastic volatility models. Journal of Financial Econometrics, 2, 319-342. [Carr00] Carrasco, M., and J. P. Florens (2000). Generalization of GMM to a Continuum of Moment Conditions. Econometric Theory, 16(6), 797–834. [Ccfg07] Carrasco, M., Chernov, M., Florens J. P. and E. Ghysels (2007). Efficient estimation of general dynamic models with a continuum of moment conditions. Journal of Econometrics, 140, 529–573. [Ckot13] Carrasco, M., and R. Kotchoni (2013). Efficient Estimation using the Characteristic Function. Working Paper. University of Montreal. 27 [Dia87] Diamond, D. W., and R. E. Verrecchia (1987). Contraints on short-selling and asset price adjustment to private information. Journal of Financial Economics, 18(2), 277– 311. [Dufg04] Duffie, D., and Glynn, P. (2004). Estimation of Continuous-Time Markov Processes Sampled at Random Times. Econometrica, 72, 1773–1808. [Duf00] Dufour, A., and R. F. Engle (2000). Time and the Price Impact of a Trade. Journal of Finance, 55(6), 2467–2498. [Eas87] Easley, D., and OHara, M. (1987). Price, Trade Size and Information in Securities Markets. Journal of Financial Economics, 19: 6990. [Eas92] Easley, D., and M. O’Hara (1992). Time and the process of security price adjustment. Journal of Finance, 47, 577–605. [Eng82] Engle, R. F. (1982). Autoregressive conditional heteroskedasticity with estimates of the variance of United Kingdom inflation. Econometrica, 50, 987-1008. [Eng00] Engle, R. F. (2000). The econometrics of ultra-high frequency data. Econometrica, 68, 1–22. [Eng08] Engle, R. F., R. Ferstenberg, dan J. R. Russell (2008). Measuring and modeling execution cost and risk. NYU Stern School of Business Working Paper. [Eng03] Engle, R. F., dan A. Lunde (2003). Trades and Quotes: A Bivariate Point Process. Journal of Financial Econometrics, 1(2), 159–188. [Eng98] Engle, R. F., and J. R. Russell (1998). Autoregressive Conditional Duration: A New Model for Irregularly Spaced Transaction Data. Econometrica, 66(5), 1127–1162. [Fos90] Foster, D. F., dan S. Viswanathan (1990). A theory of the intraday variations in volumes, variances and trading costs in security markets. Review of financial studies, 3, 593–624. [Fukr12] Fukasawa, M., and Rosenbaum, M. (2012). Central Limit Theorems for Realized Volatility under Hitting Times of an Irregular Grid. Stochastic Processes and Their Applications, 122, 3901–3920. [Ghys04] Ghysels, E., C. Gouriéroux, and J. Jasiak (2004). Stochastic Volatility Duration Models. Journal of Econometrics, 119, 413–433. [Ghys98] Ghysels, E., and J. Jasiak (1998). GARCH for Irregularly Spaced Financial Data: The ACD-GARCH Model. Studies in Nonlinear Dynamics and Econometrics, 2(4), 133– 149. 28 [Gour05] Gouriéroux, C. and J. Jasiak (2005). Autoregressive Gamma Processes. Journal of Forecasting, 25, 129–152. [Gram02] Gramming, J., and Wellner, M. (2002). Modeling the interdependence of volatility and intertransaction duration processes. Journal of Econometrics, 106, 369–400. [Ham02] Hamilton, J. D., and Ó. Jordá (2002). A Model of the Federal Funds Rate Target. Journal of Political Economy, 110(5), 1135–1167. [Han04] Hansen, P., and Lunde, A. (2004). An unbiased Measure of Realized Variance. Working Paper. [Hansenl04] Hansen, P. R., and Lunde, A. (2004a). Realized Variance and IID Market Microstructure Noise. Technical report, Brown University, Dept. of Economics. [Hasb99] Hasbrouck, J. (1999). The dynamics of discrete bid and ask quotes. Journal of Finance, 54(6), 2109–2142. [Hajayo11] Hayashi, T., Jacod, J., and Yoshida, N. (2011). Irregular Sampling and Centeral Limit Theorems for Power Variations: The Continuous Case. Ann. Inst. H. Poincar Probab. Statist, 47, 1197–1218. [Jack09] Jackson, K., A. Kreinin, and W. Zhang (2009). Randomization in the frst hitting time problem. Statistics and probability letters, 79, 2422–2428. [Japro98] Jacod, J., and Protter, P. (1998). Asymptotic Error Distributions for the Euler Method for Stochastic Differential Equations. The Annals of Probability, 26, 267–307. [Jai09] Jaimungal, S., A. Kreinin, and A. Valov (2009). Randomized First Passage Times. Submitted to Applied Probability Trust. [Jia99] Jiang, W., and M. A. Tanner (1999). Hierarchical Mixtures-Of-Experts for Exponential Family Regression Models: Approximation and Maximum Likelihood Estimation. The Annals of Statistics, 27, 987–1011. [Kars91] Karatzas, I., and Shreve, S. E. (1991). Brownian Motion and Stochastic Calculus. Springer. [Kotch12] Kotchoni, R. (2012). Applications of characteristic function based continuum GMM in finance.Computational statistics and data analysis, 13, 601–622. [Lirmyz13] Li, Y., Renault, E., Mykland, P. A., Zhang, L., and Zheng, X. (2013). Realized Volatility When Sampling Times are Possibly Endogenous. Manuscript. Available at SSRN: http://ssrn.com/abstract=1525410. 29 [Lizz13] Li, Y., Zhang, Z., and Zheng, X. (2013). Volatility Inference in The Presence of Both Endogenous Time and Microstructure Noise. Stochastic Processes and their Applications, 123(7), 2696-2727 [Lizzs13] Li, Y., Zhang, Z., and Zheng, X. (2013). Supplement to ”Volatility Inference in The Presence of Both Endogenous Time and Microstructure Noise”. [Lom90] Lo, A. W., MacKinlay, C. (1990). An econometric analysis of nonsynchronous trading. Journal of Econometrics, 45, 181-212. [Malt04] Malmsten, H., Tersvirta, T. (2004). Stylized Facts of Financial Time Series and Three Popular Models of Volatility. Working Paper, Series in Economics and Finance 563, Stockholm School of Economics. [Medrew06] Meddahi, N., Renault, E., and Werker, B. (2006). GARCH and Irregularly Spaced Data. Economics Letters, 90, 200–204. [Mykz06] Mykland, P. A. and Zhang, L. (2006). ANOVA for Diffusions and Itó Processes, Annals of Statistics, 34, 1931–1963. [Mykzh09] Mykland, P.A., and Zhang, L. (2009) Inference for Continuous Semimartingales Observed at High Frequency. Econometrica, 77, 1403–1455. [Mykzh12] Mykland, P.A., and Zhang, L. (2012). The Econometrics of High Frequency Data. (to appear in Statistical Methods for Stochastic Differential Equations, M. Kessler, A. Lindner, and M. Sorensen eds., Chapman and Hall/CRC Press), p. 109–190. [Old08] Oldham, K., J. Myland, and J. Spanier (2008). An Atlas of Functions (2nd edition). Springer. [Pel12] Pelletier, D., and H. Zheng (2012). Joint modeling of high-frequency price and duration data. North Carolina State University working paper. [Podvet09] Podolskij, M., and Vetter, M. (2009). Estimation of Volatility Functionals in the Simultaneous Presence of Microstructure Noise and Jumps. Bernoulli, 15 634–658. [Renker11] Renault, E., T. van der Heijden, and B. J. M. Werker (2011). The Dynamic Mixed Hitting-Time Model for Multiple Transaction Prices and Times. Working Paper, University of North Carolina and Tilburg University. [Rena11] Renault, E., and B. J. M. Werker (2011). Causality Effects in Return Volatility Measures with Random Times. Journal of Econometrics, 160(1), 272–279. [Rev99] Revuz, D., and Yor, M. (1999). Continuous Martingales and Brownian Motion. Springer-Verlag, Berlin. 30 [Rus99] Russell, J. R. (1999). Econometric Modeling of Multivariate Irregularly-Spaced HighFrequency Data. University of Chicago Graduate School of Business Working Paper. [Russ05] Russell, J. R., and R. F. Engle (2005). A Discrete-State Continuous-Time Model of Financial Transactions Prices and Times: The Autoregressive Conditional Multinomial– Autoregressive Conditional Duration Model. Journal of Business and Economic Statistics, 23(2), 166–180. [Sing01] Singleton, K. (2001). Estimation of Affine Pricing Models Using the Empirical Characteristic Function. Journal of Econometrics, 102, 111–141. [Tay86] Taylor, S. J. (1986). Modelling Financial Time Series. Chichester: Wiley. [Twee57] Tweedie, M. (1957). Statistical properties of inverse Gaussian distributions. Annals of Mathematical Statistics, 28, 362–377. [Val07] Valenti, D., B. Spagnolo, and G. Bonanno (2007). Hitting Time Distributions in Financial Markets. Physica A, 382, 311–320. [Whit86] Whitmore, G. (1986). Normal-Gamma Mixtures of Inverse Gaussian Distributions. Scandinavian Journal of Statistics, 13, 211–220. [Zha01] Zhang, M. Y., J. R. Russell, and R. S. Tsay (2001). A Nonlinear Autoregressive Conditional Duration Model with Applications to Financial Transaction Data. Journal of Econometrics, 104, 179–207. [Zha05] Zhang, L., Mykland, P., and Aı̈t-Sahalia, Y. (2005). A Tale of Two Time Scales: Determining Integrated Volatility with Noisy High-Frequency Data. Journal of the American Statistical Association, 100, 1394-1411. [Zha01] Zhang, L. (2001). From Martingales to ANOVA: Implied and Realized Volatility, Ph.D. thesis, The University of Chicago, Department of Statistics. [Zha06] Zhang, L. (2006). Efficient Estimation of Stochastic Volatility Using Noisy Observations: A Multi-Scale Approach. Bernoulli, 12, 1019–1043. [Zhou96] Zhou, B. (1996). High Frequency Data and Volatility in Foreign Exchange Rates. Journal of Business and Economic Statistics, 14: 45-52. 31
© Copyright 2026 Paperzz