Journal of Applied Statistics Vol. 37, No. 7, July 2010, 1089–1111 Detection of changes in a random financial sequence with a stable distribution Dong Hana , Fugee Tsungb∗ , Yanting Lic and Jinguo Xiana a Department of Mathematics, Shanghai Jiao Tong University, Shanghai, 200030, People’s Republic of China; b Department of Industrial Engineering and Logistics Management, Hong Kong University of Science and Technology, Hong Kong; c Department of Industrial Engineering and Management, School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai, China 200030, People’s Republic of China (Received 19 February 2008; final version received 19 March 2009) Quick detection of unanticipated changes in a financial sequence is a critical problem for practitioners in the finance industry. Based on refined logarithmic moment estimators for the four parameters of a stable distribution, this article presents a stable-distribution-based multi-CUSUM chart that consists of several CUSUM charts and detects changes in the four parameters in an independent and identically distributed random sequence with the stable distribution. Numerical results of the average run lengths show that the multi-CUSUM chart is superior (robust and quick) on the whole to a single CUSUM chart in detecting the shift change of the four parameters. A real example that monitors changes in IBM’s stock returns is used to demonstrate the performance of the proposed method. Keywords: logarithmic moment estimators; multi-CUSUM charts; detection of changes; random sequence with stable distribution 1. Introduction Tremendous amounts of time series data from the financial markets that contain important information for investors and financial analysts can be collected. Investment bankers, derivatives traders, stockbrokers, and securities and bonds investors make their decisions based on their experience and historical data. They are always exposed to high risk of economic loss. Thus, one of the most important tasks of financial institutions is to evaluate exposure to market risks. A common methodology that they use for estimation of market risks is the value at risk (VaR), which is the highest possible loss over a certain period of time at a given confidence level. For instance, a daily VaR for a given portfolio of assets of 2 million dollars at a 95% confidence level implies that without abrupt changes in the market conditions, there is a 5% chance that one-day losses will exceed 2 million dollars. The VaR is obtained based on the the cumulative ∗ Corresponding author. Email: [email protected] ISSN 0266-4763 print/ISSN 1360-0532 online © 2010 Taylor & Francis DOI: 10.1080/02664760902914433 http://www.informaworld.com 1090 D. Han et al. distribution function of portfolio returns in one period. In order to obtain reliable and precise VaR estimates, financial firms must be keenly alert to changes in the underlying distribution of portfolio returns. Ignoring changes in the distribution leads to incorrect VaR estimates, which eventually could cause bad decisions and enormous economic losses. Therefore, the ability to detect changes in financial time series is essential. In order to avoid or prepare for the unanticipated changes, it is critical to have a proper tool that monitors such financial data efficiently. A number of researchers have studied tests for structural breaks, in particular, those in the tail index. Perron [34] and Andrews [2] deal with tests for structural breaks in model parameters with known and unknown dates (see also [17] and the references therein for a review on this subject). Quintos et al. [35] present methods for testing structural breaks in tail index of heavy-tailed (possibly dependent) time series with possibly unknown break date. Ibragimov [18] studied the efficiency, peakedness, and majorization properties of linear estimators under heavy-tailedness assumptions. On the other hand, to monitor and detect changes in a sequence, various statistical process control (SPC) schemes have been developed in industrial quality control, such as the Shewhart chart [43], the cumulative sum chart [33], the exponentially weighted moving average [39], the Shiryayev–Roberts procedure [40,44], Bayes-type statistics [7], the regression control chart [28], residual-based control chart [1]), the GLR test [45], and its applications [3,48], the autoregressive moving average chart [20], the proportional integral derivative chart [21], etc. These SPC schemes have been extensively applied in industrial quality control and automated fault detection in dynamical systems. However, their dependence on the assumption that the underlying distribution of the charting characteristics is Gaussian unfortunately limits their application to the financial world. Mandelbrot [29,30] noticed long ago that many types of economic time series, such as stock return (indexes, funds), do not fit the Gaussian distribution because of their heavy tails and strong asymmetries. This led Mandelbrot to suggest a stable distribution, Sα (β, γ , μ), as a possible model for the distributions of income and speculative prices. In recent years, the modeling of financial processes and their analysis has become a very active and fast-developing branch of applied mathematics and quantitative finance [4,5,8,11,36–38]. One reason for these developments, based on stable distributions, is that the financial environment has tended to be more impulsive than a Gaussian distribution can describe. The popular stable distribution, being a generalization of the Gaussian distribution, shares many “nice” properties with the Gaussian distribution. For instance, the sum of independent stable random variables has only a stable distribution, which is similar to the central limit theorem for distributions with a finite second moment. Also, it is known that all one-dimensional stable distributions can be uniquely characterized by their characteristic function [47], which is given by ⎧ πα ⎪ α ⎪ exp iμt − γ |t| 1 − iβsign(t) tan , if α = 1 ⎨ 2 φ(t) = ⎪ 2 ⎪ ⎩exp iμt − γ |t| 1 + iβ sign(t) log |t| , if α = 1 π (1) where sign(t) is the sign of t defined by sign(t) = 1 if t > 0, sign(0) = 0, and sign(t) = −1 otherwise. There are four parameters in the distribution function. The parameter α (0 < α ≤ 2) is the characteristic exponent that controls the heaviness of the tails. The number μ(−∞ < μ < ∞) is the location parameter that corresponds to the mean for 1 < α ≤ 2 and the median for 0 < α ≤ 1 and β = 0. The number γ (γ > 0) is the scale parameter that sets the dispersion around its location parameter and, therefore, is analogous to the standard deviation. The number β(−1 ≤ β ≤ 1) is the symmetry parameter that characterizes the skewness. Journal of Applied Statistics 1091 Note that in almost all cases, the stable distributions denoted by Sα (β, γ , μ) do not have a closed probability density function. The only exceptions to this are the cases when α = 2 (Gaussian distributions), α = 1, β = 0 (Cauchy distributions) and α = 0.5, β = ±1 (Lévy distributions). Since the distribution of a stable random variable X ∼ Sα (β, γ , μ) with α ∈ (0, 2) obeys the power decay P (|X| > x) ∼ Cx −α , it follows that the stable distributions have infinite second (or higher) moments except for the case when α = 2. In fact, the stable distributions with α < 2 have finite moments only when order p is lower than α; that is, ⎧ ⎪ ⎨= ∞ if p ≥ α, α < 2, p E(|Xα | ) < ∞ if 0 ≤ p < α < 2, ⎪ ⎩ < ∞ if p ≥ 0, α = 2, where the random variable is Xα ∼ Sα (β, γ , μ). For more details about the properties of stable distributions, see Uchaikin and Zolotarev [47]. It should be stressed that most of the recent empirical papers report that financial return series show tail indices around three, that is, variances of the variables are finite but their fourth moments are infinite (see [15,24,26] and the book by [14]). We believe that this conclusion is true when the proportion, (Tail length)/(Size of simple), is small in Hill’s estimation (see Table 6 in Section 5). When the proportion is nearly 0.15 ≈ 310/2070, the 95% confidence interval for α is (1.79, 2.24). This means that the tail indices may be less than two. Moreover, Ibragimov et al.’s [19] paper shows that economic losses from natural disasters and the time series in catastrophe insurance markets have infinite variance tails. Because stable distributions have no closed probability density function with infinite variance, conventional SPC charting schemes are not easily applicable to monitoring and detecting changes in sequences that have stable distributions. Our main purpose in this article is to present a stabledistribution-based multi-CUSUM chart to detect simultaneously the changes in four parameters, α, β, γ and μ, in an independently and identically distributed (i.i.d.) random sequence with a common stable distribution, Sα (β, γ , μ). The multi-CUSUM chart can, to a considerable extent, achieve the following three goals in monitoring and detecting unknown changes in a stabledistributed random sequence: it can (1) signal an alarm as quickly as possible when there is a change in the sequence, (2) accurately indicate the possible type/amount/size/etc. of the change, and (3) easily handle computational complexity [16]. To this end, a critical step is to solve the problem of sequentially estimating the parameters of a stable distribution, since an unbiased and consistent parameter estimation approach would be the key to obtaining effective statistical functions and establishing an efficient multi-CUSUM chart. In the next section, we present the proposed estimators and discuss their unbiasness and consistency. The corresponding multi-CUSUM chart is presented in Section 3. Section 4 presents the numerical simulation results that compare the average run lengths (ARLs) of detecting changes in the four parameters of a stable distributed sequence. Section 5 provides a real example that illustrates the step-by-step procedure used to detect changes in IBM’s stock returns, and gives a simple comparison of detection performance between the multi-CUSUM chart with other CUSUM tests defined by Hill’s estimator which was used by Quintos et al. [35] to estimate α. Conclusions and suggestions for future research are presented in Section 6, with the proofs of two propositions given in Appendix. 2. Refined logarithmic moment estimators There have been many studies in the literature addressing the problem of estimating the parameters of stable distributions. Most of these studies focus on the special case of symmetric stable distributions with β = 0 [16,13,27]. Various estimation techniques for estimating the parameters of 1092 D. Han et al. general (possibly skewed) stable distributions and further references can be found in Kuruoğlu [23] and Nolan [31,32]. Here, we are especially interested in the logarithmic moment (LM) estimators suggested by Kuruoğlu [23], since the estimators not only have closed-form solutions and require less computation, but they can also compete with the characteristic function techniques. However, the LM estimators for the parameter β can only estimate |β|; that is, they cannot detect the sign of β. This weakness will limit their applications to some extent. In this section, we first refine the LM estimators to resolve the existing weakness in Kuruoğlu [23]. We also assess the unbiasness and consistency properties of the estimators. 2.1 Refined LM estimators Let Xk , k ≥ 1, be i.i.d. stable random variables with the parameters α, β, γ and μ, that is, Xk ∼ Sα (β, γ , μ) for k ≥ 1. Since the sum of independent stable random variables is also stable, the distribution of a weighted sum of i.i.d. stable variables with weights, ck , can be obtained from the characteristic function [41]: n n n n <α> k=1 ck α (2) ck Xk ∼ Sα β n ,γ |ck | , μ ck , α k=1 |ck | k=1 k=1 k=1 where we denote the signed pth power of the number x as x p = sign(x)|x|p . From Equation (2), we can further generate three new sequences, {XkL }, {XkS } and {XkLS }, of i.i.d. stable variables with a zero location parameter, a zero symmetry parameter, or zero values for both the location parameter and the symmetry parameter (except when α = 1), respectively: 2 − 2α L α Xk = X3k + X3k−1 − 2X3k−2 ∼ Sα β , γ (2 + 2 ), 0 (3) 2 + 2α XkS = X3k + X3k−1 − 21/α X3k−2 ∼ Sα (0, 4γ , μ(2 − 21/α )) XkLS = X2k − X2k−1 ∼ Sα (0, 2γ , 0) (4) (5) for k ≥ 1. The random variables XkLS and XkS are usually referred to as symmetrized random variables. Next, we list three formulae given by Kuruoğlu [23], one for signed fractional moments and the others for LMs of skewed stable distributions, that are used in the following discussion. Let X ∼ Sα (β, γ , 0). Then (1 − p/α) γ p/α sin(pθ/α) E(X <p> ) = (6) (1 − p) cos θ sin(pπ/2) for p ∈ (−2, −1) ∪ (−1, α), and γ 1 1 E(log |X|) = λ0 1 − + log α α cos θ θ2 π2 1 1 E([log |X| − E(log |X|)]2 ) = − 2, + 2 6 α 2 α where (.) is the gamma function, λ0 = −0.57721566 . . . and πα θ = arctan β tan . 2 (7) (8) (9) Journal of Applied Statistics 1093 Since | cos θ | and θ 2 occur, respectively, in Equations (7) and (8), one can only estimate |θ | and therefore, |β|. To estimate θ , we need to introduce a number that is relative to θ . In fact, the required number is just E(sign(X)), which has been considered by Zolotarev [49]. It follows from Equation (6) that E(sign(X)) = E(X 0 ) = lim E(X p ) = p→0 2θ . πα (10) Thus, by using Equation (7), (8) and (10), we can estimate θ as well as the other parameters. Now, we introduce five sample statistics for estimating the parameters: An = 1 sign(XkL ), n k=1 Bn = 1 log |XkL | n k=1 Cn = 1 log |XkLS |, n k=1 Dn = 1 (log |XkLS | − Cn )2 n − 1 k=1 n n n and n E(n) = S X(n+1/2) if n is odd, S S X(n+2/2) + X(n/2) if n is even, (11) S ,1 ≤ k ≤ where {XkL }, {XkS } and {XkLS } are defined in Equations (3), (4) and (5), respectively, X(k) S S S n, is the increasing order of arranging Xk , 1 ≤ k ≤ n; that is, X(k) , 1 ≤ k ≤ n satisfies X(1) < S S < · · · < X(n) . X(2) In this article, we mainly consider the case: α = 1, 0 < α < 2. The resulting estimators, α̂, β̂, γ̂ and μ̂, respectively, for α, β, γ and μ are summarized as follows. LM Estimator for α (α = 1, 0 < α < 2) Note that {XkLS } is i.i.d. with XkLS ∼ Sα (0, 2γ , 0). It follows from Equation (8) that π2 E([log |X| − E(log |X|)] ) = 6 2 that is, α= 1 1 + ; α2 2 6E([log |X| − E(log |X|)]2 ) 1 − π2 2 (12) −1/2 . Thus, we may use the following statistic α̂ = 1 6Dn − 2 π 2 −1/2 (13) as an estimator of the parameter α. Note that if Xk ∼ Sα (0, 1, 0), by Equation (7) we can choose α̂ = as an estimator of the parameter α. λ0 − (1/n) λ0 n k=1 log |Xk | (14) 1094 D. Han et al. LM Estimator for β (|β| ≤ 1) Since XkL ∼ Sα 2 − 2α α β , γ (2 + 2 ), 0 , 2 + 2α (15) it follows from Equation (10) that θ= παE(sign(X)) . 2 We therefore take θ̂ = π α̂An 2 (16) as an estimator of the parameter θ . Thus, the estimator of parameter β can be written as β̂ = 2 + 2α̂ tan(θ̂) . 2 − 2α̂ tan(π α̂/2) (17) LM Estimator for γ (γ > 0) By Equations (7), (13), (15) and (16), we may take γ̂ = 1 | cos(θ̂ )| exp{(Bn − λ0 )α̂ + λ0 } 2 + 2α̂ (18) as an estimator of the parameter γ . Median Estimator for μ It is known that the parameter μ is the mean of the stable distributions when 1 < α ≤ 2 and γ = 1 [47]. Thus, the parameter μ can be estimated by the simple mean 1 Xk , n k=1 n μ̂ = where Xk , k ≥ 1, is an i.i.d. stable random variable with 1 < α ≤ 2. However, the dissipation of the value of the sample mean will be large since the variances of the stable distributions with α < 2 is infinite. Moreover, it is not applicable to use the simple mean for an estimator of the parameter μ when 0 < α ≤ 1. Next, we will use the median technique presented by Zolotarev [49 p. 240] to estimate μ. It follows from Equation (5) that the median of XkS coincides with μ̃, where μ̃ = μ(2 − 21/α ). Taking the sample median E(n) in Equation (11) as an estimator of the parameter μ̃, we can obtain the median estimator for μ in the following: μ̂ = E(n) . 2 − 21/α̂ (19) Remark 1 The estimator θ̂ in Equation (16) requires substantially less computation than that suggested by Kuruoğlu [23]. Remark 2 The estimators above require either 3n + 2 or 2n + 1 observations. Journal of Applied Statistics 1095 2.2 Unbiasness and consistency of the LM estimators In this section, we assess the unbiasness and consistency of the proposed estimators. However, these properties cannot be studied directly. This is because the statistic α̂ cannot be computed if 6Dn /π 2 − (1/2) is less than or equal to zero, and β̂, γ̂ and μ̂ cannot be computed also when θ̂ = π/2 and α̂ = 1 or α̂ = 2. To ensure that the required statistics cannot only be computed but also are consistent and asymptotically unbiased estimators of the parameters, we have to modify the five estimators: α̂, θ̂ , β̂, γ̂ and μ̂. Let A = E(sign(X)). Since α = 1, it follows from Equations (9) and (10) that θ = π/2, i.e. −π/2 < θ < π/2, and therefore, −1 < Aα < 1. The Modified Estimator α̂ˆ To avoid α̂ = 1 or α̂ = 2, we take two positive numbers a, b such that a < and define ⎧ 6Dn 1 ⎪ ⎪ − ⎪ 2 ⎪ π 2 ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ 1 + n−a ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪1 − n−a ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ 6D 1 n − Dn (a, b) = 2 π 2 ⎪ ⎪ ⎪ ⎪ ⎪ ⎪1 ⎪ −a ⎪ ⎪ ⎪4 + n ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ 1 ⎪ ⎪ − n−a ⎪ ⎪ 4 ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ 6Dn 1 ⎪ ⎪ − ,b ⎩max π2 2 if √ b/2 and b < 1/9, 1 6Dn − > 1 + n−a , 2 π 2 if 1 ≤ 6Dn 1 − ≤ 1 + n−a , 2 π 2 if 1 − n−a ≤ 6Dn 1 − < 1, π2 2 if 1 6Dn 1 + n−a < 2 − < 1 − n−a , 4 π 2 if 1 6Dn 1 1 < 2 − ≤ + n−a , 4 π 2 4 if 1 6Dn 1 1 − n−a ≤ 2 − ≤ , 4 π 2 4 if 1 6Dn 1 − < − n−a , 2 π 2 4 where n ≥ n0 such that 1/4 − b > n−a . Thus, we replace the estimator α̂ with the following statistic α̂ˆ = (Dn (a, b))−1/2 (20) √ as an estimator of the parameter α. Obviously, the modified estimator α̂ˆ is bounded, i.e. α̂ˆ ≤ 1/ b, and not equal to 1 or 2. Since the estimator α̂ = λ0 − 1 n λ0 n k=1 log |Xk | has no definition when 1/n nk=1 log |Xk | = λ0 for Xk ∼ Sα (0, 1, 0), so we may modify it by the same way such that the modified estimator α̂ˆ is finite when 1/n nk=1 log |Xk | = λ0 . 1096 D. Han et al. The Modified Estimators θ̂ˆ and β̂ˆ To avoid θ̂ = ±π/2, we take the following statistic θ̂ˆ to replace θ̂ as an estimator of the parameter θ : ⎧ ˆ n ⎪ π α̂A ⎪ ˆ n > 1 + n−a ⎪ if α̂A ⎪ ⎪ ⎪ 2 ⎪ ⎪ ⎪ ⎪ π(1 + n−a ) ⎪ ⎪ ˆ n ≤ 1 + n−a ⎪ if 1 < α̂A ⎪ ⎪ 2 ⎪ ⎪ ⎪ −a ⎪ ⎪ ⎪ π(1 − n ) ˆ n≤1 ⎪ if 1 − n−a ≤ α̂A ⎪ ⎪ 2 ⎪ ⎪ ⎪ ⎨ ˆ ˆθ̂ = π α̂A n ˆ n | < 1 − n−a if |α̂A ⎪ ⎪ 2 ⎪ ⎪ ⎪ ⎪ π(1 − n−a ) ⎪ ⎪ ˆ n ≤ −(1 − n−a ) ⎪ − if − 1 ≤ α̂A ⎪ ⎪ 2 ⎪ ⎪ ⎪ ⎪ ⎪ π(1 + n−a ) ⎪ ˆ n < −1 ⎪ − if − (1 + n−a ) ≤ α̂A ⎪ ⎪ 2 ⎪ ⎪ ⎪ ⎪ ⎪ ˆ n ⎪ π α̂A ⎪ ˆ n < −(1 + n−a ). ⎩ if α̂A 2 Thus, the modified estimator of parameter β can be written as ˆ 2 + 2α̂ tan(θ̂) . β̂ˆ = ˆ 2 − 2α̂ˆ tan(π α̂/2) ˆ (21) ˆ is also bounded. The resulting estimator, β̂, The Modified Estimators γ̂ˆ and μ̂ˆ To make the modified estimator γ̂ˆ bounded, we introduce a bounded statistic a log n if Bn > a log n, Bn (a) = Bn if Bn ≤ a log n. Thus, we take the following bounded statistic γ̂ˆ = 1 | cos(θ̂ˆ )| exp{(Bn (a) − λ0 )α̂ˆ + λ0 } 2 + 2α̂ (22) as a modified estimator of the parameter γ . Similarly, the modified estimator μ̂ˆ can be written as μ̂ˆ = E(n) 2 − 21/α̂ˆ . (23) On the unbiasness and consistency of the modified estimators, we give two propositions in the following (see Appendix for the proofs): ˆ β̂, ˆ γ̂ˆ and μ̂, ˆ θ̂, ˆ are consistent and asymptotically unbiased Proposition 1 All five statistics, α̂, estimators of the parameters, α, θ, β, γ and μ. Journal of Applied Statistics 1097 Proposition 1 shows that the modified estimators indeed have good statistical properties, although they need more computation. Proposition 2 For any small > 0, P(|α̂ˆ − α̂| ≥ ) −→ 0, P(|θ̂ˆ − θ̂ | ≥ ) −→ 0, P(|β̂ˆ − β̂| ≥ ) −→ 0, P(|γ̂ˆ − γ̂ | ≥ ) −→ 0, P(|μ̂ˆ − μ̂| ≥ ) −→ 0 as n → ∞. Proposition 2 shows that there is actually little difference in the sense of probability convergence between the modified estimators and the previous refined estimators. That is, the five refined estimators, α̂, θ̂ , β̂, γ̂ and μ̂, have the same statistical properties as well as the five modified estimators do in the sense of probability convergence. The main reason for us to introduce and study the the modified estimators is that the unbiasness and consistency of the five refined estimators, α̂, θ̂ , β̂, γ̂ and μ̂, cannot be studied directly since they cannot be computed in some cases. From the Propositions 1 and 2, we see that the five refined estimators, α̂, θ̂ , β̂, γ̂ and μ̂, may have the same good statistical properties such as the unbiasness asymptotically and consistency as well as the five modified estimators do in the sense of probability convergence. Moreover, the five refined estimators require less computation than the modified ones do. Thus, we will use the refined estimators to construct the multi-CUSUM charts in the next section. 3. Multi-cusum charts for random sequences A multi-chart is a combination of several control charts or a set of control charts used together for obtaining a better detection power [9,10,25]. Specifically, a multi-chart with multiple CUSUM charts, i.e. the multi-CUSUM chart, is known to have particularly good properties in detecting and diagnosing an abrupt change in a process [16]. In this section, we apply the multi-CUSUM chart to monitoring changes in random sequences with stable distributions based on the LM estimators discussed in the previous section. Let Xk , k = 1, 2, . . . be the kth independent observation of a random sequence with a known common stable distribution Sα0 (β0 , γ0 , μ0 ). For simplicity, we take α0 = 1.5, β0 = 0, γ0 = 1 and μ0 = 0. Here, we take α0 = 1.5 since the tail index α of heavy-tailed time series in financial markets is usually between 1 and 2, and the number μ in the stable distribution Sα (β, γ , μ) is the mean when 1 < α ≤ 2. Suppose that at some time period, τ , which is usually called a change point, the probability distribution of Xk changes from S1.5 (0, 1, 0) to Sα (β, γ , μ). In other words, from time period τ onwards, Xi has the common distribution Sα (β, γ , μ). In practice, the change point and the post-change distributions are usually unknown, that is, τ and the changed parameters vector (α, β, γ , μ) are unknown. To identify the multi-CUSUM chart for detecting unknown abrupt changes from the parameter vector, ξ0 = (1.5, 0, 1, 0) to ξ = (α, β, γ , μ), let ᾱ1 , β̄1 , γ̄1 and μ̄1 be the four positive, pre-specified known reference values that describe the change values from ξ0 = (α0 , β0 , γ0 , μ0 ) = (1.5, 0, 1, 0) to ξ = (α1 , β1 , γ1 , μ1 ), where ᾱ1 = (1/α0 ) − (1/α1 ), β̄1 = β1 − β0 , γ̄1 = γ1 − γ0 and μ̄1 = μ1 − μ0 . A basic upward-side CUSUM chart, TC , can be defined as n TC = inf n : max (Xi − δ/2) ≥ c , (24) 1≤k≤n i=n−k+1 where δ/2 > 0 is the reference value related to the magnitude of the mean shift, δ, and c > 0 is a control limit. 1098 D. Han et al. We first present upward-side CUSUM charts, T (α1 ), T (β1 ), T (γ1 ) and T (μ1 ), in the following for detecting the change in parameters, α, β, γ and μ, respectively. Let λ0 − log |Xk | , λ0 Yk = Zk = λ0 λ0 − (1/min{k, m}) ki=k−m+1 log |Xi | and Uk = k πZk sign(Xi ). 2 min{k, m} i=k−min{k,m}+1 It follows from Equation (7) that {Yk , k ≥ 1} is i.i.d. with the mean E(Yk ) = 1/α0 = 1/1.5. Hence, to detect the change from α0 = 1.5 to α, we use the following CUSUM chart: ⎧ ⎨ ⎡ ⎛ n ⎝ T (α1 ) = min n : max ⎣ ⎩ 1≤j ≤n k=j ⎫ ⎞⎤ ⎬ Yi ᾱ1 ⎠⎦ 1 − − ≥ lα1 , ⎭ min{k, m} α0 2 i=k−min{k,m}+1 k (25) to detect the mean shift of {Yk } from 1/α0 to 1/α, where the positive number, lα1 , is the upward control limit. Similarly, by Equations (9) and (16), the CUSUM chart T (β1 ) for detecting β is given by ⎧ ⎨ ⎡ ⎛ n ⎝ T (β1 ) = min n : max ⎣ ⎩ 1≤j ≤n k=j ⎫ ⎞⎤ ⎬ g(X1 , . . . , Xi ) β̄1 ⎠⎦ − β0 − ≥ lβ1 , ⎭ min{k, m} 2 i=k−min{k,m}+1 k (26) where tan(Uk ) . tan(π Zk /2) g(X1 , . . . , Xk ) = By Equations (7) and (11), the CUSUM charts T (γ1 ) and T (μ1 ) for detecting γ and μ, respectively, are written as ⎧ ⎨ ⎡ ⎛ n ⎝ T (γ1 ) = min n : max ⎣ ⎩ 1≤j ≤n k=j ⎞⎤ k ⎫ ⎬ h(X1 , . . . , Xi ) γ̄1 − γ0 − ⎠⎦ ≥ lγ1 ⎭ min{k, m} 2 i=k−min{k,m}+1 (27) and ⎧ ⎨ ⎡ T (μ1 ) = min n : max ⎣ ⎩ 1≤j ≤n ⎫ ⎤ ⎬ μ̄1 ⎦ E(k) − μ0 − ≥ lμ1 , ⎭ 2 n k=j where h(X1 , . . . , Xk ) = | cos(Uk )| exp k 1 log |Xi | − λ0 Zk + λ0 . k i=1 (28) Journal of Applied Statistics 1099 Similarly, the downward-side CUSUM charts are written as ⎫ ⎧ ⎡ ⎛ ⎞⎤ n k ⎬ ⎨ Y ᾱ 1 i 2 ⎝ + ⎠⎦ ≤ −lα2 , − T (α2 ) = min n : min ⎣ ⎭ ⎩ 1≤j ≤n min{k, m} α0 2 k=j i=k−min{k,m}+1 ⎫ ⎧ ⎡ ⎛ ⎞⎤ n k ⎬ ⎨ g(X ) β̄ i 2 ⎝ T (β2 ) = min n : min ⎣ − β0 + ⎠⎦ ≤ −lβ2 , ⎭ ⎩ 1≤j ≤n min{k, m} 2 k=j i=k−min{k,m}+1 ⎫ ⎧ ⎡ ⎛ ⎞⎤ n k ⎬ ⎨ h(X ) γ̄ i 2 ⎝ T (γ2 ) = min n : min ⎣ − γ0 + ⎠⎦ ≤ −lγ2 ⎭ ⎩ 1≤j ≤n min{k, m} 2 k=j i=k−min{k,m}+1 and ⎧ ⎨ ⎫ ⎡ ⎤ n ⎬ μ̄ 2 ⎦ E(k) − μ0 + T (μ2 ) = min n : min ⎣ ≤ −lμ2 , ⎩ 1≤j ≤n ⎭ 2 k=j where ᾱ2 = (1/α2 ) − (1/α0 ), β̄2 = β0 − β2 , γ̄2 = γ0 − γ2 and μ̄2 = μ0 − μ2 are four positive numbers and lα2 , lβ2 , lγ2 and lμ2 are the downward control limits. Since (α0 , β0 , γ0 , μ0 ) = (1.5, 0, 1, 0), it follows that β̄1 = β1 , μ̄1 = μ1 , β̄2 = −β2 and μ̄2 = −μ2 . Now, the multi-CUSUM chart, TM , for detecting the change of ξ = (α, β, γ , μ) can be defined by TM = min{T (α1 , α2 ), T (β1 , β2 ), T (γ1 , γ2 ), T (μ1 , μ2 )}, (29) where T (α1 , α2 ) = min{T (α1 ), T (α2 )}, T (β1 , β2 ) = min{T (β1 ), T (β2 )}, T (γ1 , γ2 ) = min{T (γ1 ), T (γ2 )} and T (μ1 , μ2 ) = min{T (μ1 ), T (μ2 )} are four two-sided CUSUM charts. Remark 3 If Xk , k ≥ 1, has the common known distribution, Sα0 (μ0 , β0 , γ0 ), where β0 = 0 or γ0 = 1 and μ0 = 0, then the above statistics, Yk , Zk , Uk , g(X1 , . . . , Xk ), h(X1 , . . . , Xk ), E(k) and E(Yk ) should be replaced with 6 1 6Dk 1 −1/2 − Yk = [log |XkLS | − E(log |XkLS |)]2 − , Zk = π 2 π2 2 Uk = π Zk sign(XkL ), 2 g(X1 , . . . , Xk ) = 1 h(X1 , . . . , Xk ) = | cos(Uk )| exp 2 + 2 Zk 2 + 2Zk tan(Uk ) 2 − 2Zk tan(π Zk /2) 1 log |XiL | − λ0 Zk + λ0 , k i=1 k E(k) 2 − 21/Zk and E(Yk ) = 1/α02 , respectively. 4. Performance evaluation by simulation comparison For the convenience of discussion, we use the standard quality control terminology. Let P0 (·) and E0 (·) denote the probability and expectation when there is no change in the mean. Denote P (·) and E(·) as the probability and expectation when the change point is at τ = 1, and the mean is changed from x0 to x. For a stopping time T as the alarm time with a detecting procedure, two most frequently used operating characteristics are the in-control average run length (ARL0 ) and the out-of-control ARL, defined by ARL0 (T ) = E0 (T ) and ARL(T ) = E(T ). Usually, comparisons 1100 D. Han et al. of the control charts’ performance are made by designing the common ARL0 and comparing the ARLs of the control charts for a given mean shift. The chart with the smaller ARL is considered to have better performance. In this section, we evaluate the performance of the proposed monitoring scheme by comparing the numerical simulation results of the ARL of the four separate two-sided CUSUM charts, T (α1 , α2 ), T (β1 , β2 ), T (γ1 , γ2 ), T (μ1 , μ2 ), and the multi-CUSUM chart, TM . Here, the statistics, Yk , Zk , Uk , g(X1 , . . . , Xk ), h(X1 , . . . , Xk ), and E(k) in the CUSUM charts T (α1 , α2 ), T (β1 , β2 ), T (γ1 , γ2 ), T (μ1 , μ2 ) in Section 3 have been replaced, respectively, by 6 1 6Dk 1 −1/2 LS LS 2 Yk = [log |Xk | − E(log |Xk |)] − , Zk = − π 2 π2 2 Uk = π Zk sign(XkL ), 2 g(X1 , . . . , Xk ) = 1 h(X1 , . . . , Xk ) = | cos(Uk )| exp 2 + 2 Zk 2 + 2Zk tan(Uk ) 2 − 2Zk tan(π Zk /2) 1 log |XiL | − λ0 Zk + λ0 , k i=1 k E(k) 2 − 21/Zk since we consider more general circumstances that it is not necessary to assume β0 = 0, γ0 = 1 and μ0 = 0. This has been pointed out in Remark 3. For comparison purposes, the in-control ARL (ARL0 ) of all candidate charts are forced to be equal; in other words, the control limits of each control chart are determined to correspond to the same level of type-I error. The chart that has the lowest out-of-control ARL at the desired shift size presents the highest detection power to the pre-specified shift. Here, for illustration purposes, the common ARL0 for the five control charts is chosen to be around 500, which is a typical value in practice. We denote control limits of four two-sided CUSUM charts, T (α1 , α2 ), T (β1 , β2 ), T (γ1 , γ2 ) and T (μ1 , μ2 ) as (lα 1 , lα 2 ), (lβ 1 , lβ 2 ), (lγ 1 , lγ 2 ) and (lμ 1 , lμ 2 ), respectively. For the eight control limits of the multi-CUSUM chart, TM , we denote them as (lα1 , lα2 , lβ1 , lβ2 , lγ1 , lγ2 , lμ1 , lμ2 ). Here, we take α1 = 1.3, α2 = 1.7, β1 = 0.5, β2 = −0.5, γ1 = 2, γ2 = 0, μ1 = 1 and μ2 = −1. The main reason for us to take such reference values is that they are symmetric about ξ0 = (1.5, 0, 1, 0) since α0 − α1 = α2 − α0 , β0 − β1 = β2 − β0 , γ0 − γ1 = γ2 − γ0 and μ0 − μ1 = μ2 − μ0 . In order to make T (α1 , α2 ), T (β1 , β2 ), T (γ1 , γ2 ), T (μ1 , μ2 ) and TM have a common ARL0 = 500, the values of the control limits are taken as follows: lα 1 = 12, lα 2 = 8, lα1 = 28, lα2 = 15.1, lβ 1 lβ 2 lβ1 = 155.1, lβ2 = 155.1, = 75.1, = 75.1, lγ 1 = 66, lγ 2 = 3, lγ1 = 145.1, lγ2 = 5.2, lμ 1 lμ 2 lμ1 = 173.2, lμ2 = 173.2, = 102, = 102, where the common ARL of the CUSUM charts, T (α1 , α2 ), T (β1 , β2 ), T (γ1 , γ2 ), T (μ1 , μ2 ), in TM need to satisfy ARL0 (T (α1 , α2 )) = ARL0 (T (β1 , β2 )) = ARL0 (T (γ1 , γ2 )) = ARL0 (T (μ1 , μ2 )) = 2200. That is, we can get the ARL0 (TM ) ≈ 500 for the multi-chart TM by numerical computation when the ARL0 s of its four component CUSUM charts all approximate to 2200. Of course, we can also obtain the ARL0 (TM ) ≈ 500 when the four component CUSUM charts of TM have different ARL0 . In this case, the ARL0 of some charts must be larger than 2200, some must be smaller than 2200 among the four component charts. We make the four component CUSUM charts have the same ARL0 here, since the four parameters α, β, γ and μ are considered in the article to have Journal of Applied Statistics 1101 the same importance. The control limits can be determined by repeated computation for taking different control limits. Tables 1 and 2 illustrate the numerical simulation results of the ARLs of the four two-sided CUSUM chars, T (α1 , α2 ), T (β1 , β2 ), T (γ1 , γ2 ), T (μ1 , μ2 ) and the multi-CUSUM chart, TM . The numerical results of the ARLs were obtained based on 1,000,000-repetition experiments. The in-control value of the four parameters are α = 1.5, β = 0.0, γ = 1, μ = 0.0. The true values of α, β, γ and μ of a stable distribution are listed in the first four columns in each table. Concerning the out-of-control scenario, we consider two extreme cases. Table 1 lists the simulation results when only one parameter out of α, β, γ and μ changes and the other three parameters do not change. Table 2 contains the simulation results of the ARLs when all four parameters change. The notation over the ARL values in the two tables indicates that the value is greater than the in-control ARL of 500. The ARL values in boldface mean that it is the minimal value among the four ARL values of T (α1 , α2 ), T (β1 , β2 ), T (γ1 , γ2 ) and T (μ1 , μ2 ) in each row of Tables 1 and 2. Table 1. Comparisons of the ARLs of the CUSUM charts and the multi-CUSUM chart with ARL0 ≈ 500. Shifts T (α {α} {β} {γ } {μ} 1.5 0 1 0 499.7 1.7 1.9 1.3 1.1 0 0 0 0 1 1 1 1 0 0 0 0 486.5 429.1 426.6 279.8 1.5 1.5 1.5 1.5 0.5 0.9 −0.5 −0.9 1 1 1 1 0 0 0 0 1.5 1.5 1.5 1.5 0 0 0 0 1.5 2.0 0.5 0.2 0 0 0 0 ∗ 509.2 ∗ 502.5 ∗ 508.1 1.5 1.5 1.5 1.5 0 0 0 0 1 1 1 1 1 2 −1 −2 T (β1 , β2 ) 1 , α2 ) T (γ1 , γ2 ) T (μ1 , μ2 ) TM 500.9 499.8 498.7 501.2 ∗ 536.9 ∗ 603.5 ∗ 597.4 ∗ 594.8 ∗ 940.3 ∗ 1672.0 ∗ 562.0 ∗ 507.1 482.4 ∗ 519.2 389.7 299.2 257.6 142.6 300.2 162.3 215.6 216.4 215.4 216.6 349.0 213.9 368.8 231.4 272.2 282.5 270.0 280.1 98.2 82.5 97.7 84.3 109.5 97.8 111.7 102.4 498.5 497.5 494.9 493.9 500.8 153.4 82.7 78.9 25.3 143.3 65.8 ∗ 3774.0 ∗ 5821.0 138.0 69.6 107.2 39.1 498.5 497.1 496.9 491.1 496.3 225.4 100.9 223.2 100.4 334.7 185.8 332.8 184.3 ∗ 506.4 500.0 ∗ 504.5 ∗ 504.4 497.3 494.3 493.0 Table 2. Comparisons of the ARLs of the CUSUM charts and the multi-CUSUM chart with ARL0 ≈ 500. Shifts {ξ } {α} {β} {γ } {μ} ξ0 ξ1 ξ2 ξ3 ξ4 ξ5 ξ6 ξ7 ξ8 ξ9 1.5 1.7 1.7 1.7 1.9 1.9 1.9 1.1 1.1 1.1 0.0 0.5 0.9 −0.5 0.5 0.9 −0.5 0.5 0.9 −0.5 1.0 1.5 2.0 0.5 2.0 0.5 1.5 0.5 1.5 2.0 0.0 1.0 2.0 −1.0 −1.0 1.0 2.0 2.0 −1.0 1.0 T (α 1 , α2 ) 499.7 215.9 214.5 215.4 214.7 216.1 215.7 215.4 215.1 211.3 T (β1 , β2 ) T (γ1 , γ2 ) T (μ1 , μ2 ) TM 500.9 358.3 213.8 370.8 213.9 214.7 372.1 350.8 214.4 368.5 499.8 116.2 73.7 177.5 72.2 184.0 116.7 138.3 118.1 74.4 498.7 35.6 16.5 127.9 22.2 101.8 43.5 258.5 16.4 15.9 501.2 49.2 25.3 129.4 30.3 104.0 54.8 139.4 25.8 22.7 1102 D. Han et al. From Table 1, we see that 1. When only α changes, control charts T (β1 , β2 ) and T (γ1 , γ2 ) fail when there is a respective increase in α. Although T (μ1 , μ2 ) responds to the decrease in α in the quickest manner, it fails when α increases. Among the five control charts, although T (α1 , α2 ) has the best overall performance for detecting decreases or an increase in α, the multi-CUSUM chart, TM , is also good at detecting changes in α, except when there is an increase in α. 2. When only β changes, both T (μ1 , μ2 ) and TM are sensitive to changes in β. The other three CUSUM charts lose detection capability no matter β increases or decreases. 3. Both T (α1 , α2 ) and T (β1 , β2 ) are not sensitive to either decreases or increases in γ . T (μ1 , μ2 ) deteriorates when γ decreases. Although T (γ1 , γ2 ) is the quickest one to detect decreases in γ , the multi-CUSUM chart, TM , has the most satisfactory performance in detecting both increases and decreases in γ . 4. The three CUSUM charts T (α1 , α2 ), T (β1 , β2 ) and T (γ1 , γ2 ) are not sensitive to either decreases or increases in μ. Although T (μ1 , μ2 ) is the best among all charts in detecting the mean shifts, the multi-CUSUM chart, TM , shows a performance similar to that of T (μ1 , μ2 ). 5. The main reason for some ARL1 larger than 500 is that the variances of the estimators became smaller than that in the in-control case. For example, by 1,000,000-repetition experiments, the variance of the estimator E(k) /(2 − 21/Zk ) changes from 4.51 to 3.44, 2.75, 8.17 and 19.8 when the true value (α, β, γ , μ) changes from (1.5, 0, 1, 0) to (1.7, 0, 1, 0), (1.9, 0, 1, 0), (1.3, 0, 1, 0) and (1.1, 0, 1, 0), respectively. Thus, the corresponding ARL1 of T (μ1 , μ2 ) changes from the in-control value 498.7 to 940.3, 1672, 257.6, and 142.6. When the true value (α, β, γ , μ) changes from (1.5, 0, 1, 0) to (1.5, 0, 1.5, 0), (1.5, 0, 2.0, 0), (1.5, 0, 0.5, 0) and (1.5, 0, 0.2, 0), the variance of E(k) /(2 − 21/Zk ) changes from 4.51 to 6.96, 8.92, 2.30, 0.91, respectively. So, the corresponding ARL1 of T (μ1 , μ2 ) changes from 498.7 to 143.3, 65.8, 3774 and 5821, respectively. In summary, when only one of the four parameters changes, the multi-CUSUM chart, TM , is found to be superior to other charts on the whole for its robustness. It is not only robust (never has the worst performance) but is also competent in its detection capability. When all the four parameters change, we consider three shift sizes in each parameter. Here α ∈ {1.1, 1.7, 1.9}, β ∈ {−0.5, 0.5, 0.9}, γ ∈ {0.5, 1.5, 2} and μ ∈ {−1, 1, 2}. Thus there will be 81 possible shifts from ξ0 = (1.5, 0, 1, 0) to ξ = (α, β, γ , μ). To reduce the number of simulation experiments, we may use Taguchi’s orthogonal arrays [46], L9 (34 ), to choose the following nine possible change vectors in the parameters: ξ1 = (1.7, 0.5, 1.5, 1), ξ2 = (1.7, 0.9, 2, 2), ξ3 = (1.7, −0.5, 0.5, −1), ξ4 = (1.9, 0.5, 2, −1), ξ5 = (1.9, 0.9, 0.5, 1), ξ6 = (1.9, −0.5, 1.5, 2), and ξ7 = (1.1, 0.5, 0.5, 2), ξ8 = (1.1, 0.9, 1.5, −1), ξ9 = (1.1, −0.5, 2, 1). Table 2 shows that T (α1 , α2 ), T (β1 , β2 ) and T (γ1 , γ2 ) all fail in some cases. T (μ1 , μ2 ) and TM are the best two control charts among all the candidates. From ξ1 to ξ6 and ξ9 , the multiCUSUM chart, TM , can compete with T (μ1 , μ2 ) for quickly detecting the change. Moreover, TM outperforms T (μ1 , μ2 ) in case ξ7 . Therefore, the multi-CUSUM chart is considered to show the best and most robust performance when all the parameters change. In summary, no matter which of the four parameters of a stable distribution changes, the multi-CUSUM chart always shows the most satisfactory and robust performance. Journal of Applied Statistics 1103 5. An illustration with real stock returns We use IBM’s stock market returns from 1993 to 2007 to illustrate the implementation of the multiCUSUM chart for monitoring financial data. Figure 1 illustrates the daily IBM returns during 1993–2007. Statistical distributions of financial returns have been a primary area of investigation in both statistics and financial economics. Many researchers [12,30,42] have supported the theory that stock prices and/or stock returns follow stable distributions. The multi-CUSUM monitoring scheme should therefore be useful. At the end of this section, the detection performance of other kinds of CUSUM charts defined by Hill’s estimator are illustrated in Table 4. The successive daily closing prices are P1 , . . . Pn , where Pk is the closing price on trading day k. Let P0 be the closing price of the stock immediately before these n days. The stock returns for IBM are denoted as Pk k ≥ 1. Xk = 100 log Pk−1 We consider IBM’s stock returns as an i.i.d random sequence with a stable distribution. To use the multi-CUSUM chart to detect changes in IBM’s stock returns, we have to obtain an estimate of the true values of the four parameters in the stock return distribution. We use the data before 2001 as Phase I data to estimate the four parameters for the stable distribution. The in-control values of the four parameters are ξ0 = (α0 , β0 , γ0 , ξ0 ) = (1.5, 0.28, 0.25, 0). Then, we aim to evaluate the detection capability of the multi-CUSUM chart to detect drops in IBM stock returns starting from 2001. First, we describe the step-by-step procedures for implementing the proposed multiCUSUM scheme: 1. Obtain the in-control value of the four parameters, α, β, γ , μ, of a stable distribution. If the in-control values are unknown, estimate them from the Phase I historical data (e.g. the IBM stock returns before 2001). 2. Based on the shift magnitudes in α, β, γ , μ, determine the reference values of ᾱ1 , ᾱ2 , β̄1 , β̄2 , γ̄1 , γ̄2 , μ̄1 , μ̄2 . 3. Determine the in-control ARL and find the control limits of the multi-CUSUM charts (as described in Section 3). 4. Establish the multi-CUSUM chart and monitor the new data over time. Figure 1. IBM stock price from 1993 to 2006. 1104 D. Han et al. Table 3. Numerical results for monitoring IBM’s stock returns with ξ0 = (1.5, 0.28, 0.25, 0) and ARL0 ≈ 500. Control charts Control limit Reference values Out-of-control TM T (α) T (β) T (γ ) T (μ) lα = 9.8 α1 = 0.12 α2 = 0.12 lβ = 70.6 β1 = 0.12 β2 = 0.12 lγ = 2.4 γ1 = 0.15 γ2 = 0.15 lμ = 2.78 μ1 = 0.02 μ2 = 0.02 l = (59, 193.5, 6.7, 6.75) 234 294 54 255 60 In this illustrative case, by taking the reference values of ᾱ1 = ᾱ2 = 0.12, β̄1 = β̄2 = 0.12, γ̄1 = γ̄2 = 0.15 and μ̄1 = μ̄2 = 0.02, we compare the detection ability of four two-sided CUSUM charts, T (α1 , α2 ), T (β1 , β2 ), T (γ1 , γ2 ), T (μ1 , μ2 ) and the multi-CUSUM chart, TM , as defined by Equations (24)–(28). The control limits of each control chart are determined by achieving an in-control ARL0 of 500. Table 3 presents the numerical results for monitoring changes in IBM’s stock returns starting from 1 January, 2001. Using the CUSUM chart for γ is the quickest way to detect the changes in the underlying distribution of IBM’s stock returns. It triggers an out-of-control signal at the end of the second month in 2001, which could have given an early warning to the economic recession in US in the first three quarters of 2001. The multi-CUSUM chart also outperforms the other three single CUSUM charts by a large margin and has comparable detection capability with the the CUSUM chart for γ . Remark 4 In order to obtain the stable estimated values of the four parameters we ignore the possible changes of the parameters, during the period 1993–2001. In fact, it is feasible to estimate the distribution parameters by using data of one month. We now discuss how to define the CUSUM charts by Hill’s estimator. The Hill’s estimator −1 mJ 1 J J log X(J −i+1) − log X(J −mJ +1) α̂J = mJ i=1 J has been used by Quintos et al. [35] to estimate α, where X(i) denote the ith ordered statistic of the sample of size J , and the number mJ is the m largest observations of a sample of size J . We consider four rolling estimators α̂j k , 1 ≤ j ≤ 4 which are modifications of Hill’s estimator. −1 mj 1 Jj k Jj k α̂j k = log X(Jj k −i+1) − log X(Jj k −mj +1) mj i=1 where m1 = 12, m2 = 14, m3 = 26, m4 = 65 and J1k = {k − 11, k − 10, . . . , k − 1, k} J2k = {k − 13, k − 12, . . . , k − 1, k} J3k = {k − 25, k − 24, . . . , k − 1, k} J4k = {k − 64, k − 65, . . . , k − 1, k}. By using the four rolling estimators, we can define four CUSUM charts in the following: n ᾱ1 α̂j k − α0 − ≥ lj Tj (α) = min n : max 1≤i≤n 2 k=i Journal of Applied Statistics 1105 Table 4. Numerical results for monitoring IBM’s stock returns with α0 ≈ 1.5. Control charts Control limit Reference values T1 (α) (502) l1 = 9.5 α1 = 0.2 α2 = 0.2 T2 (α)(501) l2 = 8.8 α1 = 0.2 α2 = 0.2 T3 (α)(502) l3 = 6.3 α1 = 0.2 α2 = 0.2 T4 (α)(498) l4 = 1.6 α1 = 0.2 α2 = 0.2 280 255 290 1150 Out-of-control Table 5. Estimation of the tail index using 2020 data points from January 1993 to January 2001 and the result of detection starting from January 2001. Window size Tail length Tail index(α0 ) Control limit Change point ARL0 12 14 26 65 1.5094 1.5019 1.516 1.4871 9.5 8.8 6.3 1.6 280 255 290 1150 502 501 502 498 40 50 100 250 Table 6. Estimation of the tail index and its confidence interval using 2070 data points from January 1993 to March 2001. Tail length Tail index Confidence interval 310 2.0139 (1.79, 2.24) 200 2.6433 (2.28, 3.01) 100 3.4224 (2.75, 4.09) 50 3.6449 (2.63, 4.65) for 1 ≤ j ≤ 4. Taking m1 = 12, m2 = 14, m3 = 26, m4 = 65 can make average values of the four estimators α̂j k , 1 ≤ j ≤ 4 be equal to α01 = 1.5094, α02 = 1.5019, α03 = 1.516 and α04 = 1.4871, respectively, where the data {Xk } is the stock returns of IBM during the period 1993– 2001. The following Table 4 is the simulation results for the CUSUM chart Tj (α), 1 ≤ j ≤ 4 to monitor changes in stock returns of IBM starting from 1 January, 2001. The numbers in the parentheses in Table 4 denote the ARL0 of Tj (α), 1 ≤ j ≤ 4, respectively. Table 4 indicates that the CUSUM chart T2 (α) shows the best performance among the four charts, Tj (α), 1 ≤ j ≤ 4. As can be seen from Tables 3 and 4 that the CUSUM charts defined by rolling Hill’s estimators can compete with the CUSUM charts considered in the paper in detecting the change of the tail index α. In fact, the CUSUM charts defined by rolling Hill’s estimators may have better performance than that in Table 4, if we take different reference values for each chart, and the windows mj , 1 ≤ j ≤ 4, can vary in a suitable way with the simples. Remark 5 Table 5 in the following illustrates the Hill’s estimation of the tail index using 2020 data points from January 1993 to January 2001 of IBM’s stock returns, and the results of detecting the data starting from January 2001. Here, the tail lengths are taken to be 12, 14, 26 and 65 respectively in the four rolling estimators. As can be seen in Table 6 that the Hill’s estimations of the tail indices may be less than two for (Tail length)/(Size of simple)≈ 0.1498 = 310/2070 (Hill’s estimation of the tail index is 2.0139 in the case). 6. Discussion and conclusion Effective monitoring schemes of financial time series provide powerful tools to decision makers in the financial world. However, empirical observations show that the financial time series exhibit fat 1106 D. Han et al. tails, excessive kurtosis, some time, infinite variation, and such series cannot be easily monitored by conventional Gaussian-based SPC control schemes. On the other hand, stable distributions are a rich class of probability distributions that allow skewness and heavy tails and have extensive applications in finance and economics. We propose a stable-distribution-based control scheme using a multi-CUSUM chart. In fact, except for economics and finance, a wide range of modeling areas such as engineering, physics, astronomy, computer science, networks, and so on have been modeled through stable distributions (see [32], in which about 1300 references relative to the stable distributions are listed). We believe that the proposed stable-distribution-based multi-CUSUM scheme can be extended to a variety of different practical problems. To identify a multi-CUSUM chart for detecting changes in the four parameters, α, β, γ and μ in an i.i.d. random sequence with the stable distribution Sα (β, γ , μ), we first refined LM estimators for the four parameters, and then proved that the five estimators α̂, θ̂ , β̂, γ̂ and μ̂ not only have unbiasness and consistency in the sense of probability convergence but also need less computation than the modified ones. Based on the refined LM estimators, we construct a multi-CUSUM chart. The numerical results of the ARLs in Tables 1 and 2 illustrate that the multi-chart is superior (robust and quick) on the whole to the single CUSUM charts in detecting the shifts of the four parameters in i.i.d. random sequence with a stable distribution. Moreover, the example of monitoring changes in IBM’s stock returns also shows that the multi-CUSUM chart indeed has good detection performance. The numerical results for the multi-CUSUM charts are based on the condition that the constituent charts of the multi-CUSUM charts have a common ARL0 . It would be of interest to study if the same results still hold for the multi-CUSUM chart when its constituent charts have different ARL0 . In fact, if the change of one parameter, μ, is considered to be more important than the other one, for instance, α, then the ARL0 of the control chart for detecting μ may be chosen to be smaller than that of the control chart for detecting α, so that the change of the parameter μ can be detected more quickly. It should also be possible to find a control chart that is sensitive to changes in β. Other interesting problems that also warrant further research include how the procedures considered in the article may be modified or adapted to account for dependence in the time series, how to determine the sample size in order for the resulting control limits to have their required properties when the known parameter values in the multi-CUSUM chart are replaced by the estimated values, and how to make the multi-CUSUM chart such that it can be used efficiently over a period of time that may include several out-of-control signals. Acknowledgements The authors are grateful to the editor and the anonymous referees for their valuable comments, which have helped improve this article greatly. This research was supported by RGC Competitive Earmarked Research Grants 620707 and 620508. References [1] L.C. Alwan and H.V. Roberts, Time-series modeling for statistical process control, J. Bus. Econom. Statist. 6 (1988), pp. 87–95. [2] D.W.K. Andrews, Tests for parameter instability and structural change with unknown change point, Econometrica 61 (1993), pp. 821–856. [3] D.W. Apley and J. Shi, The GLRT for statistical process control of autocorrelated processes, IIE Trans. 31 (1999), pp. 1123–1134. [4] L. Belkacem, V.J. Lévy and C. Walter, Capm, risk and portfolio selection in alpha-stable markets, Fractals 8 (1996), pp. 99–115. [5] J. Belov, A. Kabasinskas and L. Sakalauskas, A study of stable models of stock markets, Inf. Technol. Control 35 (2006), pp. 34–56. [6] B.W. Brorsen and S.R. Yang, Maximum likelihood estimates of symmetric stable distribution parameters, Comm. Statist. Simulation Comput. 19 (1990), pp. 1459–1464. Journal of Applied Statistics 1107 [7] H. Cherno and S. Zacks, Estimating the current mean of a normal distribution which is subject to changes in time, Ann. Math. Statist. 35 (1964), pp. 999–1018. [8] R. Cont, Empirical properties of asset returns: Stylized facts and statistical issues, Quant. Finance 1 (2001), pp. 223–236. [9] V. Dragalin, The optimality of generalized CUSUM procedure in quickest detection problem, in Statistics and Control of Stochastic Processes (Proc. Steklov Inst. Math. vol. 202) (1993), pp. 132–148. [10] V. Dragalin, The design and analysis of 2-CUSUM procedure, Comm. Statist. Simulation Comput. 26 (1997), pp. 67–81. [11] J. Fajardo and E. Mordecki, Symmetry and duality in Lévy markets, Quant. Finance 6 (2006), pp. 219–227. [12] E. Fama, The behavior of stock market prices, J. Bus. 38 (1965), pp. 34–105. [13] Z. Fan, Parameter estimation of stable distributions, Commun. Statist. Theory Methods 35 (2006), pp. 245–255. [14] X. Gabaix, Power laws, in The New Palgrave Dictionary of Economics, 2nd Ed. S.N. Durlauf and L.E. Blume, eds. MacMillan, Basingstoke, UK, 2008. [15] X. Gabaix, P. Gopikrishnan, V. Plerou and H. Stanley, A theory of power-law distributions in financial market fluctuations, Nature 423 (2003), pp. 267–270. [16] D. Han and F.G. Tsung, Detection and diagnosis of unknown abrupt changes using CUSUM multi-chart schemes, Sequential Anal. 26 (2007), pp. 225–249. [17] B.E. Hansen, The new econometrics of structural change: Dating breaks in US labor productivity, J. Econ. Perspect. 15 (2001), pp. 117–128. [18] R. Ibragimov, Efficiency of linear estimators under heavy-tailedness: Convolutions of alpha-symmetric distributions, Econometric Theory 23 (2007), pp. 501–517. [19] R. Ibragimov, D. Jaffee and J. Walden, Non-diversification traps in markets for catastrophic risks, Rev. Financ. Stud. (2008). Available at http://rfs.oxfordjournals.org/cgi/content/full/hhn021v1. [20] W. Jiang, K.L. Tsui and W.H. Woodall, A new SPC monitoring method: The ARMA chart, Technometrics 42 (2000), pp. 399–416. [21] W. Jiang, H. Wu, F. Tsung, V.N. Nair and K.L. Tsui, Proportional integral derivative charts for process monitoring, Technometrics 44 (2002), pp. 205–214. [22] A.S. Kapadia, W. Chan and L. Moyé, Mathematical Statistics with Applications, Chapman Hall/CRC Press, Boca Raton, FL, 2005. [23] E.E. Kuruoðlu, Density parameter estimation of skewed α-stable distributions, IEEE Trans. Signal Process. 49 (2001), pp. 2192–2201. [24] B. LeBaron, Robust properties of stock return tails, Working Paper, Brandeis University 2008. Available at http://people.brandeis.edu/ blebaron/wps/robusttail.pdf. [25] G. Lorden and I. Eisenberger, Detection of failure rate increases, Technometrics 15 (1973), pp. 167–175. [26] M. Loretan and P.C.B. Phillips, Testing the covariance stationarity of heavy-tailed time series, J. Empir. Finance 1 (1994), pp. 211–248. [27] X.Y. Ma and C.L. Nikias, Parameters estimation and blind channel identification in impulsive signal environments, IEEE Trans. Signal Process. 43 (1995), pp. 2884–2897. [28] B.J. Mandel, The regression control chart. J. Qual. Technol. 1 (1969), pp. 1–9. [29] B. Mandelbrot, The Pareto–Lévy law and the distribution of income, Internat. Econom. Rev. 1 (1960), pp. 79–106. [30] B. Mandelbrot, The variation of certain speculative prices, J. Bus. 26 (1963), pp. 394–419. [31] J.P. Nolan, Maximum likelihood estimation and diagnostics for stable distributions, in Lévy Processes: Theory and Applications, O.E. Barndorff-Nielsen, T. Mikosch and S.I. Resnick, eds., Birkhauser, Boston, 2001, pp. 379–400. [32] J.P. Nolan, Bibliography on Stable Distributions, Processes and Related Topics, 2005. Available at http://www.academic2.american.edu∼jpnolan. [33] E.S. Page, Continuous inspection schemes, Biometrika 41 (1954), pp. 100–115. [34] P. Perron, The great crash, the oil price shock and the unit root hypothesis, Econometrica 57 (1989), pp. 1361–1401. [35] C. Quintos, Z.H. Fan and P.C.B. Phillips, Structural change tests in tail behaviour and the Asian crisis, Rev Econom. Stud. 68 (2001), pp. 633–663. [36] S. Rachev, C. Menn and F. Fabozzi, Fat-Tailed and Skewed Asset Return Distributions: Implications for Risk Management, Portfolio selection, and Option Pricing, Wiley, Hoboken, NJ, 2005. [37] S. Rachev and S. Mittnik, Stable Paretian Models in Finance, Wiley, Chichester; New York, 2000. [38] S. Rachev, E. Schwartz and I. Khindanova, Stable Modeling of Market and Credit Value at Risk, in Handbook of Heavy Tailed Distributions in Finance, S. Rachev, ed., Elsevier, North-Holland, 2003, 249–328. [39] S.W. Roberts, Control chart tests based on geometric moving averages, Technometrics 1 (1959), pp. 239–250. [40] S.W. Roberts, A comparison of some control chart procedures, Technometrics 8 (1966), pp. 411–430. [41] G. Samorodnitsky and M.S. Taqqu, Stable Non-Gaussian Random Processes, Chapman & Hall, New York, 1994. [42] P. Samuelson, Efficient portfolio selection for Pareto–Levy investments, J. Finan. Quant. Anal. 2 (1967), pp. 107–117. [43] W. Shewhart, Economic Control of Quality of Manufactured Product, Van Nostrand, Princeton, 1931. [44] A.N. Shiryayev, On optimum methods in quickest detection problems, Theory Probab. Appl. 13 (1963), pp. 22–46. 1108 D. Han et al. [45] D. Siegmund and E.S. Venkatraman, Using the generalized likelihood ratio statistic for sequential detection of a change-point, Ann. Statist. 23 (1995), pp. 255–271. [46] G. Taguchi, Taguchi Methods: Research and Development, American Supplier Institute, Dearborn, Michigan, 1992. [47] V.V. Uchaikin and V.M. Zolotarev, Chance and Stability: Stable Distributions and Their Applications, VSP, Utrecht, The Netherlands, 1999. [48] S.A. Vander Wiel, Monitoring processes that wander using integrated moving average models, Technometrics 38 (1996), pp. 139–151. [49] V.M. Zolotarev, One-Dimensional Stable Distributions, American Mathematical Society, Provindence, R.I., 1986. APPENDIX 1. Proofs of Proposition 1 and Proposition 2 Proof of Proposition 1. By Equations (5) and (12), we can check that the statistic 6Dn /π 2 − 1/2 is a unbiased estimator of the parameter 1/α 2 . It is known that [22, P.105] Var(Dn ) = 1 2 (E[log X1L − E(log X1L )]4 − [Var(log X1L )]2 ) + [Var(log X1L )]2 . n n(n − 1) As n → ∞, we have E 6Dn 1 1 − − 2 π2 2 α 2 = 36 Var(Dn ) = O π4 1 −→ 0. n (A1) This means that the statistic 6Dn /π 2 − 1/2 is a consistent unbiased estimator of the parameter 1/α 2 . Note that 1/4 < 1/α 2 < 1 or 1/α 2 > 1 and 0 < b < 1/9. Let amax = max{|1 − n−a − 1/α 2 |, |1 + n−a − 1/α 2 |}, amin = min{|1 − n−a − 1/α 2 |, |1 + n−a − 1/α 2 |}. Obviously, amax → |1 − 1/α 2 |, amin → |1 − 1/α 2 | as n → ∞. It follows from the definition of α̂ˆ that (Dn (a, b))−1 ≤ 1/b and 6Dn 1 2 a2 1 1 2 E Dn (a, b) − 2 E ≤ 1 + max − − 2 α π2 2 α2 amin (A2) for large n, since, for large n, 1 Dn (a, b) − 2 α 2 a2 ≤ max 2 amin 6Dn 1 1 − − 2 π2 2 α 2 for 1 − n−a ≤ 6Dn /π 2 − 1/2 ≤ 1 + n−a and 1 Dn (a, b) − 2 α 2 ≤ 6Dn 1 1 − − 2 π2 2 α 2 for 1/4 − n−a ≤ 6Dn /π 2 − 1/2 ≤ 1/4 + n−a . Thus, by Equations (A1) and (A2), the statistic α̂ˆ is a consistent, asymptotically unbiased estimator of the parameter α since ! 1 (Dn (a, b) − 1/α 2 )2 1 2 2 2 ˆ E(α̂ − α) = α E −→ 0 = O = O E D (a, b)− √ n 2 2 α n Dn (a, b)( Dn (a, b) + 1/α) as n → ∞. Journal of Applied Statistics 1109 By the definition of θ̂ˆ , we have 2 " ˆ n π α̂A π παA ˆ 2 ˆ n − 1| ≤ n−a E(θ̂ − θ ) ≤ E + (1 + n−a + |αA|)2 P |α̂A − 2 2 2 # ˆ n + 1| ≤ n−a +P |α̂A ≤ # π π2 " E(An )2 E(α̂ˆ − α)2 + α 2 E(An − A)2 + (2 + |αA|)2 4 2 " # −a ˆ ˆ × P |α̂An − 1| ≤ n + P |α̂An + 1| ≤ n−a . Note that −1 < αA < 1. It follows from Chebyshev’s inequality that ˆ n − 1| ≤ n−a = P 1 − αA − n−a ≤ An (α̂ˆ − α) + α(An − A) ≤ 1 − αA + n−a P |α̂A ≤ P |An (α̂ˆ − α)| ≥ (1 − αA − n−a )/2 (1 − αA − n−a ) + P |α(An − A)| ≥ 2 2 {[E(An )2 E(α̂ˆ − α)2 ]1/2 + α[E(An − A)2 ]1/2 } |1 − αA − n−a | for n ≥ n0 , where the number n0 satisfies | ± 1 − αA − n−a 0 | > 0. Similarly, we have 2 ˆ n + 1| ≤ n−a ≤ P |α̂A {[E(An )2 E(α̂ˆ − α)2 ]1/2 + α[E(An − A)2 ]1/2 } | − 1 − αA − n−a | for n ≥ n0 . Since E(An )2 ≤ 2E(An − A)2 + 2A2 and, as in Equation (A1), the statistic An is a consistent unbiased estimator of the parameter A, it follows that the statistic θ̂ˆ is a consistent ≤ asymptotically unbiased estimator of the parameter θ . ˆ ≤ c na and |(2 − Note that there exist two positive numbers c1 and c2 such that | tan(θ̂)| 1 −1 a ˆ 2 ) tan((π α̂)/2)| ≤ c2 n . By using the following facts 1 1 x y x y |Z − Z | ≤ max{Z , Z }|x − y|, | tan(x) − tan(y)| ≤ max , |x − y|, cos(x) cos(y) (A3) where Z > 1, we can check that there is a positive constant c that depends on c1 and c2 such that ⎛ ⎞2 ˆ) α̂ˆ α tan( θ̂ tan(θ ) 2 + 2 2 + 2 ⎠ (β̂ˆ − β)2 = ⎝ − ˆ 2 − 2α tan(π α/2) 2 − 2α̂ˆ tan(π α̂/2) α̂ˆ ≤ cn6a (α̂ˆ − α)2 . √ Since a < b/2, that is, 6a < 1, it follows that E(β̂ˆ − β)2 ≤ cn6a E(α̂ˆ − α)2 = O n6a n −→ 0 as n → ∞. This means β̂ˆ is a consistent, asymptotically unbiased estimator of the parameter β. Note that √ a exp{(Bn (a) − λ0 )α̂ˆ + λ0 } ≤ exp √ log n + λ0 ≤ eλ0 na/ b b √ and 2a/ b < 1. We can similarly show that γ̂ˆ is a consistent, asymptotically unbiased estimator of the parameter γ . 1110 D. Han et al. From Theorem 9.5.1. in Uchaikin and Zolotarev [47], it follows that the statistic E(n) is a consistent, unbiased estimator of the parameter μ̃ = μ(2 − 21/α ). Thus, the statistic μ̂ˆ is a consistent, asymptotically unbiased estimator of the parameter μ since E(μ̂ˆ − μ)2 ≤ O(n2a E(α̂ˆ − α)2 ) + O(n2a E(E(n) − μ̃)2 ) = O n2a n −→ 0. as n → ∞. This completes the proof. Proof of Proposition 2 By the definition of α̂ˆ we have 6Dn 1 1 1 P |α̂ˆ − α̂| ≥ = P √ −$ ≥ , 2 − − 1 ≤ n−a 1 ± n−a π 2 6Dn /π 2 − 1/2 6Dn 1 1 1 1 + P $ −$ ≥ , 2 − − ≤ n−a 1/4 ± n−a π 2 4 6Dn /π 2 − 1/2 1 6Dn 1 1 + P √ − $ − <b . ≥ , 2 2 b π 2 6Dn /π − 1/2 Obviously, 1 1 lim P √ −$ ≥ , n→∞ 1 ± n−a 6Dn /π 2 − 1/2 6Dn 1 −a =0 π 2 − 2 − 1 ≤ n and 1 1 lim P $ −$ ≥ , n→∞ 1/4 ± n−a 6Dn /π 2 − 1/2 6Dn 1 1 −a = 0. π2 − 2 − 4 ≤ n For the third term, we have 1 1 P √ − $ ≥ , b 6Dn /π 2 − 1/2 6Dn 1 1 1 1 6Dn > − − − b . < b ≤ P − π2 π2 2 2 α2 α2 Note that 1/α 2 − b > 0. By Chebyshev’s inequality and (A1), it follows that 2 6Dn 1 1 1 1 1 6D 1 n P 2 − − 2 > 2 − b ≤ O E = O − −→ 0, − 2 2 π 2 α α π 2 α n as n → ∞. Thus, P(|α̂ˆ − α̂| ≥ ) → 0 as n → ∞. Journal of Applied Statistics 1111 For θ̂ˆ , θ̂, we have πAn ˆ −a −a ˆ ˆ ˆ P |θ̂ − θ̂ | ≥ = P (α̂ − α̂) ≥ , |α̂An | < 1 − n , or |α̂An | > 1 + n 2 π ˆ n − 1| ≤ n−a + P (1 ± n−a − α̂An ) ≥ , |α̂A 2 π ˆ n + 1| ≤ n−a + P (1 ± n−a + α̂An ) ≥ , |α̂A 2 π ≤P |An ||α̂ˆ − α̂)| ≥ 2 π ˆ n | ≥ , |α̂A ˆ n − 1| < n−a +P |1 ± n−a − α̂A 2 2 π ˆ +P |An ||α̂ − α̂)| ≥ 2 2 π −a ˆ ˆ n + 1| < n−a +P |1 ± n + α̂An | ≥ , |α̂A 2 2 π ˆ +P |An ||α̂ − α̂)| ≥ . 2 2 Since π ˆ n | ≥ , |α̂A ˆ n − 1| < n−a = 0, lim P |1 ± n−a − α̂A n→∞ 2 2 π ˆ −a −a ˆ lim P =0 |1 ± n + α̂An | ≥ , |α̂A n + 1| < n n→∞ 2 2 and π π P |An ||α̂ˆ − α̂)| ≥ ≤ [E(A2n )E(α̂ˆ − α̂)2 ]1/2 , 2 2 it follows that P(|θ̂ˆ − θ̂ | ≥ ) → 0 as n → ∞. By using Equation (A3) and the fact that both Bn and E(n) are consistent, unbiased estimators of the parameters B and μ̃, respectively, we can similarly prove that P |β̂ˆ − β̂| ≥ −→ 0, P |γ̂ˆ − γ̂ | ≥ −→ 0, P |μ̂ˆ − μ̂| ≥ −→ 0 as n → ∞. Copyright of Journal of Applied Statistics is the property of Routledge and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use.
© Copyright 2026 Paperzz