School on Data Assimilation Stockholm, 26 - 30 April 2011 Ensemble Kalman filter: main schemes Pavel Sakov Nansen Environmental and Remote Sensing Center, Norway NERSC Introduction. Why EnKF? Problems with EKF I I I I Non-scalable: P is an n × n matrix Can be numerically inconsistent: positive definiteness of P Requires linearisation → prone to instability Purely sequential; can not assimilate asynchronous observations. Ensemble Kalman filter (EnKF) EnKF: Ensemble E of model states carries both the state mean x and the state error covariance P. EnKF: role of the ensemble (propagation) EnKF: workflow (update) (observations) x (analysis) E xf xa Af Aa Ef A Ea (propagation) P (model) Advantages and disadvantages of EnKF Compared to KF and EKF: Advantages: I Implicit propagation of the state error covariance to suitable for large-scale problems I Does not require model linearisation I Involves approximation of sensitivities → robust I Makes it possible to treat rank problems by localisation I Makes it possible to assimilate asynchronous observations Disadvantages: I None Compared to 3D-Var and 4D-Var: Advantages: I Simplicity (does not require TLM or AM) I (Does not require adjoint) I Maintains flow-dependent covariance between DA cycles Disadvantages: I A linear method I Possible dynamic imbalances and suboptimalities due to localisation Why does one need a dynamic covariance? Some conventions n m p (superscripts) f , a En×m An×m Pn×n x y Rp×p Hp×n Kn×p T − − − − − − − − − − − − − state vector size ensemble size number of observations for forecast/analysis ensemble ensemble anomalies, A = E − x 1T state error covariance 1 E1 ensemble mean, x = m observations observation error covariance (assumed diagonal) observation matrix Kalman gain, K = PHT (HPHT + R)−1 an ensemble transform matrix (can be right- or left-multiplied) Two kinds of EnKF Kalman covariance update: Pa = (I − KH)Pf Traditional EnKF: ESRF: (1) Stochastically approximates (1) Exactly satisfies (1) Some properties of the traditional EnKF I I The approximation error ∼ O(m−1/2 ) Tends to underestimate Pa Some properties of the ESRF: I For a linear perfect-model systems the ESRF is equivalent to KF I For such a system it is sufficient to have an ensemble of size m = dimension of model subspace + 1. I Any analysis scheme that satisfies (1) is an ESRF. Two kinds of EnKF: an example RMSE versus ensemble size 0.6 1.2 ETKF, symmetric ETKF, symmetric EnKF 0.4 RMSE RMSE 0.8 0.6 0.3 0.4 0.2 0.2 0.1 0 EnKF 0.5 best fit 1 20 40 60 80 ensemble size 100 120 Figure: Linear advection model experiment 0 0 10 20 30 40 50 60 ensemble size 70 80 Figure: Lorentz-40 model 90 Ensemble square root filter (ESRF) ESRF update: E → x, A xa = xf + K(y − Hxf ) Aa = Af Tm×m T: or Aa = Tn×n Af , Pa = (I − KH)Pf , Af Aa T = Right multiplied ESRF Aa 1 = 0 Aa T = Left multiplied ESRF Af Matrix square root I There are two non-compatible definitions of matrix square root. Definition 1 (eng.): B is called a square root of A if A = B(B)∗ . Definition 2 (math.): B is called a square root of A if A = BB. I Generally, matrix square root is non-unique. I (A)1/2 denotes a specific unique square root of A Definition: If A is symmetric positive (semi)definite then (A)1/2 denotes the unique (symmetric) positive (semi)definite square root of A. A = UL(U)T , (A)1/2 = U(L)1/2 (U)T . Definition: If A has eigenvalue decomposition A = VL(V)−1 with positive eigenvalues only, (L)ii > 0 then (A)1/2 = V(L)1/2 (V)−1 . A short derivation From now on An×m − so that P = AAT 1 (E − x 1T ), scaled ensemble anomalies, A = √ m−1 A short derivation Pa = (I − KH)Pf = Pf (I − KH)T Pa = (I − KH)1/2 Pf (I − KH)T/2 a a T 1/2 f f T (Appendix 1) T/2 A (A ) = (I − KH) A (A ) (I − KH) h ih iT = (I − KH)1/2 Af (I − KH)1/2 Af • Aa = (I − KH)1/2 Af Derivation (continued) • Aa = (I − KH)1/2 Af h i−1 1/2 a f f T f T A = I − A (HA ) HP (H) + R H Af ”Matrix shift lemma” F(AB) A = A F(BA) → h i1/2 • Aa = Af I − (HAf )T (HPf (H)T + R)−1 (HAf ) (Evensen, 2004) Matrix inversion lemma → • Aa = [I + Pf HT (R)−1 H]−1/2 Af h i−1/2 • Aa = Af I + (HAf )T (R)−1 (HAf ) (ETKF, Bishop et al. 2001) Four forms of the “symmetric” solution (I − KH)1/2 Af O o MSL MIL [I + Pf (H)T (R)−1 H]−1/2 Af / Af I − (HAf )T (M−1 )(HAf )1/2 O MIL o where: MIL is matrix inversion lemma, MSL is matrix shift lemma, and M ≡ HP(H)T + R MSL / Af I + (HAf )T (R)−1 (HAf )−1/2 Solution space h i−1/2 Tsymm = I + (HAf )T (R)−1 (HAf ) Other solutions: I Let A be ensemble anomalies that factorise a given covariance P: P = A(A)T I Then AU, where U is an arbitrary orthonormal matrix, U(U)T = I, also factorises P: AU(AU)T = A[UUT )](A)T = A(A)T I To be a valid solution for the ESRF, AU also needs to be zero-centred: AU1 = 0 All solutions: T = Tsymm U , U : U(U)T = I, U1 = 1 Effect of random rotations: two examples With L3 model With L40 model −2.7 −4.0425 −2.75 x2 x2 −4.043 −4.0435 −2.8 −4.044 −4.0445 −10.951 −2.85 −10.95 x −10.949 −2.77 −2.76 −2.75 −2.74 x 1 1 As above, rotated As above, rotated −2.7 −4.0425 −2.75 x2 x2 −4.043 −4.0435 −2.8 −4.044 −4.0445 −10.951 −2.85 −10.95 x 1 −10.949 ensemble ensemble mean true field −2.77 −2.76 −2.75 −2.74 x 1 Traditional (stochastic, perturbed observations) EnKF Kalman analysis equation: xa = xf + K(y − Hxf ), Evensen (1994): h i−1 K = Pf (H)T HPf (H)T + R h i (Ea )i = (Ef )i + K y − H(Ef )i Burgers et al. (1998) (also Houtekamer and Mitchell 1998): h i (Ea )i = (Ef )i + K y + (D)i − H(Ef )i , or xa = xf + K(y − Hxf ) Aa = Af + K(D − HAf ). D: 1 D(D)T = R m−1 D1 = 0 → Pa = (I − KH)Pf + O(m−1/2 ) Some other schemes • “Double EnKF” (“DEnKF”) by Houtekamer and Mitchell (1998) Runs two ensembles in parallel; uses covariance from other ensemble to update this ensemble I an attempt to improve performance of the stochastic EnKF for small ensembles • Singular evolutive interpolated Kalman filter (“SEIK”) by Pham (2001) • Ensemble adjustment Kalman filter (“EAKF”) by Anderson (2001) • Serial ESRF (“EnSRF”) by Whitaker and Hamill (2002) • Maximum likelihood ensemble filter (“MLEF”) by Zupanski et al. (e.g. Zupanski et al., 2008) • “Deterministic EnKF” (“DEnKF”) by Sakov and Oke (2008a) • Ensemble Randomised maximum likelihood filter (“EnRML”) by Gu and Oliver (2007) • Finite-size EnKF (“EnKF-N”) by Bocquet (2011) • Other iterative schemes Some other schemes • “Double EnKF” (“DEnKF”) by Houtekamer and Mitchell (1998) • Singular evolutive interpolated Kalman filter (“SEIK”) by Pham (2001) I an ESRF • Ensemble adjustment Kalman filter (“EAKF”) by Anderson (2001) • Serial ESRF (“EnSRF”) by Whitaker and Hamill (2002) • Maximum likelihood ensemble filter (“MLEF”) by Zupanski et al. (e.g. Zupanski et al., 2008) • “Deterministic EnKF” (“DEnKF”) by Sakov and Oke (2008a) • Ensemble Randomised maximum likelihood filter (“EnRML”) by Gu and Oliver (2007) • Finite-size EnKF (“EnKF-N”) by Bocquet (2011) • Other iterative schemes Some other schemes • “Double EnKF” (“DEnKF”) by Houtekamer and Mitchell (1998) • Singular evolutive interpolated Kalman filter (“SEIK”) by Pham (2001) • Ensemble adjustment Kalman filter (“EAKF”) by Anderson (2001) I an ESRF • Serial ESRF (“EnSRF”) by Whitaker and Hamill (2002) • Maximum likelihood ensemble filter (“MLEF”) by Zupanski et al. (e.g. Zupanski et al., 2008) • “Deterministic EnKF” (“DEnKF”) by Sakov and Oke (2008a) • Ensemble Randomised maximum likelihood filter (“EnRML”) by Gu and Oliver (2007) • Finite-size EnKF (“EnKF-N”) by Bocquet (2011) • Other iterative schemes Some other schemes • “Double EnKF” (“DEnKF”) by Houtekamer and Mitchell (1998) • Singular evolutive interpolated Kalman filter (“SEIK”) by Pham (2001) • Ensemble adjustment Kalman filter (“EAKF”) by Anderson (2001) • Serial ESRF (“EnSRF”) by Whitaker and Hamill (2002) Potter scheme: K̃ = αK, " α= 1+ R HPf HT + R 1/2 #−1 I requires ensemble update after assimilating each observations I makes it possible to use covariance localisation • Maximum likelihood ensemble filter (“MLEF”) by Zupanski et al. (e.g. Zupanski et al., 2008) • “Deterministic EnKF” (“DEnKF”) by Sakov and Oke (2008a) • Ensemble Randomised maximum likelihood filter (“EnRML”) by Gu and Oliver (2007) • Finite-size EnKF (“EnKF-N”) by Bocquet (2011) • Other iterative schemes Some other schemes • “Double EnKF” (“DEnKF”) by Houtekamer and Mitchell (1998) • Singular evolutive interpolated Kalman filter (“SEIK”) by Pham (2001) • Ensemble adjustment Kalman filter (“EAKF”) by Anderson (2001) • Serial ESRF (“EnSRF”) by Whitaker and Hamill (2002) • Maximum likelihood ensemble filter (“MLEF”) by Zupanski et al. (e.g. Zupanski et al., 2008) I use of gradient method in the analysis for handling nonlinear observations I for linear case is equivalent to ESRF I does not handle strongly nonlinear model • “Deterministic EnKF” (“DEnKF”) by Sakov and Oke (2008a) • Ensemble Randomised maximum likelihood filter (“EnRML”) by Gu and Oliver (2007) • Finite-size EnKF (“EnKF-N”) by Bocquet (2011) • Other iterative schemes Some other schemes • “Double EnKF” (“DEnKF”) by Houtekamer and Mitchell (1998) • Singular evolutive interpolated Kalman filter (“SEIK”) by Pham (2001) • Ensemble adjustment Kalman filter (“EAKF”) by Anderson (2001) • Serial ESRF (“EnSRF”) by Whitaker and Hamill (2002) • Maximum likelihood ensemble filter (“MLEF”) by Zupanski et al. (e.g. Zupanski et al., 2008) • “Deterministic EnKF” (“DEnKF”) by Sakov and Oke (2008a) I a robust approximation to the ESRF in the case of small corrections • Ensemble Randomised maximum likelihood filter (“EnRML”) by Gu and Oliver (2007) • Finite-size EnKF (“EnKF-N”) by Bocquet (2011) • Other iterative schemes Some other schemes • “Double EnKF” (“DEnKF”) by Houtekamer and Mitchell (1998) • Singular evolutive interpolated Kalman filter (“SEIK”) by Pham (2001) • Ensemble adjustment Kalman filter (“EAKF”) by Anderson (2001) • Serial ESRF (“EnSRF”) by Whitaker and Hamill (2002) • Maximum likelihood ensemble filter (“MLEF”) by Zupanski et al. (e.g. Zupanski et al., 2008) • “Deterministic EnKF” (“DEnKF”) by Sakov and Oke (2008a) • Ensemble Randomised maximum likelihood filter (“EnRML”) by Gu and Oliver (2007) I Gauss-Newton minimisation in stochastic framework I handles nonlinearity of both model and observations • Finite-size EnKF (“EnKF-N”) by Bocquet (2011) • Other iterative schemes Some other schemes • “Double EnKF” (“DEnKF”) by Houtekamer and Mitchell (1998) • Singular evolutive interpolated Kalman filter (“SEIK”) by Pham (2001) • Ensemble adjustment Kalman filter (“EAKF”) by Anderson (2001) • Serial ESRF (“EnSRF”) by Whitaker and Hamill (2002) • Maximum likelihood ensemble filter (“MLEF”) by Zupanski et al. (e.g. Zupanski et al., 2008) • “Deterministic EnKF” (“DEnKF”) by Sakov and Oke (2008a) • Ensemble Randomised maximum likelihood filter (“EnRML”) by Gu and Oliver (2007) • Finite-size EnKF (“EnKF-N”) by Bocquet (2011) I derived by Bayesian rule using some specific assumptions about the prior distribution I does not require inflation • Other iterative schemes Some other schemes • “Double EnKF” (“DEnKF”) by Houtekamer and Mitchell (1998) • Singular evolutive interpolated Kalman filter (“SEIK”) by Pham (2001) • Ensemble adjustment Kalman filter (“EAKF”) by Anderson (2001) • Serial ESRF (“EnSRF”) by Whitaker and Hamill (2002) • Maximum likelihood ensemble filter (“MLEF”) by Zupanski et al. (e.g. Zupanski et al., 2008) • “Deterministic EnKF” (“DEnKF”) by Sakov and Oke (2008a) • Ensemble Randomised maximum likelihood filter (“EnRML”) by Gu and Oliver (2007) • Finite-size EnKF (“EnKF-N”) by Bocquet (2011) • Other iterative schemes EnKF: resume ESRF: xa = xf + K(y − Hxf ), Aa = Af T, Traditional EnKF: −1 K = Pf (H)T HPf (H)T + R −1/2 T = I + (HA)T R−1 (HA) xa = xf + K(y − Hxf ) Aa = Af + K(D − HAf ) Overall: h i K = Af (HAf )T (HAf )(HAf )T + R xa = xf + Af w Aa = Af T for any scheme or → Ea = Ef X5 (Evensen, 2003) EnKF: summary I The EnKF represents a state space formulation of the KF I The traditional (perturbed observations, stochastic) EnKF treats ensemble members more like particles, while the ESRF treats the ensemble as a whole unit. I There are a number of schemes in regard to the update of the ensemble anomalies; the ensemble mean update is scheme independent. I For a linear perfect-model system the ESRF is fully equivalent to KF; while the traditional EnKF is a Monte-Carlo approximation of the KF. I The ESRF generally yields a better performance than the traditional EnKF. I The analysed ensemble is a linear transform of the forecast ensemble, Ea = Ef X5 . I The ensemble transform matrix X5 only depends on the ensemble observations HEf ; it does not depend on the state values in unobserved elements. I There is no demonstrated difference in performance of different ESRF schemes. I By rescaling the ESRF can be turned into a (derivative-less) state space formulation of the EKF. References Anderson, J. L., 2001: An ensemble adjustment Kalman filter for data assimilation. Mon. Wea. Rev., 129, 2884–2903. Bierman, G. J., 1977: Factorization Methods for Discrete Sequential Estimation. Academic Press, 241 pp. Bishop, C. H., B. Etherton, and S. J. Majumdar, 2001: Adaptive sampling with the ensemble transform Kalman filter. part I: theoretical aspects. Mon. Wea. Rev., 129, 420–436. Bocquet, M., 2011: Ensemble Kalman filtering without the intrinsic need for inflation. Nonlinear Proc. Geoph., 18, 735–750. Burgers, G., P. J. van Leeuwen, and G. Evensen, 1998: Analysis scheme in the ensemble Kalman filter. Mon. Wea. Rev., 126, 1719–1724. Evensen, G., 1994: Sequential data assimilation with a nonlinear quasi-geostrophic model using Monte-Carlo methods to forecast error statistics. J. Geophys. Res., 99, 10143–10162. — 2003: The Ensemble Kalman Filter: theoretical formulation and practical implementation. Ocean Dynamics, 53, 343–367. — 2004: Sampling strategies and square root analysis schemes for the EnKF. Ocean Dynamics, 54, 539–560. Gu, Y. and D. S. Oliver, 2007: An iterative ensemble Kalman filter for multiphase fluid flow data assimilation. SPE Journal, 12, 438–446. Houtekamer, P. L. and H. L. Mitchell, 1998: Data assimilation using an ensemble Kalman filter technique. Mon. Wea. Rev., 126, 796–811. Ott, E., B. R. Hunt, I. Szunyogh, A. V. Zimin, E. J. Kostelich, M. Corazza, E. Kalnay, D. J. Patil, and J. A. Yorke, 2003, rev. 2005: A local ensemble Kalman filter for atmospheric data assimilation. http://arxiv.org/abs/physics/0203058 . Pham, D. T., 2001: Stochastic methods for sequential data assmilation in strongly nonlinear systems. Mon. Wea. Rev., 129, 1194–1207. Sakov, P. and P. R. Oke, 2008a: A deterministic formulation of the ensemble Kalman filter: an alternative to ensemble square root filters. Tellus, 60A, 361–371. — 2008b: Implications of the form of the ensemble transformations in the ensemble square root filters. Mon. Wea. Rev., 136, 1042–1053. Whitaker, J. S. and T. M. Hamill, 2002: Ensemble data assimilation without perturbed observations. Mon. Wea. Rev., 130, 1913–1924. Zupanski, M., I. M. Navon, and D. Zupanski, 2008: The Maximum Likelihood Ensemble Filter as a non-differentiable minimization algorithm. Q. J. R. Meteorol. Soc., 134, 1039–1050. Appendix 1: To the ESRF derivation This appendix shows that (I − KH)P = (I − KH)1/2 P(I − KH)1/2 : h i (I − KH)P = I − PHT (HPHT + R)−1 H P h i1/2 h i1/2 = I − PHT (HPHT + R)−1 H I − PHT (HPHT + R)−1 H P h i1/2 h i1/2 = I − PHT (HPHT + R)−1 H P I − HT (HPHT + R)−1 HP h i1/2 h iT/2 = I − PHT (HPHT + R)−1 H P I − PHT (HPHT + R)−1 H = (I − KH)1/2 P (I − KH)T/2 Appendix 2: Optimality of ESRF solutions Ott et al. (2003, rev. 2005) I Right-multiplied symmetric ETM is a minimal distance solution in state space with a norm P−1 T0 = arg min T:Pa =(I−KH)Pf T0 = Tsymm I i h trace (Aa − Af )T (Pf ,a )−1 (Aa − Af ) Left-multiplied symmetric ETM is a minimal distance solution in state space with a norm I. T1 = arg h i trace (Aa − Af )T (Aa − Af ) h i1/2 (Pf )−1/2 = (Pf )−1/2 (Pf )1/2 Pa (Pf )1/2 min T:Pa =(I−KH)Pf T1 = T(l) symm Appendix 3: Some massaging of the update schemes Let us introduce √ s = (R)−1/2 (y − Hxf )/ m − 1 (standardised innovation) √ −1/2 f S ≡ (R) HA / m − 1 (standardised ensemble observation anomalies) Then xa − xf ≡ δx = Af G s, Aa − Af ≡ δA = Af T where (dropping brackets around (S)T ) G = (I + ST S)−1 ST = ST (I + SST )−1 EnKF: ETKF: DEnKF: T = G(D − S) T = (I + ST S)−1/2 − I 1 T = − GS 2 Appendix 4: DEnKF First way Aa = (I − KH)1/2 Af → Aa = (I − 1 KH)Af , 2 kKHk kIk Second way h i (Ea )i = (Ef )i + K d − H(Ef )i , i = 1, . . . , m ↓ (after subtracting ensemble mean) a A = (I − KH)Af ↓ Pa = 1 Aa (Aa )T = (I − 2KH)Pf + (KH)Pf (KH)T m−1 Third way K̃ = αK, α = ( R 1+ HPf (H)T + R 1/2 )−1 → α≈ 1 , 2 kHPf (H)T k kRk Some properties of the DEnKF DEnKF: I performance-wise is similar, often indistinguishable from ESRF I relates ESRF and EnKF; inherits some features from both (ESRF) (I − KH)1/2 Af a A = (I − 21 KH)Af (DEnKF) (I − KH)Af + KD (EnKF) I robust in extreme situations 1 Pa = (I − KH)Pf + KHPf (KH)T 4 I requires smaller inflation than the ESRF I has a simple form that allows easy numerical or analytical treatment and enhancement, as well as easy adaptation from existing EnKF systems
© Copyright 2026 Paperzz