SPACE WEATHER, VOL. 10, S07004, doi:10.1029/2012SW000777, 2012 Ionosphere state and parameter estimation using the Ensemble Square Root Filter and the global three-dimensional first-principle model D. V. Solomentsev,1 B. V. Khattatov,2 M. V. Codrescu,3 A. A. Titov,1 V. Yudin,4 and V. U. Khattatov1 Received 14 February 2012; revised 4 June 2012; accepted 7 June 2012; published 27 July 2012. [1] In the present paper we discuss the setup and the results of series of numerical experiments aiming to recover the ~ E ~ B plasma drift and neutral wind velocities using the Ensemble Square Root Filter together with the ionospheric numerical model. One of the objectives of the current research was assessing the performance of the upper atmosphere state and parameter ensemble estimation technique in the framework of the Observational System Simulation Experiment (OSSE). The other purpose was to improve calculation accuracy for the major driving forces in the ionosphere and to increase modeling reliability in real-data operational cases. In the current paper we describe the setup of the modeling system used to obtain the presented results. In the first section we introduce the background physics-based model used in the simulations and discuss its main assumptions along with ~ E ~ B drift and the neutral wind velocity calculation algorithms. Further we present the observations simulation system and describe the data used for assimilation and parameter estimation. We also provide a brief description of the Ensemble Square Root Filter and its application in the current study. In the last few sections the results of the numerical experiments are presented and discussed. Citation: Solomentsev, D. V., B. V. Khattatov, M. V. Codrescu, A. A. Titov, V. Yudin, and V. U. Khattatov (2012), Ionosphere state and parameter estimation using the Ensemble Square Root Filter and the global three-dimensional first-principle model, Space Weather, 10, S07004, doi:10.1029/2012SW000777. 1. Introduction [2] During the past decade, extensive research has been performed by the community in the field of ionospheric model development. Because of the wide variety of factors which influence ion and electron distribution in the ionosphere, pure theoretical numerical models are not able to provide a good estimate of the current ionospheric state or come up with a short-term forecast, as pointed out by Fuller-Rowell et al. [1996]. On the other hand, the empirical models (such as International Reference Ionosphere (IRI) introduced by Bilitza [2001]), even describing most of the probable conditions, cannot be relied on in high-precision applications, because of the finite data samples, which 1 Central Aerological Observatory, Dolgoprudny, Russia. Fusion Numerics International LLC, Boulder, Colorado, USA. NOAA, Boulder, Colorado, USA. 4 National Center for Atmospheric Research, Boulder, Colorado, USA. 2 3 Corresponding author: D. V. Solomentsev, Central Aerological Observatory, 141700, Pervomayskaya str., 3, Dolgoprudny, Russia. ([email protected]) ©2012. American Geophysical Union. All Rights Reserved. they originate from. As an example, a work by Luhr and Xiong [2010] revealed large discrepancies between the IRI model and experimental data. Due to these circumstances, data assimilation techniques are widely used to combine numerical modeling with current state measurements. Some examples of assimilative ionospheric models are presented by Schunk et al. [2004], Pi et al. [2003], and Khattatov et al. [2005]. [3] Along with the adjustment of partially observed state variables, modern data assimilation methods (such as the Ensemble Kalman Filter) provide a possibility to estimate the perturbed model drivers, which are difficult or impossible to be measured directly. Due to the complexity and high cost of precise ionospheric measurements, the opportunity of model parameters estimation becomes rather attractive. Several papers were published on the parameter estimation using data assimilation techniques, applied to the thermosphere-ionosphere system [e.g., Pi et al., 2003; Codrescu et al., 2004]. In the paper by Pi et al. [2003], the authors use the first-principle ionospheric model and 4D-Var data assimilation technique to estimate the model drivers. The paper by Codrescu et al. [2004] shows the advantages of ensemble-based data S07004 1 of 13 SOLOMENTSEV ET AL.: IONOSPHERE PARAMETER ESTIMATION ENSRF S07004 assimilation scheme, implementing one to thermospheric composition parameters retrieval. The current work introduces the OSSE results of the Ensemble Square Root Filter in the estimation of crucial ionospheric parameters, thus combining approaches of the two papers, mentioned above. [4] Another study has been performed by Scherliess et al. [2009], on the state and parameter estimation in the ionosphere using the Ensemble Kalman Filter technique. In addition to the results presented by Scherliess et al. [2009], in the current study we give a detailed description of the algorithms and the datasets used, together with the technical notes on data assimilation implementation. Moreover, in the current study we apply the technique of the Empirical Orthogonal Functions, described by Navarra and Simoncini [2010] to reduce the size of the augmented state, in particular, the neutral winds, and present the results of this method. In the current paper both neutral winds and ~ E ~ B drift velocities are estimated simultaneously, while Scherliess et al. [2009] derive horizontal winds and electric fields at low latitudes. To verify the accuracy of the parameter estimation within our data assimilation system, we introduce the OSSE framework. Since neither neutral winds in the thermosphere, nor ~ E ~ B drift can be globally observed, this is a necessary step to check the reliability of the data assimilation system. Our paper continues the work on the evaluation of data assimilation schemes for the ionospheric studies and provides an estimate of the errors that can be expected for our method under given conditions. 2. Underlying Models [5] The core physics-based model used in the current research, with minor modifications, was presented by Khattatov et al. [2005]. The model solves three hydrodynamic equations: for densities, velocities and temperatures. All the equations are solved along the magnetic field lines. To reproduce the Earth’s magnetic field, we use a tilted eccentric dipole reference frame (as described by Millward et al. [1996]) with three independent coordinates: l for the magnetic longitude, p for the altitude of the highest point of the flux tube expressed in meters above Earth’s surface, and q. The value of q changes from 1 to 1 along each flux tube and equals to zero at the magnetic equator. The continuity equation (1), as well as the momentum equation (2) are solved for seven major ions: H+, N+, N+2 , NO+, He+, O+, O+2 and electrons. ∂ i i ∂Ni b2s bs þ Ni rp Vi þ Vi rp Ni ¼ Pi Li Ni ∂t ∂s NV 1 Vi ¼ P n in þ neutrals þ P ions g sinðI Þ þ X ions n ij Vj þ n ij ð1Þ bs k Ti ∂Ni Te ∂Ne ∂ðTi þ Te Þ þ þ Ne ∂s mi Ni ∂s ∂s X n in ðVn sinðDÞ Un cosðDÞÞ cosðI Þ [6] In equation (1) i denotes the ion type number, Ni is the i-th ion concentration, RE is the Earth’s radius at the current location, s = qRE, Vi is the ion velocity along the field line, Vp is the ion velocity in p direction, bs ¼ pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 3 1 þ 3 cosf2 RRdipE is the result of transformation to the eccentric dipole coordinates, Rdip is the radius- vector in the dipole coordinate system, f is the dipole latitude, Pi is the ion production term, Li is the ion loss rate due to recombination and ion exchange reactions. The rp denotes the gradient in the direction of the p coordinate. [7] In equation (2), Vi the velocity of the i-th ion type along the field line, mi is i-th ion mass, vin and vij are the ion-neutral and ion-ion collision frequencies respectively. The ion-neutral collision term depends strongly on the neutral wind velocity components: Vn (meridional component, positive northward) and Un (zonal component, positive eastward). Since this term enters equation (2) as a multiplication factor, it has a strong influence on the ion velocity. As for the rest of the terms in equation (2), Ti and Te are the ion and electron temperatures, I and D are the inclination and the declination angles of the local magnetic flux tube respectively, k is the Boltzman’s constant, and g is the acceleration due to gravity. The momentum equation is solved assuming no ion inertia (which is relevant at lowand middle-latitudes). [8] In the energy equation, we take into consideration only three species of ions: (H+, He+ and O+) and the electrons, in order to save computing resources. All other ion temperatures are assumed to be equal to the temperature of O+. Numerical representation of the modeling results is interpolated to the regular global grid of the geographical coordinates with 60 steps in longitude, 80 steps in latitude and 150 steps in altitude. The grid covers the Earth’s surface from 80 N to 80 S in latitude and from 80 up to 20000 km above the surface. [9] The ~ E ~ B drift is accounted for in the continuity equation as the velocity of charged particle motion in p direction. In fact, the continuity equation is solved in two distinct steps: transport along the field line and transverse transport in the direction of p coordinate. The transport of ions and electrons along the magnetic longitude is neglected in the current version of the model, assuming that the meridional electric field is zero. [10] The parametrization used to calculate plasma drift velocity at the magnetic equator is taken from Scherliess and Fejer [1999]. It is an empirical model, based on incoherent scatter radar observations. This model takes current location, local time and solar activity (expressed in F10.7 index) and calculates the velocity v at the magnetic equator in p direction according to equation (3). vðt; f; d; SÞ ¼ ! S07004 13 X 8 X 6 X i¼1 j¼1 ai;j;k fk Nl;4 ðt ÞNj;4 ðfÞ ð3Þ k¼1 ð2Þ neutrals 2 of 13 S07004 SOLOMENTSEV ET AL.: IONOSPHERE PARAMETER ESTIMATION ENSRF S07004 Figure 1. Horizontal wind velocities, obtained from HWM for the time of the modeling experiment, 01-Jan-2011, 02:00 UTC. The points on the plot are horizontal projections of ionospheric model grid points in the tilted eccentric dipole coordinate system. [11] Here t is local time, f is the geographic longitude, d is the day of current year, S is the solar activity index, and N is the univariate normalized cubic-B spline of order four. The coefficients fk on the right side of the equation represent seasonal dependency and are calculated as follows: f1 = 1 in May–August, f2 = 1 in November– February, f3 = 1 in March–April, September–October and f4,5,6 = f1,2,3(S 140). The coefficients ai,j,k represent seasonal patterns and linear variations of vertical drifts with solar activity and solar cycle. [12] Despite the fact that this drift model is widely used and provides sufficient accuracy during stable conditions, such estimated ~ E ~ B cannot account for all the transients which may occur in the region of thermosphere-ionosphere interactions. Since the ~ E ~ B drift is one of the major driving forces in the ionosphere controlling vertical distribution of the charged particles (including parameters, such as thmF2, which are crucial for radio waves propagation), it is important to get good estimates of its value for better representation of the ionospheric state. [13] Another major driving force which is accounted for in the model is the neutral wind in the model grid points. The neutral wind velocity is included in the momentum equation as influencing ion-neutral collision frequencies, which is shown in equation (2). The current ionospheric model uses zonal (Un) and meridional (Vn) neutral winds obtained from the Horizontal Wind Model (HWM) which is described by Hedin et al. [1988]. The HWM is based on the statistical processing of large amounts of experimental data. Figure 1 shows a sample wind pattern at 02:00 UTC obtained from the Horizontal Wind Model for the time period corresponding to the numerical experiment described here. [14] As it is the case of the ~ E ~ B drift, there is an option for our ionospheric model to use wind velocities Un and Vn from a file. This option, in particular, was used during current study, to pass the results of data assimilation to the ensemble member models. 3. Data Assimilation Algorithm [15] As the data assimilation algorithm in this work we used the Ensemble Square Root Filter (EnSRF), described by Whitaker and Hamill [2002]. We implement an ensemblebased data assimilation scheme, because it does not require development of complex tangent and adjoint codes of the model (following Evensen [2007]). Since the underlying ionospheric model is not linear, this advantage becomes significant. 3 of 13 SOLOMENTSEV ET AL.: IONOSPHERE PARAMETER ESTIMATION ENSRF S07004 [16] The base of all Kalman filter implementations is outlined in the following set of equations, as it is shown by Lorenc [1986] xb ðt Þ ¼ Mðxa ðt 1ÞÞ ð4Þ 1 K ¼ Pb HT HPHT þ R ð5Þ xa ¼ xb þ K y Hxb : ð6Þ [17] In the present system of equations the state vector is denoted by x. The superscript b denotes the background state, and the superscript a stands for the analysis. The non-linear model operator M propagates the background state xb from the time step (t 1) to the time step t. P is the covariance of the background state, operator H is converting the state vector to the observations y and K is the so-called Kalman gain matrix, which determines the amount of change applied to the background state, given the observations. [18] In the ensemble approach, Pb can be calculated as a covariance of an ensemble of model forecasts. However, in most EnKF implementation there is no need to store all the Pb values. In the presented system, the observations are assimilated sequentially, one by one. As shown by Houtekamer and Mitchell [1998], this is enough to compute the values of PbHT and HPbHT directly from the ensemble runs. Also, random Gaussian noise should be added to the synthetic observations in order to represent reasonable measurement errors and to avoid underestimation of the analysis covariance. But, as it is shown by Whitaker and Hamill [2002], this approach can lead to spurious background-observation error covariances. Therefore, to obtain better accuracy, using the same ensemble size, one should re-define Kalman gain matrix K as follows: e¼ K sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi!1 R K: 1þ b T HP H þ R ð7Þ [19] In equation (7), as earlier, H is the observational operator, R is observations error, Pb is the background covariance, K is the Kalman gain, defined in equation (5), e is the modified Kalman gain, used to adjust the and K deviations from the ensemble mean. 4. Augmented State [20] The ensemble-based algorithm described in the previous section allows to augment the state vector with the model drivers which are afterwards estimated from measurements. In the presented system, the state vector containing electron densities in each grid point is augmented with an additional set of parameters, which describe the ~ E ~ B drift and the neutral wind velocities. The ion densities are estimated after each analysis step S07004 by multiplying them by a factor which equals to the ratio of post-analysis to pre-analysis electron density at each grid point. The post-analysis sum of all ion concentrations then equals the post-analysis electron density [21] Note that though the state vector is passed to nonlinear model operator M in equation (4) as a whole, the augmented part is not affected on this stage. The values of ~ E ~ B drift and horizontal wind velocities which were obtained from EnKF on the previous time step (t 1) and are stored in the vector xa(t 1), are used in the model equations (1) and (2). The variables of the augmented state of the ensemble members are not recalculated, or by any means corrected by ensemble members models, but are adjusted at the analysis step. [22] As it comes out from equation (3), ~ E ~ B drift significantly changes with magnetic longitude and altitude. Assuming a known dependence on altitude, the number of unknown parameters, related to ~ E ~ B equals to the number of points along the magnetic longitude. In the current system configuration, the ionospheric model has 24 points along the magnetic longitude, so the number of the augmented parameters in this case is rather small. All of the ~ E ~ B drift velocities at different magnetic longitudes can be added to the state vector without noticeably increasing computational burden. [23] Of course, the assumption of known ~ E ~ B drift dependency on altitude sets a limitation on the results of the current research. However, this is a common practice in usage of the empirical parameterizations of ~ E ~ B drift, since the real dependency is poorly known. Despite the applied algorithm does not eliminate this limitation, it allows us to estimate realistic drift velocities, which explain most of the difference between the model and the E ~ B drift. observations, caused by the presence of ~ [24] Since the velocity of the neutral wind is represented by two numbers at each model grid point, it is technically impossible to add all the velocities to the state vector due to the large increase in computational requirements. To reduce the amount of parameters in the augmented state, we use Empirical Orthogonal Functions (EOFs) technique. Derivation of the EOFs from an arbitrary dataset, used in the current work, is described, for example, by Navarra and Simoncini [2010]. To calculate the EOFs, we chose a monthly dataset (from 01/12/2010 to 31/12/2010) with 10-minute time resolution. This means that the Horizontal Wind Model was run for each 10-minute interval within the pointed dates and the results of the run were interpolated to the ionospheric model grid points. Thus, the results of HWM run each time formed a vector containing Un and Vn values for the corresponding grid points. These vectors were then collected into a matrix, rows of which were representing time evaluation of the different variables (i.e. Un or Vn value in a particular grid point). The columns of the data matrix were defining the neutral wind velocities in all the model grid points at the same time step. This matrix was decomposed into the EOFs, representing space distribution patterns of the variables and a set of 4 of 13 S07004 SOLOMENTSEV ET AL.: IONOSPHERE PARAMETER ESTIMATION ENSRF S07004 Figure 2. Representation of differences between original HWM horizontal wind patterns on 01-Jan-2011, 02:00 UTC, and the ones, obtained from EOFs with 7 first coefficients. coefficients representing the temporal evolution. The EOFs are sorted in the descending order of the corresponding eigenvalues. [25] The derived EOFs represent the main spatial patterns of both zonal and meridional wind components. The original matrix can be restored by multiplying the EOFs by the set of coefficients which correspond to the particular time step. Since the number of the coefficients equals to the number of the EOFs, we do not achieve any compression. Nevertheless, as shown by Navarra and Simoncini [2010], the original matrix can be restored using the truncated set of the EOFs and the coefficients without significant loss in accuracy. The study has been performed to estimate the number of the EOFs which can represent the wind patterns with reasonable precision. Figure 2 shows the errors in the recovery of the horizontal wind fields with only 7 major empirical functions. [26] The wind pattern recovered from the EOFs is cut, since in the observation system simulation we have used only those GPS ground stations’ coordinates which are located close to the equator (for proper ~ E ~ B drift recovery purposes, see section 5 for details). Thus, we have made a decomposition of the wind velocities only in 40 S to 40 N region. The date used for modeling was 01/01/2010. The results from the HWM for that day were not included into the EOFs calculation. [27] After the EOFs decomposition, we added K EOFs coefficients into the state vector, together with ~ E ~ B drift velocities. The value of K was changed during the simulation runs from 7 to 12 coefficients. As shown in Figure 2, even 7 EOFs can give reasonable accuracy of the whole wind dataset recovery process. The resulting control vector for the whole state and parameter estimation is given in the following equation: 8 9 Ne ð1Þ > > > > > > > > :: > > > > > > > Ne ðN Þ > > > > > > > > < vExB ð1Þ > = b :: x ¼ : > > > > vExB ðMÞ > > > > > > > > > EC1 > > > > > > > > > :: > > : ; ECK ð8Þ [28] Here Ne denotes the electron density in the particular grid point of the ensemble member, vExB is the velocity of the ~ E ~ B drift at one of the magnetic equator points from 1 to M, and EC are the EOFs coefficients for neutral winds representation. Using simulated observations, we aimed to recover the ~ E ~ B drift velocities, as well as Un and Vn values. In a free running assimilation scheme the state vector is gradually adjusted with time as more and more data points are added to the system, resulting in the initial “spin-up” period where estimation results are of poor 5 of 13 S07004 SOLOMENTSEV ET AL.: IONOSPHERE PARAMETER ESTIMATION ENSRF S07004 Figure 3. Map of GPS reference stations. Data from these stations were modeled in the simulation experiment. quality. In order to simplify data handling in our OSS experiment, we instead used iterative adjustments at each model step until convergence is reached (see section 5 for details). 5. Simulation of Observations and Ensemble Setup [29] Most of the ionospheric nowcasting systems which use data assimilation techniques to obtain an accurate estimate of the space weather conditions, utilize GPS ground network information as the primary data source (e.g. presented in papers by Schunk et al. [2003], Bailey and Balan [1996], and others). This is due to public availability of most of the archives (e.g. SOPAC), their stable functionality and regular updates. The data, that comes from GPS ground segment networks contains satellite ephemeris and observation records. The latter include records of the satellite-receiver pseudorange and the carrier phase on the two basic GPS frequencies. This allows one to calculate the TEC along the line of sight between the satellite and the receiver. [30] To obtain the simulated observations equivalent to those commonly used in the ionospheric data assimilation systems, we took the coordinates of some of the IGS (International GNSS Service) network base stations. They are plotted in Figure 3. Then we simulate the TEC along the satellite-receiver line of sight. Most of the stations, used in the experiment are located close to the equator. We chose this set of ground receivers coordinates because we assume that the influence of the ~ E ~ B is most significant in this region. [31] To obtain the slant TEC values, we perform a model run using empirical ~ E ~ B drift and Un and Vn values (calculated based on Scherliess and Fejer [1999] and Hedin et al. [1988] respectively) as inputs. The results of this run after adding the 15% noise (to represent the physical model uncertainties) are now treated as the true ionospheric state. We have performed a number of ensemble runs using different noise levels. The run with a 15% noise level produced the best results. We therefore accepted this number as our best estimate of the physical model error. After the model run, we calculate all the GPS satellites coordinates and determine which satellites are visible from each of the selected ground stations. Then the following steps are performed: calculations of the lines’ of sight coordinates, interpolation of the “true” state on the calculated lines of sight and integration of the interpolated electron density profiles. Thus, we have simulated the observations which contain the information about the ~ E ~ B drift, as well as of the Un and Vn values. [32] In the data assimilation scheme, the initial guess for the ~ E ~ B drift values on each time step is zero, i.e. we assume to have no ~ E ~ B drift at the beginning of the data assimilation step. Similarly, we use a zero vector of the EOFs coefficients as the initial guess for the neutral wind patterns. The wind velocities outside the recovery box were not changed. This initial guess results in relatively large differences between the model and the observations. For example, Figure 4 shows the differences between the synthetic observations and the ensemble member with zero initial guess of the ~ E ~ B drift in terms of the vertical TEC. [33] The state vector of each ensemble member contains approximately 25000 elements. The augmented state has size of 24 extra parameters for the ~ E ~ B drift velocities and from 7 to 12 EOFs coefficients, representing the neutral winds in the region of interest. After adding Gaussian random component, the data is passed to each ensemble member. Within the ionospheric model, the data is preprocessed for further calculations since the solution of the continuity equation requires knowledge of the ~ E ~ B drift 6 of 13 S07004 SOLOMENTSEV ET AL.: IONOSPHERE PARAMETER ESTIMATION ENSRF S07004 Figure 4. Differences in vertical TEC values, obtained from model with no E B drift and with E B drift, calculated according the model (3). Time is 03-Jan-2011, 03:00 UTC. velocity at each point of the magnetic equator. To do so, we assume that the drift velocity is constant up to 5000 km above the Earth’s surface. From 5000 to 9000 km, the velocity is multiplied by an adjustment term to reproduce slow linear decrease. After 9000 km, the ~ E ~ B drift velocity is supposed to be zero. As for the neutral winds, the coefficients from the control vector were multiplied by the EOFs matrix. The resulting vector of this multiplication contains the neutral wind velocities Un and Vn at the model grid points within the selected region. [34] At each iteration we introduced small (5–10% in terms of relative value) perturbations in the initial state vector (which accounts for model error, rising from discretization and misrepresentation of ionospheric physics) and in the adjusted state. After perturbing the state, the system performs the ensemble run for one time step. For most of the OSSE versions, the model time step equaled to 10 min. To make the situation more realistic, we did not use all satellite-receiver links at the moment of data assimilation. Instead we chose about 900 pieces of data, which also set moderate constraints on computing resources. [35] To avoid spurious correlations in the results due to estimating error covariance from a relatively small number of ensemble members, we have used the covariance localization technique which is outlined in equation (9). We also introduce small covariance inflation factor (at a level of 1.1–1.3) to reduce the sampling errors in the finite-size ensemble. ! ðfi fstation Þ4 ðqi qstation Þ4 ðhi hstation Þ : PH ¼ PH exp Lh Lf Lq T T ð9Þ [36] Here, as earlier, P is the background state covariance, H is the observations mapping operator. The sign ∘ denotes a Schur product. The geographical coordinates are denoted as f (latitude, positive northward), q (longitude, positive eastward) and h (altitude). Values of Lf, Lq and Lh denote the model correlation lengths. Such an artificial covariance adjustment allows us to use the ensemble of moderate size for parameter estimation purposes. Usually, the ensemble contained 60–80 members. The correlation lengths values were estimated by optimizing the observations-minus-forecast (OmF) residuals from the sequential assimilation system currently working at Central Aerological Observatory. The system uses the same underlying physics-based model as the one used in this study. The values used in the current OSSE are as follows: Lf = 1.25 , Lq = 1.6 and Lh = 40 km. [37] As mentioned in the previous section, we use an iterative approach to recover the augmented state vector at 7 of 13 S07004 SOLOMENTSEV ET AL.: IONOSPHERE PARAMETER ESTIMATION ENSRF S07004 Figure 5. Scheme of EnSRF process iterations. each time step. This means, that after the ensemble is run and the EnSRF equations are solved, we use the resulting analysis values of the ~ E ~ B drift and the EOFs coefficients again. We are able to obtain sufficient accuracy at each time step using from 8 to 12 iterations. After the fixed amount of iterations or when the system converges to a stable solution (see for example Figure 11), we pass all the analysis values to the next time step, where the iterative process begins again. The loop of iterations for one time step is shown in Figure 5. [38] The challenge of the ~ E ~ B drift and neutral wind velocity recovery using the ensemble-based assimilation methods, is that all of them enter the model equations as multiplicative, time-dependent terms, adjustment of which can result in filter divergence and model “blow-up”. Several heuristic techniques have been presented in the recent papers [e.g., Yang and DelSole, 2009] to avoid such problems. In the current work, we have obtained reasonable accuracy and stability without any additional filter-result manipulations. [39] The underlying model was implemented in C++ in an object-oriented manner. The ensemble data assimilation system presented in the current paper was implemented using MatLab. Several steps of the data assimilation algorithm with independent data flows were parallelized with MatLab Parallel Computing Toolbox : [40] 1. Ensemble runs. Since separate ensemble members are independent from each other at the stage of forward model integration, this step can be implemented in parallel. The model run is often one of the most time-consuming stages of the whole procedure. By distributing this task over 16 independent MATLAB instances, the timing at this stage can be improved up to a factor of 7–9. [41] 2. GPS data modeling and Hxb calculation. These processes can also be run in parallel, since the data flows are totally independent. The timing at this stage can be improved by a factor of 3–4 through the use of MatLab distributed arrays. [42] Given this software setup, the most time-consuming part of the algorithm is the EnSRF procedure. The calculation speed thus strongly depends on the amount of data, the ensemble size and model grid resolution. 6. Results and Discussion: State Estimation [43] The primary goal of any data assimilation algorithm is estimation of the system state vector. In the introduced system, the state vector is updated according to equation (8). We estimate the accuracy of the state estimation using the following algorithm. In the process of forming vectors ~ y and HXb , approximately 10% of all the available data is left for validation purposes. The corresponding slant TEC values are then compared with the observations values at 8 of 13 S07004 SOLOMENTSEV ET AL.: IONOSPHERE PARAMETER ESTIMATION ENSRF S07004 Figure 6. Time evolution of RMS error of data assimilative model, comparing to the TEC data, not assimilated in the model. each time step. The results of such a process are shown in Figure 6. [44] The RMS error evolution shown in the figure was calculated using all the validation values at each time step. Usually 10% of the available data results in 100–150 TEC observations. The system showed sufficient accuracy in TEC estimation along with parameters retrieval. The temporary increase in the relative error after 17:00 UT could be caused by small absolute TEC values, approaching the absolute observational error in those data points that have been at random used for validation. More accurate results could be obtained by using more observations at each time step and making the simulated/real network more dense. 7. Results and Discussion: Parameters Estimation [45] Following the experimental setup described in the previous section, we have performed series of the ensemble runs. Within each run we have slightly varied the ensemble size and the number of observations to investigate the ensemble convergence speed and the resulting accuracy of the calculations. Sample results are described below. [46] Figure 7 represents the ~ E ~ B drift longitudinal dependence. Estimation results for two different ensemble setups and the initial guess are shown as well as the ~ E ~ B drift velocities used for preparing the synthetic observations. Most of the drift velocities which were used in the observations modeling were recovered with reasonable accuracy. The first ensemble setup with 80 members have resulted in a big discrepancy in the region from 0 to 25 W in longitude. Initially, we have assumed low data density in the region to be the cause of the large errors, but after th ensemble size increase, the results improved. Initial guess, shown in Figure 7, slightly differs from zero, because of perturbations, introduced at the first iteration. Figure 8 demonstrates the evolution of the RMS error distributions for all observations. [47] We also present the EnSRF inversion results for the neutral wind velocities. Recovered along with ~ E ~ B drift, EOFs coefficients, after multiplication with EOFs matrix, give the values of Un and Vn. Errors of the estimation are shown in Figures 9 and 10. The errors are calculated at each grid point at each time step and averaged over all time steps within a single-day run. [48] Current ensemble setup consisted of 80 members and 1000 pieces of simulated GPS data. As it can be figured 9 of 13 S07004 SOLOMENTSEV ET AL.: IONOSPHERE PARAMETER ESTIMATION ENSRF S07004 Figure 7. Recovery results for two different ensemble configurations, 01-Jan-2011, 12:00 UTC. Circles represent initial guess values, triangles designate E B drift values, used for observations simulations, squares represent recovery results for 80-member ensemble on 10th iteration and diamonds are results for 100-member ensemble run on 10th iteration. out from the comparison of Figure 9 with Figure 1, the technique of Empirical Orthogonal Functions allows us to restore most significant components of neutral wind velocity patterns. The results of another ensemble configurations (not shown) demonstrate that increasing the density of observations in the selected region, as expected, increases quality of parameter estimation. [49] Figure 11 shows the evolution of RMS error of ~ E ~ B drift recovery for different ensemble configurations. Even a moderate ensemble setup with 80 members and 800 observations can be used to recover the augmented state with reasonable accuracy. A bigger ensemble (100 members), utilizing more data (1000 TEC observations) shows better results, as expected. Nevertheless, the speed of filter convergence varies very little with the ensemble size and data amount. Since we have used only coordinates of real GPS reference stations, the more of them we include in the simulation, the more distant they are from the magnetic Figure 8. Evolution of recovery RMS error distribution between ~ E ~ B drift, used in observations simulation and all the ensemble members results for each point on the magnetic equator separately. (left) First iteration of 80-member ensemble and (right) 10th iteration. 10 of 13 S07004 SOLOMENTSEV ET AL.: IONOSPHERE PARAMETER ESTIMATION ENSRF S07004 Figure 9. Map of absolute errors in estimation of two horizontal wind components on 10-th iteration of 100-member ensemble. Time is 01-Jan-2011, 03:00 UTC. Figure 10. RMS errors distribution of horizontal wind recovery errors for 100-member ensemble. Time is 01-Jan-2011, 03:00 UTC. 11 of 13 SOLOMENTSEV ET AL.: IONOSPHERE PARAMETER ESTIMATION ENSRF S07004 S07004 Figure 11. RMS error evolution with iterations for different ensemble configurations. Red filled rounds represent ensemble of 80 members, utilizing 800 observations. Blue filled triangles designate ensemble of 100 members, using 1000 observations. equator. Thus, the weight of each additional TEC measurement is likely to decrease. [50] Our current data assimilation system is a trade-off between the accuracy and computing resources consumption, so the information provided in Figure 11 might also be useful for further practical implementations. 8. Conclusions [51] In the present paper, the setup and results of the numerical data assimilation experiment are discussed. Using the first-principle model of the ionospheric plasma which accounts for most major driving forces and the Ensemble Square Root Filter, we have recovered the values of the ~ E ~ B drift and the neutral wind velocities, based on the simulated GPS observations. Augmentation of the state vector with Un and Vn wind components also required data compression using the EOFs technique. [52] We introduce the results of several ensemble setups which have achieved reasonable accuracy in the augmented state estimation. The results show different accuracy levels, depending on the ensemble size and the amount of assimilated data. The filter convergence speed, on the contrary, seems to depend little on these parameters. [53] Currently, major improvements of ionospheric model and data assimilation system are considered for future work. Usage of more sophisticated ionospheric model could help to sufficiently increase the parameter estimation accuracy. Also, the developed data assimilation system allows one to add more parameters, if more experimental data are available. Since the ensemble approach is relatively easy to implement, we are planning to use other sources of the ionospheric data. Also, given positive results of the data simulation experiment, we plan to modify the assimilative model to use actual data for augmented state estimation. These improvements will be implemented in the future work. [54] Acknowledgments. Authors would like to thank the collective of Central Aerological Observatory for providing scientific advice and funding of the current research. We are grateful to Louis J. Lanzerotti and two anonymous reviewers for their valuable comments and constructive suggestions. References Bailey, G. J., and N. Balan (1996), A low-latitude ionosphereplasmasphere model, in STEP Handbook on Ionospheric Models, edited by R. W. Schunk, pp. 173–206, Utah State Univ., Logan. Bilitza (2001), International Reference Ionosphere 2000, Radio Sci., 36(2), 261–275. Codrescu, M. V., T. J. Fuller-Rowell, and C. F. Minter (2004), An ensemble-type Kalman filter for neutral thermospheric composition during geomagnetic storms, Space Weather, 2, S11002, doi:10.1029/ 2004SW000088. Evensen, G. (2007), Data Assimilation: The Ensemble Kalman Filter, Springer, Berlin. Fuller-Rowell, T. J., D. Rees, S. Quegan, R. J. Moffett, M. V. Codrescu, and G. H. Millward (1996), A coupled thermosphere-ionosphere model (CTIM), in STEP Handbook on Ionospheric Models, edited by R. W. Schunk, pp. 217–238, Utah State Univ., Logan. Hedin, A. E., N. W. Spencer, and T. L. Killeen (1988), Empirical global model of upper thermosphere winds based on Atmosphere and Dynamics Explorer satellite data, J. Geophys. Res., 93, 9959–9978. Houtekamer, P. L., and H. L. Mitchell (1998), Data assimilation using an ensemble Kalman filter technique, Mon. Weather Rev., 126, 796–811. Khattatov, B., M. Murphy, M. Gnedin, J. Sheffel, J. Adams, B. Cruickshank, V. Yudin, T. Fuller-Rowell, and J. Retterer (2005), Ionospheric nowcasting via assimilation of gps measurements 12 of 13 S07004 SOLOMENTSEV ET AL.: IONOSPHERE PARAMETER ESTIMATION ENSRF of ionospheric electron content in a global physics-based timedependent model, Q. J. R. Meteorol. Soc., 131, 3543–3559. Lorenc, A. C. (1986), Analysis methods for numerical weather prediction, Q. J. R. Meteorol. Soc., 112, 1177–1194. Luhr, H., and C. Xiong (2010), IRI-2007 model overestimates electron density during the 23/24 solar minimum, Geophys. Res. Lett., 37, L23101, doi:10.1029/2010GL045430. Millward, G. H., R. J. Moffett, S. Quegan, and T. J. Fuller-Rowell (1996), A coupled thermosphere-ionosphere-plasmasphere model (CTIP), in STEP Handbook on Ionospheric Models, edited by R. W. Schunk, pp. 239–279, Utah State Univ., Logan. Navarra, A., and V. Simoncini (2010), A Guide to Empirical Orthogonal Functions for Climate Data Analysis, Springer, London. Pi, X., C. Wang, G. A. Hajj, G. Rosen, B. D. Wilson, and G. J. Bailey (2003), Estimation of E B drift using a global assimilative S07004 ionospheric model: An observation system simulation experiments, J. Geophys. Res., 108(A2), 1075, doi:10.1029/2001JA009235. Scherliess, L., and B. G. Fejer (1999), Radar and satellite global equatorial F region vertical drift model, J. Geophys. Res., 104(A4), 6829–6842. Scherliess, L., D. C. Thompson, and R. W. Schunk (2009), Ionospheric dynamics and drivers obtained from a physics-based data assimilation model, Radio Sci., 44, RS0A32, doi:10.1029/2008RS004068. Schunk, R. W., L. Scherliess, and J. J. Sojka (2003), Recent approaches to modeling ionospheric weather, Adv. Space Res., 31(4), 819–828. Schunk, R. W., et al. (2004), Global Assimilation of Ionospheric Measurements (GAIM), Radio Sci., 39, RS1S02, doi:10.1029/ 2002RS002794. Whitaker, J. S., and T. M. Hamill (2002), Ensemble data assimilation without perturbed observations, Mon. Weather Rev., 130, 1913–1924. Yang, X., and T. DelSole (2009), Using the ensemble Kalman filter to estimate multiplicative model parameters, Tellus, 61, 601–609. 13 of 13
© Copyright 2026 Paperzz