This article was downloaded by: [Nkrintra Singhrattna] On: 23 January 2012, At: 16:41 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH, UK Hydrological Sciences Journal Publication details, including instructions for authors and subscription information: http://www.tandfonline.com/loi/thsj20 Hydroclimate variability and long-lead forecasting of rainfall over Thailand by large-scale atmospheric variables a a Nkrintra Singhrattna , Mukand S. Babel & Sylvain R. Perret a a b Water Engineering and Management, Asian Institute of Technology, Pathumthani, Thailand b Centre de Coopération Internationale en Recherche Agronomique pour le Développement, UMR G-Eau, Montpellier, France Available online: 20 Jan 2012 To cite this article: Nkrintra Singhrattna, Mukand S. Babel & Sylvain R. Perret (2012): Hydroclimate variability and long-lead forecasting of rainfall over Thailand by large-scale atmospheric variables, Hydrological Sciences Journal, 57:1, 26-41 To link to this article: http://dx.doi.org/10.1080/02626667.2011.633916 PLEASE SCROLL DOWN FOR ARTICLE Full terms and conditions of use: http://www.tandfonline.com/page/terms-and-conditions This article may be used for research, teaching, and private study purposes. Any substantial or systematic reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any form to anyone is expressly forbidden. The publisher does not give any warranty express or implied or make any representation that the contents will be complete or accurate or up to date. The accuracy of any instructions, formulae, and drug doses should be independently verified with primary sources. The publisher shall not be liable for any loss, actions, claims, proceedings, demand, or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material. 26 Hydrological Sciences Journal – Journal des Sciences Hydrologiques, 57(1) 2012 Hydroclimate variability and long-lead forecasting of rainfall over Thailand by large-scale atmospheric variables Nkrintra Singhrattna1 , Mukand S. Babel1 and Sylvain R. Perret1,2 1 Water Engineering and Management, Asian Institute of Technology, Pathumthani, Thailand [email protected], [email protected] 2 Centre de Coopération Internationale en Recherche Agronomique pour le Développement, UMR G-Eau, Montpellier, France Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 Received 27 May 2010; accepted 25 March 2011; open for discussion until 1 July 2012 Editor Z.W. Kundzewicz Citation Singhrattna, N., Babel, M.S. and Perret, S.R., 2012. Hydroclimate variability and long-lead forecasting of rainfall over Thailand by large-scale atmospheric variables. Hydrological Sciences Journal, 57 (1), 26–41. Abstract The development of statistical relationships between local hydroclimates and large-scale atmospheric variables enhances the understanding of hydroclimate variability. The rainfall in the study basin (the Upper Chao Phraya River Basin, Thailand) is influenced by the Indian Ocean and tropical Pacific Ocean atmospheric circulation. Using correlation analysis and cross-validated multiple regression, the large-scale atmospheric variables, such as temperature, pressure and wind, over given regions are identified. The forecasting models using atmospheric predictors show the capability of long-lead forecasting. The modified k-nearest neighbour (k-nn) model, which is developed using the identified predictors to forecast rainfall, and evaluated by likelihood function, shows a long-lead forecast of monsoon rainfall at 7–9 months. The decreasing performance in forecasting dry-season rainfall is found for both short and long lead times. The developed model also presents better performance in forecasting pre-monsoon season rainfall in dry years compared to wet years, and vice versa for monsoon season rainfall. Key words rainfall; hydroclimate variability; ENSO; large-scale atmospheric variables; long-lead forecasting; statistical approach; modified k-nn model; cross-validated multiple regression; Chao Phraya River Basin; Ping River Basin; Thailand Variabilité hydroclimatique et prévision à long terme des précipitations en Thaïlande à l’aide de variables atmosphériques de grande échelle Résumé L’établissement de relations statistiques entre variables pluviométriques locales et variables atmosphériques de grande échelle permet une meilleure compréhension de la variabilité hydroclimatique. Les précipitations dans le bassin d’étude (le bassin supérieur de la Rivière Chao Phraya, Thaïlande) sont influencées par la circulation atmosphérique dans l’Océan Indien et dans l’Océan Pacifique tropical. Par l’analyse de corrélations et régressions multiples en validation croisée, les prédicteurs de variables atmosphériques de grande échelle, à savoir température, pression et vent, sont identifiés sur des régions cibles, et montrent une bonne capacité de prévision à long terme. Le modèle modifié des k-plus proches voisins (k-nn), développé par les prédicteurs identifiés pour prévoir les pluies, et évalué par fonction de probabilité, montre une prévisibilité à long terme (7–9 mois) des pluies de mousson. La moindre performance dans la prévision des pluies de saison sèche est prouvée pour la prévision à court et à long terme. Le modèle développé présente également une meilleure performance pour les pluies de pré-mousson en année sèche, par rapport à une année humide, et inversement pour les pluies de mousson. Mots clefs précipitations; variabilité hydroclimatique; ENSO; variables atmosphériques de grande échelle; prévision à long terme; approche statistique; modèle k-nn modifié; régression multiple à validation croisée; bassin du Fleuve Chao Phraya; bassin du Fleuve Ping; T haïlande 1 INTRODUCTION The links between large-scale atmospheric variables and local hydroclimates are widely studied and reported to diagnose the variability of local ISSN 0262-6667 print/ISSN 2150-3435 online © 2011 IAHS Press http://dx.doi.org/10.1080/02626667.2011.633916 http://www.tandfonline.com hydroclimates (Smith and O’Brien 2001). The anomalies of the climate variables known as atmospheric indices, e.g. the anomalous sea-surface temperature (SST) index or El Niño-Southern Oscillation Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 Hydroclimate variability and long-lead forecasting of rainfall over Thailand by large-scale atmospheric variables (ENSO), can strengthen or weaken a monsoon (Mason and Goddard 2001, Smith and O’Brien 2001). Thus, the atmospheric anomalies are responsible for the variability of local hydroclimates, with respect to temperature (Gershunov 1998, Pavia et al. 2006), precipitation (Masiokas et al. 2006) and streamflow (Gutiérrez and Dracup 2001, Meko and Woodhouse 2005). Due to an anomalous phase of the ocean and atmosphere, the responses of local hydroclimates may not be experienced with the same level of variability. In some regions, storms develop more strongly and frequently (Cañón et al. 2007, Kim et al. 2006), whereas in other regions, dry conditions are experienced with droughts (Mendoza et al. 2005) during longer periods (Schöngart and Junk 2007). The effects of anomalous oceanic–atmospheric circulation have changed climate patterns not only along the coast, but also in regions located far from it (Saravanan and Chang 2000). Moreover, their effects are difficult to identify because the local climate may be influenced by nearby and distant oceans through the coupled atmospheric–oceanic circulation (Wu and Kirtman 2004, Nagura and Konda 2007). The teleconnection influences could be determined by the lag time of anomalous events up to several months after a phenomenon occurs (Sourza Filho and Lall 2003, Grantz et al. 2005), which makes it difficult to understand. The annual variability of the Southeast Asian climate is influenced by the interactions between the annual reversal of surface monsoonal winds from the Indian Ocean to the equatorial western Pacific Ocean, including the South China Sea, and the complex distribution of land, sea and terrain. For the seasonal development of surface winds, the Asian summer monsoon that develops due to the land–ocean temperature gradient is influenced by the sea-surface temperature (SST) anomalies over the equatorial eastern Indian Ocean (Goswami et al. 2006) and the tropical Pacific Ocean (Fasullo and Webster 2002). The non-convective anomalies over the equatorial Indian Ocean tend to weaken the convection over the Bay of Bengal, the eastern Indian Ocean, Southeast Asia and the equatorial western Pacific Ocean (Krishnan et al. 2000). Thus, the anomalous oceanic–atmospheric circulation, e.g. ENSO, affects the Asian summer monsoon and subsequently the monsoon rainfall. A warm phase of ENSO (El Niño) is associated with a weak monsoon and below-normal rainfall, whereas it is the opposite for a cold phase of ENSO, i.e. La Niña (Shrestha 2000, Shrestha and Kostaschuk 2005). However, the influences over a particular region 27 have to be carefully investigated due to the different responses which make it hard to define a certain pattern of local hydroclimates (An et al. 2007). The lagging relationships of local hydroclimates with anomalous atmospheric events are also of interest when developing hydroclimatic forecasting models. Several models, such as artificial neural networks (ANNs) (Silverman and Dracup 2000), multiple regression model (Schöngart and Junk 2007) and principal component analysis (Eldaw et al. 2003), have been developed, and show better performance using the predictors of large-scale atmospheric information (Singhrattna et al. 2005a, Zehe et al. 2006). The basic variables, e.g. SST, surface air temperature, atmospheric pressure, wind and ENSO standard indices, are applied in the forecasting models (Gershunov 1998, Clark et al. 2001, Hamlet et al. 2002, McCabe and Dettinger 2002, Grantz et al. 2005) based on the significant relationships between large-scale atmospheric variables and hydroclimates. The forecasting models enable successful short-lead projections to be made (Gutiérrez and Dracup 2001, Schöngart and Junk 2007); however, the model performance of long-lead forecasts still needs to be improved (Krishna Kumar et al. 1995, DelSole and Shukla 2002). The decadal variability of large-scale atmospheric variables used as the predictors of a model could weaken the relationships with hydroclimates and subsequently decrease the model performance of long-lead forecasts. The objectives of this research are to study the climate characteristics and their variability in part of the Upper Chao Phraya River Basin located in Thailand (Southeast Asia), and to identify the predictors of large-scale atmospheric variables at long-lead times for rainfall forecasting. Using the identified variables as predictors, the rainfall forecasting model, based on a statistical-stochastic approach, is ultimately developed to forecast the seasonal rainfall at long lead times. 2 THE STUDY BASIN Thailand, located between the Indian Ocean and the Gulf of Thailand which is connected to the Pacific Ocean (Fig. 1(a)), covers an area of 513 115 km2 with a population of 62.4 million (2005). Its climate is influenced by the Indian Ocean and the Pacific Ocean due to the land–ocean interrelated temperature gradients. The summer season, from mid-February to midMay, is responsible for developing the land–ocean temperature gradient that strengthens the southwest 28 Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 (a) Nkrintra Singhrattna et al. (b) (c) Fig. 1 (a) Map of Thailand, (b) Chao Phraya River Basin, and (c) Ping River Basin and locations of rainfall and temperature stations. monsoon from the Indian Ocean in the following rainy season. The rainy season is caused by the Inter-Tropical Convergence Zone (ITCZ) and by the southwest monsoon, and lasts from mid-May to midOctober. Due to the low intensity and uncertainty of rainfall occurrence, the period from May to July is called the pre-monsoon season, or transition period from summer to monsoon season. In contrast, heavy rainfall occurs from August to October. The average annual rainfall in the country ranges from 1200 to 1600 mm. The winter season is characterized by dry and cool winds brought by the northeast monsoon from the mid-latitudes, and lasting from mid-October to mid-February. In terms of streamflow, Thailand generates about 289 × 109 m3 per year of total average runoff in 25 major river basins. However, only 38 × 109 m3 per year or 13.1% of the annual runoff can be stored in reservoirs and supplied to various sectors for use within the country (RID 2007). The Chao Phraya River Basin (Fig. 1(b)) is the largest among the 25 major basins, covering an area of 178 000 km2 or 35% of the country’s land area. Four major tributaries, the Ping, Wang, Yom and Nan rivers, merge at Nakhon Sawan and form the Chao Phraya River. The portion above the confluence at Nakhon Sawan is called the Upper Chao Phraya River Basin, whereas that below the confluence is the Lower Chao Phraya River Basin. The upper basin covers an area of 102 635 km2 or 58% of the Chao Phraya River Basin. The study basin, the Ping River Basin, is located in northern Thailand, between 15◦ –19◦ N latitude and 98◦ –100◦ E longitude, and covers an area of 33 899 km2 (Fig. 1(c)). The Ping River is 740 km long and many storage dams are located along the river and its tributaries. The most important is the Bhumipol Dam, constructed for the purposes of hydropower, agriculture, flood mitigation, fishery and transportation. It is 154 m high and 486 m long with a maximum storage capacity of 13.462 × 109 m3 and installed capacity of 779.2 MW of hydropower. In terms of land use, 71.46% of basin area is covered with forests, which are mostly located in the upstream or the origin of the Ping River. The remainder, along both sides of the river and the plain downstream, is covered by agricultural and residential areas and water bodies. 3 DATA 3.1 Hydroclimatic data Two hydroclimate data sets—rainfall and temperature—from the climatic stations located in and around the Ping River Basin, are used in this study. Both data are recorded on a daily basis. Out of 208 rainfall stations operated by the Royal Irrigation Hydroclimate variability and long-lead forecasting of rainfall over Thailand by large-scale atmospheric variables Department (RID), Thailand Meteorological Department (TMD) and Department of Water Resources (DWR) of the Royal Thai Government, 50 stations are selected based on the length of time series and the occurrence of incomplete data. Daily rainfall data observed at the selected stations are obtained for the period 1950–2007, with incomplete data constituting less than 5% of the recent 30 years’ data. Fisher’s Transformation (z ) is used to test the hypothesis on significance of a sample correlation coefficient by (Haan 2002): 1+r z = 0.5 ln 1−r (1) where r is the correlation coefficient between two independent samples, and z is approximately normally distributed, and is used to compare with z of the standard normal distribution to obtain the significance level (p) of correlation. Coefficients of correlation obtained for monthly rainfall of all selected stations varied from 0.46 to 0.96 and are well correlated at the 95% significance level by Fisher’s Transformation. Therefore, the average values are calculated over 50 selected stations. For daily air temperature, 11 stations of the TMD are selected with the length of data varying from 16 to 56 years. The average is estimated over 11 selected stations which are statistically correlated at 95% confidence level. The locations of selected rainfall and temperature stations are shown in Fig. 1. the region of 2◦ N–2◦ S latitude and 70◦ –90◦ E longitude. The ION data set is provided by the Physical Sciences Division of NOAA (PSD 2007b). The SST over the South China Sea (SCS) is the SST averaged over the region of 5◦ N–5◦ S latitude and 100◦ –115◦ E longitude. The SCS data set is provided by the PSD (PSD 2007a). 3.3 Large-scale atmospheric data The large-scale atmospheric variables used in this study are obtained from re-analysis-derived data, which are provided by the NCEP/NOAA (Kalnay et al. 1996). Several variables, on both a daily and a monthly basis, are available from 1948 to the present, and cover a global grid of 2.5◦ latitude × 2.5◦ longitude (PSD 2007a). This study uses monthly data of four principal atmospheric variables from 1948–2007: the surface air temperature (SAT); sealevel pressure (SLP); surface zonal wind (SXW); and surface meridian wind (SYW). These variables play an important role in strengthening or weakening a monsoon, and influence the convection over the study basin. The significant ability of forecasting models can be obtained using combinations of these variables (Sahai et al. 2003). 4 CLIMATE DIAGNOSTIC 4.1 Inter-annual climate The annual cycles of air temperature and rainfall in the Ping River Basin are shown in Fig. 2. The summer 3.2 Standard sea-surface temperature (SST) 250 This study used monthly data of SST for 1950–2007 over four regions: NINO 3, NINO 4, ION and SCS, as follows. The standard SSTs over the tropical Pacific Ocean used in this study are: These SSTs are provided by the Climate Prediction Center (CPC 2007) of the US National Oceanic and Atmospheric Administration (NOAA). The standard SST over the Indian Ocean (ION) is the SST obtained from two data sets: the Comprehensive Ocean–Atmosphere Data Set (COADS) (Woodruff et al. 1993) and real-time surface marine data from the National Centers for Environmental Prediction (NCEP), and averaged over 32 Rainfall Temperature 200 30 150 28 100 26 50 24 Temperature (°C) – NINO 3: average SST over the area of 5◦ N–5◦ S latitude and 90◦ –150◦ W longitude; and – NINO 4: average SST over the area of 5◦ N–5◦ S latitude and 150◦ W–160◦ E longitude. Rainfall (mm) Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 29 22 0 J F M A M J J A S O N D Fig. 2 Annual cycle of mean monthly air temperature from 11 selected stations for 1951–2006 and mean monthly rainfall from 50 selected stations for 1950–2007. season occurs during March-April-May (MAM). The maximum temperature of 30.2◦ C is observed in April, and the minimum of 22.9◦ C in December. Figure 2 shows that rainfall is bi-modal with two peaks: one in May and another in August. Considering the primary peak, the rainy or monsoon season in the basin occurs during August-September-October (ASO). Moreover, the secondary peak is observed during May-June-July (MJJ), which is the pre-monsoon season. The secondary peak of the pre-monsoon is related to the southwest monsoon and the ITCZ passing from the Indian Ocean to Thailand in May and to the South China Sea and central China in mid-June. However, the primary peak is associated with the ITCZ as it moves back over Thailand during ASO. From 58 years of rainfall data, the maximum MJJ rainfall of 651.8 mm and the maximum ASO rainfall of 948.3 mm are found in 1950. The minimum MJJ rainfall of 254.0 mm is in 1997 and the minimum ASO rainfall of 387.7 mm in 2004. The total annual rainfall varies from 843.0 to 1605.6 mm. The MJJ and ASO rainfall is about 88% of the total annual rainfall. The remaining rainfall is the dry-season rainfall which falls in November–April and ranges from 49.4 to 295.2 mm over 58 years. 4.2 Inter-seasonal climate MAM Temperature (°C) The air temperature is related to rainfall in terms of developing the temperature gradient between land and sea that subsequently strengthens the monsoon. In this study, the summer season temperature is averaged for MAM. Figure 3 shows the inverse relationship between the dry-season rainfall and MAM air temperature. As expected, if more rainfall is received during November–April, it would tend to cool the land and atmosphere, thus decreasing the air temperature during MAM and vice versa. 31 30 29 28 T = -0.0011R + 29.47 27 50 100 150 200 250 Dry Season Rainfall (mm) 300 Fig. 3 Scatter plots between dry-season (November–April) rainfall and March-April-May (MAM) air temperature (T: temperature; R: rainfall). MJJ Rainfall (mm) Nkrintra Singhrattna et al. 800 (a) R = -16.103T + 914.42 600 400 800 200 (b) 600 400 R = 18.276T + 23.72 200 27.5 28.0 28.5 29.0 29.5 30.0 30.5 31.0 MAM Temperature (°C) ASO Rainfall(mm) Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 30 Fig. 4 Scatter plots between March-April-May (MAM) air temperature and (a) May-June-July (MJJ) rainfall, and (b) August-September-October (ASO) rainfall (T: temperature; R: rainfall). The MAM temperature subsequently plays a role in driving pre-monsoon and monsoon season rainfall. Figure 4 shows the relationships between the MAM air temperature and both MJJ rainfall and ASO rainfall. During the pre-monsoon season or the transition period, the land–ocean temperature gradient is not fully developed. Hence, inverse relationships between MAM temperature and MJJ rainfall are found. During the monsoon season, as the positive relationships indicate, the higher MAM air temperature increases the ASO rainfall, whereas the lower MAM air temperature decreases the ASO rainfall. To further enhance the understanding of the land–sea temperature gradient that influences the seasonal rainfall, the relationships between the MAM surface temperature of nearby oceans and MJJ and ASO rainfall are investigated (Fig. 5). The positive relationships between MAM air temperature over the study basin and ASO rainfall are stronger than those of MJJ rainfall, which is consistent with the relationships in Fig. 4. For the ASO rainfall (Fig. 5(b)), the positive correlations at 95% significance level (i.e. upper and lower bounds of +0.26 and –0.26, respectively) are associated with the MAM surface temperature especially over the study basin; negative correlations at 99% significance level (between +0.34 and–0.34) are associated with MAM surface temperature over the South China Sea. This indicates the developing land– sea temperature gradient. To confirm the influence of land–sea temperature gradient on the monsoon rainfall, the differences in MAM temperature over the study basin and over the Hydroclimate variability and long-lead forecasting of rainfall over Thailand by large-scale atmospheric variables 31 (a) Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 (b) Fig. 5 Correlation maps for 1950–2007 between March-April-May (MAM) surface air temperature and (a) May-June-July (MJJ) rainfall, and (b) August-September-October (ASO) rainfall. The 95% and 99% significance levels correspond to ±0.26 and ±0.34, respectively. regions of nearby oceans such as the Pacific Ocean (NINO 3 and NINO 4), the Indian Ocean (ION) and the South China Sea (SCS) are estimated and correlated to the MJJ and ASO rainfall (Table 1). With the 90% confidence level, the significant Table 1 Cross-correlations between seasonal rainfall during May-June-July (MJJ) and August-September-October (ASO) and temperature differences during March-AprilMay (MAM) over the study basin and four different regions over the oceans. Season Correlation coefficient: NINO 3 MJJ rainfall ASO rainfall relationships between MJJ rainfall and MAM temperature differences are found to correspond to the NINO 3 region in the Pacific Ocean. The ASO rainfall develops stronger relationships than MJJ rainfall with the MAM temperature differences corresponding to the regions of NINO 4, ION and SCS. Thus, the land–sea temperature gradients developed from different sources have an influence on the seasonal rainfall. However, more investigation and detailed study are required to distinguish the influences on local hydroclimates. ∗ 0.23 0.18 NINO 4 ION SCS 0.04 0.25∗ −0.07 0.25∗ −0.12 0.35∗∗ NINO 3: a region over the central equatorial Pacific Ocean between 5◦ N–5◦ S latitude and 90–150◦ W longitude; NINO 4: a region over the western Pacific Ocean between 5◦ N–5◦ S latitude and 150◦ W–160◦ E longitude; ION: a region over the Indian Ocean between 2◦ N–2◦ S latitude and 70–90◦ E longitude; SCS: a region over the South China Sea between 5◦ N–5◦ S latitude and 100–115◦ E longitude. Figures with ∗ and ∗∗ represent the correlations being significant at 90% and 99% confidence level, respectively. 4.3 Trend of seasonal hydroclimates To support the influences of temperature on the seasonal rainfall and to investigate the variability of seasonal hydroclimates over the decades, trend analysis of seasonal rainfall and temperature is carried out. The time series of seasonal rainfall, i.e. dry (November–April), pre-monsoon (MJJ) and monsoon (ASO) season, are presented in Fig. 6, and the time series of MAM temperature are shown in Fig. 7. Based on the linear trend, the dry-season rainfall Rainfall (mm) 32 Nkrintra Singhrattna et al. 1000 (a) Time series Linear regression fit 800 600 P = 0.2858t - 424 400 200 1000 0 (b) 800 600 400 (c) 800 600 400 200 P = -1.8239t + 4167 2005 2010 2000 1990 1995 1985 1975 1980 1970 1960 1965 1950 0 1955 Rainfall (mm) 0 1000 Year (t) Fig. 6 Time series of rainfall during: (a) the dry season (November–April), (b) the pre-monsoon season (MJJ), and (c) the monsoon season (ASO) (P: rainfall). 31 T = -0.0107t + 50.379 30 29 5 PREDICTOR IDENTIFICATION 28 5.1 Correlations with large-scale atmospheric variables Time series Linear regression fit 2010 2005 2000 1995 1990 1985 1980 1975 1970 1965 1960 1955 27 1950 MAM Temperature (°C) Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 200 Rainfall(mm) P = -0.6109t + 1649 where β̂ is the slope of a fitting regression (yi = α + β̂xi + εi ), β 0 is the specific value for testing, i.e. 0 in this case, and SE is the standard error (ε i ) of leastsquares of the estimates. The decreasing trend of MAM temperature subsequently causes the decreasing trends in the MJJ and ASO rainfall, as shown in Fig. 6, due to the weakening of development of land–sea temperature gradient. For the period 1950–2007, the MJJ rainfall shows a decreasing trend of 34.2 mm, and the ASO rainfall shows a decreasing trend of 102.1 mm. It is also important to note that the years reported as the warm phase of ENSO or El Niño (CPC 2006, COAPS 2006) coincided with below-normal MJJ and ASO rainfall, and vice versa during the La Niña years. This is corroborated by Singhrattna et al. (2005b) and Chen and Yoon (2000). Moreover, the decreasing trend of rainfall is associated with that reported by the Intergovernmental Panel on Climate Change (Trenberth et al. 2007). Based on historical observations (1900–2005), they found that precipitation over the areas between 10◦ N and 30◦ N tended to decline since 1970 due to global warming and climate change. However, the precipitation variability is dependent upon region and season. Year (t) Fig. 7 Time series of MAM air temperature (T). shows a slightly increasing trend of 16.0 mm over 57 years. Based on the inverse correlations between dry-season rainfall and MAM temperature (Fig. 3), the increasing trends of dry-season rainfall tend to decrease the MAM temperature, as shown in Fig. 7. There is a decrease of 0.6◦ C in the MAM temperature over 56 years of data. This trend is statistically significant at 95% confidence level by the standard t-test (Haan 2002), which is applied to test a null hypothesis of the slope of a regression line or linear trend being different from a specific value. The standard t-test equation is as follows: t= β̂ − β0 SE (2) The climate diagnostic shows that the increasing land–sea temperature gradient strengthens the monsoons and influences the rainfall. Since an ultimate objective of this study is to develop a model to forecast rainfall at long lead times using the large-scale atmospheric variables, this section aims to develop the relationships between rainfall in the study basin and other large-scale atmospheric variables such as SAT, SLP, SXW and SYW at various lead times to identify the predictors of rainfall. The relationships are developed using correlation maps which are the interactive plots and analysis provided by the Earth System Research Laboratory (ESRL 2008). Although the physical long lead–lag relationship is not plausible, Sahai et al. (2003) found that the correlation patterns between Indian summer monsoon rainfall (ISMR) and SST show a slow and consistent temporal evolution which addresses the significant lead time relationships of SST even four years prior to the monsoon season. Clark et al. (2000) also used various Indian Ocean indices at long lead Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 Hydroclimate variability and long-lead forecasting of rainfall over Thailand by large-scale atmospheric variables times to develop the relationships for rainfall predictions. Nicholls (1983) found relationships between ISMR and 16-month leading SST near Indonesia that can show the long lead time relationships and the influences of large-scale atmospheric circulation at distance from the source (Saravanan and Chang 2000, Harshburger et al. 2002, Tereshchenko et al. 2002). Therefore, in this study, the correlation maps are adopted using the large-scale atmospheric variables, e.g. SAT, SLP and surface wind with varying lead times from 4 to 15 months prior to season starting. The correlation maps also cover the oceans, e.g. the Indian Ocean, the Pacific Ocean and the South China Sea that indicate influences on rainfall in the study basin. The predictors are selected based on longleading relationships and over regions giving high correlations at 95% significance level. Then, the optimal combination cases of predictors will be identified by cross-validated multiple regression and used in the forecasting model. 33 estimation at the dropped point obtained from the fitting regression, n is the total number of observed data, and m is the number of independent variables or predictors applied to the fitting multiple regression. From the 2k – 1 combination cases, the best combination of predictors is selected due to the minimum GCV which is associated with the smallest error from the fitting regression using the minimum number of mutually exclusive variables. 6 FORECASTING MODEL 6.1 Description of modified k-nearest neighbour (k-nn) model The nonparametric regression is an alternative of the fitting data model. The regression equation is as follows: y = f (x1 , x2 , x3 , . . . , xk ) + e (4) 5.2 Combination cases of the predictors To avoid redundancy in the case of several identified predictors, the optimal sub-set of predictors that is composed of the minimum number of mutually exclusive variables is selected by model selection methods. In this study, a cross-validated multiple regression, namely the generalized cross-validation (GCV) with the leave-one-out technique is adopted to select an optimal combination of predictors. For k multiple independent variables, there are 2k – 1 crosscombination cases of variables. Each combination case is evaluated using the leave-one-out technique, which is applied to all points of data (n) by dropping one point out of the dependent (yi ) and independent (xi ) data set. Then, the regression is developed to fit the remaining points of data (n – 1). At the dropped point, the estimation of dependent variable (yi ) is done with the developed regression using the dropped independent variables (xi ). The residual (yi – yi ) at this point is also calculated. The procedure is repeatedly applied to all points of data (n), and the GCV is calculated by: GCV = 2 n yi − y i n i=1 (1 − m/n)2 (3) where yi is observed data at the dropped point (xi ) based on the leave-one-out technique, yi is the where f is the fitting function of the independent variables x1 , x2 , x3 , . . ., xk (in this case, the identified large-scale atmospheric variables); y is the dependent variable (in this case, seasonal rainfall); and e is the error or residual assumed to be normally distributed with mean = 0 and variance = σ . Unlike the parametric approach, the prior assumption of relationships between dependent and independent variables such as linear relationships, which is one of the drawbacks of parametric models, is not required for the nonparametric approach. The fitting functions (f ) capture relationships locally or by a small set of neighbours (k) at a given point (xi ). Because the functions are flexible and able to fit any arbitrary data, for example, bivariate data and multivariate data, the nonparametric approach can determine relationships better than the parametric regression. Moreover, a drawback of global fitting in the parametric approach, where all points of data may have a large influence on an individual point of a curve, can be solved by the local fitting functions. There are several approaches of nonparametric regression. One that grants the discontinuities of the derivative curve is the spline approach. Another that applies a regression locally to a given point (xi ) and the neighbours around xi is referred to as the local polynomials approach. This approach includes locally-weighted polynomials (Loader 1999) and knearest neighbour (k-nn) local polynomials (Owosina 34 Nkrintra Singhrattna et al. Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 1992, Rajagopalan and Lall 1999). Two parameters— the size of neighbourhood (k), referred to as bandwidth, and the order of polynomials (p)—are required to develop the fitting regression. Both parameters can be estimated by various criteria and objective functions such as GCV and likelihood. In terms of forecast, the modified k-nn model is adopted to forecast seasonal rainfall using the identified large-scale atmospheric predictors as the independent variables (xi ). The fitting of a regression, rainfall forecast and resample data to achieve rainfall ensembles are described in the following steps: 1. For the fitting process, the size of neighbours (k) and the order of polynomial (p), normally 1 or 2, will be selected as the combination of k and p obtaining the minimum GCV score by: GCV(k, p) = 2. 3. 4. 5. n e2 i n i=1 (1 − m/n)2 (5) where ei is the error, n is the number of data points (xi ), and m is the number of parameters. The regression is fitted locally with k and p obtained from Step 1. The rainfall according to the developed fitting regressions is then estimated and called the mean estimations (ȳ1 , ȳ2 , ȳ3 , . . . , ȳn ). Then, the residuals (e1 , e2 , e3 , . . . , en ) are computed. At a new point of independent variable (xnew ), the forecast of rainfall is required. The mean estimation (ȳnew ) is obtained from the developed fitting regression. An ensemble forecast is then obtained by adding the residual (ei ) to ȳnew . The residual ei corresponds to one of the k-nearest neighbours (k-nn) of xnew which is randomly selected by using a weight function as follows: W (j) = 1/j k (6) (1/i) i=1 where W (j) is the weight value of a neighbour of xnew whose distance from xnew falls in the jth rank, and k is the size of the neighbours, which does not have to √ be the same as for fitting polynomial. The term n − 1 is used, in practice, to estimate k where n is the total number of xi . The weight function gives more weight to the nearest neighbour and less weight to the farthest neighbour. From the weight equation, the distances between xnew and all points of xi have to be estimated. There are several methods to calculate distance between two points of data, such as the Euclidean distance and Mahalanobis distance. The Euclidean distance is applied in this study and the equations of the Euclidean distance for univariate and multivariate data are shown in equations (7a) and (7b), respectively: n di = (xnew − xi )2 (7a) i=1 2 m di = xnew,j − xi,j (7b) j=1 where i = 1, 2, 3, . . . , n, and m is the number of predictors. 6. Repeat Step 5 as many times as required to achieve the forecasting ensembles of rainfall, or a number of simulations (N); in this case N = 300. 7. Repeat steps 4–6 for each forecasting point. The N ensembles of forecasting rainfall can develop a box-plot or a probability density function (pdf). It can provide an exceedence and a nonexceedence probability of the extreme events, e.g. wet or dry. The exceedence and non-exceedence probabilities can support water resources managers to implement a strategic planning of water allocation and management and agricultural practices. 6.2 Model evaluation The likelihood function (lhf) is applied to evaluate the modified k-nn model on capturing the pdf which is an essential tool of decision making for water resources management. The initial step of lhf calculation is to divide the climatological data of 1950–2007 into three categories: below-normal (dry), normal and above-normal (wet). In this case, three categories are determined separately for each season of rainfall by dividing the seasonal rainfall data at the 33rd percentile and the 67th percentile: the rainfall data below the 33rd percentile fall into the dry category, whereas those above the 67th percentile are in the wet category; the remaining data are in the normal category. Next, the categorical probabilities of climatology that are the proportion of historical data in each category are calculated. Subsequently, for a given year, rainfall ensembles in each category Hydroclimate variability and long-lead forecasting of rainfall over Thailand by large-scale atmospheric variables (out of 300 ensemble members obtained from the modified k-nn model) are also determined and the categorical probabilities are estimated. The categorical probabilities of model ensembles for a given year are calculated by the proportion of ensemble members falling in each category—in this case, they are computed separately for each season, year and forecasting lead time. Since the historical rainfall data in this case are divided at the 33rd and 67th percentiles, the climatological categorical probability of each category is one third. Then, the lhf for a given year is estimated as: Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 n lhft = t=1 n P̂j,t (8) Pcj,t t=1 where n is the number of years; j is the category of the observed value in year t; P̂j,t is the probability of rainfall ensembles in category j in year t; P̂j,t = (P̂1,t , P̂2,t , P̂3,t , . . . , P̂k,t ), in which k is the number of categories; and Pcj,t is the climatological probability of category j in year t, which in this case indicates an equal value for all three categories, i.e. 1/3. For example, for 1972, the observed MJJ rainfall falls in the dry category (i.e. <410 mm, or less than the 33rd percentile of 1950–2007 MJJ rainfall). Out of 300 ensemble members obtained from the modified k-nn model, there are 212 ensembles having less than 410 mm. Therefore, P̂dry,1972 = 212/300, and lhf1972 = (212/300)/(1/3) = 2.12. The lhf score varies from 0.0 to number of categories, i.e. 3.0 in this study. A score of 0.0 indicates lack of model performance in capturing the pdf of climatology, while a score greater than 1.0 indicates better performance than the climatology. A score of 3.0 indicates perfect performance of the model. 7 RESULTS AND DISCUSSION 7.1 Identifying predictors and combination cases To develop a long-lead forecasting model, the statistical relationships between rainfall during MJJ, ASO, NDJ and FMA and the large-scale atmospheric variables are developed by correlation maps with varying lead times of the atmospheric variables. Figure 8 shows examples of developed relationships between pre-monsoon season (MJJ) rainfall and SAT for 12 different lead times varying from 35 4 to 15 months before the beginning of the premonsoon season in May. The solid box represents the selected region of predictor which has high correlation at the 95% significance level (upper and lower bounds of +0.26 and –0.26, respectively). Statistical relationships from correlation maps were also developed for the variables SLP, SXW and SYW and ASO, NDJ and FMA rainfall data (not shown). The significant relationships of monsoon rainfall (i.e. MJJ and ASO rainfall) show development with the large-scale atmospheric variables over the study basin and the nearby seas and oceans, e.g. South China Sea, the Indian Ocean and the Pacific Ocean. The developed relationships are found to be associated with long lead times of atmospheric variables of 8–15 months prior to the start of the season. This is corroborated by Sahai et al. (2003) and Nicholls (1983), who presented long-lead relationships between Indian summer monsoon rainfall and atmospheric variables, e.g. SST. The significant relationships between monsoon (i.e. MJJ and ASO) rainfall and large-scale atmospheric variables suggest the ability of the rainfall forecasting models to make long-lead forecasts using the identified atmospheric predictors, which is hardly found in any study of Thailand rainfall forecasting (Hung et al. 2009, Singhrattna et al. 2005a, Weesakul and Lowanichchai, 2005). However, the dry-season (i.e. NDJ and FMA) rainfall gave shorter lead-time relationships with large-scale atmospheric variables of 7–10 months. Furthermore, significant links are found over distant regions, e.g. Sumatra and Java (Indonesia), and northeast India, which shows the remote influence of atmospheric circulation (Harshburger et al. 2002, Tereshchenko et al. 2002). Table 2 summarizes the identified predictors of largescale atmospheric variables. It is also noted that the significant relationship between SLP and NDJ rainfall is hardly found over the region, e.g. the study basin, the Pacific Ocean and the South China Sea. Moreover, the observed rainfall data (RAIN) are included in the identified predictors because, from the lag autocorrelation analysis (Fig. 9), the negative correlations at 95% significance level are associated with a 6-month lag (and the positive correlations – with a 12-month lag), which indicates the ability of forecasting models to give long-lead forecasts using RAIN as the predictor. Hence, there are five identified predictors each of the MJJ, ASO and FMA rainfall, and four predictors for the NDJ rainfall. Subsequently, an optimal combination of predictors is selected by cross-validated multiple regression. Based on k independent predictors, the 2k – 1 Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 36 Nkrintra Singhrattna et al. (a) (b) (c) (d) (e) (f) (g) (h) (i) (j) (k) (l) Fig. 8 Correlation maps between MJJ rainfall and surface air temperature (SAT) at 15- to 4-month lead times, respectively: (a) FMA, (b) MAM, (c) AMJ, (d) MJJ, (e) JJA, (f) JAS, (g) ASO, (h) SON, (i) OND, (j) NDJ, (k) DJF and (l) JFM. The 95% significance levels correspond to ±0.26. cross-combination cases can be set. In this study, the optimal sub-sets are identified separately for each season of rainfall and each lead period of predictors that can provide forecasts varying from 1 to 12 months prior to the season starting. For example, for MJJ rainfall, the 15-month lead-time predictors, which are averaged during February-March-April (FMA) in the current year, are used to forecast the MJJ rainfall of the following year. The forecast can be issued on 1 May, i.e. 12 months in advance. From the GCV, an optimal sub-set of predictors is selected from the 2k – 1 cross-combination cases corresponding to the minimum GCV. Table 3 summarizes the best combination cases of predictors for rainfall during MJJ, ASO, NDJ and FMA. The selected combinations consist of 1–2 mutually exclusive predictors, except for Hydroclimate variability and long-lead forecasting of rainfall over Thailand by large-scale atmospheric variables Table 2 Summary of the large-scale atmospheric predictors identified from the correlation maps for seasonal rainfall during May-June-July (MJJ), August-SeptemberOctober (ASO), November-December-January (NDJ) and February-March-April (FMA). Season of rainfall MJJ NDJ FMA 20.0◦ N 7.5–10.0◦ N 0◦ 0–2.5◦ N 15.30–19.36◦ N 2.5–5.0◦ N 17.5–20.0◦ N 10.0◦ N 10.0◦ N 15.30–19.36◦ N 2.5–7.5◦ S 2.5–5.0◦ S 20.0–22.5◦ N 15.30–19.36◦ N 7.5◦ S 15.0◦ N 17.5◦ N 2.5◦ S–2.5◦ N 15.30–19.36◦ N SAT SLP SXW SYW RAIN SAT SLP SXW SYW RAIN SAT SXW SYW RAIN SAT SLP SXW SYW RAIN Longitude 97.5–102.5◦ E 102.5–107.5◦ E 82.5–87.5◦ E 172.5–175.0◦ E 98.05–100.70◦ E 107.5–110.0◦ E 97.5–100.0◦ E 100.0–102.5◦ E 95.0–97.5◦ E 98.05–100.70◦ E 97.5–102.5◦ E 62.5–67.5◦ E 85.0◦ E 98.05–100.70◦ E 110.0–112.5◦ E 187.5–192.5◦ W 140.0–150.0◦ E 95.0–97.5◦ E 98.05–100.70◦ E ACF 0.0 0.5 1.0 SAT: surface air temperature; SLP: sea-level pressure; SXW: surface zonal wind; SYW: surface meridian wind; RAIN: observed rainfall over study basin. –0.5 Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 ASO Atmospheric Spatial coverage variables Latitude 0 5 10 15 Lag 20 Fig. 9 Correlogram or lag autocorrelation (ACF) of monthly rainfall for 1950–2007. The dashed lines represent the correlations being significant at the 95% confidence level. two combination cases for ASO rainfall corresponding to the 6- and 10-month forecasting lead times that address three predictors of SAT, SXW and SYW, and SAT, SYW and RAIN, respectively. The identified sub-sets indicate that the univariate regression of large-scale atmospheric variables (e.g. SXW or SYW) and the multivariate regression (e.g. SYW and RAIN 37 or SLP and SXW) are more efficient than the univariate regression of observed rainfall (i.e. RAIN). This suggests the better performance of a fitting regression and forecasting model with the incorporation of large-scale atmospheric predictors, e.g. SAT and SLP (Basson and Rooyen, 2001, Trenberth et al. 2006, Sadhuram and Ramana Murthy, 2008) compared to developing a regression from only lagging relationships of hydroclimatic variables, e.g. RAIN. 7.2 Forecasting evaluation The lhf scores used to evaluate the modified k-nn model with the leave-one-out technique are computed separately for each season of rainfall, for each year of 1950–2007 and each forecasting lead time. Figure 10 shows plots of median lhf for seasonal rainfall during MJJ, ASO, NDJ and FMA. A score of 0.0 indicates lack of model performance in capturing the pdf of climatology, whereas a score greater than 1.0 indicates better model performance than climatology. The darker shading represents higher lhf scores or better model performance. From the median lhf of 1950–2007 rainfall ensembles (Fig. 10(a)), high scores can be found corresponding to the forecast of MJJ and ASO rainfall at long lead times (e.g. 7–9 months and 11 months). The successful longlead forecasting of the modified k-nn model indicating higher skills than climatology (i.e. lhf scores greater than 1.0) was hardly found in previous studies (Kulkarni, 2000, DelSole and Shukla, 2002). The modified k-nn model also performed better than the climatology of MJJ and ASO rainfall at short forecasting lead times, e.g. 1–2 months (Singhrattna et al. 2005a, Grantz et al. 2005). However, associated with lhf scores of less than 1.0, the forecasting performance of the model with dry-season (i.e. NDJ and FMA) rainfall decreases considerably even at short lead times because, in terms of physical mechanism, the dry-season rainfall over the study basin is influenced by unstable local conditions, e.g. higher surface temperature and humidity, at finer time scales, e.g. hourly and daily (TMD 2007). Hence, the modified k-nn model, which is developed based on the relationships between rainfall and large-scale atmospheric variables at seasonal time scales, performs better in monsoon rainfall forecasting than dry-season rainfall forecasting. To compare the prediction ability of the modified k-nn model in extreme years, Fig. 10(b) and (c) present plots of median lhf for dry rainfall years (rainfall below the 33rd percentile) and wet rainfall years 38 Nkrintra Singhrattna et al. Table 3 Summary of the optimal sub-sets of predictors (•) identified from the cross-validated multiple regression for seasonal rainfall during (a) May-June-July (MJJ), (b) August-September-October (ASO), (c) November-December-January (NDJ) and (d) February-March-April (FMA). (SAT: surface air temperature; SLP: sea-level pressure; SXW: surface zonal wind; SYW: surface meridian wind; RAIN: observed rainfall over study basin.) Predictor SAT SLP Season of predictors (forecasting lead time in month) SXW SYW RAIN • • • • • • Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 (a) MJJ Rainfall FMA (12) MAM (11) AMJ (10) MJJ (9) JJA (8) JAS (7) ASO (6) SON (5) OND (4) NDJ (3) DJF (2) JFM (1) (b) ASO Rainfall MJJ (12) JJA (11) JAS (10) ASO (9) SON (8) OND (7) NDJ (6) DJF (5) JFM (4) FMA (3) MAM (2) AMJ (1) • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • (c) NDJ Rainfall ASO (12) SON (11) OND (10) NDJ (9) DJF (8) JFM (7) FMA (6) MAM (5) AMJ (4) MJJ (3) JJA (2) JAS (1) • • • • • • • • • • • • • • (d) FMA Rainfall NDJ (12) DJF (11) JFM (10) FMA (9) MAM (8) AMJ (7) MJJ (6) JJA (5) JAS (4) ASO (3) SON (2) OND (1) • • • • • • • • • • • • Hydroclimate variability and long-lead forecasting of rainfall over Thailand by large-scale atmospheric variables 2 8 6 1 4 0 FMA 2 NDJ 2 3 10 ASO 4 12 MJJ FMA NDJ MJJ 2 Forecasting Lead Time (month) 4 6 NDJ 6 8 FMA 8 10 ASO 10 12 MJJ Forecasting Lead Time (month) 12 ASO Forecasting Lead Time (month) Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 (c) (b) (a) 39 Fig. 10 Median likelihood (lhf) scores for seasonal rainfall during May-June-July (MJJ), August-September-October (ASO), November-December-January (NDJ) and February-March-April (FMA) for: (a) 1950–2007, (b) dry years, and (c) wet years. (rainfall above the 67th percentile), respectively. For MJJ rainfall, the lhf scores of dry years are higher than those of wet years, especially at long lead times, e.g. 9–12 months. In contrast, the modified k-nn model shows the greater predictability of ASO rainfall in wet years than in dry years. However, the highest lhf scores (i.e. 2.0–2.5) for ASO rainfall forecast in both dry and wet years can be found associated with the 6-month forecasting lead time, which shows an impressive ability to forecast extreme events with a long lead period. It is also noted that the better performance than climatology (i.e. lhf greater than 1.0) of wet ASO rainfall can be observed for all 12 forecasting lead times. In addition, the good performance of long-lead rainfall forecasts is obtained for dry NDJ (and short-lead forecasts for wet NDJ). For FMA rainfall, the forecast of dry years shows better performance than wet years, in particular, at the forecasting lead time of 5–6 months. Therefore, the large-scale atmospheric variables are identified and used as predictors of the modified k-nn model to forecast seasonal rainfall in the Ping River Basin for 12 forecasting lead periods. The optimal combination sets of predictors that are determined by GCV consist of 1–2 predictors. This can confirm the comparative performance of a fitting regression and forecasting model using the largescale atmospheric predictors. Since the irrigated area of the study basin covers 2332 km2 , and the agriculture depends on both the rainfall during the monsoon season and the irrigated water (also based on rainfall), the ability of the modified k-nn model to make long-lead forecasts for monsoon rainfall, e.g. 7–9 months prior to the season starting, is very useful for water resources planning and management, agricultural practices and cropping strategies. Moreover, the operation planning of several storage dams along the Ping River including the Bhumipol Dam can be done in advance to deal with the extreme events, i.e. dry and wet episodes, and to efficiently serve the water demands of the Ping and Chao Phraya river basins, consisting of 6127 and 11 000 × 106 m3 per year, respectively. The non-exceedence and exceedence probabilities of extreme events obtained from the pdf of rainfall forecasting ensembles serve as a tool for decision making in reservoir operation, and the modified k-nn model shows better performance in capturing the pdf than the climatology. 8 CONCLUSION This study enhances the understanding of local hydroclimate variability and develops statistical relationships between rainfall and large-scale atmospheric variables. The land–sea temperature gradient plays a key role in strengthening or weakening the monsoon over the Upper Chao Phraya River Basin in Thailand. The amount of dry-season (November–April) rainfall determines the MAM temperature which is responsible for setting up the land–sea temperature gradient. Higher MAM temperature tends to increase monsoon (i.e. ASO) rainfall. Due to the anomalous atmospheric variables, the below-normal rainfall corresponds to the El Niño phase of ENSO and, conversely, the above-normal rainfall corresponds to the La Niña phase. Using the correlation maps, the large-scale atmospheric variables are identified as the predictors Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 40 Nkrintra Singhrattna et al. of seasonal rainfall. The significant relationships between rainfall and large-scale atmospheric variables indicate the long-lead forecasting ability of forecasting models. There are five identified predictors for each of MJJ, ASO and FMA rainfall, and four identified predictors for NDJ rainfall. Based on the cross-validated multiple regression, optimal combinations of predictors are selected separately for each season of rainfall and forecasting lead period. The selected combinations composed of one or two mutually exclusive predictors present the comparative performance of a regression and forecasting model by incorporation of the large-scale atmospheric variables. The modified k-nn model is ultimately developed to forecast rainfall at long lead times using the optimal sets of predictors. Based on lhf scores, the modified k-nn model shows the ability to give long-lead forecasts of MJJ and ASO rainfall at 7–9 months lead time. However, decreasing performance is found in dry-season rainfall forecasts at both short and long lead times. The ability to forecast rainfall in dry MJJ is more significant than in wet MJJ, especially at lead times of 9–12 months. The best performance of ASO rainfall forecast in extreme years, i.e. dry and wet, is observed at a 6-month lead time. The long-lead forecasting ability of the modified k-nn model for rainfall in the Ping River Basin is an impressive result that can be applied to manage water resources and develop the long-term plans for reservoirs. Acknowledgements The authors would like to thank Dr Sutat Weesakul and Dr Kiyoshi Honda for their valuable comments. The authors also thank the Royal Irrigation Department of Thailand, the Thailand Meteorological Department and the Department of Water Resources for providing the data for the study. The first author is grateful to the Asian Institute of Technology, Thailand for the graduate fellowships, to the Department of Public Works and Town & Country Planning for its kind support and to Piyawat Wuttichaikitcharoen for providing some data used in this research. The research funding from CIRAD is gratefully acknowledged. Finally, the authors would like to thank the anonymous reviewers whose comments improved the manuscript. REFERENCES An, S.I., Kug, J.S., Timmermann, A., Kang, I.S. and Timm, O., 2007. The influence of ENSO on the generation of decadal variability in the North Pacific. Journal of Climate, 20 (4), 667–680. Basson, M.S. and Rooyen, J.A., 2001. Practical application of probabilistic approaches to the management of water resource systems. Journal of Hydrology, 241 (1–2), 53–61. Cañón, J., González, J. and Valdés, J., 2007. Precipitation in the Colorado River basin and its low frequency associated with PDO and ENSO signal. Journal of Hydrology, 333 (2–4), 252–264. Chen, T.-C. and Yoon, J.-H. (2000) Interannual variation in Indochina summer monsoon rainfall: possible mechanism. Journal of Climate, 13 (11), 1979–1986. Clark, C.O., Cole, J.E. and Webster, P.J., 2000. Indian Ocean SST and Indian summer rainfall: predictive relationships and their decadal variablitiy. Journal of Climate, 13, 2503–2519. Clark, M.P., Serreze, M.C. and McCabe, G.J., 2001. Historical effects of El Niño and La Niña events on the seasonal evolution of the montane snowpack in the Columbia and Colorado River basins. Water Resources Research, 37 (3), 741–757. COAPS, 2006. ENSO index according to JMA SSTA (1868 – present). http://coaps.fsu.edu/jma.shtml. CPC (Climate Prediction Center), 2006. Cold and warm episodes by season. http://www.cpc.ncep.noaa.gov/products/ analysis_monitoring/ensostuff/ensoyears.shtml. CPC (Climate Prediction Center), 2007. The tropical Pacific Ocean SST indices. http://www.cpc.noaa.gov/data/indices. DelSole, T. and Shukla J., 2002. Linear prediction of Indian monsoon rainfall. Journal of Climate, 15, 3645–3658. Eldaw, A.K., Salas, J.D. and Garcia, L.A., 2003. Long-range forecasting of the Nile River flows using climatic forcing. Journal of Applied Meteorology, 42 (7), 890–904. ESRL (Earth System Research Laboratory), 2008. Interactive plotting and analysis: linear monthly/seasonal correlations. http:// www.esrl.noaa.gov/psd/data/correlation. Fasullo, J. and Webster, P.J., 2002. Hydrological signatures relating the Asian summer monsoon and ENSO. Journal of Climate, 15(21), 3082–3095. Gershunov, A., 1998. ENSO influence on intraseasonal extreme rainfall and temperature frequencies in the contiguous United States: implications for long-range predictability. Journal of Climate, 11 (12), 3192–3203. Goswami, B.N., Wu, G. and Yasunari, T., 2006. The annual cycle, intraseasonal oscillation, and roadblock to seasonal predictability of the Asian summer monsoon. Journal of Climate, 19 (20), 5078–5099. Grantz, K., Rajagopalan, B., Clark, M. and Zagona, E., 2005. A technique for incorporating large-scale climate information in basin-scale ensemble streamflow forecasts. Water Resources Research, 41, W10410, doi:10.1029/2004WR003467. Gutiérrez, F. and Dracup, J.A., 2001. An analysis of the feasibility of long-range streamflow forecasting for Colombia using El Niño-Southern Oscillation indicators. Journal of Hydrology, 246 (1–4), 181–196. Haan, C.T., 2002. Statistical methods in hydrology, second edition. Iowa: Iowa State Press. Hamlet, A.F., Huppert, D. and Lettenmaier, D.P., 2002. Economic value of long-lead streamflow forecasts for Columbia River hydropower. Journal of Water Resource Planning – ASCE 128 (2), 91–101. Harshburger, B., Ye, H. and Dzialoski, J., 2002. Observation evidence of the influence of Pacific SSTs on winter precipitation and spring stream discharge in Idaho. Journal of Hydrology, 246 (1–4), 157–169. Hung, N.Q., Babel, M.S., Weesakul, S. and Tripathi, N.K., 2009. An artificial neural network model for rainfall forecasting in Bangkok, Thailand. Hydrology and Earth System Sciences, 13, 1413–1425. Kalnay et al., 1996. The NCEP/NCAR 40-year reanalysis project. Bulletin of the American Meteorological Society, 77, 437–470. Downloaded by [Nkrintra Singhrattna] at 16:41 23 January 2012 Hydroclimate variability and long-lead forecasting of rainfall over Thailand by large-scale atmospheric variables Kim, T.W., Valdés J.B.., Nijssen, B. and Roncayolo, D., 2006. Quantification of linkages between large-scale climatic patterns and precipitation in the Colorado River basin. Journal of Hydrology, 321 (1–4), 173–186. Krishna Kumar, K., Soman, M.K., and Rupakumar, K. (1995) Seasonal forecasting of Indian summer monsoon rainfall: a review. Weather, 50, 449–467. Krishnan, R., Zhang, C. and Sugi, M., 2000. Dynamics of breaks in the Indian summer monsoon. Journal of Atmospheric Science, 57 (9), 1354–1372. Kulkarni, A., 2000. A note on the performance of IMD prediction model for ISMR. Annual Monsoon Workshop of Indian Meteorological Society, Pune Chapter, 20 December 2000. Loader, C. (1999) Local regression and likelihood. New York: Springer. Masiokas, M.H., Villalba, R., Luckman, B.H., Quesne, C.L. and Aravene, J.C., 2006. Snowpack variations in the central Andes of Argentina and Chile, 1951–2005: large-scale atmospheric influences and implications for water resources in the region. Journal of Climate, 19 (24), 6334–6352. Mason, S.J. and Goddard, L., 2001. Probabilistic precipitation anomalies associated with ENSO. Bulletin of the American Meteorological Society, 82, 619–638. McCabe, G.J. and Dettinger, M.D., 2002. Primary modes and predictability of year-to-year snowpack variation in the western United States from teleconnections with Pacific Ocean climate. Journal of Hydrometeorology, 3 (1), 13–25. Meko, D.M. and Woodhouse, C.A., 2005. Tree-ring footprint of joint hydrologic drought in Sacramento and upper Colorado river basins, western USA. Journal of Hydrology, 308 (1–4), 196–213. Mendoza, B. et al., 2005. Historical droughts in central Mexico and their relation with El Niño. Journal of Applied Meteorology, 44 (5), 709–716. Nagura, M. and Konda, M., 2007. The seasonal development of an SST anomaly in the Indian Ocean and its relationship to ENSO. Journal of Climate, 20 (1), 38–52. Nicholls, N., 1983. Predicting Indian monsoon rainfall from sea surface temperature in the Indonesia–north Australia area. Nature, 306, 576–577. Owosina, A., 1992. Methods for assessing the space and time variability of ground water data. Thesis (MSc), Utah State University, Utah, USA. Pavia, E.G., Graef, F. and Reyes, J., 2006. PDO-ENSO effects in the climate of Mexico. Journal of Climate, 19 (24), 6433–6438. PSD (Physical Sciences Division, NOAA), 2007a. Create a monthly/seasonal mean time series from the NCEP reanalysis data set. http://www.esrl.noaa.gov/psd/cgi-bin/data/timeseries/ timeseries1.pl. PSD (Physical Sciences Division, NOAA), 2007b. The Indian Ocean SST index. http://www.esrl.noaa.gov/psd/forecasts/sstlim/ timeseries/index.html. Rajagopalan, B. and Lall, U., 1999. A k-nearest-neighbor simulation for daily precipitation and other weather variables. Water Resources Research, 35 (10), 3089–3101. RID (Royal Irrigation Department), 2007. The 25 major river basins in Thailand. http://www.rid.go.th. Sadhuram, Y. and Ramana Murthy, T.V., 2008. Simple multiple regression model for long range forecasting of Indian summer 41 monsoon rainfall. Meteorology and Atmospheric Physics, 99 (1–2), 17–24. Sahai, A.K., Grimm, A.M., Satyan, V. and Pant, G.B., 2003. Long-lead prediction of Indian summer monsoon rainfall from global SST evolution. Climate Dynamics, 20, 855–863. Saravanan, R. and Chang, P., 2000. Interaction between tropical Atlantic variability and El Niño-Southern Oscillation. Journal of Climate, 13 (13), 2177–2194. Schöngart, J. and Junk, W.J., 2007. Forecasting the flood-pulse in Central Amazonia by ENSO-indices. Journal of Hydrology, 335 (1–2), 124–132. Shrestha, A. and Kostaschuk, R., 2005. El Niño/Southern Oscillation (ENSO)-related variability in mean-monthly streamflow in Nepal. Journal of Hydrology, 308 (1–4), 33–49. Shrestha, M.L., 2000. Interannual variation of summer monsoon rainfall over Nepal and its relation to Southern Oscillation index. Meteorolical and Atmospheric Physics, 75 (1–2), 21–28. Silverman, D. and Dracup, J.P., 2000. Artificial neural networks and long-range precipitation prediction in California. Journal of Applied Meteorology, 39 (1), 57–66. Singhrattna, N., Rajagopalan, B., Clark, M. and Kumar, K.K., 2005a. Forecasting Thailand summer monsoon rainfall. International Journal of Climatology, 25 (5), 649–664. Singhrattna, N., Rajagopalan, B., Kumar, K.K. and Clark, M., 2005b. Interannual and interdecadal variability of Thailand summer monsoon season. Journal of Climate, 18 (11), 1697–1708. Smith, S.R. and O’Brien J.J., 2001. Regional snowfall distributions associated with ENSO: implications for seasonal forecasting. Bulletin of the American Meteorological Society, 82 (6), 1179– 1191. Sourza Filho, F.A. and Lall, U., 2003. Seasonal to interannual ensemble streamflow forecasts for Cearra, Brazil: applications of a multivariate semiparametric algorithm. Water Resources Research, 39 (11), doi:10.1029/2002WR001373. Tereshchenko, I., Filonov, A., Gallesgos, A., Monzón C. and Rodríguez, R., 2002. El Niño 997–98 and the hydrometeorological variability of Chapala, a shallow tropical lake in Mexico. Journal of Hydrology, 264 (1–4), 133–146. TMD (Thailand Meteorological Department), 2007. The natural disasters in Thailand (in Thai). http://www.tmd.go.th/info/risk. pdf. Trenberth, K.E. et al., 2007. Observations: surface and atmospheric climate change. New York: Cambridge University Press. Trenberth, K.E., Moore, B., Karl, T.R. and Carlos, N., 2006. Monitoring and prediction of earth’s climate: a future perspective. Journal of Climate, 19 (20), 5001–5008. Weesakul, U. and Lowanichchai, S., 2005. Rainfall forecast for agricultural water allocation planning in Thailand. Thammasat International Journal of Science and Technology, 10 (3), 18–27. Woodruff, S.D. et al., 1993. Comprehensive Ocean – Atmosphere Data Set (COADS) Release 1a: 1980–92. Earth System Monitor, 4 (1), 1–8. Wu, R. and Kirtman, B.P., 2004. Understanding the impacts of the Indian Ocean on ENSO variability in a coupled GCM. Journal of Climate, 17 (20), 4019–4031. Zehe, E., Singh, A.K. and Bárdossy, A., 2006. Modelling of monsoon rainfall for a mesoscale catchment in North-West India II: stochastic rainfall simulations. Hydrology and Earth System Sciences 10, 807–815.
© Copyright 2026 Paperzz