Relating European temperature extremes to trends in mean

DISS. ETH NO. 20714
Relating European temperature extremes to trends in mean
temperature
A dissertation submitted to
ETH Z URICH
for the degree of
D OCTOR OF S CIENCES
presented by
A NDREAS L USTENBERGER
MSc in Atmospheric and Climate Science, ETH Zurich
born on 9 June 1982
citizen of Lucerne (LU)
accepted on the recommendation of
Prof. Dr. Reto Knutti, examiner
Dr. Erich M. Fischer, co-examiner
Dr. Mark Liniger, co-examiner
2012
Abstract
Extreme weather events have the potential to impact ecosystems, society and infrastructure.
Global and regional climate models are used to estimate changes in the frequency, intensity
and duration of extreme events. However, climate model simulations are costly and timeconsuming. Therefore, there are time periods or emission scenarios for which no simulations
are available. The aim of this thesis is to evaluate the skill of simple statistical models used to
fill in gaps in the time series of the frequency and intensity of temperature extremes for which
no data is available.
The main hypothesis of this thesis is that changes in the frequency and intensity of temperature
extremes are related to trends in European mean temperature. Thus, the objective of the first
part of this thesis is to investigate whether expected changes in the frequency of cold and warm
extremes are linearly related to trends in the annual and seasonal mean temperature. To explore
the potential relationships, linear regression models and two different methods to estimate the
slope parameter are used. The objective of the second part is to approximate expected changes
in the intensity of warm temperature extremes by shifting and scaling seasonal temperature
distributions.
The results show that the frequency of warm extremes is linearly related to changes in the annual mean temperature. The skill of the estimated frequencies depends on the slope parameter
estimation method, the trends in the mean temperature and the degree of linearity. The linearity
is determined by the shape of the parent temperature distribution. In addition, linear trends in
the frequency are higher for temperature distributions with small variance. The magnitude of
the trends and the interannual variability affect the robustness of the estimated slope parameter.
However, the uncertainty due to interannual variability is smaller than the uncertainty induced
by the different climate models. The skill of estimated frequency for temperature extremes
based on absolute temperature thresholds is improved when seasonal means are used. The direction and velocity of the shifted temperature distribution are the reasons that the estimated
frequency of cold extremes performs always worse than the estimated frequency of warm extremes, especially for cold extreme events lasting several consecutive days.
Linear regression models are not appropriate for discrete count data or when the exceedance
probability converges to 0% or 100%. The dependency of these issues on the used climate
model and the location is the reason that alternative methods such as, for instance, logistic regression models do not lead to improved estimates over all of Europe.
i
Concerning the second objective, the results show that changes in the intensity of warm extremes can be characterized reasonably well by shifting the full temperature distribution towards higher temperatures for one quarter to one half of Europe across all examined climate
models. In winter, cold extremes warm faster than warm extremes. Otherwise, summer warm
extremes warm faster than summer cold extremes. Therefore, the expected temperature variability is reduced in winter and enhanced in summer. If the variability change is asymmetric,
the additional scaling of the temperature distributions leads to large estimations biases because
the use of central moments is not appropriate. In these cases, the shifted and scaled intensities
of warm extremes perform better when the corresponding half of the temperature distribution is
used to estimate the variance. Over western, northern and parts of central Europe, there are significant changes in the shape of the upper tail of the projected temperature distributions. This
implies that the shape of the parent temperature distribution changes. To account for these complex changes in higher moments, detailed knowledge about future temperature distributions is
necessary and, therefore, contradicts the initial assumption of no available data for future time
periods.
ii
Zusammenfassung
Extreme Wetterereignisse haben ein Auswirkungspotenzial auf Ökosysteme, Gesellschaft
und Infrastruktur. Globale und regionale Klimamodelle werden verwendet um zukünftige
Änderungen in der Häufigkeit, Intensität und Dauer von Extremereignissen zu schätzen. Simulationen mit Klimamodellen sind jedoch zeitaufwendig und kostspielig. Es gibt daher Zeitperioden oder Emissionsszenarien ohne verfügbare Simulationen. Das Ziel dieser Studie ist
die Eignung von einfachen statistischen Modellen zu beurteilen unvollständige Zeitreihen der
Häufigkeit und Intensität von extremen Temperaturereignissen zu vervollständigen für die keine Daten verfügbar sind.
Die zugrundeliegende Hypothese dieser Studie ist, dass Änderungen in der Häufigkeit
und Intensität von Temperaturextreme linear zusammenhängen mit den Trends in den europäischen mittleren Temperaturen. Der erste Teil dieser Studie beschäftigt sich daher damit zu untersuchen, ob es einen linearen Zusammenhang gibt zwischen den zu erwartenden
Häufigkeitsänderungen und den Trends in den mittleren jährlichen und saisonalen Temperaturen. Lineare Regressionsmodelle und zwei verschiedene Methoden zur Schätzung des Steigungsparameters werden verwendet, um mögliche Zusammenhänge zu studieren. Das Ziel des
zweiten Teils der Studie ist die zu erwartende Intensitätsänderungen von warmen Temperaturextreme annährend zu beschreiben durch skalieren und verschieben von saisonalen Temperaturverteilungen.
Die Resultate zeigen, dass es einen linearen Zusammenhang gibt zwischen der Häufigkeit
von warmen Temperaturextreme und den Änderungen in den mittleren jährlichen Temperaturen. Die Schätzung der Häufigkeiten hängt dabei ab von der Methode zur Schätzung des
Steigungsparameters, den Trends in der mittleren Temperatur und der Linearität. Die Linearität selber ist festgelegt durch die Form der zugrundeliegenden Temperaturverteilung. Ausserdem sind die linearen Trends in der Häufigkeit grösser für Temperaturverteilungen mit kleiner Varianz. Die Grössenordnung des Trends und die interannuelle Variabilität bestimmen
wie robust der geschätzte Steigungsparameter ist. Die Unsicherheit aufgrund der interannuellen Variabilität ist jedoch kleiner als die Unsicherheit durch die verschiedenen Klimamodelle. Häufigkeitsschätzungen für extreme Temperaturereignisse, die auf absoluten Temperaturgrenzwerten basieren, sind besser, wenn saisonale Mittelwerte verwendet werden. Die Richtung und Geschwindigkeit der verschobenen Temperaturverteilung sind Gründe dafür, dass die
Häufigkeitsschätzungen von kalten Extremereignissen immer schlechter sind als für warme
Extremereignisse, vor allem für kalte Extremereignisse, die über mehrere aufeinanderfolgende
Tage bestehen.
iii
Lineare Regressionsmodelle sind nicht geeignet für diskrete Daten wie Zähldaten oder wenn
die Überschreitungswahrscheinlichkeit gegen 0% oder 100% konvergiert. Die Abhängigkeit
dieser Einschränkungen vom verwendeten Klimamodell und Ort ist der Grund dafür, dass alternative Methoden, wie z.B. das logistische Regressionsmodell, nicht überall in Europa zu
verbesserten Schätzungen führen.
Die Resultate des zweiten Teils zeigen, dass für ein Viertel bis die Hälfte des europäischen
Festlands Änderungen in der Intensität von warmen Temperaturextreme erstaunlich gut beschrieben werden können durch Verschieben der gesamten Temperaturverteilung in Richtung
höhere Temperaturen. Für Wintertemperaturen ist die Erwärmung der kalten Temperaturextreme grösser als für warmen Temperaturextreme. Andererseits ist für die Sommertemperaturen die Erwärmung grösser für warme Temperaturextreme als für kalte Temperaturextreme.
Daher ist die zukünftige Temperaturvariabilität im Winter geringer und im Sommer grösser.
Wenn diese Änderung in der Variabilität asymmetrisch ist, dann führt eine zusätzliche Skalierung der Temperaturverteilungen zu grossen Schätzungsfehlern, weil die Verwendung der
zentralen Momente dafür nicht geeignet ist. Bessere Schätzungen für die verschobenen und
skalierten Intensitäten der warmen Temperaturextreme erhält man, wenn für die Schätzung der
Varianz die entsprechende Hälfte der Temperaturverteilung verwendet wird (d.h. die untere
Hälfte für kalte Extreme und die obere Hälfte für warme Extreme). Über Westeuropa, Nordeuropa und Teilen von Zentraleuropa hat es, zusätzlich zu den Änderungen in der Variabilität,
signifikante Veränderungen im Verteilungsschwanz der simulierten warmen Temperaturextreme. Dies bedeutet, dass sich über diesen Gebieten die Form der Temperaturverteilung ändert.
Um diese komplexen Änderungen in den höheren Momenten zu berücksichtigen, benötigt man
detailliertes Wissen über die zukünftigen Temperaturverteilungen und dies widerspricht der
ursprünglichen Annahme, dass für zukünftige Zeitperioden keine Daten verfügbar sind.
iv
Contents
Abstract
i
Zusammenfassung
iii
1 Introduction
1.1 European heat wave event 2003: A taste of the future climate? . .
1.2 The climate system: A short overview about a complex system . .
1.3 The probabilistic nature of extreme events . . . . . . . . . . . . .
1.4 Data availability: Prerequisite for new insights . . . . . . . . . . .
1.5 Statistical modelling of extreme events . . . . . . . . . . . . . . .
1.5.1 Extreme value theory . . . . . . . . . . . . . . . . . . . .
1.5.2 Extreme indices . . . . . . . . . . . . . . . . . . . . . . .
1.6 Are temperature extremes related to European mean temperature?
1
1
3
6
8
10
10
12
14
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
2 The potential of pattern scaling for projecting temperature-related extreme indices
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.2 Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.2.1 Extreme Indices . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.2.2 ENSEMBLES regional climate models . . . . . . . . . . . . . . . . .
2.3 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.3.1 Estimation of the response pattern . . . . . . . . . . . . . . . . . . . .
2.3.2 Evaluation of the estimation skills . . . . . . . . . . . . . . . . . . . .
2.4 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.6 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
17
20
21
21
22
22
22
24
25
29
32
3 Sensitivity of European extreme daily temperature return
trends in mean and variance
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . .
3.2 Data and Methods . . . . . . . . . . . . . . . . . . . . . .
3.2.1 ENSEMBLES regional climate models . . . . . .
3.2.2 Return level estimation . . . . . . . . . . . . . . .
3.2.3 Adjustment models . . . . . . . . . . . . . . . . .
3.3 Results and Discussion . . . . . . . . . . . . . . . . . . .
35
38
39
39
40
41
42
v
levels to projected
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
3.4
3.5
3.3.1 Performance for the summer season .
3.3.2 Performance for the winter season . .
3.3.3 Limitations of the adjustment models
Conclusions . . . . . . . . . . . . . . . . . .
Auxiliary material . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
4 Conclusions and outlook
4.1 Trends in central moments and temperature extremes:
4.2 Skill and limitations of the fill in approaches . . . . .
4.2.1 Linear regression model . . . . . . . . . . .
4.2.2 Adjustment models . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Are they related?
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
42
44
48
51
53
.
.
.
.
55
55
56
56
57
References
59
Acknowledgments
71
vi
Chapter 1
Introduction
1.1 European heat wave event 2003: A taste of the future
climate?
The French weekly newspaper ”Le Nouvel Observateur” asked on August 2003:
”What if the 2003 heatwave is just the beginning?”
This is a legitimate question given the global temperature increase observed over the past
decades and the various socio-economic and ecological impacts caused by the summer heat
wave in 2003. The study of Della-Marta et al. (2007) shows that the summer heat wave length
has doubled over western Europe since 1880. The centre of the heat wave event 2003 was
mainly located over France, Switzerland and Germany and caused record-breaking maximum
temperatures in June and August. Such prolonged time periods with high temperatures affect
the human physiology (Kalkstein and Smoyer, 1993; Changnon et al., 1996; Semenza et al.,
1996; Smoyer, 1998; Basu, 2002; Tan et al., 2007; Fischer and Schär, 2010) and increase the
mortality, especially for elderly people (Koppe et al., 2004; Kovats and Kristie, 2006; D’Ippoliti
et al., 2010). In France, for instance, the heat wave was responsible for nearly 15,000 excess
deaths during the time period from August 1st to 20th (Fouillet et al., 2006). In Switzerland,
there was an increase in daily mortality of about 7% for the time period from June to August,
mainly in suburban areas and cities such as, for instance, Basel, Geneva and Lausanne (Grize
et al., 2005). The higher mortality in urban areas is due to the amplification of heat stress
(Fischer et al., 2012a). The mortality risk is also increased due to the lack of mobility, thermal
insulation and sleeping on the top floor (Vandentorren et al., 2006). Despite of the heat stress,
people suffered additionally from high amount of photochemically produced ozone and air pollution (Fiala et al., 2003; Pellegrini et al., 2007; Solberg et al., 2008). However, not only the
human physiology was affected but also the ecosystems and, therefore, the agricultural sector.
The estimated economical loss in agriculture was valued at approximately e 13 billion (Létard
et al., 2004). The efficiency of the European ecosystems to produce organic matter from atmospheric carbon dioxide (gross primary productivity) was reduced by nearly 30%. As a result,
the ecosystems act as net carbon sources to the atmosphere (Ciais et al., 2005). Numerous
forest fires in Greece, France, Spain, Italy and Portugal were responsible for the air pollution
and caused loss of forest areas and wildlife (Garcı́a-Herrera et al., 2010). Further ecological
1
2
C HAPTER 1: I NTRODUCTION
impacts were large losses of Alpine glacier mass (Fink et al., 2004), shortage of pollen season
(Gehrig, 2006), water temperature records in lakes (BUWAL, 2004; Jankowski et al., 2006) and
reduced river discharge (BUWAL, 2004). The latter led to problems in power plant cooling and
hydro-electric power restrictions (Beniston and Diaz, 2004).
The unprecedented heat wave event in summer 2003 and the broad range of impacts raised
the question in the scientific community about how to put this event into the context of the
current and future climate. In this regard, Figure 1.1 shows the distribution of seasonal JuneJuly-August mean temperature over northwestern Switzerland for the time period from 1864
to 2000.
Figure 1.1: Distribution of seasonal June-July-August mean temperature for the time period from 1864
to 2000 and northwestern Switzerland (Schär et al., 2004).
Figure 1.2: European land-surface summer temperature anomalies for the time period July 20th to
August 20th for the year 2003 relative to the mean of the years 2000, 2001, 2002 and 2004. The radiative
temperature is derived from the TERRA MODIS (MODerate resolution Imaging Spectroradiometer)
satellite (Image by Reto Stöckli, Robert Simmon and David Herring, NASA Earth Observatory, based
on data from the MODIS land team).
1.2 T HE
CLIMATE SYSTEM :
A SHORT
OVERVIEW ABOUT A COMPLEX SYSTEM
3
The temperature anomaly of summer 2003 is about 5.1◦ C relative to the 1864-2000 mean corresponding to more than five times the standard deviation as depicted in Figure 1.1. In Basel,
the observed deviation of the summer mean maximum temperature from the 1961-1990 seasonal mean was up to 6◦ C standing out as record-breaking event since 1901 (Beniston and
Diaz, 2004). The large summer temperature anomalies in 2003 are impressively shown in Figure 1.2 for the time period from July 20th to August 20th . The figure visualizes the land-surface
temperature anomalies over Europe derived from the TERRA MODIS (MODerate resolution
Imaging Spectroradiometer) satellite with spatial resolution of 1km. The mean is computed for
all cloud free data of the years 2000, 2001, 2002 and 2004 and then subtracted from the measured radiative temperature for the year 2003. The satellite image depicts the spatial structure
of the heat wave and shows that the highest anomalies are found over France. The radiative
temperature was up to nearly 10◦ C hotter than for the other years. In historical context and
under consideration of the uncertainties in palaeoclimatological data, the summer 2003 was
the hottest since the 16th century at continental scale (Chuine et al., 2004; Luterbacher et al.,
2004; Xoplaki, 2005; Menzel, 2005; Coumou and Rahmstorf, 2012). Otherwise, the temperature anomalies in summer 2003 could be an analogue for the future climate caused by the
anthropogenic rise in the Earth’s temperature (Beniston and Stephenson, 2004; Beniston and
Diaz, 2004; Beniston, 2004; Meehl and Tebaldi, 2004; Schär et al., 2004; Stott et al., 2004).
Despite of large uncertainties in the statistics of extreme events, there is a need to investigate long-term trends, predictability and the underlying physical processes. The motivation and
main objective of this thesis is to investigate potential relationships between changes in the frequency and intensity of extreme events and the warming trends in European mean temperature
using the state-of-the-art regional climate models from the European multi-model project ENSEMBLES (Van Der Linden and Mitchell, 2009). Under consideration of gaps in temperature
time series, these relationships are used to project the frequency and intensity of warm extreme
events and, therefore, to fill in the gaps. The skill of the projections and the limitations of the
fill in methods are evaluated and discussed. The findings of this project are interesting for the
impact community, i.e. the insurance companies, engineers and public authorities. The methods could be used to fill the gap between data availability and the need for climatological data
as input for impact models. The purpose of the impact models is to assess the future evolution
of ecological and socio-economic impacts in order to plan appropriate adaptation strategies.
1.2 The climate system: A short overview about a complex
system
The climate system is driven by incoming solar short-wave radiation. Figure 1.3 shows
schematically what happens to the incoming solar radiation within the atmosphere. In the
figure are shown the annual and global mean estimates for the different energy fluxes (Kiehl
and Trenberth, 1997).
The left part of Figure 1.3 sketches that a fraction of the incoming short-wave radiation is
diffused within the atmosphere to all directions due to reflection, refraction and diffraction on
air molecules and aerosols, i.e. fog, rain droplets and clouds. As a result, part of the incoming
solar radiation is scattered back to the space and another part is absorbed by the Earth’s atmo-
4
C HAPTER 1: I NTRODUCTION
Figure 1.3: Estimated global and annual mean short-wave and long-wave energy fluxes (Kiehl and
Trenberth, 1997).
sphere. The remaining fraction of diffused radiation together with the not-diffused direct solar
radiation reaches the Earth’s surface. The surface properties determine the amount of shortwave radiation which will be absorbed. The not absorbed radiation is again reflected. The ratio
between reflected and absorbed short-wave radiation is termed albedo. Snow covered areas and
clouds, for instance, have high albedo values while the albedo of soil is lower and depends on
its composition. The absorbed short-wave radiation is converted to and emitted as long-wave
radiation as shown on the right part of Figure 1.3. The emitted long-wave radiation is partially
absorbed by the greenhouse gases (CO2 , CH4 , water vapour) and aerosols and re-emitted in
all directions. The back radiation shown in Figure 1.3 is the long-wave radiation re-emitted
towards the Earth’s surface. To avoid continuous warming or cooling of the Earth over a long
time period, the absorbed solar short-wave radiation has to be balanced by the long-wave radiation released by the Earth’s atmosphere and surface. This energy balance determines at least
the Earth’s climate and, particularly, the air and surface temperature. The greenhouse gases in
the atmosphere and the resulting long-wave back radiation are the reason for the global mean
temperature of about 14◦ C (Jones et al., 1999).
Because of the Earth’s shape and declination, the amount of incoming solar radiation per
surface area depends on the latitude and season. Thus, the Earth system strives to balance the
differences in the received energy between tropics and high latitudes via oceanic and atmospheric circulation. For the latter, the main driver is the release of energy through condensation
of water vapour depicted in Figure 1.3 as latent heat flux. The circulation patterns and the
amount of received radiation determine the weather patterns and temperature and, therefore,
the climate at a certain place and season. But even for the same latitude and season, the climate
varies between distant locations. The reason is schematically shown in Figure 1.4. Figure 1.4 is
a scheme of components of the climate system, physical processes as well as interactions. The
site-specific processes and interactions affect the occurrence, intensity and duration of weather
events. There are, for instance, milder winter temperatures on the coasts than for continental climate due to the thermal inertia of the oceans. Another example is the onset of rainfall
1.2 T HE
CLIMATE SYSTEM :
A SHORT
OVERVIEW ABOUT A COMPLEX SYSTEM
5
Figure 1.4: Scheme of physical processes, interactions and major components of the climate system
(IPCC, 2007).
triggered by the orography. In case of heat waves, the land-climate interactions can amplify
anomalous warm conditions.
Changes in the Earth’s climate system are triggered by changes in the energy fluxes (Figure
1.3), i.e. in the incoming solar short-wave radiation, the Earth’s albedo and the amount of longwave radiation emitted back to the Earth’s surface. The incoming solar shortwave radiation is
determined by the Sun’s intensity and the Earth’s orbit parameter. Both change in time scales
of 104 to 106 years. Reasons for changes in the Earth’s albedo are changes in the cloud cover,
vegetation, land-use and the amount of aerosols in the atmosphere. The fraction of long-wave
back radiation is related to the greenhouse gas concentration, i.e. mainly to the amount of
carbon dioxide and water vapour. During the last few decades, the evidence increased that human affect the radiative fluxes by changing the composition of the atmosphere and the albedo
through land-use (IPCC, 2001, 2007). As a consequence, the climate system responds directly
or indirectly through various positive and negative feedback processes such as, for instance,
the snow-albedo feedback. This feedback implies that for cold temperatures the snow cover
remains and increases the albedo and, as a consequence, the amount of short-wave radiation
reflected back to the atmosphere and space. Due to the decreased amount of available energy
at the surface, the temperature decreases further. Conversely, if the temperature increases and
the snow cover vanish, the darker soil can absorb more short-wave radiation leading to further
warming. The snow-albedo feedback is an example of a positive feedback and influences the
temperature variability for very cold winters (Yiou et al., 2009; Cohen et al., 2012). Generally, positive feedback processes intensify the initial process while negative feedbacks damp it.
Nevertheless, the climate system responds at least to the anthropogenically induced changes in
the radiation fluxes with an observed increase in the surface mean temperature on various spa-
6
C HAPTER 1: I NTRODUCTION
tiotemporal scales (Stott, 2000; Folland et al., 2001; IPCC, 2001; Hansen et al., 2006; IPCC,
2007; Zorita et al., 2008; Easterling and Wehner, 2009; Hansen et al., 2010; Ceppi et al., 2012).
Attribution studies are used to establish the most likely causes for the recent observed
changes in particular climate elements. For the attribution studies, climate models are run with
combined anthropogenic and natural forcings as well as natural forcings only (Stott, 2003; Stott
et al., 2010). The respond of the climate system is then compared with the observed time series
of climate elements such as, for instance, the temperature. Likelihood measures are derived
to determine the most probable explanation for the observed changes (Min et al., 2004). The
attribution procedure is constrained by the ability of the climate models to represent reliably
the effects of external forcings and natural variability, especially on small spatial scales (Stott
et al., 2010). The attribution of changes in extreme events to specific causes is still challenging
due to gaps in the understanding and the difficulty to represent reliably extremes in climate
models (Stott et al., 2010). Nonetheless, there are studies available trying to link particular
extreme weather events and changes in extremes with changes in the climate system (e.g. Stott
et al., 2004; Meehl et al., 2007; Min et al., 2011; Pall et al., 2011; Dole et al., 2011). But, it is
easier to evaluate how the frequency of extreme events changes for a particular warming rate
when the underlying distribution is known.
To detect significant changes in the climate state, i.e. in the mean, variance and extremes,
the definition of a climatological standard normal as reference climate is crucial. According
to the technical regulations of the World Meteorological Organisation (WMO), normals are
period averages for time periods longer than three consecutive ten-year periods (WMO, 2007).
The discussion about normals and their optimal length started already in the 19th century. During the early 20th century, the suggestions for the optimal length of the reference period ranged
from 11 years (solar cycle) up to 50 years to capture more of the inter-decadal variability. In
1935, the International Meteorological Committee suggested during the Warsaw conference to
take 30 years as time period (WMO, 2007; Hulme et al., 2009). This suggestion is more a compromise because the knowledge about the climate system and its variability was poor at that
time. Hence, there is no robust physical or statistical argument for the recommended normal
length. At the moment, the suggested and widely used normal period ranges from 1961 to 1990
and is employed as baseline for computing anomalies and changes in the climate system.
1.3 The probabilistic nature of extreme events
The frequency, intensity and persistence of weather events at a given location is inferred by
measuring several climate elements such as, for instance, temperature, wind, precipitation or
water vapour over a long time period, i.e. over several decades. Then, the measurements are
statistically characterized using estimators such as, for instance, the mean and variance. These
two estimators describe the expected weather and its variability for a certain time period and
location. Probability (PDF) and cumulative (CDF) density functions are used to summarize
and visualize the whole set of measurements. This implies that PDFs summarize all sitespecific processes and feedbacks shown in Figure 1.4, i.e. they represent at least the climate.
In this context, climate is a statistical quantity and is not directly measurable. An example
1.3 T HE
PROBABILISTIC NATURE OF EXTREME EVENTS
7
of a PDF used to describe statistically temperature is shown in Figure 1.5. In this example,
the temperature data are assumed to be normally distributed. To assume normally distributed
temperature data is a reasonable and often applied approximation. However, this assumption is
critical in the tails which are not well constrained because of the rare nature of extreme events
(Coumou and Rahmstorf, 2012). Nevertheless, the normal distribution family is often chosen
as appropriate statistical model to fit the data.
Figure 1.5: Fitted probability density function (PDF) of temperature. Blue and red areas represent cold
and warm extremes, respectively.
Figure 1.5 illustrates that there are events with small likelihood in the tails of the fitted PDF.
Because extreme events are rare events, this means that the tails of the PDF, coloured as blue
and red areas, represent cold and warm extreme events. In addition, Figure 1.5 implies that
the statistical characterization of extreme events depends on the underlying PDF. The resulting
question is where the tails begin and, therefore, how to define the term ”small likelihood”. A
widely used approach is to set the boundaries of the tails where the probability to exceed a
particular absolute or percentile threshold is less than 10% (IPCC, 2012), i.e. the 90th and 10th
percentile. The statistical definition of the tails and, therefore, of extreme events is not related
to their potential socio-economic or ecological impacts.
Instead of using probabilities, extreme events are often characterized in terms of return periods (Coles, 2001). This quantity describes how long the average time period between two
extreme events of equal or larger intensity is. Thus, 20-year return period events occur with
the probability of 5% per year and are expected on average every 20 years. Because return
periods are related to probabilities, the quantiles of them for a given PDF are the return levels.
These values are more useful for the insurance industry or engineers because they are used as
critical threshold values to design infrastructures such as, for instance, dams and dikes and for
8
C HAPTER 1: I NTRODUCTION
human health. In this context, the exceedance of the threshold has an impact potential. Hence,
return levels as absolute temperature thresholds are another possibility to define extreme events
(IPCC, 2012).
To date, the analysis of extremes is limited to particular climate element, i.e. temperature,
precipitation or wind. However, recent literature suggests that extreme events can be understood as compound events (IPCC, 2012). This means that the occurrence of an extreme event
depends conditionally on the occurrence of one or more other events. These other events do
not necessarily have to be extreme. As an example, droughts are extreme concerning their persistence but the absolute temperatures for each day might not be extreme. Further, the drought
itself can trigger the occurrence of other extreme events. For example, the preceding drought in
spring 2003 and the resulting soil-moisture deficit in summer 2003 led to a positive feedback,
i.e. to an amplification of the warm temperature anomaly in summer 2003 (Seneviratne et al.,
2006; Fischer et al., 2007; Fischer and Schär, 2009, 2010).
1.4 Data availability: Prerequisite for new insights
The data quantity and quality constrains the potential to detect trends in the frequency, intensity and persistence of extreme events as well as the understanding of underlying physical
processes. The latter is the precondition for attribution studies (Stott et al., 2010; IPCC, 2012)
and the development of reliable climate change scenarios. The detection of significant trends
in extreme events is limited by the rareness, i.e. high return periods. As a consequence, the
tails of the temperature distributions are not well constrained (Rahmstorf and Coumou, 2011).
This implies that the full range of possible extreme events is not captured for a given stationary climate and any derived estimates are associated with large sampling uncertainties. As
an example, the study of Frei and Schär (2001) investigates the sensitivity of the detection
probability to the rarity of heavy precipitation in the Alpine region. They estimated that the
probability to detect a frequency change by a factor of 1.5 in a 100-year record of seasonal
counts is about 20% for events with 100 days return period.
Another issue is the data quality, i.e. the completeness and homogeneity. Inhomogeneous
time series exhibit abrupt or gradually changes, i.e. so-called change points, caused by changes
in the station location, instrumentation, observing practices or environment. The correction of
these non-climatic changes is crucial because they can introduce artificial trends or systematic
biases. There are statistical methods available to detect and remove change points for annual,
seasonal and monthly data (Alexandersson and Moberg, 1997; Peterson et al., 1998; Vincent,
1998; Menne and Williams, 2005; Gérard-Marchant et al., 2008). For daily data, the development of procedures is still a topic of ongoing research and only a few methods are currently
available (e.g. Della-Marta and Wanner, 2006).
In general, the data quality and quantity depends on the measurement technique, station
density, the length of the time period as well as the definition of global standards and periodic
calibration. The international coordination of atmospheric observations was improved after the
International Geophysical Year (IGY) in 1957/1958. Near-global observations with satellites
started in the 1970s. Large-scale data for the Earth’s surface are available since the middle
1.4 DATA
AVAILABILITY:
P REREQUISITE
FOR NEW INSIGHTS
9
of the 19th century. For example, the land temperature time series from the Climate Research
Unit (CRU) goes back to 1856 (Brohan et al., 2006). Nowadays, there are monthly, seasonal
and annual temperature and precipitation data available for most parts of the world albeit with
different spatial and temporal coverage. However, extreme events such as, for instance, gusts
of wind (Della-Marta et al., 2010) or heavy precipitation events have generally much smaller
spatiotemporal scales. For instance, heavy precipitation events have temporal scales in the
order of hours. Therefore, data on daily or sub-daily time scales are needed to sample these
events and to detect trends. However, such data have only been available since the second half
of the 20th . These few decades with available data are not long enough to capture large-scale,
i.e. continental to hemispheric scale, and low-frequency climate variability and extremes. In
addition, there are still large areas around the Earth with very sparse and unreliable or even no
data such as, for instance, South America or Africa. The data availability is even more limited
for climate elements other than temperature or precipitation such as for the soil-moisture measurements (Seneviratne et al., 2010). This makes it more difficult to investigate extreme events
as compound events. Further, the inhomogeneous station density and the insufficient temporal
resolution imply that not all extreme events are captured and, therefore, the derived extreme
statistics might be biased. Thus, the lack of knowledge about the processes and feedbacks triggering extreme events together with the issues associated with data quality and quantity are at
least responsible for the difficulties to conduct attribution studies with extremes (Stott et al.,
2010; Schiermeier, 2011).
Extending the data availability with palaeoclimatological reconstructions or numerical climate models might help to complete the gaps in the PDF. However, the palaoclimatological
reconstructions have often spatiotemporal resolutions much coarser than for instrumental measurements and, additionally, larger uncertainties. For the climate model simulations, the reliability of the derived statistics depends generally on the appropriate representation of the
climate system, i.e. the physical processes. Because of the limited horizontal resolution of
climate models, it is not possible to simulate directly processes such as, for instance, convection. These sub-grid processes are parameterized, i.e. they are related empirically to the
resolved large-scale features. To simulate large-scale circulation, among other things, General
Circulation Models (GCM) are used with horizontal resolution in the order of 100-300 km.
To investigate regional or local-scale changes in extremes with GCMs, statistical or dynamical downscaling techniques are applied (Wilby and Wigley, 1997; Frei, 2006; Haylock et al.,
2006; Schmidli et al., 2007; Maraun et al., 2010). The dynamical downscaling procedures use
the GCM outputs as boundary conditions driving regional climate models (RCM) with higher
spatial resolutions up to 25km. Examples for large dynamical downscaling experiments are the
EU-funded FP5 PRUDENCE (Christensen et al., 2002) and FP6 ENSEMBLES multi-model
projects (Van Der Linden and Mitchell, 2009). For the latter, 6 GCMs drives 9 RCMs with
spatial resolutions of 25km and 50km. The skill of the dynamically downscaling experiments
depend once again on the reasonable representation of large-scale features in the GCM (Déqué
et al., 2007). However, studies such as Fischer et al. (2012b) and Déqué et al. (2007) have
shown that the contribution of the RCMs to the overall uncertainty has increased for the newest
generation of regional climate models. In contrast to dynamical downscaling with RCMs, statistical downscaling methods use statistical links between large-scale variables, i.e. grid points
in climate models, and observed values, i.e. point values. The statistical downscaling approach
10
C HAPTER 1: I NTRODUCTION
provides projected extremes on local scale, i.e. with smaller spatial extent relative to the area
averages (Klein-Tank et al., 2009). However, it is not possible to downscale extreme events
with magnitudes larger than observed.
As mentioned before, the regional climate models from the European multi-model project
ENSEMBLES (Van Der Linden and Mitchell, 2009) are used in this thesis. From the available
GCM-RCM pairs, a subset of 12 GCM-RCM chains is selected. The chosen set of GCM-RCM
pairs has the same rotated pole grid and, therefore, there is no further interpolation necessary.
It would be more difficult to compare the interpolated with the non-interpolated values because
of the additional uncertainties and the smoothing of extremes (Haylock et al., 2008).
1.5 Statistical modelling of extreme events
The theoretical background to characterize the distributional properties of extreme events is
formulated in the extreme value theory. The properties are determined by the tails of the underlying parent distribution. Therefore, as discussed in the chapter before, large samples of high
quality data are important to constrain the upper and lower tails.
1.5.1 Extreme value theory
The French mathematician Maurice Fréchet (1878-1973) suggested an asymptotic distribution
for samples of maxima in a paper published in 1927 (Fréchet, 1927). This distribution type has
a heavy upper tail and bounded lower tail and is known as Fréchet distribution. The Fréchet
distribution is nowadays one of the three possible extreme value distributions and applied to
model, for instance, market-returns. Two further distributions were developed to address problems in hydrology, engineering and materials science. The Swedish engineer and scientist
Waloddi Weibull (1887-1979) described the Weibull distribution in 1951. The Weibull distribution was proposed in the context of strength of material and fatigue analysis and was applied
to model failure times. The German mathematician Emil Julius Gumbel (1891-1966) applied
the extreme value theory, i.e. the Gumbel distribution, to a broad range of applications and,
therefore, various types of parent distributions (Gumbel, 1954).
The reason for the applicability of extreme value theory to different research fields is implicitly stated in the extreme value theorem (Fisher and Tippett, 1928; Gnedenko, 1943; Coles,
2001). The theorem asserts that the limit distribution of the largest values can only be one of
the three aforementioned extreme value distributions, independently of the underlying parent
distribution. Therefore, samples of the largest values, i.e. max (X 1 , · · · , X n ), converges for
n → ∞ as following:
max (X 1 , · · · , X n ) − bn
lim P
≤ z → G (z) .
n→∞
an
an > 0 and bn is a sequence of normalizing parameters and G (z) represents one of the three
possible extreme value distribution types. The parent distribution of (X 1 , · · · , X n ) determines
how fast the largest values converge to the limit distribution. In earlier studies, the first step
was to assume an extreme value distribution for a specific application and to estimate then
1.5 S TATISTICAL
11
MODELLING OF EXTREME EVENTS
the unknown distribution parameters. The disadvantage of this proceeding is that the a priori
assumption of an extreme value distribution needs a selection technique. All the following
inferences are based on the assumption that the extreme value distribution suggested by the
selection procedure is appropriate. As a consequence, any uncertainties in the inferences arising
from the distribution selection are not taken into account (Coles, 2001). This issue of a priori
distribution selection is circumvented through combining the three distribution families into
one distribution family (Jenkinson, 1955; Coles, 2001). The so-called generalized extreme
value distribution (GEV) family is defined as following:
− 1ξ !
z−µ
G (z) = exp − 1 + ξ ·
σ
for
1+ξ·
z−µ
σ
>0.
The location µ, shape ξ and scale σ parameters satisfy the conditions −∞ < µ < ∞,
−∞ < ξ < ∞ and σ > 0. The advantage of using the generalized extreme value distriˆ determine which of the three extreme
bution family is that the estimated parameters (µ̂, σ̂, ξ)
value types is appropriate for a particular application. Thus, the Gumbel family is chosen for
the limit ξ → 0 while ξ > 0 and ξ < 0 corresponds to Fréchet and Weibull family, respectively.
The extreme value distribution families are applied to model the largest values, i.e. minima or maxima. However, there is a further extreme value distribution family used to model
exceedances, i.e. (X − u|X > u), where u is a large threshold. The limit distribution was
introduced by the Italian economist Vilfredo Pareto (1848-1923) and is known as generalized
Pareto distribution (GPD) (Jenkinson, 1955; Pickands, 1975; Davison and Smith, 1990; Coles,
2001). The two-parameter generalized Pareto distribution family is defined as follows:
(X − u|X > u) · ξ
H (X − u|X > u) = 1 − 1 +
σ GP
− 1ξ
.
ξ is the shape parameter and σ GP the scale parameter. The shape and scale parameters of the
GPD and GEV distribution families are related to each other. The shape parameter ξ is the same
for both distribution families. The scale parameters of GPD and GEV are linked as following:
σ GP = σ + ξ (u − µ) .
Vilfredo Pareto applied this power-law type distribution to model income and wealth (Pareto,
1897). The tail behaviour of the Pareto distribution is, similar as for the GEV distribution, determined by the shape parameter ξ. For ξ < 0 the distribution of exceedances (X − u|X > u)
σ
has an upper bound at u − GP
. In contrast to this, there is no upper limit for ξ > 0. For
ξ
the limit case ξ → 0, the distribution converges to the exponential distribution with parameter
1
. Similar to the GEV distribution, the appropriate type of extreme value distribution is deσGP
termined by the estimated parameters.
Depending on the extreme value distribution family, i.e. GPD or GEV, there are two often
applied procedures for sampling extremes from datasets. When modelling largest values (maximum or minimum), the block maxima approach is used (Coles, 2001). For this approach, the
12
C HAPTER 1: I NTRODUCTION
whole sample is split into blocks of equal length. Then, the maxima or minima are drawn from
each block. For the resulting sample of largest values, the GEV distribution parameters and
the confidence intervals are estimated. To estimate the distribution parameters, the maximum
likelihood approach (e.g. Coles, 2001) or the L-moments method (Hosking, 1990; Coles, 2001)
are applied. To estimate the confidence interval, the Delta-method (Coles, 2001) or parametric or nonparametric resampling procedures are suggested (Davison and Hinkley, 1997). The
difficulty of the block maxima approach is to find an appropriate block size to minimize the
sampling uncertainty and to avoid biases caused by too small block lengths. Another issue is
that not all available information of the tails is used. This is the advantage of the peak-overthreshold approach using all data beyond the selected threshold u. The peak-over-threshold
method is used to derive the exceedances. For the parameter and confidence interval estimations, the same methods can be applied as for the block maxima approach. The difficulty of the
peak-over-threshold method is to determine an appropriate threshold u0 to minimize the sampling uncertainty and biases. Threshold selection methods base on the idea that for high enough
thresholds u0 the GPD parameter estimates converges to a nearly constant value and the mean
excess E (X − u|X > u) is a linear function of u (Coles, 2001). This is tested visually with
appropriate plots and, therefore, provides no objective criteria for the threshold selection. As
a consequence, the approaches are not applicable to large gridded data sets. For this purpose,
goodness-of-fit procedures (Choulakian and Stephens, 2001) and test statistics (e.g. Kyselý
et al., 2008) are used in this thesis. Generally, the GPD distribution is preferred in this thesis
because the maximum possible amount of information is used to model extreme values. This
is especially important in the context of extremes because the amount of information is already
reduced due to their rarity. For comparison, the GEV distribution is also fitted to the data in
order to test the sensitivity of the temperature extreme estimates to the chosen extreme value
distribution family. The distribution parameters of both distribution families are estimated with
the maximum-likelihood method. This approach is preferred because it provides confidence
intervals for the estimated distribution parameters (Coles, 2001). The quantiles of the fitted
extreme value distributions corresponds to the return levels. This value is exceeded on average
once every T years while T is the return period. When the return period T is given in years,
an equivalent proposition is that the return level is exceeded in one year with the probability
1
. The return periods in this thesis are estimated according to Coles (2001). Their confidence
T
intervals estimates are derived from the Delta-method (Coles, 2001).
Extreme value theory provides a statistical framework to model very rare events. However,
as discussed in the chapter before, the inferences are sensitive to the quantity and quality of the
data. To circumvent part of the difficulties arising from the rarity of extreme events, the use
of extreme indices (Frich et al., 2002; Klein-Tank et al., 2009) has established during the last
decade. For this reason, the extreme indices are also used in this thesis.
1.5.2 Extreme indices
Extreme weather events have often time scales in the order of daily or sub-daily. However,
climate scientists are mainly focused on datasets with monthly time scales up to the 1990s.
Consequently, there were no international datasets of long-term daily terrestrial data available
at that time (Peterson and Manton, 2008). This was also stated in the IPCC second assess-
1.5 S TATISTICAL
MODELLING OF EXTREME EVENTS
13
ment report in 1995 (IPCC, 1995) where is written that ”there are inadequate data to determine
whether consistent global changes in climate variability or weather extremes have occurred
over the 20th century”. Analysis of extremes were available for a few countries such as, for instance, the United States, Canada or Australia. However, it was difficult to compare the results
among these countries because they used different measures of extremes (Peterson and Manton, 2008). In addition, because the time scales of extremes are smaller than for the available
datasets, information about extremes is smoothed and partly filtered out, respectively. These
issues motivated not only to develop daily datasets but also to intensify international cooperation. International cooperation is important to generate and to analyze datasets from Asia,
Africa and South America where gaps in the data exists. For this purpose, the meeting in
November 1999 of the joint Expert Team on Climate Change Detection and Indices (ETCCDI)
in Geneva agreed to set the focus on the analysis and development of extreme indices derived
from daily meteorological data and to fill in gaps in the world map of available data by organizing regional workshops. An example for such a successful workshop is the Asia-Pacific
network (ANP) meeting (e.g. Manton et al., 2001; Zhang et al., 2011). The general idea of the
workshops is that participants bring their own data and they are then guided to apply quality
control and inhomogeneity detection procedures. At least, they compute with software provided by the ETCCDI the extreme indices (Zhang et al., 2011).
The current 27 ETCCDI extreme indices (see: http://cccma.seos.uvic.ca/
ETCCDI/list_27_indices.html) represent partly modified versions of the European
Climate Assessment and Data (ECA&D) indices (Klein-Tank and Können, 2003). The extreme
indices differ in their definition of the underlying distribution and in the selected threshold, i.e.
how far the threshold is located within the tail of the parent distribution. Relative to the rare
extreme events, the extreme indices are generally more robust, more frequent and simple to
compute. From a statistical point of view they represent moderate extreme precipitation and
temperature events, i.e. they occur a few times per year. Further advantages of the indices
are that they have a higher signal-to-noise ratio and represent broad aspects of the local and
regional climate, i.e. temperature, precipitation, humidity, cloudiness, wind and so on. The use
of these standardized indices allows now to compare the results among different studies and
regions, respectively. They are based on absolute (e.g. 0 ◦ C) or percentile thresholds (e.g. 90th
percentile). The former are not suitable to compare the results over very large areas because
they do not describe the same part of the underlying distribution. In contrast to this, extreme
indices with percentile thresholds are related to the local climate and take the mean annual
cycle into account. Because they represent in each area of interest the same part of the underlying distribution, their rarity is fixed. Therefore, the number of days exceeding the threshold
is evenly distributed in space. This allows at least to compare the results between different
regions (Klein-Tank et al., 2009). The percentile thresholds are computed using data of the
reference period, i.e. the time period 1961-1990. But, the study of Zhang et al. (2005) shows
that the sampling uncertainty in the estimated percentile causes discontinuities at the beginning
and the end of the selected reference period. They suppose a bootstrap procedure to estimate
the percentile thresholds and, therefore, to remove the inhomogeneities.
During the last decade, extreme indices are used to monitor changes in the frequency, intensity and persistence of moderate extreme events (e.g. Frich et al., 2002; Klein-Tank and
Können, 2003; Alexander et al., 2006; Russo and Sterl, 2011; Moberg et al., 2006), to evaluate
14
C HAPTER 1: I NTRODUCTION
climate models, to explore expected changes (e.g. Russo and Sterl, 2011; Tebaldi et al., 2006)
as well as for attribution studies (e.g. Kiktev et al., 2003).
Based on studies such as Katz and Brown (1992), Klein-Tank and Können (2003), Simolo
et al. (2011) and Ballester et al. (2009a), there is evidence that the exceedance frequencies,
measured by some extreme indices, are related to changes in the parent distribution. Therefore,
the general question is how extreme temperature events are related to the mean temperature.
1.6 Are temperature extremes related to European mean
temperature?
There are physical and statistical arguments that the observed increase in global mean temperature might affect the frequency, intensity and duration of warm and cold extreme events (Katz
and Brown, 1992; Klein-Tank and Können, 2003; Della-Marta et al., 2007; Ballester et al.,
2009b,a; Simolo et al., 2010, 2011; de Vries et al., 2012) as well as record-breaking events
(Wergen and Krug, 2010; Rahmstorf and Coumou, 2011). This is schematically illustrated in
Figure 1.6a showing two temperature distributions, i.e. a reference distribution and a future
distribution shifted by one standard deviation. For the reference period, the initial exceedance
probability is 10% for the warm and cold extremes, respectively. The red area in Figure 1.6a
illustrates the expected increase in the exceedance frequency of warm extremes while the blue
area represents the decrease for cold extremes. Figure 1.6b shows additionally the exceedance
probability as a function of change in the mean ∆µ. For comparison, the evolution of the exceedance probability in Figure 1.6b is also shown for a PDF with variance two times as large
as for the PDF in Figure 1.6a (dashed lines).
(a)
(b)
Figure 1.6: (a) Reference and future temperature distribution shifted by one standard deviation. The
red and blue areas depict the trend-induced changes in the exceedance probabilities of warm and cold
extremes, respectively. (b) Changes in the exceedance probabilities as a function of change in the mean
∆µ. The blue solid and dashed lines represent cold extremes while the red ones correspond to warm
extremes. The variance for the distribution corresponding to the dashed lines is twice as large as for the
distribution shown in (a).
1.6 A RE
TEMPERATURE EXTREMES RELATED TO
E UROPEAN
MEAN TEMPERATURE ?
15
As mentioned before, the trend in the mean temperature is approximately depicted as a shift of
the whole temperature distribution towards higher temperatures (Frich et al., 2002; Klein-Tank
and Können, 2003; Scherrer et al., 2005; Simolo et al., 2010; Hansen et al., 2010; Simolo et al.,
2011). Figure 1.6 illustrates that the shift induces changes in the exceedance probabilities. The
expected change in exceedance probability is larger for warm extremes than for cold extremes
as depicted in Figure 1.6b. The rate of change for the frequency of cold and warm extremes is
determined by the shape of the distribution, shift direction and velocity as well as the variance.
Figure 1.6b demonstrates also that for a particular trend in the mean, the resulting changes
in the exceedance probabilities of warm and cold extremes are higher when the temperature
distribution has a small variance. In other words, the rate of change decreases with increasing
variance.
The link between the occurrence probability and changes in the mean are already investigated in the 1980s in studies such as, for instance, Mearns et al. (1984) and Wigley (1988). A
few years later the study of Katz and Brown (1992) investigated the sensitivity of exceedance
probabilities to changes in the mean and variance. The study shows that the sensitivity to
changes in the variance is higher than for the mean. This is crucial because studies show that
each part of a temperature distribution warms differently leading at least to changes in the temperature variability (Klein-Tank and Können, 2003; Yiou et al., 2009). This is evident when
trend magnitudes in the warm and cold extremes are compared with each other (Klein-Tank and
Können, 2003; Alexander et al., 2006; Moberg et al., 2006; Russo and Sterl, 2011; de Vries
et al., 2012). Consequently, there is a general tendency of decreasing winter temperature variability and increasing summer temperature variability caused by higher trend magnitudes in the
cold extremes for winter and, respectively, in the warm extremes for summer (Fischer et al.,
2012b).
The adjustments of the location (µ) and scale (σ) parameters of the reference temperature
distribution in order to fit the future distribution is not uncontroversial (Katz, 2010). It is not
clear if the shift of the whole temperature distribution is consistent with the changes in the
physical processes. Nevertheless, several studies show that there is a link between changes in
the mean temperature and frequency. Therefore, the main objective of this thesis is to investigate the relationship between mean temperature and frequency of temperature extremes in
regional climate models representing the recent state-of-the-art of physical understanding and
regional climate modelling. Absolute and percentile threshold based extreme indices are used
as robust measures for the frequency of moderate cold and warm extreme events. But to focus
only on the frequency of occurrence ignores any changes in the severity. Therefore, a further
objective is to assess the relationship between the mean climate and the return levels of warm
extremes. The relationships are statistically modelled and the applicability of these models to
project the frequency and intensity of temperature extremes will be tested. The findings are
interesting for the impact community because they can use these simple statistical models to
complete datasets with gaps in the time series.
Chapter 2
The potential of pattern scaling for
projecting temperature-related extreme
indices
17
19
The potential of pattern scaling for projecting
temperature-related extreme indices
Andreas Lustenberger, Reto Knutti and Erich M. Fischer
Institute for Atmospheric and Climate Science, ETH Zurich, Switzerland
(Published in International Journal of Climatology, 2013, doi:10.1002/joc.3659)
Abstract
Pattern scaling can be used to linearly relate changes in extreme indices to changes in the annual or seasonal mean temperature. This study demonstrates the skills and limitations of two
often used pattern scaling approaches in filling-in gaps in the time series of six temperaturerelated extreme indices. The extreme indices over Europe are derived from daily temperature
output of 12 regional climate models of the multi-model project ENSEMBLES. The response
pattern is estimated using one of the two future time periods (2021-2050 or 2070-2099) and the
reference period (1961-1990). The simulated values from the remaining future time period are
used for evaluating the skills. Both pattern scaling approaches perform reasonably well particularly for percentile-based and over most of the regions also for fixed temperature indices.
Uncertainties due to internal variability can be large if the time period used for estimating the
response pattern is close to the reference period. Limitations of pattern scaling due to violations
of the linearity assumption are related to the shape of the temperature distribution. As a result,
differences in the skills among the extreme indices can be related to the magnitude and shift
direction of the whole temperature distribution. Therefore, skills for estimated extreme indices
derived from the upper tail of the underlying temperature distribution are generally high. Over
some areas, linear regression models used in this study are not appropriate statistical models
because of the bounded and discrete nature of the data. Alternative pattern scaling methods
such as, for instance, the logistic regression model lead to improvements over particular areas
but not over the whole integration area.
20
C HAPTER 2: L INEAR
SCALING OF TEMPERATURE EXTREMES
2.1 Introduction
Changes in the frequency or intensity of extreme events as a result of a changing climate would
have major impacts on society and the environment. In this context, not only climate scientists but also different impact communities and end-users such as, for instance, the reinsurance
companies are highly interested in trends in the frequency, intensity and duration of extreme
events and the understanding of the driving mechanisms. From a statistical point of view, their
long return periods are a major challenge in the assessment of extreme events. Consequently,
there are in general not enough observed or simulated extreme events to reliably estimate their
statistical properties. Additionally, climate models still have difficulty simulating the typically
highly non-linear actions across scales and small-scale nature of extreme events. To avoid
some of these problems we here use a set of extreme indices with return periods in the order of
weeks. Such moderate definitions of extremes include events that do not correspond to the extremeness of events that are generally perceived as weather extremes often associated with high
impacts. However, the moderate definitions used here and thus the short return periods ensure a
sufficiently large sample size for robust trend analysis. According to the extreme value theory,
these events are statistically not extreme because they do not follow the extreme value distributions. Extreme indices are used for assessing and monitoring changes in extremes on global
scale (Alexander et al., 2006) or in regional trend studies (Moberg et al., 2006; Toreti and Desiato, 2008). We here use the definitions from the European Climate Assessment and Dataset
(ECA&D) initiative (http://eca.knmi.nl/indicesextremes/index.php).
Simulations performed with regional (RCM) or global climate models (GCM) are an important tool in climate science to assess and understand changes in the climate system and their
driving processes. However, the associated costs limit the availability of simulated data for
some emission scenarios or time periods. This is not only an issue for climate scientists but
also for the impact community using these simulations as input for their impact models. A
widely-used approach to fill in these gaps is pattern scaling, a method that provides climate
change scenarios for time periods or emission scenarios for which no simulations are available.
The underlying idea is that changes in a regional variable are linearly related to large-scale
or global mean temperature changes. A forcing and time-independent response pattern determines the variable of interest from the selected set of predictors. A fast and easy way to
estimate the response pattern is the time-slice method (Mitchell, 2003). The response pattern
is then given by the differences in the estimated means of the variable of interest scaled by
the mean temperature change between two distinct time periods or emission scenarios. However, any deviation from the underlying Gaussian assumption or, more general, from symmetry
concerning the underlying probability density functions (pdf) leads to biases in the response
pattern. A more robust method is to fit a linear regression using the least-squares method because this approach takes all available data points into account, minimizing the mean-squared
error (Mitchell, 2003). Mitchell and Hulme (1999) and Huntingford and Cox (2000) applied
the least-squares method to a sequence of decadal means. Ruosteenoja et al. (2007) used the
least-squares approach in order to fit several simulations of four different emission scenarios
for a specific GCM on the global mean temperature change simulated by a simple climate
model. Kennett and Buonomo (2006) scaled the time-varying mean and standard deviation of
the regional climate model with the smoothed global mean surface air temperature from the
2.2 DATA
21
driving GCM. In the framework of the ENSEMBLES project, Kendon et al. (2010) described
how pattern scaling can be used to increase the number of GCM-RCM chains.
The main idea of regression is to estimate the conditional mean of a variable of interest.
This implies that a trend-induced change in the location parameter of the dependent variable
has to be related to the trend-induced change in the mean of the explanatory variable (Simolo
et al., 2010). But especially extremes and extreme indices are further sensitive to changes in
the scale or shape parameter of the underlying probability density function (Katz and Brown,
1992). In this context, the objective of this study is to evaluate the applicability of the timeslice and least-squares method to a set of temperature-related extreme indices using the IPCC
SRES A1B emission scenario runs from a subset of GCM-RCM pairs in the framework of the
ENSEMBLES project (Van Der Linden and Mitchell, 2009). The pattern scaling methods are
used in this study to fill in gaps in the time series of extreme indices. Since there is mainly
one emission scenario available in the ENSEMBLES project, the skills of the scaling methods
are therefore tested using different time periods rather than different emission scenarios. The
examination of the underlying assumptions such as, for instance, the linearity assumption is
crucial.
The structure of this study is as follows: Section 2 introduces the regional climate models
and the set of temperature-related extreme indices. In section 3, the least-squares and time-slice
method and their associated assumptions are discussed. The results are presented in section 4
and discussed in section 5. The resulting conclusions are the topic of section 6.
2.2 Data
2.2.1 Extreme Indices
In this study, six temperature-related extreme indices are used, each of them derived from daily
2-m temperature data. The extreme indices are defined as follows:
• Number of frost days (FD): If TN is the daily minimum temperature, FD is the number
of days per year when TN < 0◦ C.
• Number of summer days (SU): SU counts the number of days per year when the daily
maximum temperature TX > 25◦ C.
• Percentage of warm days (TX90P) and cold days (TX10P) per year: For each calendar
day of the reference period 1961-1990, the 90th and 10th percentile of the daily maximum
temperature is computed using a five day time window centred at the calendar day.
• Percentage of warm nights (TN90P) and cold nights (TN10P) per year: Same as above
but for daily minimum temperature.
• Cold-spell days index (CWFI): If TG is the daily mean temperature and TG10P the calendar day 10th percentile of a five day window centred at the calendar day, CWFI counts
the number of days per year when TG < TG10P for at least 6 consecutive days.
22
C HAPTER 2: L INEAR
SCALING OF TEMPERATURE EXTREMES
• Warm-spell days index (HWFI): If TG is the daily mean temperature and TG90P the
calendar 90th percentile of a five day window centred at the calendar day, HWFI counts
the number of days per year when TG > TG90P for at least 6 consecutive days.
All six extreme indices are computed with the Climate Data Operators (CDO). CDO is
an open source tool developed by the Max-Planck-Institute (https://code.zmaw.de/
projects/cdo/) and provides a set of operators for analyzing climate model outputs. The
extreme indices implemented in CDO are European Climate Assessment & Dataset (ECA&D)
indices. However, six of the extreme indices (FD, SU, TX10P, TX90P, TN10P, TN90P) used
in this study follow the recommended definitions by the Expert Team on Climate Change Detection and Indices (ETCCDI). HWFI and CWFI have definitions which are very similar to the
warm-spell duration (WSDI) and cold-spell duration (CSDI) index suggested by ETCCDI. The
main difference is that the daily mean temperature is used to compute CWFI and HWFI. In
contrast to this, WSDI and CSDI employ daily maximum and minimum temperature, respectively.
The suggested extreme indices are day-count indices based on fixed (FD, SU) or percentile
(TX90P, TX10P, TN90P, TN10P, HWFI, CWFI) thresholds. The fixed extreme indices make a
spatial comparison difficult because they do not sample the same part of the underlying probability distribution of temperature at different sites. The motivation of using extreme indices
based on fixed thresholds is that they are often associated with observed impacts. For spatial
comparison the percentile-based extreme indices are more suitable because they have a constant rarity in context of the local climate.
SU and FD do not take the annual cycle into account in their definitions. This means that
in regions with a pronounced annual cycle one can expect that the seasons affect the internal
variability of these two indices.
2.2.2 ENSEMBLES regional climate models
The daily maximum, minimum and mean 2-m temperatures are simulated by different one-way
nested GCM-RCM chains. The GCM-RCM pairs are defined in the framework of the ENSEMBLES project (Van Der Linden and Mitchell, 2009). Due to limited computational resources,
the ENSEMBLES community decided to use mainly the SRES A1B emission scenario. In this
study, a set of 12 GCM-RCM chains is used with a common grid and transient runs covering
the time period from 1951 to 2099. The horizontal resolution of the RCMs is about 25 km.
Only grid points exhibiting a land area fraction of more than 50% are considered in this study.
Table 2.1 gives an overview on the 12 GCM-RCM pairs, i.e. the institutions, the name of the
RCM and the name of the driving GCM.
2.3 Methods
2.3.1 Estimation of the response pattern
There are two ways to estimate the response pattern, the time-slice approach (Mitchell, 2003)
and the least-squares approach (Mitchell, 2003; Huntingford and Cox, 2000). The annual mean
2.3 M ETHODS
23
Institution
RCM model
Driving GCM
Irish National Meteorological
Service (C4I)
RCA3
HadCM3Q16
Danish Meteorological Institute (DMI)
DMI-HIRHAM
ECHAM5
Danish Meteorological Institute (DMI)
DMI-HIRHAM
ARPEGE
Swiss Federal Institute of
Technology (ETH)
CCLM
HadCM3Q0
Royal Netherlands Meteorological
Institute (KNMI)
RACMO
ECHAM5
Met Office Hadley Centre (HC)
HadRM3Q0
HadCM3Q0
Met Office Hadley Centre (HC)
HadRM3Q3
HadCM3Q3
(low sensitivity)
Met Office Hadley Centre (HC)
HadRM3Q16
HadCM3Q16
(high sensitivity)
Swedish Meteorological and
Hydrological Institute (SMHI)
RCA
ECHAM5
Swedish Meteorological and
Hydrological Institute (SMHI)
RCA
BCM
Swedish Meteorological and
Hydrological Institute (SMHI)
RCA
HadCM3Q3
Max-Planck-Institute for
Meteorology (MPI)
REMO
ECHAM5
Table 2.1: The ENSEMBLES regional climate models (RCM) with the same rotated pole grid and available simulations for the time period from 1951 to 2099.
24
C HAPTER 2: L INEAR
SCALING OF TEMPERATURE EXTREMES
of the daily mean, daily maximum and daily minimum 2-m temperatures are used as explanatory variables. Additionally, in case of the FD and SU index, the seasonal mean summer (JJA)
and mean winter (DJF) temperatures are used. In cases where the seasonal mean temperature
is employed, the extreme indices are computed per season. The anomalies for each variable are
computed by subtracting the mean of the reference period (1961-1990) from the mid-century
(2021-2050) and late-century (2070-2099) period mean. For the least-squares approach, a linear regression model is used with an unknown regression coefficient β1 and the intercept β0
which is assumed to be zero. The errors ǫi have to satisfy the Gauss-Markov conditions (Plackett, 1950). The basic idea of the least-squares method is to minimize the residual sum of squares
with respect to the regression coefficient β1 . Therefore, β1 can be estimated in the following
way:
Pn
(xi − x) · (yi − y)
β̂1 = i=1Pn
2
i=1 (xi − x)
The subscript i denotes all years of the reference and the scenario period. For each grid point
the significance of the assumed linear trend is tested with a simple t-test of the coefficient β1
with the significance level α = 5%. If the Gauss-Markov conditions are not violated then T
is distributed like Student’s t with n-2 degrees of freedom. The advantage of the least-squares
approach relative to the time-slice approach is that all available data are used. Additionally,
due to minimizing a quantity measuring the errors we can be sure that, if all assumptions are
fulfilled, this approach fits the data better than the time-slice method.
Another possibility to estimate the slope coefficient β1 is the time-slice method. This approach estimates β1 in the following way:
β̂1 =
y Scenario − y Ref erence
xScenario − xRef erence
β1 is only estimated when there is a significant trend-induced change of the mean at the 5%
significance level. In this context, a simple t-test is applied with the null hypothesis H0 that the
means are equal.
The scaling factor β1 is estimated for each grid point and for each of the 12 GCM-RCM
pairs using both pattern scaling approaches. In addition, β1 is only estimated for a particular
grid point when the associated significance test rejects the null hypothesis.
2.3.2 Evaluation of the estimation skills
The response pattern is derived using one of the two future time periods and the reference period. The estimated response pattern is used to (1) interpolate the extreme indices for the midcentury period 2021-2050 based on information from the control period and the late-century
2070-2099, or to (2) extrapolate them for the late-century 2070-2099 based on information
from the control period and the mid-century, respectively. The simulated values from the remaining time period are used for evaluating the skill of the extreme index estimates. The
motivation of this study is to stay on the level of the RCMs, i.e. we assume that we have
only information from the 12 GCM-RCM chains presented in subsection 2.2. An alternative
procedure would be to use the global mean temperature change as predictor or regional information derived from statistical downscaling. However, because the information of the RCM
2.4 R ESULTS
25
are more closely related to regional climate simulated by the GCM-RCM chains, the choice of
RCM-based information delivers better insight into the limitation of pattern scaling applied to
extreme indices.
As a measure of skill, two different statistical quantities are used here. First, the root mean
squared error (RMSE). It is important to keep in mind that the RMSE is sensitive to outliers.
Another statistical quantity is the Pearson’s correlation coefficient ρ. The correlation coefficient measures the degree of correspondence between the estimated and simulated values. The
squared value ρ2 expresses the fraction of the variance in the simulated values explained by
the estimated linear regression. However, ρ2 does not take into account any bias. Therefore,
a high value of ρ2 can still have a large systematic error. For this purpose, the RMSE will be
used additionally in order to quantify potential biases. Similar as the RMSE, ρ2 is sensitive to
outliers.
2.4 Results
Figure 2.1 shows in the top panel the mean TX90P values simulated with the HadCM3Q16HadRM3Q16 model and in the bottom panel the values fitted with the least-squares approach
for the time slice 2070-2099 using the yearly mean daily maximum temperature as explanatory
variable.
Over central and northern Europe, there is an underestimation of the fitted mean TX90P
values. In contrast to this, the TX90P values over southern Europe are in relatively good agreement with the simulated ones. The figure representing the fitted values implies some spatial
dependency of the skills which will be discussed later. Figure 2.2 shows the ρ2 and RMSE
values for the estimated TX90P ((a), (b)) and TX10P ((c), (d)) indices over the integration area
of each GCM-RCM pair. Changes in the yearly mean daily maximum temperature are used as
explanatory variable. The range in ρ2 and RMSE across grid points is depicted with the empirical 25th and 75th percentile. Figure 2.2a and 2.2c show the skill values for the time period
2021-2050 (interpolation) whereas Figure 2.2b and 2.2d those for 2070-2099 (extrapolation).
For both extreme indices, the least-squares approach performs better than the time-slice
approach. The multi-model median RMSE values in case of the least-squares approach are
for both time periods of interest about 23%-57% lower than for the time-slice approach. In
addition, the interpolation for the time period 2021-2050 yields substantially better results than
the extrapolation for 2070-2099. The multi-model interquartile range, as a measure for the
inter-model differences, is up to 50% lower for the least-squares approach and the time period
2021-2050 relative to the time-slice approach. When estimating the extreme indices for the
time period 2070-2099, the inter-model differences in the RMSE values become larger. Especially for the TX10P index, the RCMs driven with the high climate sensitivity GCM can be
characterized as outliers in terms of the RMSE values.
Concerning the ρ2 values, Figure 2.2 shows that for both pattern scaling approaches the
correlation between the simulated and fitted TX90P values are between 1.5 to three times
higher than for the TX10P index. Additionally, the multi-model interquartile range is in almost all cases smaller for the TX90P index. This behaviour implies that, given increasing
mean temperatures, estimating extreme indices associated with the upper tail of the underly-
26
C HAPTER 2: L INEAR
SCALING OF TEMPERATURE EXTREMES
(a)
60° N
45° N
30° N
30° W
15° W
0°
15° E
30° E
45° E
30° W
15° W
0°
15° E
30° E
45° E
(b)
60° N
45° N
30° N
30
40
50
% / year
60
70
80
Figure 2.1: Mean TX90P for the time slice 2070-2099 (a) simulated with the HadCM3Q16HadRM3Q16 model and (b) fitted with the least-squares approach using the annual mean daily maximum temperature as predictor.
ing temperature distribution (e.g. TX90P) seems to work generally better than for those in the
lower tail. This behaviour is due to the fact that the TX10P index approaches 0% per year
as a result of the warming. Therefore, the relationship between the extreme index and mean
temperature tends to flatten and thus violate the assumption of linearity. The same behaviour
is observed for the TN10P and TN90P index (not shown) derived from the daily minimum
temperature distribution. However, there are very small differences in the skill values between
the TN10P and TX10P extreme index and the TN90P and TX90P index, respectively. In case
of the fixed threshold based extreme indices (FD and SU; not shown), the multi-model median
RMSE values are still lower for the least-squares approach than for the time-slice approach
for the interpolation and extrapolation. Seasonal mean temperature as predictor for FD and
SU leads to multi-model median RMSE values up to 50% smaller than for the corresponding
annual based values. For both pattern scaling approaches, this decrease is for the interpolated
estimations larger than for extrapolated values.
2.4 R ESULTS
27
(a) TX90P: 2021-2050
(b) TX90P: 2070-2099
10
RMSE (% per year)
RMSE (% per year)
10
8
6
4
2
0
0
0.2
0.4
2
ρ
0.6
0.8
4
2
0.2
0.4
ρ2
0.6
0.8
1
(d) TX10P: 2070-2099
10
10
RMSE (% per year)
RMSE (% per year)
6
0
0
1
(c) TX10P: 2021-2050
8
8
6
4
2
C4I−RCA3
DMI−HIRHAM−ARPEGE
8
DMI−HIRHAM−ECHAM5
ETH−CLM
KNMI−RACMO
6
HC−HadRM3Q0
HC−HadRM3Q3
4
HC−HadRM3Q16
MPI−REMO
SMHI−RCA−BCM
2
SMHI−RCA−ECHAM5
SMHI−RCA−HadCM3Q3
0
0
0.2
0.4
ρ2
0.6
0.8
1
0
0
0.2
0.4
2
ρ
0.6
0.8
1
Figure 2.2: Skill of the pattern scaling approach for all 12 GCM-RCM chains expressed as squared
correlation ρ2 and RMSE values. Skills are shown for TX90P in (a) 2021-2050, and (b) 2070-2090, and
for TX10P in (c) 2021-2050, and (d) 2070-2099. The indices are estimated with the time-slice (dashed
line) and least-squares approach (solid line). The annual mean daily maximum temperature is used as
explanatory variable. The range of values over the integration area is represented with the empirical
25th and 75th percentile.
For the multi-model ρ2 values, an improvement up to 28% is evident mainly for the leastsquares approach applied on FD. In general, better skills are found over northern Europe for
FD and over the Mediterranean in case of SU. This is due to the fact that over northern Europe
trends in annual mean temperatures are dominated by trends in the seasonal DJF mean temperature while over southern Europe trends in the JJA mean temperature have a larger influence.
For yearly FD, the ρ2 values (multi-model median and interquartile range) are comparable
with the values of the TX10P and TN10P values. In case of SU, the multi-model median values of this particular skill measurement are lower and the multi-model interquartile range is in
almost all cases higher than for the values of the TX90P and TN90P index. In addition, the
higher interquartile range values for each GCM-RCM pair in case of FD and SU imply a higher
spatial dependency of the ρ2 values relative to the quantile-based extreme indices.
The lowest correlation values (not shown) are found between CWFI and the annual mean
temperature. The median value over the integration area of each GCM-RCM chain is between
-0.5 and -0.56 with a maximum interquartile range of 0.11. The small inter-model differences
give a hint that for this particular extreme index the linearity assumption is violated. In contrast
to the CWFI, the skill values for the HWFI indicate larger systematic errors in the estimations
but better correspondence with multi-model median ρ2 values in the order of 0.35 to 0.75 (not
shown).
To investigate the uncertainty induced by internal variability on the estimations of the slope
28
C HAPTER 2: L INEAR
SCALING OF TEMPERATURE EXTREMES
Estimated Slope (%/K)
parameter β1 , a 30-year moving window is used together with the fixed reference period 19611990. The moving time window is shifted by one year at each step and moves through the time
period from 1991 to 2099. Figure 2.3 shows the estimated β1 values for the TX90P index based
on the time-slice method (blue line) and the least-squares method (red line) for a single grid
point over northern Scandinavia from the ECHAM5-RCA model. The TX90P index is chosen
because this extreme index does not exhibit any significant violation of the linearity assumption
over the whole integration area. The green lines represent the range of estimated β1 values for
that particular grid point for all 12 GCM-RCM chains.
12
11
10
9
8
7
6
5
4
3
2001−2030 2021−2050 2041−2070 2061−2090
Time Window
Figure 2.3: Grid point from the ECHAM5-RCA model with β1 slopes (TX90P per Kelvin warming of
annual mean daily maximum temperature) estimated with the least-squares (red) and time-slice (blue)
approach using the reference period (1961-1990) and a 30-year moving time window. The horizontal
axis shows the time period covered by the moving 30-year time window. The green lines represent the
range of estimated values for all 12 RCMs.
The blue and red curves indicate that the uncertainty in the slope parameter estimates is largest
when the moving window is close to the reference period. This implies that the warming
signal is partly masked by the internal variability (noise) for the coming decades,. However,
the signal-to-noise ratio increases with time and, therefore, the estimated slope parameter converges to approximately 9 %/K. In addition, the uncertainty in the estimates decreases when the
moving window is shifted towards the end of the 21st century, i.e. both curves tend to flatten.
For the least-squares approach, for instance, the uncertainty decreases by approximately 75%
when the moving time window reaches the end of the 21st century. However, the uncertainty
induced by internal variability and the differences in the estimated slope parameters between
both pattern scaling approaches are at any time much smaller than the inter-model differences
depicted as green lines. Even the largest uncertainty of the estimated β1 due to internal variability, found when the time window is close to the reference period, is smaller than 10% of
the model range. However, the magnitude of decrease in the uncertainty of the estimated β1
exhibits some spatial dependency. The largest decrease in the uncertainty occurs over Scandinavia for three quarters of the GCM-RCM chains in case of the least-squares method and
for all RCMs for the time-slice method. For both pattern scaling methods and all GCM-RCM
2.5 D ISCUSSION
29
chains, significant decreases in the uncertainty induced by internal variability take place over
the Mediterranean.
The aforementioned results are mainly observed for warm extreme indices associated with
temperature thresholds in the upper tail of the underlying temperature distribution. For extreme
indices with temperature thresholds in the lower tail of the temperature distribution (cold extreme indices) other issues, such as nonlinearities and the convergence of the extreme index
values towards zero, have a greater influence on the estimations.
2.5 Discussion
In the last section there is evidence for a violation of the underlying linearity assumption for
some of the investigated extreme indices and especially in case of the CWFI. The degree of
linearity and the trend magnitude of the extreme index are related to trends in the explanatory
variable and the probability of exceeding a particular temperature threshold. Figure 2.4 illustrates this link in case of the TX10P and TX90P index over the Mediterranean simulated from
1951 to 2099 with the HadCM3Q3-HadRM3Q3.
The trend in the yearly mean maximum temperature for the time period from 1951 to 2099,
shown in Figure 2.4a, can be characterized by a second order polynomial regression model
(blue solid line). The fitted curve shows that under the assumption of constant central moments
higher than 1st order the trend implies a non-uniform shift of the whole temperature distribution
towards higher temperatures. The shift itself induces changes in the probabilities of exceeding
a particular temperature threshold and, therefore, in the extreme index. The magnitude and
direction of this induced trend depends on the shift direction and the shape of the temperature
distribution. Figure 2.4c shows how the probability of exceeding the 90th percentile changes
as a function of changing location parameter ∆µ in case of a standard normal distribution with
three different values for the scale parameter σ. Figure 2.4c implies that a shift towards higher
temperatures induces first a nonlinear increase in the percentage of days per year exceeding
the 90th percentile of the reference period but then the trend becomes nearly linear after some
point in time. This is exactly what is seen in Figure 2.4b showing the time series of the TX90P
and TX10P index. In addition, because the induced nonlinear trend in the time series of the
TX90P index resembles the nonlinear trend in the time series of the yearly mean temperature
and both time series are in phase, the resulting trend between the extreme index and the explanatory variable is nearly linear. But Figure 2.4c illustrates that the linear trend magnitude
can be very different for temperature distributions with the same constant shape but different
scale parameter. Further, the figure implies that if changes in the scale parameter occur during
the shift of the whole temperature distribution the resulting trend becomes nonlinear.
In contrast to this, changes in the exceedance probabilities at the lower tail of the temperature distribution, illustrated with the TX10P index, exhibit a decrease in the exceedance
probability. However, the magnitude of the decreasing trend is much smaller than for the increasing trend in the TX90P index. This explains why the skill values for warm extreme indices
(temperature threshold in the upper tail of the temperature distribution) are often much better
than for cold extreme indices (thresholds in the lower tail). Furthermore, this explains why
the estimated response patterns for the warm indices (i.e. TX90P, TN90P, HWFI and SU) are
30
C HAPTER 2: L INEAR
SCALING OF TEMPERATURE EXTREMES
Annual mean daily maximum temperature
(a)
296
Temperature (K)
295
294
293
292
291
290
289
1951
2000
(b)
2050
2099
2050
2099
Time (Year)
Percentage of days per year
50
TX90P
TX90P fit
TX10P
TX10P fit
40
30
20
10
1951
2000
(c)
Time (Year)
1
σ/σ
Exceedance Probability
Reference
= 1.5
σ/σReference = 1.0
0.8
σ/σReference = 0.5
0.6
0.4
0.2
0
−3
−2
−1
0
1
∆µ (σ)
2
3
4
5
Figure 2.4: a) Time series of the annual mean daily maximum temperature (red circles) simulated
over the Mediterranean from 1951 to 2099 with the HadCM3Q3-HadRM3Q3 under the SRES A1B
emission scenario. The blue solid line represents the second-order polynomial fit. b) Time series of the
TX90P index (red dots) and TX10P (green dots) and the second-order polynomial fits (blue line). c)
Probabilities of exceeding the 90th percentile of a standard normal distribution as a function of changes
in the location parameter for three different values of the scale parameter.
2.5 D ISCUSSION
31
less affected by the internal variability. As a consequence of this trend magnitude and linearity
dependence on the shift direction of the whole temperature distribution, interpolated estimates
for extreme indices (i.e. 2021-2050 based on 1961-1990 and 2070-2099) have higher skill than
the extrapolated values. This implies that under the assumption of linearity there is a dependency of the estimated response pattern on the chosen time periods. The moving time window
applied on the TX90P index shows that if the second time slice is placed in the second half of
the 21st century the uncertainty of the estimated β1 decreases and the estimation converges to
a nearly constant value. This effect is especially pronounced over northern Europe, northeast
Europe and over the Mediterranean for the TX90P and TN90P index. This corresponds to the
regions where the largest warming in the annual mean surface air temperature is expected at
the end of the 21st century (Van Der Linden and Mitchell, 2009). For the TN10P index, TX10P
index and CWFI, the violation of the linearity assumption is the more dominant issue.
A further issue is that the extreme indices used in this study are so called count data, i.e.
they count the number of days or events per time period. Therefore, the range of possible values for the predictands is bounded. In addition, the data is discrete, i.e. for a specific range
of yearly mean temperature it is possible to get the same number of days or events per time
period. In such cases, simple linear regression models are not appropriate statistical models
for dealing with this kind of data. The bounded nature of the data results from the typical
property of a cumulative distribution function being bounded between zero and one, as evident
in Figure 2.4c. The question is whether it is possible to modify the pattern scaling method
taking the aforementioned issues into account. There are two potential solutions. First, there
is an ad-hoc approach setting the estimated values to the boundary value when they reach the
limit of reasonable values (i.e. no days or all days exceed the temperature threshold). Another
possibility is to use a logistic regression model instead of the simple linear regression model
(not shown). In general, there is no best alternative regression model or modification of the
established simple linear regression model leading to an overall improvement over the whole
integration area. The improvement depends on the most dominant issue in each of the simulated trends, i.e. if the nonlinearity or the bounded and/or discrete nature of the data is the
main issue. How pronounced these issues are depends on the extreme index and the simulated
trends. The magnitude of the trends determines how fast an extreme index reaches a particular
boundary value. This is especially pronounced in case of CWFI and HWFI because they have
a higher sensitivity on trends in the underlying temperature distribution. As a result, the trend
magnitude for both is much higher than for the other extreme indices. All these dependencies
explain why the improvements vary between different regions.
For FD and SU, the results suggest that an improvement of the results is possible using seasonal means instead of annual means. This is not surprising because these two indices depend
on how pronounced the annual cycle is. The trends in the annual mean temperature over northern Europe are dominated by trends in the DJF mean temperature and over southern Europe by
trends in the JJA mean temperature. For this purpose, the largest increase in the skills for FD
when using seasonal means are over northern Europe and for SU over the Mediterranean.
32
C HAPTER 2: L INEAR
SCALING OF TEMPERATURE EXTREMES
2.6 Conclusions
This study demonstrates the potential and limitations of pattern scaling approaches to fill-in
gaps in projections of temperature-related extreme indices based on local mean temperature
changes. For all extreme indices considered the least-squares approach performs better in interpolating and extrapolating changes in extreme indices from local mean temperature changes
than the time-slice approach. The skill for both methods is mainly limited by the degree of linearity of the underlying relationship between mean temperature and index. The linearity of this
relationship is determined by the shape of the underlying temperature distribution. In addition,
changes in the scale parameter during the shift of the temperature distribution can affect the
linearity. However, the shape of the temperature distribution is the main reason why in case of
increasing temperatures violations of the linearity assumption occur mainly for cold extremes
such as, for instance, the TN10P index or CWFI. As a result, the interpolated and extrapolated
estimations have lower skills for cold extreme indices (i.e. TX10P, TN10P, CWFI and FD) than
for warm extreme indices (i.e. TX90P, TN90P, HWFI and SU). Given an underlying warming
trend, these higher skills are observed for both pattern scaling approaches.
Trends in CWFI are highly sensitive to annual mean temperatures due to the duration minimum of 6 days. Therefore, this extreme index converges faster towards zero than, for instance, the TN10P index. The same problem will occur with ongoing warming for extreme
indices characterizing the upper tail of the temperature distribution (e.g. TX90P, HWFI). For
the HadCM3Q16-RCA3 model which projects a strong warming, this violation is already evident for the TX90P index over parts of southern Europe at the end of the 21st century.
The results of this study further illustrate that for both pattern scaling methods the skills
for the interpolation are better than for the extrapolation. This is to a large extent due to the
internal variability that obscures the long-term trend in the near-future period used to derive
the extrapolation. Testing the uncertainty induced by internal variability on estimations of β1
reveals that estimates of the slope parameter β1 are more robust for the second half of the 21st
century than for time periods close to the reference period. Setting the second time slice at the
end of the 21st century, reductions in the uncertainty of β1 up to 75% are observed especially
over Scandinavia and the Mediterranean for both pattern scaling approaches. Nevertheless, it
is important to note that the model uncertainty exceeds that from internal variability by far.
The results show that simulated trends in the time series of extreme indices can be related
to the direction and magnitude of the change in mean temperatures. Based on a particular
GCM-RCM pair we further demonstrate that the skill of the methods is dependent on regions
and seasons considered. Over northern Europe, for instance, the largest projected changes in
seasonal mean surface air temperature in case of the SRES A1B scenario occur for the season
December-February (Van Der Linden and Mitchell, 2009). On the other hand, for the season
June-August the largest projected changes are expected over southern Europe. This implies
that annual trends over some regions are dominated by particular seasonal trends. As a consequence of this, over such regions it is possible to improve the skills for the estimated FD and
SU values when seasonal means are used instead of annual means. Applying, for instance, the
least-squares approach on FD using seasonal DJF mean temperature changes lead to RMSE
values up to 50% smaller than for annual values.
Another important limitation of the pattern scaling approaches is that count data is bounded
2.6 C ONCLUSIONS
33
and/or discrete leading to non-physical or unreasonable estimations such as, for instance, negative number of days per year. All these issues lead to the conclusion that the traditional pattern
scaling approaches are only useful when the user knows where over the area of interest the
aforementioned issues can occur and how pronounced they are. Alternative pattern scaling
techniques such as, for instance, the logistic regression model or ad-hoc procedures discarding
unreasonable values can improve the estimations. But there is no overall solution possible due
to the spatial dependency of the discussed issues and the dependency on the GCM-RCM pair.
Chapter 3
Sensitivity of European extreme daily
temperature return levels to projected
trends in mean and variance
35
37
Sensitivity of European extreme daily temperature return levels
to projected trends in mean and variance
Andreas Lustenberger, Reto Knutti and Erich M. Fischer
Institute for Atmospheric and Climate Science, ETH Zurich, Switzerland
(Submitted to Journal of Geophysical Research)
Abstract
Both the mean and variance in European daily temperature are projected to change in the future.
Such a change affects also the return period of extreme events. The aim of this study is to investigate the relationship between changes in statistical moments of the daily temperature distribution and return levels of warm extremes. For this purpose, the distributions of daily maximum
temperature during summer and daily minimum temperature during winter for the control period 1961-1990 are scaled and shifted by the expected changes in the mean and variance. The
estimated return levels of warm summer day and winter night extremes are then compared with
the projections for the scenario period 2070-2099, actually simulated by regional climate models. This study shows that changes in return levels of warm summer and winter extremes can be
approximated as a simple shift of the full temperature distribution for one quarter to one half of
Europe across all examined climate models. In other areas, changes in the variance have to be
considered since models project reduced temperature variance in winter and enhanced variance
in summer. When for a particular temperature distribution the lowermost und uppermost percentiles change at different rates, the variance changes asymmetrically. Therefore, scaling the
control temperature distribution symmetrically, i.e. with central moments, does not always lead
to an improvement of the estimates compared with the location-adjusted return level estimates.
Nevertheless, the study shows that if the variance change associated with the upper half of the
temperature distribution is known then the location-scale-adjusted return levels perform better
than the location-adjusted return levels over most areas.
38
C HAPTER 3: S ENSITIVITY
OF TEMPERATURE EXTREMES TO WARMING
3.1 Introduction
Probability density functions (PDFs) describe quantities of the climate system such as, for
instance, the surface temperature distribution over typically several decades. Extreme events
are often defined as events exceeding a certain percentile of the PDF (Alexander et al., 2006;
Frich et al., 2002; Klein-Tank and Können, 2003). An appropriate statistical description of
frequency, intensity and duration of extreme events in the past, present and future climate is
important to detect significant trends. Such trends are of public and scientific interest due to the
large socio-economic and ecological impacts of extremes (Fouillet et al., 2006; Garcı́a-Herrera
et al., 2010; Barriopedro et al., 2011; Coumou and Rahmstorf, 2012). The observed trends in
temperature extremes are often characterized as a first approximation as a simple shift of the
full temperature distribution towards higher temperatures (Simolo et al., 2011, 2010; Ballester
et al., 2009a; Kharin et al., 2007, 2013). This shift in the mean accounts for much but not all of
the changes in the frequency of temperature extremes, and existing work argues that changes in
higher order statistical moments need to be taken into account (e.g. Schär et al., 2004; Scherrer
et al., 2005; Clark et al., 2006; Vidale et al., 2007; Fischer and Schär, 2009, 2010) to understand
changes in the tails of the parent temperature distribution.
Mearns et al. (1984) discussed the nonlinear relationships between changes in the mean
temperature and the probability of high temperature events. The study by Wigley (1988) shows
that return periods of warm extremes exhibit a nonlinear decrease for increasing trends in the
mean. Katz and Brown (1992) extended this work and demonstrated that extremes are even
more sensitive to changes in the variance (scale parameter) than in the mean. Klein-Tank and
Können (2003) stated that observed trends in cold and warm temperature extremes over Europe
from 1946 to 1990 are not in contradiction with the assumption of a simple shift of the temperature distribution. They show on their study that an asymmetry between the trends in the
cold and warm extremes becomes evident when they applied the trend analysis to sub-periods.
These observed asymmetric trends suggest an additional change in the variance. In this context,
Ferro et al. (2005) investigated how changes in distinct parts of the distribution are related to
changes in the moments without any assumption about the distribution family.
Several studies discuss the link between changes in the frequency of exceeding a particular temperature threshold and the moments (e.g. Mearns et al., 1984; Wigley, 1988; Katz and
Brown, 1992; Frich et al., 2002; Klein-Tank and Können, 2003; Ferro et al., 2005; Moberg
and Jones, 2005; Alexander et al., 2006; Moberg et al., 2006; Beniston et al., 2007; Toreti and
Desiato, 2008; Ballester et al., 2009b,a; Simolo et al., 2010, 2011). However, these studies use
moderate definitions of extremes (exceedance of 90th or 95th percentile) that do not account for
any severity variations (Coelho et al., 2008; Katz, 2010).
Therefore, the aim of this project is to investigate the potential link between changes in the
return levels of warm extremes and changes in the mean and higher order central moments of
the underlying temperature distribution using regional climate model data from the European
multi-model experiment ENSEMBLES (Van Der Linden and Mitchell, 2009).
3.2 DATA
AND
M ETHODS
39
3.2 Data and Methods
3.2.1 ENSEMBLES regional climate models
The simulations are forced with the SRES A1B emission scenario. A set of 12 model chains
(see table 3.1), in which a global climate model (GCM) is driving a regional climate model
(RCM), is used. These GCM-RCM chains have a common grid and the transient runs cover the
time period from 1951 to 2099. The horizontal resolution of the GCM-RCM chains is about
25 km. Only grid points with a land area fraction of more than 50% are considered in this study.
Institution
RCM model
Driving GCM
Irish National Meteorological
Service (C4I)
RCA3
HadCM3Q16
Danish Meteorological Institute (DMI)
DMI-HIRHAM
ECHAM5
Danish Meteorological Institute (DMI)
DMI-HIRHAM
ARPEGE
Swiss Federal Institute of
Technology (ETH)
CCLM
HadCM3Q0
Royal Netherlands Meteorological
Institute (KNMI)
RACMO
ECHAM5
Met Office Hadley Centre (HC)
HadRM3Q0
HadCM3Q0
Met Office Hadley Centre (HC)
HadRM3Q3
HadCM3Q3
(low sensitivity)
Met Office Hadley Centre (HC)
HadRM3Q16
HadCM3Q16
(high sensitivity)
Swedish Meteorological and
Hydrological Institute (SMHI)
RCA
ECHAM5
Swedish Meteorological and
Hydrological Institute (SMHI)
RCA
BCM
Swedish Meteorological and
Hydrological Institute (SMHI)
RCA
HadCM3Q3
Max-Planck-Institute for
Meteorology (MPI)
REMO
ECHAM5
Table 3.1: The ENSEMBLES regional climate models (RCM) with the same rotated pole grid and available simulations for the time period from 1951 to 2099.
40
C HAPTER 3: S ENSITIVITY
OF TEMPERATURE EXTREMES TO WARMING
3.2.2 Return level estimation
Return levels of warm extremes in European climate are estimated from daily maximum temperature (TMAX) for the summer season June-July-August (JJA) and daily minimum temperature (TMIN) for the winter season December-January-February (DJF). The reason for exploring warm winter night instead of warm summer night extremes is that periods of warm summer
nights are correlated to high maximum temperatures in the daytime (Klein-Tank, 2010). Warm
winter nights due to warm winter spells have impacts on the avalanche risk, seasonal snow
cover, growing period and the income of the alpine ski resorts (Beniston, 2005). To get identically distributed samples, the data is linearly detrended at each grid point for the control
(1961-1990) and scenario (2070-2099) period. The removal of trends is also important for reducing the inflation of the estimated scale parameter (e.g. Scherrer et al., 2005) caused by the
presence of a trend. A further issue is that daily data is serially correlated due to the synoptic time scales of weather events (e.g. blockings or low pressure systems). This correlation
affects the estimated parameters of the extreme value distribution. To reduce this effect, we
declustered the samples by taking every 5th day from the detrended samples. The 5-day time
lag is selected because this value represents the mean persistence of synoptic systems, i.e. low
and high pressure systems, in the extratropics (Austin and Lentz, 1999). The 5-day time lag
reduces substantially the risk to sample several times from the same event and ensures a reasonable sample size. To investigate the sensitivity of the results to the declustering procedure,
several subsamples are generated beginning each time with another day.
An extremely warm temperature event E is defined as E = (Ti > u) with Ti as the temperature for the corresponding day i = 1, 2, · · · , n drawn from a sample of size n. The exceedances
yi = Ti − u for a high enough temperature threshold u have a two-parameter limiting distribution of the form
− ξ1
ξ·y
,
(3.1)
y ∼ GP D(y) = 1 − 1 +
σGP
where y > 0 and 1 + ξ · y/σGP > 0 (Coles, 2001). ξ is the shape parameter and σGP > 0 the
scale parameter of the Generalized Pareto distribution (GPD). If ξ < 0, then the distribution
has a bounded upper tail. Although the block maxima approach has been widely used (e.g.
Kharin and Zwiers, 2005; Nikulin et al., 2011) for modelling extreme events, the peak-overthreshold (POT) approach is preferable (e.g. Brown et al., 2008; Kyselý et al., 2008) because
more information from the sample is used. But to investigate the sensitivity of the approach in
this study, the analysis has been repeated with the block maxima approach (not shown). The
return level estimates are very similar to those derived from GPD and, therefore, the results for
the POT approach will be discussed only.
The main difficulty of the POT approach is to find an appropriate temperature threshold u.
The threshold should be as small as possible, so that the variance of the estimated parameter
is small but not too small in order to avoid any biases due to the sampling of events which are
statistically not extreme.
Often thresholds are only selected visually and are not chosen based on an objective criterion. This makes it difficult to apply these methods to large gridded data sets. The threshold
choice here is based on the Anderson-Darling statistic (Choulakian and Stephens, 2001) and the
dispersion index statistic (e.g. Kyselý et al., 2008). The null hypothesis H0 of the Anderson-
3.2 DATA
AND
M ETHODS
41
Darling statistic is that the sample yi = Ti − u comes from a GPD with unknown shape and
scale parameter. Similar to the approach of Davison and Smith (1990), the procedure starts
at the 70th percentile of the corresponding temperature distribution and the percentile is raised
successively until H0 is not rejected at the significance level α = 0.1. If the chosen percentile
threshold u is appropriate, the number of exceedances per time period follows a Poisson distribution. Thus, the dispersion index statistic DI = s2 N /mN should have a value of 1. mN and sN
are the estimated mean and standard deviation of the number of exceedances per time period
over N years. The confidence intervals of DI are derived from a χ-square distribution with N-1
degrees of freedom. Based on the results of both tests, the lowest possible percentile threshold,
common to all GCM-RCM chains, time periods and all grid points over the integration area
was assigned so that the sampling uncertainty of (ξ, σGP ) is minimized and the proportion of
points exceeding the percentile threshold is equal for each grid point.
For the POT approach, the N-year return levels zN and their confidence intervals are estimated according to Coles (2001) for each grid point of a particular GCM-RCM chain, each
season and for the return periods N = 2, 5, 10, 15 and 20 years. The parameters (σGP , ξ) and
their 95% confidence intervals are estimated with the maximum likelihood approach. The confidence intervals are used to detect significant changes in the parameters between the control
and scenario period. Over most of the integration area, the temperature PDFs have a bounded
upper tail (not shown), i.e. the shape parameter ξ is negative but can be close to zero. The latter
indicates a more moderate upper tail.
The last step is to evaluate the sensitivity of the N-year return level estimates to the methodological approaches, i.e. the temperature threshold selection and declustering procedure, using
the root mean squared error (RMSE). A threshold is considered robust, i.e. independent of
the methodological approaches, if the RMSE of the return level estimates between the chosen
and another appropriate threshold has a minimum value and, additionally, this value remains
approximately constant. To minimize the uncertainty arising from the sample size, the smallest
temperature threshold that was suggested by both significance tests and proved to be robust was
selected. Based on the test statistics and sensitivity analysis we find the 85th percentile to be an
appropriate temperature threshold for all grid points, GCM-RCM pairs and each season. Next
we examine the sensitivity to the declustering procedure by testing the differences between the
return level estimates derived from different sub-samples. The estimates are more sensitive for
winter TMIN than for summer TMAX presumably due to the higher variance in winter (e.g.
Yiou et al., 2009; Hurrell et al., 2003). However, the sensitivity is negligible compared to the
trends and estimation uncertainties of the return levels.
3.2.3 Adjustment models
We test two models, one that accounts for the change in the temperature mean (location parameter) only and a second that additionally accounts for a change in the temperature variance
(scale parameter). The underlying assumption of this study is that we have data for the control
period (CTL) and we know the changes in the location (µ) and scale parameter (σ) of the par-
42
C HAPTER 3: S ENSITIVITY
OF TEMPERATURE EXTREMES TO WARMING
ent daily minimum and maximum temperature distribution TiCT L . Similar to the approach of
Ferro et al. (2005), the first hypothesis, i.e.
HL : TiSCN = µSCN + TiCT L − µCT L ,
(3.2)
assumes that the temperature PDF of the control period TiCT L is shifted by the change in the
mean temperature ∆µ. The subscript i = 1, 2, · · · , M, where M is the sample size, denotes
the daily temperature values. This implies that the Pareto distribution and, therefore, the parameters (ξ, σGP ) remain constant, and, hence, the return levels exhibit the same trend-induced
change as the location parameter µ of the parent temperature distribution. This assumption is
a simplification because changes in extremes are affected by various physical processes on different spatio-temporal scales (Katz, 2010) and, consequently, the response of each process to
changes in the mean temperature might be different. When the location-adjusted return levels
are outside of the 95% confidence intervals of the projected return levels, then the locationscale-adjusted model will be applied. This model adjusts the location and scale of the daily
minimum and maximum temperature PDF, i.e. the hypothesis
CT L
Ti
− µCT L
SCN
(3.3)
= µSCN + σSCN ·
HLS : Ti
σCT L
is tested. The location-scale adjustment model assumes that the temperature PDF of the control
period is additionally scaled by the expected change in the variance σ.
To evaluate the potential of the location and location-scale adjustment models, the rootmean squared error (RMSE) and the percentage error are used. The percentage error is defined
as
T̂NSCN − T̃NSCN
,
(3.4)
T̃NSCN − T̃NCT L
where T̂NSCN and T̃NSCN are the adjusted and projected N-year return levels for the scenario
period. T̃NSCN − T̃NCT L are the projected N-year return level changes between scenario and
control period. The motivation is to show especially the results for the location adjustment
model because it is the simplest model and there is high confidence in projected patterns of
mean changes (the results for the location-scale adjustment model are shown in auxiliary material). In addition, other applications such as, for instance, pattern scaling (Mitchell and Hulme,
1999; Huntingford and Cox, 2000; Mitchell, 2003) use projected changes in the mean temperature to provide climate change scenarios for time periods or emission scenarios for which no
simulations are available.
3.3 Results and Discussion
3.3.1 Performance for the summer season
In order to explore how well the two approaches work, we compare the adjusted changes (here
used to refer to changes inferred with a statistical adjustment model) against projected changes
(here used to refer to future simulations with a physical climate model) in 2 to 20-year return levels of warm temperature extremes. As an example, Figure 3.1 shows the 20-year JJA
3.3 R ESULTS
AND
D ISCUSSION
43
return levels for the control period (a), the expected changes for the scenario period (b), the
location-adjusted (c) and the location-scale-adjusted method (d) in case of the HadCM3Q0CCLM model, i.e. the GCM-RCM pair with the lowest RMSE values between the locationadjusted and projected 2 to 20-year return levels. The stippled areas in Figure 3.1b represent
significant changes in the projected 20-year JJA return levels.
(a) CTL
(b) SCN-CTL
50° N
°
40 N
°
0
°
10 E
°
20 E
°
7
6
°
60 N
5
4
3
°
50 N
2
°
1
40 N
0°
30 E
(c) HL -CTL
°C
JJA
60 N
8
70° N
HadCM3Q0−CCLM
°
46
44
42
40
38
36
34
32
30
28
26
°C
70° N
10° E
20° E
30° E
0
(d) HLS -CTL
70° N
8
70° N
60° N
50° N
40° N
°
0
°
10 E
°
20 E
°
30 E
6
60° N
5
4
3
50° N
2
40° N
1
°
0
°
10 E
°
20 E
°
30 E
Figure 3.1: 20-year return level estimates for the daily JJA maximum temperature in case of the
HadCM3Q0-CCLM model. (a) 20-year JJA return level estimates for the control period. (b) Projected 20-year JJA return level changes in 2070-2099 relative to 1961-1990. Stippled areas represent
significant changes. (c, d) Adjusted 20-year JJA return level changes in 2070-2099 for location-adjusted
HL (c) and location-scale-adjusted HLS (d) temperature distributions.
The HadCM3Q0-CCLM model projects the largest increase in the 20-year JJA return level over
south-eastern Europe, parts of eastern and central Europe, western France and northern Iberian
Peninsula (Figure 3.1b). This is consistent with the other GCM-RCM pairs projecting the
largest warming in the 2 to 20-year JJA return levels over most of southern and south-eastern
Europe as well as for parts of western, eastern and central Europe. However, no significant
0
°C
HadCM3Q0−CCLM
HadCM3Q0−CCLM
7
44
C HAPTER 3: S ENSITIVITY
OF TEMPERATURE EXTREMES TO WARMING
changes are projected for 5 of 12 GCM-RCM pairs over Scandinavia and parts of the British
Isles. Two GCM-RCM chains project no significant changes over central Europe. This agrees
with the projected changes in the 90th to 99th percentiles of the parent temperature distributions
which are not substantial over these areas. Figure 3.1c shows that the location-adjusted 20year JJA return levels agree well with the projections over south-eastern, western and eastern
Europe. Further, the figure shows that the largest differences between projected and adjusted
20-year JJA return levels are found over Scandinavia, i.e. over the area with no significant
changes in the projected return levels. This is also evident for other GCM-RCM pairs when
no significant warming in the return levels is projected (see Figure 3.2). Figure 3.2 shows
the performance of the location-adjusted method for all GCM-RCM pairs measured as percentage error, i.e. the differences between location-adjusted and projected 20-year JJA return
level estimates (error) relative to the projected increase (signal). The stippled areas represent
location-adjusted estimates which are within the 95% confidence intervals of the actually projected return levels, i.e. where the method works well.
All GCM-RCM pairs tend to underestimate the projections over most of Europe when the
location-adjusted method is used. The fraction of grid points with 2 to 20-year JJA return level
estimates within the projected 95% confidence intervals is about 30-50% for all GCM-RCM
chains. This means that for these grid points the assumption of constant scale σGP and shape
ξ parameter of the Generalized Pareto distribution (GPD) is valid for the location-adjusted
approach. For the remaining grid points, changes in the variance or higher moments have
to be considered. In this context, 20-35% of the grid points exhibit significant changes in
σGP caused by changes in the variance of the parent summer TMAX distribution. Thus, the
location-adjusted model fails mainly over areas where a significant increase in summer temperature variance is projected (Yiou et al., 2009; Fischer and Schär, 2009, 2010; Nikulin et al.,
2011; Fischer et al., 2012b). For the HadCM3Q0-CCLM model, for instance, the variance
increase ranges from 10% to 40% over areas along the Mediterranean and parts of western,
central and eastern Europe.
Taking into account changes in variance, the 2nd central moment, lead to improved return
level estimates (Figure 3.1d) in terms of the RMSE values but they can be still outside of the
confidence intervals. For all climate models, the location-scale-adjusted 2 to 20-year JJA return
level estimates fall within the confidence interval of the corresponding model projections for
40-75% of the European area. However, there are areas over Europe for almost all GCM-RCM
pairs where the location-scale-adjusted approach performs worse than the location-adjusted
approach (see Figure S1 of the auxiliary material). This raises the question whether there are
substantial changes in the shape of the parent temperature distribution affecting the shape of
GPD. This is supported by the fact that the number of grid points with significant changes in
the shape parameter ξ of GPD ranges from 20% to 30% for all GCM-RCM pairs.
3.3.2 Performance for the winter season
The issues discussed for summer warm extremes are also found for adjusted winter warm
extremes. Figure 3.3 depicts the same as Figure 3.1 but now for the 20-year return levels of
winter warm extremes in TMIN and for the HadCM3Q0-CCLM model. This GCM-RCM chain
D ISCUSSION
60° N
50° N
°
40 N
°
0
°
10 E
°
20 E
°
40 N
°
0
°
10 E
°
20 E
°
50 N
°
°
°
10 E
°
20 E
°
60 N
°
50 N
°
°
30 E
0
°
°
10 E
°
20 E
°
50 N
°
40 N
°
0
°
10 E
°
20 E
°
50 N
°
°
°
10 E
°
20 E
50° N
°
°
°
10 E
°
20 E
°
30 E
°
30 E
100
80
60
40
20
0
−20
−40
−60
−80
−100
°
°
50 N
°
10 E
°
20 E
°
30 E
°
10 E
°
20 E
°
30 E
0
°
70° N
°
60 N
50° N
°
60 N
50° N
°
40 N
0
20 E
°
°
°
40 N
°
60 N
30 E
HadCM3Q3−RCA
ECHAM5−RCA
°
10 E
°
40 N
°
70° N
60 N
°
°
50 N
40 N
40 N
0
70° N
°
°
°
°
30 E
70 N
60 N
30 E
°
0
ECHAM5−REMO
HadCM3Q16−HadRM3Q16
°
20 E
°
°
70 N
60 N
°
60 N
30 E
°
70 N
10 E
°
40 N
0
°
100
80
60
40
20
0
−20
−40
−60
−80
−100
70 N
°
40 N
°
40 N
0
HadCM3Q0−HadRM3Q0
°
50° N
°
°
60 N
60° N
30 E
70 N
ECHAM5−RACMO
HadCM3Q0−CCLM
50° N
°
°
HadCM3Q3−HadRM3Q3
60° N
30 E
70 N
BCM−RCA
70° N
ECHAM5−DMI−HIRHAM
70° N
ARPEGE−DMI−HIRHAM
HadCM3Q16−RCA3
70° N
45
%
AND
%
3.3 R ESULTS
40 N
°
0
°
10 E
°
20 E
°
30 E
0
°
Figure 3.2: Percentage errors defined as differences between location-adjusted (HL ) and projected 20year return level estimates (error) for the scenario period and daily JJA maximum temperature related
to the expected changes between control and scenario period (signal). The location-adjusted estimates
over the stippled areas are within the 95% confidence intervals of the projected return levels.
46
C HAPTER 3: S ENSITIVITY
OF TEMPERATURE EXTREMES TO WARMING
has again the lowest spatially averaged RMSE values for the location-adjusted 2 to 20-year DJF
return levels. As in Figure 3.2, Figure 3.4 shows the percentage error of the location-adjusted
method for all GCM-RCM pairs.
(a) CTL
(b) SCN-CTL
12
8
°
70 N
10
8
DJF
60 N
4
2
50° N
0
−2
°
40 N
°C
6
°
HadCM3Q0−CCLM
7
6
°
60 N
5
4
3
50° N
2
°
1
40 N
°
0
°
10 E
°
20 E
°
−4
0°
30 E
(c) HL -CTL
°C
°
70 N
10° E
20° E
30° E
0
(d) HLS -CTL
70° N
8
70° N
60° N
50° N
°
40 N
6
60° N
5
4
3
50° N
2
°
1
40 N
0°
10° E
20° E
30° E
0°
10° E
20° E
30° E
Figure 3.3: 20-year return level estimates for the daily DJF minimum temperature in case of the
HadCM3Q0-CCLM model. (a) Adjusted 20-year DJF return level estimates for the control period.
(b) Projected 20-year DJF return level changes in 2070-2099 relative to 1961-1990. Stippled areas represent significant changes. (c, d) Adjusted 20-year DJF return level changes in 2070-2099 for locationadjusted HL (c) and location-scale-adjusted HLS (d) temperature distributions.
For the HadCM3Q0-CCLM model, the largest projected increases in the 20-year DJF return
levels are found over parts of the Mediterranean, central Europe, south-eastern Europe as well
as parts of eastern Europe and Scandinavia (Figure 3.3b). This agrees with the regions found
in the other climate models. Figure 3.4 illustrates for the HadCM3Q0-CCLM model that the
location-adjusted 20-year DJF return levels are within the projected 95% confidence intervals
over parts of the Mediterranean, southern France, eastern Iberian Peninsula, northern parts of
the British Isles and parts of Scandinavia. For all GCM-RCM chains, the location-adjusted 2 to
0
°C
HadCM3Q0−CCLM
HadCM3Q0−CCLM
7
D ISCUSSION
60° N
50° N
°
40 N
°
0
°
10 E
°
20 E
°
40 N
°
0
°
10 E
°
20 E
°
50 N
°
°
°
10 E
°
20 E
°
60 N
°
50 N
°
°
30 E
0
°
°
10 E
°
20 E
°
50 N
°
40 N
°
0
°
10 E
°
20 E
°
50 N
°
°
°
10 E
°
20 E
50° N
°
°
°
10 E
°
20 E
°
30 E
°
30 E
100
80
60
40
20
0
−20
−40
−60
−80
−100
°
°
50 N
°
10 E
°
20 E
°
30 E
°
10 E
°
20 E
°
30 E
0
°
70° N
°
60 N
50° N
°
60 N
50° N
°
40 N
0
20 E
°
°
°
40 N
°
60 N
30 E
HadCM3Q3−RCA
ECHAM5−RCA
°
10 E
°
40 N
°
70° N
60 N
°
°
50 N
40 N
40 N
0
70° N
°
°
°
°
30 E
70 N
60 N
30 E
°
0
ECHAM5−REMO
HadCM3Q16−HadRM3Q16
°
20 E
°
°
70 N
60 N
°
60 N
30 E
°
70 N
10 E
°
40 N
0
°
100
80
60
40
20
0
−20
−40
−60
−80
−100
70 N
°
40 N
°
40 N
0
HadCM3Q0−HadRM3Q0
°
50° N
°
°
60 N
60° N
30 E
70 N
ECHAM5−RACMO
HadCM3Q0−CCLM
50° N
°
°
HadCM3Q3−HadRM3Q3
60° N
30 E
70 N
BCM−RCA
70° N
ECHAM5−DMI−HIRHAM
70° N
ARPEGE−DMI−HIRHAM
HadCM3Q16−RCA3
70° N
47
%
AND
%
3.3 R ESULTS
40 N
°
0
°
10 E
°
20 E
°
30 E
0
°
Figure 3.4: Percentage errors defined as differences between location-adjusted (HL ) and projected 20year return level estimates (error) for the scenario period and daily DJF minimum temperature related
to the expected changes between control and scenario period (signal). The location-adjusted estimates
over the stippled areas are within the 95% confidence intervals of the projected return levels.
48
C HAPTER 3: S ENSITIVITY
OF TEMPERATURE EXTREMES TO WARMING
20-year DJF return levels are within the confidence intervals for 25-35% of Europe, mainly over
parts of western, southern and south-eastern Europe. However, the location-adjusted changes
are overestimated over most of Europe (see Figure 3.4) and underestimated over parts of southern and south-eastern Europe. Again, most of the climate models have areas with no significant
changes in the projected 2 to 20-year DJF return levels, mainly located over western Europe
and parts of central Europe (see Figure 3.3b and 3.4). But their location is less consistent across
the different climate models compared with the projected JJA return level changes.
The general projected decrease in winter minimum temperature variance (Kjellström et al.,
2007; Fischer et al., 2012b) causes significant changes in the scale parameter σGP for 30-55%
of the grid points. The largest winter variance decrease is found for 10 of the 12 GCM-RCM
pairs over northern and eastern Europe and, therefore, the location-adjusted DJF return levels
exceed the confidence intervals there at almost all grid points. Similar as for the JJA return
level estimates, there are areas where the location-scale adjustment model fails although the
changes in the scale parameter of the parent distribution are significant (see Figure S2 of the
auxiliary material). In addition, there are significant changes in the shape parameter ξ for 3050% of the European area. Areas with largest changes in the shape parameter ξ exhibit also the
largest biases in the location-adjusted and location-scale-adjusted return levels (dark-green and
dark-red areas in Figure 3.2 and 3.4).
3.3.3 Limitations of the adjustment models
Figure 3.2 and 3.4 imply that the applicability of the location adjustment model is limited to
specific regions where the GCM-RCM chains do not project significant changes in the temperature variance. For the other regions, changes in the variance or even higher moments play
an important role. For all climate models, the location-scale-adjusted 2 to 20-year DJF return
level estimates fall within the confidence interval of the corresponding model projections for
25-65% of Europe. The associated reduction in the spatially averaged RMSE values over all
return periods ranges from 15% to 60%. In addition, there is a reduction in the spatial variance
of the RMSE values of about 20-70% implying lower dependency of the results to the location.
To understand the role of changes in the shape of the parent distribution and to answer why
the location-scale adjustment model does not improve the return level estimates over all of Europe, we explored the changes in the percentiles and return levels for the location-scale-adjusted
and projected temperature PDFs. The estimated changes are given relative to the projected differences in the 50th percentile between control and scenario period. To discuss some particular
aspects, two illustrative examples are chosen and shown in Figure 3.5. The estimated changes
are depicted for summer TMAX in the RCA-ECHAM5 model (Figure 3.5a) and for winter
TMIN in the HadCM3Q0-HadRM3Q0 model (Figure 3.5b) over the PRUDENCE region Middle Europe (Christensen and Christensen, 2007).
Figure 3.5a illustrates that summer warm extremes warm more than the mean and summer
cold extremes less than the mean resulting in an increasing variance. In addition, Figure 3.5a
implies that the increase in the variance is nearly symmetric over most of the area. For all
GCM-RCM pairs, the increasing importance of soil moisture-temperature coupling in a future
3.3 R ESULTS
AND
D ISCUSSION
49
climate (Fischer and Schär, 2009; Fischer et al., 2012b) might partly explain this particular
PDF evolution,
(a)
3.5
3.5
Projected
Location−scale−adjusted
3
1.5
)
50th,CTL
−q
50th,SCN
) / (q
p,CTL
1
2
N
−q
1.5
p,SCN
2
50th,SCN
2.5
∆z / (q
2.5
−q
50th,CTL
)
3
(q
1
0.5
0.5
0
0
1
5
10
25
75
90
95
99
Percentile
5
10
20
Return Period (year)
(b)
3.5
3.5
Projected
Location−scale−adjusted
3
1.5
)
50th,CTL
−q
50th,SCN
) / (q
p,CTL
1
2
N
−q
1.5
p,SCN
2
50th,SCN
2.5
∆z / (q
2.5
−q
50th,CTL
)
3
(q
1
0.5
0.5
0
0
1
5
10
25
75
90
95
99
5
10
20
Return Period (year)
Percentile
5
Figure 3.5: Projected and location-adjusted changes in percentiles and 5, 10 and 20-year return levels relative to the projected changes in the 50th percentile between scenario (SCN) and control (CTL)
period over the PRUDENCE region Middle Europe. The ratios are shown for the daily JJA maximum
temperature and the ECHAM5-RCA model (a) as well as for the daily DJF minimum temperature and
the HadCM3Q0-HadRM3Q0 model (b).
50
C HAPTER 3: S ENSITIVITY
OF TEMPERATURE EXTREMES TO WARMING
especially over western, central and parts of eastern Europe. This is consistent with the other
climate models showing this variance increase in a zonal area extending from the Mediterranean to the North Sea. The increased variance, as shown in Figure 3.5a, implies that the
location adjustment model underestimates the 20-year JJA return levels over Middle Europe
(not shown) and, additionally, they are not within the 95% confidence intervals of the projected
return levels. After considering changes in the summer temperature variance, the adjusted return levels are within the confidence intervals.
For winter TMIN, the cold tail warms more and the warm tail less than the mean leading
to a decreased variance (de Vries et al., 2012; Fischer et al., 2012b). This behaviour is evident
in all GCM-RCM chains over northern and eastern Europe. One factor driving these differing trends in the winter minimum temperature might be the shortening of the snow season
(Lawrence and Slater, 2010) and changes in the large-scale atmospheric variability (Hurrell
et al., 2003). Figure 3.5b shows that the warming in the cold tails is larger than for the mean.
However, the warming in the warm tails is similar to the warming in the mean. As a result,
the variance reduction is dominated by the projected changes in the cold tails, i.e. the variance
decreases asymmetrically. Therefore, the location-scale-adjusted 2 to 20-year return levels for
winter TMIN are underestimated and not within the confidence intervals of the projected return
levels after considering the asymmetric variance decrease.
The location-scale adjustment method assumes symmetric changes in the variance. However, Figure 3.5b shows that this assumption is not always justified. The asymmetric variability
change, also mentioned in the study of Klein-Tank and Können (2003), is responsible that the
location-scale-adjusted return levels can become worse than the location-adjusted ones. The
resulting under- and overestimation are amplified in the tails due to their higher sensitivity to
changes in the parent temperature distribution as discussed by Katz and Brown (1992). This
leads to very large differences between adjusted and projected return levels corresponding to
the dark-green and dark-red areas in Figure 3.2 and 3.4.
Looking at the mathematical definition of return levels, it is evident that changes in the location parameter of the parent distribution adjust the temperature threshold u. Trend-induced
changes in higher order moments affect the scale and shape parameter, i.e. σGP and ξ, of the
Pareto distribution. We have seen that the information about changed variance is only useful
when the change is symmetric. For all other cases, we need to know how much of the asymmetric scale change can be attributed to the upper tail of the parent temperature distribution.
As an example for this asymmetric scale change, Figure 3.6 shows the Pareto distributions of
the projected, location-adjusted and location-scale-adjusted winter TMIN distributions for the
HadCM3Q3-RCA model and a single grid point located in southern Europe.
Figure 3.6 shows that the location-adjusted 2 to 20-year DJF return levels are within the
projected confidence intervals. However, the slightly different scale parameter σGP for the
location-adjusted winter TMIN distribution leads to increasing disagreement for small return
periods and an overestimation for high return periods. Because the change in the scale parameter of the parent distribution is dominated by the larger trends in the lower half of the
distribution (not shown), the variance decrease is overestimated in the upper half when central
moments are used. Consequently, the location-scale-adjusted return levels are all outside of
the projected confidence intervals. Thus, we computed first the differences between the mean
and all temperature values in the upper half of the PDF. These differences are then used to
3.4 C ONCLUSIONS
51
Probability Density Function
0.8
Scenario
Location adjusted
Location−scale adjusted
Location−scale adjusted (modified)
0.6
0.4
0.2
0
0
2
4
6
8
10
th
Temperature exceeding the 85 percentile threshold [°C]
Figure 3.6: Projected (black dashed line), location (L) and location-scale adjusted (LS) Pareto probability density functions of winter minimum temperature for a single grid point over the Mediterranean
and the HadCM3Q3-RCA model. For estimating the variability change, on the one hand the whole
sample was used, on the other hand only the differences between the mean of the whole sample and all
temperature values in the upper half of the PDF (LS (modified)). The black dots are the projected 2 to
20-year DJF return levels with confidence intervals.
estimate the change in the variance in the upper half labeled in the figure as “modified”. Using
this estimation for the variance change as input for the location-scale adjustment model, the
adjusted Pareto PDF fits almost perfectly the projected PDF. With this approach, the fraction
of grid points with adjusted 2 to 20-year return levels within the projected confidence intervals can be raised, relative to the location-scale adjustment model with variance estimated with
the whole sample, by additional 20% for summer TMAX and by 25% for winter TMIN. But
for this approach, we would need the information about the temperature PDF for the scenario
period contradicting the initial assumption of no knowledge about future temperature PDFs.
3.4 Conclusions
We show that estimated changes in return levels of warm temperature extremes in summer
and winter based on a simple shift of the full daily temperature distribution (location-adjusted)
are within the projected confidence intervals for one quarter to one half of Europe across all
examined climate models. The location-adjusted estimate performs better for warm extremes
in summer than in winter. The shifted return level estimates tend to underestimate the projected changes in warm extremes in summer, and to overestimate the projected changes in
warm extremes in winter over most of Europe. The primary reason is the significant projected
changes in daily temperature variance. In summer, changes in warm extremes are larger than
for cold extremes leading to enhanced variance (see also Schär et al., 2004; Vidale et al., 2007;
52
C HAPTER 3: S ENSITIVITY
OF TEMPERATURE EXTREMES TO WARMING
Kjellström et al., 2007; Fischer and Schär, 2009, 2010; Fischer et al., 2012b). In winter, cold
extremes warm more than warm extremes leading to reduced variance (Kjellström et al., 2007;
de Vries et al., 2012; Fischer et al., 2012b).
If the change in the variance is also accounted for (location-scale-adjusted), the spatially
averaged mean error is generally reduced in both winter and summer (not shown). However,
there are still areas where large estimation biases remain for the location-scale-adjusted estimates. We even find cases where return levels adjusted with location and scale are worse than
those adjusted by location only. This is due to asymmetric changes in the variance, which
causes large estimation biases and implies that the use of central moments is not appropriate.
Therefore, the location-scale-adjusted return levels perform poorly in those regions. The performance of the location-scale adjustment method can be further improved if the change in the
variance is estimated separately for the corresponding half (i.e. lower half for cold extremes
and upper half for warm extremes) of the temperature distribution. With this approach, the
return level estimates are within the projected confidence intervals for 60-80% of Europe in
summer and for 40-70% in winter. But this implies detailed knowledge about the future summer maximum and winter minimum temperature distributions. The variance change and the
degree of asymmetry depend on the regional climate model, i.e. changes in the large-scale
atmospheric circulation and feedback mechanisms including the representation of snow-cover,
soil-moisture, clouds and the surface energy budget.
This study further illustrates that there are significant changes in the shape of the upper
tail of the summer maximum and winter minimum temperature distributions and, therefore, in
the shape parameter of the Pareto distribution over northern, western and central Europe. This
implies changes in the shape or higher-order moments of the parent temperature distribution.
Further work is necessary to understand the physical processes behind these changes. Over
these areas, the use of the location and location-scale adjustment models leads to large estimation biases. This is especially the case for extremes with low probabilities due to their higher
sensitivity to changes in the moments of the parent distribution.
In general, information on future temperature distribution is often not available to account
for asymmetric changes in the variance. One promising option would be to identify relationships between mean and variance changes associated with the upper and lower half of the
temperature distribution, similar to the approach proposed by Yiou et al. (2009).
The evaluation of the performance of both adjustment models in this study was limited to
one emission scenario. However, simulations for several emission scenarios would help investigating the robustness of potential relationships between distribution parameters and return
levels of temperature extremes.
3.5 AUXILIARY
53
MATERIAL
°
°
50 N
°
40 N
°
°
10 E
°
20 E
°
50 N
°
40 N
°
°
0
°
°
10 E
°
20 E
°
50 N
°
°
°
10 E
°
20 E
°
60 N
°
50 N
°
°
30 E
0
°
10 E
°
20 E
50° N
°
40 N
°
0
°
10 E
°
20 E
50° N
°
°
10 E
°
20 E
°
°
60 N
°
50 N
10° E
20° E
30° E
30 E
100
80
60
40
20
0
−20
−40
−60
−80
−100
°
50° N
0
°
10 E
°
20 E
°
30 E
°
0°
10° E
20° E
30° E
°
60 N
°
50 N
°
40 N
0°
°
°
°
40 N
20 E
70 N
HadCM3Q3−RCA
ECHAM5−RCA
°
50 N
°
60° N
°
°
°
10 E
°
40 N
30 E
70 N
60 N
°
°
50 N
°
°
°
°
40 N
0
70 N
30 E
0
40 N
°
°
70° N
60° N
30 E
20 E
°
°
ECHAM5−REMO
60° N
°
60 N
30 E
70° N
HadCM3Q16−HadRM3Q16
70° N
10 E
°
40 N
0
°
100
80
60
40
20
0
−20
−40
−60
−80
−100
70 N
°
40 N
°
40 N
0
HadCM3Q0−HadRM3Q0
°
°
50 N
°
°
60 N
°
60 N
30 E
70 N
ECHAM5−RACMO
HadCM3Q0−CCLM
°
60 N
30 E
70 N
HadCM3Q3−HadRM3Q3
ECHAM5−DMI−HIRHAM
ARPEGE−DMI−HIRHAM
HadCM3Q16−RCA3
°
60 N
0
BCM−RCA
°
70 N
70 N
%
°
70 N
%
3.5 Auxiliary material
40 N
0°
10° E
20° E
30° E
Figure S1: Percentage errors defined as differences between location-scale-adjusted (HLS ) and projected 20-year return level estimates (error) for the scenario period and daily JJA maximum temperature related to the expected changes between control and scenario period (signal). The location-scaleadjusted estimates over the stippled areas are within the 95% confidence intervals of the projected return
levels.
°
°
°
50 N
°
40 N
°
°
10 E
°
20 E
°
60 N
°
50 N
°
40 N
°
°
30 E
0
°
°
10 E
°
20 E
°
50 N
°
°
°
10 E
°
20 E
°
60 N
°
50 N
°
°
30 E
0
°
°
10 E
°
20 E
°
50 N
°
40 N
0°
10° E
20° E
°
°
°
50 N
°
10° E
20° E
°
10° E
20° E
30° E
°
30 E
100
80
60
40
20
0
−20
−40
−60
−80
−100
°
°
°
50 N
0°
10° E
20° E
30° E
0°
10° E
20° E
30° E
°
°
60 N
°
50 N
°
60 N
°
50 N
°
40 N
0°
20 E
70 N
°
40 N
°
60 N
30° E
HadCM3Q3−RCA
ECHAM5−RCA
°
50 N
10 E
°
40 N
°
°
°
°
°
50 N
40 N
40 N
70 N
60 N
°
°
0°
70 N
30 E
70 N
60 N
30° E
°
0
ECHAM5−REMO
HadCM3Q16−HadRM3Q16
°
20 E
°
°
70 N
60 N
°
60 N
30 E
°
70 N
10 E
°
40 N
0
°
100
80
60
40
20
0
−20
−40
−60
−80
−100
70 N
°
40 N
°
40 N
0
HadCM3Q0−HadRM3Q0
ECHAM5−RACMO
°
°
50 N
°
70 N
60 N
°
60 N
30 E
°
70 N
HadCM3Q0−CCLM
ECHAM5−DMI−HIRHAM
°
60 N
0
HadCM3Q3−HadRM3Q3
°
70 N
70 N
ARPEGE−DMI−HIRHAM
HadCM3Q16−RCA3
70 N
BCM−RCA
OF TEMPERATURE EXTREMES TO WARMING
%
C HAPTER 3: S ENSITIVITY
%
54
40 N
0°
10° E
20° E
30° E
Figure S2: Percentage errors defined as differences between location-scale-adjusted (HLS ) and projected 20-year return level estimates (error) for the scenario period and daily DJF minimum temperature related to the expected changes between control and scenario period (signal). The location-scaleadjusted estimates over the stippled areas are within the 95% confidence intervals of the projected return
levels.
Chapter 4
Conclusions and outlook
The aim of this thesis was to relate projected changes in the frequency and intensity of European temperature extremes to trends in the mean temperature. The objective of the first part
of the thesis was to investigate whether changes in the frequency of European warm and cold
extremes scale linearly with trends in the annual mean temperature. In the second part, presentday summer maximum and winter minimum temperature distributions were shifted and scaled
by the expected changes in the mean and variance in order to approximate the projected changes
in the intensity of warm extremes. The skill of both approaches to fill in gaps in the time series
of the frequency and intensity was evaluated. Daily data of the recent state-of-the-art regional
climate models from the European multi-model project ENSEMBLES were used in this thesis.
4.1 Trends in central moments and temperature extremes:
Are they related?
A simple shift of the full temperature distribution towards higher temperatures is consistent
with the increasing frequency of warm extremes and the decreasing frequency of cold extremes.
For long-term trends in the intensity of warm extremes, the assumption is reasonable for one
quarter to one half of Europe across all regional climate models. Further, the assumption works
better for the intensities of warm summer extremes than for warm winter extremes. Shift direction and shift velocity affect the long-term trends in the frequency and intensity of temperature
extremes. The variance of the present-day temperature distribution has an additional effect on
the trends in the frequency. The linearity in the frequency trends is determined by the shape of
the parent temperature distribution which is approximately described as normally distributed.
Therefore, given the projected linear trends in the annual mean temperature, the projected increasing frequency in warm extremes scales linearly with the annual mean temperature over
all of Europe. Due to the shape of the cold tail in the temperature distributions, this relationship is nonlinear for the decreasing frequency in cold extremes and, additionally, the trends are
smaller than for warm extremes. For temperature extremes lasting several consecutive days, the
nonlinearity is more pronounced, especially for cold extremes, and the trends are larger. For
absolute threshold-based extremes, the frequency changes are dominated by trends in seasonal
mean temperature.
55
56
C HAPTER 4: C ONCLUSIONS
AND OUTLOOK
The differences in percentiles between present-day and future seasonal temperature distributions illustrate that summer warm extremes warm faster than the summer cold extremes. In
winter, cold extremes warm faster than warm extremes. Therefore, the temperature variability
is expected to be reduced in winter and enhanced in summer (see also Schär et al., 2004; Vidale et al., 2007; Kjellström et al., 2007; Fischer and Schär, 2009, 2010; Fischer et al., 2012b;
de Vries et al., 2012). When the rate of change is different between warm and cold extremes,
the change in the temperature variability is asymmetric. Thus, central moments are not appropriate to account for asymmetric changes. The degree of asymmetry and the variability change
depend on the regional climate model, i.e. the representation of large-scale atmospheric circulation and feedback mechanism.
Significant changes in the temperature variability cause changes in the scale parameter of
the extreme value distributions and, therefore, affect trends in the intensity of warm extremes
over substantial parts of Europe. In addition, different trends in each part of the seasonal temperature distribution change the shape of the parent temperature distribution and, therefore, the
shape of the warm tails. These changes in the warm tails are significant over northern, western
and parts of central Europe.
4.2 Skill and limitations of the fill in approaches
4.2.1 Linear regression model
The linear approach, used to reproduce the frequency of temperature extremes, is mainly limited by violations in the linearity assumption. The shape of the parent temperature distribution
and nonlinear trends in the annual mean temperature affect the degree of linearity in the frequency trends. In this context, the shape and the shift direction of the temperature distribution
are the reason why this approach works reasonably well for warm extremes. The estimates
perform even better for temperature distributions with small variance. Warm extremes lasting
for several consecutive days exhibit a higher sensitivity to changes in the annual mean temperature. Therefore, the trends for these multi-day warm extremes are larger and partly nonlinear.
As a result, the estimates have large systematic errors. In contrast to this, the skill of the linear fill in approach is limited for cold extremes due to the occurrence of nonlinear trends, the
convergence to 0% and the smaller trends compared with the warm extremes. The degree of
nonlinearity depends on the representation of the cold tail by the corresponding climate model.
For multi-day cold extreme events, these issues are even more pronounced and, therefore, the
approach performs poorly. In case of absolute-threshold based warm and cold extremes, the
skill of the linearly scaled frequencies is improved when seasonal mean temperatures are used
instead of annual mean temperatures.
Because of the internal variability, the skill of the estimated frequencies depends on the
available data used to estimate the trends, i.e. the slope parameter. The trend estimates are
less robust for data from scenario periods close to the reference period, and more robust for
data from scenario periods at the end of the 21st century. The latter is especially the case for
warm extremes because of their larger trends. In addition, methods such as the least-squares
approach use all available data to estimate the trends and, therefore, lead to more robust trend
4.2 S KILL
AND LIMITATIONS OF THE FILL IN APPROACHES
57
estimates. As a consequence, the frequencies interpolated to the first half of the 21st century
perform better than those extrapolated to the second half of the 21st . However, the uncertainty
in the trend estimates due to internal variability is smaller than the uncertainty due to the different climate models.
Logistic regression models are appropriate to deal with discrete count data and to capture
nonlinear trends in the frequency. However, the dependency of these issues on the used climate
model and the location is the reason that the estimates are not improved over all of Europe.
4.2.2 Adjustment models
Intensity estimates, derived from shifted seasonal temperature distributions, are overestimated
in winter and underestimated in summer due to the projected changes in temperature variance.
For 40-75% of Europe in summer and for 25-65% in winter, the intensity estimates perform
reasonably well when the shifted temperature distributions are additionally scaled with the expected changes in the variance. For areas with asymmetric variance changes, the symmetric
scaling of shifted present-day temperature distributions leads to large biases in the intensity
estimates of warm extremes. The approach performs better when the variance is estimated
with temperature data from the upper half of the seasonal temperature distributions. With this
method of variance estimation, the shifted and scaled intensities perform reasonably well for
60-80% of Europe in summer and for 40-70% in winter. Both adjustment models are not
appropriate over areas with significant changes in the shape of the tails of the temperature distributions.
Another promising approach to capture the asymmetry would be to explore the relationships between mean and variance changes associated with the upper and lower half of the
parent temperature distribution, similar as in Yiou et al. (2009). With this approach, changes
in the intensity could be estimated using the expected change in the mean as only information.
However, even when asymmetric variance changes are considered, there are still areas with
large estimation biases caused by significant changes in the shape of the warm tails. In this
context, quantile regression models are appropriate to study how trends in different percentiles
are related to changes in the moments of the temperature distribution and how the shape of the
temperature distribution evolves in time. This is crucial to understand how much of the changes
in the shape of the tails are related to changes in the shape of the temperature distribution. At
least, the aim is to develop a statistical model which could be used to derive directly the parameters of the extreme value distributions from the estimated changes in the parameters of the
temperature distributions, similar as in the study of Nogaj et al. (2007).
For skewed temperature distributions, the sensitivity of extremes to changes in the moments
of the parent temperature distribution might also change. To investigate this issue, the approach
in Katz and Brown (1992) could be applied to skew-normal distributions which have a shape
parameter to account for non-zero skewness (Azzalini, 2005).
In this thesis, the skill and limitations of the two fill in approaches are discussed for one
emission scenario. However, more emission scenarios are necessary to test the robustness of
these methods and to evaluate the uncertainty arising from different emission scenarios. The
motivation is to develop statistical models used to fill in gaps in emission scenarios for which
58
C HAPTER 4: C ONCLUSIONS
AND OUTLOOK
no data are available. However, not only more emission scenarios are necessary but also more
ensemble members for each climate model. In the ENSEMBLES project, there is one run
available for each climate model. But more ensemble members are crucial to quantify the
uncertainties in the estimated extremes arising from natural variability and to evaluate more reliably the benefits of regional climate models to represent, among other things, the small-scale
nature of present-day extreme events.
Both fill in approaches do not work reliably for extreme precipitation events because
changes in precipitation extremes do not scale with temperature changes. A possibility would
be to use the moments of the precipitation distribution as explanatory variable, similar as for
the adjustment method. The next step would then be to investigate how changes in the tails are
related to changes in the moments.
Another possibility is to define a set of explanatory variables which affects the intensity
and frequency of precipitation extremes. This approach is motivated by the idea that extreme
events can be viewed as compound events (IPCC, 2012). This implies that the occurrence of
an extreme precipitation event depends on the occurrence of other events which could be extreme or not. To explore the potential relationships between the precipitation extremes and the
selected explanatory variables, artificial neural networks are appropriate mathematical models.
The advantage of artificial neural networks is that they are able to capture non-linear and complex relationships.
Studies such as, for instance, Boberg and Christensen (2012) have shown that there are
temperature-dependent biases over certain areas due to the insufficient spatial resolution of the
climate models. In this context, the motivation is to develop a temperature-dependent bias correction procedure for temperature extremes. After applying the fill in approaches, the correction
procedure is used to derive frequency and intensity estimates which are physically consistent.
Bibliography
Alexander, L., et al., 2006: Global observed changes in daily climate extremes of temperature
and precipitation. Journal of Geophysical Research-Atmospheres, 111 (D5), D05 109.
Alexandersson, H. and A. Moberg, 1997: Homogenization of Swedish temperature data. Part
I: Homogeneity test for linear trends. International Journal of Climatology, 17 (1), 25–34.
Austin, J. A. and S. J. Lentz, 1999: The relationship between synoptic weather systems and
meteorological forcing on the north carolina inner shelf. Journal of Geophysical ResearchOceans, 104 (C8), 18 159–18 185.
Azzalini, A., 2005: The skew-normal distribution and related multivariate families. Scandinavian Journal of Statistics, 32 (2), 159–188.
Ballester, J., F. Giorgi, and X. Rodó, 2009a: Changes in European temperature extremes can
be predicted from changes in PDF central statistics. Climatic Change, 98 (1-2), 277–284.
Ballester, J., X. Rodó, and F. Giorgi, 2009b: Future changes in Central Europe heat waves
expected to mostly follow summer mean warming. Climate Dynamics, 35 (7-8), 1191–1205.
Barriopedro, D., E. M. Fischer, J. Luterbacher, R. M. Trigo, and R. Garcı́a-Herrera, 2011: The
hot summer of 2010: redrawing the temperature record map of Europe. Science, 332 (6026),
220–224.
Basu, R., 2002: Relation between elevated ambient temperature and mortality: A review of the
epidemiologic evidence. Epidemiologic Reviews, 24 (2), 190–202.
Beniston, M., 2004: The 2003 heat wave in Europe: A shape of things to come? An analysis
based on Swiss climatological data and model simulations. Geophysical Research Letters,
31 (2), L02 202.
Beniston, M., 2005: Warm winter spells in the Swiss Alps: Strong heat waves in a cold season?
A study focusing on climate observations at the Saentis high mountain site. Geophysical
Research Letters, 32 (1), L01 812.
Beniston, M. and H. F. Diaz, 2004: The 2003 heat wave as an example of summers in a greenhouse climate? Observations and climate model simulations for Basel, Switzerland. Global
and Planetary Change, 44 (1–4), 73–81.
Beniston, M. and D. B. Stephenson, 2004: Extreme climatic events and their evolution under
changing climatic conditions. Global and Planetary Change, 44 (1-4), 1–9.
59
60
B IBLIOGRAPHY
Beniston, M., et al., 2007: Future extreme events in European climate: an exploration of regional climate model projections. Climatic Change, 81 (Suppl. 1), 71–95.
Boberg, F. and J. H. Christensen, 2012: Overestimation of Mediterranean summer temperature
projections due to model deficiencies. Nature Climate Change, 2 (6), 433–436.
Brohan, P., J. J. Kennedy, I. Harris, S. F. B. Tett, and P. D. Jones, 2006: Uncertainty estimates
in regional and global observed temperature changes: A new data set from 1850. Journal of
Geophysical Research-Atmospheres, 111 (D12), D12 106.
Brown, S. J., J. Caesar, and C. A. T. Ferro, 2008: Global changes in extreme daily temperature
since 1950. Journal of Geophysical Research-Atmospheres, 113 (D5), D05 115.
BUWAL, M., BWG, 2004: Auswirkungen des Hitzesommers 2003 auf die Gewässer. Tech.
rep., BUWAL, BWG, MeteoSchweiz, 174 S. pp. Schriftenreihe Umwelt Nr. 369.
Ceppi, P., S. C. Scherrer, A. M. Fischer, and C. Appenzeller, 2012: Revisiting Swiss temperature trends 1959-2008. International Journal of Climatology, 32 (2), 203–213.
Changnon, S. A., K. E. Kunkel, and B. C. Reinke, 1996: Impacts and Responses to the 1995
Heat Wave: A Call to Action. Bulletin of the American Meteorological Society, 77 (7), 1497–
1506.
Choulakian, V. and M. A. Stephens, 2001: Goodness-of-fit tests for the generalized pareto
distribution. Technometrics, 43 (4), 478–484.
Christensen, J. and O. Christensen, 2007: A summary of the PRUDENCE model projections
of changes in European climate by the end of this century. Climatic Change, 81 (Suppl. 1),
7–30.
Christensen, J. H., T. R. Carter, and F. Giorgi, 2002: PRUDENCE employs new methods to
assess European climate change. Eos, Transactions American Geophysical Union, 83 (13),
147.
Chuine, I., P. Yiou, N. Viovy, B. Seguin, V. Daux, and E. Le Roy Ladurie, 2004: Historical
phenology: grape ripening as a past climate indicator. Nature, 432 (7015), 289–290.
Ciais, P., et al., 2005: Europe-wide reduction in primary productivity caused by the heat and
drought in 2003. Nature, 437 (7058), 529–533.
Clark, R. T., S. J. Brown, and J. M. Murphy, 2006: Modeling Northern Hemisphere Summer Heat Extreme Changes and Their Uncertainties Using a Physics Ensemble of Climate
Sensitivity Experiments. Journal of Climate, 19 (17).
Coelho, C. A. S., C. A. T. Ferro, D. B. Stephenson, and D. J. Steinskog, 2008: Methods
for exploring spatial and temporal variability of extreme events in climate data. Journal of
Climate, 21 (10), 2072–2092.
B IBLIOGRAPHY
61
Cohen, J. L., J. C. Furtado, M. A. Barlow, V. A. Alexeev, and J. E. Cherry, 2012: Arctic warming, increasing snow cover and widespread boreal winter cooling. Environmental Research
Letters, 7 (1), 014 007.
Coles, S., 2001: An introduction to statistical modeling of extreme values. Springer, London,
208 S. pp.
Coumou, D. and S. Rahmstorf, 2012: A decade of weather extremes. Nature Climate Change,
2 (7), 491–496.
Davison, A. C. and D. V. Hinkley, 1997: Bootstrap Methods and Their Application, Vol. 42.
Cambridge University Press, 582 pp.
Davison, A. C. and R. L. Smith, 1990: Models for exceedances over high thresholds. Journal
of the Royal Statistical Society. Series B (Methodological), 52 (3), 393–442.
de Vries, H., R. J. Haarsma, and W. Hazeleger, 2012: Western European cold spells in current
and future climate. Geophysical Research Letters, 39 (4), L04 706.
Della-Marta, P. M., M. R. Haylock, J. Luterbacher, and H. Wanner, 2007: Doubled length
of western European summer heat waves since 1880. Journal of Geophysical ResearchAtmospheres, 112 (D15).
Della-Marta, P. M., M. A. Liniger, C. Appenzeller, D. N. Bresch, P. Köllner-Heck, and V. Muccione, 2010: Improved estimates of the European winter windstorm climate and the risk of
reinsurance loss using climate model data. Journal of Applied Meteorology and Climatology,
49 (10), 2092–2120.
Della-Marta, P. M. and H. Wanner, 2006: A method of homogenizing the extremes and mean
of daily temperature measurements. Journal of Climate, 19 (17), 4179–4197.
Déqué, M., et al., 2007: An intercomparison of regional climate simulations for Europe: assessing uncertainties in model projections. Climatic Change, 81 (Suppl. 1), 53–70.
D’Ippoliti, D., et al., 2010: The impact of heat waves on mortality in 9 European cities: results
from the EuroHEAT project. Environmental Health, 9 (1), 37.
Dole, R., et al., 2011: Was there a basis for anticipating the 2010 Russian heat wave? Geophysical Research Letters, 38 (6), L06 702.
Easterling, D. R. and M. F. Wehner, 2009: Is the climate warming or cooling? Geophysical
Research Letters, 36 (8), L08 706.
Ferro, C. A. T., A. Hannachi, and D. B. Stephenson, 2005: Simple nonparametric techniques
for exploring changing probability distributions of weather. Journal of Climate, 18 (21),
4344–4354.
62
B IBLIOGRAPHY
Fiala, J., L. Cernikovsky, F. de Leeuw, and P. Kurfuerst, 2003: Air pollution by ozone in
Europe in summer 2003 - Overview of exceedances of EC ozone threshold values during the
summer season april-august 2003 and comparisons with previous years. EEA Topic Report
No. 3, European Environment Agency, Copenhagen.
Fink, A. H., T. Brücher, A. Krüger, G. C. Leckebusch, J. G. Pinto, and U. Ulbrich, 2004: The
2003 European summer heatwaves and drought - synoptic diagnosis and impacts. Weather,
59 (8), 209–216.
Fischer, E. M., K. W. Oleson, and D. M. Lawrence, 2012a: Contrasting urban and rural heat
stress responses to climate change. Geophysical Research Letters, 39 (3), L03 705.
Fischer, E. M., J. Rajczak, and C. Schär, 2012b: Changes in European summer temperature
variability revisited. Submitted to Geophysical Research Letters.
Fischer, E. M. and C. Schär, 2009: Future changes in daily summer temperature variability:
driving processes and role for temperature extremes. Climate Dynamics, 33 (7-8), 917–935.
Fischer, E. M. and C. Schär, 2010: Consistent geographical patterns of changes in high-impact
European heatwaves. Nature Geoscience, 3 (6), 398–403.
Fischer, E. M., S. I. Seneviratne, P. L. Vidale, D. Lüthi, and C. Schär, 2007: Soil moisture Atmosphere interactions during the 2003 European summer heat wave. Journal of Climate,
20 (20), 5081–5099.
Fisher, R. A. and L. H. C. Tippett, 1928: Limiting forms of the frequency distribution of
the largest or smallest member of a sample. Proceedings of the Cambridge Philosophical
Society, 24, 180–190.
Folland, C. K., et al., 2001: Global temperature change and its uncertainties since 1861. Geophysical Research Letters, 28 (13), 2621–2624.
Fouillet, A., et al., 2006: Excess mortality related to the august 2003 heat wave in France.
International Archives of Occupational and Environmental Health, 80 (1), 16–24.
Fréchet, M., 1927: Sur la loi de probabilité de lécart maximum. Annales de la Société Polonaise
de Mathématique, 18, 93–116.
Frei, C., 2006: Eine Länder übergreifende Niederschlagsanalyse zum August Hochwasser
2005. Ergänzung zu Arbeitsbericht 211. MeteoSchweiz, Zürich, 10 pp.
Frei, C. and C. Schär, 2001: Detection probability of trends in rare events: Theory and application to heavy precipitation in the alpine region. Journal of Climate, 14 (7), 1568–1584.
Frich, P., L. V. Alexander, P. Della-Marta, B. Gleason, M. Haylock, A. M. G. K. Tank, and
T. Peterson, 2002: Observed coherent changes in climatic extremes during the second half
of the twentieth century. Climate Research, 19 (3), 193–212.
B IBLIOGRAPHY
63
Garcı́a-Herrera, R., J. Dı́az, R. M. Trigo, J. Luterbacher, and E. M. Fischer, 2010: A Review of
the European Summer Heat Wave of 2003. Critical Reviews in Environmental Science and
Technology, 40 (4), 267–306.
Gehrig, R., 2006: The influence of the hot and dry summer 2003 on the pollen season in
Switzerland. Aerobiologia, 22 (1), 27–34.
Gérard-Marchant, P. G. F., D. E. Stooksbury, and L. Seymour, 2008: Methods for starting the
detection of undocumented multiple changepoints. Journal of Climate, 21 (18), 4887–4899.
Gnedenko, B., 1943: The limited distribution of the maximum term of a random series. Annals
of Mathematics, 44, 423–453.
Grize, L., A. Huss, O. Thommen, C. Schindler, and C. Braun-Fahrländer, 2005: Heat wave
2003 and mortality in Switzerland. Swiss Medical Weekly, 135 (13/14), 200–205.
Gumbel, E., 1954: Statistical theory of extreme values and some practical applications: a
series of lectures. Applied Mathematics Series, U.S. Govt. Print. Office.
Hansen, J., R. Ruedy, M. Sato, and K. Lo, 2010: Global surface temperature change. Reviews
of Geophysics, 48 (4), RG4004.
Hansen, J., M. Sato, R. Ruedy, K. Lo, D. W. Lea, and M. Medina-Elizade, 2006: Global
temperature change. Proceedings of the National Academy of Sciences of the United States
of America, 103 (39), 14 288–14 293.
Haylock, M. R., G. C. Cawley, C. Harpham, R. L. Wilby, and C. M. Goodess, 2006: Downscaling heavy precipitation over the United Kingdom: a comparison of dynamical and statistical
methods and their future scenarios. International Journal of Climatology, 26 (10), 1397–
1415.
Haylock, M. R., N. Hofstra, A. M. G. K. Tank, E. J. Klok, P. D. Jones, and M. New, 2008: A
European daily high-resolution gridded data set of surface temperature and precipitation for
1950-2006. Journal of Geophysical Research-Atmospheres, 113 (D20), D20 119.
Hosking, J. R. M., 1990: L-moments: Analysis and estimation of distributions using linear
combinations of order statistics. Journal of the Royal Statistical Society. Series B (Methodological), 52 (1), 105–124.
Hulme, M., S. Dessai, I. Lorenzoni, and D. R. Nelson, 2009: Unstable climates: Exploring the
statistical and social constructions of ”normal” climate. Geoforum, 40 (2), 197–206.
Huntingford, C. and P. M. Cox, 2000: An analogue model to derive additional climate change
scenarios from existing GCM simulations. Climate Dynamics, 16 (8), 575–586.
Hurrell, J. W., Y. Kushnir, G. Ottersen, and M. Visbeck, 2003: The North Atlantic Oscillation:
Climatic significance and environmental impact, Vol. 134. AGU, Washington, DC, 279 pp.
64
B IBLIOGRAPHY
IPCC, 1995: Climate Change 1995: The Science of Climate Change. Contribution of Working
Group I to the Second Assessment Report of the Intergovernmental Panel on Climate Change.
Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, 572 pp.
IPCC, 2001: Climate Change 2001: The Scientific Basis. Contribution of Working Group I to
the Third Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge
University Press, Cambridge, United Kingdom and New York, NY, USA, 881 pp.
IPCC, 2007: Climate Change 2007: The Physical Science Basis. Contribution of Working
Group I to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change.
Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, 996 pp.
IPCC, 2012: Managing the Risks of Extreme Events and Disasters to Advance Climate Change
Adaptation. Cambridge University Press, Cambridge, United Kingdom and New York, NY,
USA, 582 pp.
Jankowski, T., D. Livingstone, H. Buhrer, R. Forster, and P. Niederhauser, 2006: Consequences
of the 2003 European heat wave for lake temperature profiles, thermal stability, and hypolimnetic oxygen depletion: Implications for a warmer world. Limnology and Oceanography,
51 (2), 815–819.
Jenkinson, A. F., 1955: The frequency distribution of the annual maximum (or minimum)
values of meteorological elements. Quarterly Journal of the Royal Meteorological Society,
81 (348), 158–171.
Jones, P. D., M. New, D. E. Parker, S. Martin, and I. G. Rigor, 1999: Surface air temperature
and its changes over the past 150 years. Reviews of Geophysics, 37 (2), 173–199.
Kalkstein, L. S. and K. E. Smoyer, 1993: The impact of climate change on human health: some
international implications. Experientia, 49 (11), 969–979.
Katz, R. W., 2010: Statistics of extremes in climate change. Climatic Change, 100 (1), 71–76.
Katz, R. W. and B. G. Brown, 1992: Extreme events in a changing climate: Variability is more
important than averages. Climatic Change, 21 (3), 289–302.
Kendon, E. J., R. G. Jones, E. Kjellström, and J. M. Murphy, 2010: Using and designing
GCM-RCM ensemble regional climate projections. Journal of Climate, 23 (24), 6485–6503.
Kennett, E. and E. Buonomo, 2006: Methodologies of pattern scaling across the full range of
RT2A GCM ensemble members. UK Met Office, Hadley Centre for Climate Prediction and
Research: Exeter.
Kharin, V. V. and F. W. Zwiers, 2005: Estimating extremes in transient climate change simulations. Journal of Climate, 18 (8), 1156–1173.
Kharin, V. V., F. W. Zwiers, X. Zhang, and G. C. Hegerl, 2007: Changes in temperature and
precipitation extremes in the IPCC ensemble of global coupled model simulations. Journal
of Climate, 20 (8), 1419–1444.
B IBLIOGRAPHY
65
Kharin, V. V., F. W. Zwiers, X. Zhang, and M. Wehner, 2013: Changes in temperature and
precipitation extremes in the CMIP5 ensemble. Climatic Change, 1–13.
Kiehl, J. T. and K. E. Trenberth, 1997: Earth’s annual global mean energy budget. Bulletin of
the American Meteorological Society, 78 (2), 197–208.
Kiktev, D., D. M. H. Sexton, L. Alexander, and C. K. Folland, 2003: Comparison of modeled
and observed trends in indices of daily climate extremes. Journal of Climate, 16 (22), 3560–
3571.
Kjellström, E., L. Barring, D. Jacob, R. Jones, G. Lenderink, and C. Schär, 2007: Modelling daily temperature extremes: recent climate and future changes over Europe. Climatic
Change, 81 (Suppl. 1), 249–265.
Klein-Tank, A. M. G., 2010: High resolution observations for climate change monitoring of
extremes. Triennial Scientific Report 2007-2009, KNMI, 28–33 pp.
Klein-Tank, A. M. G. and G. P. Können, 2003: Trends in indices of daily temperature and
precipitation extremes in Europe, 1946-99. Journal of Climate, 16 (22), 3665–3680.
Klein-Tank, A. M. G., F. W. Zwiers, and X. Zhang, 2009: Guidelines on analysis of extremes
in a changing climate in support of informed decisions for adaptation. WMO, Geneva, 52
pp.
Koppe, C., S. Kovats, G. Jendritzky, and B. Menne, 2004: Heat-waves: risks and responses.
Health and Global Environmental Health Series, 2 (2).
Kovats, R. S. and L. E. Kristie, 2006: Heatwaves and public health in Europe. European Journal of Public Health, 16 (6), 592–599.
Kyselý, J., R. Beranová, J. Picek, and P. Štepánek, 2008: Simulation of summer temperature
extremes over the Czech Republic in regional climate models. Meteorologische Zeitschrift,
17 (5), 645–661.
Lawrence, D. and A. Slater, 2010: The contribution of snow condition trends to future ground
climate. Climate Dynamics, 34 (7), 969–981.
Létard, V., H. Flandre, and S. Lepeltier, 2004: France and the french face the canicule: the
lessons of a crisis: appendix to the minutes of the session of february 3, 2004. Information
Report No. 195, Sénat, 59-62 pp.
Luterbacher, J., D. Dietrich, E. Xoplaki, M. Grosjean, and H. Wanner, 2004: European seasonal
and annual temperature variability, trends, and extremes since 1500. Science, 303 (5663),
1499–1503.
Manton, M., et al., 2001: Trends in extreme daily rainfall and temperature in Southeast Asia
and the South Pacific: 1961–1998. International Journal of Climatology, 21 (3), 269–284.
66
B IBLIOGRAPHY
Maraun, D., et al., 2010: Precipitation downscaling under climate change: Recent developments to bridge the gap between dynamical models and the end user. Reviews of Geophysics,
48 (3), RG3003.
Mearns, L. O., R. W. Katz, and S. H. Schneider, 1984: Extreme high-temperature events:
Changes in their probabilities with changes in mean temperature. Journal of Climate and
Applied Meteorology, 23 (12), 1601–1613.
Meehl, G. A., J. M. Arblaster, and C. Tebaldi, 2007: Contributions of natural and anthropogenic
forcing to changes in temperature extremes over the United States. Geophysical Research
Letters, 34 (19), L19 709.
Meehl, G. A. and C. Tebaldi, 2004: More intense, more frequent, and longer lasting heat waves
in the 21st century. Science, 305 (5686), 994–997.
Menne, M. J. and C. N. Williams, 2005: Detection of undocumented changepoints using multiple test statistics and composite reference series. Journal of Climate, 18 (20), 4271–4286.
Menzel, A., 2005: A 500 year pheno-climatological view on the 2003 heatwave in Europe
assessed by grape harvest dates. Meteorologische Zeitschrift, 14 (1), 75–77.
Min, S.-K., A. Hense, H. Paeth, and W.-T. Kwon, 2004: A bayesian decision method for
climate change signal analysis. Meteorologische Zeitschrift, 13 (5), 421–436.
Min, S.-K., X. Zhang, F. W. Zwiers, and G. C. Hegerl, 2011: Human contribution to moreintense precipitation extremes. Nature, 470 (7334), 378–381.
Mitchell, T. D., 2003: Pattern scaling - An examination of the accuracy of the technique for
describing future climates. Climatic Change, 60, 217–242.
Mitchell, T. D. and M. Hulme, 1999: Predicting regional climate change: living with uncertainty. Progress in Physical Geography, 23 (1), 57–78.
Moberg, A. and P. D. Jones, 2005: Trends in indices for extremes in daily temperature and
precipitation in central and western Europe, 1901-99. International Journal of Climatology,
25 (9), 1149–1171.
Moberg, A., et al., 2006: Indices for daily temperature and precipitation extremes in Europe analyzed for the period 1901-2000. Journal of Geophysical Research-Atmospheres, 111 (D22),
D22 106.
Nikulin, G., E. Kjellström, U. Hansson, G. Strandberg, and A. Ullerstig, 2011: Evaluation
and future projections of temperature, precipitation and wind extremes over Europe in an
ensemble of regional climate simulations. Tellus A, 63 (1), 41–55.
Nogaj, M., S. Parey, and D. Dacunha-Castelle, 2007: Non-stationary extreme models and a
climatic application. Nonlinear Processes in Geophysics, 14 (3), 305–316.
B IBLIOGRAPHY
67
Pall, P., T. Aina, D. A. Stone, P. A. Stott, T. Nozawa, A. G. J. Hilberts, D. Lohmann, and M. R.
Allen, 2011: Anthropogenic greenhouse gas contribution to flood risk in England and Wales
in autumn 2000. Nature, 470 (7334), 382–385.
Pareto, V., 1897: Cours d’Economie Politique. Droz, Lausanne.
Pellegrini, E., G. Lorenzini, and C. Nali, 2007: The 2003 European Heat Wave: Which Role
for Ozone? Some Data from Tuscany, Central Italy. Water, Air, & Soil Pollution, 181 (1),
401–408.
Peterson, T. C. and M. J. Manton, 2008: Monitoring changes in climate extremes: A tale of
international collaboration. Bulletin of the American Meteorological Society, 89 (9), 1266–
1271.
Peterson, T. C., et al., 1998: Homogeneity adjustments of in situ atmospheric climate data: a
review. International Journal of Climatology, 18 (13), 1493–1517.
Pickands, J., 1975: Statistical Inference Using Extreme Order Statistics. The Annals of Statistics, 3 (1), 119–131.
Plackett, R. L., 1950: Some theorems in least squares. Biometrika, 37 (1–2), 149–157.
Rahmstorf, S. and D. Coumou, 2011: Increase of extreme events in a warming world. Proceedings of the National Academy of Sciences, 108 (44), 17 905–17 909.
Ruosteenoja, K., H. Tuomenvirta, and K. Jylha, 2007: GCM-based regional temperature and
precipitation change estimates for Europe under four SRES scenarios applying a superensemble pattern-scaling method. Climatic Change, 81 (Suppl. 1), 193–208.
Russo, S. and A. Sterl, 2011: Global changes in indices describing moderate temperature
extremes from the daily output of a climate model. Journal of Geophysical ResearchAtmospheres, 116 (D3), D03 104.
Schär, C., P. L. Vidale, D. Lüthi, C. Frei, C. Häberli, M. A. Liniger, and C. Appenzeller,
2004: The role of increasing temperature variability in European summer heatwaves. Nature,
427 (6972), 332–336.
Scherrer, S. C., C. Appenzeller, M. A. Liniger, and C. Schär, 2005: European temperature
distribution changes in observations and climate change scenarios. Geophysical Research
Letters, 32 (19), L19 705.
Schiermeier, Q., 2011: Climate and weather: Extreme measures. Nature, 477 (7363), 148–149.
Schmidli, J., C. M. Goodess, C. Frei, M. R. Haylock, Y. Hundecha, J. Ribalaygua, and
T. Schmith, 2007: Statistical and dynamical downscaling of precipitation: An evaluation
and comparison of scenarios for the European Alps. Journal of Geophysical ResearchAtmospheres, 112 (D4), D04 105.
68
B IBLIOGRAPHY
Semenza, J. C., C. H. Rubin, K. H. Falter, J. D. Selanikio, W. D. Flanders, H. L. Howe, and
J. L. Wilhelm, 1996: Heat-related deaths during the July 1995 heat wave in Chicago. The
New England Journal of Medicine, 335 (2), 84–90.
Seneviratne, S., D. Lüthi, M. Litschi, and C. Schär, 2006: Land-atmosphere coupling and
climate change in Europe. Nature, 443 (7108), 205–209.
Seneviratne, S. I., T. Corti, E. L. Davin, M. Hirschi, E. B. Jäger, I. Lehner, B. Orlowsky, and
A. J. Teuling, 2010: Investigating soil moisture-climate interactions in a changing climate:
A review. Earth-Science Reviews, 99 (3-4), 125–161.
Simolo, C., M. Brunetti, M. Maugeri, and T. Nanni, 2011: Evolution of extreme temperatures
in a warming climate. Geophysical Research Letters, 38 (16), L16 701.
Simolo, C., M. Brunetti, M. Maugeri, T. Nanni, and A. Speranza, 2010: Understanding climate change induced variations in daily temperature distributions over Italy. Geophysical
Research Letters, 115 (D22), D22 110.
Smoyer, K. E., 1998: A comparative analysis of heat waves and associated mortality in St.
Louis, Missouri–1980 and 1995. International Journal of Biometeorology, 42 (1), 44–50.
Solberg, S., et al., 2008: European surface ozone in the extreme summer 2003. Journal Geophysical Research, 113 (D7), D07 307.
Stott, P. A., 2000: External control of 20th century temperature by natural and anthropogenic
forcings. Science, 290 (5499), 2133–2137.
Stott, P. A., 2003: Attribution of regional-scale temperature changes to anthropogenic and
natural causes. Geophysical Research Letters, 30 (14), 1728.
Stott, P. A., N. P. Gillett, G. C. Hegerl, D. J. Karoly, D. A. Stone, X. Zhang, and F. Zwiers, 2010:
Detection and attribution of climate change: a regional perspective. Wiley Interdisciplinary
Reviews: Climate Change, 1 (2), 192–211.
Stott, P. A., D. A. Stone, and M. R. Allen, 2004: Human contribution to the European heatwave
of 2003. Nature, 432 (7017), 610–614.
Tan, J., Y. Zheng, G. Song, L. S. Kalkstein, A. J. Kalkstein, and X. Tang, 2007: Heat wave
impacts on mortality in Shanghai, 1998 and 2003. International Journal of Biometeorology,
51 (3), 193–200.
Tebaldi, C., K. Hayhoe, J. M. Arblaster, and G. A. Meehl, 2006: Going to the extremes: an intercomparison of model-simulated historical and future changes in extreme events. Climatic
Change, 79 (3), 185–211.
Toreti, A. and F. Desiato, 2008: Changes in temperature extremes over Italy in the last 44 years.
International Journal of Climatology, 28 (6), 733–745.
B IBLIOGRAPHY
69
Van Der Linden, P. and J. F. B. Mitchell, 2009: ENSEMBLES: Climate Change and its Impacts:
Summary of research and results from the ENSEMBLES project. Met Office Hadley Centre
FitzRoy Road Exeter EX1 3PB UK, 160, 160.
Vandentorren, S., et al., 2006: August 2003 heat wave in France: risk factors for death of
elderly people living at home. European Journal of Public Health, 16 (6), 583–591.
Vidale, P. L., D. Lüthi, R. Wegmann, and C. Schär, 2007: European summer climate variability
in a heterogeneous multi-model ensemble. Climatic Change, 81 (Suppl. 1), 209–232.
Vincent, L. A., 1998: A Technique for the identification of inhomogeneities in canadian temperature series. Journal of Climate, 11 (5), 1094–1104.
Wergen, G. and J. Krug, 2010: Record-breaking temperatures reveal a warming climate. EPL
(Europhysics Letters), 92 (3), 30 008.
Wigley, T. M. L., 1988: The effect of changing climate on the frequency of absolute extreme
events. Climate Monitor, 17, 44–55.
Wilby, R. and T. Wigley, 1997: Downscaling general circulation model output: a review of
methods and limitations. Progress in Physical Geography, 21 (4), 530–548.
WMO, 2007: The role of climatological normals in a changing climate. Tech. Rep. WCDMPNO.61,WMO-TD/1377, World Meteorological Organziation (WMO).
Xoplaki, E., 2005: European spring and autumn temperature variability and change of extremes
over the last half millennium. Geophysical Research Letters, 32 (15), L15 713.
Yiou, P., D. Dacunha-Castelle, S. Parey, and T. T. Huong Hoang, 2009: Statistical representation of temperature mean and variability in Europe. Geophysical Research Letters, 36 (4),
L04 710.
Zhang, X., L. Alexander, G. C. Hegerl, P. Jones, A. K. Tank, T. C. Peterson, B. Trewin, and
F. W. Zwiers, 2011: Indices for monitoring changes in extremes based on daily temperature
and precipitation data. Wiley Interdisciplinary Reviews: Climate Change, 2 (6), 851–870.
Zhang, X., G. Hegerl, F. W. Zwiers, and J. Kenyon, 2005: Avoiding inhomogeneity in
percentile-based indices of temperature extremes. Journal of Climate, 18 (11), 1641–1651.
Zorita, E., T. F. Stocker, and H. von Storch, 2008: How unusual is the recent series of warm
years? Geophysical Research Letters, 35 (24), L24 706.
Acknowledgments
I would like to express my sincere gratitude to my supervisor Prof. Dr. Reto Knutti for his
patient guidance, the constructive criticism, his readiness to help at any time and his motivating
encouragement. I am very grateful to make my first experiences as a scientist in his group.
I would like to offer my special thanks to Dr. Erich Fischer for his support, constructive criticism, proof-reading, numerous hints and the very interesting and intellectually stimulating
discussions.
I would like to thank all the members of the climate physics group for the helpfulness and the
good time at the IAC.
I would also like to thank Dr. Mark Liniger and Prof. Dr. Christoph Schär for their feedbacks
and hints in the two PhD interviews and to be a part of my PhD Defense.
A special thank goes to my office mate Nathalie Schaller for the good time in the last years and
her readiness to help.
I would like to sincerely thank my parents and brothers for their encouragement and willingness
to help wherever requested.
My biggest and sincere thanks go to my wife Jenny and my son Noah. They enrich my life in
every possible way.
71