Behavioral Ecology doi:10.1093/beheco/arl041 Advance Access publication 31 August 2006 The blue tit’s song is an inconsistent signal of male condition Timothy H. Parker,a,b Iain R. Barr,c,d and Simon C. Griffitha,e Edward Grey Institute of Field Ornithology, Department of Zoology, University of Oxford, South Parks Road, Oxford OX1 3PS, UK, bDivision of Biology, Ackert Hall, Kansas State University, Manhattan, KS 66506, USA, cDepartment of Animal and Plant Sciences, University of Sheffield, Sheffield S10 2TN, UK, dSchool of Biological Sciences, University of East Anglia, Norwich NR4 7TJ, UK, and eSchool of Biological, Earth, and Environmental Sciences, University of New South Wales, Sydney, New South Wales 2052, Australia a Sexually selected traits are often hypothesized to signal male condition or quality, though empirical evidence is mixed, and a number of alternative models of sexual selection do not require condition dependence. We examined the relationship between various measures of condition and dawn songs in male blue tits (Cyanistes caeruleus). We detected 6 largely independent measures of variation (i.e., variables) in these songs. None of these variables were related to blue tits’ ultraviolet–blue plumage, a demonstrated sexual signal, thus failing to support the redundant signal hypothesis. We found some evidence that the song variables we measured signaled male quality. There were correlations between body size and certain song traits, though neither male age nor male recapture in the subsequent breeding season (apparent local survival) predicted any song variation. We combined our results with published effect sizes comparing blue tit song with male quality variables using meta-analysis and found that a few song measures are correlates of male quality, though as in our field data, neither male age nor survival appeared related to song. Our relatively large sample sizes (.60), combined with our meta-analytical integration of 89 effect sizes, make the results regarding the signaling value of our measured components of blue tit song robust. These results demonstrate that 1) only certain aspects of signal variation may be condition dependent and 2) even when components of a sexual signal appear correlated with condition in some studies, these signal components may be unrelated or inconsistently related to a variety of condition indices. Key words: condition dependence, dawn song, meta-analysis, Parus caeruleus, survival, ultraviolet. [Behav Ecol 17:1029–1040 (2006)] umerous studies have demonstrated that variation in the expression of sexual signals amongst individuals within a population can be related to differences in the condition of those individuals (Andersson 1994; Ligon 1999). Condition-dependent ornamental traits convey information about male quality (Andersson 1986), and thus might facilitate female choice of high quality mates (Griffith and Pryke 2006), or male choices concerning competitive interactions (e.g., Parker and Ligon 2002). However, not all hypotheses regarding the evolution and maintenance of sexual signals predict strong condition dependence of ornamental traits. For instance, some sexual signals may induce a response in a receiver because of a sensory bias on the part of the receiver (Ryan 1998). Sexual signals may convey information about heritable fitness entirely due to sexual attractiveness, and sometimes males with reduced condition or survival may be most attractive (Kokko et al. 2002). Other sexual signals, especially those with multimodal expression, may help receivers identify individual signalers (Dale 2000). Alternatively, certain signal measurements may not vary with male condition or quality but the context in which the signal is used may (Mennill et al. 2002). Empirical generalizations about whether sexual signals are typically expressed with a high level of condition dependence remain difficult. In many species, individuals display multiple sexual signals (e.g., Ligon et al. 1998; Pryke et al. 2001), and a number of N Address correspondence to T.H. Parker, who is now at the Department of Biology, Whitman College, 345 Boyer Ave, Walla Walla, WA 99362, USA. E-mail: [email protected]. Received 1 February 2006; revised 8 May 2006; accepted 3 August 2006. The Author 2006. Published by Oxford University Press on behalf of the International Society for Behavioral Ecology. All rights reserved. For permissions, please e-mail: [email protected] hypotheses have been proposed to explain this phenomenon. The multiple message hypothesis proposes that different signals each convey different information about the signaler, although it could be that different signals convey redundant information instead (Møller and Pomiankowski 1993). It may also be that if one signal is condition dependent, selection will not favor condition dependence in other signals (Iwasa and Pomiankowski 1994) or multiple signals might evolve through a process of sensory drive, where new, more detectable signals spread rapidly because females notice them (Schluter and Price 1993). Although many aspects of bird song have been extensively studied, the possibility that it can be a condition-dependent signal has only recently begun to receive extensive scrutiny, for instance with experimental manipulation (e.g., Spencer et al. 2003). Song is a sexual signal in birds, typically produced by males in the context of both territory defense and mate attraction (Catchpole and Slater 1995). At least 3 different mechanisms for condition dependence of bird song have been investigated. A component of the avian brain important to song learning, the high vocal center, has been shown to be sensitive to manipulations of condition in captive studies (Buchanan et al. 2004). This translates to an influence of condition, especially during brain development, on song learning (Nowicki et al. 1998), and both correlative and manipulative studies indicate that song complexity (e.g., repertoire size, the number of distinct song types an individual learns and sings) is condition dependent in some species (Nowicki et al. 2000; Spencer et al. 2003, 2004, but see Forstmeier and Leisler 2004). It has long been supposed that song production might be energetically demanding and that variables such as rate, length, or consistency of song production might therefore show condition dependence. There is support for this idea, Behavioral Ecology 1030 but evidence varies among studies and species. For instance, evidence from the field suggests that fat reserves may be needed for long duration song output (e.g., Thomas 2002), but respirometry chamber results are mixed, with some suggesting singing is metabolically demanding (Eberhardt 1994; but see Gaunt et al. 1996) and others suggesting it is not (e.g., Ward et al. 2004). A third possibility is that constraints on song production are enforced by social interactions such that poor condition males of low status produce a substandard signal because of the risk of punishment from higher status males. For instance, dominant and subordinate individuals might alter aspects of their song production, such as the timing of the song phrase, in relation to other males singing in the vicinity (e.g., Mennill et al. 2002). The blue tit (Cyanistes caeruleus) has multiple sexual ornaments. Particularly striking are the ultraviolet (UV)–blue plumage patches. The UV–blue crown plumage is sexually dichromatic (Andersson et al. 1998) and has been found to be related to both overwinter survival (Griffith et al. 2003) and the sex ratio of offspring (Sheldon et al. 1999). One measurement of this plumage trait, UV chroma (the proportion of the reflectance curve in the UV wavelengths), has consistently appeared important in sexual signaling (Andersson et al. 1998; Sheldon et al. 1999; Griffith et al. 2003). It has also been suggested that another plumage patch, the yellow breast, may be a signal of parenting ability (Senar et al. 2002). In addition to these ornamental colors, the male blue tit sings a moderately complex song (Bijnens and Dhondt 1984) that has been the focus of several studies (Bijnens 1988; Kempenaers et al. 1997; Doutrelant et al. 2000; Poesel et al. 2001; Foerster et al. 2002; Dreiss et al. 2006). Much of this work has focused on recordings made during the dawn chorus, the only extended period of uninterrupted singing in this species. As blue tits only participate in the dawn chorus after pairing and territory formation, dawn singing is thought to serve functions such as mate guarding (Mace 1989; Kempenaers et al. 1997), attraction of extrapair partners (Mace 1989; Kempenaers et al. 1997), or territory defense (Slagsvold et al. 1994). Components of dawn song may signal male quality or condition, but significant relationships between song production and variables such as body size, hormone level, and survival have not been replicated among studies (Appendix 1). For instance, tarsus length showed significantly positive relationships with song variables in 2 studies, but in one study, the song variable was the number of song types sung in a single chorus (Doutrelant et al. 2000), and in another it was performance time, the proportion of the song’s duration that a male was actually vocalizing (Poesel et al. 2001). In a third study, performance time was significantly related to male mass and testosterone level, but a relationship with tarsus length was not reported (Foerster et al. 2002). Such results suggest a role for song in sexual selection, but the lack of consistency between studies is a concern, and clearly further work is necessary before strong, general conclusions can be drawn. When attempting to draw general conclusions about a study system, meta-analytical synthesis of published data can be fruitful (e.g., Sheldon and West 2004; Parker et al. 2005) by combining published results into single statistical tests of hypotheses (Rosenthal 1984; Gurevitch and Hedges 1999). A meta-analytical approach is well suited to cases such as blue tit song condition dependence, where results differ among studies. This approach will allow identification of particular song variables that tend to be most strongly related to condition or vice versa. Our objectives with this study of blue tits were to 1) define the major forms of variation (i.e., variables) in dawn song production and thus determine the extent to which different aspects of song were redundant signals, 2) test for redundancy between a well established plumage color signal and dawn song variables, 3) assess whether various potential condition or quality indices were predictors of dawn song variables, and 4) decide, based on a combination of published data and our own field data, whether current evidence supports the hypothesis that blue tit song production signals condition and, if so, determine which components of song appear involved. METHODS Study site We studied blue tit song in 3 years (2002–2004) in Wytham Wood, a 380 ha woodland in Oxfordshire, UK (120#W, 5147#N). Between 330 and 450 blue tit pairs bred in artificial nest boxes in each breeding season. Song recording During a female blue tit’s fertile period (from several days before egg laying begins until the penultimate day of the laying period [Mace 1989]), each day before dawn her mate produces one bout of song typically lasting 15–60 min. This is the most predictable singing bout produced by male blue tits, and it is usually also their longest period of singing uninterrupted by other activities. Because of the link between this male signal and the female’s fertile period and because earlier research suggested that dawn song may be sexually selected in blue tits (Doutrelant et al. 2000; e.g., Kempenaers et al. 1997; Poesel et al. 2001), we chose to focus on this aspect of male song. As with some other studies of blue tit song (e.g., Doutrelant et al. 2000), birds on our study site were not individually color ringed. Therefore, we carefully observed behavior of individual birds in the weeks leading up to recording to determine territory boundaries associated with particular nest boxes. This involved confirming that males spent at least 5 min unchallenged in the vicinity of the nest box (e.g., Doutrelant et al. 2000) and following males to map locations of boundary conflicts and thus delineate territories. A male’s dawn song was recorded from his first vocalizations of the day until the termination of his dawn song bout. The end of the dawn song is generally unambiguous and occurs when the pair copulates. After this, the song rate drops dramatically, and the male can be observed in other activities such as foraging. If we did not observe copulation, then the termination was defined as .5 min of either 1) no singing or 2) a switch to other activities, typically foraging, with long (.30 s) pauses between singing. Not all dawn recording efforts were successful. In some cases, we could not confirm unambiguous association with a nest box during or after the dawn chorus and such recordings were not used in our analyses. Because we were conservative in applying these standards, we are confident that few, if any, recordings were attributed to an incorrect male. Although females are fertile before egg laying begins, we limited our assessment of songs to those recorded after the start of laying to increase standardization of the conditions the pairs were experiencing at the time of data gathering. Several males were recorded in more than one year. When comparing song variables with color and condition variables, we used only the first recording made for a given male to avoid pseudoreplication. After excluding from our data set the duplicate recordings, those that were recorded before or after the laying period, recordings from males we never subsequently captured and measured (see below), and recordings that could not be confidently attributed to particular males, we were left with 63 usable recordings over the 3 years. Parker et al. • Blue tit song Approximately half the songs were recorded using Marantz PMD680 digital recorders and Sennheiser short shotgun microphones (ME66). The other songs were recorded using a Sennheiser long gun microphone (MKH816T) and a Sony Pro-2 professional Walkman. Song processing Each recording was processed by one of 2 observers (IRB or THP) using programs Raven 1.0–1.2 (Cornell Lab of Ornithology; THP) or Avisoft SASLab 3.2 (Avisoft Bioacoustics, Berlin; IRB). Before beginning this process, we developed a standard library of note and strophe (group of notes sung together in a predictable pattern, typically lasting 0.5–2.0 s, Figure 1) types to facilitate repeatable classification of all songs. Broad frequency, harmonic scold-type vocalizations (types B1-B7 in Bijnens and Dhondt 1984) constituted varying portions of dawn song. When these notes were produced (with or without high frequency, pure tone introductory notes), they were excluded from our analyses. These calls are often much more variable in length and form than other dawn song strophe types (IR Barr and TH Parker, personal observation), are produced year round and at any time of day in response to threats (Bijnens and Dhondt 1984), and in blue tits and related species, the rate of scold note production is a function of the type and proximity of a threat (Bijnens and Dhondt 1984; Templeton et al. 2005). Thus, we decided not to consider these notes as part of the blue tit sexual signal. For each strophe in a male’s entire dawn song, we classified its type, measured its length, and measured the length between adjacent strophes (Figure 1). When summing number of strophe types produced, we did not include types produced only once during the song. In the rare case when a strophe type was sung just once, we could not be confident that it was ever sung again and was actually a part of the individual’s repertoire rather than an accidental combination of notes produced, for instance, by being interrupted or switching strophe types in mid-strophe. Because a male’s dawn song typically ceases when his mate emerges from the nest box, the duration of his song is largely dependent on his mate’s decision rather than his own. Therefore, we did not use duration of the song as a variable in the analyses, and where relevant, all song variables were converted to rates. The first song rate variable, number of strophes per minute, may be likely to reflect condition if singing is metabolically demanding. The next 3 rate variables were designed to capture information about song complexity. For the number of strophe types per minute and the number of switches between strophe types per minute, the total number of strophe types or the total number of times the bird switched between strophe types for the entire dawn song was divided by the total number of minutes in the dawn song. For the Figure 1 Sound spectrogram of an example of blue tit song. Depicted are 3 strophes, 2 strophe types (A and B), and one switch between strophe types. 1031 third song rate variable designed to address song complexity, number of strophe types per strophe, the number of different strophe types produced over the entire dawn song was divided by the total number of strophes sung during the entire dawn song. One measure of song complexity we did not consider was absolute repertoire size. This is because we recorded each male on only one morning and so had insufficient data to determine asymptotic repertoire size. However, the song complexity variables described above provide information regarding the repertoire per unit time or song and thus may be influenced by the same factors hypothesizes to influence overall repertoire size. We also calculated mean strophe length and mean pause length. Both of these might be influenced by metabolic demands. For mean pause length, we excluded the pauses between strophe types, as these pauses were often longer than pauses between strophes of the same type. We calculated performance time, another variable possibly influenced by metabolic demands, as the length of a strophe divided by the sum of the length of the strophe and the length of the preceding pause. These calculations excluded the initial strophe in a series of given strophe type because, as stated above, pause length between strophe types was often longer than pause lengths within strophes. We used the mean performance time value in subsequent analyses. We calculated the coefficient of variation (CV) for strophe length and pause length because we hypothesized that consistency of strophe length or timing among strophes might reflect aspects of condition such as current metabolic fatigue or a legacy of developmental stress. These were calculated separately for all strophe types, though strophe types sung fewer than 10 times (and their associated pauses) were excluded. The strophe and pause CVs for each strophe type were then averaged within a song to produce the average CV of strophe length and the average CV of pause length. It has been shown that while blue tits are singing a given strophe type, the pause length between individual strophes tends to increase between successive strophes, possibly because of fatigue (Poesel and Kempenaers 2000). Because strophe length does not tend to change, the increase in pause length leads to a decrease in performance time (Poesel et al. 2001). This is termed drift. Although it occurs within a series of strophes of the same type (strophe series), it is reset after each change in strophe type (Poesel and Kempenaers 2000). We estimated the strength of drift for individual birds as the slope from a general linear model (PROC GLM, SAS 8.2) in which performance time was the dependent variable, time was the predictor, and strophe series was a covariate. By controlling for strophe series, a separate intercept was allowed for each run of strophes of a given type, thus accounting for the tendency for drift not to carry over when song type is switched. The slope was negative for nearly all birds, demonstrating that drift was close to ubiquitous. Some authors have classified songs into those showing drift and those not showing drift based on a significance test for negative slope (Poesel and Kempenaers 2000); however, we considered drift to be a continuous variable. First, unlike a previously reported pattern (Poesel et al. 2001), we observed a normal rather than a bimodal distribution of slope. Second, sample size differed dramatically based on the length of the song in our data set, so a significantly negative slope would be more likely from longer songs if the strength of the relationship between time and performance time was held constant. Other field methods At each box containing a blue tit nest, we noted first egg date, clutch size, and hatching date. We attempted to capture each Behavioral Ecology 1032 adult between day 6 and day 14 of the chick-feeding period. Any unringed adults were given a uniquely numbered BTO ring at this point. For each captured adult, we quantified the color of the crown and primary coverts. Spectral reflectance was measured using a USB2000 spectrometer (Ocean Optics, Dunedin, FL) with illumination from a xenon light source (Ocean Optics PX-2). A sheath was fixed to the fiber end to standardize measuring distance (7 mm) and exclude ambient light. The fiber-optic reflectance probe was held (not pressed) perpendicular to the plumage, and 3–5 scans were taken from the center of each patch, removing the probe between each. The reflectance was measured relative to a WS-2 white standard scanned prior to each individual. We chose to limit our index of color to the one measurement that has consistently appeared important to sexual signaling in this species, UV chroma (R320-400/R320-700). This measurement addresses the specific importance of UV (Andersson and Amundsen 1997; Andersson et al. 1998; Sheldon et al. 1999). Its relevance in signaling is indicated by previous observations of its sexual dimorphism, correlation with mate choice, prediction of offspring sex ratio, and prediction of over-winter survival (Andersson et al. 1998; Sheldon et al. 1999; Griffith et al. 2003). We also measured 5 potential condition indices. For each male, we measured tarsus and wing length (millimeters) and mass (grams) and determined age (second year vs. after second year) based on wing covert molt. We did not combine the 3 body size measurements prior to analyses because they were not strongly correlated with each other (r ¼ 0.17–0.33) and because we wished to compare our results with those from other studies that have considered these body size measurements separately. Our fifth measure of male quality or condition was recapture in the subsequent year. Because most individuals bred in nest boxes, all boxes were monitored, and a major effort was made to capture the breeding male at every nest box; most individuals that survived to breed on the site in a subsequent year were detected, and thus, the variable ‘‘recapture’’ can be considered a good approximation of local survival. None of these variables alone are ideal condition or quality indices, but they have been used in this context before (e.g., Bijnens 1988; Doutrelant et al. 2000; Nowicki et al. 2000). Age often predicts expression of condition-dependent signals, presumably because yearling males are in poorer condition or of lower dominance rank on average (e.g., Greene et al. 2000; Parker et al. 2003; Griffith and Pryke 2006). Survival has been shown to be influenced by aspects of condition (e.g., Gosler 1996; e.g., Lambrechts and Dhondt 1986). Tarsus length has been used as an index of condition experienced during development (Nowicki et al. 2000) and mass as an index of current condition (but see, Gosler and Harper 2000). In blue tits, older males have longer wings than yearling males (TH Parker, IR Barr, and SC Griffith, unpublished data) and so wing length may be a condition indicator as well. Strong correlations between one of these condition indices and song production would be consistent with a role of song in condition signaling. Data analyses All our continuously distributed variables that were not normally distributed were transformed to approximate normality. Recapture and age were classified as binomial variables. Birds were either recaptured or not in the next breeding season and were either second year or after second year adults. We next identified axes of independent variation among our original 10 song variables. The first step was to generate a correlation matrix to determine which, if any, song variables were strongly correlated with each other. We then included each group of correlated song variables in its own principal component analysis (PCA) to generate a single principal com- ponent that would capture the majority of the variation in this group of variables. In all such cases, we substituted this principal component for the group of correlated variables in subsequent analyses. Before conducting further data analyses, we identified potential covariates that we hypothesized might influence song production or condition variable values. Any of our variables of interest could have varied by year or geographically among the 9 subareas of the study site. Song variables could have been influenced by the timing of the recording, and so we considered recording date and both the number of days after clutch initiation and the number of days before egg laying ended that the bird was recorded. We considered an effect of low temperature on the dawn of recording because a cold night might force birds to forage at the expense of singing. We also considered the possibility of an effect of identity of the scorer of the song recording (IRB vs. THP). For the morphological measurements, we considered a role of age (wing length and mass), identity of the measurer, and date of measurement (mass). We then determined whether our song variables were providing the same information as the best-studied signal in this species, UV–blue plumage color. We compared each of our song variables with the UV chroma of the wing coverts and cap using general linear models, which allow inclusion of relevant covariates. We determined which hypothesized condition correlates, if any, predicted our song production variables by conducting general linear model analyses in a 2-step process. First, for each song variable, we analyzed a global model containing all 5 condition variables as predictors. If any condition variables had at least a marginally significant (P , 0.1) utility in predicting the song variable in question, a second analysis excluding all nonsignificant condition variables was conducted for that song variable. We obtained effect sizes and parameter estimates for the nonsignificant predictor variables from the global model and for the significant predictor variables from the reduced model. Appropriate covariates were included in each analysis, as discussed below in Results. Published data We located each published study comparing blue tit song with potential indices of individual quality or condition (Appendix 1). We then extracted all statistical information from each comparison of a song variable with a condition variable. Our intention was to compare results based on sign and magnitude of correlation coefficients. If no correlation coefficient was presented for a given relationship, we estimated one based on statistics available in the published paper with the statistical calculator option in the meta-analysis program MetaWin 2.1 (Rosenberg et al. 1997). In the one paper (Dreiss et al. 2006) where most data reporting was insufficient for inclusion in meta-analysis, we obtained more detailed information from the authors. We located 5 studies reporting 42 relationships between blue tit song variables and condition-related variables in sufficient detail for meta-analysis. We received unpublished information for an additional 11 relationships from one of these studies (Appendix 1). We plotted all these effect sizes (correlation coefficients, y axis) describing relationships between song variables and aspects of condition in blue tits against the sample size for the respective studies (x axis). This should produce a funnel-shaped plot with a large vertical spread of points to the left, where sample sizes are small and sampling variance is thus large, and a narrower distribution of points to the right, where sample sizes are larger and sampling variance is thus reduced (Palmer 1999). The points should converge Parker et al. • Blue tit song 1033 on the true effect size(s) as the sample size increases on the right side of the plot. If, when plotted, the published effect sizes do not follow a funnel-shaped distribution, publication bias may be to blame (Palmer 1999). Typical publication bias is against negative or nonsignificant results, especially in studies with small sample sizes. If the expected effect size is positive, then negative and modestly positive effects are likely to be nonsignificant when samples are small, and these results may often go unpublished (Rosenthal 1984). This can lead to a linear, rather than a funnel-shaped, distribution, with published effects from small-sample studies mostly high and positive, but becoming lower with increasing sample sizes (Palmer 1999). To supplement our visual inspection of the funnel plot, we conducted a regression analysis to test for the negative relationship between sample size and effect size predicted in the case of publication bias. Before making this comparison, we converted all correlation coefficients (r) to Fisher’s z-transformation fz ¼ 1=2ln½ð11r Þ=ð1 r Þg and log-transformed sample sizes. We included all published effects in this analysis, as well as the 11 effects for which we obtained necessary details from the authors. Because not all the details of these data were published, this is not strictly a test for publication bias, but rather a test for bias in the data we obtained from published and unpublished sources. We did not include our field data in the test for bias in previously available data. Finally, we conducted meta-analyses to determine whether certain song variables appeared more likely to be related to condition or certain condition variables seemed more likely to be related to song. We combined our field results with those obtained from other published and nonpublished sources. For these analyses, we used mixed models and bootstrapgenerated significance tests (based on 9999 iterations) in program MetaWin version 2.1. Mixed models in meta-analysis account for expected heterogeneity in true effect sizes among song and condition variables and bootstrapping accounts for violations of distributional assumptions (Rosenberg et al. 2000). These analyses were based on Fisher’s z-transformation of correlation coefficients (r) and variance inversely proportional to the sample size (vz ¼ 1/n3, where n ¼ sample size for the given study [Rosenberg et al. 2000]). With program MetaWin, it is only possible to model the effect of one class of variables at a time. Thus, we first tested for differences among effect sizes (z) related to song variable, and then we tested for difference among effect sizes related to male quality or condition variables. For each song and condition variable, we asked whether the bootstrap-generated 95% confidence interval (CI) included zero. RESULTS Identifying independent variation in songs Of the 10 song variables we compared with each other using a correlation matrix (PROC PRINCOMP, SAS 8.2), 4 were not strongly correlated with any others (all jr j , 0.42) and the remaining 6 fell into 2 groups with high intragroup correlation (each variable with at least one jr j . 0.65, Table 1). Thus, a minimum of 6 largely independent types of variation (i.e., 6 variables) were present in these blue tit songs. The 4 variables not strongly correlated with any others, mean strophe length, drift, CV of strophe length, and CV of pause length, were considered separately in further song analyses. The variation in the 2 groups of correlated variables was summarized in 2 PCAs (PROC PRINCOMP, SAS 8.2). In the first PCA, we included the variables strophes per minute, performance time, and mean pause length. Principal component 1 (PC1) explained 88% of the variation in these 3 variables and had strong loading from all 3 variables (negative for pause length), so we used PC1 in further analyses, and we termed this variable ‘‘singing intensity.’’ In the second PCA, we included the number of strophe types per minute, the number of strophe types per strophe, and the number of strophe type switches per minute. PC1 from this analysis explained 74% of the variation in these 3 variables, and all 3 variables had strong loadings. Thus, we used this new variable, which we termed ‘‘strophe turnover,’’ in all further analyses. We were thus left with 6 song variables for further analyses. Identifying covariates of song and condition variables We identified covariates of song and condition variables and included them in further analyses of these traits. Strophe length had no covariates, but drift was related to subarea of the study site; singing intensity was a function of the number of days after clutch initiation that the recording was made; and strophe turnover, CV strophe length, and CV pause length were all related to the scorer of the song recording (IRB vs. THP). The effect of identity of the song scorer may have resulted from the nonrandom geographic and temporal distributions of the sets of songs analyzed by the 2 scorers. We examined the scorer effect for 38 recordings made in one subarea of the study site (28 scored by IRB, 10 scored by THP), and the scorer effect disappeared for strophe turnover and CV pause length. We retained scorer as a covariate in these cases with the understanding that factors correlated with scorer identity were likely influencing the patterns we observed. Among male quality or condition variables, age had no covariates Table 1 Correlation coefficients describing relationships between pairs of blue tit song variables Strophe length Pause length 0.32 Performance time 0.20 Strophes per minute 0.42 CV Strophe length 0.04 CV Pause length 0.15 Drift 0.09 Strophe types per minute 0.08 Strophe types per strophe 0.29 Switches per minute 0.07 Pause length Performance time Strophes per minute CV Strophe length CV Pause length Drift 0.83 0.93 0.34 0.05 0.01 0.69 0.41 0.04 0.11 0.25 0.04 0.00 0.19 0.14 0.07 0.00 0.02 0.16 0.15 0.30 0.28 0.56 0.27 0.46 0.23 0.46 0.32 0.30 0.01 0.18 0.29 0.25 0.34 Strophe types Strophe types per minute per strophe 0.79 0.65 0.36 The purpose of this table is to describe the magnitude and direction of relationships among song variables in our recordings to determine how variables might be grouped into composite variables, not to test for statistical significance. Behavioral Ecology 1034 Table 2 Correlation coefficients describing the proportion of variation in blue tit color measurements described by blue tit song variables Strophe Singing length intensity Drift Strophe CV strophe CV pause turnover length length good predictors of the 2 color variables in the 61 individuals for whom we had sufficient color data (Table 2). No relationships were significant even before correction for multiple comparisons, and all correlations (jr j) were less than 0.24. Condition indices as predictors of song output Head chroma 0.13 Wing chroma 0.12 0.21 0.04 0.03 0.12 0.00 0.16 0.23 0.09 0.11 0.19 N ¼ 61 for all comparisons. but wing length was a function of age, measurer, and subarea of the study site; tarsus length was related to measurer and year; and year also influenced both mass and recapture. Assessing redundancy of song and color variables According to our general linear model analyses (PROC GLM, SAS 8.2), the 6 largely independent song variables were not We found evidence that some measured aspects of condition predicted song production in blue tits. Using general linear models with normal error (PROC MIXED, SAS 8.2), we identified one or more body size traits that predicted variation in 4 of 6 song variables (jr j , 0.41, Table 3). Birds with longer tarsi tended to sing longer strophes (Table 3). Tarsus length also explained the distribution of CV of strophe length, but counter to prediction, males with longer tarsi sang strophes that were less consistent in length within a strophe type (Table 3). One of the 2 strongest pattern was for heavier males to have a higher singing intensity (a greater proportion of time singing) (Table 3), and this was the only song–condition relationship to remain significant (P , 0.05) if we conducted a conservative Bonferroni adjustment for 30 comparisons (6 song variables 3 5 condition variables). The relationship Table 3 The strength of condition variables as predictors of song variables in male blue tits Song (dependent) variable Condition (independent) variable n F df P Slope Standard error ra Strophe length Mass Tarsus Wing length Age Recapture Mass Tarsus Wing length Age Recapture Mass Tarsus Wing length Age Recapture Mass Tarsus Wing length Age Recapture Mass Tarsus Wing length Age Recapture Mass Tarsus Wing length Age Recapture 61 63 61 61 61 61 61 61 61 61 61 61 61 61 61 61 61 61 61 61 61 63 61 61 61 61 63 61 61 61 0.34 6.27 0.86 1.23 1.39 10.96 1.69 0.14 1.83 1.33 9.72 6.60 1.22 0.78 0.03 1.82 0.88 0.97 0.01 0.01 0.03 5.30 0.56 0.53 1.39 0.20 1.94 0.16 0.00 2.11 1,52 1,58 1,52 1,52 1,52 1,56 1,51 1,51 1,51 1,51 1,52 1,52 1,49 1,49 1,49 1,51 1,51 1,51 1,51 1,51 1,51 1,57 1,51 1,51 1,51 1,51 1,57 1,51 1,51 1,51 0.56 0.02 0.35 0.27 0.24 ,0.01b 0.20 0.71 0.18 0.25 ,0.01c 0.01 0.27 0.38 0.86 0.18 0.35 0.33 0.92 0.93 0.87 0.02 0.46 0.47 0.24 0.66 0.17 0.69 0.99 0.15 0.009 0.032 0.005 0.016 0.018 1.27 0.56 0.06 0.54 0.51 0.0002 0.0001 0.00002 0.00004 0.00000 0.56 0.39 0.16 0.04 0.03 0.02 0.25 0.03 0.08 0.15 0.006 0.016 0.002 0.0002 0.021 0.015 0.013 0.006 0.014 0.015 0.38 0.43 1.67 0.40 0.44 0.00005 0.00005 0.00002 0.00005 0.00005 0.42 0.41 0.17 0.38 0.42 0.13 0.11 0.05 0.11 0.13 0.014 0.012 0.006 0.0132 0.014 0.08 0.31 0.13 0.15 0.16 0.40 0.18 0.05 0.19 0.16 0.40 0.34 0.16 0.13 0.02 0.19 0.13 0.14 0.01 0.01 0.02 0.29 0.10 0.10 0.16 0.06 0.18 0.06 0.00 0.20 Singing intensity Drift Strophe turnover CV strophe CV pause a b c Covariates are described in Results but not shown in this table. Reported statistics were derived from the global model (all condition variables and covariates for a given song variable) except in cases where one or more condition variables were minimally effective predictors (P , 0.1) of a given song variable, in which case statistics for those predictors were derived from the reduced model including only the effective condition predictors and relevant covariates. This explains variation in degrees of freedom among condition variables for a given song variable model. Any model containing the variable mass had an n ¼ 61 rather than 63 because we lacked mass data for 2 males. Correlation coefficient (r) equivalents were derived from degrees of freedom and the F - values using the statistical calculator in program MetaWin. These correlation coefficients were the standard effect sizes required for inclusion of these data in subsequent meta-analyses. In this table and in our meta-analyses, positive correlation coefficients indicate higher or more consistent (lower CV of strophe or pause length and less drift) song production from individuals in better condition (see Appendix 1). Exact P value (0.0016) remains significant (experiment-wide alpha ¼ 0.05) after correcting for multiple (30) comparisons. Exact P value (0.003) does not remain significant (experiment-wide alpha ¼ 0.05) after correcting for multiple (30) comparisons. Parker et al. • Blue tit song 1035 between body size and drift was inconsistent, with a strong trend for heavier males to show more drift, but a somewhat weaker pattern of males with longer tarsi showing less drift. Neither age nor recapture significantly predicted any song variables (Table 3). These results are consistent with certain song variables signaling at least some aspects of condition in blue tits. However, these results must be interpreted with caution because the evidence for publication bias suggests that our rather low meta-analysis– generated effect size estimates may be biased upward. Summarizing published relationships between song and condition DISCUSSION Published correlations between song variables and male condition variables ranged from r ¼ 0.29 to 0.86 (Appendix 1). Both a visual assessment of the funnel plot (see especially open and filled circles, Figure 2) and a quantitative assessment of the same data indicate that there may have been bias against publishing of negative or nonsignificant relationships between condition variables and song production in blue tits (F1,51 ¼ 11.8, P ¼ 0.001, r ¼ 0.43, slope ¼ 0.49 6 0.14 standard error [SE]). Studies with small sample sizes tended to report large positive effect sizes, and reported effect sizes declined with increasing sample size. Several of the particularly strong effects at small sample sizes describe relationships between testosterone level and song variables. Because effect sizes may tend to be larger for studies of physiological effects (Møller and Jennions 2002), we conducted our test for publication bias a second time, excluding the testosterone effects. We still found a significant negative slope with this reduced data set (F1,48 ¼ 4.0, P ¼ 0.050, r ¼ 0.28, slope ¼ 0.31 6 0.15 SE), again supporting the hypothesis of publication bias, though the effect seemed less marked. When we combined our results with previously published results using meta-analysis, we found evidence that certain song traits more consistently correlated with condition than others (Table 4). Singing intensity and strophe length both had significantly positive effect sizes. Within-strophe drift, a variable assessed by only one study, also showed a significant positive effect size. A similar pattern was observed when the testosterone studies were excluded, but the strophe length effect was no longer significant (Table 4). Only one possible condition or quality indicator, testosterone level, had a significantly positive relationship with song variables, although the 95% CI for mass was only marginally nonsignificant (Table 4). Figure 2 Funnel plot depicting the relationship between sample size and effect size (correlation coefficient, r) for published tests of the relationship between aspects of blue tit song and measures of male condition or quality (filled circles). Open circles designate the 3 relationships between testosterone level and song production. Filled squares represent the 11 published effects for which additional details were needed and obtained from the authors. Open triangles depict our field data published herein, and open squares represent our unpublished data from a condition manipulation with a small sample size (n ¼ 15) described in the appendix. There are 5 major results of this study. We show that 1) a minimum of 6 largely independent variables exist in dawn blue tit song, 2) none of these variables correlated strongly with variation in an established measurement of color signal expression, 3) of male quality or condition indicators, aspects of body size, but not age or a survival index, predicted variation in certain song measurements, especially singing intensity, 4) a bias toward publication of strong positive results may have inflated the perceived importance of blue tit song as a condition-dependent signal, and 5) a subset of song variables, particularly strophe length and singing intensity, may be most likely to signal condition in this species. Our large sample sizes, combined with our thorough meta-analysis, make our conclusions particularly robust. In blue tits, the dawn song appears to be a multicomponent signal, and we identified 6 largely independent forms of variation. There are any number of song variables that we did not quantify, such as amplitude (Forstmeier et al. 2002), withinstrophe drift (Bijnens 1988), and the ratio of trill rate to frequency breadth (Podos et al. 2004), and with a complex signal such as song, the number of variables examined is limited only by the researcher’s time and creativity. Thus, 6 independent song variables is a minimum estimate. However, just because one can measure a song component does not mean that it serves as a signal. Without demonstrated signal utility through study of conspecific responses to different song components, it will remain unknown what aspects of blue tit song serve as signals and therefore how many meaningful forms of variation exist. We demonstrated that our measured song variables are not strongly correlated with UV chroma of cap and wing plumage in blue tits, a well established sexual signal. This is consistent with other findings (Dreiss et al. 2006), and thus, we can unambiguously reject the redundant signal hypothesis as an explanation of the evolution of the song and color signals we measured in this system (Møller and Pomiankowski 1993). To the extent that these different measures are signaling information about individual males, they are signaling different information. Other alternative hypotheses explaining the presence of multiple ornaments (e.g., Møller and Pomiankowski 1993; Schluter and Price 1993; Iwasa and Pomiankowski 1994) cannot be rejected or supported. We found significant relationships between song variables and 2 of the 5 condition variables. These 2 condition variables were mass and tarsus length, and at least one of them was a significant predictor (before correction for multiple comparisons) of each of the following: strophe length, drift, singing intensity, and CV strophe. Although the relationship between strophe length and tarsus length in our field data was not robust to correction for multiple comparisons, when we combined our strophe length–male quality relationship effects with the 17 published relationships involving strophe length, we found a significant, albeit weaker, positive effect. Evidence for condition dependence of drift was mixed. Drift was positively related to one of our body size measurements but negatively related to another. Further, the effect of drift was very small and not significant in meta-analysis. Our results suggested the strongest role for the composite variable singing intensity in predicting male quality: heavier males tended to spend a larger proportion of their time singing, had shorter pauses, and sang more strophes per minute, and Behavioral Ecology 1036 Table 4 Strength of the relationships between blue tit song variables and various hypothesized condition indices derived from meta-analyses combining our findings with those from other studies a b c d e Variable N z 95% CI Condition Wing length Tarsus length Mass Age Survivala Mass condition index Condition manipulationb Testosterone level 10 21 20 10 10 5 10 3 0.032 0.045 0.098 0.049 0.046 0.164 0.087 0.893c 0.041 0.053 0.004 0.075 0.071 0.060 0.240 0.448 to to to to to to to to 0.096 0.157 0.200 0.182 0.183 0.417 0.134 1.293 Song variables (including testosterone data) CV pause 6 CV strophe 6 Drift 9 Within-strophe drift 5 d 16 Singing intensity Repertoire size 8 Strophe length 23 e 16 Strophe turnover 0.082 0.083 0.027 0.125c 0.182c 0.062 0.100c 0.019 0.210 0.205 0.165 0.055 0.059 0.124 0.003 0.042 to to to to to to to to 0.016 0.047 0.170 0.206 0.320 0.323 0.199 0.111 Song variables (excluding testosterone data) CV pause 6 CV strophe 6 Drift 9 Within-strophe drift 5 d 15 Singing intensity Repertoire size 8 Strophe length 22 15 Strophe turnovere 0.082 0.083 0.028 0.125c 0.167c 0.060 0.093 0.015 0.213 0.206 0.167 0.055 0.046 0.118 0.002 0.050 to to to to to to to to 0.013 0.044 0.172 0.206 0.297 0.323 0.194 0.103 Number of available estimates (N), meta-analysis derived average Z (positive values indicate a positive relationship between song production or song consistency and condition), and associated bootstrap generated 95% CIs. Survival as indicated by recapture of marked birds in subsequent years. Manipulation of chick or parent condition through brood size manipulation. Indicates an effect size estimate significantly (P , 0.05) different from zero. Includes variables ‘‘performance time’’ and ‘‘song rate’’ from published studies. Includes variables ‘‘versatility,’’ ‘‘strophe diversity,’’ and ‘‘strophe bout length’’ from published studies (see Appendix 1). this result was highly significant. As with strophe length, metaanalysis results indicate that the proportion of time a male blue tit spends singing provides a signal of some aspect of his quality. Although the presence of publication bias means we must interpret the meta-analysis results with caution, it appears that both strophe length and singing intensity may be condition dependent. These 2 song variables could be linked to body size in multiple ways. Although it is possible that larger males have more energy reserves to devote to song production (Thomas 2002), another possibility is that song production is socially mediated (Mennill et al. 2002) and that only larger males can afford to risk aggression from other males (Parker and Ligon 2002) by singing longer or more frequent strophes. Several other lines of evidence indicate that the components of blue tit song that we measured may not be general indicators of male condition. Although correlated with some body size traits, our measurements of blue tit song appeared completely unrelated to age or the probability of being recaptured after the subsequent winter. Further, in meta-analysis, neither of these condition variables was related to song. Many condition-dependent male sexual signals are age dependent (e.g., Greene et al. 2000; Parker et al. 2003; Griffith and Pryke 2006), presumably because older males have higher dominance rank or are more efficient at performing the tasks that determine the environmental effects on their signal expression. Thus, either age is unrelated to male condition in blue tits or, more likely, the song variables we examined are not particularly sensitive to male condition or the components of male condition that relate to age. In tits, it is well demonstrated that aspects of male quality and condition, including male dominance, influence overwinter survival (e.g., Gosler 1996; e.g., Lambrechts and Dhondt 1986), and so presumably our negative results with regard to recapture in the subsequent year are not due to a lack of correlation between male survival and condition but rather to song traits lacking dependence on the aspects of condition influencing survival. Our effect sizes are within the wide range observed in other bird species (0.6 , r , 0.6) where measures of condition similar to ours have been compared with aspects of song production (e.g., Lampe and Espmark 1994; Galeotti et al. 1997; Otter et al. 1997; Balsby 2000; Rinden et al. 2000; Gil et al. 2001; Forstmeier et al. 2006; Kipper et al. 2006). As with the blue tit data, results vary among and within studies. Certain combinations of condition and song variables appear important in some studies, and other combinations appear important in other studies. One fairly consistent pattern is that of an age effect on song complexity variables (Lampe and Espmark 1994; Balsby 2000; Rinden et al. 2000; Gil et al. 2001; Forstmeier et al. Parker et al. • Blue tit song 1037 2006), something we failed to detect in blue tits. Relationships between other condition and song variables are more varied, and a formal meta-analysis will be required to determine how the condition variables we studied tend to be related to song production across bird species. Our meta-analysis demonstrates the importance of formal synthesis of published results rather than casual review of notable published relationships. Without meta-analysis of the blue tit song literature, any general conclusion we might have drawn would have been dubious because of the heterogeneity in results among studies. Meta-analysis does not eliminate uncertainty concerning interpretation of published results, but it provides a formal framework for assessing these results and thus minimizes biased interpretation. It also allows for identification of potential biases in the published literature and as such promotes caution in interpretation when such bias is identified. Because sexual signal content may vary among populations of a species (Badyaev et al. 2001; Forstmeier and Leisler 2004), an appropriate interpretation of our metaanalysis results is that they represent the distribution of, and average patterns for, song–condition relationships in this species. Thus, regardless of whether the distribution of data in our meta-analyses represents geographically and temporally divergent patterns or simply sampling error, we can still conclude that many aspects of blue tit song are either inconsistent or weak predictor of male condition or both. Even the most consistent blue tit song signal of condition, singing intensity, has a weaker meta-analysis–generated effect size than that detected in our field data and may not be condition dependent in all situations (e.g., Dreiss et al. 2006). Although we found some evidence of blue tit song signal utility, we cannot eliminate the possibility that unmeasured components of blue tit song production are better signals of quality. For instance, although we quantified aspects of song complexity, we did not measure absolute repertoire size, a song variable found to be important in some case (Nowicki et al. 2000; Spencer et al. 2003, 2004, but see Forstmeier and Leisler 2004). We also did not consider the social context of song production. As with previous research on signal content in blue tit song, our song measures did account for neither the proximity of conspecifics such as competing males nor the ongoing vocalizations of other chorusing males. Male songbirds can adjust their singing behavior in response to that of their neighbors (Catchpole and Slater 1995), and patterns of counter singing have been shown in a closely related species to be an important source of information for females assessing male quality (Mennill et al. 2002). Future work with blue tits will need to address these possibilities if the signal content of male song is to be understood. It could also be that we measured appropriate song variables but did not measure the best component of male quality to compare with these variables. One such unmeasured variable is male dominance rank, which can relate to song production and have meaningful fitness implications (Lambrechts and Dhondt 1986; Mennill et al. 2002). In any study with negative results, it always remains possible that important variable(s) went unmeasured. Despite some unmeasured variables, we are still in a position to draw certain robust general conclusions. Many of the song variables we studied have previously been hypothesized to signal male condition (e.g., Eberhardt 1994; Nowicki et al. 1998; Poesel and Kempenaers 2000; Foerster et al. 2002), but for most of these song components, we found at best mixed evidence that they consistently signal male quality in blue tits. Thus, researchers need to continue to test hypotheses concerning condition dependence on a case by case basis and should remain hesitant to assume condition dependence of sexual signals without strong empirical support. We are not yet in position to identify the evolutionary mechanism maintaining variation in blue tit song. However, one aspect of variation, the production of different song types, may be maintained by the necessity of neighbor recognition as seen in many other species (Catchpole and Slater 1995) and more generally for other types of multimodal signals (Dale 2000). Maintenance of variation in other traits is less clear. Although a lack of correlation between signal expression and condition is likely in some models of signal evolution that do not rely on a fitness payoff to female choice, such as sensory bias (Ryan 1998), theory predicts this relationship can be absent even in cases where genetic benefits to mate choice are important, for instance, if males trade-off condition against sexual attractiveness (Kokko et al. 2002). Our results lend support to the hypothesis that certain aspects of sexual traits signal condition but also lead us to reject the simple scenario of a sexual signal providing consistent information about male condition. Given the common perception of the importance of this sort of condition signaling, this is an important conclusion. APPENDIX 1 Comparisons of blue tit song variables to potential condition indices in other studies Study Bijnens Bijnens Bijnens Bijnens Bijnens Bijnens Bijnens Bijnens Bijnens Bijnens Bijnens Bijnens Bijnens Bijnens Bijnens Bijnens Bijnens Site (1988) (1988) (1988) (1988) (1988) (1988) (1988) (1988) (1988) (1988) (1988) (1988) (1988) (1988) (1988) (1988) (1988) Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Peerdsbos, Belgium Belgium Belgium Belgium Belgium Belgium Belgium Belgium Belgium Belgium Belgium Belgium Belgium Belgium Belgium Belgium Belgium Song variable Condition index Sample size Repertoire sizea Repertoire size Repertoire size Repertoire size Repertoire size Strophe length Strophe length Strophe length Strophe length Strophe length Strophe length Strophe length Strophe length Strophe length Strophe length Within-strophe driftb Within-strophe drift Age Survival Tarsus length Mass Wing length Age Survival Tarsus length Mass Wing length Age Survival Tarsus length Mass Wing length Age Survival 42 42 35 34 34 28 28 18 19 19 28 28 23 23 23 29 29 Reported 2-tailed P value Other reported statistics r 0.06 0.2 .0.05 .0.05 .0.05 0.02 0.02 .0.05 .0.05 .0.05 0.03 0.04 .0.05 .0.05 .0.05 .0.1 .0.1 U ¼ 285 U ¼ 166 r ¼ 0.11 r ¼ 0.15 r ¼ 0.05 U ¼ 44 U ¼ 46 r ¼ 0.21 r ¼ 0.15 r ¼ 0.04 U ¼ 50.5 U ¼ 53.5 r ¼ 0.22 r ¼ 0.14 r ¼ 0.02 U ¼ 67 U ¼ 107 0.29 0.20 0.11 0.15 0.05 0.44 0.44 0.21 0.15 0.04 0.41 0.39 0.22 0.14 0.02 0.02 0.06 Behavioral Ecology 1038 Appendix1, Continued Reported 2-tailed P value Other reported statistics r r ¼ 0.23 r ¼ 0.18 r ¼ 0.19 F1,10 ¼ 16.49 F1,18 ¼ 4.06 F1,12 ¼ 1.38 t ¼ 2.08 t ¼ 0.74 U ¼ 39.5 r ¼ 0.48 r ¼ 0.34 r ¼ 0.04 r ¼ 0.39 r ¼ 0.23 r ¼ 0.02 r ¼ 0.05 r ¼ 0.19 r ¼ 0.30 r ¼ 0.86 r ¼ 0.51 r ¼ 0.44 r ¼ 0.51 r ¼ 0.62 r ¼ 0.42 R2 ¼ 0.09 r ¼ 0.102 r ¼ 0.08 R2 ¼ 0.41 r ¼ 0.039 r ¼ 0.067 R2 ¼ 0.01 r ¼ 0.058 r ¼ 0.02 R2 ¼ 0.04 r ¼ 0.058 r ¼ 0.19 t ¼ 0.81 0.23 0.18 0.19 0.79 0.43 0.32 0.09 0.06 0.11 0.48 0.34 0.04 0.39 0.23 0.02 0.05 0.19 0.30 0.86 0.51 0.44 0.51 0.62 0.42 0.30 0.10 0.08 0.64 0.04 0.07 0.10 0.06 0.02 0.20 0.06 0.19 0.20 Study Site Song variable Condition index Sample size Bijnens (1988) Bijnens (1988) Bijnens (1988) Doutrelant et al. (2000) Doutrelant et al. (2000) Doutrelant et al. (2000) Poesel et al. (2001) Poesel et al. (2001) Poesel et al. (2001) Poesel et al. (2001) Poesel et al. (2001) Poesel et al. (2001) Poesel et al. (2001) Poesel et al. (2001) Poesel et al. (2001) Poesel et al. (2001) Poesel et al. (2001) Poesel et al. (2001) Foerster et al. (2002) Foerster et al. (2002) Foerster et al. (2002) Foerster et al. (2002) Foerster et al. (2002) Foerster et al. (2002) Dreiss et al. (2006)g Dreiss et al. (2006)g Dreiss et al. (2006)g Dreiss et al. (2006) Dreiss et al. (2006)g Dreiss et al. (2006)g Dreiss et al. (2006)g Dreiss et al. (2006)g Dreiss et al. (2006)g Dreiss et al. (2006)g Dreiss et al. (2006)g Dreiss et al. (2006)g TH Parker et al. (unpublished data)k TH Parker et al. (unpublished data)k TH Parker et al. (unpublished data)k TH Parker et al. (unpublished data)k TH Parker et al. (unpublished data)k TH Parker et al. (unpublished data)k Peerdsbos, Belgium Peerdsbos, Belgium Peerdsbos, Belgium Pirio, Corsica Muro, Corsica Rouviere, France Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Kolbeterberg, Austria Aube, France Aube, France Aube, France Aube, France Aube, France Aube, France Aube, France Aube, France Aube, France Aube, France Aube, France Aube, France Oxford, UK Within-strophe drift Within-strophe drift Within-strophe drift Repertoire sizec Repertoire size Repertoire size Drift Drift Drift Singing intensitye Singing intensity Singing intensity Strophe length Strophe length Strophe length Strophe turnoverf Strophe turnover Strophe turnover Singing intensitye Singing intensity Singing intensity Singing intensity Strophe length Strophe turnoverf Singing intensityh Singing intensityh Singing intensityh Strophe turnoveri Strophe turnoveri Strophe turnoveri Strophe turnoverj Strophe turnoverj Strophe turnoverj Strophe length Strophe length Strophe length Strophe length Tarsus length Mass Wing length Tarsus length Tarsus length Tarsus length Tarsus length Mass Mass condition indexd Tarsus length Mass Mass condition index Tarsus length Mass Mass condition index Tarsus length Mass Mass condition index Testosterone level Mass Mass Mass condition index Testosterone level Testosterone level Rearing manipulation Tarsus length Mass Rearing manipulation Tarsus length Mass Rearing manipulation Tarsus length Mass Rearing manipulation Tarsus length Mass Effort manipulationl 19 20 20 12 20 14 20 20 20 20 20 20 20 20 20 20 20 20 7 28 25 28 7 5 14 95 93 14 91 89 14 91 89 14 95 93 15 .0.05 .0.05 .0.05 0.002 0.059 0.26 0.69 0.79 0.63 0.03 0.17 0.89 0.09 0.35 0.94 0.83 0.46 0.22 0.03 0.007 0.032 0.007 .0.1 .0.4 0.28 0.32 0.45 0.024 0.71 0.53 0.81 0.59 0.85 0.51 0.58 0.07 0.43 Oxford, UK Drift Effort manipulationl 15 0.91 t ¼ 0.12 l 0.03 Oxford, UK Singing intensity Effort manipulation 15 0.61 t ¼ 0.53 0.10 Oxford, UK Strophe turnover Effort manipulationl 15 0.99 t ¼ 0.00 0.00 l Oxford, UK CV strophe length Effort manipulation 15 0.38 t ¼ 0.92 0.23 Oxford, UK CV pause length Effort manipulationl 5 0.09 t ¼ 1.86 0.44 If no correlation coefficient (r) was presented, we derived one using the statistical calculator function in MetaWin (2.1). Correlation coefficients reported here are positive if consistent with prediction of higher and more consistent song production (larger repertoire size, longer strophes, greater proportion of time spent singing, less drift, greater turnover) from individuals in better condition (larger, older, more likely to survive, higher testosterone). a In this study, the asymptotic number of strophe types sung by an individual based on multiple recordings. b The progressive lengthening of pauses between trill notes within a strophe. c In this study, the number of strophe types sung in a single dawn song. d Mass controlled for tarsus length. e In this study, reported as ‘‘performance time,’’ strophe length divided by the sum of strophe length and pause length. f In this study, reported as ‘‘versatility,’’ the product of number of strophe types sung and the number of switches between strophe types. g Details sufficient for inclusion in meta-analysis obtained from authors. h Represented by the measurement ‘‘song rate,’’ the number of strophes per minute. i Represented by the measurement ‘‘strophe bout length,’’ the average duration between strophe type switches. j Represented by the measurement ‘‘strophe diversity,’’ the number of strophe types divided by the number of strophe bouts. k TH Parker, IR Barr, and SC Griffith. Details of this experiment and these results available from the corresponding author upon request. l Song output of adult males that had reared either enlarged or reduced (by 30%) broods in the previous breeding season. Parker et al. • Blue tit song N. Aspey, J. Higham, K. Jones, J. McMahon, H. Tillin, and R. Whitfield helped record dawn songs and L. Rowe and B. Sheldon contributed to the joint data set of Wytham blue tit reproduction and survival. Oxford University provided access to the study site. A. Dreiss kindly furnished additional details of published results for inclusion in our metaanalysis. Comments from B. Sheldon, W. Forstmeier, and 2 anonymous referees improved the manuscript. T.H.P. was hosted by B. Sheldon at the Edward Grey Institute of Field Ornithology and was supported by a US National Science Foundation International Research Fellowship (INT-0202704), I.R.B. was supported by a Natural Environment Research Council studentship, and S.C.G. by a Natural Environment Research Council Fellowship (NER/I/S/1999/00138). Temperature data were provided by the British Atmospheric Data Centre. REFERENCES Andersson M. 1986. Evolution of condition-dependent sex ornaments and mating preferences: sexual selection based on viability differences. Evolution 40:804–16. Andersson M. 1994. Sexual selection. New Jersey: Princeton University Press. Andersson S, Amundsen T. 1997. Ultraviolet colour vision and ornamentation in bluethroats. Proc R Soc Lond B Biol Sci 264:1587–91. Andersson S, Ornborg J, Andersson M. 1998. Ultraviolet sexual dimorphism and assortative mating in blue tits. Proc R Soc Lond B Biol Sci 265:445–50. Badyaev AV, Hill GE, Dunn PO, Glen JC. 2001. Plumage color as a composite trait: developmental and functional integration of sexual ornamentation. Am Nat 158:221–35. Balsby TJS. 2000. Song activity and variability in relation to male quality and female choice in Whitethroats Sylvia communis. J Avian Biol 31:56–62. Bijnens L. 1988. Blue tit Parus caeruleus song in relation to survival, reproduction and biometry. Bird Study 35:61–7. Bijnens L, Dhondt AA. 1984. Vocalizations in a Belgian blue tit, Parus c. caeruleus, population. Le Gerfaut 74:243–69. Buchanan KL, Leitner S, Spencer KA, Goldsmith AR, Catchpole CK. 2004. Developmental stress selectively affects the song control nucleus HVC in the zebra finch. Proc R Soc Lond B Biol Sci 271:2381–6. Catchpole CK, Slater PJB. 1995. Bird song: biological themes and variations. Cambridge, UK: Cambridge University Press. Dale J. 2000. Ornamental plumage does not signal male quality in red-billed queleas. Proc R Soc Lond B Biol Sci 267:2143–9. Doutrelant C, Blondel J, Perret P, Lambrechts MM. 2000. Blue tit song repertoire size, male quality and interspecific competition. J Avian Biol 31:360–6. Dreiss A, Richard M, Moyen F, White J, Møller AP, Danchin E. 2006. Sex ratio and male sexual characters in a population of blue tits, Parus caeruleus. Behav Ecol 17:13–9. Eberhardt LS. 1994. Oxygen-consumption during singing by male Carolina wrens (Thryothorus ludovicianus). Auk 111:124–30. Foerster K, Poesel A, Kunc H, Kempenaers B. 2002. The natural plasma testosterone profile of male blue tits during the breeding season and its relation to song output. J Avian Biol 33:269–75. Forstmeier W, Hasselquist D, Bensch S, Leisler B. 2006. Does song reflect age and viability? A comparison between two populations of the great reed warbler Acrocephalus arundinaceus. Behav Ecol Sociobiol 59:634–43. Forstmeier W, Kempenaers B, Meyer A, Leisler B. 2002. A novel song parameter correlates with extra-pair paternity and reflects male longevity. Proc R Soc Lond B Biol Sci 269:1479–85. Forstmeier W, Leisler B. 2004. Repertoire size, sexual selection, and offspring viability in the great reed warbler: changing patterns in space and time. Behav Ecol 15:555–63. Galeotti P, Saino N, Sacchi R, Moller AP. 1997. Song correlates with social context, testosterone and body condition in male barn swallows. Anim Behav 53:687–700. Gaunt AS, Bucher TL, Gaunt SLL, Baptista LF. 1996. Is singing costly? Auk 113:718–21. Gil D, Cobb JLS, Slater PJB. 2001. Song characteristics are age dependent in the willow warbler, Phylloscopus trochilus. Anim Behav 62:689–94. Gosler AG. 1996. Environmental and social determinants of winter fat storage in the great tit Parus major. J Anim Ecol 65:1–17. 1039 Gosler AG, Harper D. 2000. Assessing the heritability of body condition in birds: a challenge exemplified by the great tit Parus major L. (Aves). Biol J Linn Soc 71:103–17. Greene E, Lyon BE, Muehter VR, Ratcliffe L, Oliver SJ, Boag PT. 2000. Disruptive sexual selection for plumage coloration in a passerine bird. Nature 407:1000–3. Griffith SC, Ornborg J, Russell AF, Andersson S, Sheldon BC. 2003. Correlations between ultraviolet coloration, overwinter survival and offspring sex ratio in the blue tit. J Evol Biol 16:1045–54. Griffith SC, Pryke SR. 2006. Benefits to female birds of assessing color displays. In: Hill GE, McGraw KJ, editors. Bird coloration, volume 2: function and evolution. Cambridge, MA: Harvard University Press. p 233–79. Gurevitch J, Hedges LV. 1999. Statistical issues in ecological metaanalyses. Ecology 80:1142–49. Iwasa Y, Pomiankowski A. 1994. The evolution of mate preferences for multiple sexual ornaments. Evolution 48:853–67. Kempenaers B, Verheyen GR, Dhondt AA. 1997. Extrapair paternity in the blue tit (Parus caeruleus): female choice, male characteristics, and offspring quality. Behav Ecol 8:481–92. Kipper S, Mundry R, Sommer C, Hultsch H, Todt D. 2006. Song repertoire size is correlated with body measures and arrival date in common nightingales, Luscinia megarhynchos. Anim Behav 71:211–7. Kokko H, Brooks R, McNamara JM, Houston AI. 2002. The sexual selection continuum. Proc R Soc Lond Biol Sci Ser B 269:1331–40. Lambrechts M, Dhondt AA. 1986. Male quality, reproduction, and survival in the great tit (Parus major). Behav Ecol Sociobiol 19:57–63. Lampe HM, Espmark YO. 1994. Song structure reflects male quality in pied flycatchers, Ficedula hypoleuca. Anim Behav 47:869–76. Ligon JD. 1999. The evolution of avian breeding systems. Oxford: Oxford University Press. Ligon JD, Kimball R, Merola-Zwartjes M. 1998. Mate choice by female red junglefowl: the issues of multiple ornaments and fluctuating asymmetry. Anim Behav 55:41–50. Mace R. 1989. The dawn chorus in the great tit Parus major is directly related to female fertility. Nature 330:745–6. Mennill DJ, Ratcliffe LM, Boag PT. 2002. Female eavesdropping on male song contests in songbirds. Science 296:873. Møller AP, Jennions MD. 2002. How much variance can be explained by ecologists and evolutionary biologists? Oecologia 132:492–500. Møller AP, Pomiankowski A. 1993. Why have birds got multiple sexual ornaments. Behav Ecol Sociobiol 32:167–76. Nowicki S, Hasselquist D, Bensch S, Peters S. 2000. Nestling growth and song repertoire size in great reed warblers: evidence for song learning as an indicator mechanism in mate choice. Proc R Soc Lond B Biol Sci 267:2419–24. Nowicki S, Peters S, Podos J. 1998. Song learning, early nutrition and sexual selection in songbirds. Am Zool 38:179–90. Otter K, Chruszcz B, Ratcliffe L. 1997. Honest advertisement and song output during the dawn chorus of black-capped chickadees. Behav Ecol 8:167–73. Palmer AR. 1999. Detecting publication bias in meta-analyses: a case study of fluctuating asymmetry and sexual selection. Am Nat 154:220–33. Parker TH, Ligon JD. 2002. Dominant male red junglefowl test the dominance status of other males. Behav Ecol Sociobiol 53:20–4. Parker TH, Stansberry BM, Becker CD, Gipson PS. 2003. Do melaninor carotenoid-pigmented plumage ornaments signal condition and predict pairing success in the Kentucky warbler? Condor 105: 663–71. Parker TH, Stansberry BM, Becker CD, Gipson PS. 2005. Edge and area effects on the occurrence of migrant forest songbirds. Conserv Biol 19:1157–67. Podos J, Huber SK, Taft B. 2004. Bird song: the interface of evolution and mechanism. Annu Rev Ecol Evol Syst 35:55–87. Poesel A, Foerster K, Kempenaers B. 2001. The dawn song of the blue tit Parus caeruleus and its role in sexual selection. Ethology 107:521–31. Poesel A, Kempenaers B. 2000. When a bird is tired from singing: a study of drift during the dawn chorus. Etologı́a 8:1–7. Pryke SR, Andersson S, Lawes MJ. 2001. Sexual selection of multiple handicaps in the red-collared widowbird: female choice of tail length but not carotenoid display. Evolution 55:1452–63. Rinden H, Lampe HM, Slagsvold T, Espmark YO. 2000. Song quality does not indicate male parental abilities in the pied flycatcher Ficedula hypoleuca. Behaviour 137:809–23. 1040 Rosenberg KV, Adams DC, Gurevitch J. 2000. MetaWin 2.0 user’s manual. Sunderland, MA: Sinauer Associates. Rosenberg MS, Adams DC, Gurevitch J. 1997. MetaWin 2.1 ed. Sunderland, MA: Sinauer Associates. Rosenthal R. 1984. Meta-analytic procedures for social research. Beverly Hills, CA: Sage. Ryan MJ. 1998. Sexual selection, receiver biases, and the evolution of sex differences. Science 281:1999–2003. Schluter D, Price T. 1993. Honesty, perception and population divergence in sexually selected traits. Proc R Soc Lond B Biol Sci 253:117–22. Senar JC, Figuerola J, Pascual J. 2002. Brighter yellow blue tits make better parents. Proc R Soc Lond B Biol Sci 269:257–61. Sheldon BC, Andersson S, Griffith SC, Ornborg J, Sendecka J. 1999. Ultraviolet colour variation influences blue tit sex ratios. Nature 402:874–7. Sheldon BC, West SA. 2004. Maternal dominance, maternal condition, and offspring sex ratio in ungulate mammals. Am Nat 163:40–54. Behavioral Ecology Slagsvold T, Dale S, Saetre GP. 1994. Dawn singing in the great tit (Parus major): mate attraction, mate guarding, or territorial defense. Behaviour 131:115–38. Spencer KA, Buchanan KL, Goldsmith AR, Catchpole CK. 2003. Song as an honest signal of developmental stress in the zebra finch (Taeniopygia guttata). Horm Behav 44:132–9. Spencer KA, Buchanan KL, Goldsmith AR, Catchpole CK. 2004. Developmental stress, social rank and song complexity in the European starling (Stumus vulgaris). Proc R Soc Lond B Biol Sci 271:S121–3. Templeton CN, Greene E, Davis K. 2005. Allometry of alarm calls: black-capped chickadees encode information about predator size. Science 308:1934–7. Thomas RJ. 2002. The costs of singing in nightingales. Anim Behav 63:959–66. Ward S, Lampe HM, Slater PJB. 2004. Singing is not energetically demanding for pied flycatchers, Ficedula hypoleuca. Behav Ecol 15:477–84.
© Copyright 2026 Paperzz