1 Major Histocompatibility Complex (MHC) Class II sequence polymorphism in 2 long-finned pilot whale (Globicephala melas) from the North Atlantic. 3 SÍLVIA S. MONTEIRO 1,2*, JOSÉ V. VINGADA 2,3, ALFREDO LÓPEZ 4,5, GRAHAM J. 4 5 PIERCE 4,6, MARISA FERREIRA 1,2, ANDREW BROWNLOW 7, BJARNI 6 MIKKELSEN 8, MISTY NIEMEYER 9, ROBERT J. DEAVILLE 10, CATARINA EIRA 7 2,4 , STUART PIERTNEY 11. 8 9 1 Departamento de Biologia & CBMA, Universidade de Minho, Campus de Gualtar, 10 Braga, Portugal 11 2 12 Biologia, Campus de Gualtar, Braga, Portugal 13 3 14 Braga, Portugal. 15 4 16 de Santiago,Aveiro, Portugal Sociedade Portuguesa de Vida Selvagem, Universidade de Minho, Departamento de Departamento de Biologia & CESAM, Universidade de Minho, Campus de Gualtar, Departamento de Biologia & CESAM, Universidade de Aveiro, Campus Universitário 17 5 18 6 19 7 20 8 21 9 22 World headquarters, Yarmouth Port, MA, USA 23 10 24 of Zoology, Zoological Society of London, London 25 11 Coordinadora para o Estudio dos Mamíferos Mariños, Gondomar, Pontevedra,Spain Oceanlab, University of Aberdeen, Newburgh, Aberdeenshire, UK Wildlife Unit, SAC Veterinary Science Division, Inverness, UK Museum of Natural History, Tórshavn, Faroe Islands International Fund for Animal Welfare, Marine Mammal Rescue & Research Program, UK Cetacean Strandings Investigation Programme, The Wellcome Building, Institute School of Biological Sciences (Zoology), University of Aberdeen, Aberdeen, UK 26 27 * 28 Universidade de Aveiro, Campus Universitário de Santiago, 3810-193, Aveiro, 29 Portugal. Tel: +351 964192993 30 E-mail: [email protected] 31 Running head title: MHC polymorphism in pilot whales from North Atlantic Corresponding author: Silvia S. Monteiro. Departamento de Biologia & CESAM, 32 33 34 35 36 1 37 38 Abstract 39 Determining how intra-specific genetic diversity is apportioned among natural 40 populations is essential for detecting local adaptation and identifying populations with 41 inherently low levels of extant diversity which may become a conservation concern. 42 Sequence polymorphism at two adaptive loci (MHC DRA and DQB) was investigated 43 in long-finned pilot whales (Globicephala melas) from four regions in the North 44 Atlantic and compared with previous data from New Zealand (South Pacific). Three 45 alleles were resolved at each locus, with trans-species allele sharing and higher 46 levels of non-synonymous to synonymous substitution, especially in DQB locus. 47 Overall nucleotide diversities of 0.49 ± 0.38% and 4.60 ± 2.39% were identified for the 48 DRA and DQB loci, respectively, which are relatively low for MHC loci in the North 49 Atlantic, but comparable to levels previously described in New Zealand (South 50 Pacific). There were significant differences in allele frequencies within the North 51 Atlantic and between the North Atlantic and New Zealand. Patterns of diversity and 52 divergence are consistent with the long-term effects of balancing selection operating 53 on the MHC loci, potentially mediated through the effects of host-parasite coevolution. 54 Differences in allele frequency may reflect variation in pathogen communities, 55 coupled with the effects of differential drift and gene flow. 56 57 Keywords: Adaptive markers, evolution, cetacea 58 59 Introduction 60 61 Determining how intra-specific genetic diversity is spatially and temporally 62 apportioned among extant populations is essential for resolving patterns of population 63 structure, identifying levels of dispersal and gene flow, detecting local adaptation, and 64 identifying populations with inherently low levels of extant diversity which may 65 become a conservation concern (e.g. Piertney & Oliver 2006; Witteveen et al. 2011). 66 Neutral molecular markers such as mitochondrial DNA (mtDNA) and 67 microsatellite polymorphisms have been extensively applied in genetic studies of wild 68 populations (Ballard & Whitlock 2004). However, it is appreciated that neutral genetic 69 diversity is a poor proxy for levels of adaptively important genetic variation within the 70 genome (e.g. Merila & Crnokrak 2001; Holderegger et al. 2006). There is a growing 71 emphasis on resolving levels of adaptive genetic diversity and selective processes 72 that directly affect traits underpinning individual fitness and population viability (e.g 2 73 Piertney & Oliver 2006; Bos et al. 2008; Spurgin & Richardson 2010; Oliver & 74 Piertney 2012). A major focus for examining adaptive genetic diversity has been the 75 genes of the Major Histocompatibility Complex (MHC). MHC genes encode proteins 76 that recognize and present foreign antigens to the immune system to initiate adaptive 77 immune responses (Piertney & Oliver 2006). MHC genes are distributed in two main 78 MHC subfamilies (class I and class II), with class I primarily associated with 79 intracellular pathogens and class II genes with extracellular pathogens, in terms of 80 antigen presentation (Piertney & Oliver 2006; Spurgin & Richardson 2010), although 81 the reverse may also occur less frequently (Neefjes et al. 2011). 82 The functional importance of MHC in the immune system underlies its 83 characteristic high levels of genetic diversity within natural populations, with a large 84 repertoire of extant alleles (Piertney & Oliver 2006). Levels of nucleotide diversity in 85 human MHC are approximately two orders of magnitude higher than the genomic 86 average (Gaudieri et al. 2000). This high diversity is maintained by balancing 87 selection, which is thought to be mediated through host-pathogen interactions and/or 88 mate choice (e.g. Kundu & Faulkes 2004; Cutrera & Lacey 2006; Piertney & Oliver 89 2006; Oliver et al. 2009; Spurgin & Richardson 2010). Almost without exception, MHC 90 studies have observed the classical signatures of balancing selection from extant 91 MHC variation. These include the occurrence of phylogenetic patterns such as trans- 92 species polymorphism where allelic lineages are shared among species and persist 93 over long evolutionary time (Klein et al. 1980), and the occurrence of an excess of 94 nonsynonymous (dN) over synonymous (dS) substitutions, especially in the Peptide 95 Binding Region (PBR) sites (Hughes & Nei 1989). 96 One taxonomic group that can display relatively low levels of MHC diversity are 97 marine mammals. For example, Murray et al. (1995) resolved only five DQB alleles 98 among 233 beluga whales, Delphinapterus leucas (Pallas, 1776), while Munguía- 99 Vega et al. (2008) reported one DQB and two DRB alleles in 25 and 29 vaquitas, 100 Phocoena sinus Norris & McFarland, 1958. This reduced MHC variability has been 101 attributed to weaker balancing selection operating on marine mammal populations, 102 possibly due to lower pathogenic selection pressures (e.g. Slade et al. 1992; 103 Villanueva-Noriega et al. 2013). However, this theory is still under debate (McCallum 104 et al. 2004; Xu et al. 2010), since high levels of MHC diversity have also been 105 described for several cetacean species (Baker et al. 2006; Xu et al. 2007, 2012). 106 Moreover, some studies have resolved high spatial variation of extant adaptive 107 diversity in odontocete populations across different geographical regions, which 108 suggests that single point sampling of MHC variation may provide a biased view of 109 diversity and selection operating over larger geographic scales (e.g. Tursiops 3 110 truncatus (Montagu, 1821) and Orcinus orca (Linnaeus, 1758), Vassilakos et al. 111 2009). 112 Information on macro-scale host and parasite diversity, abundance and 113 consequent interactions in marine ecosystems is still scarce relative to terrestrial 114 systems, and mostly based on teleost fishes (Rohde et al. 1984; Luque & Poulin 115 2008). Parasite diversity has already been associated with variables related to fish 116 habitat, trophic level, temperature and oceanic distribution. As an example, several 117 studies have reported a higher diversity of hosts and parasites in the Indo-Pacific 118 compared to the Atlantic (Rohde 1978,1984; Luque & Poulin 2008; reviewed in 119 Rohde 2010). Additionally, sea temperature also seems to influence the diversity of 120 parasites, with marine ecto- and endoparasites showing a positive and negative 121 correlation with temperature, respectively (Rohde 1984; Rohde & Heap 1998; Luque 122 & Poulin 2008). Given the reported occurrence of latitudinal and longitudinal gradients 123 in host and parasite diversity in different oceanic basins (e.g. Lambshead et al. 2002; 124 reviewed in Rohde 2010), geographical variation in parasite burden may be expected, 125 especially for host species with a large geographical range (Morand 2000) which in 126 turn may affect standing genetic diversity in different areas, especially for genes such 127 as the MHC. 128 The long-finned pilot whale Globicephala melas (Traill, 1809) is a widely 129 distributed, medium-sized odontocete (Reid et al. 2003). There have been few studies 130 examining the patterns of genetic diversity across populations of this species (Fullard 131 et al. 2000; Oremus et al. 2009; Monteiro et al. 2015a). Neutral markers, such as 132 microsatellites and mitochondrial DNA, have shown some genetic differentiation 133 between populations located within the same oceanic basin (e.g. Atlantic Ocean, 134 Fullard et al. 2000; Monteiro et al. 2015a) and between oceanic basins (e.g. Atlantic 135 and Pacific Ocean, Oremus et al. 2009), but such studies have not been extended to 136 examine patterns of adaptive genetic diversity. The only data available are from a 137 mass stranding of 225 long-finned pilot whales from New Zealand, where eight alleles 138 were resolved both at DQA and DQB loci (Heimeier 2009). 139 In the present study, the levels of MHC DRA and DQB diversity across long- 140 finned pilot whales stranded in different locations of the North Atlantic will be analysed 141 and contrasted with previous studies from New Zealand (South Pacific) (Heimeier 142 2009) to assess the extent to which standing genetic variation at the MHC varies 143 within and across ocean basins. It will be tested whether the patterns of diversity 144 identified are consistent with the long-term effects of balancing selection, through the 145 examination of trans-species polymorphism and the analysis of the ratio between 146 nonsynonymous and synonymous substitutions within the PBR of the alleles 4 147 resolved. The present study will comprise the first description of the allelic diversity at 148 MHC loci in long-finned pilot whales from the North Atlantic. 149 150 Materials and Methods 151 Sample collection 152 A total of 119 long-finned pilot whale samples was collected from animals 153 stranded in several locations of the North Atlantic (Northwestern Iberian Peninsula 154 (NI), United Kingdom (UK) and United States of America (USA), Fig. 1). In addition, 155 this study used samples collected from animals taken in drive fisheries in 2010 and 156 archived at the tissue bank of the Museum of Natural History of the Faroe Islands (FI, 157 Fig. 1). Strandings occurred between 2001- 2009, 1992-2011 and 2002-2009 in NI, 158 UK and USA, respectively. All tissue samples were either frozen or preserved in 70% 159 ethanol. 160 MHC data from the North Atlantic was compared to DQB locus data analysed 161 from 224 pilot whales from five mass strandings in New Zealand (South Pacific, 1992 162 – 2004, Heimeier 2009). 163 164 DNA extraction, amplification and genotyping 165 Skin samples were digested in cetyl trimethylammonium bromide (CTAB) 166 extraction buffer and DNA was purified by a standard phenol–chloroform-isoamyl 167 alcohol procedure (modified from Sambrook et al. 1989). 168 The exon 2 region of the MHC DRA locus (188bp) was amplified in a total of 111 169 samples from the North Atlantic (Fig. 1 and Table I), using the primers DRAf (5’- AAT 170 CAT GTG ATC ATC CAA GCT GAG TTC-3’) and DRAr (5’- TGT TTG GGG TGT TGT 171 TGG AGC G -3’) (Xu et al. 2007). For the MHC DQB locus, a 171bp fragment of the 172 exon 2 region was amplified in a total of 96 samples from the North Atlantic (Fig. 1 173 and Table I), using the primers DQB1 (5′-CTGGTAGTTGTGTCTGCACAC-3') and 174 DQB2 5′-CATGTGCTACTTCACCAACGG-3′ (Murray et al. 1995). To allow analysis 175 using Denaturing Gradient Gel Electrophoresis (DGGE), an additional 40bp GC-rich 176 sequence (GC clamp) was added to the 5’ end of the primers DRAf and DQB1. The 177 optimal GC clamp sequence was ascertained using WinMelt software (Bio-Rad). For 178 both MHC loci, PCR reactions were carried out in a 10µl final volume reaction 179 containing 1x PCR Buffer, 1.5 mM MgCl2, 0.2 mM DNTPs, 0.5 units of BIOTAQ DNA 180 Polymerase (Bioline) and 0.4 µM of each primer. Cycling conditions for the DRA locus 181 were: 2 min at 94º C, 19 cycles of 30s at 91ºC, 30s at 60-50ºC (decreasing 0.5ºC per 182 cycle), 19 cycles of 30s at 91ºC, 30s at 50ºC followed by a final extension at 72ºC for 183 1 min, while for the DQB locus conditions were: 3 min at 94º C, 30 cycles of 30s at 5 184 94ºC, 30s at 58ºC and 30s at 72ºC followed by a final extension at 72ºC for 5 min. 185 PCR products were first visualized on a 1.5% agarose gel. 186 DGGE was performed in a Bio-Rad DCode System. One microliter of each PCR 187 product was applied directly onto 1 mm thick 10% polyacrylamide gels 188 (acrylamide:bis-acrylamide at 37:5:1) in 1x TAE (40 mM Tris-acetate and 1 mM 189 EDTA, pH 8.3), mixed with varying concentrations of denaturing agents (according to 190 the desired denaturing gradient) and polymerized by the addition of 0.1% TEMED and 191 0.1% ammonium persulphate. Denaturing gradients consisted of increasing 192 concentrations of urea and formamide in the polyacrylamide solutions (0.07 M urea 193 and 0.4% of formamide per % denaturant) and were formed using the Gradient 194 Delivery System (Bio-Rad). Electrophoresis conditions were optimized for maximum 195 band separation and resolution to 14 h at a constant voltage (50 V) and temperature 196 (60ºC), in a linear 40% to 50% denaturing agent gradient for the DRA locus and 50% 197 to 70% for the DQB locus. After electrophoresis, the gels were silver stained, 198 consisting of a 30 min immersion of the gel in a fixative solution (10 % absolute 199 ethanol/0.5 % acetic acid), followed by a 20 min immersion in 0.1 % silver nitrate 200 solution and a final immersion in a developing solution (3% sodium hydroxide and 201 1.5% of 37% formaldehyde solution), until total development of the bands. DGGE 202 bands were sequenced after excision from the gel and re-amplification. Re- 203 amplification was performed with the original primer set of each locus, as described 204 above, except that for DQB locus the PCR comprised 28 cycles and the annealing 205 temperature was increased to 60ºC. PCR products were purified using QIAquick PCR 206 purification columns (Qiagen), according to the manufacturer’s protocol. DNA 207 sequencing was undertaken using the each forward primer on an ABI3700 automated 208 DNA sequencer (Applied Biosystems, CA, USA), according to the manufacturer’s 209 instructions. Ambiguous sequences were re-sequenced using the reverse primer. 210 A sequence variant was identified as a new allele only when it was in accordance 211 with criteria laid out in Kennedy et al. (2002), namely that when using DNA cloning 212 and sequencing there have to be at least three identical clones, identified in either two 213 independent PCR reactions from the same individual, or from PCRs from at least two 214 different individuals. Therefore, to validate each allelic sequence, at least 12 215 replicates of each putative allelic band (when possible), taken from different 216 individuals across random gels, were sequenced using the forward primer and every 217 allele was re-run with the reverse primer at least three times. To ensure consistency 218 in scoring between runs, alleles found in previously cloned pilot whales, according to 219 the standard pGEM (Promega Ltd protocol), were used as a standard and added to 220 each gel. 6 221 Both DRA and DQB exon 2 alleles of long-finned pilot whale were designated 222 according with the nomenclature described by Klein et al. (1990) for MHC in non- 223 human species. The alleles resulting from the analyses of DRA and DQB loci were 224 phylogenetically compared with previously published sequences of the DRA and DQB 225 exon 2 regions of different cetacean. While for DRA locus, all the cetacean 226 sequences available at Genbank were included (31 sequences), for DQB locus a 227 random selection of 21 sequences was used in order to include several odontocete 228 species in the analysis (see Fig. 2 for Genbank accession numbers). 229 230 Statistical analysis 231 Genetic diversity and differentiation 232 Sequence visualisation, alignment and translation into amino acid sequences 233 were performed using Clustal W (Thompson et al. 1997) and MEGA 6.0 (Tamura et 234 al. 2013). Sequences were confirmed as MHC DRA and DQB sequences by the 235 National Center for Biotechnology Information (NCBI) BLAST comparison. Allelic 236 richness, nucleotide diversity, observed and expected heterozygosity (Nei & Tajima 237 1981), and deviation from Hardy-Weinberg equilibrium (20,000 bootstrap replicates to 238 test the null hypothesis that loci are in Hardy-Weinberg equilibrium) were calculated 239 using FSTAT 2.9 (Goudet 1995) and ARLEQUIN 3.5.1.2 (Excoffier & Lischer 2010). 240 To test for differences in allele frequencies per locus, within the North Atlantic 241 and between the North Atlantic and New Zealand (South Pacific), pairwise analyses 242 were performed in GENEPOP, using Fisher´s exact test (Raymond & Rousset 1995). 243 Differentiation analysis between North Atlantic and South Pacific was based only on 244 DQB locus alleles. Statistical significance was estimated using a Markov chain 245 algorithm with 10,000 dememorization steps, 100 batches and 5,000 iterations. A 246 Bonferroni correction was applied as an adjustment of critical p-values for 247 comparisons across the North Atlantic. 248 249 Indentifying historical signatures of selection 250 The long-term effects of selection on MHC loci were assessed by testing 251 whether dN:dS > 1 in the DQB and DRA loci (including analysis for all sites and 252 separately for PBR and non PBR sites), and confirming retention of allelic lineages 253 across speciation events (trans-species polymorphism). 254 The relative rate of nonsynonymous (dN) to synonymous (dS) mutation, applying 255 the method of Nei & Gojobori (1986), with Jukes–Cantor correction for multiple 256 mutations at single sites, were calculated using MEGA 6.0 (Tamura et al. 2013). The 257 probability of rejecting the null hypothesis of neutrality (dN = dS) in favour of the 7 258 alternative hypothesis of positive selection (dN 259 (Nei & Kumar 2000). Standard errors for these estimates were estimated through 260 100,000 bootstrap replicates. Nucleotides within the Peptide Binding Region (PBR) 261 were determined as predicted by Brown et al. (1993). > dS) was determined using a Z-test 262 Phylogenetic relationships were assessed using Maximum Likelihood as 263 implemented in PAUP 4.0 (Swofford 2003) and Bayesian analysis as implemented in 264 MrBayes 3.2 (Ronquist et al. 2012). Likelihood analysis was performed using 1000 265 bootstrap replicates with tree-bisection-reconnection branch swapping. For Bayesian 266 phylogeny estimation, two independent runs of four Metropolis-coupled MCMC chains 267 (temperature= 0.2) were run for 1000,000 generations (every 1000th tree was 268 sampled). The first 25% of trees were discarded as burn-in, resulting in 750 trees 269 from which parameter values and trees were then summarized and a consensus tree 270 was drawn using the program TREEVIEW 1.6 (Page 1996). The model of sequence 271 evolution recommended by JModeltest 2.1 (Darriba et al. 2012) was the Kimura 2 272 Parameter model (Kimura 1980), with gamma-distributed rate variation across sites 273 for the DRA locus (gamma distribution = 0.295) and HKY + G (A = 0.2234, C = 274 0.2542, G = 0.3835, T = 0.1389 and gamma distribution = 0.1540) for the DQB locus. 275 Bos taurus (U77794) was used as outgroup for the DQB locus. 276 277 Results 278 Genetic diversity and differentiation 279 280 No more than two sequences were resolved for any individual at either the DRA or DQB locus, suggesting that one single locus was amplified in both cases. 281 The DRA exon 2 region (186bp) was sequenced from 111 long-finned pilot 282 whales, across different geographic locations in the North Atlantic. A total of 13 283 variable sites (6.9%) in the nucleotide sequence defined three unique DRA allelic 284 sequences: Glme-DRA*01, Glme-DRA*02, which were previously described by Xu et 285 al. (2009) in other cetacean species, including Grampus griseus (G. Cuvier, 1812) 286 (GenBank accession: EF375601 and EF375602, respectively), and the newly 287 described Glme-DRA*03 (Table I.Ia and Fig. 2a). The three nucleotide alleles 288 translated to two different amino acid sequences (of 62 amino acids in length), with 289 eight variable sites (Table I.Ib). DRA exon 2 diversity is given for each Atlantic region 290 in Table II. Overall, there was an average nucleotide diversity of 0.49 ± 0.38%, 291 ranging between 0.22 ± 0.23% (FI) and 0.86 ± 0.58 (UK) (Table II). Allelic richness 292 was highest in NI and UK and lowest in the FI and USA (Table II). Observed 293 heterozygosity ranged between 0.14 (UK) and 0.70 (USA), having an overall value of 294 0.40. Overall, long-finned pilot whales from the North Atlantic were in Hardy-Weinberg 8 295 equilibrium. When considering each region analysed, the UK and USA presented 296 significant deviations from Hardy-Weinberg expectations, with a significant deficit and 297 excess of heterozygotes, respectively. 298 The DQB exon 2 region (171bp) was sequenced from 96 long-finned pilot 299 whales, from different geographic locations in the North Atlantic. A total of 17 variable 300 sites (9.9%) in the nucleotide sequence defined three unique DQB allelic sequences: 301 Glme-DQB*01 (previously described in long-finned pilot whales in the Pacific; 302 Heimeier 2009); Glme-DQB*02 (previously described in Tursiops truncatus (Genbank 303 accession: AB302053); and Glme-DQB*03 (previously described in long-finned pilot 304 whales in the Pacific; Heimeier 2009) (Table I.IIa). Each of the three nucleotide alleles 305 translated to a different amino acid sequence (57 amino acids), with 10 variable sites 306 (Table I.IIb). For the 14 amino acids corresponding to the PBR, variability was 307 detected at seven sites (50%), while for the remaining 43 amino acids variability was 308 detected at three sites (7.0%) (Table I.IIb). Table II summarizes the data on indicators 309 of DQB exon 2 diversity in each sampling group. Overall, there was an average 310 nucleotide diversity of 4.60 ± 2.39%, ranging between 4.21 ± 2.24% (FI) and 4.54 ± 311 2.44% (NI) (Table II). Allelic richness was highest in NI and lowest in the USA (Table 312 II). Observed heterozygosity ranged between 0.22 (USA) and 0.34 (UK), having an 313 overall value of 0.35. Overall, long-finned pilot whales from the North Atlantic were 314 not in Hardy-Weinberg equilibrium. However, although in all regions analysed the 315 expected heterozygosity exceeded the observed values, none of them presented 316 significant deviations from Hardy-Weinberg expectations. 317 There was some geographic variation in MHC allele frequencies across the North 318 Atlantic, most notably Glme-DQB*02 which shows a higher frequency in NI in 319 comparison to remaining regions, to Glme-DRA*01 in NI and USA which seem to 320 show lower frequencies when compared to UK and FI, and to Glme-DRA*03 which is 321 shared only by pilot whales from the NI and the UK (Fig. 1). The levels of genetic 322 differentiation found within the North Atlantic (Table III) revealed significant 323 differences of allelic frequencies between some of the analysed regions. In particular, 324 for DRA locus there were only significant differences in allele frequencies between 325 the UK and USA, while for DQB locus there were significant differences in allele 326 frequencies between NI and all the remaining analysed locations (Table III). 327 The comparison between North Atlantic and New Zealand (South Pacific) 328 identified the occurrence of significant differences between DQB allele frequencies of 329 pilot whales from both oceanic basins (Table III). 330 331 Analysis of historical signatures of selection 9 332 For the DRA exon 2 region, the proportion of nonsynonymous substitutions (dN = 333 4.70%) was not significantly different from synonymous substitutions (d S = 5.50%) (Z- 334 test P > 0.05). None of the amino acid substitutions was located in a site that has 335 been considered to be involved in peptide binding, as described by Brown et al. 336 (1993) (Table I.Ib). 337 For the DQB exon 2 region, a significantly higher value for the proportion of 338 nonsynonymous substitutions (dN = 9.70 ± 3.30%) was shown when compared to the 339 proportion of synonymous substitutions (dS = 0.80 ± 0.80%) over all sites (Z-test P < 340 0.01) (Table IV). For codons within both the PBR and non-PBR, the rate of 341 nonsynonymous substitutions (dN = 34.30 ± 15.40% and 3.50 ± 2.40%, respectively) 342 exceeded that of synonymous substitutions (dS = 3.60 ± 3.90%, and 0.0 ± 0.0, 343 respectively), although this difference was only significant in PBR (Z-test P < 0.05) 344 (Table IV). 345 Phylogenetic analysis of DRA exon 2 was based on sequences described in this 346 study for long-finned pilot whale and sequences from other cetacean species 347 described in previous studies (Xu et al. 2007, 2008, 2009; Ballingall 2010) (Fig. 2a). 348 In general, no species-specific or even suborder-specific clades were observed, since 349 the three alleles described in this study for long-finned pilot whale were more closely 350 related with alleles from other species or suborders than with alleles belonging to the 351 same genus, revealing a trans-specific sharing of alleles (Fig. 2a). The strong 352 evidence of trans-species polymorphism can be observed in Glme-DRA*01 allele 353 which grouped with Risso’s dolphin (Grampus griseus), striped dolphin Stenella 354 coeruleoalba (Meyen, 1833), pantropical spotted dolphin Stenella attenuata (Gray, 355 1846) (identical alleles) and Indo-Pacific bottlenose dolphin Tursiops aduncus 356 (Ehrenberg, 1832) (> 50% bootstrap values). Likewise, it was also evident between 357 Glme-DRA*03 allele and Omura’s whale (Balaenoptera omurai Wada, Oishi & 358 Yamada, 2003) (> 50% bootstrap values and > 0.95 posterior probability values). As 359 in DRA exon 2, no species-specific clade was observed in the DQB long-finned pilot 360 whale sequences analysed in the present study. Instead, this species DQB 361 sequences grouped with sequences from different species described in previous 362 studies (Fig. 2b), evidencing the occurrence of trans-specific sharing of alleles, 363 specifically 364 AB302053, identical alleles) (> 50% bootstrap values and > 0.95 posterior probability 365 values). between Glme-DQB*02 and Tutr-DQB*10 (Genbank accession: 366 367 Discussion 10 368 The present study described the levels of diversity of the MHC DQB and DRA 369 loci in long-finned pilot whales from different regions of the North Atlantic, for 370 comparison with previous results from the South Pacific (Heimeier 2009). For both 371 DRA and DQB locus, the variation resolved in the North Atlantic is comparable with 372 the low levels of MHC allelic diversity already reported in some cetacean species (e.g. 373 Murray et al. 1995; Munguia-Vega et al. 2007; Xu et al. 2007, 2008, 2009). Clearly, 374 any comparison of levels of genetic diversity must be done with recourse to the 375 potential sources of bias inherent to the samples taken. In this case, three bias 376 sources were considered: limited sample size, failure to resolve all variation using 377 DGGE, and non-random sampling of individuals with a bias caused by the sampling 378 of family groups. In this case, several individuals displaying the same DGGE pattern 379 were sequenced, and no cryptic DNA diversity was resolved. Similarly, the sample 380 sizes used in the present study are directly comparable to those used previously, so 381 the estimates of allelic richness are directly comparable even if all of the extant 382 diversity has not been resolved across the range. The largest concern is whether 383 these samples are random, given that the sampled stranded animals may have a high 384 degree of social structure. However, a recent study, based on some of the samples 385 analysed by Heimeier (2009), revealed that there is evidence that mass strandings of 386 pilot whales involved multiple matrilines and unrelated individuals (Oremus et al. 387 2013), suggesting that the potential for downward bias of genetic diversity is minimal. 388 It may be the case that the observed MHC diversity is related to the frequently 389 suggested weaker pathogenic pressure in marine environments, leading to weaker 390 balancing selection acting on the MHC in marine mammals compared to terrestrial 391 mammals (Slade et al. 1992; Villanueva-Noriega et al. 2013). However, this 392 hypothesis is inconsistent with some data from other species. High levels of adaptive 393 diversity have already been reported in some cetacean species (e.g. Baker et al. 394 2006; Xu et al. 2012; Arbanasic et al. 2014). Recent studies have described high 395 levels of DQB nucleotide and allelic diversities of finless porpoise, Neophocaena 396 phocaenoides (Cuvier, 1829), baiji, Lipotes vexillifer (Miller, 1918) (e.g. Yang et al. 397 2005; Xu et al. 2007, 2009) and baleen whales (Baker et al. 2006). 398 Moreover, some studies have resolved high spatial variation of extant adaptive 399 diversity in dolphin populations across different geographical regions, suggesting that 400 single point sampling of MHC variation may provide a biased view of diversity and 401 selection operating over larger geographic scales (e.g. Vassilakos et al. 2009; Xu et 402 al. 2010). In the present study, within the North Atlantic, significant differences in 403 allelic frequencies were detected between the NI and all the remaining regions 404 analysed across this oceanic basin for DQB locus, and between the UK and the USA, 11 405 for the DRA locus. Additionally, significant differences were also observed in allele 406 frequencies between the North Atlantic and New Zealand. One of the putative 407 ecological factors underpinning the occurrence of geographic variation in adaptive 408 diversity is pathogen-mediated selection (e.g. Piertney & Oliver 2006; Vassilakos et 409 al. 2009; Spurgin & Richardson 2010; Xu et al. 2010), as exemplified by Vassilakos et 410 al. (2009). These authors suggested that dolphin populations with a worldwide 411 distribution across different geographic locations would be under differential pathogen 412 selective pressures. Hence, differential parasite pressures across the distribution 413 range of long-finned pilot whales (e.g. Raga & Balbuena 1993; Abollo et al. 1998) 414 could be an explanation for the significant differences detected in allele frequencies, 415 in the North Atlantic and between the North Atlantic and New Zealand (South Pacific). 416 Indeed, a study based on the analyses of parasites in long-finned pilot whales from 417 the Faroe Islands, the Coast of France and the Mediterranean found some variation 418 in terms of parasite diversity between locations (Raga & Balbuena 1993). 419 The geographic differences in adaptive diversity may be particularly relevant 420 considering the latitudinal (mainly related to water temperature) and longitudinal 421 differences in host and parasites diversity already reported, with warmer waters and 422 the Indo-Pacific being richer than colder waters and the Atlantic (reviewed in Rohde 423 2010). These differences could help explain both the significant differences between 424 the allele frequencies of pilot whales in southern regions of the North Atlantic (warmer 425 waters) and from the North Atlantic and New Zealand (South Pacific), although higher 426 DQB nucleotide and allele diversity would be expected in New Zealand compared to 427 the North Atlantic, based on this theory. However, it is important to highlight that a 428 higher diversity of parasites in warmer waters and in the Pacific does not necessarily 429 predict levels of host-parasite interaction. 430 Associated with the existent latitudinal and longitudinal differences in parasite 431 diversity, the dietary differences exhibited by pilot whales in their distribution range 432 may also alter the exposure levels of this predator to different parasites. Previous 433 dietary studies of pilot whales, based on stomach contents and stable isotope 434 analyses, have shown the occurrence of dietary differences within Atlantic waters and 435 between Atlantic and Pacific basins (e.g. Desportes & Mouritsen 1993; Gannon et al. 436 1997; Beatson et al. 2007; Santos et al. 2014; Monteiro et al. 2015b). However, more 437 information is needed about the dietary habits and parasite burdens of pilot whales 438 and respective prey, in a broad geographic range, to be able to explore this theory. 439 In the present study, the analyses of MHC loci revealed convincing evidence that 440 balancing selection has been acting to maintain functional polymorphism of the DQB 441 locus in the North Atlantic, over long-term scale, due to the occurrence of trans12 442 species polymorphism (Klein 1980), together with higher levels of nonsynonymous 443 (versus synonymous) amino acid substitutions (dN:dS) both across the molecule and 444 also in the peptide binding region. Such phylogenetic patterns and inequality in dN:dS 445 (dN > dS, for PBR), characteristic of balancing selection, have already been reported in 446 several MHC studies in cetaceans (beluga, Murray et al. 1995; minke whale, Hayashi 447 et al. 2003; Hector’s dolphin, Heimeier et al. 2009), including long-finned pilot whales 448 from New Zealand (South Pacific) (Heimeier 2009). However, the occurrence of long- 449 term balancing selection may not reflect the influence of selection in contemporary 450 populations. Therefore, although geographical variation in parasites diversity 451 worldwide may be leading to geographical adaptive structure in pilot whales from the 452 North Atlantic and between the North Atlantic and New Zealand (Pacific), other 453 microevolutionary forces, such as migration or genetic drift, may have an important 454 role in shaping contemporary population genetic diversity or may be masking 455 contemporary signatures of selection at both oceanic basins as already described in 456 several studies (reviewed in Radwan et al. 2010). To disentangle which 457 microevolutionary forces could be contributing to the adaptive diversity on North 458 Atlantic and South Pacific long-finned pilot whales, it would be necessary to compare 459 adaptive and neutral variation (Radwan et al. 2010). 460 For the DRA locus, analysis of the long-term influence of balancing selection at 461 DRA locus showed no bias of dN to dS, suggesting a lack of effect of balancing 462 selection. Similar results were obtained by Xu et al. (2009), where the analysis of 463 several cetacean species revealed purifying historical selection on this locus. This 464 absence of balancing selection together with the low levels of diversity observed in 465 this locus may be due to loss of variation via random drift or it may represent a 466 nonclassical MHC locus, where low variation is due to a functional constraint (Babik et 467 al. 2008). 468 In conclusion, the levels of MHC allelic diversity found in long-finned pilot 469 whales from the North Atlantic were relatively low, and comparable with those 470 previously described in New Zealand (South Pacific) (Heimeier 2009). Inferences 471 drawn from dN:dS and the occurrence of trans-species polymorphism suggest that 472 balancing selection had an important role at maintaining DQB locus diversity in the 473 North Atlantic, historically. If the effect of balancing selection was acting in 474 contemporary populations of long-finned pilot whales in the North Atlantic, the 475 geographic variation found in adaptive allele frequencies across regions could reflect 476 the occurrence of differential pathogen selective forces. Likewise, the reported higher 477 levels of host and parasite diversity in the Pacific compared to the Atlantic Ocean, 478 may have resulted in the significant differences in allele frequencies between the New 13 479 Zealand (Pacific) and North Atlantic, although higher levels of adaptive diversity 480 would be expected in the former. Therefore, it may be that other microevolutionary 481 processes, such as migration and/or genetic drift are operating to shape the 482 distribution of adaptive genetic diversity in the North Atlantic and South Pacific. Future 483 studies should compare genetic variation at neutral and MHC loci in order to discern if 484 adaptive diversity of long-finned pilot whales in the Atlantic and Pacific Oceans is 485 driven by stochastic or deterministic microevolutionary processes. 486 487 Acknowledgments 488 Samples were collected under the auspices of strandings monitoring programs run by 489 Sociedade Portuguesa de Vida Selvagem, Coordinadora para o Estudio dos 490 Mamíferos Mariños (supported by the regional government Xunta de Galicia), the 491 Scottish Agriculture College Veterinary Science Division, the UK Cetacean Strandings 492 Investigation Programme (jointly funded by Defra and the Devolved Governments of 493 Scotland and Wales), the Museum of Natural History of the Faroe Islands and the 494 Marine Mammal Rescue & Research Program of the International Fund for Animal 495 Welfare. We thank all the members of those institutions for their assistance with data 496 and sample collection. Published in a collaboration between Universidade do Minho 497 and University of Aberdeen 498 499 Funding 500 Silvia S. Monteiro and Marisa Ferreira were supported by a Ph.D. grant from 501 Fundação 502 SFRH/BD/30240/2006, respectively). Alfredo López was supported by a postdoctoral 503 grant from Fundação para a Ciência e Tecnologia (ref SFRH/BPD/82407/2011). 504 Catarina Eira is supported by CESAM (UID/AMB/50017), from FCT/MEC through 505 national funds and FEDER (PT2020, Compete 2020). The work related with 506 strandings and tissue collection in Portugal was partially supported by the SafeSea 507 Project EEAGrants PT 0039 (supported by Iceland, Liechtenstein and Norway 508 through 509 NAT/PT/000038 (funded by the European Union–Program Life+) and by the project 510 CetSenti 511 (Funded by the Program COMPETE and Fundação para a Ciência e Tecnologia). para the a EEA FCT Ciência Financial e Tecnologia Mechanism), RECI/AAG-GLO/0470/2012; (ref by SFRH/BD/38735/2007 the Project and MarPro–Life09 FCOMP-01-0124-FEDER-027472 512 513 514 14 515 516 References 517 Abollo E, Lopez A, Gestal C, Benavente P, Pascual S. 1998. Macroparasites in cetaceans 518 stranded on the northwestern Spanish Atlantic coast. Diseases of Aquatic Organisms 519 32: 227-231. 520 Arbanasic H, Đuras M, Podnar M, Gomercic T, Curkovic S, Galov A. 2014. Major 521 histocompatibility complex class II variation in bottlenose dolphin from Adriatic Sea: 522 inferences about the extent of balancing selection. Marine Biology 161: 2407–2422. 523 524 525 526 Amos B, Schlotterer C, Tautz D. 1993. Social-Structure of Pilot Whales Revealed by Analytical DNA Profiling. Science 260: 670-672. Babik W, Pabijan M, Radwan J. 2008. Contrasting patterns of variation in MHC loci in the Alpine newt. Molecular Ecology 17: 2339–2355. 527 Baker CS, Vant MD, Dalebout ML, Lento GM, O’Brien SJ, Yuhki N. 2006. Diversity and 528 duplication of DQB and DRB-like genes of the MHC in baleen whales (suborder: 529 Mysticeti). Immunogenetics 58: 283–296. 530 Balbuena JA, Raga JA.1994. Intestinal helminths as indicators of segregation and social 531 structure of pods of long-finned pilot whales (Globicephala melas) off the Faroe Islands. 532 Cannadian Journal of Zoology 72: 443-448. 533 534 Ballard JWO, Whitlock MC. 2004. The incomplete natural history of mitochondria. Molecular Ecology 13: 729-744. 535 Ballingall KT, Rocchi MS, McKeever DJ, Wright F. 2010. Trans-Species Polymorphism and 536 Selection in the MHC Class II DRA Genes of Domestic Sheep. Plos One 5 (6):e11402. 537 10 pages. 538 Beatson EL, O’Shea S, Stone C, Shortland T. 2007. Notes on New Zealand mammals 6. 539 Second report on the stomach contents of long-finned pilot whales, Globicephala melas. 540 New Zealand Journal of Zoology 34: 359–362. 541 Bos DH, Gopurenko D, Williams RN, DeWoody JA. 2008. Inferring population history and 542 demography using microsatellites, mitochodrial DNA and major Histocompatibility 543 Complex (MHC) genes. Evolution 62: 1458–1468. 544 Brown JH, Jardetzky TS, Gorga JC, Stern LJ, Urban RG, Strominger JL, Wiley DC. 1993. 545 Three-dimensional structure of the human class II histocompatibility antigen HLA-DR1. 546 Nature 364: 33 - 39. 547 Cutrera AP, Lacey EA. 2006. Major Histocompatibility Complex variation in Talas Tuco-Tucos: 548 the influence of demography on selection. Journal of Mammalogy 87: 706–716. 549 Darriba D, Taboada GL, Doallo R, Posada D. 2012. jModelTest 2: more models, new 550 551 552 heuristics and parallel computing. Nature Methods 9: 772. Desportes G, Mouritsen R. 1993. Preliminary results on the diet of long-finned pilot whales off the Faroe Islands. Report of International Whaling Commision, 305-324. 15 553 Duarte S, Cássio F, Pascoal C. 2012. Denaturing Gradient Gel Electrophoresis (DGGE) in 554 Microbial Ecology – 555 Electrophoresis - Principles and Basics. InTech, Rijeka, Croatia p. 346 pp. Insights from Freshwaters in: Magdeldin, S. (Ed.), Gel 556 Excoffier L, Lischer HEL. 2010. Arlequin suite ver 3.5: a new series of programs to perform 557 population genetics analyses under Linux and Windows. Molecular Ecology Resources 558 10: 564-567. Fullard KJ, Early G, Heide-Jorgensen MP, Bloch D, Rosing-Asvid A, Amos 559 W. 2000. Population structure of long-finned pilot whales in the North Atlantic: a 560 correlation with sea surface temperature? Molecular Ecology 9: 949–958. 561 Gannon DP, Read AJ, Craddock JE, Fristrup KM, Nicolas JR. 1997. Feeding ecology of long- 562 finned pilot whales Globicephala melas in the western north Atlantic. Marine Ecology 563 Progress Series 148: 1-10. 564 Gaudieri S, Dawkins RL, Habara K, Kulski JK, Gojobori T. 2000. SNP Profile within the Human 565 Major Histocompatibility Complex Reveals an Extreme and Interrupted Level of 566 Nucleotide Diversity. Genome Research 10: 1579–1586. 567 568 569 570 571 572 573 574 Goudet J. 1995. FSTAT (Version 1.2): A Computer Program to Calculate F-Statistics. Journal of Heredity 86: 485-486. Hayashi K, Nishida S, Yoshida H, Goto M, Pastene LA, Koike H. 2003. Sequence variation of the DQB allele in the cetacean MHC. Mammal Study 28: 89–96. Heimeier D. 2009. Comparative diversity at the major histocompatibility complex in two dolphin species. PhD Thesis. University of Auckland. Holderegger R, Kamm U, Gugerli F. 2006. Adaptive vs. neutral genetic diversity: implications for landscape genetics. Landscape Ecology 21: 797–807. 575 Hughes AL, Nei M. 1989. Nucleotide substitution at major histocompatibility complex class II 576 loci: Evidence for overdominant selection. Proceedings of the National Academy of 577 Sciences USA 86: 958-962. 578 Kennedy LJ, Ryvar R, M.Gaskell R, Addie DD, Willoughby K, Carter SD, Thomson W, Ollier 579 WER, Radford AD. 2002. Sequence analysis of MHC DRB alleles in domestic cats from 580 the United Kingdom. Immunogenetics 54: 348–352. 581 Kimura M. 1980. A Simple Method for Estimating Evolutionary Rates of Base Substitutions 582 Through Comparative Studies of Nucleotide Sequences. Journal of Molecular Evolution 583 16: 111-120. 584 585 Klein J. 1980. Generation of diversity at MHC loci: implications for T-cell receptor repertoires. In: Fougereau M, Dausset J, editors. Immunology. London: Academic Press. 586 Klein J, Bontrop RE, Dawkins RL, Erlich HA, Gyllensten UB, Heise ER, Jones PP, Parham P, 587 Wakeland EK, Watkins DI. 1990. Nomenclature for the major histocompatibility 588 complexes of different species: a proposal. Immunogenetics 31: 217-219. 589 Kundu S, Faulkes CG. 2004. Patterns of MHC selection in African mole-rats, family 590 Bathyergidae: the effects of sociality and habitat. Proceedings of the Royal Society of 591 London B 271: 273–278. 16 592 Lambshead PJD, Brown CJ, Ferrero TJ, Mitchell NJ, Smith CR, Hawkins LE, Tietjen J. 2002. 593 Latitudinal diversity patterns of deep-sea marine nematodes and organic fluxes : a test 594 from the central equatorial Pacific. Marine Ecology Progress Series 236: 129 –135. 595 Luque JL, Poulin R. 2008. Linking ecology with parasite diversity in Neotropical fishes. Journal 596 of Fish Biology 72: 189–204. 597 McCallum HI, Kuris A, Harvell CD, Lafferty KD, Smith GW, Porter J. 2004. Does terrestrial 598 epidemiology apply to marine systems? Trends in Ecology & Evolution 19 (11): 585- 599 591. 600 Merila J, Crnokrak P. 2001. Comparison of genetic differentiation at marker loci and 601 quantitative traits. Journal of Evolutionary Biology 14: 892-903. Monteiro SS, Méndez- 602 Fernandez P, Piertney S, Moffat CF, Ferreira M, Vingada JV, López A, Brownlow A, 603 Jepson P, Mikkelsen B, Niemeyer M, Carvalho JC, Pierce GJ. 2015a. Long-finned pilot 604 whale population diversity and structure in Atlantic waters assessed through 605 biogeochemical and genetic markers. Marine Ecology Progress Series 536: 243–257. 606 Monteiro SS, Méndez-Fernandez P, Piertney S, Moffat CF, Ferreira M, Vingada JV, López A, 607 Brownlow A, Jepson P, Mikkelsen B, Niemeyer M, Carvalho JC, Pierce GJ. 2015a. 608 Long-finned pilot whale population diversity and structure in Atlantic waters assessed 609 through biogeochemical and genetic markers. Marine Ecology Progress Series 536: 610 243–257. 611 Monteiro S, Ferreira M, Vingada JV, López A, Brownlow A, Méndez-Fernandez P. 2015b. 612 Application of stable isotopes to assess the feeding ecology of long-finned pilot whale 613 (Globicephala melas) in the Northeast Atlantic Ocean. Journal of Experimental Marine 614 Biology and Ecology 465: 56–63. 615 Morand S, Cribb TH, Kulbickis M, Rigby MC, C. Chauvet, Dufour V, Faliex E, Galzin R, Lo 616 CM, -Yat AL, Pichelin S, Sasal P. 2000. Endoparasite species richness of New 617 Caledonian butterfly fishes: host density and diet matter. Parasitology 121: 65–73. 618 Munguia-Vega A, Esquer-Garrigos Y, Rojas-Bracho L, Vazquez-Juarez R, Castro-Prieto A, 619 Flores-Ramirez S. 2007. Genetic drift vs. natural selection in a long-term small isolated 620 population: major histocompatibility complex class II variation in the Gulf of California 621 endemic porpoise (Phocoena sinus). Molecular Ecology 16: 4051-4065. 622 Murray BW, Malik S, white BN. 1995. Sequence variation at the Major Histocompatibility 623 Complex Locus DQB in Beluga whales (Delphinapterus leucas). Molecular Biology and 624 Evolution 12: 582-593 625 Neefjes J, L. M. Jongsma M, Paul P, Bakke O. 2011. Towards a systems understanding of 626 MHC class I and MHC class II antigen presentation. Nature Reviews Immunology 11: 627 823-836. 628 629 630 631 Nei M, Gojobori T. 1986. Simple Methods for Estimating the Numbers of Synonymous and Nonsynonymous Nucleotide Substitutions. Molecular Biology and Evolution 3: 418-426. Nei M, Kumar S. 2000. Molecular Evolution and Phylogenetics. New York: Oxford University Press. 17 632 633 Nei M, Tajima F. 1981. DNA polymorphism detectable by restriction endonucleases. Genetics 97: 145-163. 634 Oliver MK, Lambin X, Cornulier T, Piertney SB. 2009. Spatio-temporal variation in the strength 635 and mode of selection acting on major histocompatibility complex diversity in water vole 636 (Arvicola terrestris) metapopulations. Molecular Ecology 18: 80–92. 637 638 639 Oliver MK, Piertney SB. 2012. Selection Maintains MHC Diversity through a Natural Population Bottleneck. Molecular Biology and Evolution 29: 1713-1720. Oremus M, Gales R, Dalebout ML, Funahashi N, Endo T, Kage T, Steel D, Baker SC. 2009. 640 Worldwide 641 (Globicephala spp.). Biological Journal of the Linnean Society 98: 729–744. mitochondrial DNA diversity and phylogeography of pilot whales 642 Oremus M, Gales R, Kettles H, Baker CS. 2013. Genetic Evidence of Multiple Matrilines and 643 Spatial Disruption of Kinship Bonds in Mass Strandings of Long-finned Pilot Whales, 644 Globicephala melas. Journal of Heredity 104: 301-311. 645 646 647 648 649 650 Page RDM. 1996. TREEVIEW: An application to display phylogenetic trees on personal computers. Computer Applications in the Biosciences 12: 357-358. Piertney SB, Oliver MK. 2006. The evolutionary ecology of the major histocompatibility complex. Heredity 96(1): 7-21. Radwan J, Biedrzycka A, Babik W. 2010. Does reduced MHC diversity decrease viability of vertebrate populations? Biological Conservation 143: 537–544. 651 Raga JA, Balbuena JA. 1993. Parasites of the long-finned pilot whale, Globicephala melas 652 (Traill, 1809), in European waters. Report of International Whaling Commission (Special 653 Issue) 14: 391-406. 654 655 656 657 658 659 Raymond M, Rousset F. 1995. GENEPOP (Version 1.2): Population Genetics Software for Exact Tests and Ecumenicism. The Journal of Heredity 86: 248 - 249. Reid JB, Evans PGH, Northridge SP. 2003. Atlas of cetacean distribution in north-west European Waters. Peterborough: Joint Nature Conservation Committee. Rohde K. 1978. Latitudinal Differences in Host-Specificity of Marine Monogenea and Digenea Marine Biology 47: 125-134. 660 Rohde K. 1984. Zoogeography of marine parasites. Helgoländer Meeresunters 37: 35-52. 661 Rohde K. 2010. Marine parasite diversity and environmental gradients. In: Morand S, Krasnov 662 BR, editors. The biogeography of Host–Parasite Interactions. Oxford: Oxford University 663 Press. pages 73-89. 664 Rohde K, Heap M. 1998. Latitudinal differences in species and community richness and in 665 community structure of metazoan endo- and ectoparasites of marine teleost fish. 666 International Journal of Parasitology 28: 461-474. 667 Ronquist F, Teslenko M, Mark PVD, Ayres DL, Darling A, Höhna S, Larget B, Liu L, Suchard 668 MA, Huelsenbeck JP. 2012. MrBayes 3.2: Efficient Bayesian Phylogenetic Inference 669 and Model Choice Across a Large Model Space. Systematic Biology 61: 539-542. 670 Sambrook J, Fritsch EF, Maniatis T. 1989. Molecular Cloning - A Laboratory Manual, 2nd 671 Edition. New York: Cold Spring Habour Laboratory Press. 18 672 Santos MB, Monteiro SS, Vingada JV, Ferreira M, López A, Martinez-Cedeira J, Reid RJ, 673 Brownlow A, Pierce G. 2014. Patterns and trends in the diet of long-finned pilot whales 674 (Globicephala melas) in the northeast Atlantic. Marine Mammal Science 30: 1-19. 675 Slade RW. 1992. Limited MHC Polymorphism in the Southern Elephant Seal: Implications for 676 MHC Evolution and Marine Mammal Population Biology. Proceedings of the Royal 677 Society B: Biological Sciences 249: 163-171. 678 679 680 681 682 683 Spurgin LG, Richardson DS. 2010. How pathogens drive genetic diversity: MHC, mechanisms and misunderstandings. Proceedings of the Royal Society B 277: 979-988. Swofford DL. 2003. PAUP*: Phylogenetic analysis using parsimony (* and other methods), version 4.0b 10. Massachusetts: Sinauer Associates. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. 2013. MEGA6: Molecular Evolutionary Genetics Analysis Version 6.0. Molecular Biology and Evolution 30: 2725–2729. 684 Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. 1997. The Clustal X 685 windows interface: flexible strategies for multiple sequence alignment aided by quality 686 analysis tools. Nucleic Acids Research 24: 4876-4882. 687 Vassilakos D, Natoli A, Dahlheim M, Hoelzel AR. 2009. Balancing and Directional Selection at 688 Exon-2 of the MHC DQB1 Locus among Populations of Odontocete Cetaceans. 689 Molecular Biology and Evolution 26: 681-689. 690 691 Villanueva-Noriega MJ, Baker CS, Medrano-González L. 2013. Evolution of the MHC-DQB exon 2 in marine and terrestrial mammals. Immunogenetics 65: 47–61. 692 Witteveen BH, Straley JM, Chenoweth E, Baker S, Barlow J, Matkin C, Gabriele CM, Neilson 693 J, Steel D, Ziegesar Ov, Andrews AG, Hirons A. 2011. Using movements, genetics and 694 trophic ecology to differentiate inshore from offshore aggregations of humpback whales 695 in the Gulf of Alaska. Endangered Species Research 14: 217–225. 696 Xu S, Chen B, Zhou K, Yang G. 2008. High similarity at three MHC loci between the baiji and 697 finless porpoise: Trans-species or convergent evolution? Molecular Phylogenetics and 698 Evolution 47: 36–44. 699 Xu S, Ju J, Zhou X, Wang L, Zhou K, Yang G. 2012. Considerable MHC Diversity Suggests 700 That the Functional Extinction of Baiji Is Not Related to Population Genetic Collapse. 701 Plos One 7: e30423. 702 Xu S, Ren W, Zhou X, Zhou K, Yang G. 2010. Sequence Polymorphism and Geographical 703 Variation at a Positively Selected MHC-DRB Gene in the Finless Porpoise 704 (Neophocaena phocaenoides): Implication for Recent Differentiation of the Yangtze 705 Finless Porpoise? Journal of Molecular Evolution 71: 6-22. 706 707 708 709 710 711 Xu S, Sun P, Zhou K, Yang G. 2007. Sequence variability at three MHC loci of finless porpoises (Neophocaena phocaenoides). Immunogenetics 59: 581-592. Xu SX, Ren WH, Li SZ, Wei FW, Zhou KY, Yang G. 2009. Sequence Polymorphism and Evolution of Three Cetacean MHC Genes. Journal of Molecular Evolution 69: 260–275. Yang G. 2005. Sequence Variation and Gene Duplication at MHC DQB Loci of Baiji (Lipotes vexillifer), a Chinese River Dolphin. Journal of Heredity 96: 310-317. 19 712 713 714 715 716 Figure legends 717 718 Fig.1 Location of the strandings of long-finned pilot whale analysed in this study (n = 719 119). Sample size is indicated for both MHC loci above each pie chart (right: DQB 720 locus; left: DRA locus). Pies charts show allele relative frequencies for the MHC class II 721 DQB and DRA. White: DQB allele 1; Black: DQB allele 2; Green: DQB allele 3; Red: 722 DRA allele 1; Orange: DRA allele 2; Yellow; DRA allele 3. NI: Northwest Iberia; UK: 723 United Kingdom; FI: Faroe Islands; USA: United States of America 724 725 Fig.2 Phylogenetic relationships between MHC alleles of long-finned pilot whales of the 726 North Atlantic, based on Maximum Likelihood and Bayesian analysis. a) MHC DRA 727 exon 2; b) MHC DQB exon 2. * indicate posterior probability support values superior to 728 0.95 and/or likelihood bootstrap values superior to 50%. For the DQB locus, Bos taurus 729 was used as outgroup. Genbank accession numbers are shown within brackets. Dede 730 Delphinus delphis; Stco - Stenella coeruleoalba; Tutr - Tursiops truncatus; Tuad 731 Tursiops aduncus; Glma - Globicephala macrorhynchus; Glme - Globicephala melas; 732 Grgr - Grampus griseus; Phma - Physeter macrocephalus; Laob - Lagenorhynchus 733 obliquidens; Meno - Megaptera novaeangliae; Dele - Delphinapterus leucas; Cehe 734 Cephalorhynchus hectori; Deca - Delphinus capensis; Phph - Phocoena phocoena; 735 Stat - Stenella attenuata; Pobl - Pontoporia blainvillei; Plga - Platanista gangetica; 736 Phda - Phocoenoides dalli; Neph - Neophocaena phocaenoides; Live - Lipotes 737 vexillifer; Baom - Balaenoptera omurai. 738 739 740 741 742 743 744 745 746 747 748 20 749 750 751 752 753 21 754 Tables 755 Table I. Nucleotide (a) and predicted amino acid sequences (b) for: I.I) MHC class II DRA exon 2 alleles and I.II) MHC class II DQB exon 2 756 alleles in long-finned pilot whale. A dot indicates identity with the top sequence. Asterisks and numbers in bold identify sites in the putative 757 peptide-binding region as predicted by Brown et al. (1993). 758 759 Table I.I a) 760 Glme*DRA*01 T C T C T G T C C C C T T A T C A A T C A A A C G A G T T T A T G T T T G A C T T T G A T G G T G A T G A G A T T T T C C A Glme*DRA*02 . . . . . . . . . . Glme*DRA*03 . . . . . . G T . . . . G . C . Glme*DRA*01 C G T G G A T A T G G A A A A G A A G G A G A C A G T G T G G C G G C T T A A A G A A T T T G G A A A T T T T G C C A G T T Glme*DRA*02 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . Glme*DRA*03 . . . . . . . T . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . Glme*DRA*01 T T C A G G C T C A G G G T G C A T T G G C C A A T A T G G C T G T G G G A A G A G C C A A C C T G G A C A T C T T G A T A Glme*DRA*02 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Glme*DRA*03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . A . . . . . . . . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . G G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 761 762 763 764 765 22 766 Table I.I b) 767 Position PBR 2 2 2 4 2 6 3 3 1 2 4 3 5 1 5 5 5 3 4 5 5 8 6 2 6 5 6 6 8 9 7 2 * * * * * * * * * * * * * * * * Glme*DRA*01 S L S P Y Q S N E F M F D F D G D E I F H V D M E K K E T V WR L K E F G N F A S F Q A Q G A L A N M A V G R A N L D I L I Glme*DRA*02 . . Glme*DRA*03 . . V . D . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . L . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . R . . . . . . . . . . . . . . . . . . . . . . . . I . . . K . . . . . M . 768 769 770 771 Table I.II a) 772 Glme*DQB*01 A C G G A G C G G G T G C G G G T C A T G A A C A G A T A C A T C T A T A A C C G G G A G G A G T T C G T G C G C T T C G A Glme*DQB*02 . . . . . . . . . . . . . . . C A . G . . Glme*DQB*03 . . . . . . . . . . . . . . . T C . G . . G . . G . . . . . . T G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Glme*DQB*01 C A G C G A C G T G G G C G A G T A C C G G G C G G T G A C C G A G C T G G G C C G G C C G G A C G C C G A G T A C T G G A Glme*DQB*02 . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T C . Glme*DQB*03 . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Glme*DQB*01 A C A G C C A G G A G G A C A T C C T G G A G G A G G A A C G G G C C G C G C T G G A C A C G Glme*DQB*02 . . . . . . . . A . . . . . . . . . . . . . . C G . . . . . . . . . . C A C G . . . . . . . . Glme*DQB*03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 773 774 Table I.II b) Position PBR 2 8 3 0 3 2 3 3 7 8 4 7 5 6 6 6 0 1 6 5 6 8 7 7 0 1 7 4 * * * * * * * * * * * * * * Glme*DQB*01 T E R V R V M N R Y I Y N R E E F V R F D S D V G E Y R A V T E L G R P D A E Y WN S Q E D I L E E E R A A L D T Glme*DQB*02 . . . . . H V S . L . . . . . . . . . . . . . . . . F . . . . . . . . . . . . . F . . . K . . . . R . . . H V . . Glme*DQB*03 . . . . . S V D . . . . . . . . . . . . . . . . . F . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 775 25 776 Table II. Summary of genetic diversity statistics for the MHC DRA and DQB loci of 777 long-finned pilot whale analysed in the present study. Mean values ± Standard 778 Deviation are shown. n: sample size; π: nucleotide diversity. Ho: observed 779 heterozigoty; He: expected heterozigoty; P: probability of rejecting the null hypothesis 780 that loci are in Hardy-Weinberg equilibrium (ns: non-significant P > 0.0125; sig: 781 significant P ≤ 0.0125, considering bonferroni correction). Abbreviations are described 782 in figure 1. alleles Allelic richness NI 3 UK 3 FI 2 USA 2 2.99 2.99 2.00 2.00 Overall 3 2.72 π (%) DRA 0.86 ± 0.58 0.22 ± 0.23 0.27 ± 0.26 0.49 ± 0.38 0.15 0.36 0.70 0.40 0.33 0.30 0.48 0.42 sig ns sig ns 3 3 3 3 3 3.00 2.98 2.89 2.82 2.82 4.54 ± 2.44 4.50 ± 2.37 4.21 ± 2.24 4.36 ± 2.34 4.60 ± 2.39 Ho 0.39 0.41 0.33 0.27 0.35 He 0.45 0.51 0.44 0.46 0.51 P ns ns ns ns sig Ho He P alleles Allelic richness DQB 0.61 ± 0.45 π (%) 0.46 0.50 ns 783 784 785 786 787 788 789 790 791 792 793 794 795 26 796 Table III. Results from Fisher´s exact test for genetic differentiation within North 797 Atlantic locations and between North Atlantic (NA) and Pacific (PAC). Above the 798 diagonal: DRA locus; Below diagonal: DQB locus. ns: non-significant; sig: significant P 799 ≤ 0.0083 within NA comparisons (considering Bonferroni correction) and P ≤ 0.05 for 800 NA vs. PAC comparison. Abbreviations are described in figure 1. 801 802 803 804 805 NI UK FI USA NI sig sig sig UK ns ns ns FI ns ns ns USA ns sig ns - NA PAC NA sig PAC - 806 807 808 809 810 Table IV. The relative rate of dN and dS substitutions among alleles for codons in the 811 PBR and non-PBR of DQB exon 2 and test of positive selection for North Atlantic long812 finned pilot whale. dN: nonsynonymous substitutions; dS: synonymous substitutions; 813 PBR: peptide binding region; P: probability of rejecting the null hypothesis of neutrality 814 (dN = dS) in favour of the alternative hypothesis of positive selection (dN > dS) (Nei & Kumar 815 2000); N: Number of codons used for the test. Mean value ± standard error based on 816 100,000 replicates. 817 818 N dN (%) dS (%) P 819 Overall 57 9.7 ± 3.3 0.8 ± 0.8 0.002 820 PBR 14 34.3 ± 15.4 3.6 ± 3.9 0.01 Non-PBR 43 3.5 ± 2.4 0.0 ± 0.0 0.07 27 821 822 28
© Copyright 2026 Paperzz