Population genetic structure in gadoid fish with focus on Atlantic cod Gadus morhua Guðni Magnús Eiríksson Faculty of Life and Environmental Sciences University of Iceland 2015 Population genetic structure in gadoid fish with focus on Atlantic cod Gadus morhua Guðni Magnús Eiríksson Dissertation submitted in partial fulfillment of a Philosophiae Doctor degree in Biology Advisor Dr. Einar Árnason PhD Committee Dr. Einar Árnason Dr. Jarle Mork Dr. Kristinn Pétur Magnússon Opponents Dr. Dorte Bekkevold Dr. William Stewart Grant Faculty of Life and Environmental Sciences School of Engineering and Natural Sciences University of Iceland Reykjavík, October 2015 Population genetic structure in gadoid fish with focus on Atlantic cod Gadus morhua Dissertation submitted in partial fulfillment of a Philosophiae Doctor degree in Biology Copyright © Guðni Magnús Eiríksson 2015 All rights reserved Faculty of Life and Environmental Sciences School of Engineering and Natural Sciences University of Iceland Sturlugata 7 101, Reykjavík Iceland Telephone: 525-4000 Bibliographic information: Guðni Magnús Eiríksson, 2015, Population genetic structure in gadoid fish with focus on Atlantic cod Gadus morhua, PhD dissertation, Faculty of Life and Environmental Sciences, University of Iceland, 128 pp. ISBN 978-9935-9266-1-6 Printing: Háskólaprent ehf. Reykjavík, Iceland, October 2015 Abstract In the present study genetic variation and population genetic structure in spawning Atlantic cod, Gadus morhua, around Iceland was examined. Earlier research on population genetic structure in cod has not been conclusive and the use of different molecular methods have shown different patterns. It is important to determine why different methods show different patterns in order to describe the population genetic structure in cod. In the present study both microsatellite DNA variation and mitochondrial DNA sequence variation were estimated. Both methods have commonly been used in population genetic studies in cod. Findings of microsatellite DNA variation analysis showed a genetic difference between cod from the waters south and north of Iceland. However, analysis of the observed difference suggest that it can be explained by natural selection affecting genetic variation at a single microsatellite loci (Gmo34). When disregarding this locus from the analysis no genetic difference was observed. Small but significant genetic difference was found among Atlantic cod sampled at different depths off the south coast of Iceland, indicating that different populations of Atlantic cod may be found at different depths. Analysis of mtDNA sequence variation showed no overall genetic difference among different geographic areas around Iceland. Analysis of temporal mtDNA sequence variation showed rapid changes in allele frequencies, particularly in cod from NE-Iceland, suggesting that the mtDNA sequence variation can be useful for detecting recent population divergence. Thus, the findings of the present study do not suggest that cod around Iceland is geographically divided into distinct historical populations as has been suggested in some earlier studies. On the contrary the findings suggest high levels of gene flow in Atlantic cod around Iceland. Genetic variation and population genetic structure across the North Atlantic ocean was also examined in saithe Pollachius virens, haddock Melanogrammus aeglefinus and whiting Merlangius merlangus using mtDNA sequence variation at the cytochrome c oxidase subunit I locus. The results showed limited trans-Atlantic genetic structure for all the species indicating high levels of gene flow or insufficient time for genetic differentiation to have become established. The observed genetic sequence variation for saithe and haddock suggest sex-biased migration pattern. The results indicate that females may be more philopatric and males more migratory. Such behaviour has rarely been described for marine fish and is worth further research. The observed mitochondrial sequence variation for all the studied species indicate sudden population expansion, reflected in high number of singletons and a shallow genealogy. However, the estimated timing of expansion varies among the examined species, suggesting that the biological, historical or analytical factors resulting in the observed pattern may differ among the species. Útdráttur Meginviðfangsefni þessa verkefnis var rannsókn á erfðabreytileika og stofngerð þorsks, Gadus morhua, umhverfis Ísland. Niðurstöður eldri rannsókna á stofngerð þorsks hafa ekki verið samhljóma, en mismunandi greiningaraðferðir hafa sýnt ólíkar niðurstöður. Mikilvægt er að leita skýringa á breytilegum og misvísandi niðurstöðum til þess að skýra betur stofngerð þorsks. Stofngerð þorsks var rannsökuð með greiningu á örtunglabreytileika og einnig kirnabeytileika í hvatberaerfðaefni, en báðar þessar aðferðir hafa mikið verið notaðar við greiningu á stofngerð þorsks. Niðurstöður rannsóknar á örtunglabreytileika sýna erfðafræðilegan mun á milli þorsks frá hafsvæðum norður og suður af Íslandi, líkt og fyrri rannsókn sýndi einnig. Greining sýnir að sá munur er líklega til kominn vegna áhrifa náttúrulegs vals á eitt örtungl (Gmo34). Þegar sá lókus er fjarlægður úr greiningu kemur enginn erfðamunur fram fyrir þorsk frá mismunandi hafsvæðum. Lítill, en marktækur, erfðafræðilegur munur greindist milli þorsks af mismunandi dýpi við suðurströnd Íslands, sem getur bent til þess að þorskur aðgreinist í mismunandi stofna eftir dýpi. Niðurstaða greiningar á kirnabreytileika í hvatberaerfðaefni sýnir engan heildarmun milli hafsvæða umhverfis Ísland. Breytileiki í tíma var greindur með samanburði milli kynslóða fyrir þorsk umhverfis Ísland. Örar breytingar á tíðni setgerða milli kynslóða, sérstaklega fyrir þorsk frá hafsvæðum norðaustur af Íslandi, benda til þess að hvatberaerfðaefni henti vel til að greina mun milli nýlega aðskildra hópa. Því styðja niðurstöður ekki að þorskur umhverfis Ísland skiptist í æxlunarlega aðskilda stofna eftir hafsvæðum eins og sumar eldri rannsóknir hafa bent til. Þvert á móti benda niðurstöður rannsóknarinnar til mikils genaflæðis milli hafsvæða umhverfis Ísland. Í verkefninu var erfðabreytileiki og stofngerð ufsa, Pollachius virens, ýsu Melanogrammus aeglefinus og lýsu Merlangius merlangus í Norður Atlantshafi einnig rannsökuð, með því að meta kirnabreytileika hvatberaerfðaefnis fyrir cytochrome c oxidase subunit I lókusinn. Niðurstöður sýna afar lítinn erfðamun milli heimshluta fyrir þessar tegundir sem bendir til mikils fars (genaflæðis) eða þess að of skammur tími sé liðinn frá aðskilnaði mögulegra hópa til að greinanlegur erfðamunur hafi myndast. Rannsókn á kirnabreytileikanum fyrir ufsa og ýsu bendir til þess að far kunni að vera ólíkt milli kynja. Hængar kunni að fara víðar en að hrygnur haldi frekar kyrru fyrir. Slíku atferli hefur sjaldan verið lýst fyrir sjávarfiska og gefur þessi vísbending tilefni til frekari rannsókna. Greining á kirnabreytileika fyrir allar tegundirnar sem rannsakaðar voru sýna merki um skyndilega stofnstærðaraukningu, sem kemur fram í hárri tíðni sjaldgæfra setgerða og grunnu ættartré. Mat á tímasetningu stofnstærðaraukningar er breytilegt milli tegunda sem bendir til þess að mismunandi líffræðilegar, sögulegar eða aðferðafræðilegar ástæður séu fyrir séðu mynstri. “Maðurinn finnur það sem hann leitar að, og sá sem trúir á draug finnur draug” Halldór Kiljan Laxness (Sjálfstætt fólk) Table of Contents Abstract iii Útdráttur v Table of Contents ix List of Original Papers xi Acknowledgments xiii 1 General introduction 1 1.1 Genetic variation - Origin and dynamics . . . . . . . . . . . . . . . . . . . . . . . 1 1.1.1 1.1.2 1.1.3 1.1.4 1.1.5 1.2 Origin of genetic variation within a population Population genetic structure . . . . . . . . . . Natural selection . . . . . . . . . . . . . . . . . Demography and molecular clock rate . . . . Life history of marine organisms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 4 5 6 Biology and population genetics of the studied fish species . . . . . . . . . . . 7 1.2.1 1.2.2 1.2.3 1.2.4 1.3 . . . . . Atlantic cod, Gadus morhua . . . . . . Haddock, Melanogrammus aeglefinus Saithe, Pollachius virens . . . . . . . . Whiting, Merlangius merlangus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 11 12 13 The present study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 1.3.1 1.3.2 1.3.3 1.3.4 1.3.5 Microsatellite variation in Atlantic cod, Gadus morhua, around Iceland . . . . . Mitochondrial DNA sequence variation in Atlantic cod, Gadus morhua . . . . . Mitochondrial DNA sequence variation in saithe, Pollachius virens . . . . . . . Mitochondrial DNA sequence variation in haddock, Melanogrammus aeglefinus Mitochondrial DNA sequence variation in whiting, Merlangius merlangus . . . 15 15 15 15 16 References 16 Paper I 27 Paper II 37 Paper III 55 Paper IV 71 Paper V 99 Appendix I. Supplementary tables 109 Appendix II. Supplementary figures 121 Appendix III 125 List of Original Papers List of Original Papers Paper I: Paper II: Paper III: Paper IV: Paper V: Guðni Magnús Eiríksson and Einar Árnason. 2013. Spatial and Temporal Microsatellite Variation in Spawning Atlantic cod, Gadus morhua, around Iceland. Canadian Journal of Fisheries and Aquatic Sciences. 70: 1151–1158. Guðni Magnús Eiríksson and Einar Árnason. High levels of gene flow and temporal genetic variation in Atlantic cod, Gadus morhua in Icelandic waters — inferred by mitochondrial DNA sequence variation. Unpublished manuscript Guðni Magnús Eiríksson and Einar Árnason. 2015. Gene flow across the N-Atlantic and sex-biased dispersal inferred from mtDNA sequence variation in saithe, Pollachius virens. Environmental Biology of Fishes. 98: 67–79. Guðni Magnús Eiríksson and Einar Árnason. Manuscript. Phylogeography of haddock Melanogrammus aeglefinus in the North Atlantic — Postglacial expansion and male-biased dispersal. Unpublished manuscript Guðni Magnús Eiríksson and Einar Árnason. 2014. Mitochondrial DNA Sequence Variation in Whiting Merlangius merlangus in the North East Atlantic. Environmental Biology of Fishes. 97: 103–110. xi Acknowledgments Acknowledgments I would like to thank my supervisor Professor Einar Árnason for his guidance, enthusiasm and stimulating discussions during this work. I thank Professor Jarle Mork for the hospitality and great company during my visits to his lab in Throndheim. Also I would like to thank Professor Kristinn Pétur Magnússon for his support. I would like to thank Professor Snæbjörn Pálsson for useful discussions during the work and the population genetics group at the University of Iceland, in particular Katrín Halldórsdóttir, Ubaldo Benitez Hernandez and Svava Ingimarsdóttir, for great company and various help during this work. I would like to thank Professor Zophonías Oddur Jónsson for various help in the molecular lab. I would also like to thank Professor Guðrún Marteinsdóttir and Jónas Páll Jónasson for useful fishy discussions and Professor Skúli Skúlason for therapeutic lunch meetings. I am thankful to all those that assisted in obtaining samples for the project. They are listed in the respective chapters. In particular I am thankful to the assistance in sampling by the staff of the Icelandic Marine Research Institute. I would like to thank my family and friends for moral support during this work. And finally I would like to thank my wife Gunnhildur Sveinsdóttir for her support and encouragement throughout this project. I am happy to close this chapter together with you and looking forward to the journey ahead. This project was supported by the Icelandic research fund, the Icelandic research fund for graduate students of The Icelandic Centre for Research and the Icelandic Marine Research Institute. xiii 1 General introduction 1.1 Genetic variation - Origin and dynamics 1.1.1 Origin of genetic variation within a population The origin, maintenance and dynamics of genetic variation is a central topic in population genetics and evolutionary biology (Hedrick, 2005; Hartl and Clark, 2006; Futuyma, 2005). Genetic variation of populations is shaped by four basic factors: Mutations, genetic drift, natural selection and gene flow. These factors are affected by effective population size (Ne ), population subdivision, demography of populations and the life history of organisms (Hartl and Clark, 2006). As mutation rate is generally low it can be assumed that it will not alter the frequency of existing alleles in large populations (assumptions of the infinite alleles model and the infinite sites model, Hartl and Clark, 2006). The allelic configuration at a locus within a population is thus more likely to be the result of genetic drift, gene flow and/or natural selection that can potentially change allele frequencies rapidly. A basic assumption in population genetics is that identical alleles are identical by decent. Therefore information about the state carries information about origin of mutation and thus information about the relationship among individuals carrying the same allele (Wright, 1921). The allelic configuration, the relationship among alleles and frequency of genotypes, can be used to make inference about the relationship among groups, natural selection and migration (Hartl and Clark, 2006). In order to discriminate between alternative factors shaping the genetic variation within a population multiple analytical approaches must be used. 1.1.2 Population genetic structure Population breeding structure is reflected in genetic difference at neutral loci. Divergence at a neutral loci between isolated populations occurs due to random genetic drift. A study of historical divergence among populations is thus an exploration of the effects of random genetic drift. The rate of genetic divergence between two isolated populations, due to genetic drift alone, can be shown to be inversely related to the effective population size (Ne ) (Hartl and Clark, 2006). Inbreeding coefficients (F statistics) originally developed by Wright (1922) are important in describing genetic variation and genetic structure. The F statistic reflects the probability that two alleles at a locus in an inbred individual are identical by decent. It corresponds to the correlation of uniting gametes. The principle for F statistics is that the frequency of heterozygotes Ho is expected to be reduced on average by a factor of 1 − 1/(2N) in each generation in a finite diploid 1 1 General introduction population. The probability that two alleles drawn at random from a population are identical is 1/2N. The fixation index F, the inbreeding coefficient, decreases with time until F equals 1, and heterozygosity is 0. However, for mtDNA the probability of two alleles drawn at random from a population is 1/N, the effective population size Ne is smaller (one quarter) than that of nuclear traits in a diploid system. Since mitochondrial genetic markers have lower effective population size Ne compared to nuclear markers (and thus higher rate of genetic drift) the rate at which isolated populations can be detected due to local inbreeding (using FST ) is expected to be faster for mitochondrial genetic markers on average. The F statistic can be estimated at different levels of population comparison. Traditionally F is used as an indicator of the relative part of the overall variation resulting from variation among correlation of gametes due to inbreeding within subpopulation (FIS ), correlation of gametes within subpopulation (FST ) and correlation of gametes within a sample (FIT ) (Hartl and Clark, 2006). The relationship among the different inbreeding coefficients can be expressed as follows: 1 − FIT = (1 − FST )(1 − FIS ). • due to inbreeding within subpopulation, FIS • within subpopulation, FST • in sample, FIT For subdivided populations, genetic difference among subgroups (estimated by FST ) will thus increase with time at a rate inversely related to population size. Thus, it can take a long time for a genetic difference to become established for subdivided populations if population size is large (Hartl and Clark, 2006). Examination of population subdivision has been the subject of many studies, not least for commercially important fish species as such information can be of great importance for responsible management (for an overview see Reiss et al., 2009). Variable molecular markers are used and results are seen to vary from one study to another. It is important to note that observed genetic differences between different environments occupied by an organism may not reflect population breeding structure if the variation is affected by natural selection (Guinand et al., 2004). Genetic difference can become established between subdivided groups of individuals without reproductive isolation or reduced gene flow (Williams et al., 1973; Guinand et al., 2004). For the identification of a breeding structure of a population it is thus important to use genetic markers that are not subject to natural selection, as the latter may reflect environmental heterogeneity rather than the origin and isolation of the groups involved. The analysis of indications of natural selection should thus be an integral part of the analysis of population sub-structuring (see e.g. Excoffier, 2007). It is sometimes suggested that markers under natural selection can be useful for detecting recent divergence, since not enough time has elapsed to allow divergence at neutral markers (e.g. Reiss et al., 2009). It is true that recent divergence may not be detectable with neutral loci, but it is the effects of random genetic drift that reflect the relationship among groups and allow detection of population breeding structure. Genetic markers that are affected by natural selection are useful for understanding biological phenomena of adaptation in a similar manner as are phenotypic traits, such as 2 1.1 Genetic variation - Origin and dynamics growth rate, meristic characters, otolith shape or behaviour. Indeed such characteristics give important insight into the biology of the species, although they can not be used independently to reflect a breeding structure of a population (Williams et al., 1973). Many studies have demonstrated a lack of spatial genetic structure in marine species indicating high levels of gene flow among groups across long distances (Reiss et al., 2009). This has been attributed to lack of physical barriers in the marine environment and high dispersal rate for eggs and larvae with ocean currents (Waples, 1987). Genetic uniformity can also be maintained if population size is large (large Ne ) as it will reduce the rate of genetic drift. A long time may thus be needed in order for genetic differentiation to become established among populations of large size. Genetic variation may reflect different migration patterns between sexes and such patterns are common for vertebrate species (Purcell et al., 1996; Petit and Excoffier, 2009). In the case of females being more philopatric and males more migratorial, such variation may be reflected in the mitochondrial DNA variation. Females would form more geographically distinct populations but males would be homogeneous over larger geographic areas, due to mixing of males of different geographical origin. Difference among the mitochondrial genome and nuclear loci can also indicate sex difference in migration pattern (Purcell et al., 1996). What defines a population? A biological population is defined in different ways in the literature. Hedrick (2005) defines a population as: “...a population is a group of interbreeding individuals that exist together in space and time” (Hedrick, 2005, p.62) Hartl and Clark (2006) discuss the difficulty in defining a population and that it may vary from one organism to another but that it is reflected in a non random distribution of interbreeding individuals within a species. They define a local population as: “...the fundamental units of population genetics. Local populations are actual, evolving units of a species” (Hartl and Clark, 2006, p.46) The definitions above reflect how populations are defined in population genetics and evolutionary biology (Waples and Gaggiotti, 2006). However, there is a fundamental difference in the way ecologists and evolutionary biologists define populations. Under an ecological paradigm the focus is on the co-occurrence of individuals within a species in space and time but under the evolutionary paradigm the emphasis is on the reproductive interactions between individuals. Waples and Gaggiotti (2006) contrast the views about populations in a useful way by summarizing the following: “Ecological paradigm: A group of individuals of the same species that co-occur in space and time and have an opportunity to interact with each other. Evolutionary paradigm: A group of individuals of the same species living in close enough proximity that any member of the group can potentially mate with any other member” (Waples and Gaggiotti, 2006, p.1421) 3 1 General introduction These different views may cause confusion. Under the ecological paradigm a population can be formed over a short period of time without any restrictions in gene flow among populations, a scenario that under the evolutionary paradigm would be considered panmixia (Waples and Gaggiotti, 2006). Many different molecular methods are used for population genetic structure analysis. Markers vary in nature, level of technology involved and cost. When selecting a molecular marker for a study all these factors need to be considered. Mitochondrial DNA (mtDNA) sequence variation has been extensively used in population genetics and phylogenetic studies due to its characteristics. The effective population size Ne is one quarter of that for nuclear genetic markers. Lower Ne of the mtDNA will result in faster rate of genetic drift compared to nuclear markers. This makes mtDNA well suited for detecting populations of recent divergence. On average it would be expected that divergence would be detected earlier using mtDNA variation compared to nuclear genome markers. Further, studies have shown that the most commonly found polymorphism in mtDNA sequences is synonymous, suggesting weak or no selection (e.g. Árnason, 2004). It has been shown to evolve at a faster rate than the coding nuclear genome (Avise, 2000). Comparing various molecular markers currently available direct sequencing of mtDNA has many favorable characteristics. The obvious weakness of using only mtDNA for population genetic structure analysis is that it is a single locus and different loci can have different histories within a population by chance alone (genetic drift is a random process). Ideally a number of neutral loci should be used (e.g. Hartl and Clark, 2006). However, if a high frequency polymorphism is observed in a mtDNA marker, that can be shown to be synonymous, it can be a useful tool for population genetic studies. In the present study mtDNA sequence variation is used for the examination of population genetic structure in four different fish species. 1.1.3 Natural selection Natural selection may favor different alleles in different environments. A difference in allele frequency for a locus under selection may thus reflect the differences in the environment in which it is found rather than the origin of the individuals or of populations. Direct observation of DNA sequence variation can be used to make inference about observed polymorphism, for example by inspection of the frequency of synonymous and nonsynonymous segregating sites. A number of statistical tests are available for detecting natural selection for example by exploring the allelic distribution or level of variability (Nielsen, 2001). An outlier detection method (Beaumont and Nichols, 1996) is an important statistical framework to detect if natural selection has affected some out of a set of loci. Outlier detection method has been used extensively in population genetics. The distribution of FST for a range of population structures, is strongly related to heterozygosity at a locus. A simple genetic model is used to generate an FST null distribution for the detection of outliers. The observed FST values for group comparison for different loci are compared to the null distribution. Values that are found outside the 95% limits of the FST null distribution are considered to be outliers and thus suspect of being under selection (Beaumont and Nichols, 1996). 4 1.1 Genetic variation - Origin and dynamics It is important to note that observed genetic difference in an organism between different environments may not reflect a population breeding structure if the variation is affected by natural selection. As described by Williams et al. (1973) a selection–caused genetic difference can become established between subdivided group of individuals that share the same origin. Williams (1975) suggested that some of the observed genetic variation in cod might have been shaped by contemporary natural selection and as such a reflection of environmental heterogeneity rather than population breeding structure as had been described for American eels, Anguilla rostrata (Williams et al., 1973). It has since become evident that signs of natural selection are common in the cod genome (e.g. Moen et al., 2008; Nielsen et al., 2009; Hemmer-Hansen et al., 2013, 2014). Moen et al. (2008) found signals of natural selection in about 9% of SNP loci analysed in Atlantic cod. Number of recent sudies have revealed adaptive evolution in Atlantic cod and genetically different groups among geographical areas (e.g. Bradbury et al., 2013; Therkildsen et al., 2013; Hemmer-Hansen et al., 2014; Berg et al., 2015). However, neutral loci have not shown the same kind of population genetic structure (e.g. Hemmer-Hansen et al., 2014). In order to describe the historical population genetic structure for an organism it is important to use neutral molecular markers. Selective neutrality can be statistically tested based on allelic distributions or DNA sequence variation by comparison variation of synonymous and nonsynonymous sites (Nielsen, 2001). However, natural selection can have similar effects to that of demographic events, such as a population bottleneck, on genetic variation. It may be difficult to discriminate between alternative explanations (e.g. Hartl and Clark, 2006; Excoffier, 2007). 1.1.4 Demography and molecular clock rate Genetic variation can be used to make inference about demographic history of a population. Nucleotide mismatch analysis can be used to examine haplotype tree structures and make predictions of demographic history. Deep genealogies will produce bi- or multimodal mismatch distributions but shallow genealogies will be unimodal (Rogers and Harpending, 1992). More recent methods, for example the skyline method introduced by Pybus et al. (2000), have improved our ability to reconstruct past demographic history embedded in the genealogical relationship among individuals of a population (Emerson et al., 2001; Drummond et al., 2005). Whatever method used, the timing of demographic events, such as a population bottleneck, critically relies on the molecular clock rate. The Panamanian Isthmus provides an important system where a known geological event closed the connection between the Pacific Ocean and the Caribbean Ocean roughly 3 million years ago. DNA sequence comparisons between between closely related species of fish (Bermingham et al., 1997) and shallow water shrimp (Knowlton and Weigt, 1998) has provided an estimate of mtDNA coding region substitution rate of 1.4–2%/MY. In this context it is important to note that according to population genetics theory the substitution rate is not related to effective population size, Ne . This is due to the fact that the probability of mutation to occur in a population (for a diploid gene in an ideal population) is: µ, and the frequency of a new allele just after mutation is thus: µ × 2Ne . The probability of fixation of a particular mutation, if neutral, is: 1/2Ne (at the 5 1 General introduction time when the mutation emerges). Thus, the substitution rate is: µ̂ × 2Ne × 1/2Ne = µ̂ (Kimura, 1984). Thus, since substitution rate is not related to effective population size, Ne , a known substitution rate in one system/population can be used to make inference about timing of events in another system/population, although the Ne between the two systems/populations may be different. However, the substitution rate varies among genes and some variation has been found among taxa and thus a universal molecular clock may be an illusive idea (Avise, 2004). Recent studies have indicated a time dependency of molecular clock rates (Ho et al., 2005; Burridge et al., 2008, e.g.) although this is debatable (see Emerson et al., 2001; Ho et al., 2007). A differentiation is made between a ’pedigree rate’ and a ’phylogenetic rate’ (e.g. Burridge et al., 2008; Rocha et al., 2005). The possible mechanism for time dependency of molecular clock rates is not fully understood but purifying selection may be responsible (Burridge et al., 2008). Positive selection has been suggested as common for mtDNA (Bazin et al., 2006) and perhaps such selection results in similar patterns if selection is weak and of long duration. Further, in a recent study it is pointed out how the use molecular clock rate calibration with phylogenetic divergence can result in an overestimation of timing of population events (Grant, 2015). Genetic variation and genealogy of high latitude organisms have been strongly influenced by climatic oscillations during the Pliocene and Pleistocene (Bernatchez and Dodson, 1991; Hewitt, 1996; Avise and Walker, 1998; Wares and Cunningham, 2001; Pálsson et al., 2009). The climatic oscillations have caused population bottlenecks for many species followed by a period of expansion. Genetic variation is lost through extinction of lineages during periods of bottlenecks generating shallow genealogy and reducing the effective population sizes, Ne . During period of population expansion, however, the probability of lineage extinction is reduced and genetic variation increases in a population (Avise, 2000). The expected pattern of genetic variation as a result of repeated demographic changes is a shallow genealogy with many rare types. This is indeed the observed pattern in many marine fish species (e.g. Sigurgíslason and Árnason, 2003; Árnason, 2004; Pálsson et al., 2009; Liu et al., 2010; Eiríksson and Árnason, 2014, 2015). 1.1.5 Life history of marine organisms Life history of organisms can be important in shaping genetic variation. High fecundity and high mortality at early stages (type III survivorship curve) is characteristic of many marine fish species. This can lead to high variation in offspring number (a sweepstake recruitment) and reduction in the Ne /N ratio and to a shallow gene genealogy (Hedgecock, 1994; Árnason, 2004). Such patterns may also be generated by demographic history, such as population bottlenecks or natural selection as mentioned earlier. In order to discriminate among alternative factors that may have shaped genetic variation a comparison can be made among related species. The species may have variable life-histories, vary in geographic distribution and demographic history that can be useful for evaluating the most likely factors shaping the genetic variation. 6 1.2 Biology and population genetics of the studied fish species 1.2 Biology and population genetics of the studied fish species In the present thesis I will report research on genetic variation in four species of the family Gadidea: Atlantic cod (Gadus morhua; Linnaeus, 1758), haddock (Melanogrammus aeglefinus; Linnaeus, 1758), saithe (Pollachius virens; Linnaeus, 1758) and whiting (Merlangius merlangus; Linnaeus, 1758). 1.2.1 Atlantic cod, Gadus morhua Fisheries, reproduction and phenotypic variation. Atlantic cod is among the most valuable commercial fish species in the North Atlantic and has been very important for the fisheries around Iceland. It has a wide distribution and is found in a variety of habitats, from shallow waters close to shore to deeper waters off the continental shelf (Jónsson, 1992). The main spawning grounds for Atlantic cod around Iceland are along the south coast (Jónsson, 1992). Embryos and fish larvae are known to drift with water currents from the main spawning grounds to the West, North and East Iceland. In some years embryos and larvae may drift to the East coast of Greenland (Astthorsson et al., 1994; Jónsson and Valdimarsson, 2005). The extent of embryo and larval drift depends on the strength of the warm North Atlantic water current (Jónsson and Valdimarsson, 2005). In addition to the main spawning grounds cod is known to spawn in different areas all around the country with spatial and temporal variation in the contribution of the various spawning grounds to each cohort (Marteinsdottir et al., 2000; Begg and Marteinsdottir, 2000; Jónsdóttir et al., 2007). Thus, in different locations around Iceland each cohort may be of mixed origin, to a variable extent (e.g. Jónsdóttir et al., 2007; Thorisson et al., 2011), depending on the strength of the North Atlantic water current and spatial and temporal variation in spawning success. Although the fish may be of mixed origin variable phenotypic differences have been observed among cod from different locations. Differences in length at age, reflecting differences in growth rate, between the north and south coast of Iceland are well known (Jónsson, 1992). Growth related morphological variation, such as otolith shape, has been described for cod around Iceland and has been used to argue for separate populations of cod (e.g. Pétursdóttir et al., 2006; Jónsdóttir et al., 2006). A study on the Norwegian coastal cod and the Northeast Arctic cod (that have been considered as two populations) indicated that an observed difference in growth and age at maturation for coastal and Northeast Arctic cod reflect differences in the environment rather than genetic variations (Godø and Moksness, 1987). In addition behavioral variation has been observed in Atlantic cod. Different behavioral types, identified as frontal and coastal behavioural types, have been described (Pálsson and Thorsteinsson, 2003; Thorsteinsson et al., 2012). This difference in behavior has been linked to genetic variation at the Pan I locus (Pampoulie et al., 2008). However, behavioral traits, like any other phenotypic trait, can be environmentally induced. In fact, experiments have shown that the development of cod behavior can be affected by environmental factors acting at an early age (Braithwaite and Salvanes, 7 1 General introduction 2005; Salvanes et al., 2007). The observed behavioral variation can thus be environmentally induced. A group of fish, that may share the same place of origin, may develop variable behaviours if exposed to different environments (as has been reported for other phenotypic traits, e.g. Gupta and Lewontin, 1982). As a model phenotype (P) of an organism is the result of its genotype (G) and the environment (E) in which the genotype develops in and an interaction between the two (IGE ) and can be expressed as: P = G + E + IGE . Thus, phenotypic variation is not always an indicator of genetic variation, but may reflect variable environmental conditions. This was recognized by Tåning (1944), that is that morphological variation would not be a good indicator of population breeding structure in fish. The phenotypic variation observed in cod may thus be environmentally introduced, that is cod of different origin may develop similar phenotypic traits if exposed to the same or similar environmental conditions. Environmental factors such as temperature or food availability or interaction of environmental factors may be important in this respect. Phenotypically distinct groups, as observed in Atlantic cod, can be defined as ecological populations although they may not reflect population breeding structure (Waples and Gaggiotti, 2006). Thus, such groups would not be considered different populations under population genetic or evolutionary paradigm (Waples and Gaggiotti, 2006). Migration pattern around Iceland. Intensive tagging experiments were carried out around Iceland by Jónsson (1996) in the period 1948–1985. The data show various trends among different localities and the results have been interpreted in different ways by scientists. Jónsson (1996) own interpretation was that cod would show clear signs of migration from the North to the main spawning ground in the South of Iceland, where most of the spawning took place. He supported this by data on movements of tagged fish and also by data on age and length. He described an example when in 1949 there was a dramatic reduction in length of 7 year old fish from approximately 85cm to 75cm in between early and late March. His interpretation was that this was a clear support for the hypothesis that the fish from the North had entered the spawning grounds, since the fish from the North have a lower growth rate and are thus smaller at age than are cod from the South. The same data have been reanalysed and said to reflect local populations, with few migratory fish (e.g. Jónsdóttir et al., 2006). There is a problem associated with the use of the results from a tagging experiments. First, there is always an autocorrelation in tagging experiment data. That is, at the time that a tagged fish is released the probability of catching it far from the location of release is negligible since the fish requires time to travel. Data have shown that as time elapses from the release of the tagged fish the more likely it is for it to be found further away from the location of tagging (Jónsson, 1996). Many recent tagging experiment cover a limited time span and can thus be affected by this, (Þorsteinsson and Marteinsdóttir, 1993; Sæmundsson, 2005; Pampoulie et al., 2006). Second, fish at different life-history stages may vary in the migratory pattern, e.g. young immature fish may not migrate but larger mature fish is more likely to migrate (Sæmundsson, 2005; Joensen et al., 2005; Schopka et al., 2006). If a group of tagged fish belong to a given size/age class then the experiment may be limited to reflecting the behaviour of this particular size/age class, but other size classes may show different pattern. Information about the size/age of the tagged fish is sometimes missing, making it hard to evaluate the results (e.g. Pampoulie 8 1.2 Biology and population genetics of the studied fish species et al., 2006). The tagging experiments show to some extent local groups of cod, although when correcting for autocorrelation the fidelity to location is likely to be reduced. Thus, at present it is not clear if (or to what extent) the Atlantic cod around Iceland is subdivided into local groups. Population genetic studies. Considerable research effort has been directed to the analysis of cod population genetic structure for decades, in order to resolve if the observed phenotypic variation may correlate with genetic variation. No commercial fish species has been the subject to as many population genetic studies (for review see Reiss et al., 2009). However, research results have to some extent led to conflicting views. Different molecular genetic markers have shown different patterns, some markers showing great population genetic structure but others little or none (Reiss et al., 2009). It is important to understand what may cause differences among the different markers in order to better understand the population genetic structure of Atlantic cod. Of particular concern is the importance of natural selection in shaping of genetic variation in Atlantic cod. Recent studies have shown that footprints of natural selection are common in the cod genome (Moen et al., 2008) and fine scale population breeding structure may in some cases be inflated due to selection (Nielsen et al., 2006; Eiríksson and Árnason, 2013). This may cause environmental heterogeneity to be reflected in the genetic variation, rather than restrictions in gene flow (Williams et al., 1973). Early studies used hemoglobin polymorphism in order to identify cod populations (e.g. Sick, 1961, 1965). Jamieson and Jónsson (1971) used hemoglobin polymorphism among cod at Iceland and Greenland and in their paper they describe their findings as a "moving mosaic of genetic isolates". Earlier Jamieson and Jones (1967) claimed, based on hemoglobin polymorphism, that there are two races or subspecies of cod around the Faroe Islands. For cod in Norway a gradient in hemoglobin variants as well as serum transferrin variation was observed along the Norwegian coast (Frydenberg et al., 1965; Moller, 1968). This polymorphism lead scientists to suggest that population subdivision of Atlantic cod might explain this pattern (e.g. Sick, 1961, 1965; Frydenberg et al., 1965; Moller, 1968; Jamieson and Jónsson, 1971). However, Williams (1975) pointed out that this pattern could be the result of natural selection, and thus that the observed hemoglobin spatial variation might reflect environmental heterogeneity rather that population substructuring. Karpov and Nivkov (1980) demonstrated the the different hemoglobin forms differe in their capacity to combine oxygen at different temperatures. Further, Petersen and Steffensen (2003) showed differences in temperature preference of different hemoglobin genotypes. The orientation and function of the different hemoglobin genotypes has been addressed in recent studies (Andersen et al., 2008; Halldórsdóttir and Árnason, 2009). When new methods became available allozyme variation studies were to a large extent abandoned as it only revealed but part of the overall genetic variation and concerns had been raised about the effects of natural selection. The use of direct mtDNA sequencing has been important in revealing the population genetic structure in Atlantic cod. The observed genetic variation at mtDNA is mostly at synonymous sites, suggesting weak or no selection (e.g. Árnason, 2004). Thus, recently established population structure should become detectable earlier for mtDNA markers 9 1 General introduction than for nuclear markers, on average. However, the mtDNA genome is a single locus and different loci may reflect different history within a population (Hartl and Clark, 2006). Although an obvious weakness of using mtDNA sequence variation is that it is a single locus, studies have reported important findings. Population genetic studies in Atlantic cod, revealed using mtDNA sequence variation, have indicated limited fine scale population genetic structure and high levels of gene flow among geographic regions (Carr and Marshall, 1991; Árnason et al., 1992; Carr et al., 1995; Árnason and Pálsson, 1996; Árnason et al., 1998, 2000). Summarizing results from many studies of cytochrome b sequence variation across the North Atlantic Árnason (2004) showed no differences among localities within countries, but a trans Atlantic cline in the common haplotype frequencies. The regularity of the cline indicated considerable gene flow among different regions. However, spatial genetic structure has been repeatedly reported for Atlantic cod around Iceland. Jónsdóttir et al. (1999) using Pan I and hemoglobin loci among cod at the south coast of Iceland, found a pattern which they interpreted as a small scale population genetic structure. More recent studies have shown that variation at the Pan I locus and hemoglobin is affected by natural selection (Karlsson and Mork, 2003; Pogson and Mesa, 2004; Árnason et al., 2009). Pampoulie et al. (2006) described population genetic structure in cod separating cod from NE- from SW-Iceland using both the Pan I locus and microsatellite markers. The population differentiation was small when based on the presumed neutral microsatellite markers (FST = 0.003). Such subtle genetic difference may, however, not be biologically meaningful (Waples, 1998). Furthermore, a thorough analysis of temporal stability and natural selection is needed in order to asses the nature of the observed variation. A number of studies have been carried out using single nucleotide polymorphism (SNP) in Atlantic cod (Moen et al., 2008, 2009; Nielsen et al., 2009; Hubert et al., 2010; Bradbury et al., 2011). Nielsen et al. (2009) used 98 SNP for the analysis of natural selection and found clear indications of natural selection acting on number of loci. In their study Moen et al. (2008) found signals of natural selection in ca. 9% of SNP loci analysed. In recent study Bradbury et al. (2011) show the application of SNP markers in population genetic studies in Atlantic cod. They suggested the use of large scale SNP panels in assignment studies for high gene flow marine species as cod. However, their analysis showed that when using a small number of loci ascertainment bias can become a problem (Bradbury et al., 2011). SNP analysis has the potential to improve our understanding of the Atlantic cod population genetic structure. However, research findings need to be interpreted carefully because of the large number of loci whose variation has been suggested to be shaped by natural selection (Moen et al., 2008; Nielsen et al., 2009) and the concern due to ascertainment bias (Bradbury et al., 2011). Future studies. Recently the sequencing of the whole Atlantic cod genome was completed (Star et al., 2011). With rapid development of sequencing technology and the reduced sequencing cost future studies will be able to solve many of the challenges faced by earlier studies. One of the exciting possibility that becomes available with genome wide information for number of individuals is to examine, in more details than previously possible, the footprints of natural selection in the cod genome. Many 10 1.2 Biology and population genetics of the studied fish species studies have suggested that natural selection is important in shaping the observed genetic variation in cod (Williams, 1975; Pogson, 2001; Nielsen et al., 2006; Moen et al., 2008, e.g). Genome wide sequence information will make it possible to reevaluate the data available on population genetic structure of Atlantic cod collected by hundreds of scientists for decades. The major challenge will remain to describe the functional importance of the observed genetic variation and possibly to identify the environmental factors that are likely to affect the variation, that is the agents of natural selection. This is a complex task but if successful it will revolutionize out understanding of the biology of Atlantic cod. 1.2.2 Haddock, Melanogrammus aeglefinus Haddock, Melanogrammus aeglefinus, is a demersal fish species most commonly found at 80–200m depth at variable bottom substrates. Haddock is an omnivore, feeding mainly on small benthic organisms. The age at first maturation varies among geographic locations and is 3–5 years in the North-West Atlantic, but 2–3 years in the warmer waters of the North Sea (Cohen et al., 1990; Jónsson, 1992). Fecundity varies greatly with females size, larger females having higher fecundity (50,000–2,000,000 eggs per female Jónsson, 1992). The eggs are small (1.2–1.7 mm in diameter). They are buoyant and eggs and larvae are pelagic for about three months (Hislop, 1984). During this period eggs and larvae can drift with water currents long distances as has been estimated for Atlantic cod (e.g. Brickman et al., 2006). Timing of spawning varies greatly among geographic regions (January to July). Haddock is a commercially important fish species, the major fishing grounds being in the North-East Atlantic (Cohen et al., 1990; Jónsson, 1992) In recent years some changes in fish communities have been observed in the North Atlantic ocean, both in migratory patterns of some species and in changes in distributions of others. These changes are most likely linked to environmental changes associated with increased water temperature (Rose, 2005). For the last two decades the distribution of haddock around Iceland has changed and haddock is now commonly found at the North coast (Björnsson et al., 2007). In recent years (2004–2007) the haddock captures increased in the Icelandic waters reflecting the increase in population size, followed by reduction in the population size (Anonymous, 2011). Such demographic changes may be reflected in genetic variation of populations. The environmental conditions throughout the distribution range of haddock is variable and different populations of haddock may occupy different habitats. An environmental heterogeneity is found at the Faroe Islands where the Faroe Plateau and the Faroe Bank, separated by 850m deep channel, differ in temperature and in abundance of food for haddock. It has been shown that fish growth rate differs due to differences in temperature and food abundance between the two ecosystems (Magnussen, 2007). A number of studies have been carried out, using different molecular markers, describing genetic variation and population genetic structure of haddock. Their findings are to some extent conflicting. A study using transferrin polymorphism for the analysis of the genetic variation in haddock indicated fine scale population sub structuring within the North-East Atlantic (Jamieson and Birley, 1989). In another study of allozyme variation in haddock (using 8 loci) limited differences were revealed among waters of 11 1 General introduction different countries (Norway, Iceland, Faroe Islands, North Sea) (Giæver and Forthun, 1999). Lage et al. (2001) found temporal variation in allele frequencies in a microsatellite study for haddock and an indication of fine scale population genetic structure in the North-West Atlantic. However, a limited population sub-structuring was indicated by mtDNA RFLP analysis for the same geographic region in another study (Zwanenburg et al., 1992). In Lage et al. (2001) study the variation in FST among the different loci is not reported and the effects of natural selection on shaping the observed pattern was not considered. It is possible that natural selection and environmental heterogeneity may explain the observed pattern (Williams et al., 1973; Guinand et al., 2004; Árnason et al., 2009; Bradbury et al., 2011). In fact one of the microsatellites used in their study (Gmo-132) has been shown to have an outlier FST value in spatial comparison for Atlantic cod, and is suspected to be under natural selection (e.g. Nielsen et al., 2006). In the present study mtDNA genetic sequence variation in haddock will be examined exploring its population genetic structure and the potential effects of natural selection will be tested. A mtDNA study indicates that gene flow among geographic region may be responsible for maintenance of high genetic variation in haddock at the time of collapse in the fisheries at Georges Bank (Purcell et al., 1996). 1.2.3 Saithe, Pollachius virens Saithe, Pollachius virens, is a demersal fish species distributed on both sides of the North Atlantic. It is distributed from Barents sea in the North to the Bay of Biscay in the South, along the coast of Canada in the West and southwards to the coast of North Carolina. It is common around Iceland but rare around Greenland. (Jónsson, 1992). Spawning time varies across the Atlantic ocean. NE-Atlantic saithe spawn in February-March but NW-Atlantic saithe are reported to have a wider spawning season, from October to March (Jónsson, 1992). Eggs and larvae are pelagic and can thus be carried with ocean currents (Brickman et al., 2006). Saithe is known to migrate long distances in search of food but spawning takes place in shallow waters (Jónsson, 1992). Saithe is an important commercial species in many European countries. For fisheries management purposes saithe is divided into several units, e.g. Canada, Northeast Arctic, North Sea and West of Scotland, Iceland and Faroe Islands. However, tagging experiments have shown considerable migration among the different units (Jakobsen and Olsen, 1987; Neilson et al., 2006), making the division questionable biologically. In a long term tagging experiment Jakobsen and Olsen (1987) reported substantial saithe migration from northern Norway to Iceland, frequently to the extent to be characterized as mass movements. This was evident both through the recapture of tagged fish, but also through a change in size at age among saithe in Icelandic waters during a given period indicating immigration of fish that had developed in a different environment, at a different growth rate. Large scale migration from Norway to Faroe Islands was also reported in the same study. In a tagging study carried out on saithe around Iceland, only a limited number of tagged saithe was found to have emigrated from the Icelandic waters, indicating no large scale emigration (Jones and Jonsson, 1971). In a more recent study Armannsson et al. (2007) found very clear inshore/offshore movements during the summer and variation among saithe in migratory routes among tagging areas. This is seen as a possible sign 12 1.2 Biology and population genetics of the studied fish species of more complex population structure around Iceland than previously thought. Further more they found no evidence of large scale emigration of saithe from Icelandic waters. In a tagging–recapture study Neilson et al. (2006) suggested that saithe of the coast of Canada forms three separate populations. They further suggested that this needs to be taken into account for management of saithe fisheries. This is based on phenotypic observations, such as variation in growth rate and behaviour. The different findings among the different studies suggest that migratory behaviour may be associated with some life history stage, that is adult fish may be more likely to migrate as has been reported for Atlantic cod Gadus morhua (Schopka et al., 2006) or that migratory behaviour may vary among different saithe groups. Limited work has been carried out to examine population genetic structure of saithe (see Forthun and Mork, 1997; Reiss et al., 2009; Coucheron et al., 2011). As described earlier saithe has high potential for egg and larval dispersal and is known for long distance migrations of adult fish. This is likely to contribute to gene flow in saithe among different geographic localities. A recent microsatellite study on closely related species, the pollack (Pollachius pollachius), along the European coast revealed very limited genetic variation among geographic localities (Charrier et al., 2006). Given this potential for gene flow, the probability of genetic divergence among saithe groups from different waters must be low. However, there are no reported cases of migrations between NW-Atlantic and the NE-Atlantic, and this might be reflected in the population genetic structure of the species. 1.2.4 Whiting, Merlangius merlangus Whiting Merlangius merlangus is a commercially important fish species in the North Atlantic especially in the North Sea. It is distributed from northern Norway and Iceland in the North to the Bay of Biscay in the South. It is also found in the Mediterranean and the Black sea. Whiting is a shallow water species mostly found at 10–200m depth (Jónsson, 1992). It has a longer pelagic phase early in life compared to related gadoids such as Atlantic cod Gadus morhua, haddock Melanogrammus aeglefinus and saithe Pollachius virens. Potentially this results in a higher dispersal rate (Hislop, 1984). Age at maturation is variable among geographic localities, however, in general maturation is reached at 2–4 years overall, earlier in the southern part of its distribution (Hislop, 1984; Jónsson, 1992; Cohen et al., 1990) Due to its commercial importance many studies have focused on whiting biology in the North East Atlantic (Cohen et al., 1990). Tag and recapture studies and levels of parasite infestation in different localities have been interpreted as reflecting limited migrations among localities (Hislop and MacKenzie, 1976). Thus two distinct populations have been assumed to exist north and south of Dogger Bank in the North Sea (Hislop and MacKenzie, 1976). Pilcher et al. (1989) described spatial and seasonal variation in parasite prevalence and infestation intensity within the North Sea. A latitudinal cline was observed for infestation of the monogenenan parasite Diclidophora merlangi. Infestation prevalence and intensities increased steadily from South to North. Charrier et al. (2007) using seven microsatellite marker for whiting in the North East Atlantic showed limited genetic differentiation among sampling localities indicating high level of gene flow over long geographic distances. However a significant genetic 13 1 General introduction difference was observed among sampling stations within the North Sea, albeit with low FST values. Similar results were obtained in an earlier microsatellite study based on three loci where two loci show significant difference between sampling locations within the North Sea (Rico et al., 1997). 1.3 The present study The main objective of the present study is to examine the population genetic structure of Atlantic cod around Iceland and to test if distinct historically stable populations can be identified, as has been suggested by some earlier studies (e.g. Jónsdóttir et al., 1999). In particular population genetic structure described by Pampoulie et al. (2006), showing small but significant difference between Atlantic cod found in waters NW- and SEIceland, will be examined and tested if their findings can be confirmed. Pampoulie et al. (2006) study is refereed to as providing an evidence of distinct geographic components in Atlantic cod around Iceland (e.g. Pampoulie et al., 2014; Grabowski et al., 2015; Sólmundsson et al., 2015). Thus, one of the objectives of the preset study is to examine the observed pattern, with regards to temporal genetic variation and natural selection. Although the present project is focusing on Atlantic cod around Iceland the findings are relevant to the study of population genetic structure in cod in general and may thus have wider implications. Another objective of the present project is to compared genetic variation and population genetic structure observed in other gadoid fish species to that of Atlantic cod. Comparison among related species may if different biological factors are reflected in the genetic variation. Thus, in the present project genetic variation in Atlantic cod, Gadus morhua, saithe, Pollachius virens, haddock, Melanogrammus aeglefinus and whiting Merlangius merlangus will be examined. In the first paper the microsatellite variation was examined in Atlantic cod. In the other four papers mitochondrial sequence variation was used to examine population genetic structure in the different gadoid species. For an overview of the mithochondrial DNA fragment size and sample size used for the different species see table 1.1. Table 1.1. Mithochondrial DNA fragment size and sample size (N) in the different papers of the present study. cyt b:cytochrome b; COI: cytochrome oxidase c subunit I Common name Atlantic cod Saithe Haddock Whiting 14 Species Gadus morhua Pollachius virens Melanogrammus aeglefinus Merlangius merlangus N 2656 1163 884 138 Locus cyt b COI COI COI bp 328 460 599 621 paper II III IV V 1.3 The present study 1.3.1 Spatial and temporal microsatellite variation in spawning Atlantic cod, Gadus morhua, around Iceland In this study 10 microsatellite loci were used for spatial and temporal genetic analysis in spawning Atlantic cod around Iceland and compared to earlier findings (Árnason et al., 2000; Pampoulie et al., 2006). The markers used included both markers that are likely to be selectively neutral and some that have shown signs of natural selection from outlier behaviour (Nielsen et al., 2006). The loci used here have all been extensively used for cod (Reiss et al., 2009). It was tested if natural selection was likely to have shaped the genetic variation at analyzed loci and the sensitivity of the methods were evaluated. 1.3.2 High levels of gene flow and temporal genetic variation in Atlantic cod, Gadus morhua in Icelandic waters — inferred by mitochondrial DNA sequence variation In this study mtDNA sequence variation was used for the analysis of population genetic structure of spawning Atlantic cod around Iceland. Comparison was made among different regions around the country and a comparison made between NE- and SWIceland, were population differentiation has been suggested (Pampoulie et al., 2006; Jónsdóttir et al., 2006). Temporal genetic variation was evaluated by comparison between sampling years and in particular by comparison among different cohorts. 1.3.3 Gene flow across the N-Atlantic and sex-biased dispersal inferred from mtDNA sequence variation in saithe, Pollachius virens In this study population mtDNA sequence variation was used to examine genetic structure in saithe in the North-Atlantic ocean. Limited work has been done on the population genetics of saithe (Reiss et al., 2009). A comparison is be made between sexes and for saithe among and within geographic localities (Canada, Iceland, Faroe Islands and Norway). 1.3.4 Phylogeography of haddock Melanogrammus aeglefinus in the North Atlantic — Postglacial expansion and male-biased dispersal This study was based on the analysis of sequence variation in a 599 base pair fragment of the cytochrome oxidase c subunit I (COI) mitochondrial gene for haddock sampled in the North Atlantic. The sequence variation was used to assess relationship in haddock within and among different geographic regions. Comparison was made to earlier studies on haddock and comparison made to what had been described for Atlantic cod, Gadus morhua, and related gadoid fish species (Árnason, 2004; Pálsson et al., 2009, and Eiriksson, unpubl data). 15 1 General introduction 1.3.5 Mitochondrial DNA sequence variation in whiting Merlangius merlangus in the North East Atlantic In this study genetic variation at a 621 base pair fragment of the cytochrome c oxidase subunit I (COI) mitochondrial gene in whiting from South Iceland, Northern Norway and the North Sea was examined. Demographic factors and spatial genetic variation were examined. The findings were compared to observed genetic variation in related species (Árnason, 2004; Pálsson et al., 2009; Liu et al., 2010) 16 References References Andersen, O., Wetten, O. F., Rosa, M. C. D., Andre, C., Alinovi, C. C., Colafranceschi, M., Brix, O., and Colosimo, A. (2008). Haemoglobin polymorphisms affect the oxygen-binding properties in Atlantic cod populations. Proceedings of the Royal Society B, 276(1658):833–841. Anonymous (2011). State of marine stocks in Icelandic waters 2010/2011. Prospects for the quota year 2010/2011 (in Icelandic with English abstract). Hafrannsóknastofnunin, Fjölrit 159:1–180. Armannsson, H., Jonsson, S. T., Neilson, J. D., and Marteinsdottir, G. (2007). Distribution and migration of saithe (Pollachius virens) around Iceland inferred from mark-recapture studies. ICES Journal of Marine Science, 64:1006–1016. Árnason, E. (2004). Mitochondrial cytochrome b DNA variation in the highfecundity Atlantic cod: Trans-Atlantic clines and shallow gene genealogy. Genetics, 166(4):1871–1885. Árnason, E., Hernandez, U. B., and Kristinsson, K. (2009). Intense habitat-specific fisheries-induced selection at the molecular Pan I locus predicts imminent collapse of a major cod fishery. PLoS ONE, 4(5):e5529. Árnason, E., Petersen, P. H., Kristinsson, K., Sigurgíslason, H., and Pálsson, S. (2000). Mitochondrial cytochrome b DNA sequence variation of Atlantic cod from Iceland and Greenland. Journal of Fish Biology, 56(2):409–430. Árnason, E., Petersen, P. H., and Pálsson, S. (1998). Mitochondrial cytochrome b DNA sequence variation of Atlantic cod, Gadus morhua, from the Baltic and the White seas. Hereditas, 129(1):37–43. Árnason, E. and Pálsson, S. (1996). Mitochondrial cytochrome b DNA sequence variation of Atlantic cod Gadus morhua, from Norway. Molcular Ecology, 5(6):715– 724. Árnason, E., Pálsson, S., and Arason, A. (1992). Gene flow and lack of population differentiation in Atlantic cod, Gadus morhua L., from Iceland, and comparison of cod from Norway and Newfoundland. Journal of Fish Biology, 40(4):751–770. Astthorsson, O., Gislason, A., and Gudmundsdottir, A. (1994). Distribution, abundance, and length of pelagic juvenile cod in Icelandic waters in relation to environmental conditions. ICES Marine Science Symposia, 198:529–541. Avise, J. C. (2000). Phylogeography: The history and formation of species. Harvard University Press, Cambridge, Massachusetts. Avise, J. C. (2004). Molecular Markers, Natural History, and Evolution. Sinauer Associates, Inc., second edition. Avise, J. C. and Walker, D. (1998). Pleistocene phylogeographic effects on avian populations and the speciation process. Proceedings of the Royal Society B, 265(1395):457– 17 References 463. Bazin, E., Glémin, S., and Galtier, N. (2006). Population size does not influence mitochondrial genetic diversity in animals. Science, 312(5773):570–572. Beaumont, M. and Nichols, R. (1996). Evaluating loci for use in the genetic analysis of population structure. Proceedings of the Royal Society of London B, 263(1377):1619– 1626. Begg, G. A. and Marteinsdottir, G. (2000). Spawning origins of pelagic juvenile cod Gadus morhua inferred from spatially explicit age distributions: potential influences on year-class strength and recruitment. Marine Ecology Progress Series, 202:193– 217. Berg, P. R., Jentoft, S., Star, B., Ring, K. H., Knutsen, H., Lien, S., Jakobsen, K. S., and André, C. (2015). Adaptation to low salinity promotes genomic divergence in atlantic cod (gadus morhua l.). Genome Biology and Evolution. Bermingham, E., McCafferty, S. S., and Martin, A. P. (1997). Fish biogeography and molecular clocks: Perspectives from the Panamanian isthmus. In Kocher, T. D. and Stepien, C. A., editors, Molecular systematics of Fishes, pages 113–128. San Diego: Academic Press. Bernatchez, L. and Dodson, J. J. (1991). Phylogeographic structure in mitochondrial DNA of lake whitefish (Coregonus clupeaformis) and its relation to Pleistocene glaciation. Evolution, 45(4):1016–1035. Björnsson, H., Sólmundsson, J. S., Kristinsson, K., Steinarsson, B. Æ., Hjörleifsson, E., Jónsson, E., Pálsson, J., Pálsson, Ó. K., Bogason, V., and Þorsteinn Sigurðsson (2007). Stofnmælingar botnfiska á Íslandsmiðum (SMB) 1985-2006 og stofnmæling botnfiska að haustlagi (SMH) 1996-2006. undirbúningur, framkæmd og helstu niðurstöður (in Icelandic with English abstract). Fjölrit nr. 131, Hafrannsóknastofnunin, Reykjavík. Bradbury, I. R., Hubert, S., Higgins, B., Bowman, S., Borza, T., Paterson, I. G., Snelgrove, P. V. R., Morris, C. J., Gregory, R. S., Hardie, D., Hutchings, J. A., Ruzzante, D. E., Taggart, C. T., and Bentzen, P. (2013). Genomic islands of divergence and their consequences for the resolution of spatial structure in an exploited marine fish. Evolutionary Applications, 6:450–461. Bradbury, I. R., Hubert, S., Higgins, B., Bowman, S., Paterson, I. G., Snelgrove, P. V. R., Morris, C. J., Gregory, R. S., Hardie, D. C., Borza, T., and Bentzen, P. (2011). Evaluating SNP ascertainment bias and its impact on population assignment in Atlantic cod, Gadus morhua. Molecular Ecology Resources, 11 Suppl 1:218–225. Braithwaite, V. A. and Salvanes, A. G. V. (2005). Environmental variability in the early rearing environment generates behaviourally flexible cod: implications for rehabilitating wild populations. Proceedings of the Royal Society B, 272(1568):1107– 1113. Brickman, D., Marteinsdottir, G., Logemann, K., and Harms, I. H. (2006). Drift probabilities for Icelandic cod larvae. ICES Journal of Marine Science, 64(1):1–11. Brooker, A. L., Cook, D., Bentzen, P., Wright, J. M., , and Doyle, R. W. (1994). Organization of microsatellites differs between mammals and cold-water teleost fishes. Canadian Journal of Fisheries and Aquatic Sciences, 51:1959–1966. Burridge, C. P., Craw, D., Fletcher, D., and Waters, J. M. (2008). Geological dates and molecular rates: fish DNA sheds light on time dependency. Molecular Biology and Evolution, 25(4):624–633. 18 References Carr, S. M. and Marshall, H. D. (1991). Detection of intraspecific DNA sequence variation in the mitochondrial cytochrome b gene of Atlantic cod (Gadus morhua) by the polymerase chain reaction. Canadian Journal of Fisheries and Aquatic Sciences, 48:48–52. Carr, S. M., Snellen, A. J., Howse, K. A., and Wroblewski, J. S. (1995). Mitochondrial DNA sequence variation and genetic stock structure of Atlantic cod (Gadus morhua) from bay and offshore locations on the Newfoundland continental shelf. Molecular Ecology, 4:79–88. Charrier, G., Coombs, S. H., McQuinn, I. H., and Laroche, J. (2007). Genetic structure of whiting Merlangius merlangus in the northeast Atlantic and adjacent waters. Marine Ecology Progress Series, 330:201–211. Charrier, G., Durand, J.-D., Quiniou, L., and Laroche, J. (2006). An investigation of the population genetic structure of pollack (Pollachius pollachius) based on microsatellite markers. ICES Journal of Marine Science, 63(9):1705–1709. Cohen, D., Inada, T., Iwamoto, T., and Scialabba, N. (1990). Gadiform fishes of the world (Order Gadiformes). An annotated and illustrated catalogue of cods, hakes, grenadiers and other gadiform fishes known to date. FAO Fisheries Synopsis. No. 125, Vol. 10. Rome. Coucheron, D. H., Nymark, M., Breines, R., Karlsen, B. O., Andreassen, M., Jørgensen, T. E., Moum, T., and Johansen, S. D. (2011). Characterization of mitochondrial mRNAs in codfish reveals unique features compared to mammals. Curr Genet, 57(3):213–222. Drummond, A. J., Rambaut, A., Shapiro, B., and Pybus, O. G. (2005). Bayesian coalescent inference of past population dynamics from molecular sequences. Molecular Biology and Evolution, 22(5):1185–1192. Eiríksson, G. M. and Árnason, E. (2013). Spatial and temporal microsatellite variation in spawning Atlantic cod, Gadus morhua, around Iceland. Canadian Journal of Fisheries and Aquatic Sciences, 70(8):1151–1158. Eiríksson, G. M. and Árnason, E. (2014). Mitochondrial DNA sequence variation in whiting Merlangius merlangus in the North East Atlantic. Environmental Biology of Fishes, 97:103–110. Eiríksson, G. M. and Árnason, E. (2015). Gene flow across the N-Atlantic and sexbiased dispersal inferred from mtdna sequence variation in saithe, Pollachius virens. Environmental Biology of Fishes, 98:67–79. Emerson, B. C., Paradis, E., and Thébaud, C. (2001). Revealing the demographic histories of species using DNA sequences. Trends in Ecology & Evolution, 16(12):707– 716. Excoffier, L. (2007). Handbook of Statistical Genetics, volume 2, chapter 29. Analysis of Population Subdivision, pages 980–1020. John Wiley and Sons, Ltd., Chichester. Forthun, J. and Mork, J. (1997). Genetic varibility at isozyme loci in two gadoid species; whiting (Merlangius merlangus l.) and saithe (Pollachius virens L.) from norway coast. International Council for the Exploration of the Sea. Frydenberg, O., Møller, D., Nævdal, G., and Sick, K. (1965). Haemoglobin polymorphism in Norwegian cod populations. Hereditas, 53:257–271. Futuyma, D. J. (2005). Evolution. Sinauer Associates, Sunderland, Massachusetts. Giæver, M. and Forthun, J. (1999). A population genetic study of haddock (Melanogram- 19 References mus aeglefinus) in Northeast Atlantic waters based on isozyme data. Sarsia, 84:89–98. Godø, O. R. and Moksness, E. (1987). Growth and maturation of norwegian coastal cod and northeast arctic cod under different conditions. Fisheries Research, 5(2–3):235– 242. Grabowski, T. B., McAdam, B. J., Thorsteinsson, V., and Marteinsdóttir, G. (2015). Evidence from data storage tags for the presence of lunar and semi-lunar behavioral cycles in spawning Atlantic cod. Environmental Biology of Fishes, 98(7). Grant, W. S. (2015). Problems and cautions with sequence mismatch analysis and bayesian skyline plots to infer historical demography. Journal of Heredity, 106(4):333– 346. Guinand, B., Lemaire, C., and Bonhomme, F. (2004). How to detect polymorphisms undergoing selection in marine fishes? A review of methods and case studies, including flatfishes. Journal of Sea Research, 51(3–4):167–182. Gupta, A. P. and Lewontin, R. C. (1982). A study of reaction norms in natural populations of Drosophila pseudoobscura. Evolution, 36(5):934–948. Halldórsdóttir, K. and Árnason, E. (2009). Organization of a β and α globin gene set in the teleost Atlantic cod, Gadus morhua. Biochemical Genetics, 47:817–830. Hartl, D. L. and Clark, A. G. (2006). Principles of Population Genetics. Sinauer Associates, Inc Publishers, Sunderland, Massachusetts, fourth edition. Hedgecock, D. (1994). Does variance in reproductive success limit effective population sizes of marine organisms? In Beaumont, A. R., editor, Genetics and Evolution of Aquatic Organisms, pages 122–134. Chapman & Hall, New York. Hedrick, P. W. (2005). Genetics of Populations. Jones and Bartlett Publishers, Sudbury, Massachusetts, third edition. Hemmer-Hansen, J., Nielsen, E. E., Therkildsen, N. O., Taylor, M. I., Ogden, R., Geffen, A. J., Bekkevold, D., Helyar, S., Pampoulie, C., Johansen, T., Consortium, F., and Carvalho, G. R. (2013). A genomic island linked to ecotype divergence in Atlantic cod. Molecular Ecology, 22:2653–2667. Hemmer-Hansen, J., Therkildsen, N., Meldrup, D., and Nielsen, E. (2014). Conserving marine biodiversity: insights from life-history trait candidate genes in Atlantic cod ((Gadus morhua)). Conservation Genetics, 15(1):213–228. Hewitt, G. M. (1996). Some genetic consequences of ice ages, and their role in divergence and speciation. Biological Journal of the Linnean Society, 58(3):247–276. Hislop, J. R. G. (1984). Fish Reproduction: Strategies and Tactics, chapter 17. A comparison of reproductive tactics and strategies of cod, haddock, whiting and Norway pout in the North Sea, pages 311–328. Academic Press, London. Hislop, J. R. G. and MacKenzie, K. (1976). Population studies of the whiting Merlangius merlangus (L.) of the northern North Sea. Journal du Conseil International pour l’Exploration de la Mer, 37:98–110. Ho, S. Y. W., Phillips, M. J., Cooper, A., and Drummond, A. J. (2005). Time dependency of molecular rate estimates and systematic overestimation of recent divergence times. Molecular Biology and Evolution, 22(7):1561–1568. Ho, S. Y. W., Shapiro, B., Phillips, M. J., Cooper, A., and Drummond, A. J. (2007). Evidence for time dependency of molecular rate estimates. Systematic Biology, 56(3):515–522. Hubert, S., Higgins, B., Borza, T., and Bowman, S. (2010). Development of a SNP 20 References resource and a genetic linkage map for Atlantic cod (Gadus morhua). BMC Genomics, 11:191. Jakobsen, T. and Olsen, S. (1987). Variation in rates of migration of saithe from Norwegian waters to Iceland and Faroe Islands. Fisheries research, 5:217–222. Jamieson, A. and Birley, A. J. (1989). The distribution of transferrin alleles in haddock stocks. ICES Journal of Marine Science, 45(3):248–262. Jamieson, A. and Jones, B. (1967). Two races of cod at Faroe. Heredity, 22:610–612. Jamieson, A. and Jónsson, J. (1971). The Greenland component of spawning cod at Iceland. Conseil International pour l’exploration de la Mer. Rapports et ProcesVerbaux, 161:65–72. Joensen, J. S., Steingrund, P., Hinriksen, A., and Mouritsen, R. (2005). Migration of cod (Gadus morhua): Tagging experiments at the Faroes 1952–1965. Fróðskaparrit, 53:100–135. Jones, B. W. and Jonsson, J. (1971). Coalfish tagging experiments at Iceland. Rit Fiskideildar, 5(1):1–27. Jónsdóttir, I., Marteinsdottir, G., and Campana., S. (2007). Contribution of different spawning components to the mixed stock fishery for cod in Icelandic waters. ICES Journal of Marine Science, 64:1749–1759. Jónsdóttir, I. G., Campana, S. E., and Marteinsdóttir, G. (2006). Otolith shape and temporal stability of spawning groups of Icelandic cod (Gadus morhua L.). ICES Journal of Marine Science, 63(8):1501–1512. Jónsdóttir, Ó. D. B., Imsland, A. K., Daníelsdóttir, A. K., Thorsteinsson, V., and Nævdal, G. (1999). Genetic differentiation among Atlantic cod in south and south-east Icelandic waters: Synaptophysin (Syp I) and haemoglobin (HbI) variation. Journal of Fish Biology, 54:1259–1274. Jónsson, G. (1992). Íslenskir fiskar (in Icelandic). Fjölvaútgáfan, Reykjavík. Jónsson, J. (1996). Tagging of cod (Gadus morhua) in Icelandic waters 1948–1986 and tagging of haddock (Gadus aeglefinus) in Icelandic waters 1953-1965. Rit Fiskideildar, 14:1–82. Jónsson, S. and Valdimarsson, H. (2005). The flow of Atlantic water to the North Icelandic shelf and its relation to the drift of cod larvea. ICES Journal of Marine Science, 62(7):1350–1359. Karlsson, S. and Mork, J. (2003). Selection-induced variation at the pantophysin locus (Pan I) in a Norwegian fjord population of cod (Gadus morhua L.). Molecular Ecology, 12(12):3265–3274. Karpov, A. and Nivkov, G. (1980). Hemoglobin alloforms in cod, Gadus morhua (Gadiformes, Gadidae), their function characteristics and occurrence in populations. Journal of Ichthyology, 6:45–49. Kimura, M. (1984). The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge. Knowlton, N. and Weigt, L. A. (1998). New dates and new rates for divergence across the isthmus of Panama. Proceedings of the Royal Society B, 265(1412):2257–2263. Lage, C., Purcell, M., Fogarty, M., and Kornfield, I. (2001). Microsatellite evaluation of haddock (Melanogrammus aeglefinus) stocks in the northwest Atlantic Ocean. Canadian Journal of Fisheries and Aquatic Sciences, 58(5):982–990. Liu, M., Lu, Z. C., Gao, T. X., Yanagimoto, T., and Sakurai, Y. (2010). Remarkably 21 References low mtDNA control-region diversity and shallow population structure in Pacific cod Gadus macrocephalus. Journal of Fish Biology, 77(5):1071–1082. Magnussen, E. (2007). Interpopulation comparison of growth patterns of 14 fish species on Faroe Bank: Are all fish species on the bank fast-growing? Journal of Fish Biology, 71:453–475. Marteinsdottir, G., Gudmundsdottir, A., Thorsteinsson, V., and Stefansson, G. (2000). Spatial variation in abundance, size composition and viable egg production of spawning cod (Gadus morhua L.) in Icelandic waters. ICES Journal of Marine Science, 57(4):824–830. Miller, K. M., Le, K. D., and Beacham, T. D. (2000). Development of tri- and tetranucleotide repeat microsatellite loci in Atlantic cod (Gadus morhua). Molecular Ecology, 9(2):238–240. Moen, T., Delghandi, M., Wesmajervi, M. S., Westgaard, J.-I., and Fjalestad, K. T. (2009). A SNP/microsatellite genetic linkage map of the Atlantic cod (Gadus morhua). Animal Genetics, 40(6):993–996. Moen, T., Hayes, B., Nilsen, F., Delghandi, M., Fjalestad, K. T., Fevolden, S.-E., Berg, P. R., and Lien, S. (2008). Identification and characterisation of novel SNP markers in Atlantic cod: Evidence for directional selection. BMC Genetics, 9(18):1–9. Moller, D. (1968). Genetic diversity in spawning cod along the Norwegian coast. Hereditas, 60(1):1–32. Neilson, J. D., Stobo, W. T., and Perley, P. (2006). Pollock (Pollachius virens) stock structure in the Canadian Maritimes inferred from mark-recapture studies. ICES Journal of Marine Science, 63(4):749–765. Nielsen, E. E., Hansen, M. M., and Meldrup, D. (2006). Evidence of microsatellite hitch-hiking selection in Atlantic cod (Gadus morhua L.): Implications for inferring population structure in nonmodel organisms. Molecular Ecology, 15(11):3219–3229. Nielsen, E. E., Hemmer-Hansen, J., Poulsen, N. A., Loeschcke, V., Moen, T., Johansen, T., Mittelholzer, C., Taranger, G.-L., Ogden, R., and Carvalho, G. R. (2009). Genomic signatures of local directional selection in a high gene flow marine organism; the Atlantic cod (Gadus morhua). BMC Evolutionary Biology, 9(276):1–11. Nielsen, R. (2001). Statistical tests of selective neutrality in the age of genomics. Heredity, 86(Pt 6):641–647. O’Reilly, P. T., Canino, M. F., Bailey, K. M., and Bentzen, P. (2000). Isolation of twenty low stutter di- and tetranucleotide microsatellites for population analyses of walleye pollock and other gadoids. Journal of Fish Biology, 56(5):1074—-1086. Pálsson, Ó. K. and Thorsteinsson, V. (2003). Migration patterns, ambient temperature and growth of Icelandic cod (Gadus morhua): Evidence from storage tag data. Canadian Journal of Fisheries and Aquatic Sciences, 60(11):1409–1423. Pampoulie, C., Jakobsdóttir, K. B., Marteinsdóttir, G., and Thorsteinsson, V. (2008). Are vertical behaviour patterns related to the pantophysin locus in the Atlantic cod (Gadus morhua L.)? Behavioral Genetics, 38(1):76–81. Pampoulie, C., Ruzzante, D. E., Chosson, V., Jörundsdóttir, Þ. D., Taylor, L., Thorsteinsson, V., Daníelsdóttir, A. K., and Marteinsdóttir, G. (2006). The genetic structure of Atlantic cod (Gadus morhua) around Iceland: Insight from microsatellites, the Pan I locus, and tagging experiments. Canadian Journal of Fisheries and Aquatic Sciences, 63(12):2660–2674. 22 References Pampoulie, C., Skirnisdottir, S., Olafsdottir, G., Helyar, S. J., Thorsteinsson, V., Jónsson, S. Þ., Fréchet, A., Durif, C. M. F., Sherman, S., Lampart-Kałużniacka, M., Hedeholm, R., Ólafsson, H., Daníelsdóttir, A. K., and Kasper, J. M. (2014). Genetic structure of the lumpfish Cyclopterus lumpus across the North Atlantic. Petersen, M. F. and Steffensen, J. F. (2003). Preferred temperature of juvenile Atlantic cod Gadus morhua with different haemoglobin genotypes at normoxia and moderate hypoxia. The Journal of Experimental Biology, 206(Pt 2):359–364. Petit, R. J. and Excoffier, L. (2009). Gene flow and species delimitation. Trends Ecol Evol, 24(7):386–393. Pilcher, M. W., Whitfield, P. J., and Riley, J. D. (1989). Seasonal and regional infestation characteristics of three ectoparasites of whiting, Merlangim merlangus L., in the North Sea. Journal of Fish Biology, 35(1):97–110. Pogson, G. H. (2001). Nucleotide polymorphism and natural selection at the pantophysin (Pan I) locus in the Atlantic cod, Gadus morhua (L.). Genetics, 157(1):317–330. Pogson, G. H. and Mesa, K. A. (2004). Positive Darwinian selection at the pantophysin (Pan I) locus in marine gadid fishes. Molecular Biology and Evolution, 21(1):65–75. Purcell, M. K., Kornfield, I., Fogarty, M., and Parker, A. (1996). Interdecadal heterogeneity in mitochondrial DNA of Atlantic haddock (Melanogrammus aeglefinus) from Georges bank. Molecular Marine Biology and Biotechnology, 5(3):185–192. Pybus, O. G., Rambaut, A., and Harvey, P. H. (2000). An integrated framework for the inference of viral population history from reconstructed genealogies. Genetics, 155(3):1429–1437. Pálsson, S., Källman, T., Paulsen, J., and Árnason, E. (2009). An assessment of mitochondrial variation in Arctic gadoids. Polar Biology, 32:471–479. Pétursdóttir, G., Begg, G. A., and Mareinsdóttir, G. (2006). Discrimination between Icelandic cod (Gadus morhua L.) populations from adjacent spawning areas based on otholith growth and shape. Fisheries Research, 80:182–189. Reiss, H., Hoarau, G., Dickey-Collas, M., and Wolff, W. J. (2009). Genetic population structure of marine fish: Mismatch between biological and fisheries management units. Fish and Fisheries, 10(2):361–395. Rico, C., Ibrahim, K. M., Rico, I., and Hewitt, G. M. (1997). Stock composition in North Atlantic populations of whiting using microsatellite markers. Journal of Fish Biology, 51(3):462–475. Rocha, L. A., Roberson, D. R., Rocha, C. R., van Tassel, J. L., Craig, M. T., and Bowen, B. W. (2005). Recent invasion of the tropical Atlantic by an Indo-Pacific coral reef fish. Molecular Ecology, 14:3921–3928. Rogers, A. and Harpending, H. (1992). Population growth makes waves in the distribution of pairwise genetic differences. Molecular Biology and Evolution, 9(3):552–569. Rose, G. A. (2005). On distributional responses of North Atlantic fish to climate change. ICES Journal of Marine Science, 62:1360–1374. Salvanes, A. G., Moberg, O., and Braithwaite, V. A. (2007). Effects of early experience on group behaviour in fish. Animal Behaviour, 74(4):805–811. Schopka, S. A., Sólmundsson, J., and Þorsteinssson, V. (2006). Áhrif svæðafriðunar á viðgang þorsks (in Icelandic). Technical Report Fjölrit nr. 123, Hafrannsóknastofnunin. Sick, K. (1961). Haemoglobin polymorphism in fishes. Nature, 192:894–896. 23 References Sick, K. (1965). Haemoglobin polymorphism of cod in the North Sea and the North Atlantic Ocean. Hereditas, 54:49–69. Sigurgíslason, H. and Árnason, E. (2003). Extent of mitochondrial DNA sequence variation in Atlantic cod from the Faroe Islands: A resolution of gene genealogy. Heredity, 91(6):557–564. Sólmundsson, J., Jónsdóttir, I. G., Björnsson, B., Ragnarsson, S. Á., Tómasson, G. G., and Thorsteinsson, V. (2015). Home ranges and spatial segregation of cod Gadus morhua spawning components. Marine Ecology Progress Series, 520:217–233. Star, B., Nederbragt, A. J., Jentoft, S., Grimholt, U., Malmstrøm, M., Gregers, T. F., Rounge, T. B., Paulsen, J., Solbakken, M. H., Sharma, A., Wetten, O. F., Lanzén, A., Winer, R., Knight, J., Vogel, J.-H., Aken, B., Andersen, O., Lagesen, K., ToomingKlunderud, A., Edvardsen, R. B., Tina, K. G., Espelund, M., Nepal, C., Previti, C., Karlsen, B. O., Moum, T., Skage, M., Berg, P. R., Gjøen, T., Kuhl, H., Thorsen, J., Malde, K., Reinhardt, R., Du, L., Johansen, S. D., Searle, S., Lien, S., Nilsen, F., Jonassen, I., Omholt, S. W., Stenseth, N. C., and Jakobsen, K. S. (2011). The genome sequence of Atlantic cod reveals a unique immune system. Nature, 477(7363):207– 210. Sæmundsson, K. (2005). Geographical distribution and dispersal of juvenile Icelandic cod (Gadus morhua). Master’s thesis, University of Iceland, Reykjavík, Iceland, Sturlugata 7, 101 Reykjavík, Iceland. Therkildsen, N. O., Hemmer-Hansen, J., Hedeholm, R. B., Wisz, M. S., Pampoulie, C., Meldrup, D., Bonanomi, S., Retzel, A., Olsen, S. M., , and Nielsen, E. E. (2013). Spatiotemporal SNP analysis reveals pronounced biocomplexity at the northern range margin of Atlantic cod Gadus morhua. Evolutionary Applications, 6:690–705. Thorisson, K., Jónsdóttir, I. G., Marteinsdottir, G., and Campana, S. E. (2011). The use of otolith chemistry to determine the juvenile source of spawning cod in Icelandic waters. ICES Journal of Marine Science, 68:98–106. Thorsteinsson, V., Pálsson, Ó. K., Tómasson, G. G., Jónsdóttir, I. G., and Pampoulie, C. (2012). Consistency in the behaviour types of the Atlantic cod: repeatability, timing of migration and geo-location. Marine Ecology Progress Series, 462:251–260. Tåning, Å. S. V. (1944). Experiments on meristic and other characters in fishes. I. on the influence of temperature on some meristic characters in sea-trout and the fixation-period of these characters. Meddelelser fra Kommissionen for Danmarks Fiskeri- og Havundersøgelser, Serie: Fiskeri, 11:1–66. Walsh, P., Metzfer, D., and Higuchi, R. (1991). Chelex 100 as a medium for simple extraction of DNA for PCR-based typing from forensic material. BioTechniques, 10(4):506–513. Waples, R. S. (1987). A multispecies approach to the analysis of gene flow in marine shore fishes. Evolution, 41(2):385–400. Waples, R. S. (1998). Separating the wheat from the chaff: Patterns of genetic differentiation in high gene flow species. Journal of Heredity, 89(5):438–450. Waples, R. S. and Gaggiotti, O. (2006). What is a population? An empirical evaluation of some genetic methods for identifying the number of gene pools and their degree of connectivity. Molecular Ecology, 15(6):1419–1439. Wares, J. P. and Cunningham, C. W. (2001). Phylogeography and historical ecology of the North Atlantic intertidal. Evolution, 55(12):2455–2469. 24 References Williams, G. C. (1975). Sex and evolution. Princeton University Press, Princeton, NJ. Williams, G. C., Koehn, R. K., and Mitton, J. B. (1973). Genetic differentiation without isolation in the American eel, Anguilla rostrata. Evolution, 27(2):192–204. Wright, S. (1921). Systems of mating. II. The effects of inbreeding on the genetic composition of a population. Genetics, 6(2):124–143. Wright, S. (1922). Coefficients of inbreeding and relationships. American Naturalist, 51:636–639. Zwanenburg, K. C. T., Bentzen, P., and Wright, J. M. (1992). Mitochondrial DNA differentiation in western North Atlantic populations of haddock (Melanogrammus aeglefinus). Canadian Journal of Fisheries and Aquatic Sciences, 49(12):2527–2537. Þorsteinsson, V. and Marteinsdóttir, G. (1993). Þorskmerkingar. Ægir, 2:92–100. 25 Paper I Spatial and Temporal Microsatellite Variation in Spawning Atlantic cod, Gadus morhua, around Iceland Guðni Magnús Eiríksson and Einar Árnason, 2013 Canadian Journal of Fisheries and Aquatic Sciences, 70:1151–1158. doi:10.1139/cjfas-2012-0494 © Canadian Journal of Fisheries and Aquatic Sciences Guðni Magnús Eiríksson designed the study, selected samples of spawning fish and did all the laboratory work: DNA isolation, microsatellite amplification and prepated the samples for electrophoresis on an ABI-3100 automatic sequencer. He also did the data analysis and interpretation. Guðni was in charge of writing the manuscript and corresponded to the comments of reviewers and finalized the article. Professor Einar Árnason actively participated in all steps of the work. Einar took part in the experimental design, supervised the molecular work, took part in data analysis, interpretation of the data and participated in writing and finalizing the manuscript. 27 Paper I 28 1151 ARTICLE Spatial and temporal microsatellite variation in spawning Atlantic cod, Gadus morhua, around Iceland Guðni Magnús Eiríksson and Einar Árnason Abstract: The present study suggests that the observed genetic difference between Atlantic cod, Gadus morhua, off the south and the north coast of Iceland may be caused by natural selection affecting genetic variation at a microsatellite loci (Gmo34). When disregarding this locus from the analysis, no genetic difference was observed between northern and southern Iceland. The methods applied here were very sensitive, and differences as small as FST = 0.0005 are unlikely to go unnoticed. The difference between cod off the south and the north coast of Iceland is thus likely to be smaller than that. Such a small difference is negligible and is not likely to have any biological meaning. Genetic drift was detected by allele frequency comparison among different cohorts (FST = 0.0007, P = 0.0209). A small but significant difference was observed among allele frequency for cod grouped by depth at the south coast of Iceland (FST = 0.0017, P = 0.0002). This difference is very subtle and needs to be interpreted with caution. Résumé : La présente étude indique que les différences génétiques observées entre les morues, Gadus morhua, de la côte nord et de la côte sud de l’Islande pourraient être le fait d’une sélection naturelle associée à la variation génétique à un site microsatellite (Gmo34). Si ce site est exclu de l’analyse, aucune différence génétique n’est observée entre le nord et le sud de l’Islande. Étant donné la grande sensibilité des méthodes utilisées, la probabilité que des différences de l’ordre de FST = 0,0005 ne soient pas détectées est faible. La différence entre les morues des côtes sud et nord de l’Islande est donc probablement plus faible que ce seuil. Une différence de cet ordre est négligeable et non susceptible d’avoir une signification biologique. La comparaison de la fréquence allélique entre différentes cohortes a permis de déceler une dérive génétique (FST = 0,0007, P = 0,0209). Une différence significative, bien que faible, a été observée sur le plan de la fréquence allélique pour différents groupes de morues répartis selon la profondeur sur la côte sud de l’Islande (FST = 0,0017, P = 0,0002). Cette différence est très subtile et doit être interprétée avec prudence. [Traduit par la Rédaction] Introduction Genetic variation For the study of population breeding structure, it is important to use neutral molecular markers whose variation is shaped by genetic drift but not natural selection (Williams et al. 1973; Guinand et al. 2004). For Atlantic cod, Gadus morhua, this is particularly important, as it has become evident that signs of natural selection are common in the cod genome (e.g., Moen et al. 2008; Nielsen et al. 2009; Bradbury et al. 2010). Moen et al. (2008) found signals of natural selection in ⬃9% of single nucleotide polymorphism (SNP) loci analyzed in Atlantic cod, based on polymorphism found in expressed sequence tags (EST). Bradbury et al. (2010) also found high number of SNPs showing deviation from neutral expectations, although at lower relative frequency (4.2%). Thus, the analysis of natural selection should be an integrated part of the analysis of Atlantic cod breeding structure. For the past two decades microsatellite markers have been applied in many studies for the analysis of population genetic structure in Atlantic cod (Reiss et al. 2009). Although many microsatellites may be selectively neutral, it has been shown that microsatellites can be found within genes where they are likely to be in linkage disequilibrium with coding regions that are potentially under natural selection (Li et al. 2004), and in some cases microsatellites may have functional importance and may thus be under direct natural selection (Li et al. 2002). For Atlantic cod it has been shown that natural selection can affect microsatellite variation (e.g., Nielsen et al. 2006). Molecular markers, whose variation is affected by natural selection, may reflect the population breeding structure, but may also be the result of contemporary selection in a heterogeneous environment (Williams et al. 1973; Guinand et al. 2004). The prerequisite for such variation to be useful for population breeding structure analysis is evidence for its historical establishment, rather than a reflection of contemporary variation in selective mortality in a heterogeneous environment. For Atlantic cod, the possibility of contemporary natural selection to shape frequencies at a given marker in different ways in different environmental conditions is a real concern, because of cod's high fecundity, high dispersal capability, and high mortality at early life stages (type III survivor ship curve). Although this is important for identification of the breeding structure of populations, markers under selection may be useful for the identification of ecological groups (see e.g., André et al. 2011) that can be defined as populations under the ecological paradigm (Waples and Gaggiotti 2006). Such variation may be important for fisheries management (e.g., Russello et al. 2012). Neutral variation may fail to reflect population structure, in particular for recently formed populations, as genetic differentiation at a neutral marker may evolve slowly. This particularly accounts for large populations, as effective population size (Ne) is inversely related to the rate of genetic drift (Hartl and Clark 2006). When no genetic difference is realized in a study, it is important to assess the statistical power of methods applied to determine Received 15 November 2012. Accepted 26 May 2013. Paper handled by Associate Editor Paloma Morán. G.M. Eiríksson and E. Árnason. Institute of Life and Environmental Sciences, University of Iceland Sturlugata 7, 101 Reykjavík. Corresponding author: Guðni Magnús Eiríksson (e-mail: [email protected]). Can. J. Fish. Aquat. Sci. 70: 1151–1158 (2013) dx.doi.org/10.1139/cjfas-2012-0494 Published at www.nrcresearchpress.com/cjfas on 29 May 2013. 29 Paper I 1152 the magnitude of difference possible to detect (Ryman and Palm 2006). Population structure of Atlantic cod Phenotypic variation in Atlantic cod among geographic localities has been documented in many studies. Variation in life history characteristics and otolith shape has been used to discriminate cod into distinct spawning groups residing north and south of Iceland (Jónsdóttir et al. 2006). Otolith shape has been used to identify different groups of cod at a small scale at adjacent spawning grounds at the south coast of Iceland (Pétursdóttir et al. 2006). Distinct behavioural types, defined as coastal and frontal behavioural types (Thorsteinsson et al. 2012), have been suggested to reflect different populations (Pálsson and Thorsteinsson 2003; Thorsteinsson et al. 2012), and correlation of the different behavioural types to body morphology has been described (McAdam et al. 2012). From these studies emerges the view that the Atlantic cod around Iceland are divided into different populations, although the form and scale may vary. However, the observed phenotypic variation rarely correlates to the findings of population genetic studies (but see Pampoulie et al. 2008). Although the population genetics of Atlantic cod across its distribution range has been studied for decades (for overview see Reiss et al. 2009), a consensus on the population structure is not in place. Various molecular tools have been used, and the results have been to some extent conflicting and the interpretation of research has varied from one study to another. In some cases results have suggested local populations at a fine scale, in particular studies that utilize microsatellite and pantophysin I locus (Pan I) variation for resolving population genetic structure (e.g., Jónsdóttir et al. 2001; Pampoulie et al. 2006; Westgaard and Fevolden 2007; Kovach et al. 2010). However, other studies suggest high levels of gene flow and limited population genetic structure, for example a study based on allozyme variation (e.g., Mork et al. 1985) and in particular studies that are based on mtDNA sequence variation (e.g., Carr et al. 1995; Árnason et al. 2000; Árnason 2004). Molecular methods vary in nature, and the evolution of different molecular markers may vary within the same population (Hartl and Clark 2006), and this may explain variable results. It has been demonstrated that cod population genetic structure may be inflated as a result of natural selection in heterogeneous environments (Nielsen et al. 2006), and this is a possible explanation for the apparent discrepancies among cod population genetic studies in general. The observed apparent population structure for Atlantic cod in Icelandic waters maybe an example of such inflated population genetic structure resulting from natural selection. A number of studies have been carried out to reveal the population genetic structure of Atlantic cod around Iceland. Pampoulie et al. (2006) reported small but highly significant genetic differentiation between Atlantic cod sampled north of Iceland compared with cod sampled south of Iceland (FST = 0.003, P = 0.00013). Such subtle genetic difference may not be biologically meaningful (Waples 1998), although it has been suggested that indeed such a small difference may be important in marine fish (Knutsen et al. 2011). In any case, thorough analysis of temporal stability and natural selection is needed to assess the nature of the observed variation in Atlantic cod around Iceland. Much greater spatial differentiation (FST = 0.261) was revealed for cod around Iceland when using the Pan I locus (Pampoulie et al. 2006), a genetic variation that is most likely shaped by natural selection (e.g., Karlsson and Mork 2003; Pogson and Fevolden 2003; Pogson and Mesa 2004). Long-term temporal stability of allele frequencies at a the Pan I locus has been taken as an evidence for its stability over time, making it useful for population structure analysis at a microgeographic scale (Nielsen et al. 2007). However, if environmental characteristics are stable with respect to factors affecting the frequencies at a locus (e.g., depth), the frequencies can be the result of selective mortality within each cohort, repeatedly arriving at Can. J. Fish. Aquat. Sci. Vol. 70, 2013 similar allele frequencies reflecting the environmental heterogeneity rather than population breeding structure. Some earlier studies on the population structure of Atlantic cod around Iceland commonly used molecular markers that are suspected of being under the effects of natural selection (Jamieson and Jónsson 1971; Jónsdóttir et al. 1999, 2001), and the reported population genetic structure may thus be inflated (Nielsen et al. 2006). However, Árnason et al. (2000), using mitochondrial DNA sequence variation, did not reveal any population genetic structure in Atlantic cod in the waters around Iceland (see also Árnason 2004). These conflicting findings call for further analysis of the population genetic structure of Atlantic cod around Iceland. For an observed spatial genetic structure in a population to be historically meaningful, temporal stability is essential. If a spatial structure is not temporally stable, it may have established because of chance or possibly because of nonrandom sampling (Waples 1998). Nonrandom sampling may cause temporal genetic variation to be mistaken for a spatial structure. This can, for example, be realized when there is a genetic difference among cohorts and when there is a difference in age combination among fish among different geographic regions. It is well known that cod in northern Iceland has a slower growth rate than cod in southern Iceland, and a difference has been reported in age composition between cod sampled in northern and southern Iceland (Jónsdóttir et al. 2006). For cod around Iceland, the above scenario is thus not only a theoretical possibility, but a real concern. When a spatial structure is observed, it is thus important to assess its temporal stability to establish if it is truly reflecting the breeding structure of the population. The present study In the present study 10 microsatellite loci were used that have been extensively used for Atlantic cod (Reiss et al. 2009), including both markers that are likely to be selectively neutral and some that have shown signs of natural selection (Nielsen et al. 2006). Spatial and temporal genetic variation for spawning cod around Iceland were examined and compared with earlier findings (Árnason et al. 2000; Pampoulie et al. 2006). We tested whether natural selection is likely to have shaped the genetic variation at analyzed loci and also evaluated the sensitivity of the methods. This study analyzed temporal variation by comparing among cohorts and age groups and assessed if that may affect the observed spatial genetic variation. To test if there are historically stable populations in the north and the south of Iceland, and that the reported difference (Pampoulie et al. 2006) is not a temporary unstable phenomena, the present study targeted only mature spawners. Although the focus of the study is on spawning cod around Iceland, it may also have general implications for the biology of Atlantic cod and the use of microsatellites for population genetic studies in general. In all microsatellite studies, the quality of the analyzed loci is of great importance. Genotyping errors such as null alleles and stuttering error are commonly encountered. The presence of null alleles can inflate a genetic difference if any difference is found among groups (increased FST), but in case of no difference, the FST is unbiased (Chapuis and Estoup 2007). Materials and methods Fish sampling Atlantic cod was sampled from Iceland in the annual spring survey of the Icelandic Marine Research Institute in 3 consecutive years, 2005–2007. The surveys mainly target spawning cod at shallow waters (<200 m) on breeding grounds at specific locations. Samples were also taken at deeper waters in southern Iceland. Gill tissue of fish was sampled and preserved in 96% ethanol for DNA analysis. Age of sampled fish was determined by otolith inspection by experts at the Marine Research Institute. Published by NRC Research Press 30 Eiríksson and Árnason 1153 Fig. 1. Atlantic cod sampling sites on a map of Iceland. Cod sampling stations in the north (triangles) and south (circles). Total number of cod sampled: N = 845 (Nnorth = 340, Nsouth = 505). Mature fish with ripe gonads were randomly selected for microsatellite analysis from the overall sample (Fig. 1). They represent six cohorts (1996–2001) and four ages (6–9 years old) from northern and southern Iceland (Fig. 1). In total 845 individuals were sampled and analyzed, 340 from the north and 505 from south. Molecular methods DNA isolation DNA was isolated using a Chelex method (Walsh et al. 1991). A stock solution was prepared consisting of 50 mg·mL–1 Chelex 100 (BioRad), 0.2% sodium dodecyl sulfate (SDS), 10 mmol·L–1 Tris (pH 8), and 0.5 mmol·L–1 ethylenediaminetetraacetic acid (EDTA) with proteinase K added to the mix, just prior to DNA isolation, at final concentration of 200 g·mL–1. A small tissue fragment (less than 1 mm in diameter) was cut from the preserved gill tissue, drained on a paper towel, and placed in a tube containing 250 L reaction mix. The mix was then placed in a Thermomixer (Eppendorf) at 700 r·min–1 for 3–5 h (or until no tissue fragments were visible). After this the mix was placed in a 95 °C for 5 min for denaturation of the proteinase K and spun at 3000 r·min–1 for 5 min to precipitate debris. The supernatant was carefully removed. A 1:19 dilution of the supernatant was prepared and used for PCR (DNA concentration approximately 1 ng·L–1). Microsatellite multiplex Ten microsatellite loci were amplified in two multiplex PCR reactions (Table 1) with modification of the method by Westgaard and Fevolden (2007). QIAGEN Multiplex PCR kit, composed of HotStartTaq DNA polymerase, multiplex PCR buffer ([MgCl2] = 6 mmol·L–1), and dNTP mix was used for multiplex reactions. A pigtail (GTTTCTT) was added to the 5=-end of the reverse primer to reduce stuttering in the amplification process (Brownstein et al. 1996). The forward primer had a 5=-end fluorescence label. Applied Biosystems dye set DS-33 (filter set G5) was used for labeling. Primers were made by Applied Biosystems. The same PCR protocol was used for both multiplexes. The DNA was denaturated at 95 °C for 1 min followed by 35 cycles of denaturation at 95 °C for 20 s, annealing at 56 °C for 30 s, and extension at 68 °C for 1 min. Ten minutes extension at 68 °C completed the PCR. DNA precipitation The amplified DNA was precipitated in plates of 96 samples. 50 L of 0.3 mol·L–1 NaOAc was added to each well containing 10 L reaction product followed by 125 L cold (stored at −20 °C) 96% ethanol. The mix was spun at 4000 r·min–1 for 30 min at 2 °C, the precipitation mix was poured off, and the sample and the tray were spun inverted on top of a paper towel at 300 r·min–1 at 2 °C for 2 min. For rinsing of the product, 250 L of cold 70% ethanol was carefully added and spun at 4000 r·min–1 for 5 min, and then the tray was spun inverted as before, but this time for 5 min to remove the rest of the ethanol. The samples were left in a dark and dry place for at least 15 min to allow for evaporation of any remaining ethanol. The DNA was then dissolved in Hi-Di formamide containing GeneScan-500 LIZ size standards (Applied Biosystems) for length determination. An ABI 3100 automatic sequencer was used for the electrophoresis of the reaction product. Published by NRC Research Press 31 Paper I 1154 Can. J. Fish. Aquat. Sci. Vol. 70, 2013 Table 1. Microsatellite loci used in the present study. Locus MP Primer concentration (mol·L–1) Size range (bp) k Hexp Hobs PHWE Reference Gmo8 Gmo19 Gmo35 Gmo37 Tch11 Gmo2 Gmo3 Gmo34 Gmo132 Tch13 1 1 1 1 1 2 2 2 2 2 0.10 0.15 0.20 2.50 0.30 0.25 0.20 0.20 0.30 0.30 119–331 127–235 121–154 233–315 123–244 105–153 165–213 90–123 100–181 84–185 54 28 11 23 27 23 12 10 33 47 0.931 0.922 0.831 0.839 0.935 0.846 0.159 0.385 0.574 0.920 0.928 0.806 0.805 0.728 0.846 0.717 0.155 0.363 0.588 0.903 0.399 <0.001 0.619 <0.001 <0.001 <0.001 0.375 0.064 0.590 0.449 Miller et al. 2000 Miller et al. 2000 Miller et al. 2000 Miller et al. 2000 O'Reilly et al. 2000 Brooker et al. 1994 Miller et al. 2000 Miller et al. 2000 Brooker et al. 1994 O'Reilly et al. 2000 Note: The amplification was carried out in two multiplexes (MP). Listed are the concentration of the primers used, the size range of alleles (base pairs, bp), the number of observed alleles (k), the expected and observed heterozygosity (Hexp and Hobs, respectively), probability for loci to be in Hardy–Weinberg equilibrium (PHWE), and the reference for primers used. Data analysis GeneMapper (version 3.7, Applied Biosystems 2004) was used for data analysis. Bin sets were developed based on the observed peak variation and alleles automatically assigned to bins. Allele determination was manually confirmed by inspection of trace files. The software Micro-Checker (version 2.2.3, Oosterhout et al. 2004) was used to assess the quality of the analysis for each loci. Micro-Checker detects the presence of null alleles and provides an algorithm to adjust the frequency of genotypes to correct for null alleles. Genotype frequency was adjusted for loci that showed signs of null alleles. The effects of not correcting and correcting on the results was assessed. The R package “adegenet” (Jombart 2008) was used to calculate summary statistics for observed and expected heterozygosity (Hobs and Hexp, respectively) and number of alleles (k). Probability test was used for assessing whether allele frequency at the loci were in Hardy–Weinberg equilibrium, implemented in GENEPOP version 4.0.7 (Raymond and Rousset 1995). The method of Weir and Cockerham (1984) was used to calculate the fixation indexes (F) for spatial and temporal comparison as implemented in GENEPOP. The significance of genic and genotypic differences was tested using Fisher' exact test as implemented in GENEPOP. For assessing if natural selection is likely to have affected the genetic variation at the studied loci, the simulation approach of Beaumont and Nichols (1996) was used. An FST null distribution was generated by simulation as implemented in FDIST2 software. The simulated distribution was used for comparison with the observed data. First, all the data were included in the simulation, and the target FST value was set at the overall FST for group differentiation. Upon the detection of an outlier, the outlier locus was removed, the target FST value was recalculated using the remaining loci, and the simulation was repeated as recommended by Beaumont and Nichols (1996). An FST null distribution was simulated for all cases where significant group differentiation was found (contrasts of geographic variation, depth groups at the south coast of Iceland, and cohorts in the overall sample). In the simulation, we assume an island model with 100 islands, the number of populations set at the number of groups in the specific contrasts, and the number of samples in contrast groups set to the number of individuals in study groups. The stepwise mutation model was selected and 100 000 loci were simulated in each simulation. The computer software POWSIM version 4.1 (Ryman and Palm 2006) was used to estimate the statistical power of sampling design used in the study. For the simulations, the empirical data (allele number and frequency) was used to define the base population. However, POWSIM is limited to 50 alleles per locus. As Gmo19 has 54 alleles, the frequencies of the three largest and the three smallest alleles were combined in the base population before the simulation. This is justified because reducing the number of alleles at Gmo19 will reduce the power of the locus to detect genetic differentiation (e.g., Kalinowski 2002). The estimated power of the design will thus be slightly underestimated. Ne was set at 5000, and the number of generations simulated varied (0–9) to generate populations with variable genetic differentiation (FST). The generated populations were then compared using GENEPOP, and statistical power was determined. It can be noted that the conditions for simulation (Ne and generations) are not important for the analysis, but only the resulting genetic differentiation. Results Nature of genetic variation The amplification success rate varied among loci used in this study. The loci were successfully amplified in 91%–96% of individuals for all loci, except for Gmo37, for which alleles could be identified in 61% of individuals only. All loci were included in further analysis of the data. The loci differed in the amount of variation exhibited (Table 1). Micro-Checker analysis of the genetic variation at the loci showed homozygotic excess and possible presence of null alleles at three loci (Gmo2, Gmo19, Gmo37). This is reflected in the difference in Hobs and Hexp in Table 1 as general excess of homozygotes for most allele size classes. Two loci had indications of stuttering error (Gmo2 and Gmo19), but no indication of large allele dropout was observed. Spatial and temporal genetic variation and outlier detection Significant genetic difference was found between cod from northern and southern Iceland (FST = 0.0016; Table 2). The overall pattern was mostly due to Gmo34, an extreme outlier (far outside the 99% limits of the FST null distribution; Fig. 2a). Another locus, Gmo132, showed significant difference between the north and south (Table 2), but the FST was not high enough for the locus to be considered an outlier (Fig. 2a). When Gmo34 was removed from the analysis, there was no overall significant population genetic structure for the remaining loci between northern and southern Iceland (overall FST = 0.0002). Adjusting genotype frequencies for Gmo2, Gmo19, and Gmo37 (accounting for possible null alleles) had very little effect on the estimated difference between cod from the north and south (overall FST = 0.0017 when Gmo34 was included, but FST = 0.0003 when Gmo34 was excluded). Considering depth, a significant difference was found among cod at 50 m depth classes in southern Iceland. Genic difference was significant at five loci, and genotypic difference was signifiPublished by NRC Research Press 32 Eiríksson and Árnason 1155 Table 2. FST for each locus for three contrasts. Contrast Locus FST between the south and north FST among 50 m depth classes in the south FST among cohorts Gmo2 Gmo3 Gmo8 Gmo19 Gmo34 Gmo35 Gmo37 Gmo132 Tch11 Tch13 All −0.0010 −0.0008 −0.0001 −0.0004 0.0250***, • • • 0.0004 −0.0000 0.0051**,• −0.0003 −0.0001 0.0016***,• • • 0.0000 −0.0009 −0.0002 0.0015* 0.0171*, • 0.0024*,• 0.0019* 0.0030*, • −0.0004 0.0012 0.0017**, • • • 0.0001 0.0012 0.0014 0.0008 0.0065 −0.0013 0.0005*, • 0.0035*,• −0.0009 −0.0002 0.0007* Fig. 2. Outlier detection for three contrasts. The observed FST for the different loci plotted against Hobs (circles). The solid lines represent the median of the FST null distribution, and the dashed and dotted lines enclose the 95% and 99% limits of the distribution, respectively. (a) Comparison between northern and southern Iceland. (b) Comparison among depth groups at the south coast of Iceland. (c) Comparison among cohorts. Note: The significance of the exact G test for genotypic differentiation is indicated with *, **, and *** at P values less than 0.05, 0.01, 0.001, respectively. Similarly, the significance of the exact G test for genic differentiation is labelled with • , • • , and • • • . In the first contrast, all the sampled fish are included: N = 845 (Nnorth = 340, Nsouth = 505). In the second contrast of depth classes, all the fish sampled at the south coast of Iceland was included: Nsouth = 504. (Sample size for different depth classes: N0–50 m = 34, N50–100 m = 320, N100–150 m = 75, N150–200 m = 21, N>200 m = 54; depth information was missing for one fish.) In the third contrast, all sampled fish (where age information was not missing) were included: N = 835. (Sample size for different cohorts: N1996 = 67, N1996 = 135, N1996 = 206, N1996 = 216, N1996 = 150, N1996 = 61.) cant at three loci (Table 2). None of the loci were found outside the 95% limits of the FST null distribution; however, Gmo34 also had a considerably higher FST value compared with the other loci (Fig. 2b). Removing Gmo34 from the analysis resulted in a smaller but still significant difference among depth groups (FST = 0.0010*,••; refer to Table 2 for a description of superscript asterisk(s) and bullet(s)). In northern Iceland, cod was mostly sampled at shallow waters, and no significant genetic difference was found at different depths. Considering temporal variation, a significant difference was also found among cohorts (Table 2). No locus was found outside the 95% range of the FST null distribution for cohorts (Fig. 2c), although again Gmo34 showed considerably higher FST value compared with the other loci. Statistical power analysis POWSIM simulations generating groups of variable genetic difference showed that there was a fair chance (80%) of detecting a significant difference at FST = 0.00034 and high probability (95%) when FST = 0.00048 (Fig. 3). Discussion This study shows the same spatial genetic structure in Atlantic cod around Iceland as reported in Pampoulie et al. (2006). However, the present study suggests that the genetic difference between Atlantic cod from the north and south coast of Iceland does not reflect a population breeding structure. The observed genetic difference was attributed to outlier behaviour of Gmo34 most likely due to natural selection at Gmo34 locus or linked loci. When removing this locus from the analysis, no difference was observed between these geographic regions. In fact the Gmo34 locus has shown similar deviation from FST null distribution in spatial comparisons in recent studies (Nielsen et al. 2006; Westgaard and Fevolden 2007; Pampoulie et al. 2011). Gmo34 was one of the loci used in Pampoulie et al. (2006), but it was not tested whether its genetic variation was likely to have been affected by natural selection. It has been shown that Gmo34 is in linkage disequilibrium with the Pan I locus (e.g., Westgaard and Fevolden 2007) and may thus reflect the same pattern as has been described for that locus (e.g., Karlsson and Mork 2003; Pampoulie et al. 2006; Árnason et al. Published by NRC Research Press 33 Paper I 1156 Can. J. Fish. Aquat. Sci. Vol. 70, 2013 Proportion of significance (Fisher’s exact tests) Fig. 3. Proportion of significance (Fisher's exact tests) against FST of simulated populations showing variable genetic differentiation. 2009). In the present study, we do not have data to resolve whether the genetic variation at the Gmo34 locus is a contemporary phenomena or the observed frequencies at the locus is the result of historical natural selection. Therefore, there is no basis to use the locus for identification of a population breeding structure. Further, the analysis of statistical power (after removing Gmo34) showed that genetic differentiation as low as FST = 0.0003 can be detected with a fair confidence (80% probability), and the probability of a difference higher than FST = 0.0005 is unlikely to go unnoticed by the analytical tools applied here. Therefore, it can be assumed that if there is a genetic difference between cod from northern and southern Iceland due to breeding structure alone, it is unlikely to be greater than FST = 0.0005. Even if such a difference was observed, it is doubtful that such a negligible difference has any biological meaning (Waples 1998), although it can not be excluded (Knutsen et al. 2011). This result is in an agreement with some earlier findings for cod around Iceland (e.g., Árnason et al. 1992, 2000; Árnason 2004), but may seem to be at variance with studies reporting phenotypic variation in Atlantic cod around Iceland (Pálsson and Thorsteinsson 2003; Jónsdóttir et al. 2006; McAdam et al. 2012). However, the observed phenotypic variation can be environmentally induced, as noted in early fish research (see e.g., Tåning 1944). Phenotypic plasticity may cause a group of fish to develop variable phenotypic traits if exposed to different environments (as in Gupta and Lewontin 1982). In fact a recent study showed that Atlantic cod originating from the north and south coast of Iceland, but reared in common experimental and aquaculture facilities, showed no quantitative difference in growth, proportion of maturation, or length (Kristjánsson 2013). These findings suggest that the observed differences in these characteristics between northern and southern Iceland (Jónsdóttir et al. 2006) can be environmentally induced. Also, experiments have shown that the development of cod behaviour can be affected by environmental factors at early age (Braithwaite and Salvanes 2005; Salvanes et al. 2007). Thus, it is obvious that phenotypic variation (including behaviour) can not be used, on its own, as an indicator of population breeding structure. Further, environmental heterogeneity in the waters around Iceland seems to favor variable phenotypes, most likely through selective mortality at early stages. In cases where phenotypic variation is correlated with genotypic variation, this will affect allele frequencies at loci, as may be the case with Gmo34. Water temperature may be an important environmental factor in this respect, as temperature varies between northern and southern parts of Iceland (Jónsson and Valdimarsson 2005). Although the present study does not support the view of separate cod populations around Iceland (as suggested in Pampoulie et al. 2006), the variation at Gmo34 may suggest that different groups of cod may occupy different geographical areas in the waters around Iceland. Such groups or ecotypes can be defined as different populations under the ecological paradigm (Waples and Gaggiotti 2006). However, it is important to note that such patterns may persist although extensive gene flow may take place between such groups, even to the extent that they can be considered to be in panmixia. The present study shows the importance of considering the effects of natural selection when analyzing population genetic structure, as has been demonstrated by Nielsen et al. (2006). This particularly applies to Atlantic cod, as recent studies have shown that a high proportion of analyzed genetic loci show indications that natural selection may have shaped their variation (e.g., Nielsen et al. 2006; Moen et al. 2008; Pampoulie et al. 2011). Genetic difference was observed when comparing samples from different depths at the south coast of Iceland. Although the overall genetic difference is very small (FST = 0.0017), 5 out of 10 loci show significant difference with depth. The loci showing the highest FST value was Gmo34 (FST = 0.0171). Gmo34 is not a significant outlier when compared with the FST null distribution and can thus not be regarded as being under natural selection with respect to depth. However, knowing its relationship with Pan I (Westgaard and Fevolden 2007), where a strong relationship with depth is also found (Pampoulie et al. 2006; Árnason et al. 2009), it may be that observed variation at Gmo34 is in fact due to this. Removing Gmo34 from the analysis shows a smaller, but significant, difference between fish samples at different depths (FST = 0.0010*,••). This variation may indicate that different populations of Atlantic cod are at different depths and may be related to the behavioral variation that has been described for Atlantic cod in annual migration patterns (Pálsson and Thorsteinsson 2003; Thorsteinsson et al. 2012) and behaviour at spawning grounds (Grabowski et al. 2011). However, it must be pointed out that the grouping of samples into depth groups is not based on any known environmental heterogeneity and may not properly reflect reproductively sepaPublished by NRC Research Press 34 Eiríksson and Árnason rated groups, if present. It must also be pointed out that the observed difference is very small, and it can be questioned if it is biologically meaningful (Waples 1998). Temporal changes in cod length at age on the spawning grounds at the south coast of Iceland may suggest migration of cod that have developed in a different environment (Jónsson 1996). Jónsson (1996) suggested that reduced length at age was an indication of cod from the north coast was entering the spawning ground at a particular time. However, this change may also be caused by cod migrating from deeper waters to the shallow spawning grounds, as studies using data storage tags have revealed (Pálsson and Thorsteinsson 2003). The genetic variation among cohorts observed in the present study showed the effects of genetic drift, as natural selection seems not to have affected the allele frequencies at the loci with respect to cohorts. However, the difference was very subtle, and although a difference in age composition is known between the north and south coast of Iceland (Jónsdóttir et al. 2006), such a small genetic difference is not sufficient to generate an apparent spatial structure, as discussed earlier. Further studies on genetic drift can be used to estimate temporal effective population size, Ne, in cod (Waples 1989). Genotyping errors are of concern in all microsatellite studies, particularly the presence of null alleles. Null alleles result from PCR amplification failure of a particular allele that may be due to primer site sequence mutation. Inspection of genotyping error using Micro-Checker showed that three of the loci used in the present study may contain null alleles, indicated by general homozygotic excess. This is a concern and suggests that the genetic analysis may not represent the true allele and genotype frequencies. It has been shown that in populations where null alleles are present and where significant population differentiation is found, FST and genetic distance may be overestimated (Chapuis and Estoup 2007). However, FST was shown to be unbiased in the absence of population structure (Chapuis and Estoup 2007). Although the present study was focused on Atlantic cod in Icelandic waters, the results have implications for the study of cod in general. It suggests that the reported genetic structure (Pampoulie et al. 2006) may in fact be due to natural selection acting on Gmo34. The same may be true in some other studies where population genetic structure has been proposed, as Gmo34 has been commonly used in the study of population genetics in Atlantic cod (for overview Reiss et al. 2009) and revision may be needed. This is important because footprints of natural selection seem to be common in the cod genome (e.g., Nielsen et al. 2006; Moen et al. 2008). Acknowledgements We thank Snæbjörn Pálsson for useful discussions, particularly on lab work and data analysis. We also thank Juha-Pekka Vähä and Paula Lehtonen for advice on lab work. We thank Guðrún Marteinsdóttir and Jónas P. Jónasson for stimulating discussions and valuable inputs during the work. We thank Kristján Kristinsson at the Icelandic Marine Reseach Institute for cooperation with sampling. We also thank staff members of the population genetics lab at the University of Iceland for various lab assistance and moral support. We thank two anonymous reviewers for their comments on the paper. This project was supported by the Icelandic research fund and the Icelandic research fund for graduate students of The Icelandic Centre for Research. References André, C., Larsson, L.C., Laikre, L., Bekkevold, D., Brigham, J., Carvalho, G.R., Dahlgren, T.G., Hutchinson, W.F., Mariani, S., Mudde, K., Ruzzante, D.E., and Ryman, N. 2011. Detecting population structure in a high gene-flow species, Atlantic herring (Clupea harengus): direct, simultaneous evaluation of neutral vs putatively selected loci. Heredity, 106: 270–280. doi:10.1038/hdy.2010.71. PMID:20551979. Applied Biosystems. 2004. GeneMapper Software Version 3.7 user guide. Tech. Rep. Applied Biosystems. 1157 Árnason, E. 2004. Mitochondrial cytochrome b DNA variation in the highfecundity Atlantic cod: trans-Atlantic clines and shallow gene genealogy. Genetics, 166(4): 1871–1885. doi:10.1534/genetics.166.4.1871. PMID:15126405. Árnason, E., Pálsson, S., and Arason, A. 1992. Gene flow and lack of population differentiation in Atlantic cod, Gadus morhua L., from Iceland, and comparison of cod from Norway and Newfoundland. J. Fish Biol. 40(4): 751–770. doi:10.1111/j.1095-8649.1992.tb02622.x. Árnason, E., Petersen, P.H., Kristinsson, K., Sigurgíslason, H., and Pálsson, S. 2000. Mitochondrial cytochrome b DNA sequence variation of Atlantic cod from Iceland and Greenland. J. Fish Biol. 56(2): 409–430. doi:10.1111/j.10958649.2000.tb02115.x. Árnason, E., Hernandez, U.B., and Kristinsson, K. 2009. Intense habitat-specific fisheries-induced selection at the molecular PanI locus predicts imminent collapse of a major cod fishery. PLoS ONE, 4(5): e5529. doi:10.1371/journal. pone.0005529. PMID:19479037. Beaumont, M., and Nichols, R. 1996. Evaluating loci for use in the genetic analysis of population structure. Proc. R. Soc. B Biol. Sci. 263(1377): 1619–1626. doi:10.1098/rspb.1996.0237. Bradbury, I.R., Hubert, S., Higgins, B., Borza, T., Bowman, S., Paterson, I.G., Snelgrove, P.V.R., Morris, C.J., Gregory, R.S., Hardie, D.C., Hutchings, J.A., Ruzzante, D.E., Taggart, C.T., and Bentzen, P. 2010. Parallel adaptive evolution of Atlantic cod on both sides of the Atlantic Ocean in response to temperature. Proc. R. Soc. B Biol. Sci. 277(1701): 3725–3734. doi:10.1098/rspb.2010. 0985. Braithwaite, V.A., and Salvanes, A.G.V. 2005. Environmental variability in the early rearing environment generates behaviourally flexible cod: implications for rehabilitating wild populations. Proc. R. Soc. B Biol. Sci. 272(1568): 1107– 1113. doi:10.1098/rspb.2005.3062. Brooker, A.L., Cook, D., Bentzen, P., Wright, J.M., and Doyle, R.W. 1994. Organization of microsatellites differs between mammals and cold-water teleost fishes. Can. J. Fish. Aquat. Sci. 51(9): 1959–1966. doi:10.1139/f94-198. Brownstein, M.J., Carpten, J.D., and Smith, J.R. 1996. Modulation of nontemplated nucleotide addition by TaqDNA polymerase: primer modifications that facilitate genotyping. BioTechniques, 20(6): 1004–1010. PMID:8780871. Carr, S.M., Snellen, A.J., Howse, K.A., and Wroblewski, J.S. 1995. Mitochondrial DNA sequence variation and genetic stock structure of Atlantic cod (Gadus morhua) from bay and offshore locations on the Newfoundland continental shelf. Mol. Ecol. 4(1): 79–88. doi:10.1111/j.1365-294X.1995.tb00194.x. PMID: 7711956. Chapuis, M.-P., and Estoup, A. 2007. Microsatellite null alleles and estimation of population differentiation. Mol. Biol. Evol. 24(3): 621–631. doi:10.1093/molbev/ msl191. PMID:17150975. Grabowski, T.B., Thorsteinsson, V., McAdam, B.J., and Marteinsdóttir, G. 2011. Evidence of segregated spawning in a single marine fish stock: sympatric divergence of ecotypes in Icelandic cod? PLoS ONE, 6(3): e17,528. doi:10.1371/ journal.pone.0017528. Guinand, B., Lemaire, C., and Bonhomme, F. 2004. How to detect polymorphisms undergoing selection in marine fishes? A review of methods and case studies, including flatfishes. J. Sea Res. 51(3–4): 167–182. doi:10.1016/j.seares. 2003.10.002. Gupta, A.P., and Lewontin, R.C. 1982. A study of reaction norms in natural populations of Drosophila pseudoobscura. Evolution, 36(5): 934–948. doi:10.2307/ 2408073. Hartl, D.L., and Clark, A.G. 2006. Principles of population genetics. 4th ed. Sinauer Associates, Inc. Publishers, Sunderland, Mass. Jamieson, A., and Jónsson, J. 1971. The Greenland component of spawning cod at Iceland. Rapp. P.-V. Réun. Cons. Int. Explor. Mer, 161: 65–72. Jombart, T. 2008. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics, 24(11): 1403–1405. doi:10.1093/bioinformatics/btn129. PMID:18397895. Jónsdóttir, I.G., Campana, S.E., and Marteinsdóttir, G. 2006. Otolith shape and temporal stability of spawning groups of Icelandic cod (Gadus morhua L.). ICES J. Mar. Sci. 63(8): 1501–1512. doi:10.1016/j.icesjms.2006.05.006. Jónsdóttir, Ó.D.B., Imsland, A.K., Daníelsdóttir, A.K., Thorsteinsson, V., and Nævdal, G. 1999. Genetic differentiation among Atlantic cod in south and south-east Icelandic waters: synaptophysin (Syp I) and haemoglobin (HbI) variation. J. Fish Biol. 54(6): 1259–1274. doi:10.1111/j.1095-8649.1999.tb02053.x. Jónsdóttir, Ó.D.B., Daníelsdóttir, A.K., and Nævdal, G. 2001. Genetic differentiation among Atlantic cod (Gadus morhua L.) in Icelandic waters: temporal stability. ICES J. Mar. Sci. 58(1): 114–122. doi:10.1006/jmsc.2000.0995. Jónsson, J. 1996. Tagging of cod (Gadus morhua) in Icelandic waters 1948–1986 and tagging of haddock (Gadus aeglefinus) in Icelandic waters 1953–1965. Rit Fiskideildar, 14: 1–82. Jónsson, S., and Valdimarsson, H. 2005. The flow of Atlantic water to the North Icelandic shelf and its relation to the drift of cod larvae. ICES J. Mar. Sci. 62(7): 1350–1359. doi:10.1016/j.icesjms.2005.05.003. Kalinowski, S.T. 2002. How many alleles per locus should be used to estimate genetic distances? Heredity, 88(1): 62–65. doi:10.1038/sj.hdy.6800009. PMID: 11813108. Karlsson, S., and Mork, J. 2003. Selection-induced variation at the pantophysin locus (PanI) in a Norwegian fjord population of cod (Gadus morhua L.). Mol. Ecol. 12(12): 3265–3274. doi:10.1046/j.1365-294X.2003.01993.x. PMID:14629344. Knutsen, H., Olsen, E.M., Jorde, P.E., Espeland, S.H., André, C., and Stenseth, N.C. Published by NRC Research Press 35 Paper I 1158 2011. Are low but statistically significant levels of genetic differentiation in marine fishes ‘biologically meaningful’? A case study of coastal Atlantic cod. Mol. Ecol. 20(4): 768–783. doi:10.1111/j.1365-294X. 2010.04979.x. Kovach, A.I., Breton, T.S., Berlinsky, D.L., Maceda, L., and Wirgin, I. 2010. Finescale spatial and temporal genetic structure of Atlantic cod off the Atlantic coast of the USA. Mar. Ecol. Prog. Ser. 410: 177–195. doi:10.3354/meps08612. Kristjánsson, T. 2013. Comparison of growth in Atlantic cod (Gadus morhua) originating from the northern and southern coast of Iceland reared under common conditions. Fish. Res. 139: 105–109. doi:10.1016/j.fishres.2012.10.005. Li, Y.-C., Korol, A.B., Fahima, T., Beiles, A., and Nevo, E. 2002. Microsatellites: genomic distribution, putative functions and mutational mechanisms: a review. Mol. Ecol. 11(12): 2453–2465. doi:10.1046/j.1365-294X.2002.01643.x. PMID:12453231. Li, Y.-C., Korol, A.B., Fahima, T., and Nevo, E. 2004. Microsatellites within genes: Structure, function, and evolution. Mol. Biol. Evol. 21(6): 991–1007. doi:10. 1093/molbev/msh073. PMID:14963101. McAdam, B.J., Grabowski, T.B., and Marteinsdóttir, G. 2012. Identification of stock components using morphological markers. J. Fish Biol. [ISSN: 1095– 8649.] doi:10.1111/j.1095-8649.2012.03384.x. Miller, K.M., Le, K.D., and Beacham, T.D. 2000. Development of tri- and tetranucleotide repeat microsatellite loci in Atlantic cod (Gadus morhua). Mol. Ecol. 9(2): 238–239. doi:10.1046/j.1365-294x.2000.00804-2.x. PMID:10672170. Moen, T., Hayes, B., Nilsen, F., Delghandi, M., Fjalestad, K.T., Fevolden, S.-E., Berg, P.R., and Lien, S. 2008. Identification and characterisation of novel SNP markers in Atlantic cod: evidence for directional selection. BMC Genet. 9(18): 1–9. doi:10.1186/1471-2156-9-18. Mork, J., Ryman, N., Ståhl, G., Utter, F., and Sundnes, G. 1985. Genetic variation in Atlantic cod (Gadus morhua) throughout its range. Can. J. Fish. Aquat. Sci. 42(10): 1580–1587. doi:10.1139/f85-198. Nielsen, E.E., Hansen, M.M., and Meldrup, D. 2006. Evidence of microsatellite hitch-hiking selection in Atlantic cod (Gadus morhua L.): implications for inferring population structure in nonmodel organisms. Mol. Ecol. 15(11): 3219– 3229. doi:10.1111/j.1365-294X.2006.03025.x. PMID:16968266. Nielsen, E.E., MacKenzie, B.R., Magnussen, E., and Meldrup, D. 2007. Historical analysis of Pan I in Atlantic cod (Gadus morhua): temporal stability of allele frequencies in the southeastern part of the species distribution. Can. J. Fish. Aquat. Sci. 64(10): 1448–1455. doi:10.1139/f07-104. Nielsen, E.E., Hemmer-Hansen, J., Poulsen, N.A., Loeschcke, V., Moen, T., Johansen, T., Mittelholzer, C., Taranger, G.-L., Ogden, R., and Carvalho, G.R. 2009. Genomic signatures of local directional selection in a high gene flow marine organism; the Atlantic cod (Gadus morhua). BMC Evol. Biol. 9(276): 1–11. doi:10.1186/1471-2148-9-276. Oosterhout, C.V., Hutchinson, W.F., Wills, D.P.M., and Shipley, P. 2004. MICROCHECKER: Software for identifying and correcting genotyping errors in microsatellite data. Mol. Ecol. 4(3): 535–538. doi:10.1111/j.1471-8286.2004. 00684.x. O'Reilly, P.T., Canino, M.F., Bailey, K.M., and Bentzen, P. 2000. Isolation of twenty low stutter di- and tetranucleotide microsatellites for population analyses of walleye pollock and other gadoids. J. Fish Biol. 56(5): 1074–1086. doi:10.1111/ j.1095-8649.2000.tb02124.x. Pálsson, Ó.K., and Thorsteinsson, V. 2003. Migration patterns, ambient temperature, and growth of Icelandic cod (Gadus morhua): evidence from storage tag data. Can. J. Fish. Aquat. Sci. 60(11): 1409–1423. doi:10.1139/f03-117. Pampoulie, C., Ruzzante, D.E., Chosson, V., Jörundsdóttir, T.D., Taylor, L., Thorsteinsson, V., Daníelsdóttir, A.K., and Marteinsdóttir, G. 2006. The genetic structure of Atlantic cod (Gadus morhua) around Iceland: insight from microsatellites, the Pan I locus, and tagging experiments. Can. J. Fish. Aquat. Sci. 63(12): 2660–2674. doi:10.1139/f06-150. Pampoulie, C., Jakobsdóttir, K.B., Marteinsdóttir, G., and Thorsteinsson, V. 2008. Are vertical behaviour patterns related to the pantophysin locus in the Atlantic cod (Gadus morhua L.)? Behav. Genet. 38(1): 76–81. doi:10.1007/s10519007-9175-y. PMID:17978867. Pampoulie, C., Daníelsdóttir, A.K., Storr-Paulsen, M., Hovgård, H., Can. J. Fish. Aquat. Sci. Vol. 70, 2013 Hjörleifsson, E., and Steinarsson, B.Æ . 2011. Neutral and nonneutral genetic markers revealed the presence of inshore and offshore stock components of Atlantic cod in Greenland waters. Trans. Am. Fish. Soc. 140(2): 307–319. doi: 10.1080/00028487.2011.567850. Pétursdóttir, G., Begg, G.A., and Mareinsdóttir, G. 2006. Discrimination between Icelandic cod (Gadus morhua L.) populations from adjacent spawning areas based on otolith growth and shape. Fish. Res. 80: 182–189. doi:10.1016/j.fishres. 2006.05.002. Pogson, G.H., and Fevolden, S.-E. 2003. Natural selection and the genetic differentiation of coastal and Arctic populations of the Atlantic cod in northern Norway: A test involving nucleotide sequence variation at the pantophysin (PanI) locus. Mol. Ecol. 12(1): 63–74. doi:10.1046/j.1365-294X.2003.01713.x. PMID:12492878. Pogson, G.H., and Mesa, K.A. 2004. Positive Darwinian selection at the pantophysin (PanI) locus in marine gadid fishes. Mol. Biol. Evol. 21(1): 65–75. doi:10. 1093/molbev/msg237. PMID:12949133. Raymond, M., and Rousset, F. 1995. An exact test for population differentiation. Evolution, 49(6): 1280–1283. doi:10.2307/2410454. Reiss, H., Hoarau, G., Dickey-Collas, M., and Wolff, W.J. 2009. Genetic population structure of marine fish: mismatch between biological and fisheries management units. Fish Fish. 10(4): 361–395. doi:10.1111/j.1467-2979.2008.00324.x. Russello, M.A., Kirk, S.L., Frazer, K.K., and Askey, P.J. 2012. Detection of outlier loci and their utility for fisheries management. Evol. Appl. 5(1): 39–52. doi: 10.1111/j.1752-4571.2011.00206.x. Ryman, N., and Palm, S. 2006. POWSIM: a computer program for assessing statistical power when testing for genetic differentiation. Mol. Ecol. 6(3): 600–602. doi:10.1111/j.1471-8286.2006.01378.x. Salvanes, A.G., Moberg, O., and Braithwaite, V.A. 2007. Effects of early experience on group behaviour in fish. Anim. Behav. 74(4): 805–811. doi:10.1016/j. anbehav.2007.02.007. Tåning, Å.V. 1944. Experiments on meristic and other characters in fishes. I. On the influence of temperature on some meristic characters in sea-trout and the fixation-period of these characters. Medd. Komm. Dan. Fisk. Havunders., Fisk. 11: 1–66. Thorsteinsson, V., Pálsson, Ó.K., Tómasson, G.G., Jónsdóttir, I.G., and Pampoulie, C. 2012. Consistency in the behaviour types of the Atlantic cod: repeatability, timing of migration and geo-location. Mar. Ecol. Progr. Ser. 462: 251–260. doi:10.3354/meps09852. Walsh, P.S., Metzger, D.A., and Higuchi, R. 1991. Chelex 100 as a medium for simple extraction of DNA for PCR-based typing from forensic material. BioTechniques, 10(4): 506–513. PMID:1867860. Waples, R.S. 1989. A generalized approach for estimating effective population size from temporal changes in allele frequency. Genetics, 121(2): 379–391. PMID:2731727. Waples, R.S. 1998. Separating the wheat from the chaff: Patterns of genetic differentiation in high gene flow species. J. Hered. 89(5): 438–450. doi:10.1093/ jhered/89.5.438. Waples, R.S., and Gaggiotti, O. 2006. What is a population? An empirical evaluation of some genetic methods for identifying the number of gene pools and their degree of connectivity. Mol. Ecol. 15(6): 1419–1439. doi:10.1111/j.1365294X.2006.02890.x. Weir, B.S., and Cockerham, C.C. 1984. Estimating F-statistics for the analysis of population structure. Evolution, 38(6): 1358–1370. doi:10.2307/2408641. Westgaard, J.-I., and Fevolden, S.-E. 2007. Atlantic cod (Gadus morhua L.) in inner and outer coastal zones of northern Norway display divergent genetic signature at non-neutral loci. Fish. Res. 85(3): 306–315. doi:10.1016/j.fishres.2007. 04.001. Williams, G.C., Koehn, R.K., and Mitton, J.B. 1973. Genetic differentiation without isolation in the American eel, Anguilla rostrata. Evolution, 27(2): 192–204. doi:10.2307/2406960. Published by NRC Research Press 36 Paper II High levels of gene flow and temporal genetic variation in Atlantic cod, Gadus morhua in Icelandic waters — inferred by mitochondrial DNA sequence variation Guðni Magnús Eiríksson and Einar Árnason Unpublished manuscript © Guðni Magnús Eiríksson and Einar Árnason Guðni Magnús Eiríksson was in charge of genetic sampling and organization, on behalve of the Population Genetics laboratory at the University of Iceland, in cooperation with the Marine Research institute of Iceland. Guðni did all the laboratory work: DNA isolation, mtDNA amplification, purification and prepared the samples for electrophoresis on an ABI-3100 automatic sequencer. He also did the data analysis and interpretation. Guðni was in charge of writing the manuscript. Professor Einar Árnason actively participated in all steps of the work. Einar took part in the experimental design, supervised the molecular work, took part in data analysis, interpretation of the data and participated in writing the manuscript. 37 Paper II 38 1 Abstract Genetic variation in spawning Atlantic cod, Gadus morhua, around Iceland was examined using mitochondrial DNA sequence information. A 328 base pair fragment of the cytochrome b gene in 2656 individuals sampled all around Iceland was examined. Variation was found at 70 sites defining 128 haplotypes. The observed polymorphism was mostly synonymous, suggesting weak or no natural selection acting on it. Demographic analysis of the genetic variation suggest sudden expansion of the Atlantic cod population estimated to have taken place 96 Kyr ago. Rapid changes in allele frequency observed in the present study, particularly in cod from NE-Iceland, suggest that the mtDNA sequence variation is an ideal marker for detecting recent population divergence. Temporal genetic analysis showed heterogenity in haplotype frequency change between NE- and SW-Iceland. This is in accordance with earlier findings that have shown temporal and spatial variation in the contribution of different spawning grounds to each cohort. The variation is likely to be attributed to strength of the warm Atlantic ocean current, but temporal variation in spawning success in different spawning grounds may also be important. Our analysis does not suggest the Atlantic cod around Iceland is divided into distinct historical populations as has been suggested in earlier studies. On the contrary it supports the view of high levels of gene flow in Atlantic cod around Iceland. We suggest that the mixing of cod at times when ocean currents allow high levels of embryo/larvae drift, prevents the evolution of distinct populations. Keywords: Atlantic cod, mtDNA, cytb, natural selection, temporal genetic variation 39 Paper II 2 Introduction 2.1 General biology and phenotypic variation Atlantic cod is among the most valuable commercial fish species in the North Atlantic and has been very important for the fisheries around Iceland. It has a wide distribution and is found in a variety of habitats, from shallow waters close to shore to deeper waters off the continental shelf (Jónsson, 1992). The main spawning grounds for Atlantic cod around Iceland are along the South coast (Jónsson, 1992). Embryos and fish larvae are known to drift with water currents from the main spawning grounds to the West, North and East Iceland. Some years embryos and larvae may drift to the East coast of Greenland (Astthorsson et al., 1994; Jónsson and Valdimarsson, 2005). The extent of embryo and larval drift depends on the strength of the warm North Atlantic water current (Jónsson and Valdimarsson, 2005). In addition to the main spawning grounds cod is known to spawn in different areas all around the country with spatial and temporal variation in their contribution to each cohort (Marteinsdottir et al., 2000; Begg and Marteinsdottir, 2000; Jónsdóttir et al., 2007). Thus, in different locations around Iceland each cohort may be of mixed origin, to a variable extent, depending on the strength of the North Atlantic water current and spatial and temporal variation in spawning success. Indeed, studies have shown cohorts to be of mixed origin, and temporal variation in the contribution of different spawning grounds to the cohorts (e.g. Jónsdóttir et al., 2007; Thorisson et al., 2011). Although the fish may be of mixed origin variable phenotypic differences have been observed in cod from different locations. Difference in length at age, reflecting difference in growth rate, between cod from the North coast of Iceland to that of the South is well known (Jónsson, 1992). Growth related morphological variation, including otolith shape, has been described for cod around Iceland and has been used to argue for separate populations of cod (e.g. Pétursdóttir et al., 2006; Jónsdóttir et al., 2006). In addition behavioral variation has also been observed. Different behavioral types have been described, identified as frontal and coastal behavioural types (Pálsson and Thorsteinsson, 2003; Thorsteinsson et al., 2012). This migration pattern has been linked to genetic variation at the Pan I locus (Pampoulie et al., 2008). However, behavioral traits, like any other phenotypic trait, can be affected by the environment that the organism is exposed to (phenotypic plasticity). In fact, experiments have shown that the development of cod behavior can be affected by environmental factors at early age (Braithwaite and Salvanes, 2005; Salvanes et al., 2007). The observed behavioral variation can thus be environmentally induced. A group of fish, that may share the same place of origin, may develop variable behaviours if exposed to different environments during their development (c.f. Gupta and Lewontin, 1982). The phenotypic variation observed in cod may thus be environmentally introduced, that is cod of different origin may develop similar phenotypic traits if exposed to the same/similar environmental conditions. Environmental factors such as temperature or food availability or interaction of environmental factors may be important in this respect. Such phenotypically distinct groups can be defined as ecological populations although they may not reflect population breeding structure (Waples and Gaggiotti, 2006). Thus, such groups would not be considered different populations under population genetic or evolutionary paradigm (Waples and Gaggiotti, 2006). 2.2 Population genetic studies Intense research effort, for decades, has been directed to the analysis of cod population genetic structure, not least to resolve if the observed phenotypic variation correlates with genetic variation. No commercial fish species has been the subject to as many population genetic studies (for review see Reiss et al., 2009). 40 However, research findings are to some extent conflicting. Different molecular genetic markers have shown different patterns, some markers showing great population genetic structure but others showing little or none (Reiss et al., 2009). It is important to understand what may cause the differences among the different markers in order to better understand the population genetic structure of Atlantic cod. Of particular concern is the importance of natural selection in the shaping of genetic variation in Atlantic cod. Recent studies have shown that footprints of natural selection are common in the cod genome (Moen et al., 2008) and fine scale population breeding structure may in some cases be inflated due to selection (Nielsen et al., 2006; Eiríksson and Árnason, 2013). This may cause environmental heterogeneity to be reflected in the genetic variation, rather than restrictions in gene flow (Williams et al., 1973). A spatial genetic structure has been repeatedly reported for Atlantic cod around Iceland. Jónsdóttir et al. (1999) showed an interesting variation of the Pan I and hemoglobin loci among cod at the South coast of Iceland, a pattern interpreted as a small scale population genetic structure. However, variation at the Pan I locus and the hemoglobin has been shown to be be affected by natural selection (Karlsson and Mork, 2003; Pogson and Mesa, 2004; Andersen et al., 2008; Árnason et al., 2009). Pampoulie et al. (2006) described population genetic structure in cod separating cod from NE- from SW-Iceland using both the Pan I locus and microsatellite markers. The population differentiation was small when based on the presumed neutral microsatellite markers (FST = 0.003). A recent study indicates that the observed difference between cod from North and South Iceland may in fact be caused by natural selection connected to one particular locus (Gmo34) (Eiríksson and Árnason, 2013). However, in the present study this reported population structure will be reviewed and examined further. The use of direct mtDNA sequencing has been important in revealing the population genetic structure in Atlantic cod. The observed genetic variation at mtDNA is mostly at synonymous sites, suggesting weak or no selection (e.g. Árnason, 2004). The effective population size (Ne ) of the mtDNA genome is a quarter of that of nuclear markers making the rate of genetic drift faster (as genetic drift rate is inversely related to Ne ) (Hartl and Clark, 2006). Thus, recently established population structure should become detectable earlier for mtDNA markers than for nuclear markers, on average. However, the mtDNA genome is a single locus and different loci may reflect different history within a population (Hartl and Clark, 2006). Although an obvious weakness of using mtDNA sequence variation is that it is a single locus, studies have reported important findings that differ from patterns reveald from loci that are likely to be under selection. Population genetic studies in Atlantic cod, using mtDNA sequence variation, have indicated limited fine scale population genetic structure and high levels of gene flow among geographic regions (Carr and Marshall, 1991; Árnason et al., 1992; Carr et al., 1995; Árnason and Pálsson, 1996; Árnason et al., 1998, 2000). Summarizing results from many studies of cytochrome b sequence variation across the North Atlantic Árnason (2004) showed no difference among localities within countries, but a trans-Atlantic cline in the common haplotype frequencies. The regularity of the cline indicated considerable gene flow among different regions. 2.3 The present study In the present study mtDNA sequence variation will be used for the analysis of population genetic structure of spawning Atlantic cod around Iceland. Comparison will be made among different regions around the country and a comparison made between NE- and SW-Iceland, were population differentiation has been suggested (Pampoulie et al., 2006; Jónsdóttir et al., 2006). Temporal genetic variation will be evaluated by comparison between sampling years and in particular by comparison among different cohorts. 41 Paper II Table 1: Sample size in different areas around Iceland (Figure 1) and different depth groups. Depth information was missing (DIM) for 160 fish . Area 0-50m 50-100m 100-150m 150-200m >200m DIM 1 39 189 22 0 0 50 2 31 279 34 0 0 10 3 53 31 0 0 0 0 4 62 120 31 19 0 0 5 173 0 0 0 0 35 7 12 40 26 0 0 0 8 46 319 63 0 26 23 9 86 534 145 53 63 42 Total: 502 1512 321 72 89 160 Total 300 354 84 232 208 78 477 923 2656 3 Materials and methods 3.1 Fish samples Atlantic cod was sampled around Iceland in the annual spring survey of the Icelandic Marine Research Institute (MRI) in two consecutive years, 2005 and 2006 (Figure 1). The surveys mainly target spawning cod at shallow waters (< 200m), at known breeding grounds. Samples are also taken at deeper waters in South of Iceland. Gill tissue samples were preserved in 96% ethanol. In total 2656 fish were sampled and successfully sequenced. Sample size varied among the different sampling areas and the depth range at which the fish were caught (Table 1). The waters around Iceland were divided into nine different areas (Figure 1), the same divisions as used in recent studies (e.g. Jónsdóttir et al., 2006; Pampoulie et al., 2006). All areas except number 6 were sampled. Age of fish were determined from otoliths at the Icelandic Marine Research Institute. 3.2 Molecular methods Three different fragments of the cod mitochondrial genome were amplified and sequenced. The fragments are parts of cox1, cyt b and the noncoding TP-spacer and upstream, corresponding to base pairs 5787-6017, 14310-14711 and 15556-15616 in the cod complete mitochondrial genome, respectively (Johansen and Bakke, 1996, GenBank accession number: X99772.1). All three fragments of the mtDNA (992 base pairs) was successfully sequenced in 875 individuals and used for demographic analysis. The cytochrome b was sequenced for a larger sample, 2656 individuals in total. Due to poor quality for parts of the cytochrome b sequences (base pairs 14662-14676) the base pairs 14352-14661 and 14677-14694 (328 base pairs) were used for spatial and temporal population genetic analysis. 3.3 Data analysis Base calling, data assembly and sequence alignment was carried out in Phred (Ewing et al., 1998) and Phrap (Gordon et al., 1998). Sequence data was inspected using Consed (Gordon et al., 1998) and Seaview (Galtier et al., 1996). Trace files for all observed haplotypes were inspected manually, the sequence integrity confirmed and sequences of low quality rejected. Different neutrality tests, Tajima’s D test, Chakraborty’s test and Fu’s FS test, as implemented in Arlequin 3.5.1.2 (Excoffier and Lischer, 2010) were applied. The neutrality tests were applied to the total 42 Figure 1: Atlantic cod sampling locations around Iceland. The numbers refer to divisions used in earlier studies (e.g. Jónsdóttir et al., 2006; Pampoulie et al., 2006). 43 Paper II sample of cytochrome b (N = 2656) and the combined fragment of cytochrome b, spacer and cytochrome oxidase I (N = 875). Genetic differentiation among various groups was analyzed using analysis of molecular variation (AMOVA) (Excoffier et al., 1992), pairwise comparison of ΦST and FST between groups and pairwise comparison using exact test, as implemented in Arlequin. Spatial genetic variation was examined both using genetic distance based on Tamura and Nei (1993) (Φ) and conventional F -statistics (based on frequency of different haplotypes only) but temporal genetic variation was based on conventional F -statistics only (as temporal difference, over a short period of time, will be reflected as change in frequency rather than as a change in genetic distance among haplotypes). The significance of fixation indices in pairwise group comparison was tested using 10,000 permutations (Excoffier et al., 1992). Spatial genetic comparison was made among fish sampled in the defined areas (Figure 1). For further spatial comparison fish sampled in NE-Iceland (areas 3, 4, 5 and 7; N = 602) were compared to fish sampled SW-Iceland (areas 1, 2, 8 and 9; N = 2054; Figure 1) as Pampoulie et al. (2006) suggest that there is a barrier to gene flow among the two components. However, fish sampled in area 7 and the sampled fish in area 8 North of 64◦ N are at the margin of the two suggested areas. Therefore, similar comparisons were first made by grouping all fish from area 8 and 7 with SW-Iceland (moving the potential boundary between the areas further North) and second by leaving out the marginal samples (samples from area 7 and area 8 North of 64◦ N). Comparison was also made among fish sampled at different depth. Depth at which fish were sampled were available for most of the sampled fish (N = 2496, Table 1). Fish were grouped into five 50m depth groups (Table 1). Comparison was made including all sampling areas and also for sampling areas at the South coast of Iceland separately (areas 8 and 9, Table 1). A comparison was also made grouping into shallow (< 200m) and deep (> 200m) for cod sampled at the South coast of Iceland. Bonferroni correction (Sokal and Rohlf, 1995) was used to account for repeated comparisons in order to further examine the significance of observed differences. Analysis of pairwise nucleotide mismatch (mismatch distribution Rogers and Harpending, 1992) was carried out for the overall sample and 10,000 simulated mismatch distributions used to test if the observed distribution differed to the expectations of a population expansion model (Schneider and Excoffier, 1999). The mismatch analysis was implemented in Arlequin. If a mismatch distribution does not deviate from a model predicting population sudden expansion τ can be used to estimate the time from expansion: t = τ /2u, where t is time in years, τ is mode of the mismatch distribution and u is mutation rate in substitutions/year (Rogers and Harpending, 1992). A substitution rate µ̂ = 2 × 10−8 /site/year was used (Avise, 2004). As this is the mutation rate estimated for coding region of the animal mtDNA, the mutation rate for the fragment used here may be slightly higher, as it includes part of the noncoding spacer and tRNA. 4 Results 4.1 Genetic variation Variation was found at 70 sites in the cytochrome b fragment (328 base pairs) defining 128 haplotypes (Appendix I: Tables 1.4–1.7). Number of sites had more than one substitution and in total 83 substitutions were observed, 69 transitions and 14 transversions. Most of the observed polymorphism was synonymous. However, 14 nonsynonymous mutations were found among 15 individuals (Appendix I: Table 1.8). Four different mutations were detected in one codon (at sites 14587–14589, site 82 of the protein), three of which lead to amino acid substitution. Thus, at this site in the cytochrome b four different amino acids were 44 Figure 2: Atlantic cod cyt b haplotype frequencies in North East (NE) and South West (SW) Iceland. Different haplotypes are represented with different shades. The most common haplotype is indicated with black color and the less common once with gradually decreasing intensity of gray. White represents a pool of rare haplotypes. observed (M,V, I and T). These amino acids all have hydrophobic side chains apart from threonine (T), which is uncharged. One site had three different amino acids: E, G and D (E and D are negatively charged but glycine (G), the smallest amino acid, has no side chain) at sites 14521–14523, site 60 of the protein. One haplotype was found at high frequency (> 40%) and another four haplotypes were found at relatively high frequency (three at > 5% and one at 4.7%). Other haplotypes were found at lower frequencies and more than half (69 of the 128) were singletons in the sample. The overall nucleotide diversity was π̂ = 0.0042±0.0029 and haplotype diversity was ĥ = 0.7608 ± 0.0063. Tests of selective neutrality showed deviation from neutral expectations (P value for all neutrality tests applied were < 1 × 10−5). Both Chakraborty’s and Fu’s neutrality tests showed that the number of observed haplotypes exceeded expected number more than ten times. The high number of observed haplotypes resulted in a large deviation between FExp and FObs. in the Ewens-Watterson test and Tajima’s D = −2.17. 4.2 Spatial genetic variation Spatial genetic variation analysis, using AMOVA, showed no overall differences among different areas along the coast of Iceland. Pairwise comparison among different areas around Iceland showed a single significant difference, between area 4 and 8 (FST = 0.0035, P = 0.0450). However, this is not different from what might be expected by chance alone and after Bonferroni correction, no comparison was statistically significant. The frequencies of different haplotypes were similar between NE-Iceland and SW-Iceland (Figure 2) and no significant difference was detected when using conventional F -statistics or when using Tamura and Nei’s genetic distance. Including fish from area 7 together with the SW-Iceland group did not change the result. Neither did excluding the samples from area 7 and area 8 North of latitude 64◦ N (samples that were at the margin of the two areas). 45 Paper II The sampled cod was also divided by depth at which it had been caught but no overall difference was observed. A pairwise difference was observed between cod at 50–100m and 100–150m at the South coast of Iceland, when using conventional FST statistics (FST = 0.0046, P = 0.045). However, when correcting for repeated sampling this is not different from what would be expected from chance alone. No significant difference was observed between fish at deep waters (< 200m) compared to fish at shallow waters (> 200m) at the South coast of Iceland (area 8 and 9). For both AMOVA using conventional F statistics and Tamura and Nei pairwise difference, the genetic differentiation (FST ) was negative, interpreted as nill. 4.3 Temporal genetic variation Comparison using AMOVA did not show significant differences between sampling years. Using AMOVA for comparison among the most abundant cohorts (1994–2002) did not show overall significant differences (FST [8,2520] = 0.0001N S , P = 0.409). However, a pairwise FST comparison showed that the 1995 cohort differed significantly from four of the other eight cohorts and that overall it deviated from most of the other cohorts (Figure 3). However, correcting for repeated testing showed that this pairwise difference was not significant. The nucleotide diversity, π̂, was low for the 1995 cohort compared to other cohorts and the haplotype diversity the lowest among the cohorts (Table 2). Other cohorts did not differ significantly amongst each other (Figure 3). Figure 3: Atlantic cod cyt b haplotype frequencies among cohorts. Different haplotypes are represented with different shades. The most common haplotype is indicated with black color and the less common once with gradually decreasing intensity of gray. White represents a pool of rare haplotypes. Using AMOVA to compare cohorts separately for NE-Iceland showed overall significant difference (FST [8,559] = 0.0102, P = 0.0164, Figure 4a). Pairwise FST comparison revealed that in particular the 1998 cohort deviated from other cohorts in NE-Iceland, being significantly different from four of the other eight cohorts. Using AMOVA to compare cohorts separately for SW-Iceland showed no overall difference (FST [8,1952] = −0.0004, P = 0.613, Figure 4b). Only two significant pairwise differences were observed (1994:1995, P = 0.027 and 1997:2001, P = 0.045). Taking repeated testing into account, this difference was not significant. Comparison of the deviations of the most common haplotype relative frequency from its overall mean frequency in NE-Iceland and SW-Iceland show that there is a significantly greater deviations in the NE- 46 Table 2: Molecular diversity among different Atlantic cod cohorts. π̂ is nucleotide diversity; ĥ is haplotype diversity; SD is standard deviation. Cohort π̂ ± SD(×100) 1994 0.466± 0.315 1995 0.397± 0.277 1996 0.474± 0.315 1997 0.430± 0.292 1998 0.395± 0.274 1999 0.417± 0.285 2000 0.424± 0.289 2001 0.442± 0.301 2002 0.447± 0.306 ĥ ± SD 0.799 ± 0.0306 0.709 ± 0.0424 0.785 ± 0.0244 0.769 ± 0.0155 0.739 ± 0.0147 0.766 ± 0.0125 0.766 ± 0.0144 0.774 ± 0.0401 0.806 ± 0.0406 Iceland (mean: 8.9%) than in SW-Iceland (mean: 4.3%) (Pairwise t-test: t = 2.64, df = 16, P = 0.018; figure 4c). Heterogenity tests were used for further comparison of different cohorts between regions around Iceland. A comparison of haplotype frequencies between cohorts showed significant differences between the NE- and SW-Iceland (Figure 4). Comparing the frequency of the two most common haplotypes and the pooled frequency of other haplotypes showed that the 1998 cohort was significantly different between NEand SW-Iceland (G = 11.56, df = 2, P < 0.01) and that there was an overall difference in haplotype frequencies between the cohorts in NE- and SW-Iceland (G = 27.60, df = 16, P < 0.05). Using only the most common haplotype and the others pooled showed that the 1998 and 1999 cohorts were significantly different between NE- and SW-Iceland (1998: G = 7.86, df = 1, P < 0.01; 1999: G = 3.56, df = 1, P < 0.05) and that there was also an overall difference in haplotype frequencies between the cohorts in NE- and SW-Iceland (G = 17.46, df = 8, P < 0.05). 4.4 Historical demograpic analysis The mismatch distribution for the 328 bp cytochrome b fragment did not deviate from prediction of a population sudden expansion model (PSSD = 0.067) (Rogers and Harpending, 1992). The mismatch distribution of the combined 992 bp mitochondrial DNA fragment did not deviate from the prediction of a population sudden expansion model (PSSD = 0.885, Figure 5). The distribution of pairwise mismatches is indicative of sudden population expansion event estimated to have taken place 96 Kyr ago (CI: 37-142 Kyr). 5 Discussion The observed polymorphism in the present study is primarily found at synonymous sites, therefore it is not likely to be affected by natural selection. However, neutrality tests indicate that natural selection may have shaped the observed polymorphism. In particular the high number of rare haplotypes generate deviation from neutral expectations in these tests. Such deviation may also be caused by demographic effects of population sudden expansion. A mismatch analysis suggests that this may in fact be the case, that a sudden expansion of the Atlantic cod population may have occurred around 96 Kyr ago. If this estimated timing of expansion is correct, it may suggest that expansion of the cod population may by correlated with the earths cooling climate that initiated ca. 100 Kyr ago (Andersen et al., 2004). The estimate of the timing 47 Paper II Figure 4: Atlantic cod cytochrome b haplotype frequencies among cohorts. Different haplotypes are represented with different shades. The most common haplotype is indicated with black color and the less common one’s with gradually decreasing intensity of gray. White represents a pool of rare haplotypes. (a) NE-Iceland, (b) SW-Iceland. (c) The deviation of the most common haplotype relative frequency from its overall mean frequency in NE-Iceland (in blue) and SW-Iceland (in red). 48 Figure 5: Pairwise nucleotide mismatch distribution for a 992 base pair mtDNA fragment in Atlantic cod and prediction of a sudden expansion model expressed with line and dots. of population sudden expansion is based on the mtDNA mutation rate (Avise, 2004). Recent studies have suggested that mutation rate may be much higher than previously estimated (Burridge et al., 2008). If that is the case, the timing of population expansion may be different. Also, Árnason and Halldórsdóttir (2014) suggest that time–scales may be mush faster in high fecundity organisms like Atlatic cod. However, it is likely that population sudden expansion may explain the deviation from neutral expectations, although natural selection cannot be ruled out (e.g. see Bazin et al., 2006). Our analysis suggests that Atlantic cod around Iceland is not divided into distinct historical populations as has been suggested in earlier studies (e.g. Jamieson and Jónsson, 1971; Jónsdóttir et al., 1999; Pampoulie et al., 2006; Jónsdóttir et al., 2006). The variation among cohorts, in NE-Iceland in particular, shows how rapidly haplotype frequencies can change, implying small effective population size (Ne ). Such rapid frequency changes make the observed mtDNA sequence variation useful for detecting recent population divergence. However, no such population structure is observed which supports the view of high levels of gene flow in Atlantic cod around Iceland (Árnason, 2004; Eiríksson and Árnason, 2013). Although high levels of gene flow is evident from the present study it is interesting, and may at first seem contradicting, to find genetic differences within cohorts between NE- and SW-Iceland. However, this may reflect temporal variation in the strength of the North Atlantic water current (Jónsson and Valdimarsson, 2005) and/or temporal variation in spawning success at different spawning grounds. Given the large allele frequency changes one would expect large genetic difference to accumulate among different localities around Iceland in the absence of gene flow. For example if changes like those observed in 1998 would persist over prolonged period of time large differences would accumulate. However, the changes seem to be random with the same allele increasing or decreasing in frequency and therefore any such structure is expected to be temporally unstable and to break down easily. Overall difference between the areas would most likely have established already, given the rapid changes in allele frequencies observed in the present study (particularly in NE-Iceland). It is thus possible that the reported difference between NE-Iceland and SW-Iceland (c.f. Pampoulie et al., 2006) does reflect a temporally unstable phenomena rather than histor- 49 Paper II ically stable population genetic structure. The present study suggests that high levels of gene flow to be more appropriate description of Atlantic cod breeding structure around Iceland. Natal homing has been suggested for cod to variable extent in different regions (Robichaud and Rose, 2004). The results of the present study suggests that homing is not likely to apply to Atlantic cod around Iceland. However, Atlantic cod is known to be philopatric, particularly at juvenile stage, larger/older fish being more likely to undertake long distance migrations (Jónsson, 1992, 1996). This may result in phenotypic variation due to phenotypic plasticity in cod developing in heterogenous environment. Different groups of Atlantic cod may develop distinct phenotypes due to exposure to different environmental conditions. However, high levels of gene flow, even panmixia, may exist between phenotypically distinct groups. Such phenotypic variation can be used to define ecological populations (Waples and Gaggiotti, 2006). However, they would not be considered separate populations under a population genetic or evolutionary paradigm as limited genetic differentiation will establish due to high levels of gene flow (Waples and Gaggiotti, 2006). The present study supports earlier findings on temporal and spatial variation in the contribution of different spawning grounds to each cohort (e.g. Marteinsdottir et al., 2000; Begg and Marteinsdottir, 2000, 2003; Jónsdóttir et al., 2006, 2007; Thorisson et al., 2011), reflected in the observed temporal genetic variation. The variation is likely to be attributed to strength of the warm Atlantic ocean current (Jónsson and Valdimarsson, 2005), but temporal variation in spawning success in different spawning grounds may also be important. We suggest that mixing of cod at times when ocean currents allow high levels of embryo/larvae drift, and thus high levels of gene flow, prevents the evolution of genetically distinct populations. 6 Acknowledgement We would like to thank Dr Snæbjörn Pálsson for useful discussions particularly on lab work and data analysis. We thank Dr Guðrún Marteinsdóttir and Jónas P. Jónasson for stimulating discussions and valuable inputs during the work. We thank Kristján Kristinsson at the Icelandic Marine Reseach Institute for cooperation in sampling. We also like to thank staff members of the population genetics lab in the University of Iceland for various lab assistance and moral support. This project was supported by The Icelandic research fund and the Icelandic research fund for graduate students of The Icelandic Centre for Research. References Andersen, K. K., Azuma, N., Barnola, J.-M., Bigler, M., Biscaye, P., Caillon, N., Chappellaz, J., Clausen, H. B., Dahl-Jensen, D., Fischer, H., Flückiger, J., Fritzsche, D., Fujii, Y., Goto-Azuma, K., Grønvold, K., Gundestrup, N. S., Hansson, M., Huber, C., Hvidberg, C. S., Johnsen, S. J., Jonsell, U., Jouzel, J., Kipfstuhl, S., Landais, A., Leuenberger, M., Lorrain, R., Masson-Delmotte, V., Miller, H., Motoyama, H., Narita, H., Popp, T., Rasmussen, S. O., Raynaud, D., Rothlisberger, R., Ruth, U., Samyn, D., Schwander, J., Shoji, H., Siggard-Andersen, M.-L., Steffensen, J. P., Stocker, T., Sveinbjörnsdóttir, A. E., Svensson, A., Takata, M., Tison, J.-L., Thorsteinsson, T., Watanabe, O., Wilhelms, F., White, J. W. C., and members, N. G. I. C. P., 2004. High-resolution record of northern hemisphere climate extending into the last interglacial period. Nature, 431: 147–151. Andersen, O., Wetten, O. F., Rosa, M. C. D., Andre, C., Alinovi, C. C., Colafranceschi, M., Brix, O., and Colosimo, A., 2008. Haemoglobin polymorphisms affect the oxygen-binding properties in Atlantic cod populations. Proceedings of the Royal Society B, 276: 833–841. Árnason, E., 2004. Mitochondrial cytochrome b DNA variation in the high-fecundity Atlantic cod: TransAtlantic clines and shallow gene genealogy. Genetics, 166: 1871–1885. Árnason, E. and Halldórsdóttir, K., 2014. Nucleotide variation and balancing natural selection at the Ckma gene in Atlantic cod: Analysis with multiple merger coalescent models. PeerJ PrePrints, 2: e528v1. 50 Árnason, E., Hernandez, U. B., and Kristinsson, K., 2009. Intense habitat-specific fisheries-induced selection at the molecular Pan I locus predicts imminent collapse of a major cod fishery. PLoS ONE, 4: e5529. Árnason, E., Petersen, P. H., Kristinsson, K., Sigurgíslason, H., and Pálsson, S., 2000. Mitochondrial cytochrome b DNA sequence variation of Atlantic cod from Iceland and Greenland. Journal of Fish Biology, 56: 409–430. Árnason, E., Petersen, P. H., and Pálsson, S., 1998. Mitochondrial cytochrome b DNA sequence variation of Atlantic cod, Gadus morhua, from the Baltic and the White seas. Hereditas, 129: 37–43. Árnason, E. and Pálsson, S., 1996. Mitochondrial cytochrome b DNA sequence variation of Atlantic cod Gadus morhua, from Norway. Molcular Ecology, 5(6): 715–724. Árnason, E., Pálsson, S., and Arason, A., 1992. Gene flow and lack of population differentiation in Atlantic cod, Gadus morhua L., from Iceland, and comparison of cod from Norway and Newfoundland. Journal of Fish Biology, 40: 751–770. Astthorsson, O., Gislason, A., and Gudmundsdottir, A., 1994. Distribution, abundance, and length of pelagic juvenile cod in Icelandic waters in relation to environmental conditions. ICES Marine Science Symposia, 198: 529–541. Avise, J. C., 2004. Molecular Markers, Natural History, and Evolution. Sinauer Associates, Inc., second ed. Bazin, E., Glémin, S., and Galtier, N., 2006. Population size does not influence mitochondrial genetic diversity in animals. Science, 312: 570–572. Begg, G. A. and Marteinsdottir, G., 2000. Spawning origins of pelagic juvenile cod Gadus morhua inferred from spatially explicit age distributions: potential influences on year-class strength and recruitment. Marine Ecology Progress Series, 202: 193–217. Begg, G. A. and Marteinsdottir, G., 2003. Spatial partitioning of relative fishing mortality and spawning stock biomass of Icelandic cod. Fisheries Reseach, 59: 343–362. Braithwaite, V. A. and Salvanes, A. G. V., 2005. Environmental variability in the early rearing environment generates behaviourally flexible cod: implications for rehabilitating wild populations. Proceedings of the Royal Society B, 272: 1107–1113. Burridge, C. P., Craw, D., Fletcher, D., and Waters, J. M., 2008. Geological dates and molecular rates: fish DNA sheds light on time dependency. Molecular Biology and Evolution, 25: 624–633. Carr, S. M. and Marshall, H. D., 1991. Detection of intraspecific DNA sequence variation in the mitochondrial cytochrome b gene of Atlantic cod (Gadus morhua) by the polymerase chain reaction. Canadian Journal of Fisheries and Aquatic Sciences, 48: 48–52. Carr, S. M., Snellen, A. J., Howse, K. A., and Wroblewski, J. S., 1995. Mitochondrial DNA sequence variation and genetic stock structure of Atlantic cod (Gadus morhua) from bay and offshore locations on the Newfoundland continental shelf. Molecular Ecology, 4: 79–88. Eiríksson, G. M. and Árnason, E., 2013. Spatial and temporal microsatellite variation in spawning Atlantic cod, Gadus morhua, around Iceland. Canadian Journal of Fisheries and Aquatic Sciences, 70: 1151– 1158. Ewing, B., Hillier, L., Wendl, M. C., and Green, P., 1998. Base-calling of automated sequencer traces using Phred. I. Accuracy assessment. Genome Research, 8: 175–185. Excoffier, L. and Lischer, H. E. L., 2010. Arlequin suite ver 3.5: A new series of programs to perform population genetics analyses under linux and windows. Molecular Ecology Resources, 10: 564–567. 51 Paper II Excoffier, L., Smouse, P. E., and Quattro, J. M., 1992. Analysis of molecular variance inferred from metric distances among DNA haplotypes: Application to human mitochondrial DNA restriction data. Genetics, 131: 479–491. Galtier, N., Gouy, M., and Gautier, C., 1996. SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny. CABIOS, Computer Applications in the Biosciences, 12: 543–548. Gordon, D., Abajian, C., and Green, P., 1998. Consed: a graphical tool for sequence finishing. Genome Research, 8: 195–202. Gupta, A. P. and Lewontin, R. C., 1982. A study of reaction norms in natural populations of Drosophila pseudoobscura. Evolution, 36: 934–948. Hartl, D. L. and Clark, A. G., 2006. Principles of Population Genetics. Sinauer Associates, Inc Publishers, Sunderland, Massachusetts, fourth ed., 652 pp. Jamieson, A. and Jónsson, J., 1971. The Greenland component of spawning cod at Iceland. Conseil International pour l’exploration de la Mer. Rapports et Proces-Verbaux, 161: 65–72. Johansen, S. and Bakke, I., 1996. The complete mitochondrial DNA sequence of Atlantic cod (Gadus morhua): Relevance to taxonomic studies among codfishes. Molecular Marine Biology and Biotechnology, 5: 203–214. Jónsdóttir, I., Marteinsdottir, G., and Campana., S., 2007. Contribution of different spawning components to the mixed stock fishery for cod in Icelandic waters. ICES Journal of Marine Science, 64: 1749–1759. Jónsdóttir, I. G., Campana, S. E., and Marteinsdóttir, G., 2006. Otolith shape and temporal stability of spawning groups of Icelandic cod (Gadus morhua L.). ICES Journal of Marine Science, 63: 1501–1512. Jónsdóttir, Ó. D. B., Imsland, A. K., Daníelsdóttir, A. K., Thorsteinsson, V., and Nævdal, G., 1999. Genetic differentiation among Atlantic cod in south and south-east Icelandic waters: Synaptophysin (Syp I) and haemoglobin (HbI) variation. Journal of Fish Biology, 54: 1259–1274. Jónsson, G., 1992. Íslenskir fiskar (in Icelandic). Fjölvaútgáfan, Reykjavík, 568 pp. Jónsson, J., 1996. Tagging of cod (Gadus morhua) in Icelandic waters 1948–1986 and tagging of haddock (Gadus aeglefinus) in Icelandic waters 1953-1965. Rit Fiskideildar, 14: 1–82. Jónsson, S. and Valdimarsson, H., 2005. The flow of Atlantic water to the North Icelandic shelf and its relation to the drift of cod larvea. ICES Journal of Marine Science, 62: 1350–1359. Karlsson, S. and Mork, J., 2003. Selection-induced variation at the pantophysin locus (Pan I) in a Norwegian fjord population of cod (Gadus morhua L.). Molecular Ecology, 12: 3265–3274. Marteinsdottir, G., Gudmundsdottir, A., Thorsteinsson, V., and Stefansson, G., 2000. Spatial variation in abundance, size composition and viable egg production of spawning cod (Gadus morhua L.) in Icelandic waters. ICES Journal of Marine Science, 57: 824–830. Moen, T., Hayes, B., Nilsen, F., Delghandi, M., Fjalestad, K. T., Fevolden, S.-E., Berg, P. R., and Lien, S., 2008. Identification and characterisation of novel SNP markers in Atlantic cod: Evidence for directional selection. BMC Genetics, 9: 1–9. Nielsen, E. E., Hansen, M. M., and Meldrup, D., 2006. Evidence of microsatellite hitch-hiking selection in Atlantic cod (Gadus morhua L.): Implications for inferring population structure in nonmodel organisms. Molecular Ecology, 15: 3219–3229. Pálsson, Ó. K. and Thorsteinsson, V., 2003. Migration patterns, ambient temperature and growth of Icelandic cod (Gadus morhua): Evidence from storage tag data. Canadian Journal of Fisheries and Aquatic Sciences, 60: 1409–1423. 52 Pampoulie, C., Jakobsdóttir, K. B., Marteinsdóttir, G., and Thorsteinsson, V., 2008. Are vertical behaviour patterns related to the pantophysin locus in the Atlantic cod (Gadus morhua L.)? Behavioral Genetics, 38: 76–81. Pampoulie, C., Ruzzante, D. E., Chosson, V., Jörundsdóttir, Þ. D., Taylor, L., Thorsteinsson, V., Daníelsdóttir, A. K., and Marteinsdóttir, G., 2006. The genetic structure of Atlantic cod (Gadus morhua) around Iceland: Insight from microsatellites, the Pan I locus, and tagging experiments. Canadian Journal of Fisheries and Aquatic Sciences, 63: 2660–2674. Pogson, G. H. and Mesa, K. A., 2004. Positive Darwinian selection at the pantophysin (Pan I) locus in marine gadid fishes. Molecular Biology and Evolution, 21: 65–75. Pétursdóttir, G., Begg, G. A., and Mareinsdóttir, G., 2006. Discrimination between Icelandic cod (Gadus morhua L.) populations from adjacent spawning areas based on otolith growth and shape. Fisheries Research, 80: 182–189. Reiss, H., Hoarau, G., Dickey-Collas, M., and Wolff, W. J., 2009. Genetic population structure of marine fish: Mismatch between biological and fisheries management units. Fish and Fisheries, 10: 361–395. Robichaud, D. and Rose, G. A., 2004. Migratory behaviour and range in Atlantic cod: inference from a century of tagging. Fish and Fisheries, 5: 185–214. Rogers, A. and Harpending, H., 1992. Population growth makes waves in the distribution of pairwise genetic differences. Molecular Biology and Evolution, 9: 552–569. Salvanes, A. G., Moberg, O., and Braithwaite, V. A., 2007. Effects of early experience on group behaviour in fish. Animal Behaviour, 74: 805–811. Schneider, S. and Excoffier, L., 1999. Estimation of past demographic parameters from the distribution of pairwise differences when the mutation rates vary among sites: Application to human mitochondrial DNA. Genetics, 152: 1079–1089. Sokal, R. R. and Rohlf, F. J., 1995. Biometry. The Principles and Practices of Statistics in Biological Research. W. H. Freeman. New York, 3 ed., 887 pp. Tamura, K. and Nei, M., 1993. Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Molecular Biology and Evolution, 10: 512–526. Thorisson, K., Jónsdóttir, I. G., Marteinsdottir, G., and Campana, S. E., 2011. The use of otolith chemistry to determine the juvenile source of spawning cod in Icelandic waters. ICES Journal of Marine Science, 68: 98–106. Thorsteinsson, V., Pálsson, Ó. K., Tómasson, G. G., Jónsdóttir, I. G., and Pampoulie, C., 2012. Consistency in the behaviour types of the Atlantic cod: repeatability, timing of migration and geo-location. Marine Ecology Progress Series, 462: 251–260. Waples, R. S. and Gaggiotti, O., 2006. What is a population? An empirical evaluation of some genetic methods for identifying the number of gene pools and their degree of connectivity. Molecular Ecology, 15: 1419–1439. Williams, G. C., Koehn, R. K., and Mitton, J. B., 1973. Genetic differentiation without isolation in the American eel, Anguilla rostrata. Evolution, 27: 192–204. 53 Paper III Gene flow across the N-Atlantic and sex-biased dispersal inferred from mtDNA sequence variation in saithe, Pollachius virens Guðni Magnús Eiríksson and Einar Árnason, 2015 Environmental Biology of Fishes, 98: 67–79. doi:10.1007/s10641-014-0237-8 © Environmental Biology of Fishes Guðni Magnús Eiríksson actively particiated in assembling samples from different parts of the world, took part in sampling trips around Iceland in 2006 and around Faroe Islands the same year. Guðni did all the molecular laboratory work: DNA isolation, mtDNA amplification, purification and prepared the samples for electrophoresis on an ABI-3100 automatic sequencer. Guðni was also in charge of data analysis and interpretation. Guðni was in charge of writing the manuscript, corresponded to the comments of reviewers and finalized the article. Professor Einar Árnason actively participated in all steps of the work. Einar took part in the experimental design, supervised the molecular work, took part in data analysis, interpretation of the data and participated in writing the manuscript and finalizing the article. 55 Paper III 56 Environ Biol Fish (2015) 98:67–79 DOI 10.1007/s10641-014-0237-8 Gene flow across the N-Atlantic and sex-biased dispersal inferred from mtDNA sequence variation in saithe, Pollachius virens Guni Magnús Eirı́ksson · Einar Árnason Received: 12 May 2013 / Accepted: 20 January 2014 / Published online: 19 March 2014 © Springer Science+Business Media Dordrecht 2014 Abstract Genetic variation in saithe Pollachius virens was examined using a 460 base pair fragment of the cytochrome c oxidase subunit I mitochondrial gene in 1163 individuals sampled in Canada, Iceland, Faroe Islands and Norway. In all, 43 segregating sites were observed, almost all synonymous, defining 51 haplotypes. High frequency polymorphism was present at the analyzed fragment making it suitable for population genetic studies. The results showed limited trans Atlantic genetic structure indicating high levels of gene flow. However, a spatial genetic structure was observed when considering the sexes separately. This may suggest sex-biased migration pattern. The data indicate that females may be more philopatric and males more migratory. Such behaviour has rarely been described for marine fish and is worth further research. The observed genetic variation also indicates that saithe has undergone sudden population expansion, reflected in high number of singletons and a shallow genealogy. Keywords mtDNA · COI · Gene flow · Sex-biased dispersal · Demography G. M. Eirı́ksson () · E. Árnason Institute of Life and Environmental Sciences, University of Iceland, Sturlugata 7, 101 Reykjavı́k, Iceland e-mail: [email protected] Introduction The life history of many marine organisms is characterized by high fecundity, planktonic eggs and larvae and high dispersal potential with oceanic currents (Hedgecock 1994; Hedgecock et al. 2007; Lee and Boulding 2009). Genetic differentiation among populations, FST , has been shown to be negatively correlated to estimated dispersal capability (Waples 1987) although recent study showed that this may not always be the case (Weersing and Toonen 2009). Studies have shown low population differentiation among marine fish species compared to freshwater species. This pattern might be explained by lack of physical barriers in the marine environment, migration of adult fish as well as passive dispersal of eggs and larvae with ocean currents (Ward et al. 1994). It is also clear that it will take time for genetic difference to become established between isolated groups. Separate populations may thus be similar due to insufficient time for divergence. This is particularly true for large populations (large Ne ) as rate of genetic drift is inversely related to effective population size, and thus more time will be needed for genetic difference to become established in large populations (e.g. Kimura 1984). Many factors may contribute to the generation of population genetic structure, even in the face of high gene flow. For example natal homing has been suggested for Atlantic cod, Gadus morhua, to variable extent in different regions (Robichaud and Rose 2004). Such 57 Paper III 68 behavioural patterns may contribute to geographic population genetic structure although larvae may drift long distances with water currents. Genetic variation may also reflect different migration patterns between sexes. Such sexual variation is common for vertebrate species (Prugnolle and de Meeus 2002; Petit and Excoffier 2009), although few cases have been reported for marine fish (but see Schrey and Heist 2003; Shaw et al. 2004). If females are more philopatric and males more migratorial, mtDNA variation is expected to show geographic population structure in the females, but less so for males. Different migration patterns between sexes may also be revealed by comparison between mitochondrial and nuclear loci (Prugnolle and de Meeus 2002). A study of population genetic structure of organisms is important for basic understanding of relationships among different groups, their origin and gene flow. Stock structure is fundamental information for fisheries management and knowledge of the dynamics of the resource in space and time is essential for reliable assessment and successful management (Cadrin et al. 2004). Genetic variation and genealogy of high latitude organism have been strongly influenced by climatic oscillations during the Pliocene and Pleistocene (Bernatchez and Dodson 1991; Hewitt 1996; Avise and Walker 1998; Wares and Cunningham 2001; Pálsson et al. 2009). Climatic oscillations have caused population bottlenecks for many species followed by a period of expansion. Genetic variation is lost through extinction of lineages during bottleneck periods generating shallow gene genealogies and reducing effective population sizes, Ne . During periods of population expansion, however, the probability of lineage extinction is reduced and genetic variation increases in a population (Avise 2000). The expected pattern of genetic variation as a result of repeated demographic changes is a shallow genealogy with many rare types. This is indeed the observed pattern in many marine fish species (e.g. Sigurgı́slason and Árnason 2003; Árnason 2004; Pálsson et al. 2009; Liu et al. 2010; Eirı́ksson and Árnason 2014). Thus, genetic variation can be used to make inference about demographic history of a population. Nucleotide mismatch analysis has been used extensively to make predictions of demographic history (Rogers and Harpending 1992) but more recent methods, for example the skyline method introduced by Pybus et al. (2000), have 58 Environ Biol Fish (2015) 98:67–79 improved our ability to reconstruct past demographic history embedded in the genealogical relationship among individuals of a population (Emerson et al. 2001; Drummond et al. 2005). Saithe, Pollachius virens, is a commercially important demersal fish species distributed on both sides of the North Atlantic. It is distributed from Barents sea (Spitsbergen) in the North to the Bay of Biscay (France) in the South, along the coast of Canada in the West and Southwards to the coast of North Carolina. It is common around Iceland but rare around Greenland (Jónsson 1992). Spawning time varies across the Atlantic ocean. NE–Atlantic saithe spawn in February–March but NW–Atlantic saithe are reported to have a wider spawning season, from October to March (Jónsson 1992). Egg and larvae are pelagic and can thus be carried with ocean currents (Brickman et al. 2006). Saithe is known to migrate long distances in search of food but spawning takes place in shallow waters (Jónsson 1992). Results of a long term tagging experiment (Jakobsen and Olsen 1987) reported substantial saithe migration from northern Norway to Iceland, frequently to an extent characterized as mass movements. In a more recent study clear inshore/offshore movements during summer was observed for saithe around Iceland and variation in migratory routes among tagging areas (Armannsson et al. 2007). Neilson et al. (2006) suggested that saithe off the coast of Canada forms three separate populations based on findings in a tagging recapture study. The different findings among the different studies suggest that migratory behaviour may be associated with some life history stage (e.g. adult fish may be more likely to migrate as has been reported for Atlantic cod Gadus morhua, Schopka et al. 2006) or that migratory behaviour may vary among different saithe groups. As described earlier saithe has high potential for egg and larval dispersal and is known for long distance migrations of adult fish. This is likely to contribute to gene flow in saithe among different geographic localities. A recent microsatellite study on closely related species, the pollack (Pollachius pollachius), along the European coast revealed very limited genetic variation among geographic localities (Charrier et al. 2006). Given this potential for gene flow, little genetic divergence among saithe groups from different waters is predicted. However, there are no reported cases of Environ Biol Fish (2015) 98:67–79 migrations between NW-Atlantic and the NE-Atlantic, and this might be reflected in the population genetic structure of the species. Limited work has been done on the population genetics of saithe (Reiss et al. 2009). In the present study we examine population genetic structure in saithe in the North-Atlantic ocean. We compare differences between sexes and compare differences among and within geographic localities for the waters around Canada, Iceland, Faroe Islands and Norway. Materials and methods Fish samples Fish samples were obtained in co-operation with various marine research institutions (Table 1, Fig. 1). In all cases gill tissue samples were taken and preserved in 96 % ethanol. No individual biological information was available for Norway samples. Sex, length and weight of individual fish was recorded for the other countries and age was determined by otolith inspection for fish sampled in Iceland and Canada. Molecular methods DNA was isolated using a chelex method with modifications (Walsh et al. 1991). A 460 base pair fragment was amplified from the 5 region of the cytochrome c oxidase subunit I (COI) mitochondrial gene using pvL6082-5CCTGCTGGAGGAGGTGATCC3 and pvH6580-5CCTGCTGGAGGAGGTGATCC3 . The fragment corresponds to sites 6082–6542 in the saithe complete mtDNA genome sequence (Coucheron et al. 2011). A 19μl PCR reaction mix contained 0.17mM dNTP, 0.11mg/μl BSA, 0.36pM of each primer (L6082, H6580), and 0.05U Taq Polymerase buffered with 10× ThermoPol buffer (New England BioLabs - NEB) together with DNA template (median concentration 0.26ng/μl in reaction mix). The DNA was denatured at 94 ◦ C for 5 minutes followed by 35 cycles of denaturing for 1 minute, annealing at 56 ◦ C for 30 seconds extension at 72 ◦ C for 1 minute, followed by 7 minutes extension at 72 ◦ C. A 5 μl of the product was run for 10 min at 90 Volts on an 1.5 % agarose gel containing ethidium bromide. The gel was then inspected under UV light for amplified DNA. A sample of 5μl succesfully 69 amplified DNA was purified using 2U Exonuclease I (NEB) and 1U Antarctic phosphatase (NEB). Sequencing was carried out using ABI BigDye Terminator v3.1 using 0.1 pM of the pvH6580 primer and 5μl of the purified DNA in a total reaction volume of 15μl using standard protocol with modifications. The reaction product was analyzed on ABI-3100 automatic sequencer. Data analysis Base calling, data assembly and sequence alignment were done using Phred and Phrap (Ewing et al. 1998) and sequence data were viewed using Consed (Gordon et al. 1998) and Seaview (Galtier et al. 1996). Trace files for all observed haplotypes were inspected manually, sequence integrity confirmed and bad sequences rejected based on Phred quality scores. The software DNAsp (Librado and Rozas 2009) and Network 4.5.1.6 (Bandelt et al. 1999) was used to produce median joining network among haplotypes. The tree is based on the most parsimonious relationship among the observed DNA sequences and haplotype frequency used as additional criteria for resolving homoplasy. MEGA 4 (Tamura et al. 2007) was used for translation of nucleotide sequence to amino acid sequence for the observed haplotypes and for the counting of synonymous and nonsynonymous sites. Analysis of Molecular variation (AMOVA) was carried out using Arlequin 3.5.1.2 (Excoffier and Lischer 2010). Pairwise comparison was made among countries for ST , analog to FST based on Weir and Cockerham (1984) and gene flow M estimated among countries. Selective neutrality was tested using Chakraborty’s test, Fu’s FS tests, Ewens-Watterson neutrality test and Tajimas’s D carried out in Arlequin. Method of false discovery rate was used to adjust for multiple testing (Benjamini and Hochberg 1995). Geographic distance between sampling locations was calculated as minimum distance between the sampling localities based on mean values for coordinates for each group using the fields package (Furrer et al. 2011) in R (R development core team 2008). In reality the distance separating saithe in the different sampling locations is longer, since the effects of suitable environments and ocean currents is not taken into account. Mantel tests (with 10,000 permutations) of relationship between genetic and geographical distance was 59 Paper III 70 Environ Biol Fish (2015) 98:67–79 Table 1 Sampling year and sampling sites for saithe in different countries Country Area Month Year Lat. Lon. N Canada Canada Iceland Iceland Iceland Iceland Iceland Faroe Islands Faroe Islands Norway Norway Norway Norway Norway Norway Norway Norway Canada-E Canada-W Iceland-N Iceland-SE Iceland-SE Iceland-SW Iceland-SW Faroe Bank Faroe Plateau Malangsgrunnen Solbergfjorden Vågsfjorden Outer Lofoten Vistenfjorden Sklinnabanken Frøyabanken Romsdalsfjorden July July April October March October March March March September August August August September September October October 2006 2006 2005 2004 2005 2004 2005 2006 2006 1992 1994 1993 1993 1993 1994 1994 1994 43.41 42.94 65.24 63.80 63.80 63.62 63.62 60.94 62.22 69.53 69.08 68.54 67.49 65.43 64.57 63.47 62.35 −61.46 −66.29 −22.59 −14.36 −14.36 −22.50 −22.50 −8.44 −6.59 17.42 17.37 16.56 14.57 12.34 8.28 7.06 7.25 42 91 32 78 91 21 154 57 53 44 46 81 79 82 83 60 69 In total 1163 fish were sampled and successfully analyzed. Position is given as an average of the latitude (Lat.) and the longitude (Lon.) of the sampling stations within areas, in degrees and decimal minutes. N sample size carried out separately for each sex, using the ape package (Paradis et al. 2004) in R. We used Bayesian skyline plot method (BSP) in BEAST v1.6.1 (Drummond et al. 2005) to estimate effective population size with time using MCMC to Fig. 1 Sampling sites are marked with black dots 60 average over tree space, each tree weighted against its posterior probability. Strict molecular clock was set at 2 % substitutions/site/Myr, as commonly used for mtDNA sequence analysis (Avise 2004). Two independent chains were run for 300 million iterations 61 c . . . . . . . . . . . . . . . . . . . . . . . . t . . . . . . . . . . . . . . . . . . . . . . . . . Hapl. COI-01 COI-02 COI-03 COI-04 COI-05 COI-06 COI-07 COI-08 COI-09 COI-10 COI-11 COI-12 COI-13 COI-14 COI-15 COI-16 COI-17 COI-18 COI-19 COI-20 COI-21 COI-22 COI-23 COI-24 COI-25 COI-26 COI-27 COI-28 COI-29 COI-30 COI-31 COI-32 COI-33 COI-34 COI-35 COI-36 COI-37 COI-38 COI-39 COI-40 COI-41 COI-42 COI-43 COI-44 COI-45 COI-46 COI-47 COI-48 COI-49 COI-50 COI-51 a . . . . . . . . g . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 9 5 t . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . c . . . . . . . . . . . 6 0 9 8 c . . . . . . . . . . . . . . . . . . . . g . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1 1 6 t . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . c . . . . . . . . . . . . . . . 6 1 1 9 t . . . . . . . . . . . . . . . . . . . c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1 3 1 a . . . . . . . . . . . . . . . . . . . . . . . . . . . . . g . . . . . . . . . . . . . . . . . . . . 6 1 4 3 a . . . . . . . . . . . . . . . . . . . . . c g . . . . . . . g . . . . . . c . . . . . . . . . . . . 6 1 4 9 c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . t . . . . . . . . . . . . . . . . . . 6 1 7 0 c . . . . . . . . . . . . . . . . . . . . . . . t t . . . . . . . t . . . . . . . . . . . . . . . . . 6 1 8 2 t . . . . . . . . . . . . . . . . . . c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2 0 9 g . . . . . . . a . . . . . . . . . . . . . . . . . . . . a . . . . . . . . . . . . . . . . . . . . . 6 2 1 8 c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . t . . . . . . . . . . . . . . . . 6 2 2 1 g . . . . . . . . . . . . . . . . . a . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2 3 0 t . . . . . . c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2 3 9 c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . t . . . . . . . . . . 6 2 4 2 t . . . . . . . . . . . . . . . . c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2 4 8 a . . . g g . . . . . . . . . . . . . . . . . . . . . . . . . . . . . g . . . . . g . . . . . . . . . 6 2 6 3 c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . t . . . . . . . . . . . . . . 6 2 7 2 a . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . g . . . . . . . . . . . . . 6 2 8 7 c . . . . . . . . . . . . . . . . . . . . . . . . . t . . . . . . . . . . . . . . . . . . . . . . . . 6 2 9 9 t . . . . . . . . . . . . . . . . . . . . . . . . . . . c . . . . . . . . . . . . . . . . . . . . . . 6 3 3 8 c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . t . . . . . . . . 6 3 4 1 t . . . . . . . . . . . . . . . g . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 3 5 0 a . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . g . . . . . . . . . . . . 6 3 5 3 a . . g . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . g g g g . . . . . . . . 6 3 6 8 t . c . . . . c c c . . . . c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 3 6 9 a . . . . . . . . . . g . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 3 8 6 c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . t . . . . . . . 6 3 8 9 a . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . g . . . . . . 6 3 9 8 a . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . g . . . . . 6 4 0 7 c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . g . . . . 6 4 2 2 a . . . . . . . . . . . g . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 4 2 5 t . . . . . . . . . . . . . . c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 4 2 8 c . . . . . . . . . t . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 4 4 9 c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . t . . . 6 4 5 5 t . . . . . . . . . . . . . c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 4 5 6 c . . . . . . . . . . . . . . . . . . . . . . . . . . g . . . . . . . . . . . . . . . . . . . . g . . 6 4 7 0 t . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 4 9 1 c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . g . 6 5 2 3 t c c . . c c c c c c . . . c c c c c c c c c c c c c c . . . . . . . . . . . . . . . . . . . . . . . 6 5 2 4 g . . . . . . . . . . . . c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 5 3 7 c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . g 6 5 3 9 N 612 325 70 61 21 12 7 5 3 3 2 2 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 The site numbers refer to the equivalent site in the complete mtDNA sequence (Coucheron et al. 2011). The combined frequency among sampling localities is presented, N. The dots represent identity of a sequence to the most common haplotype COI-01 6 0 8 3 Table 2 Segregating sites of 51 haplotypes (Hapl.) of the 460 base pairs fragments of cytochrome c oxidase subunit I found among 1163 individual saithe, Pollachius virens, sampled in the North Atlantic Environ Biol Fish (2015) 98:67–79 71 Paper III 72 Environ Biol Fish (2015) 98:67–79 Fig. 2 A median joining network for the 51 observed COI haplotypes. The sizes of circles are proportional to the frequencies of the given haplotype (smallest, N = 1). Different haplotypes are represented with different shades. Black: COI01, dark gray represents C0I-02, intermediate gray: C0I-03 and light gray: C0I-04. Rare haplotypes are represented with open circles. The red circle represents an unobserved but inferred haplotype. Lines between haplotypes reflect single mutational steps between alleles each and the results combined using LogCombiner (Drummond et al. 2005). The first 10 million iterations were discarded in each run to allow for burn-in. jModelTest (Posada 2008) was used to determine that a suitable substitution model for the analysis was Hasegawa-Kishino-Yano with empirical base frequencies and site heterogeneity set to Gamma (4 categories) and invariant sites (HKY+G+I). Tree prior was set at: Coalescent, Bayesian skyline, and piecewiseconstant skyline model selected, with group size m = 10 and auto optimize. The BEAST input file was produced using BEAUTi v.1.6.1 and the results were inspected using TRACER v.1.6.1 (Drummond et al. 2005). A generation time of 4.5 years was used to calculate the Ne after the run (Jónsson 1992). The results were compared to changes in δO 18 from the North Greenland ice core Project as it is indicative of the earth’s climate (δO 18 data available at: www.gfy. ku.dk/∼www-glac/ngrip, (Andersen et al. 2004)). Results Observed genetic variation at COI In total a 460 base pair fragment of the cytochrome c oxidase subunit I (COI) was successfully sequenced in 1163 individual saithe from Canada, Iceland, Faroe Table 3 AMOVA between sexes of saithe in Canada using Tamura and Nei (1993) genetic distance (upper) and based on haplotype frequency only (lower) Source of variation d.f. Variance component Percentage of variation Between sexes Within sexes 1 112 0.010 0.403 Between sexes Within sexes 1 112 0.019 0.299 /F are the various intraclass correlation coefficients 62 /F statistic P 2.44 97.56 ST = 0.024 0.075 5.96 94.04 FST = 0.060 0.009 Environ Biol Fish (2015) 98:67–79 Islands and Norway (Table 1 and Fig. 1). Altogether 43 segregating sites were observed defining 51 haplotypes, at variable frequencies (Table 2). The overall nucleotide diversity was π̂ = 0.0019 ± 0.0015 and the overall haplotype diversity was ĥ = 0.64 ± 0.01. The relationship among haplotypes showed a shallow genealogy (Fig. 2). Two haplotypes (COI-01 and COI-02) were found at high frequencies (>25 %) and a 73 another two (C0I-03 and C0I-04) were found at relatively high frequencies (>5 %). Additionally many rare types different by one mutation from the more common haplotypes were observed (Fig. 2). All but two segregating sites were synonymous. An Alanine to Glycine change (haplotype COI-50, site 6522) and a Valine to Leucine change (haplotype COI14, site 6536). Both haplotypes having the amino acid substitutions were found as singletons in the sample (Table 2). The observed transition/transversion ratio was 36/8 with 23 C-T and 13 A-G transitions vs. 6 C-G, one A-C and one G-T. Genetic differences between sexes 1.0 b AMOVA between sexes in the overall sample showed no difference. However, comparing sexes within countries revealed a significant difference in Canada when using haplotype frequency only (conventional F statistics) but not significant when using (Tamura and Nei 1993) genetic distance (Table 3). Comparison between sexes in Iceland and Faroe Islands did not show a significant difference. Molecular diversity was higher in females compared to males in Canada. The nucleotide diversity was π̂ = 0.0020 ± 0.0016 and π̂ = 0.0015 ± 0.0013, and the haplotype diversity was ĥ = 0.68±0.04 and ĥ = 0.53±0.07 for females and males, respectively. Relative frequency of the most common haplotype 0.6 0.4 0.0 0.2 Relative frequency 0.8 c Canada Fig. 3 Comparison of COI haplotype frequency between sexes in a Canada, b Iceland and c Faroe Islands. Different haplotypes are represented with different shades. Black: COI-01, dark gray: COI-02, intermediate gray: COI-03 and light gray: COI-04. White represents a pool of rare haplotypes Iceland Faroe Isl. Norway Fig. 4 Comparison of COI haplotype frequency among countries. Different haplotypes are represented with different shades. Black: COI-01, dark gray: COI-02, intermediate gray: COI03 and light gray: COI-04. White represents a pool of rare haplotypes 63 Paper III 74 Environ Biol Fish (2015) 98:67–79 Table 4 AMOVA among countries using Tamura and Nei (1993) genetic distance (upper) and based on haplotype frequency only (lower) Source of variation d.f. Variance component Percentage of variation /F statistic P Among countries Among areas within countries Within areas 3 11 1148 −0.0006 −0.0004 0.4405 −0.14 −0.08 100.22 CT = −0.0014 SC = −0.0008 ST = −0.0022 0.80 0.53 0.71 Among countries Among areas within countries Within areas 3 11 1148 −0.0012 0.0015 0.3187 −0.37 0.48 99.89 FCT = −0.0037 FSC = 0.0048 FST = 0.0011 0.93 0.10 0.22 /F are the various intraclass fixation indices (COI-01) was lower in females compared to males (Fig. 4). The results of Mantel test did not indicate isolation by distance for either males or females (Z = 495.29, P = 0.480 for females and Z = 403.14, P = 0.082 for males). In a pairwise FST comparison between countries, separately for each sex (where sample sex information was available), a significant difference was observed between females sampled in Canada and Faroe Island (FST = 0.051; P = 0.017), and also for males (FST = 0.033; P = 0.037) (Fig. 3). The difference between females, but not males, remained significant after accounting for repeated testing. The frequencies of different haplotypes for females sampled in Iceland-SW and Iceland-SE were different (pairwise comparison: FST = 0.022 and P = 0.038). However, taking repeated testing into account this was not significant. Genetic variation among ocean regions The frequency of the most common haplotypes (COI01, COI-02, COI-03 and COI-04) was similar among waters around countries (Fig. 4). One rare haplotype (COI-08) is shared among waters around Iceland, Norway and Canada. AMOVA results showed no difference among waters around countries (Table 4). Only a small part, non significant, of the observed variation was due to variation among areas within countries (conventional F comparison, Table 4, lower). Pairwise comparison between countries showed very low ST values and the estimated gene flow was thus high, in most cases unlimited (Table 5). Genetic variation among areas within countries An AMOVA analysis revealed a genetic difference between sampling stations within Canada (FST = 0.040, P = 0.028). However, the sample from Canada East was dominated by males (61 %) whereas in Canada West the sex ratio was equal (males, 51 %). Comparison between sampling stations in Canada independently for sexes did not reflect a spatial genetic structure. No significant differences were observed among sampling stations within the other countries. Table 5 Gene flow, M = Ne m (upper part) and ST (lower part) for saithe, pairwise comparison between countries 64 Country Canada Iceland Faroe Isl. Norway Canada Iceland Faroe Islands Norway * −0.00294 −0.00373 −0.00245 ∞ * −0.00349 −0.00110 ∞ ∞ * 0.00011 ∞ ∞ 4370.014 * Environ Biol Fish (2015) 98:67–79 75 Temporal genetic variation Temporal genetic variation was examined around Iceland (between saithe samples from October and April) and haplotype frequency was not significantly different between sampling events (AMOVA, haplotype frequencies only: FST = −0.005, P = 0.93). No difference was observed among the most abundant cohorts (1993–2002), for the pooled data for Canada and Iceland (AMOVA, haplotype frequencies only: FST = −0.001, P = 0.51). Demographic analysis The Bayesian Skyline analysis indicated a recent population expansion in saithe (Fig. 5a). This event is estimated to have started 7–8 Kyr ago. Overall, the Fig. 5 a Bayesian skyline plot for saithe reflecting changes in female effective population size (Ne ) against time in thousands of years (Kyr) before present. b Changes in the δO 18 concentration in the North Greenland ice core effective population size of saithe is estimated to have increased 330 fold, during the last 10Kyr. The warming of the earth’s climate after the last Ice age is estimated to have occurred earlier, about 10–15Kyr ago (climate change indicated by increased δ018 in the Greenland Ice core (NGRIP data: www.gfy.ku. dk/∼www-glac/ngrip) (Fig. 5b). Tests of neutrality The observed number of haplotypes by far exceeds the expected number in the overall samples according to Chakraborty’s test (Chakraborty 1990) in the overall sample (kExp. = 6.9, kObs. = 51, P < 1 × 10−5 ). The same is true for the Fu’s FS (FS = −28.27, P < 1 × 10−5 ) (Fu 1997). The frequency distribution of the observed haplotypes a b 65 Paper III 76 differ from expectation of Ewen’s sampling formula for a population at mutation drift equilibrium (FExp. = 0.084, FObs. = 0.362, P = 8 × 10−5 ) (Ewens 1972). Tajima’s neutrality test also reflected the high number of rare haplotypes (Tajima’s D = −2.18, P < 1 × 10−5 ) (Tajima 1989). In general the same applies to the analysis when carried out for samples within the different countries. Discussion Spatial and temporal genetic variation The results of the present study show limited overall mtDNA sequence differences among geographic locations throughout the North Atlantic ocean for saithe. However, when considering genetic variation between the sexes a certain pattern is revealed. Differences between sexes in Canadian waters indicate that the sexes originate from groups differing in haplotype frequencies, and that there might be a sex-biased migration pattern between these groups (Prugnolle and de Meeus 2002). Alternatively, a nuclear and mtDNA incompatibility might be reflected in haplotype specific mortality in either sex, changing the haplotype frequency in one sex through sex selective mortality. However, since the observed genetic variation is mostly synonymous this is an unlikely explanation for the observed sex differences. A more likely explanation are differences in migratory patterns of adult individuals between the sexes. Spatial genetic structure is only revealed when comparing the sexes separately, females showing slightly more differences than the males in a pairwise comparison between Canada and the Faroe Islands (Fig. 3). This may suggest that females are more philopatric than males. Similar indications have been observed for haddock (Eirı́ksson, unpubl. data). Sex-biased dispersal is well known in a wide range of taxa (for overview see Petit and Excoffier 2009) and a number of cases have been reported for freshwater fish (e.g. Fraser et al. 2004; Cano et al. 2008; Brunelli et al. 2010). However, there are not many reported cases of sex-biased dispersal in marine fish (but see Shaw et al. 2004), therefore, these indications in saithe are worth further research. Although there is an indication of spatial genetic structure across the N-Atlantic, when considering the 66 Environ Biol Fish (2015) 98:67–79 sexes separately, the results suggest high levels of gene flow among geographic regions. This is not surprising for saithe geographic localities in the NE-Atlantic, where tag recapture studies have shown migration among countries (Jakobsen and Olsen 1987). However, saithe migration between the NE-Atlantic and NW-Atlantic, as suggested by the present study, has not been reported to our knowledge. Not only is the relative frequency of the most common haplotypes similar among different waters, but a rare haplotype (COI-08) is found in saithe from the waters around Canada, Iceland and Norway (Table 2). These findings, and the finding of sex differences, suggest regular saithe migrations and/or passive egg or larval drift with water currents between the West and the East of the North Atlantic as well as among areas within the NE-Atlantic. The present study does not support the idea of population structure around Iceland as Armannsson et al. (2007) suggested based on a tagging and migration study. The same is true for saithe sampled at different localities around Faroe Islands and Norway. No significant differences were observed. The observed variation among sampling areas in Canada (Canada West and Canada East) might be considered spatial variation as the groups have been suggested to belong to different populations based on phenotypic difference (Neilson et al. 2006). However, the observed difference is more likely to reflect the observed variation between sexes and skewed sex ratio rather than spatial variation. The observed variation in saithe in Canada is thus worth further investigation. Temporal variation is small as indicated by AMOVA among cohorts. This may be due to large effective population size, Ne , which reduces the rate of genetic drift. Large population size will thus mean that a long time is necessary for genetic drift to alter the allelic configuration of neutral genetic variation among groups. The marine ecosystems of the North Atlantic underwent a dramatic change at the end of last Ice age 10–15 Kyr ago. This may be too short a time for population genetic structure to be established by drift alone. Comparing the observed variation in saithe to transAtlantic cline seen in Atlantic cod, Gadus morhua, (Árnason 2004), suggests different migration pattern with a more limited trans Atlantic gene flow in cod compared to saithe. Environ Biol Fish (2015) 98:67–79 Demography and natural selection The observed DNA sequence variation is almost exclusively synonymous and is thus likely to be either neutral or under weak purifying natural selection. However, the neutrality test results showed that the observed genetic variation deviates from expectations of selective neutrality. Factors other than natural selection may produce similar pattern. The Fu’s FS test is sensitive to recent population expansion, resulting in high negative FS values as observed in the present study. Tajima’s test result might indicate purifying selection, but as for Fu’s test the same result is expected for a population that has undergone recent expansion. It is possible that natural selection is responsible for the observed pattern (see e.g. Bazin et al. 2006). However, given that the observed polymorphism is almost exclusively synonymous we assume that the apparent deviation from neutrality, suggested by the other neutrality tests, are indicative of a recent population expansion rather than natural selection. Indeed the demographic analysis of genetic variation in saithe suggests a recent population expansion (Bayesian skyline analysis). The expansion event does not correlate with changes in Earth’s climate. However, the timing of the expansion event is dependent on the mutation rate used for the analysis (2 % sub./site/Myr in the present study). Other estimates suggest that the mutation rate may be slightly lower (Bermingham et al. 1997; Knowlton and Weigt 1998) and a recent study suggests that mutation rates may be much higer (Burridge et al. 2008) and also time dependent. Using a different mutation rate would alter the assessment of the timing of population expansion. Also, as different loci may have different history within a population, analysing more loci will improve the prediction and might change the estimated timing of expansion. Indication of a sudden population expansion has been observed in related species, but the estimated timing varies (e.g. Pálsson et al. 2009; Liu et al. 2010; Eirı́ksson and Árnason 2014). The varible estimated timing may suggest multible historical and/or biological explanatations for the observed genetic vartiation in these fish species. Acknowledgment This project was supported by the Icelandic research fund and the Icelandic research fund for graduate students of The Icelandic Centre for Research. We thank Dr. 77 Snæbjörn Pálsson for stimulating discussions during the work and Prof. Jarle Mork for providing us with access to his samples and his hospitality and stimulating discussions during lab work at NTNU. We thank people at the Faroese Fisheries laboratories for cooperation during their annual survey in 2006. We also thank Hlynur Ármannsson for assisting in providing samples from Canadian waters collected by Canadian Department of Fisheries and Oceans, Maritimes Region. We thank Kristján Kristinsson and workers at Icelandic Marine Research Institute for providing saithe samples from Iceland. Finally we thank the people at the population genetics lab at the University of Iceland for discussions and help during the work. References Andersen KK, Azuma N, Barnola JM, Bigler M, Biscaye P, Caillon N, Chappellaz J, Clausen HB, Dahl-Jensen, D, Fischer, H, Flückiger J, Fritzsche D, Fujii Y, Goto-Azuma K, Grønvold K, Gundestrup NS, Hansson M, Huber C, Hvidberg CS, Johnsen SJ, Jonsell U, Jouzel J, Kipfstuhl S, Landais A, Leuenberger M, Lorrain R, Masson-Delmotte V, Miller H, Motoyama H, Narita H, Popp T, Rasmussen SO, Raynaud D, Rothlisberger R, Ruth U, Samyn D, Schwander J, Shoji H, Siggard-Andersen ML, Steffensen JP, Stocker T, Sveinbjörnsdóttir AE, Svensson A, Takata M, Tison JL, Thorsteinsson T, Watanabe O, Wilhelms F, White JWC, members NGICP (2004) High-resolution record of northern hemisphere climate extending into the last interglacial period. Nature 431:147–151 Armannsson H, Jonsson ST, Neilson JD, Marteinsdottir G (2007) Distribution and migration of saithe (Pollachius virens) around Iceland inferred from markrecapture studies. ICES J Mar Sci 64:1006–1016. doi:10.1093/icesjms/fsm076 Árnason E (2004) Mitochondrial cytochrome b DNA variation in the high-fecundity Atlantic cod: Trans-Atlantic clines and shallow gene genealogy. Genetics 166:1871– 1885. doi:10.1534/genetics.166.4.1871 Avise JC (2000) Phylogeography: the history and formation of species. Harvard University Press, Cambridge, Massachusetts Avise JC (2004) Molecular markers, natural history and evolution, 2nd edn. Sinauer Associates, Inc Avise JC, Walker D (1998) Pleistocene phylogeographic effects on avian populations and the speciation process. Proc R Soc Lond B 265:457–463. doi:10.1098/rspb.1998.0317 Bandelt HJ, Forster P, Röhl A (1999) Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol 16:37–48 Bazin E, Glémin S, Galtier N (2006) Population size does not influence mitochondrial genetic diversity in animals. Science 312:570–572. doi:10.1126/science.1122033 Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Statist Soc B 57:289–300 Bermingham E, McCafferty SS, Martin AP (1997) Fish biogeography and molecular clocks: perspectives from the Panamanian Isthmus. In: Kocher TD, Stepien CA (eds) 67 Paper III 78 Molecular systematics of fishes. Academic Press, San Diego, pp 113–128 Bernatchez L, Dodson JJ (1991) Phylogeographic structure in mitochondrial DNA of lake whitefish (Coregonus clupeaformis) and its relation to Pleistocene glaciation. Evolution 45:1016–1035 Brickman D, Marteinsdottir G, Logemann K, Harms IH (2006) Drift probabilities for Icelandic cod larvae. ICES J Mar Sci 64:1–11. doi:10.1093/icesjms/fsl019 Brunelli JP, Steele CA, Thorgaard GH (2010) Deep divergence and apparent sex-biased dispersal revealed by a Y-linked marker in rainbow trout. Mol Phylogenet Evol 56:983–990. doi:10.1016/j.ympev.2010.05.016 Burridge CP, Craw D, Fletcher D, Waters JM (2008) Geological dates and molecular rates: fish DNA sheds light on time dependency. Mol Biol Evol 25:624–633. doi:10.1093/mol bev/msm271 Cadrin SX, Friedland KD, Waldman JR (2004) Stock identification methods: applications in fishery science, 2nd edn. Elsevier Academic Press, London Cano JM, Mäkinen HS, Merilä J (2008) Genetic evidence for male-biased dispersal in the three-spined stickleback (Gasterosteus aculeatus). Mol Ecol 17:3234–3242. doi:10.1111/j.1365-294X.2008.03837.x Chakraborty R (1990) Mitochondrial DNA polymorphism reveals hidden heterogeneity within some Asian populations. Am J Hum Genet 47:87–94 Charrier G, Durand JD, Quiniou L, Laroche J (2006) An investigation of the population genetic structure of pollack (Pollachius pollachius) based on microsatellite markers. ICES J Mar Sci 63:1705–1709. doi:10.1016/j.icesjms.2006.07.006 Coucheron DH, Nymark M, Breines R, Karlsen BO, Andreassen M, Jørgensen TE, Moum T, Johansen SD (2011) Characterization of mitochondrial mRNAs in codfish reveals unique features compared to mammals. Curr Genet 57:213–222. doi:10.1007/s00294-011-0338-2 Drummond AJ, Rambaut A, Shapiro B, Pybus OG (2005) Bayesian coalescent inference of past population dynamics from molecular sequences. Mol Biol Evol 22:1185–1192. doi:10.1093/molbev/msi103 Eirı́ksson GM, Árnason E (2014) Mitochondrial DNA sequence variation in whiting Merlangius merlangus in the North East Atlantic. Environ Biol Fish 97:103–110. doi:10.1007/s10641-013-0143-5 Emerson BC, Paradis E, Thébaud C (2001) Revealing the demographic histories of species using DNA sequences. TREE 16:707–716. doi:10.1016/S0169-5347(01)02305-9 Ewens WJ (1972) The sampling theory of selectively neutral alleles. Theor Popul Biol 3:87–112 Ewing B, Hillier L, Wendl MC, Green P (1998) Basecalling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res 8:175–185. doi:10.1101/gr.8.3.175 Excoffier L, Lischer HEL (2010) Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under linux and windows. Mol Ecol Res 10:564–567. doi:10.1111/j.1755-0998.2010.02847.x Fraser DJ, Lippé C, Bernatchez L (2004) Consequences of unequal population size, asymmetric gene flow and sex-biased dispersal on population structure in 68 Environ Biol Fish (2015) 98:67–79 brook charr (Salvelinus fontinalis). Mol Ecol 13:67–80. doi:10.1046/j.1365-294X.2003.02038.x Fu YX (1997) Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics 147:915–925 Furrer R, Nychka D, Sain S (2011) Fields: tools for spatial data. R package version 6.5.2 Galtier N, Gouy M, Gautier C (1996) SEAVIEW and PHYLO WIN: two graphic tools for sequence alignment and molecular phylogeny. CABIOS 12:543–548. doi:10.1093/bioinformatics/12.6.543 Gordon D, Abajian C, Green P (1998) Consed: a graphical tool for sequence finishing. Genome Res 8:195–202. doi:10.1101/gr.8.3.195 Hedgecock D (1994) Does variance in reproductive success limit effective population sizes of marine organisms? In: Beaumont AR (ed) Genetics and evolution of aquatic organisms. Chapman & Hall, New York, pp 122–134 Hedgecock D, Launey S, Pudovkin AI, Naciri Y, Lapegue S, Bonhomme F (2007) Small effective number of parents (Nb ) inferred for a naturally spawned cohort of juvenile European flat oysters Ostrea edulis. Mar Biol 150:1173– 1182. doi:10.1007/s00227-006-0441-y Hewitt GM (1996) Some genetic consequences of Ice ages, and their role in divergence and speciation. Biol J Linn Soc 58:247–276. doi:10.1111/j.1095-8312.1996.tb01434.x Jakobsen T, Olsen S (1987) Variation in rates of migration of saithe from Norwegian waters to Iceland and Faroe Islands. Fish Res 5:217–222. doi:10.1016/0165-7836(87)90041-5 Jónsson G (1992) Íslenskir fiskar (in Icelandic). Fjölvaútgáfan, Reykjavı́k Kimura M (1984) The neutral theory of molecular evolution. Cambridge University Press Knowlton N, Weigt LA (1998) New dates and new rates for divergence across the Isthmus of Panama. Proc R Soc B 265:2257–2263. doi:10.1098/rspb.1998.0568 Lee HJ, Boulding EG (2009) Spatial and temporal population genetic structure of four northeastern Pacific littorinid gastropods: The effect of mode of larval development on variation at one mitochondrial and two nuclear DNA markers. Mol Ecol 18:2165–2184. doi:10.1111/j.1365-294X.2009.04169.x Librado P, Rozas J (2009) DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics 25:1451–1452. doi:10.1093/bioinformatics/btp187 Liu M, Lu ZC, Gao1 TX, Yanagimoto T, Sakurai Y (2010) Remarkably low mtDNA control-region diversity and shallow population structure in Pacific cod Gadus macrocephalus. J Fish Biol 77(5):1071–1082. doi:10.1111/j.1095-8649.2010.02743.x Neilson JD, Stobo WT, Perley P (2006) Pollock (Pollachius virens) stock structure in the Canadian Maritimes inferred from mark-recapture studies. ICES J Mar Sci 63:749–765. doi:10.1016/j.icesjms.2005.12.006 Pálsson S, Källman T, Paulsen J, Árnason E (2009) An assessment of mitochondrial variation in Arctic gadoids. Polar Biol 32:471–479. doi:10.1007/s00300-008-0542-9 Paradis E, Claude J, Strimmer K (2004) APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20:289–290 Environ Biol Fish (2015) 98:67–79 Petit RJ, Excoffier L (2009) Gene flow and species delimitation. TREE 24:386–393. doi:10.1016/j.tree.2009.02.011 Posada D (2008) jModelTest: phylogenetic model averaging. Mol Biol Evol 25:1253–1256. doi:10.1093/molbev/msn083 Prugnolle F, de Meeus T (2002) Inferring sex-biased dispersal from population genetic tools: a review. Heredity 88:161– 165. doi:10.1038/sj.hdy.6800060 Pybus OG, Rambaut A, Harvey PH (2000) An integrated framework for the inference of viral population history from reconstructed genealogies. Genetics 155:1429–1437 R development core team (2008) Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna Reiss H, Hoarau G, Dickey-Collas M, Wolff WJ (2009) Genetic population structure of marine fish: Mismatch between biological and fisheries management units. Fish Fish 10:361– 395. doi:10.1111/j.1467-2979.2008.00324.x Robichaud D, Rose GA (2004) Migratory behaviour and range in Atlantic cod: inference from a century of tagging. Fish Fish 5(3):185–214. doi:10.1111/j.1467-2679.2004.00141.x Rogers A, Harpending H (1992) Population growth makes waves in the distribution of pairwise genetic differences. Mol Biol Evol 9:552–569 Schopka SA, Sólmundsson J, Porsteinssson V (2006) Áhrif svæafriunar á vigang porsks (in Icelandic). Tech. rep., Hafrannsóknastofnunin Schrey AW, Heist EJ (2003) Microsatellite analysis of population structure in the shortfin mako (Isurus oxyrinchus). Can J Fish Aquat Sci 60:670–675 Shaw PW, Arkhipkin AI, Al-Khairulla H (2004) Genetic structuring of Patagonian toothfish populations in the Southwest Atlantic ocean: the effect of the Antarctic polar front and deep-water troughs as barriers to genetic exchange. Mol Ecol 13:3293–3303. doi:10.1111/j.1365-294X.2004. 02327.x 79 Sigurgı́slason H, Árnason E (2003) Extent of mitochondrial DNA sequence variation in Atlantic cod from the Faroe Islands: A resolution of gene genealogy. Heredity 91:557– 564. doi:10.1038/sj.hdy.6800361 Tajima F (1989) Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585– 595 Tamura K, Nei M (1993) Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol 10:512–526 Tamura K, Dudley J, Nei M, Kumar S (2007) MEGA4: Molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol 24:1596–1599. doi:10.1093/molbev/ msm092 Walsh P, Metzfer D, Higuchi R (1991) Chelex 100 as a medium for simple extraction of DNA for PCR-based typing from forensic material. BioTechniques 10:506– 513 Waples RS (1987) A multispecies approach to the analysis of gene flow in marine shore fishes. Evolution 41:385– 400 Ward RD, Woodwark M, Skibinski DOF (1994) A comparison of genetic diversity levels in marine, freshwater, and anadromous fishes. J Fish Biol 44:213–232. doi:10.1111/j.1095-8649.1994.tb01200.x Wares JP, Cunningham CW (2001) Phylogeography and historical ecology of the North Atlantic intertidal. Evolution 55:2455–2469. doi:10.1111/j.0014-3820.2001.tb00760.x Weersing K, Toonen RJ (2009) Population genetics, larval dispersal, and connectivity in marine systems. Mar Ecol Prog Ser 393:1–12. doi:10.3354/meps08287 Weir BS, Cockerham CC (1984) Estimating F -statistics for the analysis of population structure. Evolution 38:1358– 1370 69 Paper IV Phylogeography of haddock Melanogrammus aeglefinus in the North Atlantic — Postglacial expansion and male-biased dispersal Guðni Magnús Eiríksson and Einar Árnason Unpublished manuscript © Guðni Magnús Eiríksson and Einar Árnason Guðni Magnús Eiríksson actively particiated in assembling samples from different parts of the world, took part in sampling trips around Iceland in 2006 and around Faroe Islands the same year. Guðni did all the molecular laboratory work: DNA isolation, mtDNA amplification, purification and prepared the samples for electrophoresis on an ABI-3100 automatic sequencer. Guðni was also in charge of data analysis and interpretation. Guðni was in charge of writing the manuscript. Professor Einar Árnason actively participated in all steps of the work. Einar took part in the experimental design, supervised the molecular work, took part in data analysis, interpretation of the data and participated in writing the manuscript. 71 Paper IV 72 1 Abstract Genetic variation in haddock, Melanogrammus aeglefinus, was examined using the mitochondrial DNA sequence information. A 599 base pair fragment of the cytochrome oxidase c subunit I gene in 884 individuals sampled across the North Atlantic was used. The overall nucleotide diversity was π̂ = 0.0026 and haplotype diversity was ĥ = 0.81. The observed genetic variation suggests weak or no natural selection acting on the observed polymorphism. Analysis of genetic variation revealed difference between sexes in the Faroe Islands and Norway, suggesting male-biased dispersal. Such examples are very rare for marine fish. However, genetic variation was almost uniform across the North Atlantic suggesting high levels of gene flow. Indications of population structure within Iceland and Faroe Islands are worth further examination. Bayesian skyline analysis reconstructing the past demographic history indicate post-glacial population expansion that correlate with the change in climate at the end of the last glacial maximum. Keywords: mtDNA, COI, male-biased dispersal, postglacial expansion, demography 73 Paper IV 2 Introduction 2.1 Genetic variation and population subdivision Genetic difference among groups, for traits that is not affeced by natural selection, will establish over time due to genetic drift. Thus using a molecular markers, that are not affected by natural selection, can be used to identify reproductively isolated groups. The rate of drift is inversely related to the effective population size (Ne ). Examination of population subdivision has been the subject of many studies, not least for commercially important fish species as such information can be of great importance for responsible management (for overview see Reiss et al, 2009) Variable molecular markers are used and results are seen to vary from one study to another. It is important to note that observed genetic differences between different environments occupied by an organism may not reflect population breeding structure if the variation is affected by natural selection (Guinand et al, 2004). Genetic difference can become established between subdivided group of individuals that share the same origin (Williams et al, 1973; Guinand et al, 2004). For the identification of a breeding structure of a population it is thus important to use genetic markers that are not subject to natural selection, as the latter may reflect environmental heterogeneity rather than the origin of groups involved. The analysis of indications of natural selection should thus be an integrated part of the analysis of population sub-structuring (see e.g. Excoffier, 2007). In the analysis of population sub-structuring the a priori definition of groups or populations, for example using analysis of molecular variation (AMOVA) (Excoffier et al, 1992) will affect the results. If sub-structuring is not properly recognized it can greatly affect our interpretation of the genetic diversity (Excoffier, 2007). The SAMOVA method (Dupanloup et al, 2002) maximizes genetic difference between a specified number of groups without a priori assignment of group membership. It can be used to identify populations without prior grouping of samples. However, the results for different grouping can only be indicative of the true number of populations (Dupanloup et al, 2002). Population sub-structuring may follow environmental variation not considered by the observer. For example different populations may occupy different depth as has been suggested for Atlantic cod, Gadus morhua (Pampoulie et al, 2008). Such variation would go unnoticed in a SAMOVA analysis were genetic difference between groups from different geographic locations is maximized. Genetic variation may reflect different migration patterns between sexes and such patterns are common for vertebrate species (Purcell et al, 1996; Petit and Excoffier, 2009). In the case of females being more philopatric and males more migratorial, such variation may be reflected in the mitochondrial DNA variation. Females would form more geographically distinct populations but males would be homogeneous over larger geographic areas, due to mixing of males of different geographical origin. Difference among the mitochondrial genome and nuclear loci can also indicate sex difference in migration pattern (Purcell et al, 1996). Many studies have demonstrated a lack of spatial genetic structure in marine species indicating high levels of gene flow attributed to lack of physical barriers in the marine environment and high dispersal rate for eggs and larvae with ocean currents (Waples, 1987). Large effective population size (Ne ) may also account for the genetic uniformity as it may take a long time for a genetic difference to become established as rate of genetic drift will be low. This may be the case for many marine species (Waples, 1987). Direct observation of DNA sequence variation can be used to make inference about observed polymorphism, for example by inspection of the frequency of synonymous and nonsynonymous segregating sites. A number of statistical tests are available for detecting natural selection for example by exploring the allelic distribution or level of variability (Nielsen, 2001). However, natural selection can be similar to the effects that demographic events, such as populations bottleneck, can have on genetic variation and it may be dif- 74 ficult to discriminate between the alternative explanations. Nucleotide mismatch analysis can be used to examine the haplotype tree structure and make predictions of demographic history (Rogers and Harpending, 1992) but more recent methods, for example the skyline method introduced by Pybus et al (2000), have improved our ability to reconstruct past demographic history embedded in the genealogical relationship among individuals of a population (Emerson et al, 2001; Drummond et al, 2005). 2.2 Haddock — Biology and environment Haddock, Melanogrammus aeglefinus, is a demersal fish species most commonly found at 80–200m depth at variable bottom substrates. Haddock is an omnivore, feeding mainly on small benthic organisms. The age at first maturation varies among geographic locations and in the North-West Atlantic the age of first maturation is 3–5 years, but maturation occurs at 2–3 years in the warmer waters of the North Sea (Cohen et al, 1990; Jónsson, 1992). Fecundity varies greatly with females size, larger females having higher fecundity (50,000– 2,000,000 eggs per female Jónsson, 1992). The eggs are small (1.2-1.7 mm in diameter). They are buoyant and eggs and larvae are pelagic for about three months (Hislop, 1984). During this period eggs and larvae can drift with water currents long distances as has been shown for Atlantic cod (e.g. Brickman et al, 2006). Timing of spawning varies greatly among geographic regions (January to July). Haddock is a commercially important fish species, the major fishing grounds being in the North-East Atlantic (Cohen et al, 1990; Jónsson, 1992) In recent years some changes in fish communities have been observed in North Atlantic ocean, both in migratory patterns of some species and in changes in the distribution of others. These changes are most likely linked to environmental changes associated with increased water temperature (Rose, 2005). For the last two decades the distribution of haddock around Iceland has changed and haddock is now commonly found at the North coast (Björnsson et al, 2007). In recent years (2004–2007) the haddock captures increased in the Icelandic waters reflecting the increase in population size, followed by reduction in the population size (Anonymous, 2011). Such demographic changes may be reflected in genetic variation of populations. The environmental conditions throughout the distribution range of haddock are variable and different populations of haddock may occupy different habitats. An environmental heterogeneity is found at the Faroe Islands where the Faroe Plateau and the Faroe Bank are separated by 850m deep channel. It has been shown that fish growth rate is different due to difference in temperature and food abundance between the two ecosystems (Magnussen, 2007). A number of studies have been carried out, using different molecular markers, in order to describe the genetic variation and the population genetic structure of haddock and the findings are to some extend conflicting. A study using transferrin polymorphism for the analysis of the genetic variation in haddock indicate fine scale population sub structuring within the North-East Atlantic (Jamieson and Birley, 1989). In another study of allozyme variation in haddock (using 8 loci) limited difference was revealed among countries (Norway, Iceland, Faroe Islands, North Sea) (Giæver and Forthun, 1999). Lage et al (2001) found a temporal variation in allele frequencies in a microsatellite study for haddock and an indication of fine scale population genetic structure in the North-West Atlantic. However, a limited population sub-structuring was indicated by mtDNA RFLP analysis for the same geographic region in another study (Zwanenburg et al, 1992). In Lage et al (2001) study the variation in FST among the different loci is not reported and the effects of natural selection on shaping the observed pattern was not considered. It is possible that natural selection and environmental heterogeneity may explain the observed pattern (Williams et al, 1973; Guinand et al, 2004). In fact one of the microsatellites used in the study (Gmo-132.1) has been shown to have an outlier 75 Paper IV FST value in spatial comparison for Atlantic cod, and is suspected to be under natural selection (e.g. Nielsen et al, 2006). A mtDNA study indicates that gene flow among geographic region may be responsible for maintenance of high genetic variation in haddock at the time of collapse in the fisheries at Georges Bank (Purcell et al, 1996). 2.3 The present study Here we analyse mtDNA sequence variation using a 599 base pair fragment of the cytochrome oxidase c subunit I (COI) gene for haddock sampled in the North Atlantic. The sequence variation will be used to assess relationship in haddock within and among countries. An important characteristic of mtDNA is that its effective population size (Ne ) is quarter of that for a nuclear marker, due to maternal inheritance and a single allele carried by each individual, making the rate of genetic drift potentially faster (Hartl and Clark, 2006). Comparison will be made to earlier studies on haddock and comparison made to what has been described for Atlantic cod, Gadus morhua, and related gadiod fish species (Árnason, 2004; Pálsson et al, 2009; Eiríksson and Árnason, 2014, 2015) 3 Materials and methods 3.1 Fish samples In total 884 haddock samples were obtained from five geographic regions from the North-East Atlantic. Gill tissue samples were preserved in 96% ethanol. Samples were grouped in 14 different areas within countries: Greenland NW, SW and E, Iceland S, W, N and E, Faroe Bank and Faroe Plateau, four sampling stations along the coast of Norway (79◦ N, 69◦ N, 66◦ N, 63◦ N) and the North Sea (Table 1 and Figure 1). The weight and length of sampled fish was recorded and sex determined for most fish, except for fish sampled in Greenland. The age of the haddock sampled at the Faroe Bank and in Iceland was determined by otolith inspection at the Faroese Fisheries Laboratory and the Icelandic Marine Research Institute, respectively. Three individuals where sampled at the east coast of Greenland (Table 1). These samples are not included when comparing different groups within countries, but pooled together with other samples from Greenland in the analysis of variation among countries. 3.2 Molecular methods DNA was isolated using a chelex method (Walsh et al, 1991) with some modifications. A pair of primers FishF1-5′ TCAACCAACCACAAAGACATTGGCAC3′ and FishR2-5′ ACTTCAGGGTGACCGAAGAATCAGAA3′ (Ward et al, 2005) was used to amplify a 599 base pair fragment of the haddock COI mitochondrial gene, corresponding to base pairs 5529–6127 in the haddock complete mitochondrial genome (Ursvik et al, 2007, GenBank accession number: AM489717.1). A 19µ l PCR reaction mix contained 0.17mM dNTP, 11µ g/µ l Bovine serum albumin (BSA), 0.36pM of each primer (FishF1, FishR2), and 0.05U Taq Polymerase buffered with 10× Taq buffer (New England BioLabs (NEB)), DNA template (median concentration 0.26ng/µ l in reaction mix). The reaction mix was prepared on ice and placed in a preheated thermocycler when ready. The DNA was denatured at 94◦ C for 5 minutes in the first cycle followed by 35 cycles of denaturing for 1 min, annealing at 54◦ C for 30 seconds and extension at 72◦ C for 1 minute followed by 7 minutes extension period at 72◦ C. A 5µ l of the product was run for 10 min at 90 Volts on an 1.5% agarose gel containing ethidium bromide and then inspected under UV light to determine amplification quality. PCR were then enzymatically purified using Exonuclease I (NEB) and Antarctic Phosphatase 76 (NEB). Products were sequenced using Applied Biosystems-BigDye Terminator v3.1 kit (using the FishF1 primer) applying standard protocol with some modifications and ran on an ABI-3100 automatic sequencer. 3.3 Data analysis Base calling, data assembly and sequence alignment was carried out in Phred (Ewing et al, 1998) and Phrap (Gordon et al, 1998). Sequence data was viewed using Consed (Gordon et al, 1998) and Seaview (Galtier et al, 1996). Trace files for all observed haplotypes were inspected manually, the sequence integrity confirmed and sequences of low quality rejected. Haplotype tree was produced using DNAsp (Librado and Rozas, 2009) and Network 4.5.1.6 (Bandelt et al, 1999). The tree is based on the most parsimonious relationship among the observed DNA sequences and haplotype frequency used as an additional criterion for resolving homoplasy. MEGA 4 (Tamura et al, 2007) was used for translation of nucleotide sequence to amino acid sequence for the observed haplotypes (vertebrate mtDNA) for identification of amino acid substitutions. The genetic variation was examined by application of different neutrality tests: Tajima’s D test, Chakraborty’s test and Fu’s FS test. The neutrality tests were applied to the total sample and separately for the different geographic regions sampled (by countries) and carried out in Arlequin 3.5.1.2 (Excoffier and Lischer, 2010). Analysis of pairwise nucleotide mismatch (Rogers and Harpending, 1992) was carried out for the overall sample and 1000 simulated mismatch distributions used to test if the observed distribution differed to the expectations of a population expansion model (Schneider and Excoffier, 1999). We also used the Bayesian skyline plot method (BSP) in BEAST v1.6.1 (Drummond et al, 2005) to estimate effective population size with time and to reconstruct demographic history. Estimations were based on Markov chain Monte Carlo technique to average over tree space, each tree weighted against its posterior probability. Strict molecular clock was set at 2%substitutions/site/Myr (Avise, 2004). The chain was run for 500 million iterations. After the run the first 10 million iterations were discarded to allow for burn-in (Drummond et al, 2005). Substitution model used was Hasegwa-Kishino-Yano setting base frequencies at empirical and site heterogeneity set to Gamma (4 categories) and invariant sites (HKY+G+I). jModelTest (Posada, 2008) was used to determine the suitable substitution model for the analysis. Tree prior was set at: Coalescent, Bayesian skyline, and piecewise-constant skyline model selected, setting group size (m) at 10. Auto optimize was selected for the analysis operators. The BEAST input file was produced using BEAUTi v.1.6.1 and the results were inspected using TRACER v.1.6.1 (Drummond et al, 2005). A generation time of 4 years was used to calculate the Ne after the run. There is considerable variation in age at maturation for haddock, increasing with higher latitude, but estimated age at maturation usually ranges between 3-5 years (e.g. Hislop, 1984; Jónsson, 1992). The results were compared to changes in δ O18 from the North Greenland ice core Project as it is indicative of the earths climate (Andersen et al, 2004). Genetic differentiation among groups was analyzed using Analysis of Molecular variation (AMOVA) based on the method developed by Excoffier et al (1992). Genetic variation among groups was examined, both using genetic distance based on Tamura and Nei (1993) (Φ) and conventional F-statistics (frequency of different haplotypes only) to compare genetic variation among groups. Pairwise comparisons using ΦST and FST were carried to further analyze any possible differences among groups, as implemented in Arlequin. The significance of fixation indices were tested using 10,000 permutations (Excoffier et al, 1992). In the present study temporally spaced samples were not obtained for any sampling location. However, genetic variation among cohorts, age composition and body length (length used as an indicator of age were age information was not available) was examined among groups where significant genetic difference was found. This was done in order to assess if temporal genetic variation could be responsible for the observed genetic variation. 77 Paper IV Pairwise genetic difference (ΦST ) was compared among countries and between areas within countries. The significance of the pairwise ΦST values was tested by a generation of ΦST null distribution based on the hypothesis of no difference between groups by 10,000 permutations of haplotypes between groups. The Pvalue was obtained as the proportion of permutations leading to an ΦST value larger or equal to the observed one (Excoffier et al, 1992). Assessment of false discovery rate (Benjamini and Hochberg, 1995) was used to account for repeated comparisons in order to further examine the significance of observed difference. Spatial AMOVA (SAMOVA) was carried out for the samples obtained at 13 geographic localities (all but Greenland E, Table 1) as implemented in the SAMOVA software (Dupanloup et al, 2002). The analysis was carried out for 2–12 groups and the variance components and fixation indices inspected. The significance of the fixation indices was assessed using 1000 permutations. The geographic distance between sampling locations was calculated as minimum distance between the sampling localities based on mean values for coordinates for each group using the fields package (Furrer et al, 2011) in R (R development core team, 2008). In reality the distance separating haddock in the different sampling locations is longer, since the effects of suitable environments and ocean currents is not taken into account. Mantel tests (with 10,000 permutations) of relationship of genetic and geographical distance was carried out using the ape package (Paradis et al, 2004) in R. Gene flow M (absolute number of migrants) was estimated among the countries based on its relationship with FST , as M = 1 − FST /2FST as calculated in Arlequin. 4 Results 4.1 Genetic variation Variation was found at 75 sites defining 102 haplotypes (GenBank accession numbers: JQ340919–JQ341020) (Appendix I: Tables 1.9-1.12). All but five mutations were synonymous. Four of the nonsynonymous mutations were observed as singletons in the sample and one was found in two individuals (Table 2). Four haplotypes (COI-001 – COI-004) were found at relatively high frequencies (>5%) and formed a shallow base of the geneaology (Figure 2). The rare types mostly deviated by one mutation from the main haplotypes, but deeper branches were also present, particularly branches from COI-003 (Figure 2). The overall nucleotide diversity was π̂ ± σ̂π (×100) = 0.26 ± 0.17 and haplotype diversity was ĥ ± σ̂h = 0.805 ± 0.009. The diversity measures differed slightly among area and countries, π̂ ranging from 0.20 ± 0.14 in Iceland N to 0.37 ± 0.24 in Norway 69◦ N and ĥ ranging from 0.74 ± 0.04 in Iceland W and 0.89 ± 0.04 in Greenland NW (Table 3). 4.2 Population genetic structure — Variation among countries The frequency of different COI haplotypes was similar in the different countries, although haplotype combination of haddock from Greenland was slightly different from the other countries sampled (Figure 3). An AMOVA showed no overall difference among countries. However, part of the observed genetic variation was explained by variation among areas within countries (Table 4). Pairwise comparison among countries suggest panmixia (unrestricted gene flow) for haddock among Iceland, Faroe Islands and Norway and no genetic differentiation (negative ΦST values) (Table 5). Significant differences were observed in a pairwise comparison between Greenland on one hand and Iceland, Norway and North Sea at an α -level of 0.05 (Table 5). However, after adjusting for repeated sampling using false discovery method (Benjamini and Hochberg, 1995) this difference was not significant. 78 4.3 Population genetic structure — Variation among areas within countries Analysis of genetic variance among the 13 defined areas within countries showed nine significant pairwise differences at an α -level of 0.05. Greenland-NW showed genetic differences to four other areas (Faroe Plateau, Iceland-N, North Sea and Norway 70◦ N). The haddock sampled in Iceland-N differed to both sampling areas in Faroe Islands and to Iceland-W. Differences were observed between Faroe Plateau and Faroe Bank as well as between Iceland-W and the North Sea. Using false discovery method (Benjamini and Hochberg, 1995) to correct for repeated sampling none of the observed differences remained significant. A trend towards increased genetic differentiation (ΦST ) was observed with increased geographical distance between groups (Figure 4). However, the relationship was not significant (Mantel test, Z = 591, P = 0.149). AMOVA results showed no overall difference between sampling stations around Iceland (using Tamura and Nei (1993) genetic distance: ΦST (3,171) = 0.009, P = 0.16, and conventional haplotype frequencies: FST (3,171) = 0.013, P = 0.08). However, a pairwise difference was observed between haddock sampled at the West coast and North coast of Iceland (ΦST = 0.063, P = 0.018, Figure 5a). AMOVA results comparing Faroe Bank and Faroe Plateau showed a significant difference when considering Tamura and Nei (1993) genetic distance but not when considering haplotype frequency only (Table 6). None of the observed variance was explained by a comparison among stations within areas. Pairwise ΦST comparison between the two areas showed significant genetic differentiation (ΦST = 0.011, P = 0.009, Figure 5b). The SAMOVA results showed that for grouping into two groups the maximum genetic difference was obtained by comparing Greenland NW against all the other samples pooled, although not significant (FCT (1,868) = 0.0314, P = 0.070). As the number of groups was increased the fixation indices FSC (among populations within groups) and FST (within population) decreased, as would be expected when groups become less similar, but the FCT (among groups) had an initial drop but was not reduced when more than four groups were formed (Figure 8). For grouping of sampling locations into 3 groups a maximum genetic difference was observed when groups consisted of: Greenland NW; Iceland N and North sea; all other pooled (FCT (3,868) = 0.0200, P = 0.001). Highly significant difference among groups was observed when grouping in 4–11 groups with variable combinations but no clear geographical structure could be identified. 4.4 Population genetic structure — Variation between sexes There was no difference between sexes in the overall sample. However, AMOVA results revealed a significant overall difference between males (N = 103) and females (N = 78) within Norway (FST (1,179) = 0.022, P = 0.015, Figure 9). Genetic difference was not observed for sexes in the other countries sampled. AMOVA was carried out comparing genetic variation among countries for the two sexes separately for Iceland, Faroe Islands and Norway (excluding haddock sampled in Greenland as sex information was not available and North Sea due to small sample size). No difference was observed among countries for either sex. However, comparing the sexes separately between Faroe Bank and Faroe Plateau using AMOVA revealed significant difference between females when considering Tamura and Nei (1993) genetic distance (ΦST (1,183) = 0.020, P = 0.029 ,Figure 10a), but the same was not true for males (ΦST (1,205) = 0.004, P = 0.233, Figure 10b). The sample sex ratio was different between sampled haddock in areas within Faroe Islands (female ratio 0.36 and 0.66 in the Faroe Bank and the Faroe Plateau respectively, χ 2 = 31.03, d f = 1, P = 2.5 × 10−8 ). 79 Paper IV 4.5 Population genetic structure — Temporal variation Haplotype frequencies were not different among the most abundant cohorts (fish of age: 4-7 years, cohorts:19992002) within the Faroe Bank (AMOVA: FST (3,234) = −0.0004, P = 0.45). The age was not determined for the haddock sampled at the Faroe Plateau but length of the haddock sampled at the Faroe Plateau and the Faroe Bank did not differ significantly (ȲFP = 48.1cm and ȲFB = 49.0cm; t = 1.34, d f = 242, P = 0.18). The length of female and male haddock from the Faroe Islands did not differ (Y♀ = 48.9cm and Y♂ = 48.81cm; t = −0.13, d f = 360, P = 0.90) and the same was true for female and male comparison within the Faroe Plateau and the Faroe Bank separately. Also, the age composition of male and female haddock sampled at the Faroe Bank did not differ (χ 2 = 3.78, d f = 5, P = 0.58). The frequency of haplotypes did not differ among cohorts around Iceland for the most abundant cohorts (fish of age: 1–5 years, cohorts:1999– 2003)(AMOVA: FST (4,155) = 0.004, P = 0.31) and age composition of haddock was not significantly different between Iceland-W and Iceland-N (χ 2 = 8.24, d f = 4, P = 0.083). The age was not determined for haddock sampled in Norway, but total length of individuals did not differ significantly between sex (Ȳ♀ = 38.5cm and Ȳ♂ = 39.6cm; t = 0.71, d f = 170, P = 0.48). 4.6 Genetic variation and demographic history Neutrality tests showed that allele frequency distributions deviated from predictions of selective neutrality for the pooled data. Charkraborty’s test showed that the number of haplotypes in the observed sample by far exceeded the expected number (kobs = 102, kexp = 10.3, P < 1 × 10−5 ). The same was true for Fu’s FS test (FS = −27.0, P < 1 × 10−5 ). Similarly Tajima’s D reflected excess of low frequency polymorphisms (Tajima’s D = −2.34, P < 1 × 10−5 ). As population sub-structuring may affect this result, the tests were also carried out for haddock sampled in the various countries. In general the same results were obtained as for the pooled samples. The haplotype numbers exceeded expectations and Tajima’s D was negative. The observed mismatch distribution did not fit the prediction of the sudden expansion model (Figure 6, τ = 1.55, P < 1 × 10−5 ). The most obvious difference was that the observed distribution had more sequences that differed by two nucleotides than expected for a population having undergone a sudden expansion (Figure 6). The Bayesian skyline plot analysis indicated population expansion event in the last 25 kyr. Expansion was estimated to have started approximately 22 kyr ago (Figure 7a). The haddock population was predicted to have increased more than ten fold 18-19 kyr ago. After this the population expansion continues, but at a faster rate, extending almost to the present day. The overall effective population size was estimated to have increased about 360 fold since approximately 22 kyr ago (Figure 7a). The haddock population expansion is estimated to have started towards the end of the last glacial maximum (Figure 7b). 5 Discussion The present study reveals sex related genetic variation that may suggest male-biased dispersal pattern in haddock. The difference between female haddock from Faroe Bank and Faroe Plateau suggests historical separation between the two ecosystems, evidenced by genetic distances among of female haplotypes between the two regions. The same does not apply to males. This may suggest that females are more philophatric but males more migratory. Difference in migration patterns between the sexes is also suggested by a skewed sex ratio in from these localities. The well known ecological difference between the systems are likely to play a role in generating this pattern (e.g. Magnussen, 2007). However, there is no overall difference between sexes in Faroe Islands. Also, the observed difference between males and females 80 in the Faroe Islands is not likely to be of a temporal origin, as no difference was observed in body length or age between the two sexes. Similarly, no difference was observed between sexes overall for haddock sampled around Iceland and neither was sex difference found among sampling stations within Iceland. Difference in the physical environments between Iceland and Faroe Islands (e.g. the channel between the Faroe Bank and Faroe Plateau) and/or sampling at different time of the year may explain the difference between the countries. The genetic difference between sexes in Norway is difficult to account for. A possible mechanism for such patter to arise is a male-biased migration among genetically different groups (Prugnolle and de Meeus, 2002). However, no such groups were identified in the present study. We have observed similar pattern for saithe, Pollachius virens (Eiríksson and Árnason, 2015). Sex-biased migration pattern is well known in freshwater fish species (e.g. Fraser et al, 2004; Cano et al, 2008; Brunelli et al, 2010) and in many other different vertebrate taxa (for overview see Petit and Excoffier, 2009). However, only few cases of such pattern have been described for marine fish (but see Schrey and Heist, 2003; Shaw et al, 2004). Based on a game-theoretic analysis (Perrin and Mazalov, 2000) this pattern may suggest intense local mate competition and polygyny in haddock. The present study was not designed to address the issue of sex-biased migration in haddock in particular. A strategy for further research on sex-biased migration in haddock would be to compare nuclear and mitochondrial molecular markers (Prugnolle and de Meeus, 2002). Also, future studies on haddock migration and behaviour (e.g. tag recapture studies) should take sex variation into account. Alternative explanation for the observed difference between sexes is that the difference was due to nuclear and mitochondrial genetic incompatibility. However, as the observed variation is almost exclusively synonymous this is not a likely explanation. The observation of apparently fine scale structure for females in Faroe Islands would suggest that more difference might be expected among geographic localities further apart. Such pattern is not found. The present study shows weak population genetic structure in haddock across the North Atlantic, although Greenland was seen as an outlier (Figure 3, Table 5). A weak geographic population structure is common for marine organisms, attributed to lack of physical barriers in the marine environment and high dispersal rate for eggs and larvae with ocean currents (Waples, 1987). Extensive gene flow is predicted in the present study that may partly be due to such processes. The weak spatial genetic structure differs from what has been shown for Atlantic cod, that shows more trans Atlantic genetic differentiation (Árnason, 2004). The similarity of genetic variation observed in different geographic regions for haddock may have different explanations that can be difficult to disentangle. It may suggest enough gene flow among geographic regions to act against differentiation due to genetic drift. Theoretically it has been shown that a low level of gene flow may be sufficient to maintain genetic homogeneity among neutral loci (Wright, 1931). Egg and larvae drifting with water currents can account for such gene flow (Brickman et al, 2006). Although juvenile and adult haddock is mostly found at shallow waters, extensive migrations are known to take place (Cohen et al, 1990), which also may contribute as homogenizing effect. In fact sex differences indicate that such migrations are likely, at least for adult males. Gene flow for haddock among Iceland, Faroe Islands and Norwegian waters was estimated to be high enough that these localities may be considered panmictic (Table 5). However, it must be acknowledged that ΦST may not be a valid indicator of migration rate (Whitlock and McCauley, 1999). Alternative explanation for the observed pattern is that the haddock effective population size (Ne ) is large, making genetic drift slow, and that the observed genetic similarity among geographic regions is maintained due to short time since the current population structure has become established. Indeed the marine ecosystem in the North Atlantic, in its present state, can be considered young as the end of the last ice age occurred only about fifteen thousand years ago (Andersen et al, 2004). Overall a small part of the observed genetic variation was explained by variation within countries (AMOVA, Table 4). Looking more closely at the within countries comparisons showed that genetic dif- 81 Paper IV ference between Western and Northern Iceland (ΦST = 0.063) is more than twice that found between Greenland and the North Sea (ΦST = 0.025). The difference between Greenland and the North Sea was the largest genetic pairwise difference observed between countries. This genetic variation within Iceland might be associated with variation in population density and dynamics around Iceland (Anonymous, 2011; Björnsson et al, 2007). However, the observed variation between sampling locations in Iceland could be a temporally unstable phenomena although the present study shows that there is no difference in genetic variation among different haddock cohorts. Also the age composition between the samples for N-Iceland and W-Iceland is not significantly different. Further studies are needed to determine the temporal stability and the significance of the observed variation. No genetic difference was observed among sampling locations in Norway. This is in accordance with the findings of Giæver and Forthun (1999). No genetic difference was observed within Greenland and the difference found within Faroe Islands was related to sex difference. Significant differences among groups within countries contrasts with limited differences among countries. Perhaps grouping by countries does not reflect population genetic structure for haddock. However, an attempt to identify spatial genetic structure using SAMOVA failed to reflect any obvious geographical pattern for the observed genetic variation. The data may suggest a fine scale population genetic structure in haddock as pooling of groups over larger geographic areas (as done for countries) may level out differences among groups, as the mean haplotype frequencies can be the same when different groups are pooled (Excoffier, 2007). Although there are indications of possible fine scale/hidden spatial structure the present study is not conclusive on this. The limited trans Atlantic population genetic structure may be due to high levels of gene flow. This is in contrast to some earlier findings for haddock, where spatial genetic structure was described (Jamieson and Birley, 1989; Lage et al, 2001). However, as addressed earlier the effects of natural selection can explain the difference (Williams et al, 1973; Guinand et al, 2004). High levels of gene flow, as other studies have also indicated (Purcell et al, 1996; Giæver and Forthun, 1999), is more likely to explain the present data. The fact that nonsynonymous substitutions are rare suggests weak or no natural selection acting on the observed polymorphism in haddock. This makes the COI fragment used in the present study suitable for population genetic studies in haddock. However, neutrality tests used in the present study showed deviations from neutral expectations. The tests are sensitive to demographic events and the observed pattern could be the result of recent population expansion (Tajima, 1993; Excoffier, 2007). Similar pattern has been found in many related taxa and (Pálsson et al, 2009; Eiríksson and Árnason, 2014, 2015). In fact the observed genetic variation indicates two phases of variable population expansion rate for haddock (Figure 7). The earlier phase estimated to have taken place towards the end of the last glacial maximum (as defined by Clark et al, 2009) but before the sudden warming of the earths climate ca. 14 kyr ago (Figure 7). The timing of these events are based on the molecular clock rate used. Although 2% sub./site/Myr is commonly used for animal mtDNA (Avise, 2004) variable mutation rates have been estimated (Bermingham et al, 1997; Knowlton and Weigt, 1998; Avise, 2004; Burridge et al, 2008). Using strict molecular clock rate and mutation of 2% sub./site/Myr may be too low, in particular when considering the ’pedigree rate’ (ca. 6% sub./site/Myr) in the study by Burridge et al (2008). Underestimating the mutation rate will lead to overestimation in time to the event. Thus, the observed expansion events in the present study may have taken place more recently, if mutation rate is higher. If mutation rate is as high as suggested for ’pedigree rate’ in Burridge et al (2008), the timing of the expansion event would be much more recent, or ca. 5 kyr ago. Using the ’phylogenetic rate’ the observed pattern of population size change fits quite well to the ecological change occurring in the marine environment at the end of last ice age. However, if ’pedigree rate’ is used other explanations will be needed for the observed genetic variation in haddock, such as effects of natural selection (Bazin et al, 2006). This finding shows the importance of better understanding of molec- 82 ular clock rate for timing of demographic events, for proper interpretation of observed genetic variation in general. However, the overall distribution of nucleotide mismatch differed slightly form expectations of population sudden expansion model. The deviation may be the result of extended and variable expansion event and/or weak population genetic structure (Figure 6). The present study shows the shortcoming of the mismatch analysis in reconstructing demographic history. Although useful for identifying genetic pattern resulting from a single demographic expansion (Rogers and Harpending, 1992), more complicated expansion event may explain the poor fit to the sudden expansion model. The Bayesian skyline analysis makes better use of the genealogical information, allowing for better reconstructing of the demographic history (see e.g. Emerson et al, 2001; Ho and Shapiro, 2011). Here it can be noted that Bazin et al (2006) suggested that repeated adaptive selection events was the dominant factor shaping the variation in animal mitochondrial genome. However, later studies have not supported the generality of this finding (e.g. McCusker and Bentzen, 2010, and references therein). The observed genetic variation in haddock may thus be shaped by adaptive selection dating back to the predicted expansion event. However, population expansion correlating with environmental change must be regarded as more likely explanation for the observed genetic variation in haddock. 6 Acknowledgements This project would not have been possible without assistance in obtaining samples from different geographic regions. We are thankful for assistance in obtaining samples to Bo Bergstrøm at the Greenland Nature Institute, Greenland, Kristján Kristinsson at the Icelandic Marine Research Institute (MRI), Iceland, Petur Steingrund at Faroe Marine Research Institute (FAMRI), Faroe Islands, Jarle Mork at the Norwegian University of Science and Technology (NTNU), Norway, and Remment ter Hofstede at Institute for Marine Resources and Ecosystem Studies (IMARES) in the Netherlands. We also would like to thank Snæbjörn Pálsson for useful discussions during the work and the population genetics group at the University of Iceland for various help during laboratory work. This project was supported by the Icelandic research fund and the Icelandic research fund for graduate students of The Icelandic Centre for Research and the Icelandic Marine Research Institute. References Andersen KK, Azuma N, Barnola JM, Bigler M, Biscaye P, Caillon N, Chappellaz J, Clausen HB, DahlJensen D, Fischer H, Flückiger J, Fritzsche D, Fujii Y, Goto-Azuma K, Grønvold K, Gundestrup NS, Hansson M, Huber C, Hvidberg CS, Johnsen SJ, Jonsell U, Jouzel J, Kipfstuhl S, Landais A, Leuenberger M, Lorrain R, Masson-Delmotte V, Miller H, Motoyama H, Narita H, Popp T, Rasmussen SO, Raynaud D, Rothlisberger R, Ruth U, Samyn D, Schwander J, Shoji H, Siggard-Andersen ML, Steffensen JP, Stocker T, Sveinbjörnsdóttir AE, Svensson A, Takata M, Tison JL, Thorsteinsson T, Watanabe O, Wilhelms F, White JWC, members NGICP (2004) High-resolution record of northern hemisphere climate extending into the last interglacial period. Nature 431:147–151. Anonymous (2011) State of marine stocks in Icelandic waters 2010/2011. Prospects for the quota year 2010/2011 (in Icelandic with English abstract). Hafrannsóknastofnunin Fjölrit 159:1–180. Árnason E (2004) Mitochondrial cytochrome b DNA variation in the high-fecundity Atlantic cod: TransAtlantic clines and shallow gene genealogy. Genetics 166:1871–1885. 83 Paper IV Avise JC (2004) Molecular Markers, Natural History, and Evolution, second edition edn. Sinauer Associates, Inc. Bandelt HJ, Forster P, Röhl A (1999) Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol 16:37–48. Bazin E, Glémin S, Galtier N (2006) Population size does not influence mitochondrial genetic diversity in animals. Science 312:570–572. Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Statist Soc B 57:pp. 289–300. Bermingham E, McCafferty SS, Martin AP (1997) Fish biogeography and molecular clocks: Perspectives from the Panamanian isthmus. In: Kocher TD, Stepien CA (eds) Molecular systematics of Fishes, San Diego: Academic Press, pp 113–128. Björnsson H, Sólmundsson JS, Kristinsson K, Steinarsson BÆ, Hjörleifsson E, Jónsson E, Pálsson J, Pálsson ÓK, Bogason V, Þorsteinn Sigurðsson (2007) Stofnmælingar botnfiska á Íslandsmiðum (SMB) 19852006 og stofnmæling botnfiska að haustlagi (SMH) 1996-2006. undirbúningur, framkæmd og helstu niðurstöður (in Icelandic with English abstract). Fjölrit nr.131, Hafrannsóknastofnunin, Reykjavík. Brickman D, Marteinsdottir G, Logemann K, Harms IH (2006) Drift probabilities for Icelandic cod larvae. ICES J Mar Sci 64:1–11. Brunelli JP, Steele CA, Thorgaard GH (2010) Deep divergence and apparent sex-biased dispersal revealed by a Y-linked marker in rainbow trout. Mol Phylogenet Evol 56:983–990. Burridge CP, Craw D, Fletcher D, Waters JM (2008) Geological dates and molecular rates: fish DNA sheds light on time dependency. Mol Biol Evol 25:624–633. Cano JM, Mäkinen HS, Merilä J (2008) Genetic evidence for male-biased dispersal in the three-spined stickleback (Gasterosteus aculeatus). Mol Ecol 17:3234–3242. Clark PU, Dyke AS, Shakun JD, Carlson AE, Clark J, Wohlfarth B, Mitrovica JX, Hostetler SW, McCabe AM (2009) The last glacial maximum. Science 325:710–714. Cohen D, Inada T, Iwamoto T, Scialabba N (1990) Gadiform fishes of the world (Order Gadiformes). An annotated and illustrated catalogue of cods, hakes, grenadiers and other gadiform fishes known to date. FAO Fisheries Synopsis. No. 125, Vol. 10. Rome. Drummond AJ, Rambaut A, Shapiro B, Pybus OG (2005) Bayesian coalescent inference of past population dynamics from molecular sequences. Mol Biol Evol 22:1185–1192. Dupanloup I, Schneider S, Excoffier L (2002) A simulated annealing approach to define the genetic structure of populations. Mol Ecol 11:2571–2581. Eiríksson GM, Árnason E (2014) Mitochondrial DNA sequence variation in whiting Merlangius merlangus in the North East Atlantic. Environ Biol Fish 97:103–110. Eiríksson GM, Árnason E (2015) Gene flow across the N-Atlantic and sex-biased dispersal inferred from mtDNA sequence variation in saithe, Pollachius virens. Environ Biol Fish 98:67–79. 84 Emerson BC, Paradis E, Thébaud C (2001) Revealing the demographic histories of species using DNA sequences. Trends Ecol Evol 16:707 – 716. Ewing B, Hillier L, Wendl MC, Green P (1998) Base-calling of automated sequencer traces using phred. I. accuracy assessment. Genome Res 8:175–185. Excoffier L (2007) Analysis of population subdivision. In: DJBalding, MBishop, CCannings (eds) Handbook of Statistical Genetics, John Wiley and Sons, Ltd., Chichester, vol 2, pp 980–1020. Excoffier L, Lischer HEL (2010) Arlequin suite ver 3.5: A new series of programs to perform population genetics analyses under linux and windows. Mol Ecol Resour 10:564–567. Excoffier L, Smouse PE, Quattro JM (1992) Analysis of molecular variance inferred from metric distances among DNA haplotypes: Application to human mitochondrial DNA restriction data. Genetics 131:479– 491. Fraser DJ, Lippé C, Bernatchez L (2004) Consequences of unequal population size, asymmetric gene flow and sex-biased dispersal on population structure in brook charr (Salvelinus fontinalis). Mol Ecol 13:67– 80. Furrer R, Nychka D, Sain S (2011) fields: Tools for spatial data. R package version 6.5.2. Galtier N, Gouy M, Gautier C (1996) SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny. CABIOS, Comput Appl Biosci 12:543–548. Giæver M, Forthun J (1999) A population genetic study of haddock (Melanogrammus aeglefinus) in Northeast Atlantic waters based on isozyme data. Sarsia 84:89–98. Gordon D, Abajian C, Green P (1998) Consed: a graphical tool for sequence finishing. Genome Research 8:195–202. Guinand B, Lemaire C, Bonhomme F (2004) How to detect polymorphisms undergoing selection in marine fishes? a review of methods and case studies, including flatfishes. J Sea Res 51:167–182. Hartl DL, Clark AG (2006) Principles of Population Genetics, 4th edn. Sinauer Associates, Inc Publishers, Sunderland, Massachusetts. Hislop JRG (1984) A comparison of reproductive tactics and strategies of cod, haddock, whiting and Norway pout in the North Sea. In: Potts GW, Wootton RJ (eds) Fish Reproduction: Strategies and Tactics, Academic Press, London, pp 311–328. Ho SYW, Shapiro B (2011) Skyline-plot methods for estimating demographic history from nucleotide sequences. Mol Ecol Resour 11:423–434. Jamieson A, Birley AJ (1989) The distribution of transferrin alleles in haddock stocks. ICES J Mar Sci 45:248–262. Jónsson G (1992) Íslenskir fiskar (in Icelandic). Fjölvaútgáfan, Reykjavík. Knowlton N, Weigt LA (1998) New dates and new rates for divergence across the Isthmus of Panama. Proc R Soc B 265:2257–2263. 85 Paper IV Lage C, Purcell M, Fogarty M, Kornfield I (2001) Microsatellite evaluation of haddock (Melanogrammus aeglefinus) stocks in the Northwest Atlantic Ocean. Can J Fish Aquat Sci 58:982–990. Librado P, Rozas J (2009) Dnasp v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics 25:1451–1452. Magnussen E (2007) Interpopulation comparison of growth patterns of 14 fish species on Faroe Bank: Are all fish species on the bank fast-growing? J Fish Biol 71:453–475. McCusker MR, Bentzen P (2010) Positive relationships between genetic diversity and abundance in fishes. Mol Ecol 19:4852–4862. Nielsen EE, Hansen MM, Meldrup D (2006) Evidence of microsatellite hitch-hiking selection in Atlantic cod (Gadus morhua L.): Implications for inferring population structure in nonmodel organisms. Mol Ecol 15:3219–3229. Nielsen R (2001) Statistical tests of selective neutrality in the age of genomics. Heredity 86:641–647. Pampoulie C, Jakobsdóttir KB, Marteinsdóttir G, Thorsteinsson V (2008) Are vertical behaviour patterns related to the pantophysin locus in the Atlantic cod (Gadus morhua L.)? Behav Genet 38:76–81. Paradis E, Claude J, Strimmer K (2004) APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20:289–290. Perrin N, Mazalov V (2000) Local competition, inbreeding, and the evolution of sex-biased dispersal. Am Nat 155:116–127. Petit RJ, Excoffier L (2009) Gene flow and species delimitation. Trends Ecol Evol 24:386–393. Posada D (2008) jmodeltest: phylogenetic model averaging. Mol Biol Evol 25:1253–1256. Prugnolle F, de Meeus T (2002) Inferring sex-biased dispersal from population genetic tools: a review. Heredity 88:161–165. Purcell MK, Kornfield I, Fogarty M, Parker A (1996) Interdecadal heterogeneity in mitochondrial DNA of Atlantic haddock (Melanogrammus aeglefinus) from Georges bank. Mol Mar Biol Biotechnol 5:185–192. Pybus OG, Rambaut A, Harvey PH (2000) An integrated framework for the inference of viral population history from reconstructed genealogies. Genetics 155:1429–1437. Pálsson S, Källman T, Paulsen J, Árnason E (2009) An assessment of mitochondrial variation in Arctic gadoids. Polar Biol 32:471–479. R development core team (2008) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. Reiss H, Hoarau G, Dickey-Collas M, Wolff WJ (2009) Genetic population structure of marine fish: Mismatch between biological and fisheries management units. Fish Fish 10:361–395. Rogers A, Harpending H (1992) Population growth makes waves in the distribution of pairwise genetic differences. Mol Biol Evol 9:552–569. Rose GA (2005) On distributional responses of North Atlantic fish to climate change. ICES J Mar Sci 62:1360–1374. 86 Schneider S, Excoffier L (1999) Estimation of past demographic parameters from the distribution of pairwise differences when the mutation rates vary among sites: Application to human mitochondrial DNA. Genetics 152:1079–1089. Schrey AW, Heist EJ (2003) Microsatellite analysis of population structure in the shortfin mako (Isurus oxyrinchus). Can J Fish Aquat Sci 60:670–675. Shaw PW, Arkhipkin AI, Al-Khairulla H (2004) Genetic structuring of Patagonian toothfish populations in the Southwest Atlantic ocean: the effect of the Antarctic polar front and deep-water troughs as barriers to genetic exchange. Mol Ecol 13:3293–3303. Tajima F (1993) Statistical analysis of DNA polymorphism. Jpn J Genet 68:567–595. Tamura K, Nei M (1993) Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol 10:512–526. Tamura K, Dudley J, Nei M, Kumar S (2007) MEGA4: Molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol 24:1596–1599. Ursvik A, Breines R, Christiansen JS, Fevolden SE, Coucheron DH, Johansen SD (2007) A mitogenomic approach to the taxonomy of pollocks: Theragra chalcogramma and T. finnmarchica represent one single species. BMC Evol Biol 7:86. Walsh P, Metzfer D, Higuchi R (1991) Chelex 100 as a medium for simple extraction of DNA for PCRbased typing from forensic material. BioTechniques 10:506–513. Waples RS (1987) A multispecies approach to the analysis of gene flow in marine shore fishes. Evolution 41:385–400. Ward RD, Zemlak TS, Innes BH, Last PR, Hebert PDN (2005) DNA barcoding Australia’s fish species. Philos Trans R Soc, B 360:1847–1857. Whitlock MC, McCauley DE (1999) Indirect measures of gene flow and migration: FST 6= 1/(4Nm+1). Heredity 82:117–125. Williams GC, Koehn RK, Mitton JB (1973) Genetic differentiation without isolation in the American eel, Anguilla rostrata. Evolution 27:192–204. Wright S (1931) Evolution in Mendelian populations. Genetics 16:97–159. Zwanenburg KCT, Bentzen P, Wright JM (1992) Mitochondrial DNA differentiation in western North Atlantic populations of haddock (Melanogrammus aeglefinus). Can J Fish Aquat Sci 49:2527–2537. 87 Paper IV 7 Tables Table 1: Sampling year and sampling sites for haddock in different countries. In total 884 fish were sampled and successfully analyzed. Position is given as an average of the latitude (Lat.) and the longitude (Lon.) of the sampling region. Position is given in degrees and decimal minutes. N sample size. Locality Month Year Lat. Lon. N Greenland NW Greenland SW Greenland E Iceland N Iceland E Iceland S Iceland W Faroe Plateau Faroe Bank Norway 70◦ N Norway 69◦ N Norway 66◦ N Norway 63◦ N North Sea July July July October October October October March March March September October August February 2006 2006 2006 2004 2004 2004 2004 2006 2006 1994 1993 1992 1992 2006 63◦ 81 52◦ 81 60◦ 27 46◦ 87 65◦ 06 33◦ 08 65◦ 94 19◦ 64 65◦ 74 13◦ 89 63◦ 61 15◦ 16 64◦ 83 24◦ 04 61◦ 05 8◦ 66 61◦ 93 7◦ 89 70◦ 15 −28◦ 81 69◦ 34 −18◦ 65 66◦ 27 −13◦ 58 63◦ 11 −6◦ 78 56◦ 13 −2◦ 24 22 38 3 41 46 45 43 144 279 75 17 38 51 42 Table 2: Amino acid substitutions of various haplotypes and their frequencies (N). Haplotype Substitution N C0I-026 C0I-043 C0I-064 C0I-079 C0I-100 Valine (V) Isoleucine (I) Isoleucine (I) Aspartic acid (D) Aspartic acid (D) ↔ ↔ ↔ ↔ ↔ Methionine (M) Valine (V) Serine (S) Glycine (G) Asparagine (N) 2 1 1 1 1 Table 3: Molecular diversity in haddock from the North Atlantic.S number of segregating sites; k number of haplotypes; ĥ haplotype diversity; π̂ nucleotide diversity; σ̂ standard error. Locality S k ĥ ± σ̂ĥ π̂ ± σ̂π̂ (×100) Greenland NW Greenland SW Iceland N Iceland E Iceland S Iceland W Faroe Plateau Faroe Bank Norway 70◦ N Norway 69◦ N Norway 66◦ N Norway 63◦ N North Sea 88 11 8 9 16 13 8 26 38 19 10 11 9 14 10 9 10 17 12 8 25 51 18 9 10 9 13 0.89 ± 0.04 0.80 ± 0.03 0.74 ± 0.05 0.84 ± 0.04 0.79 ± 0.05 0.72 ± 0.04 0.78 ± 0.03 0.83 ± 0.01 0.81 ± 0.03 0.83 ± 0.09 0.81 ± 0.04 0.76 ± 0.04 0.81 ± 0.04 0.33 ± 0.21 0.22 ± 0.16 0.20 ± 0.15 0.27 ± 0.18 0.25 ± 0.17 0.23 ± 0.16 0.24 ± 0.16 0.27 ± 0.18 0.27 ± 0.18 0.37 ± 0.24 0.26 ± 0.18 0.25 ± 0.17 0.24 ± 0.17 Table 4: AMOVA among countries using Tamura and Nei (1993) genetic distance (upper) and based on haplotype frequency only (lower). Φ/F are the various intraclass fixation indices. Source of Variance Percentage variation d.f. component of variation Φ/F statistic P Among countries 4 Among areas within countries 8 Within areas 868 −0.001 0.004 0.772 −0.12 0.53 99.60 ΦCT = −0.001 0.57 ΦSC = 0.005 0.11 ΦST = 0.004 0.10 Among countries 4 Among areas within countries 8 Within areas 868 -0.0003 0.001 0.401 −0.07 0.35 99.73 FCT = −0.001 FSC = 0.004 FST = 0.003 0.44 0.16 0.16 Table 5: Gene flow M = Ne m (upper part) and ΦST (lower part in bold) for haddock, pairwise comparison between countries. Where significant genetic difference (P<0.05) was observed, ΦST value is labelled with *. Country Greenl. Iceland Faroe Isl. Norway N. Sea Greenl. Iceland Faroe Isl. Norway N. Sea 0.016* 0.009 0.016* 0.025* 32 -0.002 -0.002 0.002 56 ∞ -0.002 0.006 30 ∞ ∞ 0.006 20 271 91 79 - Table 6: AMOVA between areas within Faroe Islands using Tamura and Nei (1993) genetic distance (upper) and based on haplotype frequency only (lower). Φ/F are the various intraclass fixation indices. Source of Variance Percentage variation d.f. component of variation Φ/F statistic P Among areas Among stations within areas Within stations 1 18 378 0.011 -0.002 0.782 1.34 -0.19 98.85 ΦCT = 0.013 0.02 ΦSC = −0.002 0.47 ΦST = 0.012 0.28 Among areas Among stations within areas Within stations 1 18 378 0.001 -0.002 0.408 0.28 -0.60 100.32 FCT = 0.003 FSC = −0.006 FST = −0.003 0.13 0.83 0.73 89 Paper IV 8 Figures Figure 1: Sampling stations for haddock are indicated with red dots. 90 Figure 2: The overall relationship among the 102 observed COI haplotypes. The size of circles are proportional to the frequency of the given haplotype (smallest, N = 1). Different haplotypes are represented with different shades. Black: COI-001, dark gray: C0I-002, intermediate gray: COI-003 and light gray: COI-004. Rare haplotypes are represented with open circles. The red circles represent an unobserved but inferred haplotype. Lines between haplotypes reflect mutational steps between alleles. Figure 3: Comparison of haddock COI haplotype frequency among countries. Different haplotypes are represented with different shades. Black: COI-001, dark gray: COI-002, intermediate gray: COI-003 and light gray:COI-004. White represents rare haplotypes pooled. 91 Paper IV Figure 4: The relationship among pairwise genetic differentiation (ΦST ) and geographic distance (km) for the defined groups. The filled circles represent ΦST values that are significant at α -level 0.05, open circles show non significant values. The trend line shows small, non-significant, increase in ΦST with geographic distance. Figure 5: Comparison of haddock COI haplotype frequency among different localities around Iceland (a) (S: South; W: West; N: North; E: East) and areas within the Faroe Islands (b) (FB: Faroe Bank; FP: Faroe Plateau). Different haplotypes are represented with different shades. Black: COI-001, dark gray: COI-002, intermediate gray: COI-003 and light gray:COI-004. White represents rare haplotypes, pooled in (b) 92 Figure 6: The pairwise nucleotide mismatch distribution for the 599 base pair COI fragment in haddock and the prediction of a sudden expansion model is expressed with line and dots. 93 Paper IV Figure 7: (a) Bayesian skyline plot for haddock of changes in female effective population size (Ne ) against time in kilo years (kyr) before present. (b) Changes in the δ O18 concentration in the North Greenland Ice core. The shady interval covers the last glacial maximum (LGM)(Clark et al, 2009) (δ O18 data from: www.gfy.ku.dk\~www-glac\ngrip) 94 Figure 8: Fixation indexes for the SAMOVA results as samples are assigned into variable number of groups (2-12). 95 Paper IV Figure 9: Comparions of haddock COI haplotype frequency between males (a) and females (b) sampled in the Nortern (N) part (>67◦ N)and Southern (S) part (>67◦ N) of along the Norwegian Norwegina waters. (Females N, N = 45; Females S, N = 33; Males N, N = 47; Males S, N = 56). Different haplotypes are represented with different shades. Black: COI-001, dark gray: COI-002, intermediate gray: COI-003 and light gray:COI-004. White represents rare haplotypes pooled. 96 Figure 10: Comparison of haddock COI haplotype frequency between males (a) and females (b) sampled in waters within different areas around Faroe Islands (FB: Faroe Bank, FP: Faroe Plateau). (Females FB, N = 90; Females FP, N = 94; Males FB, N = 158; Males FP, N = 49). Different haplotypes are represented with different shades. Black: COI-001, dark gray: COI-002, intermediate gray: COI-003 and light gray:COI-004. White represents rare haplotypes pooled. 97 Paper V Mitochondrial DNA Sequence Variation in Whiting Merlangius merlangus in the North East Atlantic Guðni Magnús Eiríksson and Einar Árnason, 2014 Environmental Biology of Fishes, 97: 103–110. doi:10.1007/s10641-013-0143-5 © Environmental Biology of Fishes Guðni Magnús Eiríksson actively particiated in assembling samples from different parts of the world. Guðni did all the molecular laboratory work: DNA isolation, mtDNA amplification, purification and prepared the samples for electrophoresis on an ABI-3100 automatic sequencer. Guðni was also in charge of data analysis and interpretation. Guðni was in charge of writing the manuscript, corresponded to the comments of reviewers and finalized the article. Professor Einar Árnason actively participated in all steps of the work. Einar took part in the experimental design, supervised the molecular work, took part in data analysis, interpretation of the data and participated in writing the manuscript and finalizing the article. 99 Paper V 100 Environ Biol Fish (2014) 97:103–110 DOI 10.1007/s10641-013-0143-5 Mitochondrial DNA sequence variation in whiting Merlangius merlangus in the North East Atlantic Guðni Magnús Eiríksson & Einar Árnason Received: 14 August 2012 / Accepted: 17 April 2013 / Published online: 12 May 2013 # Springer Science+Business Media Dordrecht 2013 Abstract Genetic variation in whiting Merlangius merlangus was examined using a 621 base pair fragment of the cytochrome c oxidase subunit I mitochondrial gene in 138 individuals sampled from Iceland, Norway and the North Sea. In total 10 segregating sites were observed defining 12 haplotypes. Three of the haplotypes were found at high frequencies (>5 %). All but one mutations were synonymous and the nonsynonymous mutation was found as a singleton. This suggests weak or no natural selection acting on the observed polymorphism making it useful for examination of population breeding structure. The genetic variation suggests that the whiting population has undergone sudden expansion in the past, estimated to have started 70 Kyr ago, during the last glacial period. Spatial genetic analysis reveals genetic uniformity across long geographic distances suggesting high level of gene flow. The long pelagic phase at early age, allowing for high dispersal rate, may partly explain the observed pattern. Keywords mtDNA . COI . Demography . Gene flow G. M. Eiríksson (*) : E. Árnason Institute of Life and Environmental Sciences, University of Iceland, Sturlugata 7, 101 Reykjavík, Iceland e-mail: [email protected] Introduction Genetic variation and genealogy of high latitude organism have been strongly influenced by climatic oscillations during the Pliocene and Pleistocene (Bernatchez and Dodson 1991; Hewitt 1996; Avise and Walker 1998; Wares and Cunningham 2001; Pálsson et al. 2009). The climatic oscillations have caused population bottlenecks for many species followed by a period of expansion. Genetic variation is lost through extinction of lineages during bottleneck period generating shallow genealogy and reducing the effective population size, Ne. During period of population expansion however the probability of lineage extinction is reduced and genetic variation increases in a population (Avise 2000). The expected pattern of genetic variation as a result of repeated demographic changes is a shallow genealogy with many rare types. This is indeed the observed pattern in many marine fish species (e.g. Sigurgíslason and Árnason 2003; Árnason 2004; Pálsson et al. 2009; Liu et al. 2010, Eiríksson unpubl. data). Many studies have demonstrated a lack of spatial genetic structure in marine species indicating high level of gene flow among groups across long distances (for overview see Reiss et al. 2009). This has been attributed to lack of physical barriers in the marine environment and high dispersal rate for eggs and larvae with ocean currents (Waples 1987). Genetic uniformity can also be maintained if population size is large (large Ne) as it will reduce the rate of genetic drift. A long time may 101 Paper V 104 thus be needed in order for genetic differentiation to become established among populations of large size. Knowledge of the population genetic structure of harvested fish species is important for responsible management (Reiss et al. 2009). Fish distribution in the North Atlantic is affected by environmental conditions and changes have been observed, in particular in the last few decades, most likely due to climate change (Rose 2005; ter Hofstede et al. 2010). Understanding of gene flow among geographic regions may be useful for predicting the effects of shifts in distributions. Whiting Merlangius merlangus is a commercially important fish species in the North Atlantic especially in the North Sea. It is distributed from northern Norway and Iceland in the north to the Bay of Biscay in the south. Whiting is a shallow water species mostly found at 10–200 m depth (Jónsson 1992). It has a long pelagic phase early in life compared to related gadoids such as Atlantic cod Gadus morhua, haddock Melanogrammus aeglefinus and saithe Pollachius virens. Potentially this results in a higher dispersal rate (Hislop 1984). Age at maturation is variable among geographic localities, however, in general maturation is reached at 2–4 years overall, earlier in the southern part of its distribution (Hislop 1984; Jónsson 1992; Cohen et al. 1990). Due to its commercial importance many studies have focused on whiting biology in the North East Atlantic (Cohen et al. 1990). Tag and recapture studies and levels of parasite infestation in different localities have been interpreted as reflecting limited migrations among localities (Hislop and MacKenzie 1976). Thus two distinct populations have been assumed to exist north and south of Dogger Bank in the North Sea (Hislop and MacKenzie 1976). Pilcher et al. (1989) described spatial and seasonal variation in parasite prevalence and infestation intensity within the North Sea. A latitudinal cline was observed for infestation of Diclidophora merlangi’s infections with infestation prevalence and intensity increasing steadily from south to north. A population genetic study on whiting, using three microsatellite loci, showed significant difference between sampling locations within the North Sea for two loci (Rico et al. 1997). Charrier et al. (2007) used seven microsatellite markers for whiting in the North East Atlantic but limited genetic differentiation among sampling localities was observed, indicating high level of gene flow over long geographic distances. However, significant genetic difference was also observed among sampling stations within the North Sea, albeit with low FST values. 102 Environ Biol Fish (2014) 97:103–110 In the present study genetic variation at a 621 base pair fragment of the cytochrome c oxidase subunit I (COI) mitochondrial gene in whiting from south Iceland, north Norway and the North Sea was examined. Demographic factors and spatial genetic variation were examined. The findings were compared to observed genetic variation in related species (Árnason 2004; Pálsson et al. 2009; Liu et al. 2010, Eiríksson unpubl. data). Materials and methods Fish samples Whiting was sampled from Icelandic and Norwegian waters and from the North sea (Table 1). In all cases gill tissue samples were preserved in 96 % ethanol. It can be noted that samples from Norway were collected in 1992, 14 years earlier than those collected in North sea. However, this should not be a concern when analyzing historical population genetic breeding structure. Molecular methods DNA was isolated using chelex method (Walsh et al. 1991) with some modifications. A pair of primers, FishF1-5′TCAACCAACCACAAAGACATTGGCA C3′ and FishR1-5′TAGACTTCTGGGTGGCCA AAGAATCA3′, was used to amplify a 621 base pair fragment corresponding to positions 5530–6150 in whiting complete mitochondrial genome (Roques et al. 2006; GenBank accession number: NC_007395.1). The primers used were developed by Ward et al. (2005) and have been used for many different fish species. A 19 μl PCR reaction mix contained 0.17 mM dNTP, 11 μg/μl Table 1 Whiting samples from different areas. The whiting from the Icelandic and Norwegian waters were sampled in a single sampling location. Whiting from North Sea were sampled in seven different locations (all south of Dogger Bank), the position is given as an average of the latitude and the longitude of the sampling locations. Location is given in degrees and decimal minutes Area N Month Year Location Iceland 31 October 2004 64°39N,23°04W Norway 78 September 1992 70°6N,21°43E North Sea 30 January 2006 53°88N,6°39E Environ Biol Fish (2014) 97:103–110 105 and sequence data was viewed using Consed (Gordon et al. 1998) and Seaview (Galtier et al. 1996). Trace files for all observed haplotypes were inspected manually, the sequence integrity confirmed and bad sequences were rejected. The software Network 4.5.1.6 was used to produce median joining network among haplotypes (Bandelt et al. 1999). MEGA 4 (Tamura et al. 2007) was used for translation of nucleotide sequence to amino acid sequence for the observed haplotypes and for the counting of synonymous and nonsynonymous sites. Analysis of Molecular variation (AMOVA), based on the method developed by Excoffier et al. (1992), was carried out using Arlequin 3.5.1.2 (Excoffier and Lischer 2010) both using genetic distance (Tamura and Nei 1993) and the haplotype frequency only. In both cases 10000 permutations were used to test the genetic structure of the observed sample. Pairwise FST comparison was made among countries. In order to test the significance of the FST between countries a null distribution of FST was generated by 10000 permutations. Gene flow, M=Nem, was estimated among the countries (where m is migration rate and M is the absolute number of migrants per subpopulation per generation). The migration M is derived directly from its relationship with the FST as M=1− FST /2FST , as calculated in Arlequin. Selective neutrality was tested using Fu’s FS test (number of simulations=1000), Chakraborty’s Bovine serum albumin (BSA), 0.36 pM of each primer (FishF1, FishR1), and 0.05 U Taq Polymerase buffered with 10×ThermoPol buffer (New England BioLabs (NEB), #M0267) together with DNA template (concentration ca. 0.26 ng/μl in reaction mix) and distilled and deionized water. The reaction mix was prepared on ice and immediately placed in a thermocycler when ready. The DNA was denatured at 94 °C for 5 min in the first cycle followed by 35 cycles of denaturation, annealing at 54 °C for 30 s and extension at 72 °C for 1 min. The cycles were followed by 7 min extension period at 72 °C. A 5 μl of the product was run for 10 min at 90 Volts on an 1.5 % agarose gel containing ethidium bromide. The gel was then inspected under UV light for amplified DNA. PCR products were enzymatically purified using Exonuclease I (NEB, #M0293L) and Antarctic Phosphatase (NEB, #M0289L). The purification was followed by direct sequencing using Applied Biosystems-BigDye Terminator v3.1 kit (and FishF1 primer) using standard protocol with modifications. The reaction product was analyzed on ABI-3100 automatic sequencer. Data analysis Base calling, data assembling and sequence alignment was carried out in Phred and Phrap (Ewing et al. 1998) Table 2 Segregating sites observed in a 621 base pair fragment of the COI for whiting. The number above each segregating site is equivalent to the site number as recorded for whiting complete mitochondrial genome (Roques et al. 2006). N, frequency of each haplotype in the overall sample Haplotype 5 5 6 7 5 7 0 3 5 7 4 8 5 7 6 0 5 8 6 2 5 8 8 9 5 9 9 1 6 0 0 6 6 1 3 8 6 1 4 2 N H01 G C C C G C C C G C 83 H02 . . . . . . . G . . 31 H03 . T . . . . . G . . 16 H04 . T . . . . . . . . 1 H05 . . T . . . . G . . 1 H06 . . . . . T . . . . 1 H07 . . . . . . T . . . 1 H08 . . . . . . . G . T 1 H09 . . . . . . . . A . 1 H10 . . . . C . . G . . 1 H11 . . . A . . . . . . 1 H12 A T . . . . . G . . 1 103 Paper V 106 Environ Biol Fish (2014) 97:103–110 Fig. 1 The overall relationship among the 12 observed COI haplotypes. The size of circles are proportional to the frequency of the given haplotype (smallest, N=1). Different haplotypes are represented with different shades. Black: H01, dark gray: H02, light gray: H03. Rare haplotypes are represented with white colour. Lines between haplotypes reflect mutational steps between alleles test and Tajima’s D test (number of simulations= 1000), implemented in Arlequin. Demographic population expansion was studied using the distribution of pairwise nucleotide mismatch (Rogers and Harpending 1992). The observed data was tested against simulated distributions predicting sudden population expansion as described in Schneider and Excoffier (1999). The mismatch calculations were carried out in Arlequin. If a mismatch distribution does not deviate from a model predicting population sudden expansion τ can be used to estimate the time from expansion: t=τ/2μ, where t is time in years, τ is mode of the mismatch distribution and μ is mutation rate in substitutions year−1(Rogers and Harpending 1992). In the analysis synonymous substitution rate μ=3.86×10−8 site−1year−1 was used, estimated for Atlantic cod (Árnason 2004), as it is likely to reflect neutral genetic variation. The population parameter θ was estimated using two different methods: θH based on observed homozygosity (or the probability of two randomly chosen sequences to be identical) and θπ is based on the pairwise nucleotide mismatches. Estimations of θ were carried out in Arlequin. θ is the effective population size Ne scaled with mutation rate. For mtDNA θ=2Neμ, where Ne is the effective population size and the μ is the overall mutation rate. Effective population size was calculated using the above relationship between θ and Ne. JN168869–JN168880). Three of the haplotypes were found at high frequencies (>5 %). The haplotypes form a shallow geneaology (Fig. 1). All but one mutations were synonymous. The nonsynonymous mutation was found as a singleton (H12) at site 5567: Serine ↔ Asparagine. Similar molecular diversity was observed in the samples from the different sampling sites (Table 3). Estimation of θ did not vary greatly among countries (Table 4) and reflect small effective population size (mean Ne =4.23×104 and 3.20×104 when based on θH and θπ respectively). Fu’s FS test for the pooled sample showed departure from mutation-drift equilibrium (FS =−7.15, P=0.005) were observed number of alleles is higher that expected. The same is true for Chakraborty’s test (P=0.006). Tajima’s D was negative but not significantly different from neutral expectation (D=−1.353, P=0.067). Population genetic structure —spatial variation Comparison of genetic variation among countries showed no genetic differentiation, tested both using genetic distance and haplotype frequency only (Table 5, Fig. 2). Pairwise FST between countries were all negative Table 3 Whiting molecular diversity at 621bp fragment of the COI gene. N, Sample size; h, haplotype diversity; π, nucleotide diversity Results Genetic variation In total 10 segregating sites were observed defining 12 haplotypes (Table 2; GenBank accession numbers: 104 π±S.E.(×100) Sampling area N h±S.E. Iceland 30 0.58±0.08 0.12±0.10 Norway 78 0.60±0.05 0.13±0.11 North Sea 31 0.59±0.09 0.15±0.12 Environ Biol Fish (2014) 97:103–110 107 Table 4 Estimates of θ for whiting from Iceland, Norway and the North Sea Statistic Iceland Norway North Sea θH 1.02 1.10 1.08 σθH 0.33 0.24 0.39 θπ 0.72 0.81 0.92 σθπ 0.62 0.66 0.73 (interpreted as close to zero) reflecting no genetic differentiation among countries. Therefore gene flow, M, among countries was estimated at infinity. The observed mismatch distribution for whiting is unimodal and does not deviate significantly from the sudden expansion model (PSSD =0.24, Fig. 3). The mode of the distribution τ is 0.891 (95 % CI=0.609– 1.312). Based on this the time since sudden expansion was estimated to be 70 Kyr (95 % CI=48–103 Kyr). Discussion Observed genetic variation in whiting indicate population sudden expansion estimated to originate within the middle of the most recent glaciation period, extending from approximately 110–10 Kyr before present (Petit et al. 1999; Andersen et al. 2004). This is similar to what has been observed in Atlantic cod (Bigg et al. 2008; Carr and Marshall 2008), Greenland cod Gadus ogac (Pálsson et al. 2009) and Pacific cod Gadus macrocephalus (Liu et al. 2010) and other related species (Eiríksson, unpubl. data). If the estimated timing of sudden expansion is correct it seems that factors other than the climatic oscillations alone may be responsible for the demographic history in whiting as well as for other related species. This may be due to complicated effects of water temperature and water level variation as well as interaction among different fish species. The observed polymorphism in the present project is mostly at synonymous sites and as such natural selection is not likely to explain the observed pattern. Although Fu’s FS and Chakraborty’s test indicate departure from selective neutrality other factors, such as demographic history, may be responsible for this outcome. Both tests use θ estimators to predict the total number of haplotypes in a sample. High number of alleles (haplotypes) are expected in expanding populations as the probability of lineage extinction is reduced (Avise 2000). The tests are known to be sensitive to sudden population expansion and as the data show good fit to the sudden expansion model this is a more likely explanation for the high number of observed haplotypes. The same has been observed in many other related species (Árnason 2004; Pálsson et al. 2009, Eiríksson unpubl. data). The present study indicates high level of gene flow for whiting among geographic localities in the North East Atlantic. No indication of population genetic substructure was revealed. The observed effective population size, Ne, is similar among the countries and much smaller than the actual population size as has been observed for other fish species (e.g., Árnason 2004). The high level of gene flow might be attributed to the long pelagic phase of whiting at young age (Hislop 1984). Genetic uniformity among groups can be maintained by regular migrations among geographic regions and the number of migrants does not need to be large. In fact, Wright (1931) theoretically showed that an exchange of a single individual per subpopulation every second generation could be sufficient to prevent genetic differentiation of neutral variation between groups. The evidence of genetic uniformity among samples across long geographical distances throughout the distribution range of whiting is at variance with a potential small scale spatial structure as has been suggested in the North Sea (Hislop and MacKenzie 1976; Pilcher Table 5 Whiting AMOVA among countries. Based on Tamura and Nei’s genetic distance (Tamura and Nei 1993) (upper) and based on haploype frequency only (lower) Source of variation d.f. Sum of squares Percentage of variation Ф/F statistic P ФST =−0.0067 0.901 Among countries 2 0.28 −1.65 Within countries 136 55.75 101.65 ФIT =0.4100 FST =−0.0048 Among countries 2 0.19 −1.67 Within countries 136 40.11 101.67 FIT =0.2950 105 0.921 Paper V 108 Environ Biol Fish (2014) 97:103–110 Fig. 2 Comparison of whiting COI haplotype frequency among countries. Different haplotypes are represented with different shades. Black: H01, dark gray: H02, light gray: H03. White represents rare haplotypes et al. 1989). The present study is, however, not conclusive on this since sampling was not carried out in different parts of the North Sea. Parasite composition of whiting in the North Sea may suggest separate populations but seasonal variation shows that the parasite composition can change over relatively short periods of time (Pilcher et al. 1989). Therefore a considerable migration among different sites within the North Sea may take place, followed by change in parasite infestation pattern due to different environments, without changing the observed pattern. Results of microsatellite studies for whiting indicate high level of gene flow as the present project also does (Rico et al. 1997; Charrier et al. 2007). However, in contrast to the present study a small scale spatial structure was indicated in earlier studies (Rico et al. 1997; Charrier et al. 2007). In both studies natural selection is considered unlikely for explaining the observed genetic variation, but selective neutrality was not tested (Rico et al. 1997; Charrier et al. 2007). Charrier et al. (2007) reported that only one of the seven microsatellite loci that they used (Pop18a) showed significant global genetic differentiation among samples (FST =0.0052, P< 0.05). Although this is a very low FST value it is nevertheless considerably higher than found for the other loci, making it a possible outlier and thus potentially under natural selection (Beaumont and Nichols 1996). Rico et al. (1997) provided the allele frequency of the three microsatellite loci used in their study. The frequency distribution of alleles showed clear deviation from Ewen’s prediction for allele frequency distribution for selective neutrality for two of the loci (Mmer-UEAW02 and Gmo2). Highly significant deviation from HWE was observed for all the loci in the study (Rico et al. 1997). Both of these phenomena may be due to natural selection 106 affecting the observed genetic variation. Although selective neutrality is very common for a polymorphism at a molecular level (as predicted by Kimura 1968) natural selection can shape genetic variation (Hartl and Clark 2006). It has been shown that in many cases microsatellites may have functional importance and may thus be under direct natural selection (Li et al. 2002). It is also common to find microsatellites within genes where they are likely to be in linkage disequilibrium with coding regions that are potentially under natural selection (Li et al. 2004). For Atlantic cod it has been shown that hitch-hiking selection can affect microsatellite variation (Nielsen et al. 2006). It is thus possible that a spatial genetic structure observed in the microsatellites studies (Rico et al. 1997; Charrier et al. 2007) was shaped Fig. 3 Pairwise nucleotide mismatch distribution for a 621 base pair COI fragment in whiting and prediction of a sudden expansion model expressed with line and dots Environ Biol Fish (2014) 97:103–110 by heterogeneous environment and reflect different habitats rather than the breeding structure of whiting. The genetic variation examined in the present study is almost exclusively synonymous and it is thus unlikely that the observed polymorphismis affected by natural selection. The microsatellite results for whiting need to be interpreted with caution as recognized by Rico et al. (1997). The present study and the main findings of the microsatellite studies (Rico et al. 1997; Charrier et al. 2007) indicate that high levels of gene flow, over long geographic distances, is a likely explanation for the observed low levels of genetic differentiation in whiting. Acknowledgments We are grateful to Kristján Kristinsson at Icelandic Marine Research Institute (MRI), Remment ter Hofstede at Institute forMarine Resources and Ecosystem Studies (IMARES) in the Netherlands and Jarle Mork at the Norwegian University of Science and Technology (NTNU) in Trondheim, Norway, for assisting in providing samples for the present project. We are thankful to Snæbjörn Pálsson for useful discussions during this work and members of the population genetics laboratory at the University of Iceland for help with molecular analysis during the work. We also thank two anonymous reviewers for their comments on the paper. This project was supported by the Icelandic research fund, the Icelandic research fund for graduate students of The Icelandic Centre for Research and the Icelandic Marine Research Institute. References Andersen KK, Azuma N, Barnola JM et al (2004) Highresolution record of northern hemisphere climate extending into the last interglacial period. Nature 431:147–151 Árnason E (2004) Mitochondrial cytochrome b DNA variation in the high-fecundity Atlantic cod: trans-Atlantic clines and shallow gene genealogy. Genetics 166:1871–1885 Avise JC (2000) Phylogeography: The history and formation of species. Harvard University Press, Cambridge Avise JC, Walker D (1998) Pleistocene phylogeographic effects on avian populations and the speciation process. Proc R Soc B 265:457–463 Bandelt HJ, Forster P, Röhl A (1999) Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol 16:37–48 Beaumont M, Nichols R (1996) Evaluating loci for use in the genetic analysis of population structure. Proc R Soc B 263:1619–1626 Bernatchez L, Dodson JJ (1991) Phylogeographic structure in mitochondrial DNA of lake whitefish (Coregonus clupeaformis) and its relation to Pleistocene glaciation. Evolution 45:1016–1035 Bigg GR, Cunningham CW, Ottersen G, Pogson GH, Wadley MR, Williamson P (2008) Ice-age survival of Atlantic cod: 109 agreement between palaeoecology models and genetics. Proc R Soc B 275:163–172 Carr SM, Marshall HD (2008) Intraspecific phylogeographic genomics from multiple complete mtDNA genomes in Atlantic cod (Gadus morhua): origins of the ”codmother”, transatlantic vicariance and midglacial population expansion. Genetics 180:381–389 Charrier G, Coombs SH, McQuinn IH, Laroche J (2007) Genetic structure of whiting Merlangius merlangus in the northeast Atlantic and adjacent waters. Mar Ecol Prog Ser 330:201–211 Cohen D, Inada T, Iwamoto T, Scialabba N (1990) Gadiform fishes of the world (Order Gadiformes). An annotated and illustrated catalogue of cods, hakes, grenadiers and other gadiform fishes known to date. FAO Fisheries Synopsis. No. 125, Vol. 10. Rome Ewing B, Hillier L, Wendl MC, Green P (1998) Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res 8:175–185 Excoffier L, Lischer HEL (2010) Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under linux and windows. Mol Ecol Res 10:564–567 Excoffier L, Smouse PE, Quattro JM (1992) Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics 131:479–491 Galtier N, Gouy M, Gautier C (1996) SEAVIEW and PHYLO WIN: two graphic tools for sequence alignment and molecular phylogeny. CABIOS 12:543–548 Gordon D, Abajian C, Green P (1998) Consed: a graphical tool for sequence finishing. Genome Res 8:195–202 Hartl DL, Clark AG (2006) Principles of population genetics, 4th edn. Sinauer Associates, Inc Publishers, Sunderland Hewitt GM (1996) Some genetic consequences of ice ages, and their role, in divergence and speciation. Biol J Linn Soc 58:247–276 Hislop JRG (1984) A comparison of reproductive tactics and strategies of cod, haddock, whiting and Norway pout in the North Sea. In: Potts GW, Wootton RJ (eds) Fish reproduction: Strategies and tactics. Academic, London, pp 311–328 Hislop JRG, MacKenzie K (1976) Population studies of the whiting Merlangius merlangus (L.) of the northern North Sea. J Cons int Explor Mer 37:98–110 Jónsson G (1992) Íslenskir fiskar (in Icelandic). Fjölvaútgáfan, Reykjavík Kimura M (1968) Evolutionary rate at the molecular level. Nature 217:624–626 Li YC, Korol AB, Fahima T, Beiles A, Nevo E (2002) Microsatellites: genomic distribution, putative functions and mutational mechanisms: a review. Mol Ecol 11:2453–2465 Li YC, Korol AB, Fahima T, Nevo E (2004) Microsatellites within genes: structure, function, and evolution. Mol Biol Evol 21:991–1007 Liu M, Lu ZC, Gao TX, Yanagimoto T, Sakurai Y (2010) Remarkably low mtDNA control-region diversity and shallow population structure in Pacific cod Gadus macrocephalus. J Fish Biol 77:1071–1082 107 Paper V 110 Nielsen EE, Hansen MM, Meldrup D (2006) Evidence of microsatellite hitchhiking selection in Atlantic cod (Gadus morhua L.): implications for inferring population structure in nonmodel organisms. Mol Ecol 15:3219–3229 Pálsson S, Källman T, Paulsen J, Árnason E (2009) An assessment of mitochondrial variation in Arctic gadoids. Polar Biol 32:471–479 Petit JR, Jouzel J, Raynaud D, Barkov NI, Barnola JM, Basile I, Bender M, Chappellaz J, Davis M, Delaygue G, Delmotte M, Kotlyakov VM, Legrand M, Lipenkov VY, Lorius C, Pépin L, Ritz C, Saltzman E, Stievenard M (1999) Climate and atmospheric history of the past 420,000 years from the Vostok ice core, Antarctica. Nature 399:429–436 Pilcher MW, Whitfield PJ, Riley JD (1989) Seasonal and regional infestation characteristic of three ectoparasites of whiting, Merlangim merlangus L., in the North Sea. J Fish Biol 35:97–110 Reiss H, Hoarau G, Dickey-Collas M, Wolff WJ (2009) Genetic population structure of marine fish: mismatch between biological and fisheries management units. Fish Fish 10:361–395 Rico C, Ibrahim KM, Rico I, Hewitt GM (1997) Stock composition in North Atlantic populations of whiting using microsatellite markers. J Fish Biol 51:462–475 Rogers A, Harpending H (1992) Population growth makes waves in the distribution of pairwise genetic differences. Mol Biol Evol 9:552–569 Roques S, Fox CJ, Villasana MI, Rico C (2006) The complete mitochondrial genome of the whiting, Merlangius merlangus and the haddock, Melanogrammus aeglefinus: a detailed genomic comparison among closely related species of the Gadidae family. Gene 383:12–23 108 Environ Biol Fish (2014) 97:103–110 Rose GA (2005) On distributional responses of North Atlantic fish to climate change. ICES J Mar Sci 62:1360–1374 Schneider S, Excoffier L (1999) Estimation of past demographic parameters from the distribution of pairwise differences when the mutation rates vary among sites: application to human mitochondrial DNA. Genetics 152:1079–1089 Sigurgíslason H, Árnason E (2003) Extent of mitochondrial DNA sequence variation in Atlantic cod from the Faroe Islands: a resolution of gene genealogy. Heredity 91:557–564 Tamura K, Nei M (1993) Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol 10:512–526 Tamura K, Dudley J, Nei M, Kumar S (2007) MEGA4: Molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol 24:1596–1599 ter Hofstede R, Hiddink JG, Rijnsdorp AD (2010) Regional warming changes fish species richness in the eastern North Atlantic ocean. Mar Ecol Prog Ser 414:1–9 Walsh P, Metzfer D, Higuchi R (1991) Chelex 100 as a medium for simple extraction of DNA for PCR-based typing from forensic material. Biotechniques 10:506–513 Waples RS (1987) A multispecies approach to the analysis of gene flow in marine shore fishes. Evolution 41:385–400 Ward RD, Zemlak TS, Innes BH, Last PR, Hebert PDN (2005) DNA barcoding Australia’s fish species. Phil Trans R Soc B 360:1847–1857 Wares JP, Cunningham CW (2001) Phylogeography and historical ecology of the North Atlantic intertidal. Evolution 55:2455–2469 Wright S (1931) Evolution in mendelian populations. Genetics 16:97–159 Appendix I. Supplementary tables Appendix I. Supplementary tables 109 Appendix I. Supplementary tables Table 1.2. Paper I. Microsatellite loci used in the present study. The amplification was carried out in two multiplexes (MP). Listed are the primer sequences and fluorescence label, concentration of the primers used [Primer], the size range of alleles, the number of observed alleles (k), the expected and observed heterozygosity (Hexp and Hobs , respectively), and reference for primers. Locus Gmo8 MP 1 Gmo19 1 Gmo35 1 Gmo37 1 Tch11 1 Gmo2 2 Gmo3 2 Gmo34 2 Gmo132 2 Tch13 2 Primer seq.(50 –30 ) R: GGGGGAGGCATCTGTCATTCA F: GCAAAACGAGATGCACAGACACC R: GTCTTGCCTGTAAGTCAGCTTG F: CACAGTGAAGTGAACCCACTG R: CCTTATCATGTACGTTGTTAAC F: GGAGGTGCTTTGAAGATG R: CGTGGGATACATGGGTACT F: GGCCAATGTTTCATAACTCT R: TCGAGTTCAGGTGGACAA F: ATCCATTGGTGTTTCAAC R: GTGTGAGATGACTGTGTCG F: CCCTCAGATTCAAATGAAGGA R: GCAGCACGAGAGAGCTATTCCTC F: AGGCACGCAGGTGGACAGGAAC R: GGTTGGACCTCATGGTGAA F: TCCACAGAAGGTCTCCTAA R: CGAAAGGACGAGCCAATAAC F: GGAACCCATTGGATTCAGGC R: AATCCACTGGTGCAGACC F: TTTCCGATGAGGTCATGG 50 -Label NED VIC 6-FAM 6-FAM PET NED 6-FAM 6-FAM VIC PET [Primer] 0.10µM Size range(bp) 119-331 k 54 Hexp 0.931 Hobs 0.928 Reference Miller et al. (2000) 0.15µM 127-235 28 0.922 0.806 Miller et al. (2000) 0.20µM 121-154 11 0.831 0.805 Miller et al. (2000) 2.50µM 233-315 23 0.839 0.728 Miller et al. (2000) 0.30µM 123-244 27 0.935 0.846 O’Reilly et al. (2000) 0.25µM 105-153 23 0.846 0.717 Brooker et al. (1994) 0.20µM 165-213 12 0.159 0.155 Miller et al. (2000) 0.20µM 90-123 10 0.385 0.363 Miller et al. (2000) 0.30µM 100-181 33 0.574 0.588 Brooker et al. (1994) 0.30µM 84-185 47 0.920 0.903 O’Reilly et al. (2000) Table 1.3. Paper I. Statistical power for detecting genetic differentiation FST using 9 microsatellite loci. The power is expressed as the proportion of 1000 simulations that result in a statistically significant difference between two populations at an α-level of 0.05. The Gmo34 locus was excluded from base population in the analysis. True FST 0.0000 0.0003 0.0004 0.0005 0.0006 0.0007 0.0008 0.0009 X2 0.046 0.743 0.883 0.966 0.992 0.998 0.998 1.000 Fisher’s exact test 0.061 0.753 0.882 0.962 0.990 0.996 0.999 1.000 111 Haplotype cytb24.hap.001 cytb24.hap.002 cytb24.hap.003 cytb24.hap.004 cytb24.hap.005 cytb24.hap.006 cytb24.hap.007 cytb24.hap.008 cytb24.hap.009 cytb24.hap.010 cytb24.hap.011 cytb24.hap.012 cytb24.hap.013 cytb24.hap.014 cytb24.hap.015 cytb24.hap.016 cytb24.hap.017 cytb24.hap.018 cytb24.hap.019 cytb24.hap.020 cytb24.hap.021 cytb24.hap.022 cytb24.hap.023 cytb24.hap.024 cytb24.hap.025 cytb24.hap.026 cytb24.hap.027 cytb24.hap.028 cytb24.hap.029 cytb24.hap.030 cytb24.hap.031 cytb24.hap.032 cytb24.hap.033 cytb24.hap.034 cytb24.hap.035 cytb24.hap.036 cytb24.hap.037 cytb24.hap.051 cytb24.hap.038 cytb24.hap.039 cytb24.hap.040 cytb24.hap.041 cytb24.hap.042 cytb24.hap.043 cytb24.hap.044 cytb24.hap.045 cytb24.hap.046 cytb24.hap.047 cytb24.hap.048 cytb24.hap.049 cytb24.hap.050 cytb24.hap.052 cytb24.hap.053 cytb24.hap.054 cytb24.hap.055 cytb24.hap.056 cytb24.hap.057 cytb24.hap.058 cytb24.hap.059 cytb24.hap.060 cytb24.hap.061 cytb24.hap.062 cytb24.hap.063 cytb24.hap.064 3 6 7 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 7 3 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 7 9 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . 3 8 5 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 8 8 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 9 0 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 9 1 C . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 9 2 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 9 8 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 0 9 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 1 0 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 1 8 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . A . T . . . . . . . . . . . . . . . . 4 2 7 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 3 0 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 3 6 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 4 5 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 5 1 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 5 4 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 5 7 C . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 6 0 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 6 3 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 6 6 A . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 7 5 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . 4 8 1 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 8 7 A . . . . . G . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . 4 8 8 C . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 9 0 A . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . 4 9 6 A . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . 5 0 2 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . G . . . . . . . . . . . . 5 0 8 T C . . . . . . . C . . . . . . . . . . C . . . . . . C C . . . . . . . . . . . C . C C C . . . . . C . . . . . . . . . . . . . 5 1 4 A . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 1 7 C . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2 2 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2 3 G . A . . . . . . . . A . . A . . . . A A . . . . . . . . . A . A A A . . . . . . . A . . . . . . . . . A A . A . . . . . . . . 5 2 6 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . N 1122 431 376 296 124 22 18 16 11 10 9 8 8 7 7 6 5 5 4 4 4 4 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 1 1 1 1 1 Appendix I. Supplementary tables 112 Table 1.4. 1A. Paper II. Atlantic cod segregating sites table, part 1 of 4 (part 1: Table 1.4; part 2: Table 1.5; part 3: Table 1.6; part 4: Table 1.7).Segregating sites of the cytochrome b fragment (328 base pairs) defining 128 haplotypes found among 2656 individual Atlantic cod, Gadus morhua, sampled around Iceland. The site numbers (above the horizontal line) refer to the equivalent site, plus 14000, in the complete mtDNA sequence (GenBank accession number: X99772.1). The combined frequency among sampling localities is presented, N. The dots represent identity of a sequence to the most common haplotype cytb24_hap_001. Table 1.5. 1B. Paper II. Atlantic cod segregating sites table, part 2 of 4 (part 1: Table 1.4; part 2: Table 1.5; part 3: Table 1.6; part 4: Table 1.7). Segregating sites of the cytochrome b fragment (328 base pairs) defining 128 haplotypes found among 2656 individual Atlantic cod, Gadus morhua, sampled around Iceland. The site numbers (above the horizontal line) refer to the equivalent site, plus 14000, in the complete mtDNA sequence (GenBank accession number: X99772.1). The combined frequency among sampling localities is presented, N. The dots represent identity of a sequence to the most common haplotype cytb24_hap_001. 5 3 5 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . 5 3 8 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 4 1 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 4 2 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . . . 5 4 7 C . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 5 6 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 2 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . G . . . G . . . . . . . . . 5 6 5 C . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 8 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 1 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 4 A . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 3 G . . . . . . . . . . . . . . . . . A . . . . . A . . . . . . . . . . . . . . . A . . . . . . . . . . . . A . . . . . . . . . . 5 8 6 T . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 7 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 8 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 9 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 2 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 5 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 8 T . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . 6 0 7 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . 6 1 9 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2 8 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . C . . . . . . . . . . . 6 3 1 G . . . . A . . A . . . . . . . . . . A . A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 3 4 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . 6 3 7 T . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 4 3 A . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 4 6 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C 6 4 9 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . A . 6 5 2 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . 6 5 8 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` 6 8 2 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 8 5 C . T . T . . . . . . T . . T . . . . T T . . . . . . . . . T T T T T . . . . . . . . . . . . . . . . . T T T T . . . T . . . . 6 8 6 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 8 8 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . 6 9 1 G . . T . T T . . . . A . A . . . . . . . T . . . . . . . . . . . . . T T . . . . . . . . . . . . . . . . . . . T T T C . . . . N 1122 431 376 296 124 22 18 16 11 10 9 8 8 7 7 6 5 5 4 4 4 4 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 1 1 1 1 1 Appendix I. Supplementary tables 113 Haplotype cytb24.hap.001 cytb24.hap.002 cytb24.hap.003 cytb24.hap.004 cytb24.hap.005 cytb24.hap.006 cytb24.hap.007 cytb24.hap.008 cytb24.hap.009 cytb24.hap.010 cytb24.hap.011 cytb24.hap.012 cytb24.hap.013 cytb24.hap.014 cytb24.hap.015 cytb24.hap.016 cytb24.hap.017 cytb24.hap.018 cytb24.hap.019 cytb24.hap.020 cytb24.hap.021 cytb24.hap.022 cytb24.hap.023 cytb24.hap.024 cytb24.hap.025 cytb24.hap.026 cytb24.hap.027 cytb24.hap.028 cytb24.hap.029 cytb24.hap.030 cytb24.hap.031 cytb24.hap.032 cytb24.hap.033 cytb24.hap.034 cytb24.hap.035 cytb24.hap.036 cytb24.hap.037 cytb24.hap.051 cytb24.hap.038 cytb24.hap.039 cytb24.hap.040 cytb24.hap.041 cytb24.hap.042 cytb24.hap.043 cytb24.hap.044 cytb24.hap.045 cytb24.hap.046 cytb24.hap.047 cytb24.hap.048 cytb24.hap.049 cytb24.hap.050 cytb24.hap.052 cytb24.hap.053 cytb24.hap.054 cytb24.hap.055 cytb24.hap.056 cytb24.hap.057 cytb24.hap.058 cytb24.hap.059 cytb24.hap.060 cytb24.hap.061 cytb24.hap.062 cytb24.hap.063 cytb24.hap.064 Haplotype cytb24.hap.001 cytb24.hap.065 cytb24.hap.066 cytb24.hap.067 cytb24.hap.068 cytb24.hap.069 cytb24.hap.070 cytb24.hap.071 cytb24.hap.072 cytb24.hap.073 cytb24.hap.074 cytb24.hap.075 cytb24.hap.076 cytb24.hap.077 cytb24.hap.079 cytb24.hap.080 cytb24.hap.081 cytb24.hap.082 cytb24.hap.083 cytb24.hap.084 cytb24.hap.085 cytb24.hap.086 cytb24.hap.087 cytb24.hap.088 cytb24.hap.089 cytb24.hap.090 cytb24.hap.091 cytb24.hap.092 cytb24.hap.093 cytb24.hap.094 cytb24.hap.095 cytb24.hap.096 cytb24.hap.097 cytb24.hap.098 cytb24.hap.099 cytb24.hap.100 cytb24.hap.101 cytb24.hap.102 cytb24.hap.103 cytb24.hap.104 cytb24.hap.105 cytb24.hap.106 cytb24.hap.107 cytb24.hap.108 cytb24.hap.109 cytb24.hap.110 cytb24.hap.111 cytb24.hap.113 cytb24.hap.114 cytb24.hap.115 cytb24.hap.116 cytb24.hap.117 cytb24.hap.118 cytb24.hap.119 cytb24.hap.120 cytb24.hap.121 cytb24.hap.122 cytb24.hap.124 cytb24.hap.125 cytb24.hap.126 cytb24.hap.127 cytb24.hap.128 cytb24.hap.129 cytb24.hap.130 cytb24.hap.131 3 6 7 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . 3 7 3 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . 3 7 9 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 8 5 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . 3 8 8 T . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 9 0 G . . . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A . . . . . . . . . . . . . 3 9 1 C . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 9 2 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . 3 9 8 G . . . . . . . . . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . . 4 0 9 C . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 1 0 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A . . . . . . . . . . . . . . . A . . . . . . . . . . . . . . 4 1 8 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . G . . . 4 2 7 A . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 3 0 A . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 3 6 A . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 4 5 C . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 5 1 T . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 5 4 A . . . . . . . . . . . . . . . . . . . . . . G T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 5 7 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 6 0 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . 4 6 3 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . 4 6 6 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 7 5 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 8 1 A . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . 4 8 7 A . . . . . . . . . . . . . . . G . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . G . . . . . 4 8 8 C . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . 4 9 0 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 9 6 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 0 2 A . . . G . . . . . . . . . . . . . . . . . . . . . . G G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 0 8 T C . C . . C . C . . C C C C C C C . . . . . . . . . . . C C . C . C . C C C . . . . . . . . . . . . . . . . . . C . . . . . . . 5 1 4 A . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 1 7 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2 2 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2 3 G . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A . A A A A A A A C . . A A A A . . A . . . . . . 5 2 6 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . N 1122 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Appendix I. Supplementary tables 114 Table 1.6. 2A. Paper II. Atlantic cod segregating sites table, part 3 of 4 (part 1: Table 1.4; part 2: Table 1.5; part 3: Table 1.6; part 4: Table 1.7). Segregating sites of the cytochrome b fragment (328 base pairs) defining 128 haplotypes found among 2656 individual Atlantic cod, Gadus morhua, sampled around Iceland. The site numbers (above the horizontal line) refer to the equivalent site, plus 14000, in the complete mtDNA sequence (GenBank accession number: X99772.1). The combined frequency among sampling localities is presented, N. The dots represent identity of a sequence to the most common haplotype cytb24_hap_001. Table 1.7. Paper II. Atlantic cod segregating sites table, part 4 of 4 (part 1: Table 1.4; part 2: Table 1.5; part 3: Table 1.6; part 4: Table 1.7). Segregating sites of the cytochrome b fragment (328 base pairs) defining 128 haplotypes found among 2656 individual Atlantic cod, Gadus morhua, sampled around Iceland. The site numbers (above the horizontalline) refer to the equivalent site, plus 14000, in the complete mtDNA sequence (GenBank accession number: X99772.1). The combined frequency among sampling localities is presented, N. The dots represent identity of a sequence to the most common haplotype cytb24_hap_001. 5 3 5 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . 5 3 8 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 4 1 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 4 2 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 4 7 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 5 6 T . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 2 A . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . 5 6 5 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 8 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 1 C . . . . . . . A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 4 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 3 G . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . . . . 5 8 6 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 7 A . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 8 T . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 9 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . G G . . . . . . . G . 5 9 2 T . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 5 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A C . . . . . . . . . . . . . . . . . . . . . . . . 5 9 8 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 7 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1 9 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . 6 2 8 T . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 3 1 G A A A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A . . . . . 6 3 4 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T 6 3 7 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 4 3 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 4 6 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . 6 4 9 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 5 2 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 5 8 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` a` 6 8 2 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . 6 8 5 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T T T T T T T T T T T T T T T T T . . T . . . . . . 6 8 6 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . 6 8 8 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T T . . . . . . 6 9 1 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T T T T T T N 1122 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Appendix I. Supplementary tables 115 Haplotype cytb24.hap.001 cytb24.hap.065 cytb24.hap.066 cytb24.hap.067 cytb24.hap.068 cytb24.hap.069 cytb24.hap.070 cytb24.hap.071 cytb24.hap.072 cytb24.hap.073 cytb24.hap.074 cytb24.hap.075 cytb24.hap.076 cytb24.hap.077 cytb24.hap.079 cytb24.hap.080 cytb24.hap.081 cytb24.hap.082 cytb24.hap.083 cytb24.hap.084 cytb24.hap.085 cytb24.hap.086 cytb24.hap.087 cytb24.hap.088 cytb24.hap.089 cytb24.hap.090 cytb24.hap.091 cytb24.hap.092 cytb24.hap.093 cytb24.hap.094 cytb24.hap.095 cytb24.hap.096 cytb24.hap.097 cytb24.hap.098 cytb24.hap.099 cytb24.hap.100 cytb24.hap.101 cytb24.hap.102 cytb24.hap.103 cytb24.hap.104 cytb24.hap.105 cytb24.hap.106 cytb24.hap.107 cytb24.hap.108 cytb24.hap.109 cytb24.hap.110 cytb24.hap.111 cytb24.hap.113 cytb24.hap.114 cytb24.hap.115 cytb24.hap.116 cytb24.hap.117 cytb24.hap.118 cytb24.hap.119 cytb24.hap.120 cytb24.hap.121 cytb24.hap.122 cytb24.hap.124 cytb24.hap.125 cytb24.hap.126 cytb24.hap.127 cytb24.hap.128 cytb24.hap.129 cytb24.hap.130 cytb24.hap.131 Appendix I. Supplementary tables Table 1.8. Paper II. Amino acid substitutions observed in the analysis of Atlantic cod mtDNA data of various haplotypes and their frequencies (N). Haplotype cytb24.hap.041 cytb24.hap.067 cytb24.hap.070 cytb24.hap.076 cytb24.hap.083 cytb24.hap.094 cytb24.hap.099 cytb24.hap.100 cytb24.hap.108 cytb24.hap.109 cytb24.hap.115 cytb24.hap.116 cytb24.hap.117 cytb24.hap.122 116 Site 14542 14587 14588 14390 14398 14522 14410 14589 14398 14392 14523 14410 14390 14686 DNA sub. G A T G G A G A G G G G G A ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ AA Sub. A G C A A G A T A A C A A G V M M S V E A M V A E A S I ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ ↔ N I V T N I G T I I T D T N V 2 1 1 1 1 1 1 1 1 1 1 1 1 1 Table 1.9. Paper IV. Haddock segregating sites table, part 1 of 4 (part 1: Table 1.9; part 2: Table 1.10; part 3: Table 1.11; part 4: Table 1.12). Segregating sites of the cytochrome oxidase subunit I fragment (599 base pairs) defining 102 haplotypes found among 884 individual haddock, Melanogrammus aeglefinus, sampled in the NE-Atlantic ocean. The site numbers (above the horizontal line) refer to the equivalent site in the complete mtDNA sequence (Genebank accession number: AM489717.1). The combined frequency among sampling localities is presented, N. The dots represent identity of a sequence to the most common haplotype mel.aeg.CO1-001. 5 5 4 8 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . 5 5 5 3 C . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 5 5 6 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 5 6 5 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 5 6 8 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . 5 5 7 4 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . 5 5 9 3 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 5 9 5 T . . C . . . . . . . . . . . . . . . . . . . . . . . . . . C C . . . . . . . . . . . . . . . . . . 5 6 0 7 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 1 3 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 1 9 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . 5 6 2 5 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . 5 6 2 8 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . 5 6 3 4 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . 5 6 4 3 T . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 5 8 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . 5 6 7 3 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . 5 6 8 8 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . 5 6 9 2 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . 5 6 9 7 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 0 9 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 2 4 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . 5 7 4 5 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G G . . . . . 5 7 5 1 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . 5 7 5 4 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . 5 7 6 0 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 6 3 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 7 5 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 9 0 A C . . . . C C . . . . . C C C C C C . C C C C C C . . . . . . C . C C C . . C . . . . . . C C C C 5 7 9 3 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 9 6 A . . . . . G . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 9 7 G . . . . . . . . . . . . . . . . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . . . . . 5 8 0 2 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 0 5 T . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . 5 8 0 8 A . . . . . . . . G . . . . . . . . . . G G . . . . . . . . . . . . G . . . . . . . . . . . . . . . 5 8 1 1 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 1 4 C T T T . . T T . T . T . T T T T T T T T T T T T T T . . . . T T T T T T . . T T . . T . . T T T T 5 8 1 7 A . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . N 306 182 150 53 29 10 8 7 6 5 5 4 4 4 3 3 3 3 3 3 2 2 2 2 2 2 2 2 2 2 2 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Appendix I. Supplementary tables 117 Haplotype mel.aeg.CO1-001 mel.aeg.CO1-002 mel.aeg.CO1-003 mel.aeg.CO1-004 mel.aeg.CO1-005 mel.aeg.CO1-006 mel.aeg.CO1-007 mel.aeg.CO1-008 mel.aeg.CO1-009 mel.aeg.CO1-010 mel.aeg.CO1-011 mel.aeg.CO1-012 mel.aeg.CO1-013 mel.aeg.CO1-014 mel.aeg.CO1-015 mel.aeg.CO1-016 mel.aeg.CO1-017 mel.aeg.CO1-018 mel.aeg.CO1-019 mel.aeg.CO1-020 mel.aeg.CO1-021 mel.aeg.CO1-022 mel.aeg.CO1-023 mel.aeg.CO1-024 mel.aeg.CO1-025 mel.aeg.CO1-026 mel.aeg.CO1-027 mel.aeg.CO1-028 mel.aeg.CO1-029 mel.aeg.CO1-030 mel.aeg.CO1-031 mel.aeg.CO1-032 mel.aeg.CO1-033 mel.aeg.CO1-034 mel.aeg.CO1-035 mel.aeg.CO1-036 mel.aeg.CO1-037 mel.aeg.CO1-038 mel.aeg.CO1-039 mel.aeg.CO1-040 mel.aeg.CO1-041 mel.aeg.CO1-042 mel.aeg.CO1-043 mel.aeg.CO1-044 mel.aeg.CO1-045 mel.aeg.CO1-046 mel.aeg.CO1-047 mel.aeg.CO1-048 mel.aeg.CO1-049 mel.aeg.CO1-050 Haplotype mel.aeg.CO1-001 mel.aeg.CO1-002 mel.aeg.CO1-003 mel.aeg.CO1-004 mel.aeg.CO1-005 mel.aeg.CO1-006 mel.aeg.CO1-007 mel.aeg.CO1-008 mel.aeg.CO1-009 mel.aeg.CO1-010 mel.aeg.CO1-011 mel.aeg.CO1-012 mel.aeg.CO1-013 mel.aeg.CO1-014 mel.aeg.CO1-015 mel.aeg.CO1-016 mel.aeg.CO1-017 mel.aeg.CO1-018 mel.aeg.CO1-019 mel.aeg.CO1-020 mel.aeg.CO1-021 mel.aeg.CO1-022 mel.aeg.CO1-023 mel.aeg.CO1-024 mel.aeg.CO1-025 mel.aeg.CO1-026 mel.aeg.CO1-027 mel.aeg.CO1-028 mel.aeg.CO1-029 mel.aeg.CO1-030 mel.aeg.CO1-031 mel.aeg.CO1-032 mel.aeg.CO1-033 mel.aeg.CO1-034 mel.aeg.CO1-035 mel.aeg.CO1-036 mel.aeg.CO1-037 mel.aeg.CO1-038 mel.aeg.CO1-039 mel.aeg.CO1-040 mel.aeg.CO1-041 mel.aeg.CO1-042 mel.aeg.CO1-043 mel.aeg.CO1-044 mel.aeg.CO1-045 mel.aeg.CO1-046 mel.aeg.CO1-047 mel.aeg.CO1-048 mel.aeg.CO1-049 mel.aeg.CO1-050 5 8 2 0 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 2 6 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 3 2 T . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 3 8 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 3 9 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 4 7 A . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 5 3 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 7 7 C . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . T . . . . . 5 8 9 9 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 1 0 T . . . . . . . . . . . . . C . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 2 5 A . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 3 1 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 4 0 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 7 3 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 7 9 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 8 5 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . 6 0 1 5 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 2 1 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 2 4 G . . . . . . A . . . . . . . . . . . . . A . . . . . . . . . . . . . A . . . . . . . . . . . . . . 6 0 3 0 G . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 3 6 A . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 3 9 T . . . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 4 2 T . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 6 0 T . . . . . . . . . . . . . . . C . . . . . . . . . C . . C . . . . . . . . . . . . C . . . . . . G 6 0 6 2 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 6 3 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 6 6 A . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . 6 0 7 5 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 8 0 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 8 1 T . . . . . . . C . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 8 4 T . . . . . . A . . . . . . . . . . . . . A . . . . . . . . . . . . . A . . . . . . . . . . . . . . 6 0 8 7 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 9 0 C . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . T . 6 1 0 8 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . 6 1 1 1 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1 2 0 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1 2 3 T . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . N 306 182 150 53 29 10 8 7 6 5 5 4 4 4 3 3 3 3 3 3 2 2 2 2 2 2 2 2 2 2 2 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Appendix I. Supplementary tables 118 Table 1.10. Paper IV. Haddock segregating sites table, part 2 of 4 (part 1: Table 1.9; part 2: Table 1.10; part 3: Table 1.11; part 4: Table 1.12). Segregating sites table for the cytochrome oxidase subunit I fragment (599 base pairs) defining 102 haplotypes found among 884 individual haddock, Melanogrammus aeglefinus, sampled in the NE-Atlantic ocean. The site numbers (above the horizontal line) refer to the equivalent site in the complete mtDNA sequence (Genebank accession number: AM489717.1). The combined frequency among sampling localities is presented, N. The dots represent identity of a sequence to the most common haplotype mel.aeg.CO1-001. Table 1.11. Paper IV. Haddock segregating sites table, part 3 of 4 (part 1: Table 1.9; part 2: Table 1.10; part 3: Table 1.11; part 4: Table 1.12). Segregating sites of the cytochrome oxidase subunit I fragment (599 base pairs) defining 102 haplotypes found among 884 individual haddock, Melanogrammus aeglefinus, sampled in the NE-Atlantic ocean. The site numbers (above the horizontal line) refer to the equivalent site in the complete mtDNA sequence (Genebank accession number: AM489717.1). The combined frequency among sampling localities is presented, N. The dots represent identity of a sequence to the most common haplotype mel.aeg.CO1-001. 5 5 4 8 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 5 5 3 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 5 5 6 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C 5 5 6 5 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A . 5 5 6 8 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 5 7 4 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 5 9 3 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A . . 5 5 9 5 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C C . . . 5 6 0 7 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . 5 6 1 3 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A . . . . . . 5 6 1 9 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 2 5 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 2 8 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 3 4 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 4 3 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . C 5 6 5 8 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . 5 6 7 3 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 8 8 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 9 2 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 9 7 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . 5 7 0 9 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . 5 7 2 4 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 4 5 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 5 1 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 5 4 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 6 0 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . 5 7 6 3 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C C . . . . . . . . . . 5 7 7 5 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . 5 7 9 0 A C C C C C C C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . C C C . . . . C 5 7 9 3 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . 5 7 9 6 A . . . . . . . G G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 9 7 G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 0 2 A . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 0 5 T . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C C . . . . . . . . . . . . . . 5 8 0 8 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 1 1 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . C . . . . . . . . . . . 5 8 1 4 C T T T T T . T . . . T T T T T T T T T T T . . . . . . . . . . . . . . . T . T . T . . T T T T T T T . T 5 8 1 7 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . N 306 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Appendix I. Supplementary tables 119 Haplotype mel.aeg.CO1-001 mel.aeg.CO1-051 mel.aeg.CO1-052 mel.aeg.CO1-053 mel.aeg.CO1-054 mel.aeg.CO1-055 mel.aeg.CO1-056 mel.aeg.CO1-057 mel.aeg.CO1-058 mel.aeg.CO1-059 mel.aeg.CO1-060 mel.aeg.CO1-061 mel.aeg.CO1-062 mel.aeg.CO1-063 mel.aeg.CO1-064 mel.aeg.CO1-065 mel.aeg.CO1-066 mel.aeg.CO1-067 mel.aeg.CO1-068 mel.aeg.CO1-069 mel.aeg.CO1-070 mel.aeg.CO1-071 mel.aeg.CO1-072 mel.aeg.CO1-073 mel.aeg.CO1-074 mel.aeg.CO1-075 mel.aeg.CO1-076 mel.aeg.CO1-077 mel.aeg.CO1-078 mel.aeg.CO1-079 mel.aeg.CO1-080 mel.aeg.CO1-081 mel.aeg.CO1-082 mel.aeg.CO1-083 mel.aeg.CO1-084 mel.aeg.CO1-085 mel.aeg.CO1-086 mel.aeg.CO1-087 mel.aeg.CO1-088 mel.aeg.CO1-089 mel.aeg.CO1-090 mel.aeg.CO1-091 mel.aeg.CO1-092 mel.aeg.CO1-093 mel.aeg.CO1-094 mel.aeg.CO1-095 mel.aeg.CO1-096 mel.aeg.CO1-097 mel.aeg.CO1-098 mel.aeg.CO1-099 mel.aeg.CO1-100 mel.aeg.CO1-101 mel.aeg.CO1-102 Haplotype mel.aeg.CO1-001 mel.aeg.CO1-051 mel.aeg.CO1-052 mel.aeg.CO1-053 mel.aeg.CO1-054 mel.aeg.CO1-055 mel.aeg.CO1-056 mel.aeg.CO1-057 mel.aeg.CO1-058 mel.aeg.CO1-059 mel.aeg.CO1-060 mel.aeg.CO1-061 mel.aeg.CO1-062 mel.aeg.CO1-063 mel.aeg.CO1-064 mel.aeg.CO1-065 mel.aeg.CO1-066 mel.aeg.CO1-067 mel.aeg.CO1-068 mel.aeg.CO1-069 mel.aeg.CO1-070 mel.aeg.CO1-071 mel.aeg.CO1-072 mel.aeg.CO1-073 mel.aeg.CO1-074 mel.aeg.CO1-075 mel.aeg.CO1-076 mel.aeg.CO1-077 mel.aeg.CO1-078 mel.aeg.CO1-079 mel.aeg.CO1-080 mel.aeg.CO1-081 mel.aeg.CO1-082 mel.aeg.CO1-083 mel.aeg.CO1-084 mel.aeg.CO1-085 mel.aeg.CO1-086 mel.aeg.CO1-087 mel.aeg.CO1-088 mel.aeg.CO1-089 mel.aeg.CO1-090 mel.aeg.CO1-091 mel.aeg.CO1-092 mel.aeg.CO1-093 mel.aeg.CO1-094 mel.aeg.CO1-095 mel.aeg.CO1-096 mel.aeg.CO1-097 mel.aeg.CO1-098 mel.aeg.CO1-099 mel.aeg.CO1-100 mel.aeg.CO1-101 mel.aeg.CO1-102 5 8 2 0 T . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 2 6 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . 5 8 3 2 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 3 8 T . . . . . . . . . . . . . . . . . . . G C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 3 9 T . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 4 7 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 5 3 C . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 8 7 7 C . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . T . . . . . . . . . . . . 5 8 9 9 C . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 1 0 T . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 2 5 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 3 1 T . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 4 0 T . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 7 3 T . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 7 9 A . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 9 8 5 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 1 5 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . 6 0 2 1 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . 6 0 2 4 G . . . . . . . . . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 3 0 G . A . A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 3 6 A . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 3 9 T . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 4 2 T C . . . . . . . . . . . . . . C . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 6 0 T . . . . . . . . . . . . . . G . . . C . . . . . . . . . . . G G . . . . . . . . . . . . . . . . . . . . 6 0 6 2 T . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 6 3 C . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 6 6 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 7 5 A . . . . . . . . . . G . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . 6 0 8 0 A . . . . . . . . . . . . . . . . . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . 6 0 8 1 T . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 8 4 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 0 8 7 C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . . 6 0 9 0 C . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . T . . . . . . . . . . . . . . . . . . . . . 6 1 0 8 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1 1 1 T . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1 2 0 A . . . . . . . . . . . . G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1 2 3 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . N 306 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Appendix I. Supplementary tables 120 Table 1.12. Paper IV. Haddock segregating sites table, part 4 of 4 (part 1: Table 1.9; part 2: Table 1.10; part 3: Table 1.11; part 4: Table 1.12). Segregating sites of the cytochrome oxidase subunit I fragment (599 base pairs) defining 102 haplotypes found among 884 individual haddock, Melanogrammus aeglefinus, sampled in the NE-Atlantic ocean. The site numbers (above the horizontal line) refer to the equivalent site in the complete mtDNA sequence (Genebank accession number: AM489717.1). The combined frequency among sampling localities is presented, N. The dots represent identity of a sequence to the most common haplotype mel.aeg.CO1-001. Appendix II. Supplementary figures Appendix II. Supplementary figures 121 150 250 300 (h) Gmo132 Allele size (bp) 150 1200 800 90 100 110 120 Allele size (bp) (j) Tch13 Frequency 150 Frequency 800 Frequency 260 Allele size (bp) 300 100 120 140 160 Allele size (bp) 180 0 50 220 0 0 0 50 140 50 (i) Tch11 400 250 150 Frequency 300 200 Frequency 100 0 130 30 Allele size (bp) 250 (g) Gmo37 10 120 160 200 Allele size (bp) 240 80 100 140 180 Allele size (bp) Figure 1.1. Paper I. Bargraph of allele sizes of the microsatellite loci used. (a) Gmo2, (b) Gmo3, (c) Gmo8, (d) Gmo19, (e) Gmo34, (f) Gmo35, (g) Gmo37, (h) Gmo132, (i) Tch11, (j) Tch13. 123 Appendix II. Supplementary figures 120 Frequency 0 Allele size (bp) 400 (f) Gmo35 200 0 0 210 Allele size (bp) 150 190 100 Frequency 150 200 Frequency 0 170 100 150 Allele size (bp) (e) Gmo34 50 130 50 100 1000 Frequency 0 500 300 Frequency 100 0 110 (d) Gmo19 400 (c) Gmo8 50 (b) Gmo3 1500 (a) Gmo2 ● ● ● ● ● ● ● ● 1.0 (e) Gmo34 ● ● ● ● ● ● ● ● 0.8 ● 0.8 ● 0.8 0.8 ● ● ● ● ● 0.8 ● ● (d) Gmo19 1.0 1.0 (c) Gmo8 1.0 1.0 (b) Gmo3 ● ● ● 0.6 0.6 0.6 0.6 0.4 0.4 0.4 0.2 0.4 0.4 ● 0.2 Heterozygosity 0.6 ● ● ● ● ● 3 4 5 1 2 3 4 5 0.0 1 (h) Gmo132 1.0 (g) Gmo37 1.0 (f) Gmo35 2 0.0 0.0 1 2 3 4 ● ● ● ● ● ● ● ● ● ● 3 4 5 ● ● ● 4 5 ● ● ● ● ● 1 2 ● ● ● 0.8 ● ● ● 0.8 0.8 0.8 ● ● ● ● ● 2 (j) Tch13 ● ● ● 1 (i) Tch11 ● ● 5 1.0 5 ● 0.8 4 0.2 ● ● 1.0 3 ● ● ● ● 1.0 2 ● ● ● 0.0 1 0.2 ● ● ● ● 0.0 0.2 ● ● ● ● ● ● ● ● ● ● ● ● ● 0.4 0.4 ● 0.4 0.4 0.6 0.6 0.6 0.6 0.6 ● ● 0.4 Heterozygosity ● 1 2 3 4 Depth group 5 1 2 3 4 Depth group 5 1 2 3 4 Depth group 5 0.2 0.0 0.2 0.0 0.2 0.0 0.2 0.0 0.0 0.2 ● 1 2 3 4 Depth group 5 3 Depth group Figure 1.2. Paper I. The observed heterozygosity HObs (open circles and dashed trend line) and the expected heterozygosity HExp (filled circles dotted trend line) for the the different microsatellite loci for Atlantic cod sampled at the south coast of Iceland grouped at different depth intervals (1:0-50m, 2:50-100m, 3:100-150m, 4:150-200m, 5:>200m). (a)Gmo2, (b)Gmo3, (c)Gmo8, (d)Gmo19, (e)Gmo34, (f)Gmo35, (g)Gmo37, (h)Gmo132, (i)Tch11, (j)Tch13. Appendix II. Supplementary figures 124 (a) Gmo2 Appendix III Appendix III 125 Appendix III Molecular methods In the present project I used mainly analysis of mtDNA sequence variation. The molecular methods used are basically the same in all studies. Following is a detailed description of the molecular methods used for the study on saithe. This is for my own reference and might be of help to others. DNA isolation. DNA was isolated using a chelex method (Walsh et al., 1991). A stock solution was prepared consisting of 50mg/mL Chelex 100, 0.2% SDS, 10mM Tris (pH 8) and 0.5mM EDTA. Just before DNA isolation Proteinase K was added to the mix at final concentration of 200 µg/mL. A small tissue fragment (less than 1 mm) was cut of the preserved gill tissue, drained on a paper towel and placed in a tube with the 250 µl reaction mix. The mix was then placed at Thermomixer (Eppendorf, Thermomixer Comfort) at 700 rpm for 3–5 hours (or until no tissue fragments were visible). After this the mix was placed in a 95◦ C for 5 minutes for denaturation of the Proteinase K followed by spinning at 3000 rpm for 5 minutes. The supernatant was carefully removed, leaving the chelex on the bottom of the tube. A 1:19 dilution of the isolated DNA was prepared and used for PCR (DNA concentration ca. 1 ng/µl). PCR. A 460 bp fragment was amplified from the 50 region of the cytochrome c oxidase subunit I (COI) mitochondrial gene using pvL6082-50 CCTGCTGGAGGAGGTGATCC30 and pvH6580-50 CCTGCTGGAGGAGGTGATCC30 . The fragment is corresponding to sites number 6082-6542 in the saithe complete mtDNA genome sequence (GenBank accession number: FR751399, Coucheron et al., 2011). Primer numbers refer to the first fragment sites upstream and downstream of the L and H primers respectively. A 19µl PCR reaction mix contained 0.17mM dNTP), 0.11mg/µl BSA, 0.36pM of each primer (L6082, H6580), and 0.05U Taq Polymerase buffered with 10× ThermoPol buffer (New England BioLabs - NEB) together with DNA template (median concentration 0.26ng/µl in reaction mix). The reaction mix was prepared on ice and when ready it was immediately placed in a thermocycler (DNA Engine Tetrad 2, Bio-Rad Laboratories). The DNA was denatured at 94◦ C for 5 minutes followed by 35 cycles of denaturing for 1 minute, annealing at 56◦ C for 30 seconds and extension at 72◦ C for 1 minute. This was followed by 7 minutes extension at 72◦ C. After the PCR 5µl of the product was run for 10 minutes at 90 Volts on an 1.5% agarose gel containing ethidium bromide. The gel was inspected under UV light for amplified DNA. Successfully amplified DNA was then taken for direct DNA sequencing. Purification of amplified DNA. A sample of 5µl amplified DNA was purified using 2U Exonuclease I (NEB) for the removal of single stranded DNA and 1U Antarctic phosphatase (NEB) for the removal of 50 phosphoryl groups, dNTPs and pyrophosphate in the PCR reaction mix (preventing fragments from self-ligating). The reaction was buffered with Antarctic phosphatase buffer (NEB). The reaction mix was incubated at 38◦ C for 35 minutes followed by 80◦ C denaturation step for 20 minutes. The sample was then cooled down to room temperature. Sequencing reaction. Sequencing was carried out using 0.5 µl of the ABI BigDye Terminator v3.1, 2.75 µl of the ABI BigDye Terminator v3.1 kit 5×buffer, using 0.1 pM of the pvH6580 primer and 5µl of the purified DNA in a total reaction volume of 15µl. The reaction mix was prepared on ice and when ready it was immediately placed in a thermocycler at an initial 96◦ C for 10 seconds followed by 25 cycles of denaturing 127 Appendix III at 96◦ C for 10 seconds, annealing at 50◦ C for 5 seconds and extension at 60◦ C for 2 minutes. DNA precipitation. The sequencing product was precipitated with ethanol using glycogen as carrier in 96 well plates. A stock solution of 0.3 M NaOAc and 0.1µg/ml glycogen was prepared. 50µl stock solution was added to the 10µl sequencing product followed by 125µl cold (stored at −20◦ C) 96% ethanol. The mix was spun at 4000 rpm for 30 minutes at 2◦ C, the precipitation mix poured off and the sample tray spun inverted on top of a paper towel at 300 rpm at 2◦ C. For rinsing the product 250µl of cold 70% ethanol was carefully added to each sample and spun at 4000 rpm for 5 minutes for rinsing of the product. After this the ethanol was poured off and the tray spun inverted to remove the rest of the ethanol. The 70% ethanol rinse was done twice. The samples were left in a dark and dry place for at least 15 minutes to allow for evaporation of any remaining ethanol. Each sample was then dissolved in 10µl HiDi formamide and mixed by vortexing for 1 minutes. A quick spin down was made before the samples were placed in an ABI-3100 automatic sequencer for analysis. Reference: Walsh, P., Metzfer, D., and Higuchi, R. (1991). Chelex 100 as a medium for simple extraction of DNA for PCR-based typing from forensic material.BioTechniques, 10(4):506–513. 128
© Copyright 2026 Paperzz