Original Article Rare Variant APOC3 R19X Is Associated With Cardio-Protective Profiles in a Diverse Population-Based Survey as Part of the Epidemiologic Architecture for Genes Linked to Environment Study Dana C. Crawford, PhD; Logan Dumitrescu, PhD, MS; Robert Goodloe, MS; Kristin Brown-Gentry, MS; Jonathan Boston, BS; Bob McClellan Jr, BS; Cara Sutcliffe, MS; Rachel Wiseman, BS; Paxton Baker, MS; Margaret A. Pericak-Vance, PhD; William K. Scott, PhD; Melissa Allen, BS; Ping Mayo, BS; Nathalie Schnetz-Boutaud, PhD; Holli H. Dilks, PhD; Jonathan L. Haines, PhD; Toni I. Pollin, PhD Downloaded from http://circgenetics.ahajournals.org/ by guest on June 17, 2017 Background—A founder mutation was recently discovered and described as conferring favorable lipid profiles and reduced subclinical atherosclerotic disease in a Pennsylvania Amish population. Preliminary data have suggested that this null mutation APOC3 R19X (rs76353203) is rare in the general population. Methods and Results—To better describe the frequency and lipid profile in the general population, we as part of the Population Architecture using Genomics and Epidemiology I Study and the Epidemiological Architecture for Genes Linked to Environment Study genotyped rs76353203 in 1113 Amish participants from Ohio and Indiana and 19 613 participants from the National Health and Nutrition Examination Surveys (NHANES III, 1999 to 2002, and 2007 to 2008). We found no carriers among the Ohio and Indiana Amish. Of the 19 613 NHANES participants, we identified 31 participants carrying the 19X allele, for an overall allele frequency of 0.08%. Among fasting adults, the 19X allele was associated with lower triglycerides (n=7603; β=−71.20; P=0.007) and higher high-density lipoprotein cholesterol (n=8891; β=15.65; P=0.0002) and, although not significant, lower low-density lipoprotein cholesterol (n=6502; β= -4.85; P=0.68) after adjustment for age, sex, and race/ethnicity. On average, 19X allele participants had approximately half the triglyceride levels (geometric means, 51.3 to 69.7 versus 134.6 to 141.3 mg/dL), >20% higher high-density lipoprotein cholesterol levels (geometric means, 56.8 to 74.4 versus 50.38 to 53.36 mg/dL), and lower low-density lipoprotein cholesterol levels (geometric means, 104.5 to 128.6 versus 116.1 to 125.7 mg/dL) compared with noncarrier participants. Conclusions—These data demonstrate that APOC3 19X exists in the general US population in multiple racial/ethnic groups and is associated with cardio-protective lipid profiles. (Circ Cardiovasc Genet. 2014;7:848-853.) Key Words: genetics ◼ genetic association studies ◼ high-density lipoprotein cholesterol ◼ molecular epidemiology ◼ triglycerides B oth common and rare genetic variation is associated with lipid trait distributions. Candidate gene and genome-wide association studies in populations of mostly European descent have identified >150 common genetic variants associated with high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), triglyceride, and total cholesterol levels.1,2 Early linkage and family-based studies have identified rare mutations linked to extreme lipid trait profiles associated with dyslipidemias.3 More recent population-based sequencing studies are bridging the gap between common genetic variation and disease-causing mutations with the discovery and catalog of additional rare and less common variation (frequency <1% in the general population) impacting lipid trait profiles in humans.4–10 Clinical Perspective on p 853 One such rare variant discovered in a Pennsylvania Old Order Amish population sample11 and recently described in the National Heart, Lung, and Blood Institute Exome Sequencing Project12,13 is APOC3 R19X (rs76353203). This Received October 1, 2013; accepted September 16, 2014. From the Institute for Computational Biology (D.C.C., P.M., J.L.H.), Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH (D.C.C., J.L.H.); Center for Human Genetics Research (L.D., R.G., K.B.-G., J.B., B.M., M.A., N.S.-B.), Department of Molecular Physiology and Biophysics (L.D.), Vanderbilt Technologies for Advanced Genomics Core Facility, Vanderbilt University, Nashville, TN (C.S., R.W., P.B., H.H.D.); Hussman Institute for Human Genomics, University of Miami, FL (M.A.P.-V., W.K.S.); and Division of Endocrinology, Diabetes, and Nutrition, Department of Medicine, University of Maryland School of Medicine, Baltimore (T.I.P.). The Data Supplement is available at http://circgenetics.ahajournals.org/lookup/suppl/doi:10.1161/CIRCGENETICS.113.000369/-/DC1. Correspondence to Dana C. Crawford, PhD, Case Western Reserve University, Wolstein Research Building, 2013 Cornell Rd, Suite 2527, Cleveland, OH 44106. E-mail [email protected] © 2014 American Heart Association, Inc. Circ Cardiovasc Genet is available at http://circgenetics.ahajournals.org 848 DOI: 10.1161/CIRCGENETICS.113.000369 Crawford et al APOC3 R19X, Cardio-Protective Profiles, and NHANES 849 Downloaded from http://circgenetics.ahajournals.org/ by guest on June 17, 2017 null mutation was associated with cardio-protective profiles in the Amish, including significantly higher levels of HDL-C and lower levels of triglycerides and total cholesterol levels compared with noncarriers.11 Pennsylvania Amish carriers of 19X were also less likely to have detectable coronary artery calcification compared with noncarriers, which is consistent with their cardio-protective profiles.11 In contrast to the Pennsylvania Amish population, no 19X carriers were identified in a preliminary study of 214 European American adults from Baltimore, Maryland, suggesting that this variant is rare in the general population.11 To date, the rare APOC3 R19X has not yet been characterized in a large and diverse general population or other Amish populations. Therefore, to further characterize this variant in other populations, we as part of the Epidemiological Architecture for Genes Linked to Environment Study, a member of the Population Architecture using Genomics and Epidemiology (PAGE) I Study,14 genotyped APOC3 R19X in 19 613 Americans (including non-Hispanic whites, nonHispanic blacks, and Mexican Americans) ascertained for the National Health and Nutrition Examination Surveys (NHANES). We also genotyped 1113 Old Order Amish from Ohio and Indiana ascertained for studies of aging and dementia. Overall, we found the 19X variant to be present but rare in the general American population and on the same single haplotype as in the Lancaster Amish; 19X was absent in this sample of Old Order Amish from the Midwestern United States. These data confirm the association between APOC3 R19X and cardio-protective lipid profiles and provide the first glimpse of carrier rates in a general population of Americans. Methods Study Population The National Health and Nutrition Examination Surveys are conducted by the National Center for Health Statistics at the Centers for Disease Control and Prevention. The NHANES participants included in this study were ascertained as part of NHANES III phase 2 (between 1991 and 1994), NHANES 1999 to 2000, NHANES 2001 to 2002, and NHANES 2007 to 2008. NHANES is a national representative survey of noninstitutionalized Americans ascertained regardless of health status. NHANES collects data on health and lifestyle of participants via questionnaires, laboratory measures, and a physical examination administered by Centers for Disease Control (CDC) health professionals. We accessed data for serum HDL-C, triglycerides, and total cholesterol, which were all measured using standard enzymatic methods. LDL-C was calculated using the Friedewald equation, with missing values assigned for samples with triglyceride levels >400 mg/ dL. Body mass index (kg/m2) was calculated from measured height and weight as part of the physical examination in the CDC Mobile Examination Center. A total of 19 613 samples were available for study from consenting participants. All procedures were approved by the CDC Ethics Review Board and written informed consent was obtained from all participants. Because no identifying information is available to the investigators, Vanderbilt University’s Institutional Review Board determined that this study met the criteria of nonhuman subjects. The Old Order Amish participants included in this study were originally ascertained as part of a population-based study of aging and dementia conducted between 1998 and present in the Amish communities of Adams, Elkhart, and LaGrange Counties in Indiana and Holmes County in Ohio. Study population characteristics of the Old Order Amish have been previously described.15 Briefly, the Amish immigrated to the United States from Europe in 2 waves. The first wave arrived and settled in Pennsylvania in the early 1700s and some of these families proceeded to migrate west to Ohio in the 18th and 19th centuries.16 A second wave of European Amish with distinct surnames arrived in Pennsylvania in the 1800s but continued westward to Indiana and Ohio.17,18 A third wave of immigrants from Switzerland settled in Adams County, Indiana in the mid-1800s. Written informed consent was obtained for all Old Order Amish participants or their legal guardians. A total of 1113 unique DNA samples were available for genotyping in this study, of which 143 were cases of dementia, 620 were controls, and 350 were of unknown dementia status. Biomarker data were not available on the Midwestern Amish samples. Genotyping Genotyping was conducted by the Vanderbilt DNA Resources Core using Applied Biosystems’ custom TaqMan assay with the following primers and probes: 5′-CCTCCTGGCGCTCCTG-3′ (forward), 5′-CCAAGTTGCCTCCACCCT-3′ (reverse), 5′-CAAGTGC TTACGGGCAGA-3′ (G allele probe), and 5′-CAAGTGCTTACA GGCAGA-3′ (A allele probe). To evaluate the assay and to assist in clustering the rare variant, 94 deidentified samples consisting of 39 Pennsylvania Amish individuals heterozygous for R19X and 55 relatives without the mutation11 were genotyped blinded by mutation status. In addition to experimental NHANES DNA, we genotyped blinded duplicates provided by CDC for concordance checks and quality control. The genotyping call rate for rs76353203 in NHANES DNA samples and the Old Order Amish samples was ≈95% and 98%, respectively. For haplotype inference in the Pennsylvania Amish subjects, 211 individuals were genotyped for both rs7635320311 and the Illumina Omni 2.5 mol/L Beadchip. Originally 1472 individuals were genotyped for 2 443 179 single nucleotide polymorphisms (SNPs). The 6 Beadchip SNPs used in the haplotype inference (using Haploview)19 and comparison with NHANES III were among the 2 391 559 passing quality control filters, which comprised exclusion of SNPs with >2% duplicate inconsistency, >5% missing data, >5 Mendelian inconsistencies, Hardy–Weinberg Equilibrium P<10−6, mitochondrial location, minor allele frequency <0.01, as well as duplicated and nonuniquely mapped SNPs. Statistical Analysis Allele frequencies were calculated for 19X. Tests of association were limited to fasting (≥8 hours since last meal) adults (≥18 years of age), and study population characteristics are given in Table 1. We performed an SNP test of association assuming a dominant genetic model using linear regression. HDL-C, LDL-C, triglycerides each Table 1. NHANES Study Population Characteristics for Fasting Adults by Survey NHANES III (n=3501) % female NHANES 1999 to 2002 (n=4699) NHANES 2007 to 2008 (n=1854) 56.4 51.2 49.4 Mean age (SD) 44.78 (19.24) 48.82 (18.58) 50.93 (17.78) % non-Hispanic white 39.5 54.1 56.3 % non-Hispanic black 29.6 18.5 22.4 % Mexican American 30.9 27.4 21.3 Mean HDL-C (SD) 50.36 (15.23) 51.60 (15.68) 53.10 (15.78) Mean LDL-C (SD) 124.79 (37.57) 123.70 (34.99) 115.35 (35.96) Mean TG (SD) 134.85 (94.47) 145.57 (92.71) 135.06 (92.63) HDL-C indicates high-density lipoprotein cholesterol; LDL-C, low-density lipoprotein cholesterol; NHANES, National Health and Nutrition Examination Surveys; and TG, triglyceride. 850 Circ Cardiovasc Genet December 2014 Table 2. Meta-Analysis Results for APOC3 R19X and Lipid Traits in NHANES Sample Size β (SE) P Value HDL-C, mg/dL 8891 +15.65 (4.23) 0.0002 LDL-C, mg/dL 6502 −4.85 (11.65) 0.68 TG, mg/dL 7603 −71.20 (26.45) 0.007 HDL-C indicates high-density lipoprotein cholesterol; LDL-C, low-density lipoprotein cholesterol; NHANES, National Health and Nutrition Examination Surveys; and TG, triglyceride. Downloaded from http://circgenetics.ahajournals.org/ by guest on June 17, 2017 were the dependent variables as continuous traits. Models were adjusted for age, sex, and self-reported race/ethnicity. Logistic regressions were performed for each NHANES (III, 1999 to 2002, and 2007 to 2008) with SAS v9.2 (SAS Institute, Cary, NC) using the Analytic Data Research by E-mail portal of the CDC Research Data Center in Hyattsville, MD. Meta-analyses were conducted using a fixed-effects inverse-variance weighted approach using METAL.20 METAL also implements Cochran Q-test for heterogeneity. To facilitate comparisons of the genetic effect of 19X across lipid traits across studies, we also calculated the geometric means by carrier status. Haplotypes were inferred using PHASE v2.1.121,22 for 35 genetic variants in the APOA5/A4C3/A1 gene cluster on chromosome 11. The 35 genetic variants included APOC3 R19X and genetic variants selected as tagSNPs (based on data from Fullerton et al)23 and genotyped in NHANES III for previous lipid trait genetic association studies24,25 (Table I in the Data Supplement). These Genetic NHANES data are available for secondary analysis via the CDC. Results General Population To estimate the prevalence of the APOC3 19X allele in a general population from the United States, we genotyped rs76353203 in a total of 19 613 DNA samples from participants ascertained as part of NHANES III, NHANES 1999 to 2002, and NHANES 2007 to 2008. We identified a total of 31 carriers of the 19X allele among all participants genotyped. The overall frequency of the 19X allele was 0.08% in this US population. Among the 3 major NHANES racial/ethnic groups, the 19X allele was observed at a higher frequency among Mexican Americans (0.23% allele frequency) and nonHispanic whites (0.20% allele frequency) compared with nonHispanic blacks (0.0124% allele frequency). We next characterized the haplotype background containing APOC3 19X in the 10 carriers identified in NHANES III to establish the number of haplotype backgrounds associated with the mutation event. A total of 35 genetic variants spanning the APOA5/A4/C3/A1 gene cluster on chromosome 11 were available for haplotype inference in NHANES III only (Table I in the Data Supplement). All 10 19X alleles in NHANES III were inferred on a single haplotype background, suggesting that this mutation occurred once (Table II in the Data Supplement). This haplotype background was identical to the second most common haplotype in the population at all loci except R19X. Using data from 211 individuals in the Pennsylvania Amish genotyped for both APOC3 R19X and the Illumina Omni 2.5 mol/L Beadchip, we identified 6 polymorphic markers genotyped in both the Amish and NHANES samples. These 6 markers along with APOC3 R19X uniquely tagged 8 of the 10 common haplotypes and collapsed 2 pairs of haplotypes. The Amish haplotype containing the APOC3 R19X variant was identical to the NHANES III R19X haplotype, but found at a frequency of 0.028 versus 0.0007 in NHANES III consistent with a founder effect and genetic drift (Table III in the Data Supplement; also shows frequency of haplotypes in the 1000 Genomes Project). To assess the relationship between APOC3 R19X and lipid profiles in the general population, we performed a test of association between the rare variant and HDL-C, LDLC, and triglycerides among fasting adults (Table 1). Overall, the 19X allele was associated with lower triglycerides (β=−71.20±26.45 [SEM] ln mg/dL; P=0.007) and higher HDL-C (β=+15.65±4.23 mg/dL; P=2.1×10−4) after adjustment for age, sex, and race/ethnicity (Table 2). The 19X allele was not significantly associated with LDL-C in adjusted analyses (β=−4.85±11.65 mg/dL; P=0.68). There was no evidence for heterogeneity for any of the 3 meta-analyzed lipids traits (pheterogeneity=0.226, 0.794, and 0.997 for HDL-C, LDLC, and triglycerides, respectively). On average, participants with the 19X allele had approximately half the triglyceride levels, >20% higher HDL-C levels, and, although not statistically significant, had lower LDL-C levels compared with RR homozygotes (Table 3). In contrast, there was no association between the 19X allele and body mass index (β=2.16±1.82; P=0.234), as was the case in the Pennsylvania Amish.11 Midwestern US Amish Population The original discovery of APOC3 R19X involved Amish participants from Pennsylvania.11 Given the migration patterns and history of the Amish in the Americans, we sought to estimate the prevalence of 19X in a non-Pennsylvania Amish population to determine if the cardio-protective variant is common in the Amish that settled in Ohio and Indiana. Among the 1113 Amish participants from Indiana and Ohio genotyped for APOC3 R19X, no carriers were detected. Detailed examination of the pedigree connections between the sampled Ohio/ Table 3. Geometric Means and SD of HDL-C, LDL-C, and TG Levels Among Fasting Adults by APOC3 19X Carrier Status and NHANES NHANES III Noncarriers Carriers NHANES 1999 to 2002 Noncarriers Carriers NHANES 2007 to 2008 Noncarriers Carriers HDL-C, mg/dL (SD) 50.38 (0.27) 59.51 (7.31) 51.92 (0.25) 74.38 (5.47) 53.36 (0.37) 56.78 (14.47) LDL-C, mg/dL (SD) 125.68 (0.80) 104.46 (36.26) 122.95 (0.74) 117.73 (13.09) 116.05 (0.98) 128.60 (36.07) TG, mg/dL (SD) 135.67 (1.65) 51.30 (45.32) 141.33 (1.90) 63.68 (34.03) 134.62 (2.44) 69.67 (90.70) HDL-C indicates high-density lipoprotein cholesterol; LDL-C, low-density lipoprotein cholesterol; NHANES, National Health and Nutrition Examination Surveys; and TG, triglyceride. Crawford et al APOC3 R19X, Cardio-Protective Profiles, and NHANES 851 Indiana Amish and the Pennsylvania Amish indicate that only 2 founders of the sampled Ohio/Indiana Amish were directly descended from the most recent common ancestor of all Pennsylvania Amish individuals possessing the R19X variant.11 This suggests that the 19X allele was either lost in this sample because of chance nontransmission, the variant is much less frequent compared with the Pennsylvania Amish, or the variant was not detected because of genotyping error. Discussion Downloaded from http://circgenetics.ahajournals.org/ by guest on June 17, 2017 We estimated the prevalence of the APOC3 19X allele to be 0.08% in a general population from the United States ascertained regardless of health status, with allele frequencies highest among Mexican Americans (0.23%) and non-Hispanic whites (0.20%) and lowest among non-Hispanic blacks (0.0124%). We also replicated the association between APOC3 rs76353203 and cardio-protective lipid profiles first described in a Pennsylvania Amish population. APOC3 19X carriers in NHANES on average had significantly higher HDL-C (geometric means, 64 versus 52 mg/dL) and lower triglycerides (geometric means, 62 versus 137 mg/dL) compared with noncarriers, similar to the effects reported in the Pennsylvania Amish carriers of 19X (median HDL-C, 67 versus 55 mg/dL; median triglycerides, 31 versus 57 mg/dL).11 We also observed a trend for lower LDL-C levels among APOC3 19X carriers versus noncarriers (117 versus 122 mg/dL), which, although not statistically significant, was in the same direction as was seen in the Pennsylvania Amish (116 versus 140 mg/dL).11 We did not identify any APOC3 19X carriers among 1113 Amish from Ohio and Indiana. These data coupled with NHANES data of an overall frequency of 0.08% suggest that APOC3 19X is a rare variant in most populations. Interestingly, among the 31 carriers identified in NHANES, the overwhelming majority were either self-identified non-Hispanic white or Mexican American. Indeed, frequency of 19X in non-Hispanic blacks is >15 times less than that for the other 2 groups. Furthermore, the mutation was observed on a single haplotype background in all NHANES III participants and the Amish subjects. In the National Heart, Lung, and Blood Institute Exome Sequencing Project, the 19X allele was less common in European Americans than in NHANES (allele frequency 0.03% [3/8588 alleles] in 4294 individuals) but nonexistent (0/4402 alleles) in blacks.12,13 Collectively, these data suggest that the founding mutation event occurred once. Since the writing of this article, Tachmazidou and colleagues26 published frequency estimates of APOC3 R19X in a Greek population isolate. Among 1267 individuals, 3.8% were identified as carriers of the 19X allele, with an overall allele frequency of 1.9% in this isolated population.26 In contrast, the 19X allele was found only among 4 of the 3621 participants whole genome sequenced from the UK10K project,27 for an overall allele frequency of 0.05%. The allele frequency estimated by the UK10K project (0.05%) is similar to the overall frequency estimated in NHANES (0.08%), reflecting the fact that both surveys ascertained participants without regard to disease unlike the Exome Sequencing Project, which ascertained cases and controls of various diseases and extremes of quantitative trait distributions for ≥7 studies. In the case of the UK10K project, the whole genome sequence data were generated on well-phenotyped controls from the TwinsUK and Avon Longitudinal Study of Parents and Children Study.27 Also like NHANES, Tachmazidou et al26 found the 19X allele on a single haplotype background in both the Greek population isolate and the UK10K data sets, suggesting a single origin of the mutation. And, Tachmazidou et al26 replicated the association between 19X and lower triglyceride levels and higher HDL-C levels compared with noncarriers. Together, these data establish the association between 19X and cardioprotective lipid profiles in both isolate and outbred populations. This study has many strengths, including sample size (≈20 000 total DNA samples) and diversity (3 racial/ethnic groups). Despite these strengths, this study also has several weaknesses. First, given that the allele frequency in Africandescent populations is low, the sample size for non-Hispanic blacks in NHANES is not sufficient for accurate estimates especially given the error rate of the assay is nearly equal to the frequency estimate of the overall population. Second, the Ohio and Indiana Amish population genotyped here was ascertained for studies related to aging and dementia, and it is possible that ascertainment bias may be an explanation for the lack of 19X carriers in the non-Pennsylvania Amish. However, given that most of the genotyped Amish were free of dementia and were older than a general population, the sample was likely biased toward more 19X carriers. These data coupled with examination of the pedigrees suggest that the founder mutation described in the Pennsylvania Amish was lost by chance nontransmission in the Ohio and Indiana Amish (although differences in frequency of the mutation between Amish samples or genotyping error cannot yet be ruled out). Third, while we replicated the genetic association between APOC3 19X and cardio-protective profiles in NHANES, we were limited to HDL-C, LDL-C, and triglyceride levels. NHANES does not regularly measure other lipid traits such as non-HDL or very low-density lipoprotein. NHANES also does not determine coronary artery calcification in participants. Finally, given the wide age range of NHANES (children to older adults), NHANES has few cases of myocardial infarction or other clinical outcomes. Therefore, this study was limited in exploring the extent of cardio-protection afforded by APOC3 19X in NHANES. There are other limitations worth noting that may have affected the allele frequency estimations presented here. The frequencies presented here were not weighted to account for the complex survey design used to ascertain participants for NHANES. The sample design differs by survey, and NHANES provides sampling weights by survey (and by variable, if applicable). NHANES does not yet provide sampling weights for analyses combining all Genetic NHANES (NHANES III, 1999 to 2002, and 2007 to 2008). It is, in fact, unclear if sampling weights can be calculated for the Genetic NHANES data set given the original surveys were conducted during different census years. Given the lack of sampling weights, we have analyzed the data unweighted, and this may have slightly overestimated the point estimate of the allele frequencies for all racial/ethnic groups presented here. It is difficult, however, to assess the extent of overestimation given that the genotyping performed here may have also introduced both false positives and false negatives. TaqMan genotyping assays are generally associated with low error rates of 0.1%,28 with published error rates as 852 Circ Cardiovasc Genet December 2014 Downloaded from http://circgenetics.ahajournals.org/ by guest on June 17, 2017 low as <1 in 2000 genotypes or 0.05%.29 On the basis of these error rates, we would expect 10 to 20 genotyping errors among 19 613 NHANES participants genotyped for rs76353203 using this assay. Given the PAGE I Study only supported genotyping, we were unable to verify the 31 identified carriers of the 19X via resequencing. Comparisons of the frequencies estimated here to the UK10K Project described above and replication of the associations between 19X and HDL-C and triglycerides as reported in several populations, however, suggest that the effect of the absence of sample weights or genotyping error is minimal. Despite these limitations, these data establish APOC3 R19X as a variant present in the general outbred population at an appreciable, albeit low, frequency with favorable effects on lipid profiles similar to that observed in the Pennsylvania Amish. Recent Phase I clinical trials have shown that inhibition of apoC-III leads to reductions in plasma apoC-III and triglycerides in humans,30 both of which are associated with risk of cardiovascular disease. Further studies are needed to establish the relevance of APOC3 in cardiovascular disease prediction, drug therapy, and other possible clinical applications. Acknowledgments We at Epidemiological Architecture for Genes Linked to Environment would like to thank Dr Geraldine McQuillan and Jody McLean for their help in accessing the Genetic NHANES data and Keith Tanner for technical assistance in assembling the Pennsylvania Amish positive control samples. The Vanderbilt University Center for Human Genetics Research, Computational Genomics Core provided computational or analytic support for this work. The findings and conclusions in this report are those of the authors and do not necessarily represent the views of the National Institutes for Health or the Centers for Disease Control and Prevention. Sources of Funding Genotyping in National Health and Nutrition Examination Surveys (NHANES) was supported in part by the Epidemiological Architecture for Genes Linked to Environment Study (U01HG004798 and its ARRA supplements) as part of the Population Architecture using Genomics and Epidemiology Study established by the National Human Genome Research Institute. Select NHANES III data presented here were genotyped under funding provided by the University of Washington’s Center for Ecogenetics and Environmental Health supported by the National Institute of Environmental Sciences (5 P30 ES007033-12). Also, genotyping services for select NHANES III single nucleotide polymorphism (SNPs) presented here were provided by the Johns Hopkins University under federal contract number (N01-HV-48195) from National Heart, Lung, and Blood Institute. Genotyping and analysis in Pennsylvania Amish samples supported by National Institutes of Health (NIH) R01 HL088119, R01 AR046838, U01 HL72515, R01 AG18728, U01 HL084756, R01 HL104193, and R01 CA122844; General Clinical Research Centers Program, National Center for Research Resources, NIH; the University of Maryland General Clinical Research Center, grant M01 RR 16500; University of Maryland Nutrition and Obesity Research Center grant P30 DK072488. The collection and genotyping of the Indiana/Ohio Amish samples was supported by the NIH grants AG019085 (to Dr Haines and Dr Pericak-Vance) and AG019726 (to Dr Scott). We thank Dr Julie Douglas for kindly sharing the Illumina Omni 2.5 mol/L Beadchip SNP genotypes obtained in the Old Order Amish as part of R01 CA122844. Disclosures None. References 1. Global Lipids Genetics Consortium. Discovery and refinement of loci associated with lipid levels. Nat Genet. 2013;45:1274–1283. 2. Teslovich TM, Musunuru K, Smith AV, Edmondson AC, Stylianou IM, Koseki M, et al. Biological, clinical and population relevance of 95 loci for blood lipids. Nature. 2010;466:707–713. 3. Hegele RA. Plasma lipoproteins: genetic influences and clinical implications. Nat Rev Genet. 2009;10:109–121. 4. Cohen JC, Kiss RS, Pertsemlidis A, Marcel YL, McPherson R, Hobbs HH. Multiple rare alleles contribute to low plasma levels of HDL cholesterol. Science. 2004;305:869–872. 5. Cohen J, Pertsemlidis A, Kotowski IK, Graham R, Garcia CK, Hobbs HH. Low LDL cholesterol in individuals of African descent resulting from frequent nonsense mutations in PCSK9. Nat Genet. 2005;37: 161–165. 6. Kotowski IK, Pertsemlidis A, Luke A, Cooper RS, Vega GL, Cohen JC, et al. A spectrum of PCSK9 alleles contributes to plasma levels of lowdensity lipoprotein cholesterol. Am J Hum Genet. 2006;78:410–422. 7. Musunuru K, Pirruccello JP, Do R, Peloso GM, Guiducci C, Sougnez C, et al. Exome sequencing, ANGPTL3 mutations, and familial combined hypolipidemia. N Engl J Med. 2010;363:2220–2227. 8. Rios J, Stein E, Shendure J, Hobbs HH, Cohen JC. Identification by whole-genome resequencing of gene defect responsible for severe hypercholesterolemia. Hum Mol Genet. 2010;19:4313–4318. 9.Romeo S, Pennacchio LA, Fu Y, Boerwinkle E, Tybjaerg-Hansen A, Hobbs HH, et al. Population-based resequencing of ANGPTL4 uncovers variations that reduce triglycerides and increase HDL. Nat Genet. 2007;39:513–516. 10. Lange L, Hu Y, Zhang H, Xue C, Schmidt E, Tang ZZ, et al. Whole-exome sequencing identifies rare and low-frequency coding variants associated with LDL cholesterol. Am J Hum Genet. 2014;94:233–245. 11. Pollin TI, Damcott CM, Shen H, Ott SH, Shelton J, Horenstein RB, et al. A null mutation in human APOC3 confers a favorable plasma lipid profile and apparent cardioprotection. Science. 2008;322:1702–1705. 12. Tennessen JA, Bigham AW, O’Connor TD, Fu W, Kenny EE, Gravel S, et al. Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science. 2012;337:64–69. 13. Fu W, ‘Connor TD, Jun G, Kang HM, Abecasis G, Leal SM, et al. Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants. Nature. 2013;493:216–220. 14. Matise TC, Ambite JL, Buyske S, Carlson CS, Cole SA, Crawford DC, et al; PAGE Study. The next PAGE in understanding complex traits: design for the analysis of Population Architecture Using Genetics and Epidemiology (PAGE) Study. Am J Epidemiol. 2011;174:849–859. 15. Edwards DR, Gilbert JR, Jiang L, Gallins PJ, Caywood L, Creason M, et al. Successful aging shows linkage to chromosomes 6, 7, and 14 in the Amish. Ann Hum Genet. 2011;75:516–528. 16. Walt J, Scott W, Slifer S, Gaskell PC, Martin E, Welsh-Bohmer K et al. Maternal lineages and Alzheimer disease risk in the Old Order Amish. Hum Genet. 2005;118:115–122. 17. Hostetler JA. Amish Society. Baltimore, MD: Johns Hopkins University Press; 1993. 18. Hostetler JA. Hutterite Society. Baltimore, MD: Johns Hopkins University Press; 1997. 19. Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21:263–265. 20. Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010;26:2190–2191. 21.Stephens M, Smith NJ, Donnelly P. A new statistical method for haplotype reconstruction from population data. Am J Hum Genet. 2001;68:978–989. 22. Stephens M, Scheet P. Accounting for decay of linkage disequilibrium in haplotype inference and missing-data imputation. Am J Hum Genet. 2005;76:449–462. 23. Fullerton SM, Buchanan AV, Sonpar VA, Taylor SL, Smith JD, Carlson CS, et al. The effects of scale: variation in the APOA1/C3/A4/A5 gene cluster. Hum Genet. 2004;115:36–56. 24. Dumitrescu L, Carty CL, Taylor K, Schumacher FR, Hindorff LA, Ambite JL, et al. Genetic determinants of lipid traits in diverse populations from the population architecture using genomics and epidemiology (PAGE) study. PLoS Genet. 2011;7:e1002138. 25. Keebler ME, Sanders CL, Surti A, Guiducci C, Burtt NP, Kathiresan S. Association of blood lipids with common DNA sequence variants at 19 genetic loci in the multiethnic United States National Health and Nutrition Crawford et al APOC3 R19X, Cardio-Protective Profiles, and NHANES 853 Examination Survey III/CLINICAL PERSPECTIVE. Circ Cardiovasc Genet. 2009;2:238–243. 26. Tachmazidou I, Dedoussis G, Southam L, Farmaki AE, Ritchie GRS, Xifara DK, et al. A rare functional cardioprotective APOC3 variant has risen in frequency in distinct population isolates. Nat Commun. 2013;4:2872. 27. Muddyman D, Smee C, Griffin H, Kaye J. Implementing a successful data-management framework: the UK10K managed access model. Genome Med. 2013;5:100. 28. De La Vega FM, Isaac H, Collins A, Scafe CR, Halldorsson BV, Su X, et al. The linkage disequilibrium maps of three human chromosomes across four populations reflect their demographic history and a common underlying recombination pattern. Genome Res. 2005;15:454–462. 29. Ranade K, Chang MS, Ting CT, Pei D, Hsiao CF, Olivier M, et al. Highthroughput genotyping with single nucleotide polymorphisms. Genome Res. 2001;11:1262–1268. 30. Graham MJ, Lee RG, Bell TA III, Fu W, Mullick AE, Alexander VJ, et al. Antisense oligonucleotide inhibition of apolipoprotein C-III reduces plasma triglycerides in rodents, nonhuman primates, and humans. Circ Res. 2013;112:1479–1490. CLINICAL PERSPECTIVE Downloaded from http://circgenetics.ahajournals.org/ by guest on June 17, 2017 Decades of genetic studies spanning linkage studies, candidate gene and genome-wide association studies, and contemporary sequencing studies have identified hundreds of rare and common genetic variants associated with lipid profiles within the normal range as well as extreme lipid profiles associated with dyslipidemia. One recently identified variant in a Pennsylvania Old Order Amish population is rs76353203 (R19X), a rare null variant in apolipoprotein C III (APOC3) that was associated with cardio-protective profiles including significantly higher levels of high density lipoprotein (HDL) cholesterol and lower levels of triglycerides and total cholesterol levels compared with noncarriers of the mutation. Preliminary data in European-descent outbred populations suggested that APOC3 19X is rare in the general population. In this study, we sought to further characterize the frequency of APOC3 rs76353203 and replicate the reported associations in 19 613 participants from 3 National Health and Nutrition Examination Surveys (NHANES III, NHANES 1999 to 2002, and NHANES 2007 to 2008). We estimated the prevalence of the APOC3 19X allele to be 0.08% in a general population from the United States ascertained regardless of health status, with allele frequencies highest among Mexican Americans (0.23%) and non-Hispanic whites (0.20%) and lowest among non-Hispanic blacks (0.0124%). We also replicated the association between APOC3 rs76353203 and cardio-protective lipid profiles. These data establish APOC3 19X is present in the general outbred population at an appreciable, albeit low, frequency with favorable effects on lipid profiles. Further studies are needed to establish the relevance of APOC3 in cardiovascular disease prediction, drug therapy, and other possible clinical applications. Downloaded from http://circgenetics.ahajournals.org/ by guest on June 17, 2017 Rare Variant APOC3 R19X Is Associated With Cardio-Protective Profiles in a Diverse Population-Based Survey as Part of the Epidemiologic Architecture for Genes Linked to Environment Study Dana C. Crawford, Logan Dumitrescu, Robert Goodloe, Kristin Brown-Gentry, Jonathan Boston, Bob McClellan, Jr, Cara Sutcliffe, Rachel Wiseman, Paxton Baker, Margaret A. Pericak-Vance, William K. Scott, Melissa Allen, Ping Mayo, Nathalie Schnetz-Boutaud, Holli H. Dilks, Jonathan L. Haines and Toni I. Pollin Circ Cardiovasc Genet. 2014;7:848-853; originally published online November 1, 2014; doi: 10.1161/CIRCGENETICS.113.000369 Circulation: Cardiovascular Genetics is published by the American Heart Association, 7272 Greenville Avenue, Dallas, TX 75231 Copyright © 2014 American Heart Association, Inc. All rights reserved. Print ISSN: 1942-325X. Online ISSN: 1942-3268 The online version of this article, along with updated information and services, is located on the World Wide Web at: http://circgenetics.ahajournals.org/content/7/6/848 Data Supplement (unedited) at: http://circgenetics.ahajournals.org/content/suppl/2014/11/01/CIRCGENETICS.113.000369.DC1 Permissions: Requests for permissions to reproduce figures, tables, or portions of articles originally published in Circulation: Cardiovascular Genetics can be obtained via RightsLink, a service of the Copyright Clearance Center, not the Editorial Office. Once the online version of the published article for which permission is being requested is located, click Request Permissions in the middle column of the Web page under Services. Further information about this process is available in the Permissions and Rights Question and Answer document. Reprints: Information about reprints can be found online at: http://www.lww.com/reprints Subscriptions: Information about subscribing to Circulation: Cardiovascular Genetics is online at: http://circgenetics.ahajournals.org//subscriptions/ SUPPLEMENTAL MATERIAL Supplementary Table 1: APOA5/A4/C3/A1 gene cluster SNPs available in NHANES III for haplotype inference. SNP ID (if available), chromosome 11 position (Genome Build 19), alleles, nearest gene, and SNP location within or near gene are given for each variant. The minor allele in CEU or other European-descent populations (such as those in Fullerton et al21) is bolded italicized. Names of SNPs used to compare haplotype backgrounds with Pennsylvania Amish subjects are bolded. SNP Position Alleles Nearest Gene Location rs28927680 116619073 C/G APOA5 3′ UTR rs964184 116648917 C/G APOA5 3′ near gene rs12286037 116652207 C/T APOA5 Intronic rs2075294 116658122 A/C APOA5 Intronic rs33984246 116660450 A/G APOA5 5′ near gene rs34089864 116660768 A/G APOA5 3′ UTR rs619054 116660813 C/T APOA5 3′ UTR rs2072560 116661826 C/T APOA5 Intronic rs12287066 116662331 A/C APOA5 Synonymous rs17120035 116663851 C/T APOA5 5′ near gene rs10750097 116664040 C/T APOA5 5′ near gene rs12721040 116691480 A/G APOA4 3′ UTR rs5110 116691634 A/C APOA4 Nonsynonymous rs675 116691675 A/T APOA4 Nonsynonymous rs5106 116691928 C/T APOA4 Synonymous rs12721043 116692293 A/C APOA4 Nonsynonymous rs5104 116692334 A/G APOA4 Nonsynonymous rs5100 116692694 A/G APOA4 Intronic rs5096 116693095 C/T APOA4 Intronic rs5094 116693213 A/G APOA4 Intronic rs5092 116693464 A/G APOA4 Synonymous rs5091 116694005 A/G APOA4 5′ UTR 1 rs5090 116694055 C/G APOA4 5′ near gene rs12721083 116699336 G/T APOC3 5′ near gene rs12721092 116700194 C/T APOC3 5′ near gene rs17257761a 116700860 A/G APOC3 Intronic rs76353203 116701353 A/G APOC3 Nonsense rs4520 116701535 C/T APOC3 Synonymous rs5142 116701850 C/T APOC3 Intronic rs34635405b 116703671 G/T APOC3 5′ UTR rs11216153 116705100 G/T APOA1 3′ near gene rs12721028 116705590 A/G APOA1 3′ near gene rs12718463 116706343 A/G APOA1 3′ near gene rs7116797 116707338 C/T APOA1 Intronic rs5071 a merged with rs734104 b merged with rs4225 116707734 A/G APOA1 Synonymous 2 Supplementary Table 2. APOA5/A4/C3/A1 gene cluster inferred haplotypes in NHANES III. Haplotypes were inferred in NHANES III using 35 SNPs from Supplementary Table 1 and PHASE v2.1.119, 20. Major alleles for all SNPs are shown in the first row of the table and are ordered 5′ to 3′ according to the genomic positions given in Supplementary Table 1. A total of 475 haplotypes were inferred in NHANES III for this gene region. Ten of the most common inferred haplotypes in NHANES III are displayed (along with counts and percentage) followed by the haplotype containing APOC3 19X. The 19X allele is bolded and shown in white text in a dark box. Inferred haplotype # chromosomes (%) N/a Major alleles GCCCAGCCCCTGCTCCAATAAGCGTAGCCGGAATG Haplotype #1 GCCCAGCCCCTGCTCCAATAAGCGTAGCCTGAATG 3,558 (23.58) Haplotype #2 GCCCAGTCCCTGCACCAGCAAGCGTAGCCTGAATG 961 (6.37) Haplotype #3 CGTCAGCCACCGCTCCAATAAGCGTAGTCGTGATG 736 (4.88) Haplotype #4 GGTCAGCTCCCGCTCCGGCAGGCGTGGTTGGAACG 651 (4.31) Haplotype #5 GCCCAGCCCCTGATCCAATAAGCGTAGCCTGAATG 631 (4.18) Haplotype #6 GCCCAGCCCCTGCTCCAATAAGCGTAGCCGGAATG 420 (2.78) Haplotype #7 GCCCAGCCCCTGCTCCAATAAGCGTAGTCGTGATG 379 (2.51) Haplotype #8 GCCCAGCCCTCGCTCCAATAAGCGTAGCCGGAATG 349 (2.31) Haplotype #9 GCCCAGCCCCCGCTCCAATAAGCGTAGTCGTGATG 306 (2.03) Haplotype #10 GCCCAGTCCCTGCTCCGGCAGGCGTAGTCGTGATG 301 (2.0) Haplotype 19X GCCCAGTCCCTGCACCAGCAAGCGTAACCTGAATG 10 (0.07) 3 Supplementary Table 3. Comparison of APOA5/A4/C3/A1 haplotype frequency among NHANES III, Amish, and 1000 Genomes using polymorphic tagging SNPs genotyped in both NHANES III and Amish showing that APOC3 R19X is found on a haplotype background otherwise identical to haplotype #2 in both NHANES III and Amish. Alleles departing from the major haplotype are shown in dark boxes. SNPs are (1) rs619054; (2) rs12287066; (3) rs10750097; (4) rs5110; (5) rs34635405/ rs734104; (6) rs76353203 (R19X) and (7) rs34635405/ rs4225. The number of individuals is denoted by “n” for each sample. 7 % in NHANES III (n = 3,980) % in Amish (n = 211) %1000G EUR (n = 379) 1000G AFR (n = 246) %1000G MEX (n = 66) %1000G COMBINED (n = 691) 1 2 3 4 5 6 R19X Hap #1 C C T C A G T 23.6 38.4 38.4 3.4 29.1 23.9 Hap #2 T C T C A G T 6.4 17.9 14.6 3.4 11.3 10.8 Hap #3 C A C C A G G 4.9 4.4 4.7 12.0 10.4 8.0 Hap #4 C C C C G G G 4.3 2.1 8.9 3.7 11.7 7.5 Hap #5 C C T A A G T 4.2 6.4 6.0 0.0 3.0 3.9 Hap #6-7 C C T C A G G 5.3 10.9 9.3 33.5 10.2 18.2 Hap #8-9 C C C C A G G 4.3 0.0 4.7 21.6 15.8 11.6 Hap #10 T C T C A G G 2.0 5.8 0.0 5.6 0.0 0.0 Hap 19X T C T C A A T 0.07 2.8 0.0 0.0 0.0 0.0 4
© Copyright 2026 Paperzz