From Genotype to Phenotype: Nonsense Variants in SLC13A1 Are

G3: Genes|Genomes|Genetics Early Online, published on July 13, 2016 as doi:10.1534/g3.116.032979
1
Title Page
2
3
From Genotype to Phenotype: Nonsense Variants in SLC13A1 are Associated with Decreased
4
Serum Sulfate and Increased Serum Aminotransferases
5
6
*Christina G. Tise, 1033 West Barre Street, Baltimore, MD 21230;
7
[email protected]
8
James A. Perry1, 685 West Baltimore Street, MSTF Room 357, Baltimore, MD 21201;
9
[email protected]
10
Leslie E. Anforth1, 135 Strummer Lane Gaithersburg, MD 20878; [email protected]
11
Mary A. Pavlovich1, 685 West Baltimore Street, MSTF Room 357, Baltimore, MD 21201;
12
[email protected]
13
Joshua D. Backman1, 660 West Redwood Street, Howard Hall 460, Baltimore, MD 21201;
14
[email protected]
15
Kathleen A. Ryan1, 685 West Baltimore Street, MSTF Room 357, Baltimore, MD 21201;
16
[email protected]
17
Joshua P. Lewis1, 660 West Redwood Street, Howard Hall 498, Baltimore, MD 21201;
18
[email protected]
19
Jeffrey R. O’Connell1, 685 West Baltimore Street, MSTF Room 300 B, Baltimore, MD 21201;
20
[email protected]
21
Laura M. Yerges-Armstrong1¶, 709 Swedeland Road, UW2230, King of Prussia, PA 19406;
22
[email protected]
1
© The Author(s) 2013. Published by the Genetics Society of America.
23
Alan R. Shuldiner1¶, 685 West Baltimore Street, MSTF Room 379, Baltimore, MD 21201;
24
[email protected]
25
26
1
27
Nutrition, and, University of Maryland School of Medicine, Baltimore, Maryland, United States of
28
America
Program for Personalized and Genomic Medicine and Division of Endocrinology, Diabetes and
29
30
* Corresponding author
31
¶
LMY and ARS are Joint Senior Authors.
32
33
34
35
36
37
38
39
40
41
42
43
44
45
2
46
Abstract
47
Using genomic applications to glean insights into human biology, we systematically
48
searched for nonsense, single nucleotide variants (SNVs) that are rare in the general population but
49
enriched in the Old Order Amish (Amish) due to founder effect. We identified two non-linked,
50
nonsense SNVs (R12X and W48X) in SLC13A1 (allele frequencies 0.29% and 0.74% in the Amish;
51
enriched 1.2-fold and 3.7-fold, compared to the outbred Caucasian population, respectively).
52
SLC13A1 encodes the apical, sodium-sulfate cotransporter (NaS1) responsible for sulfate
53
(re)absorption in the kidneys and intestine. Both SLC13A1 R12X and W48X were independently
54
associated with a 27.6% (P=2.7x10-8) and 27.3% (P=6.9x10-14) decrease in serum sulfate,
55
respectively (P=8.8x10-20 for carriers of either SLC13A1 nonsense SNV). We further performed the
56
first exome- and genome-wide association study (ExWAS/GWAS) of serum sulfate and identified a
57
missense variant (L348P) in SLC26A1, which encodes the basolateral sulfate-anion transporter
58
(Sat1), that was associated with decreased serum sulfate (P=4.4x10-12). Consistent with sulfate’s
59
role in xenobiotic detoxification and protection against acetaminophen-induced hepatotoxicity,
60
SLC13A1 nonsense SNV carriers had higher aminotransferase levels compared to non-carriers.
61
Furthermore, SLC26A1 L348P was associated with lower whole-body bone mineral density (BMD)
62
and higher serum calcium, consistent with the osteochondrodysplasia exhibited by dogs and sheep
63
with naturally-occurring, homozygous, loss-of-function mutations in Slc13a1. This study
64
demonstrates the power and translational potential of systematic identification and characterization
65
of rare, loss-of-function variants and warrants additional studies to better understand the importance
66
of sulfate in human physiology, disease, and drug toxicity.
67
68
3
69
70
Article Summary
While less genetically diverse, founder populations are more likely to have multiple
71
individuals who share the same ancestral variant. The authors take advantage of this concept by
72
systematically identifying nonsense variants enriched in a cohort of Old Order Amish individuals.
73
Upon selecting two nonsense variants in the sulfate transporter gene, SLC13A1, to study further, the
74
authors investigate the consequences of these variants on serum sulfate, a micronutrient
75
infrequently measured. In addition, the authors examine the effect of these variants on clinical
76
phenotypes, results of which are particularly relevant given sulfate’s known role in acetaminophen
77
toxicity and osteoarthritis.
78
79
80
Keywords
81
SLC13A1; serum sulfate; loss-of-function variants; Old Order Amish; GWAS/ExWAS
82
83
84
85
86
87
88
89
90
4
91
Background
92
Population isolates have been critical in elucidating the genetic basis of both classical
93
Mendelian and complex diseases. While many loss-of-function variants associated with disease are
94
rare in the general population, genetic drift in founder populations like the Old Order Amish
95
(Amish) increases the probability that many individuals will share the same rare, disease-causing
96
variant. Recent advances in genome-wide genotyping and sequencing technologies have made
97
possible the systematic identification of these variants and their subsequent investigation.
98
In this study, we systematically interrogated nonsense single nucleotide variants (SNVs)
99
from the Illumina Human Exome BeadChip for minor allele frequency (MAF) enrichment in the
100
Amish compared to outbred, European-derived populations. Using this approach, we observed an
101
enrichment of two nonsense SNVs (c.34G>A, p.R12X and c.144C>T, p.W48X) in SLC13A1, which
102
encodes an apical membrane, sodium-sulfate cotransporter (NaS1) responsible for sulfate
103
(re)absorption in the intestines and kidneys.
104
Inorganic sulfate (SO42-) is an important micronutrient vital for numerous cellular and
105
metabolic processes in human development and physiology (Dawson 2011, 2013; Markovich
106
2014). Sulfate is critical in the biotransformation of multiple compounds via sulfotransferase-
107
mediated sulfate conjugation (sulfation) (Klaassen and Boles 1997). These compounds include
108
hormones, neurotransmitters, proteoglycans and xenobiotics during phase II metabolism (Dawson
109
2011; Markovich 2014, 2011a; Council 2005; Markovich 2011b). Impaired sulfation capacity
110
substantially alters the metabolism and activities of these compounds and has been implicated in
111
several human pathologies including reduced xenobiotic clearance, skeletal dysplasia, premature
112
pubarche, autism spectrum disorder (ASD), and neurological disease (Alberti et al. 1999; Bowling
113
et al. 2013; Heafield et al. 1990; Ahmad et al. 1998; Noordam et al. 2009; Dawson 2013; Dawson
5
114
and Markovich 2005). Despite the importance of sulfate in these physiological processes, little is
115
known regarding sulfate homeostasis in humans nor is it routinely measured in clinical settings.
116
Given the potential importance of sulfate in the development of several disorders, we
117
comprehensively characterized serum sulfate in 977 Amish individuals in order to 1) determine the
118
heritability of serum sulfate, 2) examine the relationship between SLC13A1 SNVs and serum
119
sulfate, 3) identify additional novel genetic contributors through completion of the first exome-wide
120
association study (ExWAS) and genome-wide association study (GWAS) of serum sulfate, and 4)
121
investigate associations between sulfate-altering genotypes and relevant, clinical phenotypes
122
including aminotransferase levels, bone mineral density (BMD), and serum calcium.
123
124
125
Methods
126
Study Population
127
This report is based on the Amish community living in Lancaster County, PA, whom our
128
research group has been studying since 1993. This community was founded by several hundreds of
129
individuals who immigrated to Lancaster County, PA from central Europe during the early 18th
130
century, with the present day Lancaster County Amish community comprised of their descendants
131
(Lee et al. 2010). Cultural and religious beliefs have maintained the Amish as distinct from the
132
general population. Due to the availability of extensive genealogical records (Agarwala et al. 2003),
133
virtually all present-day Amish can be linked into a single, 14-generation pedigree. To date, we
134
have screened approximately 6,500 Amish adults for a variety of risk factors related to
135
cardiovascular disease (Mitchell et al. 2008), diabetes (Hsueh et al. 2000), and osteoporosis
136
(Streeten et al. 2006) as part of the Amish Complex Disease Research Program (ACDRP).
6
137
Subjects included in this report are members of the Amish community in Lancaster County,
138
PA who were at least 18 years old and have previously participated in one or more Institutional
139
Review Board (IRB)-approved studies conducted at the University of Maryland School of Medicine
140
Amish Research Clinic (ARC). Written informed consent was obtained from each participant.
141
142
143
Genotyping
Exome-wide genotyping, including four SLC13A1 SNVs, R12X (rs28364172), W48X
144
(rs138275989), N174S (rs2140516) and R237C (rs139376972), was performed on 1,725 Amish
145
subjects using the Illumina Human Exome BeadChip (Illumina Inc., San Diego, California).
146
Genotyping was performed according to the manufacturer’s instructions and genotypes were
147
assigned using Illumina GenomeStudio software (Illumina Inc., San Diego, California).
148
In subsequent follow-up investigations, genotyping of SLC13A1 nonsense SNVs, R12X and
149
W48X was performed on additional Amish subjects using TaqMan® SNP genotyping assays (Life
150
Technologies, Foster City, California). In total, SLC13A1 R12X and W48X genotypes were
151
available in 3,924 and 3,782 unique individuals, respectively. For both SNPs, the TaqMan genotype
152
concordance was greater than 99.8% in a subset of duplicate samples, and for subjects genotyped
153
using the Illumina Human Exome BeadChip and TaqMan® SNP genotyping assays, genotype
154
concordance between platforms was 100% for both SLC13A1 R12X and W48X.
155
For 917 of the 977 subjects for which serum sulfate was measured, genome-wide
156
genotyping was performed using the Affymetrix GeneChip Human Mapping 500K or 1M (version
157
6.0) arrays according to the manufacturer's instructions (Affymetrix Inc., Santa Clara, California).
158
Genotype calls were performed using BRLMM (500K array) or Birdseed version 2 (1 M array).
7
159
SNVs present on both arrays with a minor allele frequency greater than 1% were included in the
160
analyses.
161
162
163
Serum Sulfate Measurements
All known Amish carriers of the rare SLC13A1 SNVs, R12X, W48X and R237C, for whom
164
fasting, serum aliquots were available, were selected for sulfate measurement. Such subjects
165
consisted of participants from the Amish Pharmacogenomics of Anti-Platelet Intervention (PAPI)
166
Study (Shuldiner et al. 2009; Bozzi et al. 2015), the Amish Heredity and Phenotype Intervention
167
(HAPI) Heart Study (Mitchell et al. 2008), the Amish Family Calcification Study (AFCS) (Post et
168
al. 2007), the Amish Family Longevity Study (AFLS) (Mitchell et al. 2001; Sorkin et al. 2005), and
169
the Amish Wellness Study (AWS) (File S1). All other subjects consisted of randomly selected
170
participants of the Amish PAPI (Shuldiner et al. 2009; Bozzi et al. 2015) or HAPI Heart (Mitchell
171
et al. 2008) studies for whom fasting serum aliquots were available.
172
Sulfate measurements were completed on frozen, fasting serum aliquots stored at -80◦C.
173
Sulfate concentration was determined by turbidimetry according to Dodgson & Price (Dodgson and
174
Price 1962) using a Quantichrom™ Sulfate Assay Kit (BioAssay Systems, Hayward, CA). In order
175
to improve accuracy, a quadratic least squares fit was used instead of a linear fit to generate the
176
standard curve (NIST/SEMATECH e-Handbook of Statistical Methods). All standards and samples
177
were measured in duplicate. For each sample, sulfate concentration was calculated as the mean of
178
the duplicate measurements. Samples with an absolute difference greater than 20% were considered
179
discordant duplicate measurements and were not included in the analysis. Of the 1,003 samples in
180
which serum sulfate was measured, 26 subjects’ samples were excluded due to discordant duplicate
181
measurements, resulting in valid sulfate concentration measurements for 977 subjects’ samples.
8
182
183
Serum Alanine Aminotransferase (ALT), Aspartate Aminotransferase (AST), Calcium, and
184
Albumin Measurements
185
Serum ALT, AST, calcium, and albumin measurements were collected as part of the Amish
186
PAPI (Shuldiner et al. 2009; Bozzi et al. 2015) and Amish Family Longevity (Sorkin et al. 2005;
187
Mitchell et al. 2001) studies and completed on fresh, fasting serum aliquots by Quest Diagnostics
188
(Horsham, Pennsylvania) at the time of subject enrollment.
189
190
191
Bone Mineral Density (BMD) Measurements
BMD was measured as part of the Amish PAPI Study (Shuldiner et al. 2009; Bozzi et al.
192
2015) by dual-energy x-ray absorptiometry at the proximal femur, lumbar spine, one-third radius,
193
and whole-body using a Hologic 4500W (Hologic). Daily phantom measurements were obtained to
194
detect measurement shifts over time. Coefficients of variation for the repeat measures of the sites in
195
normal volunteers were between 1.5% and 0.8%.
196
197
198
Statistical analysis
The MAF for each nonsense SNV was obtained from the Total European Ancestry
199
population from 1000 Genomes (1000g-EUR) and the European American population from the
200
National Heart, Lung, and Blood Institute (NHLBI) Exome Sequencing Project (ESP) (ESP-EA).
201
For each SNV, the total allele count was assumed to be 1338 in 1000g-EUR (n=669) and 6728 in
202
ESP-EA (n=3,364). For each nonsense SNV, the MAF in the Amish was compared to the MAF in
203
1000g-EUR and ESP-EA (when available) using a chi-squared test.
9
204
Association analyses between genotypes and serum sulfate, and other phenotypic measures,
205
were conducted using a regression-based method that models variation of the trait of interest as a
206
function of measured covariates, measured genotypes, and a polygenic component that accounts for
207
phenotypic correlation due to relatedness. This method was implemented using the Mixed Models
208
Analysis for Pedigrees and Populations (MMAP) program (O'Connell). For each association
209
analysis performed, individuals with a missing covariate, genotype and/or trait of interest were
210
excluded from the analysis. All analyses of serum sulfate included gender, study, and study-age as
211
covariates. All analyses of clinical phenotypes included age and gender as covariates. All analyses
212
of clinical phenotypes obtained in Amish PAPI Study (Shuldiner et al. 2009; Bozzi et al. 2015)
213
participants containing serum sulfate as a covariate were limited to subjects for whom serum sulfate
214
was measured using a serum aliquot collected at the time of participation in that study.
215
216
217
Results
218
SLC13A1 Variants are Enriched in the Amish
219
In this Amish cohort (n=1,725), 43,459 of the >240,000 markers on the Illumina Human
220
Exome BeadChip were polymorphic, 266 of which were nonsense SNVs. For 45 of the 266
221
nonsense SNVs, no MAF was reported in 1000g-EUR nor ESP-EA. For 131 of the 266 nonsense
222
SNVs, the MAF was greater in this Amish cohort compared to the MAF in 1000g-EUR averaged
223
with the MAF in ESP-EA. Of these 131 SNVs, 67 and 103 were significantly enriched (P<0.05)
224
compared to 1000G-EUR and ESP-EA, respectively (Table S1). With an intent to unveil new
225
biology, we filtered out variants in genes with an associated Online Mendelian Inheritance in Man
226
(OMIM®) (Online Mendelian Inheritance in Man, OMIM®) phenotype, leaving us with 118
10
227
nonsense SNVs. Two nonsense SNVs (c.34G>A, p.R12X and c.144C>T, p.W48X) in SLC13A1
228
(Figure 1) were enriched 1.9-fold (0.43% vs. 0.23%) and 4.4-fold (0.87% vs. 0.20%), respectively,
229
compared to ESP-EA allele frequencies. In addition, two missense SNVs (c.521T>C, p.N174S and
230
c.877G>A, p.R237C) in SLC13A1 were also present on the Illumina Exome BeadChip and enriched
231
1.2-fold (39% vs. 32%) and 10.7-fold (1.7% vs. 0.16%), respectively, compared to ESP-EA allele
232
frequencies (Table S2).
233
Upon further genotyping of SLC13A1 R12X and W48X in additional Amish subjects
234
(nR12X=3,924; nW48X=3,782), the allele frequencies were enriched 1.2-fold (0.29% vs. 0.023) and
235
3.7-fold (0.74% vs. 0.20%), respectively (Table S2). All four SLC13A1 SNVs conformed to Hardy-
236
Weinberg expectations (Table S2); no R12X homozygotes, W48X homozygotes, or R12X/W48X
237
compound heterozygotes were observed. None of the four SNVs were in significant linkage
238
disequilibrium (LD) with one another with the exception of N174S and R237C (r2=0.02 and
239
|D’|=1.00, LOD≥2) (Figure S1).
240
241
242
SLC13A1 Variants are Associated with Serum Sulfate Levels
Given the role of SLC13A1 in sulfate (re)absorption, we measured sulfate levels in 977
243
individuals. Amish research subjects selected for sulfate measurement consisted of 481 males
244
(49.2%) and 496 females (50.8%), with a mean age of 46.0 ± 14.4 years, and a mean BMI of 27.1 ±
245
4.8 kg/m2 (Table 1). Serum sulfate was normally distributed in this population with an unadjusted
246
mean of 0.36 mM (Figure S2). Serum sulfate was associated with increasing age (β=0.0007,
247
P=4.9x10-6), but not with gender (β=-0.0049, P=0.36). The total variance in serum sulfate
248
explained by age, gender, and study was 13% (Table S3a). The residual heritability of serum sulfate
249
after adjustment for age, gender, and study was 0.40 (P=1.8x10-16).
11
250
In our primary model, we assessed associations between each of the four SLC13A1 SNVs
251
and serum sulfate, independently (Table S3a-b). Specifically, we observed strong associations
252
between R12X and W48X, and lower serum sulfate. SLC13A1 R12X was associated with a 26.7%
253
lower serum sulfate (β=-0.10 mM, P=2.9x10-7) (Figure S3) and SLC13A1 W48X was associated
254
with a 26.5% lower serum sulfate (β=-0.10 mM, P=1.3x10-12) (Figure S4). We evaluated the
255
influence of N174S and R237C on serum sulfate levels in 900 of these individuals. In contrast to
256
R12X and W48X, N174S was associated with a 3% higher serum sulfate per allele (β=0.01 mM,
257
P=1.6x10-4) (Figure 2a). No association was observed between R237C and serum sulfate (P=0.15)
258
(Figure 2b). The estimated heritability after adjustment for each of these genotypes, along with the
259
additive and total variance explained by each variant is shown in Table S3.
260
While LD data suggested the SLC13A1 R12X and W48X variants are not correlated with
261
each other, we performed conditional association analyses to evaluate whether these variants
262
represent independent association signals (Table S2a). Indeed, both R12X and W48X were
263
independently associated with a 27.6% (β=-0.10 mM, P=2.7x10-8) and 27.3% (β=-0.10 mM,
264
P=6.9x10-14) decrease in serum sulfate, respectively (Figure 2c). SLC13A1 R12X and W48X
265
together, explained approximately 31% of the additive variance and 9% of the total variance in
266
serum sulfate. The increased significance of these two SNVs, along with the increase in additive
267
and total variance explained, compared to our primary models, is consistent with these two loci
268
being independent genetic determinants of serum sulfate.
269
Finally, we assessed the association between SLC13A1 nonsense carriers and serum sulfate
270
by including carrier status of an SLC13A1 nonsense SNV (R12X or W48X) as a covariate (Table
271
S3a). We observed a 27% decrease in serum sulfate amongst SLC13A1 nonsense SNV carriers
272
compared to subjects who do not carry R12X or W48X (β=-0.10 mM, P=8.8x10-20) (Figure 2c).
12
273
The estimated residual heritability after adjustment for SLC13A1 nonsense SNV carrier status was
274
0.28 with carrier status of an SLC13A1 nonsense SNV explaining 31% of the additive variance and
275
9% of the total variance in serum sulfate.
276
277
278
SLC26A1 L348P is Associated with Serum Sulfate Levels
An ExWAS of serum sulfate, performed using genotype data from the Illumina Human
279
Exome BeadChip (n=900) (Figure S5a), demonstrated genome-wide significance (P<5.0x10-8) for
280
W48X (β=-0.08 mM, P=2.7×10-8), and suggestive-significance for R12X (β=-0.09 mM,
281
P=7.5×10-6) in SLC13A1 on chromosome 7q31.32 (Figure 3 and Table S3b). The ExWAS also
282
revealed a novel cluster of four SNVs spanning approximately 3 megabases on chromosome 4p16.3
283
demonstrating genome-wide significance (Figure 3). These SNVs were found to be in strong LD
284
with one another, i.e. pairwise r2=0.49 to 0.97 and |D’|=0.79 to 1.00, LOD≥2 (Figure S6), with
285
L348P in SLC26A1 (rs148386572; MAF=0.06) being the most significant, associated with a 12%
286
decrease in serum sulfate per allele (β=0.05 mM, P=4.4x10-12). SLC26A1 encodes the basolateral
287
membrane, sulfate-anion transporter 1 (Sat1), consistent with this locus as a true determinant of
288
serum sulfate (Figure 1), and indicative of L348P causing decreased or loss of function in the Sat1
289
protein. SLC26A1 L348P explained 3% of the additive variance and 5% of the total variance in
290
serum sulfate (Table S3b). No other genomic region revealed association signals that met or
291
exceeded genome-wide significance (Table S3).
292
A secondary ExWAS of serum sulfate, adjusting for SLC13A1 R12X and W48X, and
293
SLC26A1 L348P genotypes, was also performed (n=900) (Figure S5b). No genomic region revealed
294
association signals that met or exceeded genome-wide significance (Figure S7 and Table S4).
13
295
A GWAS of serum sulfate, performed using genotype data from the Affymetrix GeneChip
296
(n=917) (Figure S8), demonstrated genome-wide significance for a variant on chromosome 7
297
(rs17684427) and genome-wide suggestive-significance for a variant on chromosome 4 (rs362272)
298
in strong LD with SLC13A1 W48X and SLC26A1 L348P, respectively (Figure S9-S11 and Table
299
S6). A secondary GWAS of serum sulfate, adjusting for rs17684427 and rs362272 genotypes was
300
also performed (n=917) (Figure S12). No genomic region revealed association signals that met or
301
exceeded genome-wide significance (Figure S13).
302
A model including all five SNV genotypes (SLC13A1 R12X, W48X, N174S, and R237C,
303
and SLC26A1 L348P) into the model were used to further estimate residual heritability (Table S3b).
304
In this model, SLC13A1 R12X, W48X, and N174S, and SLC26A1 L348P were all independently
305
associated with decreased serum sulfate while SLC13A1 R237C was not. The estimated residual
306
heritability after adjustment for SLC13A1 R12X, W48X, N174S, and R237C, and SLC26A1 L348P
307
genotype was 0.31 with these variants, together, explaining approximately 29% of the additive
308
variance and 13% of the total variance in serum sulfate.
309
310
311
SLC13A1 Nonsense SNVs are Associated with Increased Aminotransferase Levels
Given sulfate’s role in drug metabolism, we assessed the association between SLC13A1
312
nonsense SNV carriers and levels of alanine aminotransferase (ALT) and aspartate
313
aminotransferase (AST) in 684 participants from the Amish PAPI Study (Shuldiner et al. 2009;
314
Bozzi et al. 2015) (Table 1). ALT levels were 20% higher in SLC13A1 nonsense SNV carriers
315
compared to non-carriers (β=3.7 U/L, P=0.03). Similarly, a trending association was observed
316
between SLC13A1 nonsense carriers and increased AST (β=2.1 U/L, P=0.09) (Figure 4a). No
14
317
association was observed between SLC26A1 L348P, and ALT or AST levels (P=0.92 and 0.37,
318
respectively) (Table 2a).
319
We were able to replicate these associations in an independent Amish cohort consisting of
320
264 participants from the Amish Family Longevity Study (AFLS) (Mitchell et al. 2001; Sorkin et
321
al. 2005) who were, on average, 20.5 years older than the 684 participants from the Amish PAPI
322
Study (Shuldiner et al. 2009; Bozzi et al. 2015) (Table 1). ALT and AST levels were found to be
323
42% and 55% higher, respectively, in SLC13A1 nonsense SNV carriers than in non-carriers (β=7.3
324
U/L, P=2.0x10-3 and β=10.4 U/L, P=1.3x10-7, respectively) (Figure 4b). Additionally, an
325
association between SLC26A1 L348P and a 13% increase in AST per allele was observed in this
326
cohort (β=2.4 U/L, P=6.8x10-3) (Table 2b).
327
328
SLC26A1 Sulfate-lowering SNVs are Associated with Altered Serum Calcium and BMD
329
Given the role of sulfate in cartilage and bone development, we assessed the association
330
between SLC13A1 nonsense SNV carriers and whole-body BMD. No association was observed
331
between SLC13A1 nonsense carriers and whole-body BMD (P=0.19); however, SLC26A1 L348P
332
was significantly associated with decreased whole-body BMD (β=-0.03 g/cm2, P=6.8x10-3) (Figure
333
5 and Table 2c). Upon further investigation, SLC26A1 L348P was also associated with several other
334
BMD measurements while SLC13A1 nonsense SNV carrier status was not (Table 2d and S7).
335
Furthermore, SLC26A1 L348P was significantly associated with increased total serum calcium
336
(β=0.11 mg/dL, P=1.4x10-3), and showed a nearly-significant association with albumin-adjusted
337
serum calcium (β=0.06, P=0.06) (Figure 5 and Table 2c). No association was observed between
338
SLC13A1 nonsense SNV carriers, and serum calcium or albumin-adjusted serum calcium (P=0.98
339
and 0.68, respectively) (Table 2c).
15
340
341
342
Clinical Phenotypes are Not Associated with Serum Sulfate Concentration
Serum sulfate concentration by itself was not associated with any of the aforementioned
343
clinical phenotypes with the exception of trochanter and intertrochanter BMD. Lower serum sulfate
344
was marginally associated with lower trochanter (β=0.13 g/cm2, P=0.03) and lower intertrochanter
345
BMD (β=0.18 g/cm2, P=0.04) (Table S8).
346
347
348
349
Discussion
Using genomic approaches to unveil new human biology, we systematically identified
350
nonsense SNVs that are rare in the general population but enriched in the Amish due to founder
351
effect. Two nonsense SNVs (R12X and W48X), as well as two missense SNVs (N174S and
352
R237C), in SLC13A1 were not only enriched in the Amish but likely functional, thus we
353
hypothesized that these SNVs would be associated with altered serum sulfate due to altered sulfate
354
(re)absorption and perhaps other traits for which sulfate may play a role. We further conducted the
355
first ExWAS and GWAS of serum sulfate in an attempt to identify other genetic contributors of
356
serum sulfate, a trait not measured clinically despite its important physiological role in human
357
development, disease, and drug toxicity. To the best of our knowledge, our group is the first to
358
establish that serum sulfate is heritable (h2=0.40, P=1.8x10-16). In addition, these studies have
359
identified novel associations between R12X, W48X, and N174S in SLC13A1, and L348P in
360
SLC26A1, and serum sulfate levels. Further clinical characterization of the three sulfate-lowering
361
variants implicates sulfate wasting in liver function, and perhaps BMD and calcium homeostasis.
16
362
In regards to SLC13A1 R12X, our findings are consistent with work performed by Bowling
363
et al., which demonstrated an association between R12X and increased urinary fractional excretion
364
index (FEI) of sulfate in a cohort with autism spectrum disorder (ASD) (Bowling et al. 2013).
365
Furthermore, the R12X allele has been shown by Lee et al. to abolish 100% of NaS1 sulfate
366
transport in Xenopus oocytes (Lee et al. 2006). It is likely that this complete loss of NaS1 function
367
occurs through a nonsense-mediated decay (NMD) mechanism. While SLC13A1 W48X has not
368
been described in the literature to date, the similar effect size of R12X and W48X on serum sulfate
369
and their lack of LD with one another suggest similar loss-of-functionality. Together, these two
370
SNVs alone explain approximately 31% of the additive variance and 9% of the total variance of
371
serum sulfate in the Amish. Based on previous experience, these estimates are likely to be lower in
372
outbred populations. Nevertheless, these estimates clearly validate our approach of systematically
373
identifying and studying nonsense SNVs to glean insights into human biology.
374
Unexpectedly, we observed an association between the more common SLC13A1 N174S
375
variant and increased serum sulfate, albeit an effect size of approximately 1/10th the magnitude of
376
SLC13A1 R12X or W48X, consistent with a gain-of-function. In contrast, Bowling et al. found
377
N174S to be associated with increased FEI of sulfate (Bowling et al. 2013) in a cohort with ASD
378
while Lee et al. found the N174S allele to abolish greater than 60% of NaS1 sulfate transport (Lee
379
et al. 2006). With the exception of R12X, other loss-of-function variants, such as W48X, were not
380
accounted for by Bowling et al.
381
Our ExWAS and GWAS findings support the results of our candidate gene study regarding
382
the sulfate-lowering effect of R12X and W48X in SLC13A1, while also identifying L348P in
383
SLC26A1 as an additional genetic contributor of serum sulfate. SLC26A1 encodes Sat1, the
384
basolateral membrane, sulfate-anion transporter, consistent with this locus as a true determinant of
17
385
serum sulfate. While SLC26A1 L348P has not been described in the literature to our knowledge, at
386
half the effect size of SLC13A1 R12X or W48X, each copy of the L348P allele results in an additive
387
decrease in serum sulfate. This association is suggestive of SLC26A1 L348P causing decreased or
388
loss of function in the Sat1 protein; however functional studies are needed to fully understand the
389
extent to which this occurs. It should be noted that all three of the sulfate-lowering SNVs identified
390
in this report are enriched in the Amish, with SLC13A1 W48X and SLC26A1 L348P being
391
markedly enriched. Therefore, our study is likely better powered to detect these variants compared
392
to other ExWAS or GWAS of serum sulfate in outbred populations of comparable size.
393
Lastly, our results indicate that serum sulfate is a heritable trait, and while our results
394
explain a large portion of the genetic component of serum sulfate, there is an equally large portion
395
yet to be explained. It is possible that much of this heritability can be explained by additional
396
variants in SLC13A1, SLC26A1, and/or other sulfate transporters that were missed by the
397
genotyping arrays used in this study. Such variants have the potential to be identified through other
398
genotyping arrays, exome sequencing, and/or copy number analysis, thus studies aiming to identify
399
additional genetic variants in genes associated with serum sulfate in the Amish, as well as outbred
400
populations, are warranted.
401
402
403
SLC13A1 and Aminotransferase Levels
Serum AST and ALT are biomarkers of hepatocyte integrity and thus elevations in these
404
serum aminotransferases suggest liver cell injury (Giannini et al. 2005; Gowda et al. 2009). Given
405
sulfate’s role in drug metabolism and increased sensitivity to acetaminophen-induced hepatotoxicity
406
exhibited by Slc13a1- and Slc26a1-knockout mice (Lee et al. 2006; Dawson et al. 2010; Markovich
407
2012), we hypothesized that individuals with sulfate-lowering variants would have higher liver
18
408
aminotransferase levels. The association between SLC13A1 nonsense SNV carriers and higher
409
aminotransferase levels in participants of the Amish PAPI Study (Shuldiner et al. 2009; Bozzi et al.
410
2015), support our hypothesis, with even greater effects seen in participants of the Amish Family
411
Longevity Study (Mitchell et al. 2001; Sorkin et al. 2005). The increase in aminotransferase levels
412
seen in SLC13A1 nonsense SNV carriers is consistent with increased susceptibility to drugs or
413
exogenous substrates metabolized through sulfation. This is potentially due to decreased inward
414
transport of sulfate into cells, independent of serum sulfate concentration, resulting in decreased
415
ability to detoxify xenobiotics. In contrast to the Amish PAPI Study (Shuldiner et al. 2009; Bozzi et
416
al. 2015), participants of the Amish Family Longevity Study (Mitchell et al. 2001; Sorkin et al.
417
2005) were not asked to discontinue prescription medications and/or supplements 7 days prior to
418
their initial clinic visit (File S1). We attribute the increased effect of sulfate-lowering variants in
419
participants of the Amish Family Longevity Study to routine medication/supplement use, as well as
420
increased participant age which may render hepatocytes more susceptible to damage from
421
xenobiotics. This phenomenon may partially explain the lack of an association between SLC26A1
422
L348P and elevated aminotransferase levels in the participants of the younger Amish PAPI Study
423
(Shuldiner et al. 2009; Bozzi et al. 2015), along with differential tissue expression of SLC13A1 and
424
SLC26A1, and/or L348P causing decreased, but sufficient, Sat1 function, as opposed to a complete
425
loss of function. Moreover, the associations observed between sulfate-lowering SNVs and
426
aminotransferase level in this study may be under-estimated given the relative drug-naïve lifestyle
427
practice by the Amish community. Additional studies are warranted to better understand the
428
importance of sulfate in human physiology and its potential role in disease and drug toxicity.
429
430
SLC26A1, and BMD and Serum Calcium
19
431
Sulfate’s physiological role in bone and cartilage is well known. Sulfated proteoglycans are
432
a critical component of extracellular matrices in cells throughout the body, specifically in
433
connective tissues and are required for maintaining normal bone and cartilage structure
434
(Yanagishita 1993; Fernandes et al. 1997). The essentiality of sulfate for proper bone and cartilage
435
formation is demonstrated by the spectrum of osteochondrodysplasias in humans with homozygous,
436
loss-of-function mutations in SLC26A2 (Hastbacka et al. 1996; Hastbacka et al. 1999; Online
437
Mendelian Inheritance in Man, OMIM®), as well as dog and sheep with homozygous loss-of-
438
function mutations in Slc13a1 (Neff et al. 2012; Zhao et al. 2012). We thus hypothesized that
439
individuals with sulfate-lowering SNVs would have altered BMD. We observed several significant
440
associations between SLC26A1 L348P and decreased BMD measurements which were not
441
associated with SLC13A1 nonsense carrier status, nor serum sulfate concentration. However,
442
compared to SLC26A1 L348P (allele count ~70, 6 homozygotes), there is far less power to detect
443
such associations with SLC13A1 nonsense SNVs due to the lower allele frequency (allele count=20)
444
and the lack of homozygotes. The lack of an association with SLC13A1 nonsense variants may also
445
be explained by differential tissue expression of SLC13A1 and SLC26A1.
446
The association between increased serum calcium and SLC26A1 L348P may be specific to
447
decreased Sat1 (SLC26A1) function and is seemingly independent of serum sulfate. These findings
448
are consistent with mouse models in which hyposulfatemic, Slc26a1-knockout mice exhibit
449
nephrocalcinosis and increased calcium oxalate kidney stone formation while hyposulfatemic,
450
Slc13a1-knockout mice do not exhibit these manifestations (Dawson et al. 2010; Markovich 2011b,
451
2011a, 2012). Additionally, SLC26A1 has been has been suggested to play a role in humans with
452
recurrent calcium oxalate kidney stones (Dawson et al. 2013). While the mechanism for
453
nephrocalcinosis, calcium oxalate urolithiasis, and increased serum calcium is not totally clear,
20
454
these phenotypes associate with decreased Sat1 function, suggesting they may be specific to anion
455
transport rather than sulfate transport. We hypothesize that the SLC26A1 L348P allele decreases
456
Sat1 function, decreasing sulfate transport into the blood as well as decreasing anion transport into
457
cells. As a result, serum oxalate and thus urine oxalate concentrations may increase resulting in the
458
formation of calcium oxalate crystals which either accumulate to form stones and/or increase serum
459
calcium levels. We do not have an adequate number of subjects with renal stones to address this
460
hypothesis further.
461
462
463
464
Conclusions
Sulfate is an essential micronutrient, as impaired sulfation capacity has been reported in a
465
number of human conditions including reduced xenobiotic clearance, skeletal dysplasias, premature
466
pubarche, ASD, and neurological disease (Alberti et al. 1999; Bowling et al. 2013; Heafield et al.
467
1990; Ahmad et al. 1998; Noordam et al. 2009; Dawson 2013; Dawson and Markovich 2005).
468
Despite sulfate’s known role in the biotransformation of hormones, neurotransmitters,
469
glycosaminoglycans (GAGs), cerebrosides, and xenobiotics (Dawson 2011; Markovich 2014,
470
2011a; Council 2005), sulfate is not routinely measured clinically (Dawson 2011, 2013). In this
471
study, we measured and comprehensively characterized serum sulfate in 977 Amish subjects. We
472
are the first to estimate the heritability of serum sulfate and the first to perform an ExWAS and a
473
GWAS of serum sulfate. We identified novel associations between serum sulfate and three SNVs in
474
the gene, SLC13A1, encoding the apical membrane, sodium-sulfate cotransporter, NaS1. We further
475
identified a novel association between L348P in the gene, SLC26A1, encoding the basolateral
476
membrane, sulfate-anion transporter, Sat1, and decreased serum sulfate. We also investigated
21
477
associations between sulfate-lowering SNVs, aminotransferase levels, BMD, and serum calcium,
478
results of which implicate sulfate and these SNVs in important human metabolism, physiology, and
479
disease processes.
480
In conclusion, this study demonstrates the power and translational potential of leveraging
481
genetic data in founder populations like the Old Order Amish to unveil and better understand the
482
phenotypic consequences of rare, deleterious variants enriched by genetic drift. Further, our clinical
483
findings warrant additional studies to better understand the importance of sulfate in human
484
physiology, disease, and drug toxicity.
485
486
487
488
489
490
491
Declarations
492
List of abbreviations
493
ACDRP: Amish Complex Disease Research Program
494
AFCS: Amish Family Calcification Study
495
AFLS: Amish Family Longevity Study
496
ALT: alanine aminotransferase
497
Amish: Old Order Amish
498
ARC: Amish Research Clinic
22
499
ASD: autism spectrum disorder
500
AST: aspartate aminotransferase
501
AWS: Amish Wellness Study
502
BMD: bone mineral density
503
ESP: Exome Sequencing Project
504
ExWAS: exome-wide association study
505
FEI: fractional excretion index
506
GAG(s): glycosaminoglycan(s)
507
GWAS: genome-wide association study
508
HAPI: Heredity and Phenotype Intervention
509
IRB: Institutional Review Board
510
LD: linkage disequilibrium
511
MAF: minor allele frequency
512
MMAP: Mixed Models Analysis for Pedigrees and Populations
513
NaS1: Sodium-sulfate cotransporter 1 protein
514
NHLBI: National Heart, Lung, and Blood Institute
515
NMD: nonsense-mediated decay
516
OMIM: Online Mendelian Inheritance in Man
517
PAPI: Pharmacogenomics of Anti-Platelet Intervention
518
Sat1: Sulfate-anion transporter 1 protein
519
SNV(s): single nucleotide variant(s)
520
521
Ethics approval and consent to participate
23
522
Subjects included in this report are members of the Old Order Amish community in
523
Lancaster County, PA who were at least 18 years old and have previously participated in one or
524
more Institutional Review Board (IRB)-approved studies conducted at the University of Maryland
525
School of Medicine Amish Research Clinic (ARC). Written informed consent was obtained from
526
each participant.
527
528
Availability of data and material
529
In accordance to our human subjects research protocols, we are unable to release the
530
individual-level data set. We continue to deposit aggregate-level data into the National Institutes of
531
Health (NIH) database of Genotypes and Phenotypes (dbGaP); however, given the Old Order
532
Amish are a founder population with publically available pedigree data, depositing individual-level
533
data on our subjects creates the potential for personal identification of our research subjects. In the
534
past, the NIH has provided exemptions for this reason. Furthermore, all of our analyses were
535
performed using the Mixed Models Analysis for Pedigrees and Populations (MMAP) program
536
which models variation of the trait of interest as a function of measured covariates, measured
537
genotypes, and a polygenic component that accounts for phenotypic correlation due to relatedness.
538
For this reason, outside investigators attempting to replicate our analyses will get different results if
539
the family structure of the Old Order Amish is not accounted for in their analyses. We would be
540
glad to assist any investigators who have questions or would like to perform related analyses
541
regarding this data. Authors will certainly entertain requests for individual-level data and pedigree
542
information with qualified investigators and collaborators after necessary modifications have been
543
made to our study protocols.
544
24
545
Competing interests
546
Christina G. Tise: No conflict of interest to disclose.
547
James A. Perry: No conflict of interest to disclose.
548
Leslie E. Anforth: No conflict of interest to disclose.
549
Mary A. Pavlovich: No conflict of interest to disclose.
550
Joshua D. Backman: No conflict of interest to disclose.
551
Kathleen A. Ryan: No conflict of interest to disclose.
552
Joshua P. Lewis: No conflict of interest to disclose.
553
Jeffrey R. O’Connell: No conflict of interest to disclose.
554
Laura M. Yerges-Armstrong: Dr. Yerges-Armstrong is employed by GlaxoSmithKline but her
555
contribution to this work was completed as an assistant professor at the University of Maryland
556
School of Medicine.
557
Alan R. Shuldiner: In addition to his part-time appointment at the University of Maryland School of
558
Medicine, Dr. Shuldiner is Vice President and Co-Head of the Regeneron Genetics Center, LLC, a
559
fully owned subsidiary of Regeneron Pharmaceuticals, Inc. The Regeneron Genetics Center focuses
560
on early discovery research, applying human genomics to identify novel drug targets.
561
562
Funding
563
National Institute on Aging
564
Award Number: R01 AG018728
565
Recipient: Alan R. Shuldiner
566
567
National Institute of Arthritis and Musculoskeletal and Skin Diseases
Award Number: R01 AR046838
25
568
569
Recipient: Alan R. Shuldiner
National Heart, Lung, and Blood Institute
570
Award Number: U01 HL072515
571
Recipient: Alan R. Shuldiner
572
National Institute of General Medical Sciences
573
Award Number: U01 GM074518
574
Recipient: Alan R. Shuldiner
575
National Heart, Lung, and Blood Institute
576
Award Number: U01 HL084756
577
Recipient: Jeffrey R. O’Connell
578
National Institute of Diabetes and Digestive and Kidney Diseases
579
Award Number: P30 DK072488
580
Recipient: Simeon I. Taylor (not applicable)
581
National Heart, Lung, and Blood Institute
582
Award Number: K01 HL116770
583
Recipient: Laura M. Yerges-Armstrong
584
585
Authors’ contributions
586
CGT designed the genetic filtering approach and subsequent experiments, performed sulfate
587
measurements, genotype assays, and genetic analyses, and wrote the manuscript. JAP performed
588
genetic analyses and produced figures. LEA performed sulfate measurements and genotype assays.
589
MAP retrieved clinical phenotype data and performed genetic analyses. JDB performed genetic
590
analyses. KAR retrieved clinical phenotype data and performed genetic analyses. JPL aided in
26
591
designing the genetic filtering approach and wrote the manuscript. JRO provided genotype data,
592
aided in genetic analyses, and oversaw the project. LMY provided genotype data, performed
593
genetic analyses, oversaw the project, and wrote the manuscript. ARS aided in designing the genetic
594
filtering approach, provided genotype data and clinical phenotype data, oversaw the project, and
595
wrote the manuscript. All authors read and approved the final manuscript.
596
597
598
Acknowledgements
We gratefully acknowledge and thank the Amish Research Clinic staff, as well as the Amish
599
community for their extraordinary cooperation and support, without which these studies would not
600
have been possible.
601
602
603
604
605
Tables
606
Table 1. Characteristics of Old Order Amish research participants by cohort.
607
Characteristic
Sulfate
AFLS
PAPI Cohort
(Units)
Cohort
Cohort
Number (n)
977
684
264
Male (%)
49.2
49.8
47.3
Age ± SD (years) 46.0 ± 14.4 45.0 ± 13.4 65.5 ± 10.4
BMI ± SD (kg/m2) 27.1 ± 4.8 27.1 ± 4.7 27.9 ± 5.2
Abbreviations: SD, standard deviation; BMI, body mass index; PAPI, Pharmacogenomics of Anti-
608
Platelet Intervention; ALFS, Amish Family Longevity Study.
609
610
27
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
Table 2. Associations between sulfate-lowering SNVs and clinical phenotypes.
626
a.
Trait
(PAPI)
ALT
AST
627
628
Sulfate-lowering SNV(s)
n
Het.
Hom.
SLC13A1 R12X or W48X
SLC26A1 L348P
SLC13A1 R12X or W48X
SLC26A1 L348P
684
650
684
650
20
71
20
71
0
6
0
6
SNV covariate(s)
n
Het.
Hom.
SLC13A1 R12X or W48X
SLC26A1 L348P
SLC13A1 R12X or W48X
SLC26A1 L348P
264
232
264
232
7
21
7
21
0
2
0
2
βSNV ± SE
βSNV/
(U/L)
MeanWT
3.73 ± 1.74
0.20
-0.08 ± 0.84
0.00
2.13 ± 1.26
0.11
0.53 ± 0.59
0.03
PSNV
0.03
0.92
0.09
0.37
b.
Trait
(AFLS)
ALT
AST
629
28
βSNV ± SE
βSNV/
(U/L)
MeanWT
7.25 ± 2.33
0.42
0.81 ± 1.20
0.04
10.42 ± 1.92
0.55
2.42 ± 0.89
0.13
PSNV
2.0x10-3
0.50
1.3x10-7
6.8x10-3
630
c.
n
Het.
Hom.
βSNV ± SE
βSNV/
MeanWT
PSNV
679
20
0
-0.03 ± 0.02
-0.03
0.19
645
69
6
-0.03 ± 0.01
-0.03
6.8x10-3
684
20
0
0.00 ± 0.07
0.00
0.98
650
71
6
0.11 ± 0.03
0.01
1.4x10-3
Corrected SLC13A1 R12X or W48X 684
Calcium
SLC26A1 L348P
650
(mg/dL)
20
0
-0.03 ± 0.06
0.00
0.68
71
0
0.06 ± 0.03
0.01
0.06
Trait
Sulfate-lowering SNV(s)
Whole- SLC13A1 R12X or W48X
body BMD
SLC26A1 L348P
(g/cm2)
Calcium SLC13A1 R12X or W48X
(mg/dL)
SLC26A1 L348P
631
632
633
634
635
636
637
638
639
d.
BMD
Trait
Mid-arm
1/3-arm
Total-arm
Ultradistal-arm
Femoral
neck
Intertrochanter
Total-hip
Trochanter
Sulfate-lowering SNV(s)
n
Het.
Hom.
SLC13A1 R12X or W48X
SLC26A1 L348P
SLC13A1 R12X or W48X
SLC26A1 L348P
SLC13A1 R12X or W48X
SLC26A1 L348P
SLC13A1 R12X or W48X
SLC26A1 L348P
SLC13A1 R12X or W48X
SLC26A1 L348P
SLC13A1 R12X or W48X
SLC26A1 L348P
SLC13A1 R12X or W48X
SLC26A1 L348P
SLC13A1 R12X or W48X
680
646
680
646
680
646
680
646
681
647
680
646
681
647
680
20
70
20
70
20
70
20
70
20
70
20
70
20
70
20
0
6
0
6
0
6
0
6
0
6
0
6
0
6
0
29
βSNV ± SE
βSNV/
PSNV
2
(g/cm )
MeanWT
-0.02 ± 0.01
-0.04
0.08
-0.01 ± 0.01
-0.02
0.03
-0.03 ± 0.02
-0.04
0.08
-0.02 ± 0.01
-0.02
0.03
-0.02 ± 0.01
-0.04
0.06
-0.02 ± 0.01
-0.02
0.01
-0.02 ± 0.01
-0.03
0.26
-0.01 ± 0.01
-0.03
0.03
-0.02 ± 0.03
-0.02
0.46
-0.02 ± 0.01
-0.02
0.26
-0.04 ± 0.04
-0.03
0.35
-0.05 ± 0.02
-0.04 5.1x10-3
-0.03 ± 0.03
-0.03
0.28
-0.04 ± 0.01
-0.04
0.01
-0.03 ± 0.03
-0.04
0.23
640
SLC26A1 L348P
646 70
6
-0.03 ± 0.01
-0.03
0.04
SLC13A1 R12X or W48X 679 20
0
-0.03 ± 0.03
-0.03
0.39
Total-spine
SLC26A1 L348P
645 70
6
-0.03 ± 0.02
-0.03
0.10
Adjusted for age and gender. The P-value of significant associations are highlighted in bold text.
641
Abbreviations: SE, standard error. a) Serum alanine aminotransferase (ALT) and aspartate
642
aminotransferase (AST) levels from the Amish Pharmacogenomics of Anti-Platelet Intervention
643
(PAPI) Study. b) Serum ALT and AST levels from the Amish Family Longevity Study (AFLS). c)
644
Whole-body bone mineral density (BMD) and serum calcium from the Amish PAPI Study. d)
645
Additional BMD measurements from the Amish PAPI Study. Abbreviations: Het., heterozygotes;
646
Hom., homozygotes; WT, wild type.
647
648
649
650
651
652
653
References
654
Agarwala, R., L.G. Biesecker, and A.A. Schaffer, 2003 Anabaptist genealogy database. Am J Med
655
656
Genet C Semin Med Genet 121c (1):32-37.
Ahmad, M., M. Faiyaz Ul Haque, W. Ahmad, H. Abbas, S. Haque et al., 1998 Distinct, autosomal
657
recessive form of spondyloepimetaphyseal dysplasia segregating in an inbred Pakistani
658
kindred. Am J Med Genet 78 (5):468-473.
30
659
660
661
662
663
Alberti, A., P. Pirrone, M. Elia, R.H. Waring, and C. Romano, 1999 Sulphation deficit in "lowfunctioning" autistic children: a pilot study. Biol Psychiatry 46 (3):420-424.
Bowling, F.G., H.S. Heussler, A. McWhinney, and P.A. Dawson, 2013 Plasma and urinary sulfate
determination in a cohort with autism. Biochem Genet 51 (1-2):147-153.
Bozzi, L.M., B.D. Mitchell, J.P. Lewis, K.A. Ryan, W.R. Herzog et al., 2015 The
664
Pharmacogenomics of Anti-Platelet Intervention (PAPI) Study: Variation in Platelet
665
Response to Clopidogrel and Aspirin. Curr Vasc Pharmacol.
666
Council, N.R., 2005 Sulfate, pp. 424--448 in National Research Council. Dietary Reference Intakes
667
for Water, Potassium, Sodium, Chloride, and Sulfate. The National Academies Press,
668
Washington, DC.
669
Dawson, P.A., 2011 Sulfate in fetal development. Semin Cell Dev Biol 22 (6):653-659.
670
Dawson, P.A., 2013 Role of sulphate in development. Reproduction 146 (3):R81-89.
671
Dawson, P.A., and D. Markovich, 2005 Pathogenetics of the human SLC26 transporters. Curr Med
672
Chem 12 (4):385-396.
673
Dawson, P.A., C.S. Russell, S. Lee, S.C. McLeay, J.M. van Dongen et al., 2010 Urolithiasis and
674
hepatotoxicity are linked to the anion transporter Sat1 in mice. J Clin Invest 120 (3):706-
675
712.
31
676
677
678
679
680
681
682
683
684
685
686
687
688
Dawson, P.A., P. Sim, D.W. Mudge, and D. Cowley, 2013 Human SLC26A1 gene variants: a pilot
study. ScientificWorldJournal 2013:541710.
Dodgson, K.S., and R.G. Price, 1962 A note on the determination of the ester sulphate content of
sulphated polysaccharides. Biochem J 84:106-110.
Fernandes, I., G. Hampson, X. Cahours, P. Morin, C. Coureau et al., 1997 Abnormal sulfate
metabolism in vitamin D-deficient rats. J Clin Invest 100 (9):2196-2203.
Giannini, E.G., R. Testa, and V. Savarino, 2005 Liver enzyme alteration: a guide for clinicians.
Cmaj 172 (3):367-379.
Gowda, S., P.B. Desai, V.V. Hull, A.A. Math, S.N. Vernekar et al., 2009 A review on laboratory
liver function tests. Pan Afr Med J 3:17.
Hastbacka, J., A. Kerrebrock, K. Mokkala, G. Clines, M. Lovett et al., 1999 Identification of the
Finnish founder mutation for diastrophic dysplasia (DTD). Eur J Hum Genet 7 (6):664-670.
Hastbacka, J., A. Superti-Furga, W.R. Wilcox, D.L. Rimoin, D.H. Cohn et al., 1996
689
Atelosteogenesis type II is caused by mutations in the diastrophic dysplasia sulfate-
690
transporter gene (DTDST): evidence for a phenotypic series involving three
691
chondrodysplasias. Am J Hum Genet 58 (2):255-262.
32
692
Heafield, M.T., S. Fearn, G.B. Steventon, R.H. Waring, A.C. Williams et al., 1990 Plasma cysteine
693
and sulphate levels in patients with motor neurone, Parkinson's and Alzheimer's disease.
694
Neurosci Lett 110 (1-2):216-220.
695
Hsueh, W.C., B.D. Mitchell, R. Aburomia, T. Pollin, H. Sakul et al., 2000 Diabetes in the Old
696
Order Amish: characterization and heritability analysis of the Amish Family Diabetes Study.
697
Diabetes Care 23 (5):595-601.
698
Klaassen, C.D., and J.W. Boles, 1997 Sulfation and sulfotransferases 5: the importance of 3'-
699
phosphoadenosine 5'-phosphosulfate (PAPS) in the regulation of sulfation. Faseb j 11
700
(6):404-418.
701
Lee, S., P.A. Dawson, A.K. Hewavitharana, P.N. Shaw, and D. Markovich, 2006 Disruption of
702
NaS1 sulfate transport function in mice leads to enhanced acetaminophen-induced
703
hepatotoxicity. Hepatology 43 (6):1241-1247.
704
Lee, W.J., T.I. Pollin, J.R. O'Connell, R. Agarwala, and A.A. Schaffer, 2010 PedHunter 2.0 and its
705
usage to characterize the founder structure of the Old Order Amish of Lancaster County.
706
BMC Med Genet 11:68.
707
708
Markovich, D., 2011a Physiological roles of mammalian sulfate transporters NaS1 and Sat1. Arch
Immunol Ther Exp (Warsz) 59 (2):113-116.
33
709
710
711
712
Markovich, D., 2011b Physiological roles of renal anion transporters NaS1 and Sat1. Am J Physiol
Renal Physiol 300 (6):F1267-1270.
Markovich, D., 2012 Slc13a1 and Slc26a1 KO models reveal physiological roles of anion
transporters. Physiology (Bethesda) 27 (1):7-14.
713
Markovich, D., 2014 Na+-sulfate cotransporter SLC13A1. Pflugers Arch 466 (1):131-137.
714
Mitchell, B.D., W.C. Hsueh, T.M. King, T.I. Pollin, J. Sorkin et al., 2001 Heritability of life span in
715
716
the Old Order Amish. Am J Med Genet 102 (4):346-352.
Mitchell, B.D., P.F. McArdle, H. Shen, E. Rampersaud, T.I. Pollin et al., 2008 The genetic response
717
to short-term interventions affecting cardiovascular function: rationale and design of the
718
Heredity and Phenotype Intervention (HAPI) Heart Study. Am Heart J 155 (5):823-828.
719
Neff, M.W., J.S. Beck, J.M. Koeman, E. Boguslawski, L. Kefene et al., 2012 Partial deletion of the
720
sulfate transporter SLC13A1 is associated with an osteochondrodysplasia in the Miniature
721
Poodle breed. PLoS One 7 (12):e51917.
722
NIST/SEMATECH e-Handbook of Statistical Methods.
723
Noordam, C., V. Dhir, J.C. McNelis, F. Schlereth, N.A. Hanley et al., 2009 Inactivating PAPSS2
724
mutations in a patient with premature pubarche. N Engl J Med 360 (22):2310-2318.
725
O'Connell, J.R., MMAP Documentation.
34
726
Online Mendelian Inheritance in Man, OMIM®, pp. An Online Catalog of Human Genes and
727
Genetic Disorders, edited by J.H.U. McKusick-Nathans Institute of Genetic Medicine,
728
Baltimore, MD.
729
730
731
Online Mendelian Inheritance in Man, OMIM®, pp. MIM Number: 606718, edited by B. Johns
Hopkins University, MD.
Post, W., L.F. Bielak, K.A. Ryan, Y.C. Cheng, H. Shen et al., 2007 Determinants of coronary artery
732
and aortic calcification in the Old Order Amish. Circulation 115 (6):717-724.
733
Shuldiner, A.R., J.R. O'Connell, K.P. Bliden, A. Gandhi, K. Ryan et al., 2009 Association of
734
cytochrome P450 2C19 genotype with the antiplatelet effect and clinical efficacy of
735
clopidogrel therapy. Jama 302 (8):849-857.
736
737
738
Sorkin, J., W. Post, T.I. Pollin, J.R. O'Connell, B.D. Mitchell et al., 2005 Exploring the genetics of
longevity in the Old Order Amish. Mech Ageing Dev 126 (2):347-350.
Streeten, E.A., D.J. McBride, T.I. Pollin, K. Ryan, J. Shapiro et al., 2006 Quantitative trait loci for
739
BMD identified by autosome-wide linkage scan to chromosomes 7q and 21q in men from
740
the Amish Family Osteoporosis Study. J Bone Miner Res 21 (9):1433-1442.
741
742
Yanagishita, M., 1993 Function of proteoglycans in the extracellular matrix. Acta Pathol Jpn 43
(6):283-293.
35
743
Zhao, X., S.K. Onteru, S. Piripi, K.G. Thompson, H.T. Blair et al., 2012 In a shake of a lamb's tail:
744
using genomics to unravel a cause of chondrodysplasia in Texel sheep. Anim Genet 43
745
Suppl 1:9-18.
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
36
764
765
Figure Legends
766
Figure 1. Sulfate transporters in the renal and small intestinal epithelial cell. Modified from
767
Markovich (Markovich 2012). Permission to reuse figure obtained from copyright holder.
768
769
Figure 2. Serum sulfate concentration by SLC13A1 genotype. Box and whisker plots: box
770
represents 2nd and 3rd quartiles (IQR, interquartile range); horizontal band represents median value;
771
whiskers represent 1st and 4th quartiles; ends of whiskers represent minimum and maximum values,
772
excluding outliers; open circles represents outliers (data point more than 1.5*IQR below the 1st
773
quartile or above the 3rd quartile). Association analyses between genotypes and serum sulfate were
774
conducted using a regression-based method that models variation of the trait of interest as a
775
function of measured covariates, measured genotypes, and a polygenic component that accounts for
776
phenotypic correlation due to relatedness. a) Serum sulfate by SLC13A1 N174S genotype. b) Serum
777
sulfate by SLC13A1 R237C genotype. c) Serum sulfate by SLC13A1 nonsense SNV genotype. The
778
P-value for the R12X or W48X Heterozygotes (dark green box and whisker plot) results from the
779
model including carrier status of an SLC13A1 nonsense SNV (R12X or W48X) as a covariate. The
780
P-values for the R12X Heterozygotes and the W48X Heterozygotes (light green box and whisker
781
plots) result from the conditional model including both R12X and W48X as covariates.
782
783
Figure 3. ExWAS of serum sulfate enables discovery of SLC26A1 L348P. a) Manhattan plot
784
depicting association results (plotted as –log10 P-value on y-axis) of individual single-nucleotide
785
polymorphisms distributed across the 22 autosomes (x-axis) from the serum sulfate ExWAS
786
performed using the Illumina Human Exome BeadChip platform (n=900). Red horizontal dotted
37
787
line indicates P=5.0×10−8. Blue horizontal dotted line indicates P=5.0x10-6. b) Serum sulfate by
788
SLC26A1 L348P genotype. Box and whisker plots: box represents 2nd and 3rd quartiles (IQR,
789
interquartile range); horizontal band represents median value; whiskers represent 1st and 4th
790
quartiles; ends of whiskers represent minimum and maximum values, excluding outliers; open
791
circles represents outliers (data point more than 1.5*IQR below the 1st quartile or above the 3rd
792
quartile). Association analyses between genotypes and serum sulfate were conducted using a
793
regression-based method that models variation of the trait of interest as a function of measured
794
covariates, measured genotypes, and a polygenic component that accounts for phenotypic
795
correlation due to relatedness.
796
797
Figure 4. Serum alanine aminotransferase (ALT) and aspartate aminotransferase (AST) by
798
SLC13A1 nonsense SNV carrier status. Box and whisker plots: box represents 2nd and 3rd
799
quartiles (IQR, interquartile range); horizontal band represents median value; whiskers represent 1st
800
and 4th quartiles; ends of whiskers represent minimum and maximum values, excluding outliers;
801
open circles represents outliers (data point more than 1.5*IQR below the 1st quartile or above the
802
3rd quartile). Association analyses between genotypes and serum sulfate were conducted using a
803
regression-based method that models variation of the trait of interest as a function of measured
804
covariates, measured genotypes, and a polygenic component that accounts for phenotypic
805
correlation due to relatedness. a) Amish PAPI Study subjects. b) Amish Family Longevity Study
806
(AFLS) subjects.
807
808
Figure 5. Whole-body bone mineral density (BMD) and corrected serum calcium by SLC26A1
809
L348P genotype. Box and whisker plots: box represents 2nd and 3rd quartiles (IQR, interquartile
38
810
range); horizontal band represents median value; whiskers represent 1st and 4th quartiles; ends of
811
whiskers represent minimum and maximum values, excluding outliers; open circles represents
812
outliers (data point more than 1.5*IQR below the 1st quartile or above the 3rd quartile). Association
813
analyses between genotypes and serum sulfate were conducted using a regression-based method
814
that models variation of the trait of interest as a function of measured covariates, measured
815
genotypes, and a polygenic component that accounts for phenotypic correlation due to relatedness.
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
39
831
832
Supplemental Figure Legends
833
Figure S1. Linkage disequilibrium (LD) plots depicting r2 and D’ amongst SLC13A1 SNVs
834
within the entire Amish cohort (n=1,725). The LD plots were created using Haploview. The
835
numbers in the squares on the LD plots are the r2 and D’ values (only the decimals are shown) - r2
836
and D’ values of 1 are not listed. The color code on the r2 plot follow the r2 color scheme for
837
Haploview: white=(r2=0), shades of grey=(0<r2<1), black=(r2=1). The color code on the D’ plot
838
follow the standard color scheme for Haploview: white=(|D′|<1, LOD<2), shades of
839
pink/red=(|D′|<1, LOD≥2), blue=(|D′|=1, LOD<2), bright red=(|D′|=1, LOD≥ 2). Each SNV is
840
labeled with gene name, amino acid change and position, and the reference sequence id (rs-
841
number). The distance between each SNV is indicated on the white bar above.
842
843
Figure S2. Descriptive statistics and distribution of serum sulfate in 977 Old Order Amish
844
research participants.
845
846
Figure S3. Serum sulfate by SLC13A1 R12X genotype. Box and whisker plots: box represents 2nd
847
and 3rd quartiles (IQR, interquartile range); horizontal band represents median value; whiskers
848
represent 1st and 4th quartiles; ends of whiskers represent minimum and maximum values, excluding
849
outliers; open circles represents outliers (data point more than 1.5*IQR below the 1st quartile or
850
above the 3rd quartile). Association analyses between genotypes and serum sulfate were conducted
851
using a regression-based method that models variation of the trait of interest as a function of
852
measured covariates, measured genotypes, and a polygenic component that accounts for phenotypic
853
correlation due to relatedness.
40
854
855
Figure S4. Serum sulfate by SLC13A1 W48X genotype. Box and whisker plots: box represents
856
2nd and 3rd quartiles (IQR, interquartile range); horizontal band represents median value; whiskers
857
represent 1st and 4th quartiles; ends of whiskers represent minimum and maximum values, excluding
858
outliers; open circles represents outliers (data point more than 1.5*IQR below the 1st quartile or
859
above the 3rd quartile). Association analyses between genotypes and serum sulfate were conducted
860
using a regression-based method that models variation of the trait of interest as a function of
861
measured covariates, measured genotypes, and a polygenic component that accounts for phenotypic
862
correlation due to relatedness.
863
864
Figure S5. QQ plot for the serum sulfate ExWAS performed using the Illumina Human
865
Exome BeadChip platform (n=900). a) Unadjusted ExWAS. b) ExWAS adjusted for SLC13A1
866
R12X, SLC13A1 W48X, and SLC26A1 L348P. Note about horizontal line seen at Y=3.4: There are
867
29 SNVs on chromosome 11 in perfect LD with each other and thus all have the same P value.
868
869
Figure S6. Linkage disequilibrium plots depicting r2 and D’ amongst the top serum sulfate
870
ExWAS hits on chromosome 4 from the Illumina Human Exome BeadChip platform (n=900).
871
The LD plots were created using Haploview. The numbers in the squares on the LD plots are the r2
872
and D’ values (only the decimals are shown) - r2 and D’ values of 1 are not listed. The color code
873
on the r2 plot follow the r2 color scheme for Haploview: white=(r2=0), shades of grey=(0<r2<1),
874
black=(r2=1). The color code on the D’ plot follow the standard color scheme for Haploview:
875
white=(|D′|<1, LOD<2), shades of pink/red=(|D′|<1, LOD≥2), blue=(|D′|=1, LOD<2), bright
876
red=(|D′|=1, LOD≥ 2). Each SNV is labeled with gene name, amino acid change and position, and
41
877
the reference sequence id (rs-number). The distance between each SNV is indicated on the white
878
bar above.
879
880
Figure S7. Serum sulfate ExWAS results adjusted for SLC13A1 R12X, SLC13A1 W48X, and
881
SLC26A1 L348P (n=900).. Manhattan plot depicting association results (plotted as –log10 P-value
882
on y-axis) of individual single-nucleotide polymorphisms distributed across the 22 autosomes (x-
883
axis) from the serum sulfate ExWAS performed using the Illumina Human Exome BeadChip
884
platform. Red horizontal dotted line indicates P=5.0×10−8. Blue horizontal dotted line indicates
885
P=5.0x10-6.
886
887
Figure S8. QQ plot for serum sulfate GWAS using Affymetrix GeneChip platform (n=917).
888
889
Figure S9. Serum sulfate GWAS results (n=917). Manhattan plot depicting association results
890
(plotted as –log10 P-value on y-axis) of individual single-nucleotide polymorphisms distributed
891
across the 22 autosomes (x-axis) from the serum sulfate GWAS performed using the Affymetrix
892
GeneChip platform. Red horizontal dotted line indicates P=5.0×10−8. Blue horizontal dotted line
893
indicates P=5.0x10-6. Affymetrix GeneChip does not contain SLC13A1 R12X, SLC13A1 W48X, or
894
SLC26A1 L348P.
895
896
Figure S10. Linkage disequilibrium plots depicting r2 and D’ amongst SLC26A1 L348P (from
897
the Illumina Human Exome BeadChip platform) and the top serum sulfate GWAS hits on
898
chromosome 7 from the Affymetrix GeneChip platform (n=879, subject overlap between
899
Illumina Human Exome BeadChip and Affymetrix GeneChip platforms). The LD plots were
42
900
created using Haploview. The numbers in the squares on the LD plots are the r2 and D’ values (only
901
the decimals are shown) - r2 and D’ values of 1 are not listed. The color code on the r2 plot follow
902
the r2 color scheme for Haploview: white=(r2=0), shades of grey=(0<r2<1), black=(r2=1). The color
903
code on the D’ plot follow the standard color scheme for Haploview: white=(|D′|<1, LOD<2),
904
shades of pink/red=(|D′|<1, LOD≥2), blue=(|D′|=1, LOD<2), bright red=(|D′|=1, LOD≥ 2). Each
905
SNV is labeled with gene name, amino acid change and position, and the reference sequence id (rs-
906
number). The distance between each SNV is indicated on the white bar above.
907
908
Figure S11. Linkage disequilibrium plots depicting r2 and D’ amongst SLC13A1 W48X (from
909
the Illumina Human Exome BeadChip platform) and the top serum sulfate GWAS hits on
910
chromosome 4 from the Affymetrix GeneChip platform (n=879, subject overlap between
911
Illumina Human Exome BeadChip and Affymetrix GeneChip platforms). The LD plots were
912
created using Haploview. The numbers in the squares on the LD plots are the r2 and D’ values (only
913
the decimals are shown) - r2 and D’ values of 1 are not listed. The color code on the r2 plot follow
914
the r2 color scheme for Haploview: white=(r2=0), shades of grey=(0<r2<1), black=(r2=1). The color
915
code on the D’ plot follow the standard color scheme for Haploview: white=(|D′|<1, LOD<2),
916
shades of pink/red=(|D′|<1, LOD≥2), blue=(|D′|=1, LOD<2), bright red=(|D′|=1, LOD≥ 2). Each
917
SNV is labeled with gene name, amino acid change and position, and the reference sequence id (rs-
918
number). The distance between each SNV is indicated on the white bar above.
919
920
Figure S12. QQ plot for serum sulfate GWAS adjusted for rs17684427 and rs362272 using
921
Affymetrix GeneChip platform (n=917).
922
43
923
Figure S13. Serum sulfate GWAS results adjusted for rs17684427 and rs362272 (n=917).
924
Manhattan plot depicting association results (plotted as –log10 P-value on y-axis) of individual
925
single-nucleotide polymorphisms distributed across the 22 autosomes (x-axis) from the serum
926
sulfate GWAS performed using the Affymetrix GeneChip platform. Red horizontal dotted line
927
indicates P=5.0×10−8. Blue horizontal dotted line indicates P=5.0x10-6. Affymetrix GeneChip does
928
not contain SLC13A1 R12X, SLC13A1 W48X, or SLC26A1 L348P.
44
a.
b.