Large-Scale Generation and Analysis of Expressed Sequence Tags

BIOLOGY OF REPRODUCTION 71, 1991–2002 (2004)
Published online before print 11 August 2004.
DOI 10.1095/biolreprod.104.031831
Large-Scale Generation and Analysis of Expressed Sequence Tags from
Porcine Ovary1
Honglin Jiang,3,4 Kristin M. Whitworth,4 Nathan J. Bivens,4 James E. Ries,4 Rami J. Woods,4
Lawrence J. Forrester,4 Gordon K. Springer,4 Nagappan Mathialagan,5 Cansu Agca,4 Randall S. Prather,4
and Matthew C. Lucy2,4
Department of Animal Science,4 University of Missouri, Columbia, Missouri 65211
Monsanto Company,5 St. Louis, Missouri 63198
ABSTRACT
One method to identify the factors that control ovarian function is to characterize the genes that are expressed in ovary. In
the present study, cDNA libraries from fetal, neonatal, and prepubertal porcine ovaries, pubertal ovaries on different days of
the estrous cycle (Days 0 [follicle], 5, and 12 [follicle and corpus
luteum]), and follicles isolated from weaned sows (diameter, 2,
4, 6, and 8 mm) were constructed and sequenced. A total of
22 176 cDNAs were sequenced, of which 15 613 were of sufficient quality for clustering. Clustering of cDNAs resulted in
8507 contigs, 6294 (74%) of which were comprised of a single
sequence. Sixty-eight percent of the contigs had consensus sequences that were homologous to existing Tentative Consensus
(TC) sequences or mature transcripts (ET) in The Institute for
Genomic Research Porcine Gene Index. The consensus sequences were classified according to the Gene Ontology Index. Most
cDNA-encoded proteins were components of the nucleus, ribosome, or mitochondrion. The proteins primarily functioned in
binding, catalysis, and transport. Nearly 75% of the proteins
were involved in metabolism and cell growth and/or maintenance. Analysis of the cDNA frequency across different libraries
demonstrated differential gene expression within different-size
follicles, between follicles and corpora lutea, and across developmental time-points. The expression of selected genes (analyzed by ribonuclease protection assay and Northern blotting)
was consistent with the frequency of their respective cDNA in
the individual libraries. This porcine ovary unigene set will be
useful for identifying factors and mechanisms controlling ovarian follicular development in a variety of species.
corpus luteum, follicle, gene regulation, ovary
INTRODUCTION
The ovarian follicle, consisting of an oocyte and surrounding layers of granulosa and theca cells, fulfills the
essential ovarian functions of steroidogenesis and gamete
production. During reproductive life, ovarian follicles undergo dynamic changes in morphology and function as they
progress through the primordial, primary, secondary, antral,
Supported in part by a grant from the Monsanto Company (St. Louis,
MO) and the University of Missouri (Food for the 21st Century Program).
2
Correspondence: Matthew C. Lucy, 164 Animal Science Research Center,
University of Missouri, Columbia, MO 65211. FAX: 573 882 6827;
e-mail: [email protected]
3
Current address: Department of Animal and Poultry Sciences, Virginia
Tech, Blacksburg, VA.
1
Received: 10 May 2004.
First decision: 16 June 2004.
Accepted: 23 July 2004.
Q 2004 by the Society for the Study of Reproduction, Inc.
ISSN: 0006-3363. http://www.biolreprod.org
and preovulatory follicular stages [1]. An even greater morphological and functional change occurs when follicular
cells differentiate into the corpus luteum following the LH
surge and ovulation [2]. Few follicles actually progress
through all of the successive stages, because a high rate of
follicular atresia leads to follicular death before ovulation
[3, 4].
The mechanisms underlying ovarian follicular development have been intensely studied. It is generally accepted
that antral and preovulatory follicular development is primarily controlled by pituitary gonadotropins [1]. Early
stages of follicular development (preantral and small antral
stages) are controlled by a variety of locally produced hormones and growth factors [5, 6]. Despite continuing progress in the area of ovarian biology, the factors controlling
some of the key events in follicular development, including
the initiation of primordial follicle growth, antrum formation, follicular dominance, and follicular atresia, remain to
be identified.
Large-scale sequencing of cDNA libraries is a rapid and
efficient means of discovering new genes and providing
both quantitative and qualitative information regarding
gene expression within specific tissues or cell types [7–12].
We carried out large-scale sequencing of cDNA from the
pig ovary to identify novel intraovarian factors and mechanisms involved in ovarian development. Here, we report
a catalog of genes expressed in the porcine ovary (expressed sequence tags [ESTs]) as well as a preliminary view
of their expression patterns throughout ovarian development, follicular growth, and ovulation.
MATERIALS AND METHODS
Library Construction and Sequencing
Ovarian tissue (whole ovary, dissected follicles, or corpora lutea) (Table 1) was collected from cross-bred pigs (Sus scrofa domestica), frozen
in liquid nitrogen shortly after collection, and stored at 2808C until RNA
extraction. The tissues from several individual pigs were pooled for RNA
extraction. The procedures used for tissue collection were approved by the
Animal Care and Use Committee of the University of Missouri-Columbia.
The procedures for constructing the Day 0 follicle (pd0fol), Day 5
ovary (pd5ov), Day 12 follicle (pd12fol), and Day 12 corpus luteum
(pd12CL) cDNA libraries were reported previously [12]. The methods
described herein were used for production of the 2-mm follicle (p2mm),
4-mm follicle (p4mm), 6-mm follicle (p6mm), 8-mm follicle (p8mm), fetal
ovary (pfeto), neonatal ovary (pnatal), and prepubertal ovary (pputal) libraries. Total cellular RNA was extracted using the Trizol reagent (Life
Technologies, Gaithersburg, MD) according to the manufacturer’s instructions. Poly(A) RNA was isolated from total cellular RNA using the Oligotex mRNA kit (Qiagen, Chatsworth, CA). The integrity and purity of
extracted RNA were verified by measuring the ratio of absorbance at 260
and 280 nm and by electrophoresis of an RNA aliquot on formaldehydeagarose gels.
1991
1992
JIANG ET AL.
TABLE 1. Sources of tissues used to construct cDNA libraries.
Library
Library code
Animal type
Tissue type
2-mm follicle
4-mm follicle
6-mm follicle
8-mm follicle
Day 0 follicle
Day 5 ovary
p2mm
p4mm
p6mm
p8mm
pd0fol
pd5ov
Sow
Sow
Sow
Sow
Gilt
Gilt
Follicle
Follicle
Follicle
Follicle
Follicle
Whole ovary
Day 12 corpus luteum
pd12cl
Gilt
Corpus luteum
Day 12 follicle
pd12fol
Gilt
Follicle
Fetal ovary
pfeto
Fetus
Whole ovary
Neonatal ovary
pnatal
Gilt
Whole ovary
Prepubertal ovary
pputal
Gilt
Whole ovary
The cDNA libraries were constructed by using the SuperScript Plasmid
System for cDNA Synthesis and Plasmid Cloning (Life Technologies) according to the manufacturer’s instructions with a modified reverse transcription reaction [13]. The reverse transcription was performed by annealing 1 mg of poly(A) RNA with 10 mg of an oligo(dT) primer
(GCTGCTCGCGGCCGC-tag-T18, where tag 5 library-specific five-base
sequence identifier) at 428C or by annealing 1 mg of poly(A) RNA with
4 mg of an anchored oligo(dT) primer (GCTGCTCGCGGCCGC-tagT18VN, where V 5 A, G, or C and N 5 A, G, C, or T) at 508C. An
excessive amount of oligo(dT) primer or an anchored oligo(dT) primer
was used to reduce the number of clones with long poly(A) tails in the
cDNA libraries [13]. The cDNAs were ligated into either the pT7T3 vector
(pd0fol, pd5ov, pd12fol, and pd12CL cDNA libraries) or the pSport1 vector (p2mm, p4mm, p6mm, p8mm, pfeto, pnatal, and pputal libraries; Life
Technologies) at the NotI and SalI sites. Ligated vector-cDNAs were transformed into DH10B Escherichia coli-competent cells (Life Technologies)
by electroporation. The complexity of the library was estimated according
to the manufacturer’s instructions (Life Technologies).
The quality of each cDNA library was assessed before large-scale
cDNA sequencing was performed. Bacteria clones from each library were
picked and individually grown overnight at 378C in deep-well, 96-well
plates containing 1.4 ml of Terrific Broth with ampicillin [14]. Plasmid
extractions from bacterial cultures were performed by using the QIAprep
96 Turbo Miniprep kit (Qiagen) according to the manufacturer’s instructions. The average cDNA insert size and the percentage of clones without
an insert were determined by EcoRI and HindIII restriction-enzyme digestion of plasmid isolated from 96 randomly picked clones. Libraries with
an average insert size of greater than 1 kilobase (kb) and more than 95%
of clones having a cDNA insert were subjected to preliminary cDNA
sequencing. Then, 288 randomly picked clones were sequenced. The ABI
Prism Big Dye Terminator cycle sequencing chemistry (Applied Biosystems, Foster City, CA) with a T3 (cDNA ligated into the pT7T3 vector)
or Sp6 (cDNA ligated into the pSport1 vector) sequencing primer was
used for the sequencing reactions. The sequencing reaction products (39
cDNA sequence) were analyzed on an ABI 377 Automated DNA sequencer (Applied Biosystems). The cDNA sequence data from the preliminary
sequencing were evaluated. A cDNA library was subjected to large-scale
sequencing if it met the following criteria: 1) More than 95% of the clones
contained inserts, 2) the average insert size was greater than 1 kb, 3) less
than 5% of the clones contained ribosomal RNA, 4) less than 10% of the
clones contained long poly(A) tails (.40 adenines), and 5) the complexity
of the library was at least 106 colony-forming units. The large-scale sequencing was performed as described for the preliminary sequence analysis above.
Sequence Clustering and Annotation
The cDNA sequences from the 11 libraries were clustered into contigs
using the SeqMan II software system (DNASTAR, Inc., Madison, WI).
The original trace files from the automated DNA sequencing were loaded
into the software system. The trace files were evaluated for quality using
the Trace Quality Evaluation system within SeqMan II. Poor-quality sequence, vector sequence, and poly(A) tails were trimmed from the sequence. The trimmed cDNA sequences were assembled into contigs using
Description
Dissected from sow ovary on Days 1–5 after weaning.
Dissected from sow ovary on Days 1–5 after weaning.
Dissected from sow ovary on Days 1–5 after weaning.
Dissected from sow ovary on Days 1–5 after weaning.
Dissected from gilt ovary on the day of estrus.
Ovary collected on day 5 of the estrous cycle. Includes
small antral follicles and corpora lutea.
Dissected from gilt ovary on Day 12 of the estrous cycle.
Dissected from gilt ovary on Day 12 of the estrous cycle (3–5-mm antral follicles).
Ovary collected on approximately Day 60 of gestation.
Contains primordial germ cells.
Ovary collected on approximately Day 22 of age. Contains primordial and primary follicles.
Ovary collected on approximately Day 155 of age. Primordial through antral-stage follicles.
the assembly process within SeqMan II. A 90% minimum match percentage was used. The actual average match percentage for entry into the
contig was 97.1%. A consensus sequence for each contig was exported
from SeqMan II.
The consensus cDNA sequence for each cluster was compared with
The Institute for Genomic Research (TIGR) Porcine Gene Index nucleotide sequence databases (Porcine Index, release 8.0; TIGR, Rockville, MD;
http://www.tigr.org/tdb/tgi) using the BLAST program [15]. The contig
consensus sequence was considered to be homologous to a TIGR Tentative
Consensus (TC) sequence or mature transcript (ET) sequence when the
BLAST score was greater than 200. Contigs that were homologous to the
TIGR TC sequence were annotated with a gene name by using the TIGR
EST Annotator.
Functional classifications from Gene Ontology (GO) Consortium [16]
were assigned to each contig using the homologous TIGR TC number and
the functional classifications assigned by TIGR. A set of higher-level parent classifications (GO Slim) [17] were used for the GO assignments. The
GO Slim assignments were performed manually by visually inspecting the
Tree view and identifying the appropriate GO Slim category. Genes whose
component, process, or function fell into multiple GO Slim categories were
given equal weighting across categories.
Electronic Northern Blot Analyses
Contigs were comprised of cDNA clones arising from the original
cDNA libraries. An ‘‘electronic Northern’’ was performed by analyzing
the frequency of library clones within each contig. Three collective libraries were selected for analyses. A collective follicle library (combined
p2mm, p4mm, p6mm, and p8mm libraries) was compiled and analyzed,
because the individual follicle libraries represented follicles at different
stages of preovulatory development [18]. The pd12fol and pd12CL libraries were constructed from follicles and corpora lutea collected from
the same ovaries on the same day of the estrous cycle. Therefore, a collective Day 12 ovary library (pd12fol and pd12CL) also was complied
and analyzed for cDNAs that were differentially expressed in follicles and
corpora lutea. Finally, a collective library representing developmental stages (fetal, neonatal, and prepubertal) was analyzed to identify cDNAs
whose presence was specific to developmental stages of the ovary. Independence was examined using the chi-square test. The observed value was
the number of clones arising from each library. Each library should contribute a proportional number of clones to the contig if the frequency of
clones was independent of library. The expected value (assuming independence) therefore was the total number of clones in the contig multiplied
by the proportion of accepted sequences in the subject library (relative to
the total number of sequence in the collective libraries).
Comparison of mRNA Amount in Tissues with cDNA
Frequency in Libraries
Ribonuclease protection (RP) and Northern blot assays were used to
determine the mRNA expression levels of selected genes in ovaries, follicles, and corpora lutea. The expression levels of seven mRNAs were
measured. These mRNAs were selected because their cDNAs appeared to
be differentially represented within closely related libraries. The expres-
1993
EXPRESSED SEQUENCE TAGS FROM PORCINE OVARY
TABLE 2. Characteristics of individual cDNA sequences evaluated for quality and clustered by using the SeqMan II program.a
Library
2 mm follicle (p2mm)
4 mm follicle (p4mm)
6 mm follicle (p6mm)
8 mm follicle (p8mm)
Day 0 follicle (pd0fol)
Day 5 ovary (pd5ov)
Day 12 corpus luteum (pd12cl)
Day 12 follicle (pd12fol)
Fetal ovary (pfeto)
Neonatal ovary (pnatal)
Prepubertal ovary (pputal)
Total
a
b
Sequence
attempts (n)
2592
2592
2592
2592
1152
384
2016
1632
2016
2304
2304
22 176
Accepted
sequences (%)
1805
1721
1983
2047
725
89
1219
945
1505
1851
1723
15 613
(70)
(66)
(77)
(79)
(63)
(23)
(60)
(58)
(75)
(80)
(75)
(70)
Contigs (n)
1358
1350
1404
1469
614
79
901
781
1153
1529
1333
8507
Singleton contigs
(n [%])
1160
1160
1171
1266
540
72
798
683
1034
1375
1151
6294
(85)
(86)
(83)
(86)
(88)
(91)
(89)
(87)
(90)
(90)
(86)
(74)
Contigs in TIGR
(n [%])b
1096
1017
1179
1094
484
63
666
602
826
1046
1070
5797
(81)
(75)
(84)
(74)
(79)
(80)
(74)
(77)
(72)
(68)
(80)
(68)
The consensus cDNA sequence for each cluster was compared against with the TIGR Porcine Gene Index.
Blast score greater than or equal to 200 when blasted against the TIGR Porcine Gene Index 8.0 (January 6, 2004; http://www.tigr.org/tdb/tgi).
sions of three mRNAs were compared for fetal, neonatal, and prepubertal
tissues, and the expression of three mRNAs were compared for 2-, 4-, 6-,
and 8-mm follicle tissues. The expression of one mRNA was compared
between follicle and corpus luteum. Total cellular RNA was extracted from
tissues as described above. Radiolabeled probes were synthesized from
selected cDNA plasmids. The RP and Northern blot assays were performed as single experiments using previously described procedures [19,
20]. The mRNA expression level was subjectively compared to the number
of cDNA isolated from each library.
for individual libraries ranged from 23% (Day 5 ovary library) to 80% (neonatal ovary library). The average
trimmed sequence length across all libraries was 348 6 1
(mean 6 SEM) base pairs (bp; range, 99 to 659 bp) (Fig.
1). Clustering of the cDNA sequences resulted in 8507 contigs (Table 2). Single-sequence contigs comprised 74% of
the total, and 99% of the contigs had 20 or fewer members
(Fig. 2).
Data and cDNA Clones Access
Annotation of cDNA and Contigs
The sequence for each cDNA library clone reported herein has been
deposited in GenBank. A list of cDNA library clones within each contig,
an annotated list of the contigs (contig number, gene name, frequency of
clones from each library, etc.), and the consensus sequence for each contig
are available at http://www.swine.rnet.missouri.edu.
Sixty-eight percent of the contigs (n 5 5797) were defined as homologous (BLAST score, $200) to existing TC
or ET in the TIGR Porcine Gene Index (Fig. 3). The average BLAST score for homologous contigs was 601 6 3.
Forty-eight contigs has 20 or more members (Table 3).
Nearly half of the largest contigs had consensus sequences
that were homologous to genes involved in protein synthesis (initiation factors, elongation factors, and ribosomal proteins). Other large contigs had consensus sequences for enzymes involved in intracellular energy production (e.g.,
components of the electron-transport chain), for structural
components of the cell, for proteins involved in protein
turnover, or for proteins involved in transport.
The consensus sequences for contigs were homologous
to a number of genes that function in the endocrinology of
ovarian cells (Table 4). The sequenced cDNA encoded insulin-like growth factor (IGF) system genes (including IGFbinding proteins), inhibin/activin system genes, relaxin and
RESULTS
cDNA Sequencing and Clustering
A total of 22 176 cDNA were sequenced and subjected
to the Trace Quality Evaluation system within SeqMan II.
Seventy percent of the cDNA sequences (n 5 15 613) contained sufficient cDNA sequences of acceptable quality for
clustering (Table 2). The percentage of accepted sequences
FIG. 1. The length of trimmed cDNA sequence (cDNA length after removal of poor-quality sequence, vector sequence, and poly[A] tail) for
cDNA clones submitted for clustering. The number of cDNA clones (frequency) within different categories of trimmed sequence length is presented. A trimmed sequence length of 100 represents clones with 99–
100 bp of trimmed sequence. Trimmed sequence lengths of 150, 200,
and 250 bp (etc.) on the abscissa represent ranges of trimmed sequence
lengths (101–150, 151–200, 201–250 bp, etc., respectively).
FIG. 2. The number of contigs with 20 or fewer members. The frequency
of contigs is represented on the ordinate (logarithmic scale) and the size
of the contig (1, 2, 3, or 4 cDNA members, etc.) is represented on the
abscissa.
1994
JIANG ET AL.
FIG. 3. The number of contigs with different BLAST scores from a nucleotide homology search of the TIGR Porcine Gene Index. A BLAST score
of 50, 100, 150, etc. on the abscissa represents a range of BLAST scores
(,50, 51–100, 101–150, etc., respectively).
relaxin-like protein genes, and transforming growth factor
b family genes. The inhibin a chain (contig 756) represented 0.29% of the sequenced cDNA. The cDNAs for
three separate prostaglandin synthesis enzymes and four
separate steroidogenic enzymes were sequenced. Prostaglandin F synthase 1 (contig 273) was one of the most
abundant cDNAs and represented 0.30% of cDNA sequences. Steroid acute regulatory protein (StAR; contig 3043)
represented 0.12% of the sequenced cDNAs. The cDNAs
for hormone receptors and components of hormone secondmessenger systems (cAMP, inositol triphosphate, phosphatidylinositol 3-kinase, mitogen-associated protein kinase,
Janus kinase, and sma/mad) were also sequenced.
The cDNAs were classified according to the GO Index
(cellular component, molecular function, and biological
process). We found that 797 contigs (3242 individual
cDNA clones) had GO cellular component annotations in
the TIGR Porcine Gene Index. Each contig contained one
or more cDNA clones. Based on the number of cDNA
clones (an indirect indicator of mRNA species within the
cell), the majority of cellular mRNA encoded components
of the ribosome, mitochondrion, and nucleus (Fig. 4). We
found that 996 contigs (3370 individual cDNA clones) had
a GO function annotation. The consensus sequences for
most contigs were homologous to genes whose products
were involved in binding or catalytic activity (enzymes).
We found 949 contigs (2948 individual cDNA clones) with
a GO process annotation. Nearly three-quarters of the
cDNAs sequenced encoded genes involved in cellular metabolism and cell growth and/or maintenance.
Frequency of cDNAs in Different Libraries (Electronic
Northern Blot Analysis)
The frequency of library cDNA clones within the largest
individual contigs is shown for follicle libraries (p2mm,
p4mm, p6mm, and p8mm) (Table 5), Day 12 ovary libraries
(p12fol and p12CL) (Table 6), and whole-ovary libraries
(pfeto, pnatal, and pputal) (Table 7). Independence of
cDNA distribution within the libraries was tested by chisquare (statistical significance inferred at minimum threshold of P , 0.10). A lack of independence (statistically significant result) indicated that within a contig, specific
cDNAs were found in a higher percentage in some libraries
relative to others.
We found 18 large contigs ($10 members) whose cDNA
clones were not equally distributed across follicle libraries
(Table 5). Nine contigs (contig 2 [elongation factor 1a],
contig 1 [cytochrome c oxidase], contig 160 [cytochrome
b], contig 18 [identity unknown], contig 222 [actin a], contig 1711 [translation elongation factor eEF-2], contig 5141
[retinoic acid receptor-responder protein 1], contig 163
[ADP ATP carrier protein isoform T2], and contig 5152
[glutathione S-transferase]) had cDNA clones that were predominantly found in the collective p6mm and p8mm libraries. Three contigs (contig 288 [ferritin, heavy polypeptide], contig 273 [prostaglandin F synthase 1], and contig
756 [inhibin a]) had cDNA predominately found in the
p6mm library. Four contigs (contigs 36 and 37 [glutathione
S-transferase], contig 651 [cathepsin L], and contig 1448
[ATP synthase g]) had cDNA clones predominately found
in the p8mm library. Two contigs (contig 161 [cytochrome
c oxidase III] and contig 370 [thymosin b4]) had cDNA
clones primarily found in p2mm and p4mm libraries. The
remaining contigs with more than 10 members had cDNA
equally distributed across follicle libraries.
Eight contigs had cDNA clones that were not equally
distributed within the collective p12fol and p12cl libraries
(contigs with four or more members) (Table 6). Eleven contigs had cDNAs that were found primarily in pd12CL, and
seven contigs had cDNAs that were found primarily in
pd12fol. The greatest difference in cDNA distribution toward pd12CL was found for contigs 1 (cytochrome c oxidase), 12 (prostate-secreted seminal plasma protein [PSP94 protein]), 33 (ATP synthase A chain), 3043 (StAR),
5146 (tissue metalloproteinase inhibitor [TIMP]-1), and
5207 (CYP1B1). The cDNAs for a variety of proteins and
enzymes involved in progesterone synthesis were found in
higher proportions within pd12CL (contigs 3043 and 5325
[StAR], contig 2308 [CYPXIA1; side chain-cleavage enzyme], and contig 282 [3-b-hydroxysteroid dehydrogenase]). The greatest difference in cDNA distribution toward
pd12fol was found for contigs 278 (nexin-1), 18 (unknown), 368 (translationally controlled tumor protein
[TCTP]), 7 (collagen a2[I] chain), 255 (unknown), and
2326 (alanine glyoxylate aminotransferase). The cDNAs
within contigs 7, 18, and 368 (found in high abundance in
pd12fol) were also abundant in the individual follicle libraries (Table 5).
Eleven contigs had cDNA clones that were not equally
distributed within the collective pfeto, pnatal, and pputal
libraries (contigs with six or more members) (Table 7).
Three contigs (contig 33 [ATP synthase A chain], contig
161 [cytochrome c oxidase III], and contig 2285 [guanine
nucleotide-binding protein b subunit-like protein]) contained cDNAs that were predominately found in the pfeto
library. Three contigs contained cDNAs that were predominately found in the pnatal library (contig 412 [collagen,
type III, a1], contig 533 [eukaryotic translation initiation
factor 4A], and contig 5141 [retinoic acid receptor-responder protein 1]). Five contigs contained cDNAs that were predominately found in the pputal library (contig 288 [ferritin,
heavy polypeptide], contig 430 [monooxygenase], contig
64 [selenoprotein P-like protein], contig 756 [inhibin a
chain], and contig 34 [vimentin]). The five contigs whose
cDNAs were predominately found in the pputal library
(created from tissue-containing antral follicles) were among
the largest contigs for the individual follicle libraries (Table
5).
Comparison of mRNA Amount in Tissues with cDNA
Frequency in Libraries
Seven mRNAs were independently examined for expression level within ovarian tissues using RP assay (Fig.
1995
EXPRESSED SEQUENCE TAGS FROM PORCINE OVARY
TABLE 3. Contig number, TIGR TC number, gene name, total number of isolated clones, and percentage of all sequences for contigs with 20 or more
members (all libraries).a
Contig
number
1
2
33
160
161
434
36
274
288
1308
7
368
273
1788
756
12
34
67
412
1309
64
2036
414
435
104
651
1133
13
1712
18
162
430
5122
164
167
413
2515
3770
2281
748
1313
1787
5130
5123
533
741
1732
5125
a
TIGR TC
TC116362
TC103875
TC116169
TC116172
TC103937
TC116175
TC116463
TC103916
TC103870
TC103959
TC116269
TC116297
TC104571
TC116311
TC104824
TC117722
TC116272
TC116295
TC103911
TC103988
TC104140
TC103960
TC116176
TC103503
TC104062
TC116298
TC115698
TC103942
TC104193
TC107052
TC116338
TC103694
TC103919
TC116277
TC103994
TC103287
TC103878
TC116268
TC116335
TC116555
TC103159
TC116317
TC116409
TC116192
TC116418
TC116332
TC116301
TC103354
Gene name
Cytochrome c oxidase (EC 1.9.3.1) chain I—pig mitochondrion
Elongation factor 1-a 1
ATP synthase A chain (EC 3.6.1.34) (Protein 6)
Cytochrome b
Cytochrome c oxidase III
60S acidic ribosomal protein PO (L10E)
Glutathione S-transferase
40S ribosomal protein S2
Ferritin, heavy polypeptide
Ribosomal protein S3a
Collagen a2(I)
Translationally controlled tumor protein (TCTP) (p23)
Prostaglandin-F synthase 1 (EC 1.1.1.188)
Ribosomal protein L7a
Inhibin a
Prostate secreted seminal plasma protein (PSP-94 protein)
Vimentim
40S ribosomal protein SA (P40)
Collagen, type III, a1
40S ribosomal protein S7 (S8)
Selenoprotein P-like protein
60S ribosomal protein L6
Ribosomal protein L10
60S Ribosomal protein L4
Ribosomal protein S8
Cathepsin L (EC 3.4.22.15)
Tumor-specific transplantation antigen P198 homolog p23
Polyubiquitin 3
Bone proteoglycan II (PG-S2) (Decorin)
Unknown
60S ribosomal protein L9
Monooxygenase, nonspecific
Heat shock cognate 71-kDa protein
Ribosomal protein L11
60S ribosomal protein L17 (L23)
Nonhistone chromosomal protein HMG-17
Ferritin, light polypeptide
Ribosomal protein S9
Ribosomal protein L5
Cytochrome b5
40S ribosomal protein S6 (Phosphoprotein NP33)
Integral membrane protein 2B (Transmembrane protein BR1)
Eukaryotic translation elongation factor 1 gamma
Glyceraldehyde 3-phosphate dehydrogenase (EC 1.2.1.12)
Eukaryotic translation initiation factor 4A
Diamine acetyltranferase (EC 23.1.57)
40S ribosomal protein S3
60S ribosomal protein L8
Total clones (n) % of sequences
222
215
114
89
86
71
63
63
63
56
53
49
47
47
46
44
44
42
42
39
37
37
34
33
32
32
32
31
31
30
29
29
28
27
27
27
25
24
23
22
22
22
22
21
20
20
20
20
1.42
1.38
0.73
0.57
0.55
0.45
0.40
0.40
0.40
0.36
0.34
0.31
0.30
0.30
0.29
0.28
0.28
0.27
0.27
0.25
0.24
0.24
0.22
0.21
0.20
0.20
0.20
0.20
0.20
0.19
0.19
0.19
0.18
0.17
0.17
0.17
0.16
0.15
0.15
0.14
0.14
0.14
0.14
0.13
0.13
0.13
0.13
0.13
A complete list of contigs can be found at http://www.swine.rnet.missouri.edu.
5, A and B) or Northern blot analysis (Fig. 5C). In most
cases, the level of mRNA expression within a tissue reflected the relative number of cDNA clones isolated from the
respective libraries. For example, the RP assay did not detect the inhibin a subunit mRNA in fetal or neonatal ovary,
but a radiographic signal was readily detected for inhibin
a subunit in prepubertal ovary (Fig. 5A). The corresponding clone frequency within contig 756 (inhibin a) was one,
zero, and seven for pfeto, pnatal, and pputal, respectively.
Likewise, the mRNA expression level within fetal, neonatal, and prepubertal tissues for selenoprotein P (contig 64)
and an unknown gene (contig 2542) (Fig. 5A) as well as
the mRNA expression level for 2-, 4-, 6-, or 8-mm follicles
for inhibin b (contig 3428) and LH-receptor mRNA (contig
1894) (Fig. 5B) were visually correlated with the number
of clones isolated from the respective libraries. The 17ahydroxylase mRNA was highest in 6-mm follicles (Fig.
5B), but the corresponding number of cDNA clones for
p2mm, p4mm, p6mm, and p8mm was one, zero, four, and
five, respectively. The apparent decrease in 17a-hydroxylase mRNA in 8-mm follicles was not reflected in the
cDNA frequency across libraries. The PSP-94 cDNA was
found almost exclusively in the corpus luteum (one cDNA
from pd12fol compared to 41 clones from pd12CL), and
Northern blot analyses demonstrated expression of PSP-94
(;600 bp) in corpus luteum but not in the follicle or nonluteal ovary (Fig. 5C).
DISCUSSION
Large-scale cDNA sequencing projects have generated
millions of ESTs from numerous species and a variety of
tissues (including porcine ovary) [7–12]. The present EST
project is unique from others, because whole ovary repre-
1996
JIANG ET AL.
TABLE 4. Contig number, TIGR TC number, gene name, total number of isolated clones, and percentage of all sequences for representative contigs
whose cDNAs encode genes involved in the endocrine regulation of ovarian cells.
Contig
number
TIGR TC
Gene name
Total clones (n)
% of sequences
756
3775
2120
156
1887
6872
4489
5090
5255
2738
4322
5955
TC104824
TC117675
TC104817
TC117324
TC104032
TC117989
TC119778
TC118084
TC120386
TC103983
TC117225
TC117725
Hormones and hormone-binding proteins
Inhibin a chain
Follistatin (activin-binding protein)
Relaxin
Relaxin-like factor (RLF/INSL3)
Monocyte chemotactic protein 1 (MCP-1)
Interleukin-18
Transforming growth factor b-1
Transforming growth factor b-3
Insulin-like growth factor-I
Insulin-like growth factor-II
Insulin-like growth factor binding protein-2
Insulin-like growth factor binding protein-5
46
1
4
2
2
1
1
1
1
3
2
2
0.29
0.01
0.03
0.01
0.01
0.01
0.01
0.01
0.01
0.02
0.01
0.01
273
1714
6549
TC104571
TC104909
TC105381
Prostaglandin synthesis
Prostaglandin-F synthase 1
Prostaglandin-E synthase
Prostaglandin-D synthase
47
2
1
0.30
0.01
0.01
1953
3043
2308
281
5149
1576
TC116550
TC116845
TC104489
TC117802
TC104115
TC108675
Steroidogenesis
Sterol carrier protein-2
Steroidogenic acute regulatory protein
Cytochrome P450 side-chain cleavage
3-b-Hydroxysteroid dehydrogenase
Cytochrome P450 steroid 17 a-hydroxylase/17 20 lyase
Cytrochrome P450 Aromatase
1
19
5
8
11
1
0.01
0.12
0.03
0.05
0.07
0.01
5888
1894
452
3323
5800
6335
TC106004
NP276344
TC104214
TC108406
TC105022
TC119247
Hormone receptors
Atrial natriuretic peptide receptor B (guanylate cyclase)
Luteinizing hormone receptor
Progesterone membrane-binding protein
GnRH receptor (type II)
Interleukin-11-receptor a chain
Steroidogenic factor 1
2
5
4
1
1
1
0.01
0.03
0.03
0.01
0.01
0.01
5542
5568
6096
2751
5808
3144
2038
3430
7751
6123
353
5536
1502
2599
2801
1244
TC104369
TC124885
TC105650
TC104116
TC119549
TC104440
TC106516
TC117154
TC107962
TC104637
TC105997
TC116679
TC105469
TC116574
TC117952
TC105621
Second-messenger systems
cAMP-dependent protein kinase type I-a regulatory chain
cAMP-dependent protein kinase type II-b regulatory chain
Phospholipase C b-2
Calmodulin
Phosphatidylinositol 3-kinase regulatory subunit
RhoC
Growth factor receptor-bound protein 14
Mitogen-activated protein kinase 1a
Mitogen-activated protein kinase kinase 1
Mitogen-activated protein kinase 3 (ERK-1)
Mitogen-activated protein kinase phosphatase 6
Janus kinase 1
SMAD 4
Mitogen-activated protein/microtubule affinity-regulating kinase 3
Serine/threonine protein phosphatase 2A regulatory subunit A
Serine/threonine protein phosphatase 2A regulatory subunit B
3
3
2
11
1
4
3
1
1
2
1
3
2
6
1
1
0.02
0.02
0.01
0.07
0.01
0.03
0.02
0.01
0.01
0.01
0.01
0.02
0.01
0.04
0.01
0.01
senting specific stages of development (fetal, neonatal, and
prepubertal) and preovulatory ovarian follicles sampled at
different diameters were used to generate individual libraries for cDNA sequencing. Each clone was sequenced from
its original library so that the frequency of cDNA within
each tissue could be determined. Oligonucleotide tags were
used during library construction to confirm the origin of
each cDNA clone. The 15 613 sequences clustered into
8507 contigs whose sequences were blasted against the
TIGR Porcine Gene Index. Sixty-eight percent of the contigs were homologous to existing sequences within the
TIGR Porcine Gene Index. The genes were associated with
common cell functions, such as energy metabolism, protein
synthesis, signal transduction, cell communication, transport, development, and cell-cycle regulation (Fig. 4). Genes
associated with ovary-specific functions, such as steroido-
genesis, were also identified (Table 4). The remaining 30%
of the consensus sequences were not homologous to porcine sequences at TIGR and may represent novel mRNA
species. Given the large number of unique sequences in the
TIGR Porcine Gene Index, our observed frequency of novel
consensus sequences (32%) suggests a high rate of new
gene discovery in porcine ovary. The original cDNA sequences and their respective library clusters represent a scientific resource for ovarian biologists who study ovarian
gene expression (http://www.swine.rnet.missouri.edu).
As revealed by the cDNA frequency, cytochrome c oxidase cDNA was the most abundant cDNA, representing
1.42% of the 15 613 clones (Table 3). Cytochrome c oxidase is an enzyme in the mitochondrial electron-transport
chain (intracellular energy production). Other highly expressed genes in porcine ovary were also involved in intra-
EXPRESSED SEQUENCE TAGS FROM PORCINE OVARY
FIG. 4. The percentage of cDNA within different categories of the GO
Index. The cDNAs were classified according to GO Slim categories for
cellular component (top), molecular function (middle), and biological
process (bottom).
cellular energy production (ATP synthase and cytochrome
c oxidase III). Elongation factor 1a had the second-highest
cDNA frequency (1.38% of the cDNA clones), and other
genes involved in protein synthesis (including ribosomal
proteins) were also highly expressed. More than 50% of the
cDNA sequences encoded proteins involved in metabolism
(GO Slim process) (Fig. 4), and nearly 50% of the genes
encoded components of the ribosome or mitochondria (GO
Slim component) (Fig. 4). Thus, the bulk of the mRNA
within an ovarian cell encodes genes that are involved in
the intracellular activities that are not unique to ovarian
cells (metabolism, protein synthesis, etc.), and only a small
1997
FIG. 5. RP and Northern blot analyses for selected mRNA representing
cDNA sequenced in the project. The analyses were performed on mRNA
from A) whole porcine ovary (fetal, neonatal, or prepubertal; ribonuclease
protection assay); B) ovarian follicles collected at different diameters after
weaning (2, 4, 6, or 8 mm; ribonuclease protection assay); or C) porcine
ovary (whole nonluteal ovary [Ovary], corpus luteum [CL], or follicles
[Fol]; Northern blot analyses for PSP-94). In A and B, the contig number
and the annotated gene name are presented to the right of the blot and
the number of cDNA isolated from each library is shown below the individual lanes. For PSP-94 (C), 41 cDNAs were isolated from pd12cl, and
one cDNA was isolated from pd12fol. In general, the number of library
cDNAs within the respective contig reflected the level of mRNA expression within the tissue.
percentage of the major cDNA species appeared to be specific to reproductive tissues. Reproductive genes with a
high expression level (high frequency of sequenced cDNA)
included inhibin a and PSP-94. The abundance of cDNA
for these two reproductive genes in ovary is unusual, because most other genes previously associated with ovarian
function were found in relatively low numbers. For exam-
1998
JIANG ET AL.
TABLE 5. The number of sequenced cDNAs within individual follicle libraries (p2mm, p4mm, p6mm, and p8mm).a
Contig
2
1
160
36
288
273
756
161
34
651
33
7
368
18
1133
430
2515
5122
372
13
413
221
412
1041
5130
5123
370
222
37
1311
64
1712
1711
476
5141
163
997
1448
5152
416
5149
a
b
TIGR TC
TC103875
TC116362
TC116172
TC116463
TC103870
TC104571
TC104824
TC103937
TC116272
TC116298
TC116169
TC116269
TC116297
TC107052
TC115698
TC103694
TC103878
TC103919
TC118187
TC103942
TC103287
TC116199
TC103911
TC104001
TC116409
TC116192
TC116291
TC116201
TC116465
TC104054
TC104140
TC104193
TC116425
TC116368
TC116449
TC116313
TC116411
TC104248
TC117178
TC116566
TC104115
Name
Elongation factor 1-a 1
Cytochrome c oxidase (EC 1.9.3.1) chain 1
Cytochrome b
Glutathione S-transferase
Ferritin, heavy polypeptide
Prostaglandin-F synthase 1 (EC 1.1.1.188)
Inhibin a chain
Cytochrome c oxidase III
Vimentin
Cathepsin L (EC 3.4.22.15)
ATP synthase A chain (EC 3.6.I.34) (Protein 6)
Collagen a 2(I)
Translationally controlled tumor protein (TCTP) (p23)
Unknown
Tumor-specific transplantation antigen P198 homolog p23
Monooxygenase, nonspecific
Ferritin, light polypeptide
Heat shock cognate 71-kDa protein
Smfn protein
Polyubiquitin 3—human
Nonhistone chromosomal protein HMG-17
Actin, g
Collagen, type III, a1
Apolipoprotein E
Eukaryotic translation elongation factor 1 g
Glyceraldehyde 3-phosphate dehydrogenase (EC 1.2.1.12)
Thymosin b 4
Actin a
Glutathione S-transferase
Apolipoprotein AI
Selenoprotein P-like protein
Bone proteoglycan II precursor (PG-S2)
Translation elongation factor eEF-2
Polyadenylate-binding protein 1
Retinoic acid receptor-responder protein 1
ADP ATP carrier protein isoform T2
Omithine decarboxylase antizyme
ATP synthase g chain mitochondrial precursor
Glutathione S-transferase
Mitochondrial SP-22 protein
Steroid 17-alpha-hydroxylase/17 20 lyase
p2mm
p4mm
p6mm
p8mm
P,
22
16
9
2
6
4
3
16
12
6
10
6
8
2
6
8
6
2
1
6
3
6
2
2
2
2
8
1
0
2
3
2
1
3
0
1
1
2
0
2
1
15
4
2
3
5
4
1
7
6
1
8
5
7
0
6
4
2
7
3
1
5
0
1
3
3
0
4
0
2
0
2
1
0
1
1
0
3
0
1
1
0
40
21
17
10
29
25
27
7
6
9
3
7
6
7
5
4
5
4
5
4
4
5
6
3
4
6
1
4
2
4
4
3
6
3
3
6
3
1
6
2
4
50
34
28
37
8
7
7
6
10
14
7
7
4
15
3
3
5
3
7
4
3
4
5
6
4
5
0
7
8
6
2
5
3
3
6
3
3
7
3
5
5
0.001
0.001
0.001
0.001
0.001
0.001
0.001
0.050
NSb
0.050
NS
NS
NS
0.001
NS
NS
NS
NS
NS
NS
NS
NS
NS
NS
NS
NS
0.010
0.050
0.050
NS
NS
NS
0.100
NS
0.100
0.100
NS
0.050
0.100
NS
NS
The largest contigs ($ 10 total members) are shown. Ribosomal subunit genes are not shown.
NS, not significant.
ple, only a single copy of aromatase (contig 1576), FSH
receptor (contig 7993), zona pellucida protein 3 (contig
7528), and IGF-I (contig 5255) were sequenced. Fewer than
five cDNAs were sequenced for several other genes that
play physiologically important roles within the ovary (IGFII, IGF-binding protein-2, LH receptor) (Table 4). Tissuespecific ovarian genes therefore may be found only in contigs with a small number of members.
The cDNA libraries sequenced in the present study were
not subtracted or normalized. The frequency of cDNA sequences therefore may approximate the expression level of
the mRNA in the tissue from which the cDNA library was
made [21–24]. For this reason, the frequency of each cDNA
from each library was counted. Analysis of the cDNA frequency across different libraries suggested differential patterns of expression within different-size follicles (diameter,
2–8 mm) (Table 5), between follicles and corpora lutea (Table 6), and across developmental time-points (fetal, neonatal, and prepubertal ovary) (Table 7). The expression of
selected genes was consistent with the frequencies of their
respective cDNA in the individual libraries (analyzed by
RP assay or Northern blot) (Fig. 5).
Distinct biochemical processes may underlie the morphological and physiological changes in follicles as they
grow from 2 to 8 mm in diameter. Follicles of 2 to 4 mm
in diameter either may continue growth or may undergo
atresia [25]. Follicles of 6 mm in diameter are highly estrogenic, and in our previous analyses, follicles of 6 mm in
diameter had more aromatase mRNA than those of 2, 4, or
8 mm in diameter [18]. Follicles of 8 mm in diameter had
less aromatase, because they had been exposed to the LH
surge and had undergone luteinization before ovulation.
Eight contigs had cDNA clone frequency that was highly
disproportionate (P ,0.001) across follicle libraries (Table
5). Five contigs had cDNA clone frequencies that increased
in the collective p6mm and p8mm libraries (contig 2 [elongation factor 1a], contig 1 [cytochrome c oxidase], contig
160 [cytochrome b], contig 36 [glutathione S-transferase],
and contig 18 [unknown]). The physiological basis for the
observed changes in gene frequency is open to further scientific investigation. The increase in elongation factor 1a
mRNA and cytochrome c oxidase mRNA may reflect an
increase in protein synthesis and energy metabolism, respectively, in large follicles. The increase in glutathione Stransferase that we observed in the p8mm library is supported by previous research showing an increase in glutathione S-transferase in large follicles [26]. The increase in
glutathione S-transferase is believed to support progester-
1999
EXPRESSED SEQUENCE TAGS FROM PORCINE OVARY
TABLE 6. The number of sequenced cDNAs within individual day 12 libraries (pd12fol and pd12CL).a
Contig
1
12
33
3043
161
748
64
5139
36
1531
2943
2
741
5146
278
2331
18
368
5207
412
2252
7
255
2326
160
282
581
2308
5325
1914
2453
245
273
1998
2120
5158
1031
1496
a
b
TC
TC116362
TC117722
TC116169
TC116845
TC103937
TC116555
TC104140
TC117113
TC116463
TC105082
TC117132
TC103875
TC116332
TC116432
TC117595
TC104136
TC107052
TC116297
TC117961
TC103911
TC104215
TC116269
TC120237
TC117678
TC116172
TC117803
TC106109
TC104489
TC116845
TC110813
TC103991
TC117806
TC104571
TC119205
TC104817
TC103355
TC104176
TC108301
Name
Cytochrome c oxidase (EC 1.9.3.1) chain I
Prostate secreted seminal plasma protein precursor (PSP-94 protein)
ATP synthase A chain (EC 3.6.1.34) (Protein 6)
Steroidogenic acute regulatory protein
Cytochrome c oxidase III
Cytochrome b5
Selenoprotein P-like protein
NADH-ubiquinone oxidoreductase chain 6 (EC 1.6.5.3)
Glutathione S-transferase
NADH-ubiquinone oxidoreductase chain 2 (EC 1.6.5.3)
3-Hydroxy-3-methylglutaryl-coenzyme A synthase
Elongation factor 1 a 1
Diamine acetyltransferase (EC2.3.1.57)
TIMP-1
Nexin-I
PTH-responsive osteosarcoma D1 protein
Unknown
TCTP (23)
CYP1B1 cytochrome P450 enzyme
Collagen, type III, a1
Cytochrome oxidase subunit II
Collagen a 2(I)
Unknown
Alanine glyoxylate aminotransferase
Cytochrome b
3-b-Hydroxysteroid dehydrogenase
Nuclear receptor NR5A2
P450scc cytochrome P450 11A1 (EC 1.14.15.6) (CYPXIA1)
Steroidogenic acute regulatory protein mitochondrial
IQ motif containing GTPase-activating protein 2
Prosaposin
UDP glucose pyrophosphorylase
Prostaglindin F synthase 1 (EC 1.1.1.188)
Solute carrier family 35, member F5
Relaxin
Glutathione peroxidase
Heat shock protein 84
Karyopherin (importin) b
pd12fol
pc12cl
P,
21
1
7
1
4
3
7
4
6
3
1
5
2
0
5
1
5
5
0
1
3
4
4
4
0
0
0
0
0
3
3
1
1
1
1
1
2
2
67
41
34
15
10
9
4
7
2
5
6
2
5
6
1
5
0
0
5
4
2
0
0
0
4
4
4
4
4
1
1
3
3
3
3
3
2
2
0.001
0.001
0.001
0.010
NSb
NS
NS
NS
0.100
NS
NS
NS
NS
0.050
0.050
NS
0.010
0.010
0.050
NS
NS
0.050
0.050
0.050
0.100
0.100
0.100
0.100
0.100
NS
NS
NS
NS
NS
NS
NS
NS
NS
The largest contigs ($ 4 total members) are shown. Ribosomal subunit genes are not shown.
NS, Not significant.
one synthesis in preovulatory follicles that have been exposed to the LH surge and are undergoing luteinization
[27]. The cytochrome b cDNA was increased in large follicles as well. Cytochrome b is a component of the oxidase
system of neutrophils, and neutrophils are attracted to the
follicle during ovulation [28]. The increase in cytochrome
b therefore may reflect an infiltration of neutrophils into the
preovulatory follicle. Contig 18 represents a cDNA that encodes an unknown protein. The cDNA sequence is found
within the TIGR database but, nonetheless, cannot be tied
to any known gene. The identity and function of this unknown protein that is increased in large porcine follicles is
a potential subject for future investigations.
Contig 288 (ferritin, heavy polypeptide), contig 273
(prostaglandin F synthase 1), and contig 756 (inhibin a)
were specifically increased in the p6mm library at P ,
0.001. The high expression of inhibin a is consistent with
the role of inhibin as an endocrine factor secreted from
large follicles [29]. Others have reported an increase in inhibin a mRNA in midstage preovulatory follicles, with a
subsequent decrease in inhibin a mRNA is late-stage preovulatory follicles [30]. Ovarian follicles synthesize prostaglandins E, D, and F, as evidenced by our sequencing of
synthase enzymes for the respective enzymes (Table 4) and
previously published literature [31]. Prostaglandin synthesis
increases in preovulatory follicles, particularly around the
time of the LH surge [32], and an increase in prostaglandin
synthesis is required for ovulation in pigs [33]. A clear
increase in ferritin cDNA was noted within the p6mm library, and to our knowledge, changes in ferritin mRNA
have not been previously reported in preovulatory follicles.
Iron is required for coordinating the heme within P450 enzymes, which is a class of enzymes that includes those
involved in steroidogenesis [34]. Ferritin sequesters and
stores intracellular iron in a nontoxic form [35]. The increase in ferritin in preovulatory follicles may play an important role in iron homeostasis during the preovulatory
period.
Twelve contigs had cDNA frequencies that differed at P
, 0.05 across the pd12fol and pd12CL libraries (Table 6).
Six of the contigs (contig 1 [cytochrome c oxidase], contig
12 [PSP-94], contig 33 [ATP synthase], contig 3043
[StAR], contig 5146 [TIMP-1], and contig 5207 [CYP1B1])
contained cDNAs that were more abundant in the pd12CL
library. An increase in cytochrome c oxidase and ATP synthase activity in corpora lutea relative to follicles has been
demonstrated in previous studies [36, 37]. Cytochrome c
oxidase and ATP synthase are mitochondrial enzymes, and
the tissue differences may simply reflect the greater density
of mitochondria in corpus luteum relative to follicle. Greater StAR mRNA within corpora lutea relative to follicles
has also been found in porcine ovary [38]. The difference
in cDNA frequency may reflect the greater mitochondrial
density as well as greater steroidogenic capacity of corpus
2000
JIANG ET AL.
TABLE 7. The number of sequenced cDNA within individual ovary libraries (pfeto, pnatal, and pputal).a
Contig
2
1
33
161
160
412
7
368
1712
13
288
69
430
64
1133
5130
756
34
1787
1433
1893
413
5122
2546
2285
533
1764
552
2515
1892
5141
5123
760
222
109
1711
383
a
b
TC
TC103875
TC116362
TC116169
TC103937
TC116172
TC103911
TC116269
TC116297
TC104193
TC103942
TC103870
TC116177
TC103694
TC104140
TC115698
TC116409
TC104824
TC116272
TC116317
TC103884
TC104425
TC103287
TC103919
TC116429
TC104004
TC116418
TC104011
TC104069
TC103878
TC103911
TC116449
TC116192
TC105289
TC116201
TC116634
TC116425
TC118023
Name
Elongation factor 1-a I
Cytochrome c oxidase (EC 1.9.3.1) chain I
ATP synthase A chain (EC 3.6.1.34)
Cytochrome c oxidase III
Cytochrome b
Collagen, type III, aI
Collagen a 2(I)
TCTP (p23)
Bone proteoglycan II precursor (PG-S2) (Decorin)
Polyubiquitin 3
Ferritin, heavy polypeptude
Tubulin a-1 chain
Monooxygenase, nonspecific
Selenoprotein P-like protein
Tumor-specific transplantation antigen P198 homolog p23
Eukaryotic translation elongation factor 1 g
Inhibin a chain
Vimentin
Integral membrane protein 2B (Transmembrane protein BRI)
Chain B of porcine hemoglobin
Unknown
Nonhistone chromosomal protein HMG-17
Heat shock cognate 71-kDa protein
p97
Guanine nucleotide-binding protein b subunit-like protein
Eukaryotic initiation factor 4A-II (eIF-4A-II)
Smooth muscle protein SM22
Heterogeneous nuclear ribonucleoprotein H (hnRNP H)
Ferritin, light polypeptide
Collagen, type III, aI
Retinoic acid receptor responder protein 1
Glyceraldehyde 3-phosphate dehydrogenase
Elongation factor 1 d
Actin a
Translation elongation factor eEF-I b chain
Translation elongation factor eEF-2
Unknown
pfeto
pnatal
pputal
P,
17
13
26
20
9
2
3
7
1
4
2
5
1
0
3
2
1
2
0
2
4
1
1
3
5
2
1
0
2
2
0
2
0
0
3
3
1
34
18
6
3
7
17
8
7
8
3
2
4
0
1
2
4
0
0
3
1
2
3
3
3
2
5
1
4
4
2
5
0
2
4
1
1
3
28
21
10
9
13
3
8
4
7
7
9
3
9
9
5
3
7
6
5
5
2
4
4
2
0
0
5
3
1
3
1
4
4
2
2
2
2
NSb
NS
0.001
0.001
NS
0.001
NS
NS
NS
NS
0.050
NS
0.001
0.001
NS
NS
0.010
0.050
NS
NS
NS
NS
NS
NS
0.050
0.100
NS
NS
NS
NS
0.050
NS
NS
NS
NS
NS
NS
The largest contigs ($ 6 total numbers) are shown. Ribosomal subunit genes are not shown.
NS, Not significant.
luteum relative to follicle. Tissue inhibitors of metalloproteinases play a role in remodeling of the extracellular matrix [39]. The expression of TIMP-1 mRNA increases during late preovulatory development and increases further
within the corpus luteum [40]. To our knowledge, specific
expression of CYP1B1 (a P450 enzyme involved in steroid
metabolism) [41] in corpus luteum has not been previously
reported.
The abundance of prostate-secreted plasma protein (PSP94 or b-microseminoprotein) in corpus luteum relative to
follicle was unexpected (Table 6). The PSP-94 protein is a
major component of seminal fluid from a variety of species.
Although PSP-94 has been used as a marker of prostatic
development in males [42], a specific function for PSP-94
in prostate physiology has not been identified. The PSP-94
protein will bind immunoglobulin and may influence the
activity of immune cells [43]. A recent study found an inhibitory effect of PSP-94 in prostate tumor growth [44].
Thus, PSP-94 may control cell growth and development
either through an indirect effect on immune cells or through
a direct effect on luteal cells themselves. We examined
PSP-94 in cow and pig corpus luteum and found that the
mRNA was only expressed in pig corpus luteum (unpublished observations). Furthermore, the mRNA was maximally expressed in midcycle corpora lutea. Thus, the porcine corpus luteum may be unique from the corpus luteum
of other species in that it specifically expresses PSP-94
(Fig. 5). The highly specific expression of PSP-94 mRNA
suggests a unique role for this protein in porcine corpus
luteum.
Six contigs had cDNAs that were more abundant in
pd12fol compared to pd12cl (P , 0.05; contig 278 [nexin1], contig 18 [unknown], contig 368 [TCTP], contig 7 [collagen a2 (I) chain], contig 255 [unknown], and contig 2326
[alanine glyoxylate aminotransferase]) (Table 6). The
cDNAs within contigs 7, 18, and 368 were also abundant
in the individual follicle libraries (Table 5). The relative
abundance of nexin-1 (a serine protease inhibitor) in follicles compared to corpus luteum confirms a recent report
showing a high level of nexin-1 expression in preovulatory
follicle [45]. Nexin-1 mRNA decreases after the LH surge.
Thus, the decrease in a specific protease inhibitor may facilitate ovulation. Translationally controlled tumor protein
was recently implicated as a protein that binds elongation
factor eEF1A during the elongation step of protein synthesis [46]. The abundance of TCTP cDNA in follicle libraries
(Tables 5 and 6) highlights the importance of protein synthesis during follicular development. The presence of collagen a2 (I) in pd12fol may reflect the relative abundance
of specific collagen subunits in follicles versus corpora lutea [47]. The two contigs classified as unknown should be
investigated further as genes controlling unique aspects of
follicular development.
Several genes were differentially expressed when the
pfetal, pnatal, and pputal libraries were examined. Ovarian
follicular development in pigs is different from that in hu-
EXPRESSED SEQUENCE TAGS FROM PORCINE OVARY
mans and other farm animals, because the neonatal ovary
(sampled herein) is compact, with a dense population of
primordial and primary follicles. Oocytes may still reside
in egg nests at birth, and a period of 1–2 wk after birth
may be required before follicular populations are fully established and primordial follicles enter the growth phase
[48]. The ovary then undergoes a protracted period of follicular growth before puberty. Three genes had contigs that
were highly represented in either fetal or neonatal ovary (P
, 0.001; contig 33 [ATP synthase A chain], contig 161
[cytochrome c oxidase III], and contig 412 [collagen, type
III, a1]) (Table 7). The fetal ovary is densely packed with
oocytes, and oocytes have a large number of mitochondria.
Thus, the high frequency of ATP synthase and cytochrome
c oxidase in the pfeto libraries may reflect the preponderance of mitochondria in developing oocytes. Ovarian follicles change the collagen composition of their basal lamina
when they commence growth [49]. The shift in expression
toward collagen type III a1 within the pnatal library may
be indicative of an active phase of follicular growth in the
neonatal ovary. Additional cDNAs had a frequency that
appeared to be greater in either the pfeto (contig 2285 [guanine nucleotide-binding protein b subunit-like protein]) or
pnatal (contig 5141 [retinoic acid-responder protein 1] and
contig 533 [eukaryotic translation elongation factor 1g]) libraries. Five contigs (contig 288 [ferritin, heavy polypeptide], contig 430 [monooxygenase], contig 64 [selenoprotein P-like protein], contig 756 [inhibin a chain], and contig
34 [vimentin]) had cDNAs that were predominately derived
from prepubertal ovaries (pputal library). The prepubertal
ovaries used to construct the pputal library contained antral
follicles. Each of the five contigs was among the largest
contigs in the collective follicle libraries (Table 5). The observed frequencies of cDNA in the pputal library therefore
supported the sequencing data from the follicle libraries
(Table 5).
In summary, the present study has provided a catalog of
8507 contigs derived from 15 613 cDNA sequences obtained in porcine ovarian tissue. The list of ovarian cDNAs
and their frequencies in each ovarian library will be a useful
index of ovarian gene expression. The observed differences
in cDNA frequency across libraries appeared to be logical
given existing knowledge of ovarian biology. For most contigs, the frequency of the sequenced genes was too few to
reliably study gene expression across tissues. Greater sensitivity for gene expression analysis will be achieved using
solid-phase microarrays constructed from the cDNAs generated herein. This second phase of the current project is
underway.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
ACKNOWLEDGMENTS
The authors thank B. Blaue, C. K. Boyd, and P. M. Roozen for technical assistance.
REFERENCES
1. Webb R, Nicholas B, Gong JG, Campbell BK, Gutierrez CG, Garverick HA, Armstrong DG. Mechanisms regulating follicular development and selection of the dominant follicle. Reprod Suppl 2003;
61:71–90.
2. Christenson LK, Devoto L. Cholesterol transport and steroidogenesis
by the corpus luteum. Reprod Biol Endocrinol 2003; 1:90.
3. Johnson AL. Intracellular mechanisms regulating cell survival in ovarian follicles. Anim Reprod Sci 2003; 78:185–201.
4. McGee EA, Hsueh AJ. Initial and cyclic recruitment of ovarian follicles. Endocr Rev 2000; 21:200–214.
5. Fortune JE. The early stages of follicular development: activation of
17.
18.
19.
20.
2001
primordial follicles and growth of preantral follicles. Anim Reprod
Sci 2003; 78:135–163.
Matzuk MM. Revelations of ovarian follicle biology from gene
knockout mice. Mol Cell Endocrinol 2000; 163:61–66.
Adams MD, Kerlavage AR, Fleischmann RD, Fuldner RA, Bult CJ,
Lee NH, Kirkness EF, Weinstock KG, Gocayne JD, White O, Sutton
G, Blake JA, Brandon RC, Chiu MW, Clayton RA, Cline RT, Cotton
MD, Earle-Hughes J, Fine LD, FitzGerald LM, FitzHugh WM, Fritchman JL, Geoghagen NSM, Glodek A, Gnehm CL, Hanna MC, Hedblom E, Hinkle PS Jr, Kelley JM, Klimek KM, Kelley JC, Liu L-I,
Marmaros SM, Merrick JM, Moreno-Palanques RF, McDonald LA,
Nguyen DT, Pellegrino SM, Phillips CA, Ryder SE, Scott JL, Saudek
DM, Shirley R, Small KV, Spriggs TA, Utterback TR, Weidman JF,
Li Y, Barthlow R, Bednarik DP, Cao L, Cepeda MA, Coleman TA,
Collins E-J, Dimke D, Feng P, Ferrie A, Fischer C, Hastings GA, He
W-W, Hu J-S, Huddleston KA, Greene JM, Gruber J, Hudson P, Kim
A, Kozak DL, Kunsch C, Ji H, Li H, Meissner PS, Olsen H, Raymond
L, Wei Y-F, Wing J, Xu C, Yu G-L, Ruben SM, Dillon PJ, Fannon
MR, Rosen CA, Haseltine WA, Fields C, Fraser CM, Venter JC. Initial
assessment of human gene diversity and expression patterns based
upon 83 million nucleotides of cDNA sequence. Nature 1995;
377(suppl):3–174.
Hillier LD, Lennon G, Becker M, Bonaldo MF, Chiapelli B, Chissoe
S, Dietrich N, DuBuque T, Favello A, Gish W, Hawkins M, Hultman
M, Kucaba T, Lacy M, Le M, Le N, Mardis E, Moore B, Morris M,
Parsons J, Prange C, Rifkin L, Rohlfing T, Schellenberg K, Marra M.
Generation and analysis of 280 000 human expressed sequence tags.
Genome Res 1996; 6:807–828.
Marra M, Hillier L, Kucaba T, Allen M, Barstead R, Beck C, Blistain
A, Bonaldo M, Bowers Y, Bowles L, Cardenas M, Chamberlain A,
Chappell J, Clifton S, Favello A, Geisel S, Gibbons M, Harvey N,
Hill F, Jackson Y, Kohn S, Lennon G, Mardis E, Martin J, Waterston
R. An encyclopedia of mouse genes. Nat Genet 1999; 21:191–194.
Tirunagaru VG, Sofer L, Cui J, Burnside J. An expressed sequence
tag database of T-cell-enriched activated chicken splenocytes: sequence analysis of 5251 clones. Genomics 2000; 66:144–151.
Caetano AR, Johnson RK, Pomp D. Generation and sequence characterization of a normalized cDNA library from swine ovarian follicles. Mamm Genome 2003; 14:65–70.
Tuggle CK, Green JA, Fitzsimmons C, Woods R, Prather RS, Malchenko S, Soares BM, Kucaba T, Crouch K, Smith C, Tack D, Robinson N, O’Leary B, Scheetz T, Casavant T, Pomp D, Edeal BJ, Zhang
Y, Rothschild MF, Garwood K, Beavis W. EST-based gene discovery
in pig: virtual expression patterns and comparative mapping to human.
Mamm Genome 2003; 14:565–579.
Jiang H, Bivens NJ, Ries JE, Whitworth KM, Green JA, Forrester LJ,
Springer GK, Didion BA, Mathialagan N, Prather RS, Lucy MC. Constructing cDNA libraries with fewer clones that contain long poly(dA)
tails. BioTechniques 2001; 31:38–40.
Sambrook J, Fritch EF, Maniatis T. Molecular Cloning: A Laboratory
Manual, 2nd ed. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press; 1989:A.1–B.25.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local
alignment search tool. J Mol Biol 1990; 215:403–410.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM,
Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP,
Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE,
Ringwald M, Rubin GM, Sherlock G. Gene ontology: tool for the
unification of biology. The Gene Ontology Consortium. Nat Genet
2000; 25:25–29.
Pruess M, Fleischmann W, Kanapin A, Karavidopoulou Y, Kersey P,
Kriventseva E, Mittard V, Mulder N, Phan I, Servant F, Apweiler R.
The proteome analysis database: a tool for the in silico analysis of
whole proteomes. Nucleic Acids Res 2003; 31:414–417.
Liu J, Koenigsfeld AT, Cantley TC, Boyd CK, Kobayashi Y, Lucy
MC. Growth and the initiation of steroidogenesis in porcine follicles
are associated with unique patterns of gene expression for individual
components of the ovarian insulin-like growth factor system. Biol Reprod 2000; 63:942–952.
Sterle JA, Boyd C, Peacock JT, Koenigsfeld AT, Lamberson WR, Gerrard DE, Lucy MC. Insulin-like growth factor (IGF)-I, IGF-II, IGFbinding protein-2, and pregnancy-associated glycoprotein mRNA in
pigs with somatotropin-enhanced fetal growth. J Endocrinol 1998;
159:441–450.
Jiang H, Lucy MC. Involvement of hepatocyte nuclear factor-4 in the
expression of the growth hormone receptor 1a messenger ribonucleic
acid in bovine liver. Mol Endocrinol 2001; 15:1023–1034.
2002
JIANG ET AL.
21. Okubo K, Hori N, Matoba R, Niiyama T, Fukushima A, Kojima Y,
Matsubara K. Large scale cDNA sequencing for analysis of quantitative and qualitative aspects of gene expression. Nat Genet 1992; 2:
173–179.
22. Graber JH, Cantor CR, Mohr SC, Smith TF. In silico detection of
control signals: mRNA 39-end-processing sequences in diverse species. Proc Natl Acad Sci U S A 1999; 96:14055–14060.
23. Hu RM, Han ZG, Song HD, Peng YD, Huang QH, Ren SX, Gu YJ,
Huang CH, Li YB, Jiang CL, Fu G, Zhang QH, Gu BW, Dai M, Mao
YF, Gao GF, Rong R, Ye M, Zhou J, Xu SH, Gu J, Shi JX, Jin WR,
Zhang CK, Wu TM, Huang GY, Chen Z, Chen MD, Chen JL. Gene
expression profiling in the human hypothalamus-pituitary-adrenal axis
and full-length cDNA cloning. Proc Natl Acad Sci U S A 2000; 97:
9543–9548.
24. Ko MS, Kitchen JR, Wang X, Threat TA, Wang X, Hasegawa A, Sun
T, Grahovac MJ, Kargul GJ, Lim MK, Cui Y, Sano Y, Tanaka T, Liang
Y, Mason S, Paonessa PD, Sauls AD, DePalma GE, Sharara R, Rowe
LB, Eppig J, Morrell C, Doi H. Large-scale cDNA analysis reveals
phased gene expression patterns during preimplantation mouse development. Development 2000; 127:1737–1749.
25. Guthrie HD, Grimes RW, Cooper BS, Hammond JM. Follicular atresia
in pigs: measurement and physiology. J Anim Sci 1995; 73:2834–
2844.
26. Sesh PS, Singh D, Sharma MK, Pandey RS. Activity of glutathione
related enzymes and ovarian steroid hormones in different sizes of
follicles from goat and sheep ovary of different reproductive stages.
J Exp Biol 2001; 39:1156–1159.
27. Keira M, Nishihira J, Ishibashi T, Tanaka T, Fujimoto S. Identification
of a molecular species in porcine ovarian luteal glutathione S-transferase and its hormonal regulation by pituitary gonadotropins. Arch
Biochem Biophys 1994; 308:126–132.
28. Ushigoe K, Irahara M, Fukumochi M, Kamada M, Aono T. Production
and regulation of cytokine-induced neutrophil chemoattractant in rat
ovulation. Biol Reprod 2000; 63:121–126.
29. Knight PG. Roles of inhibins, activins, and follistatin in the female
reproductive system. Front Neuroendocrinol 1996; 17:476–509.
30. Li MD, DePaolo LV, Ford JJ. Expression of follistatin and inhibin/
activin subunit genes in porcine follicles. Biol Reprod 1997; 57:112–
118.
31. Padmavathi O, Reddy GP, Reddy GR, Reddanna P. Biosynthesis of
prostaglandins in ovarian follicles and corpus luteum of sheep ovary.
Biochem Int 1990; 20:455–461.
32. Ainsworth L, Tsang BK, Downey BR, Marcus GJ. The synthesis and
actions of steroids and prostaglandins during follicular maturation in
the pig. J Reprod Fertil Suppl 1990; 40:137–150.
33. Downey BR, Mootoo JE, Doyle SE. A role for lipoxygenase metabolites of arachidonic acid in porcine ovulation. Anim Reprod Sci
1998; 49:269–279.
34. Rodgers RJ. Steroidogenic cytochrome P450 enzymes and ovarian
steroidogenesis. Reprod Fertil Dev 1990; 2:153–163.
35. Theil EC. Ferritin: at the crossroads of iron and oxygen metabolism.
J Nutr 2003; 133(suppl 1):1549S–1553S.
36. Naumoff PA, Stevenson PM. The differential development of mitochondrial cytochrome P-450 and the respiratory cytochromes in rat
ovary. Biochim Biophys Acta 1981; 673:359–365.
37. Rodgers RJ, Waterman MR, Simpson ER. Cytochromes P-450scc, P450(17)a, adrenodoxin, and reduced nicotinamide adenine dinucleotide phosphate-cytochrome P-450 reductase in bovine follicles and
corpora lutea. Changes in specific contents during the ovarian cycle.
Endocrinology 1986; 118:1366–1374.
38. LaVoie HA, Benoit AM, Garmey JC, Dailey RA, Wright DJ, Veldhuis
JD. Coordinate developmental expression of genes regulating sterol
economy and cholesterol side-chain cleavage in the porcine ovary.
Biol Reprod 1997; 57:402–407.
39. Smith MF, McIntush EW, Ricke WA, Kojima FN, Smith GW. Regulation of ovarian extracellular matrix remodeling by metalloproteinases and their tissue inhibitors: effects on follicular development, ovulation and luteal function. J Reprod Fertil Suppl 1999; 54:367–381.
40. Smith GW, Juengel JL, Mclntush EW, Youngquist RS, Garverick HA,
Smith MF. Ontogenies of messenger RNA encoding tissue inhibitor
of metalloproteinases 1 and 2 within bovine periovulatory follicles
and luteal tissue. Domest Anim Endocrinol 1996; 13:151–160.
41. Murray GI, Melvin WT, Greenlee WF, Burke MD. Regulation, function, and tissue-specific expression of cytochrome P450 CYP1B1.
Annu Rev Pharmacol Toxicol 2001; 41:297–316.
42. Hyakutake H, Sakai H, Yogi Y, Tsuda R, Minami Y, Yushita Y, Kanetake H, Nakazono I, Saito Y. Beta-microseminoprotein immunoreactivity as a new prognostic indicator of prostatic carcinoma. Prostate
1993; 22:347–355.
43. Yang WC, Kwok SC, Leshin S, Bollo E, Li WI. Purified porcine
seminal plasma protein enhances in vitro immune activities of porcine
peripheral lymphocytes. Biol Reprod 1998; 59:202–207.
44. Shukeir N, Arakelian A, Kadhim S, Garde S, Rabbani SA. Prostate
secretory protein PSP-94 decreases tumor growth and hypercalcemia
of malignancy in a syngeneic in vivo model of prostate cancer. Cancer
Res 2003; 63:2072–2078.
45. Bedard J, Brule S, Price CA, Silversides DW, Lussier JG. Serine protease inhibitor-E 2 (SERPINE2) is differentially expressed in granulosa
cells of dominant follicle in cattle. Mol Reprod Dev 2003; 64:152–
165.
46. Cans C, Passer BJ, Shalak V, Nancy-Portebois V, Crible V, Amzallag
N, Allanic D, Tufino R, Argentini M, Moras D, Fiucci G, Goud B,
Mirande M, Amson R, Telerman A. Translationally controlled tumor
protein acts as a guanine nucleotide dissociation inhibitor on the translation elongation factor eEF1A. Proc Natl Acad Sci U S A 2003; 100:
13892–13897.
47. Zhao Y, Luck MR. Gene expression and protein distribution of collagen, fibronectin and laminin in bovine follicles and corpora lutea. J
Reprod Fertil 1995; 104:115–123.
48. Black JL, Erickson BH. Oogenesis and ovarian development in the
prenatal pig. Anat Rec 1968; 161:45–55.
49. Rodgers RJ, Irving-Rodgers HF, Russell DL. Extracellular matrix of
the developing ovarian follicle. Reproduction 2003; 126:415–424.