Hormonal Genomics

0163-769X/02/$20.00/0
Printed in U.S.A.
Endocrine Reviews 23(3):369 –381
Copyright © 2002 by The Endocrine Society
Hormonal Genomics
CHANDRA P. LEO, SHEAU YU HSU,
AND
AARON J. W. HSUEH
Division of Reproductive Biology, Department of Gynecology and Obstetrics, Stanford University School of Medicine,
Stanford, California 94305-5317
The availability of the human genomic sequence is changing
the way in which biological questions are addressed. Based on
the prediction of genes from nucleotide sequences, homologies among their encoded amino acids can be analyzed and
used to place them in distinct families. This serves as a first
step in building hypotheses for testing the structural and
functional properties of previously uncharacterized paralogous genes. As genomic information from more organisms
becomes available, these hypotheses can be refined through
comparative genomics and phylogenetic studies. Instead of
the traditional single-gene approach in endocrine research,
we are beginning to gain an understanding of entire mammalian genomes, thus providing the basis to reveal subfami-
lies and pathways for genes involved in ligand signaling. The
present review provides selective examples of postgenomic
approaches in the analysis of novel genes involved in hormonal signaling and their chromosomal locations, polymorphisms, splicing variants, differential expression, and physiological function. In the postgenomic era, scientists will be
able to move from a gene-by-gene approach to a reconstructionistic one by reading the encyclopedia of life from a global
perspective. Eventually, a community-based approach will
yield new insights into the complexity of intercellular communications, thereby offering us an understanding of hormonal physiology and pathophysiology. (Endocrine Reviews
23: 369 –381, 2002)
I.
II.
III.
IV.
V.
VI.
VII.
VIII.
IX.
needed for all functional assignments, it is becoming clear
that the emerging global perspective of all genes, as well as
the pathways and networks in which they are arranged, will
provide insights of a quality that was formerly unattainable
using the previous local view of a limited number of genes.
In the postgenomic era, the availability of all human gene
sequences allows us to study transcription profiles of genes
for specific cell types and developmental stages in response
to hormonal signaling. The introduction of new tools for the
parallel analysis of all gene transcripts and, in the future, the
expression of all proteins heralds the dawn of the new fields
of transcriptomics and proteomics, respectively.
A major impact of hormonal genomics that is evident
today is the method used for identifying new ligands and
receptors. In the classic endocrine approach, we first defined
a biological endpoint (such as body growth induced by pituitary extracts in hypophysectomized animals) to allow the
isolation of the potential ligand (in this case, GH) based on
bioassays. Only after the cloning of ligand genes could cognate receptors be identified. Theoretically, in the postgenomic era, all receptors could be predicted based on their
unique sequence features (e.g., the seven-transmembrane
stretches of hydrophobic amino acids characteristic for G
protein-coupled plasma membrane receptors). This reverse
endocrinology approach starts with orphan receptors to
eventually identify cognate ligands and physiological functions. Taking advantage of the sequence relatedness among
family genes, we can predict the functions of novel uncharacterized hormonal ligands, receptors, and intracellular signaling molecules.
Another important conceptual change for traditional endocrinologists is the realization that the global perspective of
biological systems will continue to blur established boundaries currently separating endocrinology, growth factor research, immunology, extracellular matrix research, and de-
Introduction
Gene Identification
Sequence Homology and Phylogenetic Relationships
Chromosomal Location of Genes
Gene Polymorphism
Alternative Splicing
Identification of the Regulatory Elements of Genes
mRNA Expression Profiling
The Postgenomic Challenges in Hormonal Research
I. Introduction
T
HE SEQUENCING OF the human genome has triggered
a revolution encompassing all fields of biomedicine (1,
2). Whereas traditional methods in genetics and molecular
biology focused on the characterization of individual genes
and their products, the new genomic approach takes advantage of knowledge concerning the totality of genes in an
organism. In the area of hormonal genomics, the functional
subgenomes of secreted extracellular signaling molecules
(peptide and protein ligands), transmembrane receptors, intracellular signaling molecules, and transcriptional factors
(including steroid receptors and related genes) can now be
analyzed in an integrated manner. With a global perspective
on the limited “parts list” of our body, we are able to predict
the structure of genes based on their sequence relationships,
to understand the functional redundancy of genes based on
their common position in gene networks, and to trace the
evolutionary roots of gene families based on comparative
genomic analyses of sequence conservation and chromosomal gene order. Although experimental evidence is
Abbreviations: CCR, C-C chemokine receptor; EST, expressed sequence tag; GDF, growth differentiation factor; LGR, leucine-rich repeatcontaining G protein-coupled receptor; ORF, open reading frame; SAGE,
serial analysis of gene expression.
369
370
Endocrine Reviews, June 2002, 23(3):369 –381
velopmental biology. With the availability of a handful of
animal genomes for comparison, we can trace the evolutionary roots of all human endocrine genes. Also, increasing
evidence indicates that the endocrine mechanism is merely
a special case of long-range intercellular communication
mechanisms that developed during the course of evolution.
Indeed, one can find many cases in which endocrine hormones and receptors have paracrine roots; for example, homologs for insulin-signaling genes or glycoprotein hormone
receptors are already present in nematodes that are lacking
an enclosed circulatory system (3, 4).
Over the past decade, whole-genome sequencing projects
for numerous organisms have been completed, beginning
with viruses and prokaryotes such as Escherichia coli (5) and
continuing with the eukaryotic model organism’s baker’s
yeast, Saccharomyces cerevisiae (6), the nematode Caenorhabditis elegans (7), and the fruit fly Drosophila melanogaster (8).
Recently, working draft versions of the human genome have
been published (1, 2). Moreover, a private draft sequence of
the mouse genome (Mus musculus) and public draft sequences for two species of puffer fish (Fugu rubripes and
Tetraodon nigroviridis) have been completed. Additional sequencing projects for rat (Rattus norvegicus), zebra fish (Danio
rerio), and a number of other organisms are in progress.
Given the explosive growth of the genomic literature, we
have deliberately chosen to limit the scope of the present
review, highlighting basic principles in genomics and providing selected examples of their application to the field of
hormonal research. In contrast, the evolving field of hormonal proteomics and the impact of genomics on clinical
endocrinology will not be covered here.
II. Gene Identification
In small microbial genomes, the identification of genes is
relatively straightforward, as the gene-coding regions are not
interrupted by introns. However, as organisms and their
genomes increase in complexity, the task of gene finding
becomes more difficult. One main reason for the difficulty is
that the signal-to-noise ratio—and thus the accuracy of gene
predictions— drops as the proportion of noncoding DNA
increases. Whereas in prokaryotes, open reading frames
(ORFs) that could encode a stretch of uninterrupted amino
acids represent approximately 85% of the whole genome, this
number decreases to around 70% in yeast and to less than
25% in the worm and fly (9). The human genome is estimated
to have a total size of about 3.2 gigabases (i.e., 3.2 ⫻ 109 bp)
of DNA, of which less than 1.5% are thought to encode
proteins (10). Gene prediction in the human genome is further complicated by the presence of comparatively long and
variable intron sequences (1).
The transcription units, consisting of exons, introns, and
the regulatory region, are thought to constitute more than
20% of the entire human genome. To increase the sensitivity
and specificity of gene predictions in the draft sequence of
the human genome, both public and private groups are using
a combination of approaches based on: 1) identification of
matching mRNA-derived sequences, 2) searches for similarity to previously known genes and proteins, and 3) prediction based on features common to all genes.
Leo et al. • Hormonal Genomics
The first strategy relies on identifying regions of similarity
between genomic sequences and sequences that are known
to be transcribed, such as expressed sequence tags (ESTs)
derived from fragments of cDNA, and full-length cDNA
sequences (11). Even though this approach is based on direct
experimental evidence, it is still subject to errors resulting
from spurious EST data, e.g., from unspliced mRNAs, or
genomic DNA contamination. More importantly, genes expressed at low levels or those transcribed selectively in tissues or cell types underrepresented in EST databases may be
overlooked. Likewise, single-exon genes encoding small proteins may not be detected because the ESTs representing
them are sometimes indistinguishable from genomic DNA
contamination (12).
The second strategy identifies similarities between the
genomic sequence investigated and known gene or protein
sequences in human or other species. Although this approach can correctly predict genes belonging to larger gene
families or having orthologs in different species, it is not
capable of identifying genes having no sequence similarity to
known genes and is prone to picking up pseudogenes. Moreover, it is difficult to detect homologs of genes with small
ORFs because sequence similarities over short stretches may
not achieve the necessary level of statistical significance (12).
The third strategy, ab initio gene prediction, uses software
algorithms combining statistical information on splicing
sites, coding bias, and exon/intron lengths to detect putative
exons and genes (13). In a recent study involving the fruit fly
genome, such algorithms correctly predicted all exons of a
gene in about 40% of the cases, but entirely missed 5–10% of
the known genes (14). Due to the limitations discussed above,
the application of ab initio gene prediction methods to the
human genome is expected to yield an even lower sensitivity
and specificity (15).
Combining all three strategies, the total number of human
genes was estimated to be between 30,000 and 40,000. It is
important to note, however, that we are still some way from
having a complete human gene index. Indeed, initially, a
complete high-stringency alignment with the draft sequence
could only be achieved for half of the more than 10,000
known human genes stored in the curated RefSeq database
(1). This discrepancy is due to the remaining errors and gaps
in the draft sequence that are being addressed during the
ongoing process.
With the completion of the sequencing of several vertebrate genomes, analytical approaches to identify functionally
similar homologous genes have become feasible. The best
examples for hormonal research are the discoveries of more
than 400 G protein-coupled receptors in the human genome
with a potential hormonal ligand (16), and of diverse ligands
belonging to the cytokine (17), IL (18, 19), and cysteine-knot
gene families. New ligand/receptor systems have also been
discovered with the aid of genomic information. For example, a family of more than 10 Toll-like receptors has been
identified (20), and the crucial roles of these receptors in the
recognition of microbial components and innate immunity
have been elucidated (21).
As discussed above, the task of gene identification is not
trivial. Moreover, multiple gene messages can be derived
from a single stretch of DNA based on alternative uses of
Leo et al. • Hormonal Genomics
promoters, exons, and termination sites. Adding to these
overlapping transcription units, somatic recombination
events such as those found in some of the immune recognition loci, and the existence of highly similar gene families
and pseudogenes, render it difficult to clearly identify and
categorize the genes. Based on the classic definition that
genes are distinct transcription units that are translated to
generate one or a set of related proteins, many “genes” in the
genome are pseudogenes. The human genome is estimated
to encode approximately 900 olfactory receptor genes; however, 60% of them appear to be pseudogenes with disrupted
ORFs (22, 23). Likewise, the entire mouse olfactory receptor
gene repertoire comprising about 1300 genes may include
20% pseudogenes (24). Because not all genes with disruptions are pseudogenes, due to the possible splicing out of
regions with a termination codon, the exact number of functional genes remains to be determined using more advanced
experimental procedures.
Comparative genomic analyses of gene families across
different species can provide insights into the evolution and
associated adaptation of specific hormonal regulatory circuits among closely related species. For example, multiple
GnRHs and GnRH receptors have been identified in vertebrates from teleosts to primates (25); however, humans appear to have only one functional GnRH receptor due to the
apparent degeneration of the type II GnRH receptor gene
into a pseudogene (26 –28). Because the ortholog for the human type II GnRH receptor is functional in nonhuman primates (29), future investigation on the role of two types of
GnRH receptors in primates will have to address the evolutionary adaptation of these GnRH signaling systems.
III. Sequence Homology and Phylogenetic
Relationships
Although the human genome has been sequenced, only a
small fraction of its genes have been studied experimentally
to reveal their physiological roles. Consequently, a major
challenge for functional genomics is to predict and then
elucidate the function of all genes in the human body. For
hormonal genomics, we can take advantage of the unique
sequence features of known ligands, receptors, and intracellular signaling molecules. Using sequence relatedness as a
criterion, one can define subgenomes encompassing specific
groups of ligand-signaling genes. Although we do not have
knowledge of all hormones and receptors in the human genome, most of the encoded G protein-coupled receptors have
been identified. Likewise, attempts were made to predict all
cystine knot-containing ligands and related extracellular signaling molecules among the human ORFs by searching for
their unique pattern of cysteine signatures (30). Although a
large number of secreted polypeptide hormones can potentially be identified based on the presence of a signal peptide
for secretion and the lack of transmembrane regions, this
group of predicted genes also includes adhesion molecules,
secreted enzymes, and other uncharacterized extracellular
proteins.
Despite the lack of a convenient global approach to understand genes involved in ligand signaling, one can define
Endocrine Reviews, June 2002, 23(3):369 –381
371
subgroups of these genes based on their evolutionary relationships. Two types of relationships between genes deserve
specific attention. The first type is the orthologous gene that
did not significantly change in either structure or function
during speciation. For instance, it has been estimated that
40% and 60% of human genes responsible for genetic diseases have orthologs in yeast and fly, respectively. The second group of human genes derived as the result of geneduplication events during evolution are called paralogs.
Although these genes belong to the same family and usually
have similar structural motifs, they can serve related but
nonidentical physiological functions. It has been hypothesized that during early chordate evolution, before the Cambrian explosion and during the early Devonian period, two
entire genome duplications took place (31, 32), leading to the
generation of sets of multiple mammalian paralogs, concomitant with the opportunity to develop complex regulatory
mechanisms and functions. Thus, most human genes belong
to a family as the result of gene duplication and domain
shuffling. These evolutionary mechanisms allow for the amplification and diversification of existing domains, the recruitment of existing domains for new functions, and the
development of new domains through domain combination
and shuffling.
Comparative genomic analysis reveals the evolution of
polypeptide ligands, receptors, and intracellular signaling
molecules. Only four orthologs of human insulin-like genes
have been found in D. melanogaster, whereas C. elegans possesses a large number of insulin gene homologs, including
one that was shown to determine the life span of the worm
(33). In contrast, three insulin paralogs are present in the
human genome, including insulin controlling carbohydrate
metabolism, and IGF-I and IGF-II regulating organ and body
growth. A related group of hormonal ligands that also consist of B and A subunits connected by a long-C-domain
peptide is constituted by the relaxin-related proteins. The
human paralogs of both insulin and relaxin all have highly
conserved cysteine residues essential for maintaining a similar secondary structure. In the relaxin group, at least seven
human paralogs have been identified, including three relaxin
genes, Leydig cell relaxin (INSL3), INSL4, INSL5/RIF2, and
INSL6/RIF1 involved in reproductive tract maintenance and
other uncharacterized functions (34, 35).
Genomic studies on the presence of genes with sequence
similarities to the human gonadotropin and TSH receptors
also enhanced our understanding of their evolution. All of
these genes encode proteins with a large N-terminal extracellular region important for ligand binding followed by a
seven-transmembrane region known to be essential for G
protein coupling. Based on the evolutionary conservation of
these genes, five orphan receptors were identified in humans
and named LGRs (leucine-rich repeat-containing, G proteincoupled receptors) (36, 37). Based on the structural comparison of LGRs from diverse species, and the evolutionary
relationship of these animals, a putative evolutionary tree for
the LGR family of proteins can be proposed. The primitive
LGR gene likely replicated before the emergence of the cniderian (sea anemone) to form three LGRs (named as LGRA,
LGRB, and LGRC), each with homologs in modern vertebrates. Based on their structural similarity and putative evo-
372
Endocrine Reviews, June 2002, 23(3):369 –381
lutionary origins, mammalian LGRs can be divided into
three subgroups: LGRA (LH, FSH, and TSH receptors),
LGRB (LGR4, LGR5, and LGR6), and LGRC (LGR7 and
LGR8) (38). In fly, there are at least three LGRs, each belonging to one of these subgroups. Because only one LGR
with similarities to the LGRA subgroup could be identified
in C. elegans (4), it is likely that a gene loss occurred during
evolution. Interestingly, recent studies indicated that LGR7
and LGR8 are receptors for relaxin (39).
Initially discovered based on their sequence similarity to
a fly ortholog, cyclic nucleotide phosphodiesterases represent a large group of genes encoding enzymes that hydrolyze
and inactivate cAMP and cGMP (40). Animal cyclic nucleotide phosphodiesterases comprise at least seven subtypes as
well as orthologs corresponding to four subtypes that can be
found even in freshwater sponges. Accordingly, phylogenetic analysis revealed that most gene duplications and domain shuffling that gave rise to different subtypes had been
completed in the early evolution of animals before the separation of sponges and eumetazoans (41).
Another example of the use of comparative genomics for
hormone discovery is the recent identification of CRHrelated peptides (42– 44). Using phylogenetic and structure
profile analyses, two novel peptides, stresscopin/urocortin II
and stresscopin-related peptide/urocortin III, with limited
sequence relatedness to frog sauvagine and mammalian
CRH, were isolated from the human and puffer fish genomic
databases and shown to function as selective type 2 CRH
receptor agonists. These studies suggest that the two novel
peptides are important in stress-coping responses in both
central and peripheral tissues for maintaining homeostasis
after the initial fight-or-flight responses induced by stress.
Evolutionary genomic approaches have facilitated the
analysis of large families of polypeptide ligands. A recent
review summarizes the origin and function of the pituitary
adenylate cyclase-activating polypeptide/glucagon superfamily in vertebrates and shows that the ancestral superfamily members can be traced to a common origin with the
invertebrates. It also suggests that the superfamily began
with gene or exon duplication and then continued to diverge
with some gene duplications in vertebrates (45). Likewise,
recent duplication of the LH␤ gene in ancestral primates led
to the derivation of multiple human chorionic gonadotropin-␤ genes (46). Furthermore, the coevolution of GH and its
receptor in primates has been analyzed. These studies demonstrated that GH evolution is very conservative among
most mammals but not in primates. The human GH receptor
displays species specificity and interacts only with human or
rhesus monkey GH, but not with nonprimate GH. Sequence
analysis revealed that species specificity of the human GH
receptor emerged in the common ancestor of Old World
primates in a relatively short period (47). Phylogenetic analysis also suggested the evolution of vertebrate steroid receptors from an ancestral estrogen receptor, followed by the
progesterone receptor and other related proteins (48).
As these examples illustrate, the availability of genomic
sequences from an increasing number of organisms will enable hormonal researchers to decipher the evolutionary origin of diverse human genes important for ligand signaling.
In turn, these findings will significantly facilitate the exper-
Leo et al. • Hormonal Genomics
imental verification of functions for novel genes based on
their relationship to characterized genes as well as studies in
model organisms.
IV. Chromosomal Location of Genes
For decades, genetic mapping of the chromosomal positions of genes has been performed based on patients’ family
histories or cross-breeding in animals. Linkage analyses utilize the frequency of meiotic recombination between genetic
markers such as microsatellite DNAs or known genes to
estimate the position of genes with a unique phenotype. The
determination of gene positions in chromosomes has also
been assisted by physical mapping based on fluorescence in
situ hybridization, restriction enzyme length polymorphism,
and sequence-tagged sites. In the postgenomic era, chromosomal location is known for all human genes with known
sequences.
Investigations of gene order in chromosomes of closely
related species indicate that chromosomes underwent duplication, fusion, translocation, and inversion events during
evolution. In syntenic regions of related species, a partial or
complete conservation of gene order during chromosome
evolution is evident. With the availability of complete genomes of increasing numbers of vertebrate species, the
genomic analysis of syntenic regions becomes a valuable tool
for understanding the function of human orthologs based on
the conservation of gene locations. Recently, syntenic mapping of human and mouse chromosomes was performed by
the National Center for Biotechnology Information (http://
www.ncbi.nlm.nih.gov/Homology/). With the well established technology of gene engineering in the mouse, comparisons of human and mouse phenotypes and the
elucidation of syntenic genes will become an increasingly
useful approach in the future. Similar syntenic mapping
projects are being performed between human and rat (49) as
well as between other species.
One example of the impact of synteny studies on endocrine research is the role of certain members of the TGF␤
family in ovarian follicle development. Mutant mice lacking
the oocyte hormone growth differentiation factor (GDF)-9
showed an arrest of ovarian follicles at the secondary stage
(50). Furthermore, another oocyte gene with high homology
to GDF-9, named GDF-9B or bone morphogenetic protein-15
(51, 52), was also found to stimulate granulosa cell proliferation (53) and mapped to human chromosome Xp11.2–11.4.
In sheep, a naturally occurring X-linked mutation was
known to cause an increased ovulation rate and twin and
triplet births in heterozygotes but primary ovarian failure in
homozygotes. Further studies identified the genetic locus of
this Inverdale sheep mutation as syntenic to the human Xp
region containing the GDF-9B/bone morphogenetic protein-15 gene. Thus, this oocyte gene is essential for female
fertility, and its natural mutations can cause both an increased ovulation rate and infertility phenotypes in a dosagesensitive manner (54).
In addition to studies of syntenic chromosome regions,
analyses of unique phenotypes in patients have also confirmed the disruption of linked genes in the human genome.
Leo et al. • Hormonal Genomics
Kallmann syndrome is characterized by hypogonadotropic
hypogonadism and an inability to smell as the result of a
defect in the migration of olfactory and GnRH neurons (55).
Although this disease has been associated with both X-linked
and autosomal inheritance, some infertile patients also
showed skin lesions of scalp, ears, and neck, representing a
contiguous gene syndrome involving two adjacent genes in
chromosome Xp22.3 (56). In this region, the KAL1 gene responsible for Kallmann syndrome is situated adjacent to the
steroid sulfatase gene responsible for X-linked ichthyosis.
Therefore, the analysis of patients with microdeletions of
neighboring chromosomal regions containing linked genes
can reveal the location of disease genes.
V. Gene Polymorphism
The out-of-Africa hypothesis regards all modern human
populations as descended from a small group of approximately 100,000 individuals that migrated from Africa less
than 150,000 yr ago and replaced archaic populations (57).
Assuming the common origin of modern humans, nucleotide
variations are shared at specific positions in the genome. Base
pair changes that lead to minimal alterations in the property
of encoded amino acids usually do not significantly affect the
overall function of the protein. Consequently, variations in
nucleotide sequences that are associated with silent mutations or minimal phenotypic changes are tolerated during
evolution and inherited over generations. These single nucleotide polymorphisms represent minor alterations in DNA
that occur with varying frequencies in different ethnic populations and are a central focus of pharmacogenomic studies
(58). A random pair of human haploid genomes differed at
an average rate of 1 per 1250 bp, but there was marked
heterogeneity in the occurrence of polymorphisms across the
genome. It has been estimated that there are 300,000 potential
single nucleotide polymorphisms in the human genome
(59, 60).
Polymorphisms of individual genes can account for differences in an individual’s responsiveness to hormonal and
other ligands. For example, polymorphisms in the coding
region of the C-C chemokine receptor (CCR)-5 gene, and in
the promoter region of its ligand RANTES, are associated
with aberrant ligand-receptor interactions. Patients with an
altered CCR-5 receptor, the coreceptor for the viral infection
of immune cells, or patients who exhibit overproduction of
RANTES, can be exposed to HIV with reduced infection risk
or delayed disease progression (61, 62). In both cases, the
viral particles fail to infect cells either because they cannot
enter using the defective receptor or because the functional
receptor is already occupied by high levels of the endogenous ligand. Likewise, a polymorphism of the ␤-adrenergic
receptor is associated with altered sensitivity to ␤-agonists in
asthmatics (63). Moreover, a single-nucleotide polymorphism of the IL-1B promoter has been found to be associated
with a chronic hypochlorhydric response to Helicobacter pylori infection and the risk of gastric cancer, presumably by
altering IL-1 levels in the stomach (64). Recently, two linked
polymorphisms of the FSH receptor gene were found to alter
FSH responsiveness of gonadal cells as the amount of FSH
Endocrine Reviews, June 2002, 23(3):369 –381
373
required for ovarian stimulation in patients with different
polymorphic alleles differed (65). Estrogen receptor variants
in normal and neoplastic mammary tissues and other target
cells have also been identified (66, 67). These transcripts
could be responsible for aberrant ligand-signaling events. A
recent study analyzed polymorphisms and genetic variations of multiple G protein-coupled receptors and documented the functional importance of specific residues for the
function of these receptors (68).
Studies on gene polymorphisms could also facilitate the
mapping of disease genes based on their chromosomal location. Studies on polymorphic differences in coding and
noncoding regions in large populations with a particular
phenotype facilitate genome-wide linkage scans of polygenic
diseases such as type 1 and type 2 diabetes. This approach
is complicated by the dependence of such complex diseases
on a combination of genetic variations that may be common
in the general population, and environmental factors. However, increasing documentation of single-nucleotide polymorphism variation in humans could allow genome-wide
linkage and association scans for complex endocrine diseases
(69). Recently, advanced analytical tools have been applied
to delineate the relationship between a common polymorphism in the peroxisome proliferator- activated receptor-␥
gene and type 2 diabetes (70). Because the chromosomal
locations of human genes in the known ligand-signaling
pathways are clear, efforts to identify genetic loci predisposing for hypertension have also focused on candidate gene
strategies. Genetic linkage and association methods have
correlated hypertension with polymorphisms in the genes of
the renin-angiotensin-aldosterone system, the catecholaminergic/adrenergic receptors, and other hormonal factors.
These studies have yielded promising leads by suggesting a
number of candidate genes for hypertension susceptibility
(71).
VI. Alternative Splicing
To deduce the complete set of proteins encoded by a genome, more information than the sequence of all its genes is
required. In higher eukaryotes, the primary transcripts of
individual genes are processed in the nucleus before being
transported to the cytoplasm as mRNAs that direct ribosomal protein synthesis. Alternative mRNA processing is a
mechanism for creating protein diversity through selective
inclusion or exclusion of nucleotide sequences during posttranscriptional processing. Although human, fruit fly, and
worm appear to have similar numbers of alternative splicing
forms per gene, it is clear that alternative splicing results in
a significant increase in coding capacity and protein diversity
(72). Different studies imply that alternative splicing occurs
in about 35–50% of human genes (73–76) and may be more
frequent in certain types of genes, e.g., those encoding cell
surface receptors (77). Although it is not unusual for individual genes to have as many as 12 different mRNA species,
a small subset of genes (such as adhesion molecules or ion
channels) have been found to possess more than 100 splicing
variants (78).
Sequence alignment of multiple ESTs with the correspond-
374
Endocrine Reviews, June 2002, 23(3):369 –381
ing genomic sequence is the most common method to predict
the presence of different mRNA species (76, 79). Intronic
sequences at splice junctions are highly conserved, and 99%
of introns have a GT-AG at their 5⬘- and 3⬘-ends, respectively.
These characteristic patterns can be used to verify candidate
splices. Other approaches for the prediction of alternative
splicing variants include interspecies comparisons of
genomic sequences (80) and the computational analysis of
mRNA expression data measured with oligonucleotide microarrays (81).
The phenomenon of alternative splicing affects hormonal
systems in two ways. In one case, the components of the
machinery—polypeptide ligands, receptors (plasma membrane and nuclear), intracellular mediators of signaling pathways, etc.—are themselves subject to alternative splicing. On
the other hand, the processing of pre-mRNAs in the nucleus
may be subject to hormonal regulation (82). For example,
hypophysectomy in the rat results in the expression of alternative splice variants of a calcium-dependent potassium
channel, an effect that can be prevented by injection of ACTH
(83).
A number of hormone and growth factor isoforms are
derived from alternative gene splicing (84). One prominent
example is vascular endothelial growth factor, a factor that
induces microvascular permeability and plays a central role
in both angiogenesis and vasculogenesis. Through alternative mRNA splicing, the vascular endothelial growth factor
gene gives rise to several distinct isoforms differing in their
expression patterns as well as their biochemical and biological properties (85– 87).
There are also many examples of receptor gene splicing.
For instance, inclusion or exclusion of exon 11 of the insulin
receptor gene results in two different mRNA isoforms encoding receptor proteins with differing biological properties
(88). Similarly, there are at least eight human CRH-receptor
type 1 mRNA variants, and the splicing pattern varies in
response to environmental stress. Functional analysis indicated that some of these CRH-receptor type 1 variants differ
in their ability to transduce ligand signals in the cAMPmediated pathways, whereas others could function as binding proteins for CRH and related ligands (89). The alternative
splicing mechanism also underlies the generation of diverse
decoy receptors belong to the TNF receptor, the chemokine
receptor, and cytokine receptor families. These soluble or
membrane-anchored decoy receptors bind and sequester
their respective ligands with high affinity and specificity but
are incapable of signaling or presenting the agonist to the
functional receptor complexes, thus acting as a molecular
trap for endogenous agonists (90).
For intracellular signaling proteins, gene splicing can also
lead to multiple gene products with diverse functions. For
genes in the apoptosis pathway, alternative splicing can affect both the functional activity and the intracellular distribution of protein isoforms (91). For example, a splicing variant of the antiapoptotic Mcl-1 protein has proapoptotic
activity and antagonizes the prosurvival action of the known
Mcl-1 protein by forming noncovalent dimers (92). In this
example, proteins with diametrically opposed functions are
generated under different physiological conditions through
alternative splicing mechanisms.
Leo et al. • Hormonal Genomics
Splicing of mRNA constitutes an essential molecular
mechanism in the ongoing evolution of eukaryotes and is
pivotal to the understanding of hormonal genomics. Currently available information on alternative splice variants for
numerous genes can be found in databases such as ASDB
(93), AsMamDB (94), or PALS db (95). Such resources will
allow us to gain a better understanding of the complexity of
alternative splicing and its role in hormonal research. Because approximately 15% of the point mutations that cause
diseases in humans are thought to alter the normal splicing
pattern, studies on gene splicing mechanisms could also
provide important insights into human pathophysiology.
VII. Identification of the Regulatory Elements
of Genes
Hormones often effect changes in their target tissues by
altering the expression of specific sets of genes. At the DNA
level, these temporal, spatial, and quantitative changes in
gene expression are controlled by various regulatory DNA
sequences such as promoters, enhancers, insulators, and silencers (96, 97). The identification and characterization of
regulatory DNA sequences that mediate the actions of hormones and their signaling pathway genes therefore constitute an important focus of hormonal research.
Traditionally, regulatory DNA sequences have been investigated on a gene-by-gene basis, using a variety of approaches including deletion constructs, DNA footprinting,
and gel shift assays. With the availability of genome sequences for human, rodent, and other species, novel strategies for the large-scale, computer-based in silico identification
of regulatory sites are becoming feasible (98).
cis-Regulatory elements in the promoters and enhancers of
genes can be defined based on their DNA sequences, position
relative to coding sequences, and the potential to interact
with transcription factors. Because an exhaustive experimental characterization of all potential regulatory sites in a given
genome is difficult to generate, a variety of computational
approaches to this problem have been developed. These
methods include 1) analyses using databases of experimentally defined transcription factor binding sites, 2) approaches
based on comparative genomics, and 3) sequence analyses of
genes with similar expression profiles.
Experimentally derived data on transcription factors, their
consensus DNA binding sites, and DNA-binding profiles
have been compiled in a number of publicly accessible databases (99 –102). By comparing a given genomic DNA sequence to the consensus target sites for transcription factors
stored in the database, potential regulatory sites can be identified. Because many of these sites are short in length and
contain degenerate consensus sequences, a relatively large
number of false-positive predictions are generated.
Cross-species comparisons, also known as phylogenetic
profiling, provide another approach for identifying regulatory DNA sequences. These methods align noncoding sequences of orthologous genes from different species, taking
advantage of the high degree of conservation for regulatory
sequences. On a functional level, this finding is matched by
the observation that genomic transgenes largely retain their
Leo et al. • Hormonal Genomics
natural expression patterns even when introduced into mice
from other mammalian species (103). With the availability of
working drafts of the human and mouse genome, large-scale
interspecies comparisons to identify conserved noncoding
sequences have become possible (104, 105). Moreover, alignments of large genomic regions can be easily performed and
visualized using Internet-based tools such as PipMaker (106).
The last common ancestor of humans and mice lived about
80 –100 million years ago, and it appears that in many regions
of the human genome, this evolutionary distance will allow
the identification of conserved regulatory sites through comparative genomics (98, 107).
Another strategy to predict regulatory elements of genes
is based on the use of expression profiling to identify coregulated genes (cf. Section VIII). Transcription factors often
coordinate the expression of whole sets of genes such as those
induced by a given hormonal ligand. This coordinated regulation results from the presence of common transcription
factor binding sites in these genes and, hence, similar or
identical sequence motifs in their noncoding regions. For
hormonally regulated genes, large-scale identification of coexpressed mRNAs after hormonal treatment can be used to
identify genes that are coregulated by the same transcription
factors. A detailed analysis of the upstream regions of these
functionally clustered genes for shared sequence motifs can
then serve as a starting point for the characterization of novel
regulatory sequences. This approach has already been successfully applied to infer regulatory sites in yeast from microarray- derived expression data (108 –110). Because the
expression of a single gene can be controlled by various
synergistic and/or antagonistic factors acting on different
binding sites in its regulatory region, additional algorithms
are needed to model relationships between multiple transcription factors and their binding sites to unravel complex
transcriptional networks (111–113).
The combination of bioinformatics and novel experimental
approaches allows new insights into the function of regulatory elements. For example, nuclear receptor signaling is
transduced by homodimeric or heterodimeric receptor complexes that bind to specific sequences of DNA, termed response elements, in the promoter regions of target genes.
Because considerable flexibility exists in the types of binding
sites the nuclear receptors are capable of recognizing, the
precision of predicting regulatory elements based on a bioinformatic approach is limited. Screening methods involving
immunoselection and PCR amplification or microarray binding assays in combination with bioinformatic analyses now
can be used to examine genomic binding sites for different
nuclear receptors. The integrated approach for isolating
functional binding sites for nuclear receptors from genomic
DNA should aid in the discovery of genes regulated by
steroid hormones (114). The large-scale prediction of regulatory regions in mammalian genomes will likely require a
combination of all the approaches mentioned above. Importantly, in silico prediction of regulatory DNA sequences will
need to be verified by functional data from high-throughput
experimental methods. Although the area of regulatory element analysis remains a major challenge for hormonal researchers, recent studies on the promoter of the human
RANTES/CCR5 ligand gene have allowed the identification
Endocrine Reviews, June 2002, 23(3):369 –381
375
of promoter “modules” that are important for cell type-specific expression of this gene (115, 116) and will serve as a
model for future bioinformatic approaches in this field.
VIII. mRNA Expression Profiling
The majority of cells in an individual organism (with exceptions such as haploid germ cells or neoplastic cells) share
an identical set of genes. Nevertheless, these cells display an
astonishing phenotypic diversity that is further enhanced by
a multiplicity of developmental stages and physiological or
pathological states. This diversity of cellular phenotypes encountered within an individual is almost entirely due to
differences in the patterns of gene expression. Consequently,
the study of changes in gene expression over time and
space—as well as in response to hormones—represents a
challenging area of hormonal research.
Traditional experimental methods, such as Northern blot
analysis, differential display, and subtractive hybridization,
were the standard tools that researchers used to identify
differentially expressed genes. These approaches produced
either lists of differentially expressed genes or, at most, semiquantitative estimates of transcript frequencies in different
biological samples. This situation changed with highthroughput sequencing of cDNAs, including studies of ESTs
and the serial analysis of gene expression (SAGE). These
methods allowed scientists to quantify the occurrence of
thousands of transcripts in libraries prepared from different
cells or tissues. Based on the resulting data, the relative
expression of different genes within a biological sample
could be assessed. Databases containing information on ESTs
derived from different tissues or cell lines can be analyzed to
estimate the relative abundance of transcripts (117, 118). Using digital differential display or electronic subtraction, a
computational assessment of transcript frequencies in different samples (e.g., with or without nerve growth factor
treatment) can be used to explore differential gene expression (119). Digital differential display has been used to identify genes differentially expressed in diverse hormoneproducing cells and tissues, including murine oocytes (120),
the human prostate (121, 122), and the human hypothalamus-pituitary-adrenal axis (79). By comparing the genomic
localizations of the genes encoding testis-specific transcripts
with known translocation breakpoints from infertile males,
a set of candidate genes for male infertility was identified
(123).
In SAGE, short diagnostic sequence tags are isolated from
multiple cDNAs in the sample of interest, concatenated, and
cloned. In this manner, the sequencing of thousands of socalled SAGE tags can be performed with relative ease. By
calculating the frequencies of tags in a given sample, the
expression patterns of the corresponding genes can be determined (124, 125). A SAGE analysis of the molecular phenotype of the human oocyte identified a variety of surface
receptors, components of second messenger systems, and
secreted proteins, many of which were not previously known
to be expressed in mammalian oocytes (126). Other applications of SAGE technology to the field of endocrinology
include the identification of corticosteroid-responsive genes
376
Endocrine Reviews, June 2002, 23(3):369 –381
in the rat hippocampus (127), the discovery of human thyroid-specific transcripts (128, 129), and the characterization
of gene expression in PRL receptor knockout mice (130).
With the introduction of DNA array technologies, powerful tools for global studies of gene expression have become
available to individual laboratories (131). In this approach,
gene-specific sequences (probes) are immobilized on solid
substrates in ordered arrays. Subsequently, mRNA from the
cells or tissues to be studied is used to generate labeled
samples (targets) that are hybridized to the arrays. By quantifying the amount of labeled target hybridized to the array,
large-scale analyses of gene expression can be performed.
The available DNA array platforms differ in a number of
ways including the substrate on which the array is produced
(glass slides or silicon wafers), the type of arrayed material
(cDNAs or oligonucleotides), the way in which the probe is
fixed onto the array (printing or in situ synthesis), and the
type of labeling used [fluorescent dyes or radioactive tracers
(132)].
As a single DNA array experiment yields thousands of
data points, new computational tools have been developed
for their management and analysis (133). In particular, a
variety of clustering algorithms have been designed to classify and group genes on the basis of similarities or differences
in their expression across different samples and thus help
users to recognize global gene expression patterns (134). The
popular hierarchical clustering approach uses data from a set
of array experiments to place genes on a single hierarchical
tree. Genes with similar expression profiles become neighbors on the tree, whereas genes with larger differences in
their expression patterns appear on more distant branches
(135).
One basic assumption in interpreting gene expression profiles is that genes with similar transcription patterns are
likely to share aspects of their regulation and/or biological
functions. As a consequence, previously uncharacterized
genes have been attributed putative functions based on the
characteristics of known genes that are found in the same
cluster. This “guilt-by-association” approach has been successfully used to predict the function of yeast genes (131) and
is now being applied to the study of mammalian genomes,
e.g., the identification of prostate cancer marker genes (136).
Microarray technologies have also been applied to investigate a number of endocrine tissues and hormone-regulated
processes. Studies performed to date have addressed—
among others— questions involving gene expression in the
human adrenal (137), the effect of hypophysectomy in rats
(138), development of the mouse pituitary gland (139), adipose tissue in obese and diabetic mice (140), preovulatory
gene expression in the rat ovary (141), implantation in the
mouse (142), and gene expression in the murine placenta and
embryo (143). Another important area of DNA array studies
is the classification of disease phenotypes based on expression profiling of genes in normal and pathological samples
(144 –147). In the future, transcript profiling of patient specimens could be used to subclassify hormonal disorders allowing for more precise therapeutic interventions.
As gene expression patterns may vary widely between
adjacent but biologically distinct tissues and cell types, a
meaningful microarray analysis often depends on the precise
Leo et al. • Hormonal Genomics
selection of defined cell populations from a sample. The
investigation of gene expression patterns can be refined by
laser capture microdissection, often in combination with
methods for target amplification (148, 149). In the future,
global analyses of gene expression may be performed even
in single cells (150).
So far, most of these DNA array studies have taken a local
approach to the study of gene expression and have focused
on identifying individual genes connected with certain biological phenomena. Because gene products in a given cell
usually function in a concerted manner and belong to specific
pathways for carrying out unique cellular functions, future
studies will focus on a global analysis of gene expression. In
a recent study, the results of more than 500 DNA microarray
experiments were analyzed to group coregulated genes in C.
elegans. To visualize the relationships between more than
17,000 genes in the nematode genome, clusters of coexpressed transcripts were displayed as mountains in a threedimensional expression map that corresponds to established,
as well as previously unknown, biological relationships
(151). In addition, a number of projects focus on the analysis
and presentation of gene pathway maps. For example, GenMapp provides a graphic representation of the relative expression of selected groups of genes categorized by their
known positions in physiological pathways (152) (http://
www.genmapp.org). Thus, RNA transcripts for thousands of
genes from defined cell populations can be probed for their
relative abundance based on prearranged maps representing
functional pathways.
DNA array experiments generate data sets of sizes that are
unprecedented in most areas of biological research (153). In
the future, it will become increasingly useful for these data
to be available in electronic form to the scientific community
at large. On the one hand, these data cannot be published by
traditional methods because of their unwieldy quantity and
because they can be more easily visualized and manipulated
using a computer. On the other hand, the constant improvement of algorithms for the analysis of microarray data means
that data sets will be frequently revisited to reinterpret their
meaning or to formulate new hypotheses and relationships.
Most importantly, however, the true power of gene expression data can only be unleashed when the expression of a
given set of transcripts can be compared over as many different tissues and cell types, developmental and physiological states, and diseases and experimental conditions as possible. Therefore, public repositories and common data
formats for gene expression data have been proposed (154).
IX. The Postgenomic Challenges in
Hormonal Research
The application of postgenomic approaches to the field of
hormone research will likely result in major changes in the
way endocrinologists perform their work. In the traditional
endocrine laboratory, the single-gene/single-protein approach dominates, providing an integrated view of sequence, mRNA, and protein expression, as well as cellular
and organismal function for a limited number of genes. This
type of research often starts from functional studies where
Leo et al. • Hormonal Genomics
individual laboratories function independently with a limited degree of collaboration. In the current early phase of the
postgenomic era, new methods provide massive amounts of
data, but these are still largely interpreted using a classic
paradigm. For example, DNA microarrays are often used as
a sophisticated substitute for differential display or subtractive hybridization. Consequently, out of thousands analyzed, only a few genes with the most pronounced changes
are selected and studied using the single-gene approach. In
spite of the accelerated pace of discovery, the independent
laboratory approach is still most common.
In the next phase, the large data sets created by the application of a single high-throughput method will be analyzed and interpreted with regard to the complete underlying set of biological entities (e.g., genes). The characterization
of gene function will also expand from the single-gene
knockout approach to random or targeted gene-trapping
approaches (155, 156) (http://socrates.berkeley.edu/⬃skarnes/resource.html; http://www.lexgen.com/). A secretory
trap method has allowed the generation of mice with a deletion of genes encoding membrane and secreted proteins,
including those for polypeptide ligands and plasma membrane receptors. After the validation of functions for individual genes, comprehensive gene functional maps will be
integrated into a biological atlas (157). As a result, relationships between separate entities in this set are recognized,
patterns are discovered, and new testable hypotheses regarding gene pathways and networks are created. This
global phase will be reached when a large number of data
sets become available in a compatible electronic format and
new computational methods for their analysis, visualization,
and interpretation become available (e.g., the use of microarray data in the yeast research community). For understanding hormonal signaling, data on all ligands and receptors will
be deposited and categorized in searchable databases with
links to biomedical literature and gene sequences, protein
structures, and other resources. These resources will be complemented by databases for specific endocrine organs [e.g.,
the Ovarian Kaleidoscope database, (158)] to provide an integrated view of organ physiology and pathophysiology.
We are moving from a gene-by-gene approach to a reconstructionistic one. Genes in the hormonal signaling system
can be analyzed from the combined perspectives of traditional endocrinologists, molecular biologists, developmental
biologists, cell biologists, evolution biologists, structural biologists, and geneticists. In the postgenomic era, the scientific
community as a whole, as opposed to an individual laboratory, plays an increasingly important role in biomedical research. Eventually, the massive genomic data gathered with
different methods (e.g., DNA arrays and proteomics) will be
integrated to allow a global view of the system (157). Recently, steroid receptors and related orphan receptors have
been grouped in the context of all transcription factors (159),
whereas diverse plasma membrane receptors have been classified based on their evolutionary origin and relationship to
other plasma membrane signaling molecules (I. Ben-Shlomo,
S. Y. Hsu, and A. J. W. Hsueh, submitted). In this way, new
metapatterns can be discovered, and pathways spanning the
scope of different methods can be understood ranging from
regulatory DNA sequences to gene transcript expression to
Endocrine Reviews, June 2002, 23(3):369 –381
377
protein function. This phase relies heavily on collaboration
within the biomedical community because no single laboratory is likely to have the expertise, time, and resources to
generate such large datasets using different high-throughput
methods. This transition of hormonal research from a single
laboratory approach to an integrated community effort will
pose an important challenge for the next generation of endocrine researchers.
Acknowledgments
Address all correspondence and requests for reprints to: Aaron J. W.
Hseuh, Ph.D., Division of Reproductive Biology, Department of Gynecology and Obstetrics, Stanford University Medical Center, 300 Pasteur
Drive, Room A344, Stanford, California 94305-5317. E-mail: aaron.
[email protected]
This work was supported by NIH Grants HD-23273, HD-31398, and
DK-58534.
* Current address for C.P.L.: MBA Programme, Institute Europeen
d’Administration des Affaires, Boulevard de Constance, 77305 Fontainebleau Cedex, France.
References
1. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin
J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D,
Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R,
McEwan P, McKernan K, et al. 2001 Initial sequencing and analysis
of the human genome. Nature 409:860 –921
2. Venter JC, Adams MD, Myers EW, Li PW, Mural, RJ, Sutton GG,
Smith HO, Yandell M, Evans CA, Holt RA, Gocayne JD, Amanatides, P, Ballew RM, Huson DH, Wortman JR, Zhang Q, Kodira CD,
Zheng XH, Chen L, Skupski M, et al. 2001 The sequence of the
human genome. Science 291:1304 –1351
3. Gerisch B, Weitzel C, Kober-Eisermann C, Rottiers V, Antebi A
2001 A hormonal signaling pathway influencing C. elegans metabolism, reproductive development, and life span. Dev Cell 1:841– 851
4. Kudo M, Chen T, Nakabayashi K, Hsu SY, Hsueh AJ 2000 The
nematode leucine-rich repeat-containing, G protein-coupled receptor (LGR) protein homologous to vertebrate gonadotropin and
thyrotropin receptors is constitutively active in mammalian cells.
Mol Endocrinol 14:272–284
5. Blattner FR, Plunkett III G, Bloch CA, Perna NT, Burland V, Riley
M, Collado-Vides J, Glasner JD, Rode CK, Mayhew GF, Gregor
J, Davis NW, Kirkpatrick HA, Goeden MA, Rose DJ, Mau B, Shao
Y 1997 The complete genome sequence of Escherichia coli K-12.
Science 277:1453–1474
6. Goffeau A, Barrell BG, Bussey H, Davis RW, Dujon B, Feldmann
H, Galibert F, Hoheisel JD, Jacq C, Johnston M, Louis EJ, Mewes
HW, Murakami Y, Philippsen P, Tettelin H, Oliver SG 1996 Life
with 6000 genes. Science 274:546, 563–547
7. The C. elegans Sequencing Consortium 1998 Genome sequence of
the nematode C. elegans: a platform for investigating biology. Science 282:2012–2018
8. Adams MD, Celniker SE, Holt RA, Evans CA, Gocayne JD,
Amanatides PG, Scherer SE, Li PW, Hoskins RA, Galle RF,
George RA, Lewis SE, Richards S, Ashburner M, Henderson SN,
Sutton GG, Wortman JR, Yandell MD, Zhang Q, Chen LX, et al.
2000 The genome sequence of Drosophila melanogaster. Science 287:
2185–2195
9. Stein L 2001 Genome annotation: from sequence to biology. Nat
Rev Genet 2:493–503
10. Baltimore D 2001 Our genome unveiled. Nature 409:814 – 816
11. Bailey Jr LC, Searls DB, Overton GC 1998 Analysis of EST-driven
gene annotation in human genomic sequence. Genome Res 8:
362–376
12. Basrai MA, Hieter P, Boeke JD 1997 Small open reading frames:
beautiful needles in the haystack. Genome Res 7:768 –771
378
Endocrine Reviews, June 2002, 23(3):369 –381
13. Burge C, Karlin S 1997 Prediction of complete gene structures in
human genomic DNA. J Mol Biol 268:78 –94
14. Reese MG, Kulp D, Tammana H, Haussler D 2000 Genie– gene
finding in Drosophila melanogaster. Genome Res 10:529 –538
15. Guigo R, Agarwal P, Abril JF, Burset M, Fickett JW 2000 An
assessment of gene prediction accuracy in large DNA sequences.
Genome Res 10:1631–1642
16. Howard AD, McAllister G, Feighner SD, Liu Q, Nargund RP, Van
der Ploeg LH, Patchett AA 2001 Orphan G-protein-coupled receptors and natural ligand discovery. Trends Pharmacol Sci 22:
132–140
17. Cascieri MA, Springer MS 2000 The chemokine/chemokinereceptor family: potential and progress for therapeutic intervention. Curr Opin Chem Biol 4:420 – 427
18. Busfield SJ, Comrack CA, Yu G, Chickering TW, Smutko JS,
Zhou H, Leiby KR, Holmgren LM, Gearing DP, Pan Y 2000 Identification and gene organization of three novel members of the IL-1
family on human chromosome 2. Genomics 66:213–216
19. Debets R, Timans JC, Homey B, Zurawski S, Sana TR, Lo S,
Wagner J, Edwards G, Clifford T, Menon S, Bazan JF, Kastelein
RA 2001 Two novel IL-1 family members, IL-1␦ and IL-1⑀, function
as an antagonist and agonist of NF-␬B activation through the orphan IL-1 receptor-related protein 2. J Immunol 167:1440 –1446
20. Rock FL, Hardiman G, Timans JC, Kastelein RA, Bazan JF 1998
A family of human receptors structurally related to Drosophila Toll.
Proc Natl Acad Sci USA 95:588 –593
21. O’Neill LA 2000 The interleukin-1 receptor/Toll-like receptor superfamily: signal transduction during inflammation and host defense. Sci STKE 44:RE1; 1–11 (http://stke.sciencemag.org)
22. Glusman G, Yanai I, Rubin I, Lancet D 2001 The complete human
olfactory subgenome. Genome Res 11:685–702
23. Zozulya S, Echeverri F, Nguyen T 2001 The human olfactory
receptor repertoire. Genome Biol 2(6):research0018.1– 0018.12
(http://genomebiology.com)
24. Zhang X, Firestein S 2002 The olfactory receptor gene superfamily
of the mouse. Nat Neurosci 5:124 –133
25. Neill JD 2002 GnRH and GnRH receptor genes in the human
genome. Endocrinology 143:737–743
26. Okubo K, Nagata S, Ko R, Kataoka H, Yoshiura Y, Mitani H,
Kondo M, Naruse K, Shima A, Aida K 2001 Identification and
characterization of two distinct GnRH receptor subtypes in a
teleost, the medaka Oryzias latipes. Endocrinology 142:4729 – 4739
27. Millar R, Lowe S, Conklin D, Pawson A, Maudsley S, Troskie B,
Ott T, Millar M, Lincoln G, Sellar R, Faurholm B, Scobie G,
Kuestner R, Terasawa E, Katz A 2001 A novel mammalian receptor
for the evolutionarily conserved type II GnRH. Proc Natl Acad Sci
USA 98:9636 –9641
28. Faurholm B, Millar RP, Katz AA 2001 The genes encoding the type
II gonadotropin-releasing hormone receptor and the ribonucleoprotein RBM8A in humans overlap in two genomic loci. Genomics
78:15–18
29. Neill JD, Duck LW, Sellers JC, Musgrove LC 2001 A gonadotropin-releasing hormone (GnRH) receptor specific for GnRH II in
primates. Biochem Biophys Res Commun 282:1012–1018
30. Vitt UA, Hsu SY, Hsueh AJ 2001 Evolution and classification of
cystine knot-containing hormones and related extracellular signaling molecules. Mol Endocrinol 15:681– 694
31. Meyer A, Schartl M 1999 Gene and genome duplications in vertebrates: the one-to-four (-to-eight in fish) rule and the evolution of
novel gene functions. Curr Opin Cell Biol 11:699 –704
32. Pebusque MJ, Coulier F, Birnbaum D, Pontarotti P 1998 Ancient
large-scale genome duplications: phylogenetic and linkage analyses shed light on chordate genome evolution. Mol Biol Evol 15:
1145–1159
33. Pierce SB, Costa M, Wisotzkey R, Devadhar S, Homburger SA,
Buchman AR, Ferguson KC, Heller J, Platt DM, Pasquinelli AA,
Liu LX, Doberstein SK, Ruvkun G 2001 Regulation of DAF-2
receptor signaling by human insulin and ins-1, a member of the
unusually large and diverse C. elegans insulin gene family. Genes
Dev 15:672– 686
34. Hsu SY 1999 Cloning of two novel mammalian paralogs of relaxin/
insulin family proteins and their expression in testis and kidney.
Mol Endocrinol 13:2163–2174
Leo et al. • Hormonal Genomics
35. Bathgate RA, Samuel CS, Burazin TC, Layfield S, Claasz AA,
Reytomas IG, Dawson NF, Zhao C, Bond C, Summers RJ, Parry
LJ, Wade JD, Tregear GW 2002 Human relaxin gene 3 (H3) and the
equivalent mouse relaxin (M3) gene. Novel members of the relaxin
peptide family. J Biol Chem 277:1148 –1157
36. Hsu SY, Kudo M, Chen T, Nakabayashi K, Bhalla A, van der Spek
PJ, van Duin M, Hsueh AJ 2000 The three subfamilies of leucinerich repeat-containing G protein-coupled receptors (LGR): identification of LGR6 and LGR7 and the signaling mechanism for LGR7.
Mol Endocrinol 14:1257–1271
37. Hsu SY, Liang SG, Hsueh AJ 1998 Characterization of two LGR
genes homologous to gonadotropin and thyrotropin receptors with
extracellular leucine-rich repeats and a G protein-coupled, seventransmembrane region. Mol Endocrinol 12:1830 –1845
38. Nishi S, Hsu SY, Zell K, Hsueh AJ 2000 Characterization of two
fly LGR (leucine-rich repeat-containing, G protein-coupled receptor) proteins homologous to vertebrate glycoprotein hormone receptors: constitutive activation of wild-type fly LGR1 but not LGR2
in transfected mammalian cells. Endocrinology 141:4081– 4090
39. Hsu SY, Nakabayashi K, Nishi S, Kumagai J, Kudo M, Sherwood
OD, Hsueh AJ 2002 Activation of orphan receptors by the hormone
relaxin. Science 295:671– 674
40. Conti M, Jin SL 1999 The molecular biology of cyclic nucleotide
phosphodiesterases. Prog Nucleic Acid Res Mol Biol 63:1–38
41. Koyanagi M, Suga H, Hoshiyama D, Ono K, Iwabe N, Kuma K,
Miyata T 1998 Ancient gene duplication and domain shuffling in
the animal cyclic nucleotide phosphodiesterase family. FEBS Lett
436:323–328
42. Hsu SY, Hsueh AJ 2001 Human stresscopin and stresscopinrelated peptide are selective ligands for the type 2 corticotropinreleasing hormone receptor. Nat Med 7:605– 611
43. Reyes TM, Lewis K, Perrin MH, Kunitake KS, Vaughan J, Arias
CA, Hogenesch JB, Gulyas J, Rivier J, Vale WW, Sawchenko PE
2001 Urocortin II: a member of the corticotropin-releasing factor
(CRF) neuropeptide family that is selectively bound by type 2 CRF
receptors. Proc Natl Acad Sci USA 98:2843–2848
44. Lewis K, Li C, Perrin MH, Blount A, Kunitake K, Donaldson C,
Vaughan J, Reyes TM, Gulyas J, Fischer W, Bilezikjian L, Rivier
J, Sawchenko PE, Vale WW 2001 Identification of urocortin III, an
additional member of the corticotropin-releasing factor (CRF) family with high affinity for the CRF2 receptor. Proc Natl Acad Sci USA
98:7570 –7575
45. Sherwood NM, Krueckl SL, McRory JE 2000 The origin and function of the pituitary adenylate cyclase-activating polypeptide
(PACAP)/glucagon superfamily. Endocr Rev 21:619 – 670
46. Maston GA, Ruvolo M 2002 Chorionic gonadotropin has a recent
origin within primates and an evolutionary history of selection.
Mol Biol Evol 19:320 –335
47. Liu JC, Makova KD, Adkins RM, Gibson S, Li WH 2001 Episodic
evolution of growth hormone in primates and emergence of the
species specificity of human growth hormone receptor. Mol Biol
Evol 18:945–953
48. Thornton JW 2001 Evolution of vertebrate steroid receptors from
an ancestral estrogen receptor by ligand exploitation and serial
genome expansions. Proc Natl Acad Sci USA 98:5671–5676
49. Kwitek AE, Tonellato PJ, Chen D, Gullings-Handley J, Cheng YS,
Twigger S, Scheetz TE, Casavant TL, Stoll M, Nobrega MA,
Shiozawa M, Soares MB, Sheffield VC, Jacob HJ 2001 Automated
construction of high-density comparative maps between rat, human, and mouse. Genome Res 11:1935–1943
50. Dong J, Albertini DF, Nishimori K, Kumar TR, Lu N, Matzuk
MM 1996 Growth differentiation factor-9 is required during early
ovarian folliculogenesis. Nature 383:531–535
51. Dube JL, Wang P, Elvin J, Lyons KM, Celeste AJ, Matzuk MM
1998 The bone morphogenetic protein 15 gene is X-linked and
expressed in oocytes. Mol Endocrinol 12:1809 –1817
52. Laitinen M, Vuojolainen K, Jaatinen R, Ketola I, Aaltonen J,
Lehtonen E, Heikinheimo M, Ritvos O 1998 A novel growth
differentiation factor-9 (GDF-9) related factor is co-expressed with
GDF-9 in mouse oocytes during folliculogenesis. Mech Dev 78:
135–140
53. Otsuka F, Yamamoto S, Erickson GF, Shimasaki S 2001 Bone
morphogenetic protein-15 inhibits follicle-stimulating hormone
Leo et al. • Hormonal Genomics
54.
55.
56.
57.
58.
59.
60.
61.
62.
63.
64.
65.
66.
67.
68.
69.
70.
71.
72.
(FSH) action by suppressing FSH receptor expression. J Biol Chem
276:11387–11392
Galloway SM, McNatty KP, Cambridge LM, Laitinen MP, Juengel JL, Jokiranta TS, McLaren RJ, Luiro K, Dodds KG, Montgomery GW, Beattie AE, Davis GH, Ritvos O 2000 Mutations in
an oocyte-derived growth factor gene (BMP15) cause increased
ovulation rate and infertility in a dosage-sensitive manner. Nat
Genet 25:279 –283
Seminara SB, Hayes FJ, Crowley Jr WF 1998 Gonadotropinreleasing hormone deficiency in the human (idiopathic hypogonadotropic hypogonadism and Kallmann’s syndrome): pathophysiological and genetic considerations. Endocr Rev 19:521–539
Martul P, Pineda J, Levilliers J, Vazquez JA, Rodriguez-Soriano
J, Loridan L, Diaz-Perez JL 1995 Hypogonadotrophic hypogonadism with hyposmia, X-linked ichthyosis, and renal malformation
syndrome. Clin Endocrinol (Oxf) 42:121–128
Stringer CB, Andrews P 1988 Genetic and fossil evidence for the
origin of modern humans. Science 239:1263–1268
Wang DG, Fan J-B, Siao C-J, Berno A, Young P, Sapolsky R,
Ghandour G, Perkins N, Winchester E, Spencer J, Kruglyak L,
Stein L, Hsie L, Topaloglou T, Hubbell E, Robinson E, Mittmann
M, Morris MS, Shen N, Kilburn D, Rioux J, Nusbaum C, Rozen S,
Hudson TJ, Lipshutz R, Chee M, Lander ES 1998 Large-scale
identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science 280:1077–1082
Smigielski EM, Sirotkin K, Ward M, Sherry ST 2000 dbSNP: a
database of single nucleotide polymorphisms. Nucleic Acids Res
28:352–355
Evans WE, Relling MV 1999 Pharmacogenomics: translating functional genomics into rational therapeutics. Science 286:487– 491
Michael NL, Louie LG, Rohrbaugh AL, Schultz KA, Dayhoff DE,
Wang CE, Sheppard HW 1997 The role of CCR5 and CCR2 polymorphisms in HIV-1 transmission and disease progression. Nat
Med 3:1160 –1162
Liu H, Chao D, Nakayama EE, Taguchi H, Goto M, Xin X, Takamatsu JK, Saito H, Ishikawa Y, Akaza T, Juji T, Takebe Y, Ohishi
T, Fukutake K, Maruyama Y, Yashiki S, Sonoda S, Nakamura T,
Nagai Y, Iwamoto A, Shioda T 1999 Polymorphism in RANTES
chemokine promoter affects HIV-1 disease progression. Proc Natl
Acad Sci USA 96:4581– 4585
Buscher R, Herrmann V, Insel PA 1999 Human adrenoceptor
polymorphisms: evolving recognition of clinical importance.
Trends Pharmacol Sci 20:94 –99
El-Omar EM, Carrington M, Chow WH, McColl KE, Bream JH,
Young HA, Herrera J, Lissowska J, Yuan CC, Rothman N, Lanyon
G, Martin M, Fraumeni Jr JF, Rabkin CS 2000 Interleukin-1 polymorphisms associated with increased risk of gastric cancer. Nature
404:398 – 402
Perez Mayorga M, Gromoll J, Behre HM, Gassner C, Nieschlag
E, Simoni M 2000 Ovarian response to follicle-stimulating hormone (FSH) stimulation depends on the FSH receptor genotype.
J Clin Endocrinol Metab 85:3365–3369
Hopp TA, Fuqua SA 1998 Estrogen receptor variants. J Mammary
Gland Biol Neoplasia 3:73– 83
Dowsett M, Daffada A, Chan CM, Johnston SR 1997 Oestrogen
receptor mutants and variants in breast cancer. Eur J Cancer 33:
1177–1183
Rana BK, Shiina T, Insel PA 2001 Genetic variations and polymorphisms of G protein-coupled receptors: functional and therapeutic implications. Annu Rev Pharmacol Toxicol 41:593– 624
Thomson G 2001 Significance levels in genome scans. Adv Genet
42:475– 486
Altshuler D, Hirschhorn JN, Klannemark M, Lindgren CM, Vohl
MC, Nemesh J, Lane CR, Schaffner SF, Bolk S, Brewer C, Tuomi
T, Gaudet D, Hudson TJ, Daly M, Groop L, Lander ES 2000 The
common PPAR␥ Pro12Ala polymorphism is associated with decreased risk of type 2 diabetes. Nat Genet 26:76 – 80
Timberlake DS, O’Connor DT, Parmer RJ 2001 Molecular genetics
of essential hypertension: recent results and emerging strategies.
Curr Opin Nephrol Hypertens 10:71–79
Brett D, Hanke J, Lehmann G, Haase S, Delbruck S, Krueger S,
Reich J, Bork P 2000 EST comparison indicates 38% of human
Endocrine Reviews, June 2002, 23(3):369 –381
73.
74.
75.
76.
77.
78.
79.
80.
81.
82.
83.
84.
85.
86.
87.
88.
89.
90.
91.
92.
93.
94.
95.
96.
379
mRNAs contain possible alternative splice forms. FEBS Lett 474:
83– 86
Hanke J, Brett D, Zastrow I, Aydin A, Delbruck S, Lehmann G,
Luft F, Reich J, Bork P 1999 Alternative splicing of human genes:
more the rule than the exception? Trends Genet 15:389 –390
Mironov AA, Fickett JW, Gelfand MS 1999 Frequent alternative
splicing of human genes. Genome Res 9:1288 –1293
Brett D, Pospisil H, Valcarcel J, Reich J, Bork P 2001 Alternative
splicing and genome complexity. Nat Genet 30:29 –30
Kan Z, Rouchka EC, Gish WR, States DJ 2001 Gene structure
prediction and alternative splicing analysis using genomically
aligned ESTs. Genome Res 11:889 –900
Modrek B, Lee C 2001 A genomic view of alternative splicing. Nat
Genet 30:13–19
Black DL 2000 Protein diversity from alternative splicing: a challenge for bioinformatics and post-genome biology. Cell 103:
367–370
Hu RM, Han ZG, Song HD, Peng YD, Huang QH, Ren SX, Gu YJ,
Huang CH, Li YB, Jiang CL, Fu G, Zhang QH, Gu BW, Dai M, Mao
YF, Gao GF, Rong R, Ye M, Zhou J, Xu SH, Gu J, Shi JX, Jin WR,
Zhang CK, Wu TM, Huang GY, Chen Z, Chen MD, Chen JL 2000
Gene expression profiling in the human hypothalamus-pituitaryadrenal axis and full-length cDNA cloning. Proc Natl Acad Sci USA
97:9543–9548
Korf I, Flicek P, Duan D, Brent MR 2001 Integrating genomic
homology into gene structure prediction. Bioinformatics 17(Suppl
1):S140 –148
Hu GK, Madore SJ, Moldover B, Jatkoe T, Balaban D, Thomas J,
Wang Y 2001 Predicting splice variant from DNA chip expression
data. Genome Res 11:1237–1245
Akker SA, Smith PJ, Chew SL 2001 Nuclear post-transcriptional
control of gene expression. J Mol Endocrinol 27:123–131
Xie J, McCobb DP 1998 Control of alternative splicing of potassium
channels by stress hormones. Science 280:443– 446
Lou H, Gagel RF 2001 Alternative ribonucleic acid processing in
endocrine systems. Endocr Rev 22:205–225
He Y, Smith SK, Day KA, Clark DE, Licence DR, Charnock-Jones
DS 1999 Alternative splicing of vascular endothelial growth factor
(VEGF)-R1 (FLT-1) pre-mRNA is important for the regulation of
VEGF activity. Mol Endocrinol 13:537–545
Grunstein J, Masbad JJ, Hickey R, Giordano F, Johnson RS 2000
Isoforms of vascular endothelial growth factor act in a coordinate
fashion To recruit and expand tumor vasculature. Mol Cell Biol
20:7282–7291
Robinson CJ, Stringer SE 2001 The splice variants of vascular
endothelial growth factor (VEGF) and their receptors. J Cell Sci
114:853– 865
Norgren S, Zierath J, Wedell A, Wallberg-Henriksson H, Luthman H 1994 Regulation of human insulin receptor RNA splicing in
vivo. Proc Natl Acad Sci USA 91:1465–1469
Pisarchik A, Slominski AT 2001 Alternative splicing of CRH-R1
receptors in human and mouse skin: identification of new variants
and their differential expression. FASEB J 15:2754 –2756
Mantovani A, Locati M, Vecchi A, Sozzani S, Allavena P 2001
Decoy receptors: a strategy to regulate inflammatory cytokines and
chemokines. Trends Immunol 22:328 –336
Jiang ZH, Wu JY 1999 Alternative splicing and programmed cell
death. Proc Soc Exp Biol Med 220:64 –72
Bae J, Leo CP, Hsu SY, Hsueh AJ 2000 MCL-1S, a splicing variant
of the antiapoptotic BCL-2 family member MCL-1, encodes a proapoptotic protein possessing only the BH3 domain. J Biol Chem
275:25255–25261
Dralyuk I, Brudno M, Gelfand MS, Zorn M, Dubchak I 2000
ASDB: database of alternatively spliced genes. Nucleic Acids Res
28:296 –297
Ji H, Zhou Q, Wen F, Xia H, Lu X, Li Y 2001 AsMamDB: an
alternative splice database of mammals. Nucleic Acids Res 29:
260 –263
Huang YH, Chen YT, Lai JJ, Yang ST, Yang UC 2002 PALS db:
putative alternative splicing database. Nucleic Acids Res 30:
186 –190
Zhan HC, Liu DP, Liang CC 2001 Insulator: from chromatin domain boundary to gene regulation. Hum Genet 109:471– 478
380
Endocrine Reviews, June 2002, 23(3):369 –381
97. Burke LJ, Baniahmad A 2000 Co-repressors 2000. FASEB J 14:
1876 –1888
98. Pennacchio LA, Rubin EM 2001 Genomic strategies to identify
mammalian regulatory sequences. Nat Rev Genet 2:100 –109
99. Heinemeyer T, Wingender E, Reuter I, Hermjakob H, Kel AE, Kel
OV, Ignatieva EV, Ananko EA, Podkolodnaya OA, Kolpakov FA,
Podkolodny NL, Kolchanov NA 1998 Databases on transcriptional
regulation: TRANSFAC, TRRD and COMPEL. Nucleic Acids Res
26:362–367
100. Ghosh D 2000 Object-oriented transcription factors database
(ooTFD). Nucleic Acids Res 28:308 –310
101. Praz V, Perier R, Bonnard C, Bucher P 2002 The eukaryotic promoter database, EPD: new entry types and links to gene expression
data. Nucleic Acids Res 30:322–324
102. Wingender E, Chen X, Hehl R, Karas H, Liebich I, Matys V,
Meinhardt T, Pruss M, Reuter I, Schacherer F 2000 TRANSFAC:
an integrated system for gene expression regulation. Nucleic Acids
Res 28:316 –319
103. Lacy DA, Wang ZE, Symula DJ, McArthur CJ, Rubin EM, Frazer
KA, Locksley RM 2000 Faithful expression of the human 5q31
cytokine cluster in transgenic mice. J Immunol 164:4569 – 4574
104. Hardison RC, Oeltjen J, Miller W 1997 Long human-mouse sequence alignments reveal novel regulatory elements: a reason to
sequence the mouse genome. Genome Res 7:959 –966
105. Hardison RC 2000 Conserved noncoding sequences are reliable
guides to regulatory elements. Trends Genet 16:369 –372
106. Schwartz S, Zhang Z, Frazer KA, Smit A, Riemer C, Bouck J,
Gibbs R, Hardison R, Miller W 2000 PipMaker—a web server for
aligning two genomic DNA sequences. Genome Res 10:577–586
107. Wasserman WW, Palumbo M, Thompson W, Fickett JW, Lawrence CE 2000 Human-mouse genome comparisons to locate regulatory sites. Nat Genet 26:225–228
108. Chu S, DeRisi J, Eisen M, Mulholland J, Botstein D, Brown PO,
Herskowitz I 1998 The transcriptional program of sporulation in
budding yeast. Science 282:699 –705
109. Wolfsberg TG, Gabrielian AE, Campbell MJ, Cho RJ, Spouge JL,
Landsman D 1999 Candidate regulatory sequence elements for cell
cycle-dependent transcription in Saccharomyces cerevisiae. Genome
Res 9:775–792
110. Tavazoie S, Hughes JD, Campbell MJ, Cho RJ, Church GM 1999
Systematic determination of genetic network architecture. Nat
Genet 22:281–285
111. Bussemaker HJ, Li H, Siggia ED 2001 Regulatory element detection using correlation with expression. Nat Genet 27:167–171
112. Pilpel Y, Sudarsanam P, Church GM 2001 Identifying regulatory
networks by combinatorial analysis of promoter elements. Nat
Genet 29:153–159
113. Birnbaum K, Benfey PN, Shasha DE 2001 cis Element/transcription factor analysis (cis/TF): a method for discovering transcription
factor/cis element relationships. Genome Res 11:1567–1573
114. Koszewski NJ, Kiessling S, Malluche HH 2001 Isolation of
genomic DNA sequences that bind vitamin D receptor complexes.
Biochem Biophys Res Commun 283:188 –194
115. Fessele S, Boehlk S, Mojaat A, Miyamoto NG, Werner T, Nelson
EL, Schlondorff D, Nelson PJ 2001 Molecular and in silico characterization of a promoter module and C/EBP element that mediate LPS-induced RANTES/CCL5 expression in monocytic cells.
FASEB J 15:577–579
116. Fessele S, Maier H, Zischek C, Nelson PJ, Werner T 2002 Regulatory context is a crucial part of gene function. Trends Genet
18:60 – 63
117. Adams MD, Kelley JM, Gocayne JD, Dubnick M, Polymeropoulos
MH, Xiao H, Merril CR, Wu A, Olde B, Moreno RF, Kerlavage AR,
McCombie WR, Venter JC 1991 Complementary DNA sequencing:
expressed sequence tags and human genome project. Science 252:
1651–1656
118. Adams MD, Kerlavage AR, Fleischmann RD, Fuldner RA, Bult
CJ, Lee NH, Kirkness EF, Weinstock KG, Gocayne JD, White O,
Suttor G, Blake JA, Brandon RC, Chiu M, Clayton RA, Cline RT,
Cotton MD, Earle-Hughes J, Fine LD, FitzGerald LM,
FitzHugh WM, Fritchman JL, Geoghagen NSM, Glodek A,
Gnehm CL, Hanna MC, Hedblom E, Hinkler Jr PS, Kelley JM,
Klimek KM, Kelley JC, Liu L, Marmaros SM, Merrick JM,
Leo et al. • Hormonal Genomics
119.
120.
121.
122.
123.
124.
125.
126.
127.
128.
129.
130.
131.
132.
133.
134.
135.
136.
Moreno-Palanques RF, McDonald LA, Nguyen DT, Pellegrino SM, Phillips CA, Ryder SE, Scott JL, Saudek DM, Shirley R,
Small KV, Spriggs TA, Utterback TR, Weidman JF, Li Y, Barthlow R, Bednarik DP, Cao L, Cepeda MA, Coleman TA, Collins E,
Dimke D, Feng P, Ferrie A, Fischer C, Hastings GA, He W, Hu J,
Huddleston KA, Greene JM, Gruber J, Hudson P, Kim A,
Kozak DL, Kunsch C, Ji H, Li H, Meissner PS, Olsen H, Raymond L, Wei Y, Wing J, Xu C, Yu G, Ruben SM, Dillon PJ,
Fannon MR, Rosen CA, Haseltine WA, Fields C, Fraser CM,
Venter JC 1995 Initial assessment of human gene diversity and
expression patterns based upon 83 million nucleotides of cDNA
sequence. Nature 377:3–174
Lee NH, Weinstock KG, Kirkness EF, Earle-Hughes JA, Fuldner
RA, Marmaros S, Glodek A, Gocayne JD, Adams MD, Kerlavage
AR, Fraser CM, Venter JC 1995 Comparative expressed-sequencetag analysis of differential gene expression profiles in PC-12 cells
before and after nerve growth factor treatment. Proc Natl Acad Sci
USA 92:8303– 8307
Rajkovic A, Yan MSC, Klysik M, Matzuk M 2001 Discovery of
germ cell-specific transcripts by expressed sequence tag database
analysis. Fertil Steril 76:550 –554
Vasmatzis G, Essand M, Brinkmann U, Lee B, Pastan I 1998
Discovery of three genes specifically expressed in human prostate
by expressed sequence tag database analysis. Proc Natl Acad Sci
USA 95:300 –304
Olsson P, Bera TK, Essand M, Kumar V, Duray P, Vincent J, Lee
B, Pastan I 2001 GDEP, a new gene differentially expressed in
normal prostate and prostate cancer. Prostate 48:231–241
Olesen C, Hansen C, Bendsen E, Byskov AG, Schwinger E, LopezPajares I, Jensen PK, Kristoffersson U, Schubert R, Van Assche
E, Wahlstroem J, Lespinasse J, Tommerup N 2001 Identification of
human candidate genes for male infertility by digital differential
display. Mol Hum Reprod 7:11–20
Velculescu VE, Zhang L, Vogelstein B, Kinzler KW 1995 Serial
analysis of gene expression. Science 270:484 – 487
Lal A, Lash AE, Altschul SF, Velculescu V, Zhang L, McLendon
RE, Marra MA, Prange C, Morin PJ, Polyak K, Papadopoulos N,
Vogelstein B, Kinzler KW, Strausberg RL, Riggins GJ 1999 A
public database for gene expression in human cancers. Cancer Res
59:5403–5407
Neilson L, Andalibi A, Kang D, Coutifaris C, Strauss III JF,
Stanton JA, Green DP 2000 Molecular phenotype of the human
oocyte by PCR-SAGE. Genomics 63:13–24
Datson NA, van der Perk, J, de Kloet ER, Vreugdenhil E 2001
Identification of corticosteroid-responsive genes in rat hippocampus using serial analysis of gene expression. Eur J Neurosci 14:
675– 689
Pauws E, Moreno JC, Tijssen M, Baas F, de Vijlder JJ, Ris-Stalpers
C 2000 Serial analysis of gene expression as a tool to assess the
human thyroid expression profile and to identify novel thyroidal
genes. J Clin Endocrinol Metab 85:1923–1927
Moreno JC, Pauws E, van Kampen AH, Jedlickova M, de Vijlder
JJ, Ris-Stalpers C 2001 Cloning of tissue-specific genes using serial
analysis of gene expression and a novel computational subtraction
approach. Genomics 75:70 –76
Goffin V, Binart N, Clement-Lacroix P, Bouchard B, Bole-Feysot
C, Edery M, Lucas BK, Touraine P, Pezet A, Maaskant R, Pichard
C, Helloco C, Baran N, Favre H, Bernichtein S, Allamando A,
Ormandy C, Kelly PA 1999 From the molecular biology of prolactin and its receptor to the lessons learned from knockout mice
models. Genet Anal 15:189 –201
Ferea TL, Brown PO 1999 Observing the living genome. Curr Opin
Genet Dev 9:715–722
Schulze A, Downward J 2001 Navigating gene expression using
microarrays—a technology review. Nat Cell Biol 3:E190 –195
Young RA 2000 Biomedical discovery with DNA arrays. Cell 102:
9 –15
Quackenbush J 2001 Computational analysis of microarray data.
Nat Rev Genet 2:418 – 427
Eisen MB, Spellman PT, Brown PO, Botstein D 1998 Cluster
analysis and display of genome-wide expression patterns. Proc
Natl Acad Sci USA 95:14863–14868
Walker MG, Volkmuth W, Sprinzak E, Hodgson D, Klingler T
Leo et al. • Hormonal Genomics
137.
138.
139.
140.
141.
142.
143.
144.
145.
146.
147.
1999 Prediction of gene function by genome-scale expression analysis: prostate cancer-associated genes. Genome Res 9:1198 –1203
Rainey WE, Carr BR, Wang ZN, Parker Jr CR 2001 Gene profiling
of human fetal and adult adrenals. J Endocrinol 171:209 –215
Flores-Morales A, Stahlberg N, Tollet-Egnell P, Lundeberg J,
Malek RL, Quackenbush J, Lee NH, Norstedt G 2001 Microarray
analysis of the in vivo effects of hypophysectomy and growth hormone treatment on gene expression in the rat. Endocrinology 142:
3163–3176
Douglas KR, Camper SA 2000 Partial transcriptome of the developing pituitary gland. Genomics 70:335–346
Nadler ST, Stoehr JP, Schueler KL, Tanimoto G, Yandell BS, Attie
AD 2000 The expression of adipogenic genes is decreased in obesity
and diabetes mellitus. Proc Natl Acad Sci USA 97:11371–11376
Leo CP, Pisarska MD, Hsueh AJ 2001 DNA array analysis of
changes in preovulatory gene expression in the rat ovary. Biol
Reprod 65:269 –276
Yoshioka K, Matsuda F, Takakura K, Noda Y, Imakawa K, Sakai
S 2000 Determination of genes involved in the process of implantation: application of GeneChip to scan 6500 genes. Biochem Biophys Res Commun 272:531–538
Tanaka TS, Jaradat SA, Lim MK, Kargul GJ, Wang X, Grahovac
MJ, Pantano S, Sano Y, Piao Y, Nagaraja R, Doi H, Wood III WH,
Becker KG, Ko MS 2000 Genome-wide expression profiling of
mid-gestation placenta and embryo using a 15,000 mouse developmental cDNA microarray. Proc Natl Acad Sci USA 97:9127–9132
Perou CM, Jeffrey SS, van de Rijn M, Rees CA, Eisen MB, Ross
DT, Pergamenschikov A, Williams CF, Zhu SX, Lee JC, Lashkari
D, Shalon D, Brown PO, Botstein D 1999 Distinctive gene expression patterns in human mammary epithelial cells and breast cancers. Proc Natl Acad Sci USA 96:9212–9217
Perou CM, Sorlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees
CA, Pollack JR, Ross DT, Johnsen H, Akslen LA, Fluge O, Pergamenschikov A, Williams C, Zhu SX, Lonning PE, BorresenDale AL, Brown PO, Botstein D 2000 Molecular portraits of human
breast tumours. Nature 406:747–752
Khan J, Wei JS, Ringner M, Saal LH, Ladanyi M, Westermann F,
Berthold F, Schwab M, Antonescu CR, Peterson C, Meltzer PS
2001 Classification and diagnostic prediction of cancers using gene
expression profiling and artificial neural networks. Nat Med 7:
673– 679
Notterman DA, Alon U, Sierk AJ, Levine AJ 2001 Transcriptional
Endocrine Reviews, June 2002, 23(3):369 –381
148.
149.
150.
151.
152.
153.
154.
155.
156.
157.
158.
159.
gene expression profiles of colorectal adenoma, adenocarcinoma,
and normal tissue examined by oligonucleotide arrays. Cancer Res
61:3124 –3130
Leethanakul C, Patel V, Gillespie J, Pallente M, Ensley JF, Koontongkaew S, Liotta LA, Emmert-Buck M, Gutkind JS 2000 Distinct
pattern of expression of differentiation and growth-related genes
in squamous cell carcinomas of the head and neck revealed by the
use of laser capture microdissection and cDNA arrays. Oncogene
19:3220 –3224
Luzzi V, Holtschlag V, Watson MA 2001 Expression profiling of
ductal carcinoma in situ by laser capture microdissection and highdensity oligonucleotide arrays. Am J Pathol 158:2005–2010
Brady G 2000 Expression profiling of single mammalian cells—
small is beautiful. Yeast 17:211–217
Kim SK, Lund J, Kiraly M, Duke K, Jiang M, Stuart JM, Eizinger
A, Wylie BN, Davidson GS 2001 A gene expression map for Caenorhabditis elegans. Science 293:2087–2092
Redfern CH, Degtyarev MY, Kwa AT, Salomonis N, Cotte N,
Nanevicz T, Fidelman N, Desai K, Vranizan K, Lee EK, Coward
P, Shah N, Warrington JA, Fishman GI, Bernstein D, Baker AJ,
Conklin BR 2000 Conditional expression of a Gi-coupled receptor
causes ventricular conduction delay and a lethal cardiomyopathy.
Proc Natl Acad Sci USA 97:4826 – 4831
Reichhardt T 1999 It’s sink or swim as a tidal wave of data approaches. Nature 399:517–520
Edgar R, Domrachev M, Lash AE 2002 Gene expression omnibus:
NCBI gene expression and hybridization array data repository.
Nucleic Acids Res 30:207–210
Zambrowicz BP, Friedrich GA, Buxton EC, Lilleberg SL, Person
C, Sands AT 1998 Disruption and sequence identification of 2,000
genes in mouse embryonic stem cells. Nature 392:608 – 611
Skarnes WC, Moss JE, Hurtley SM, Beddington RS 1995 Capturing genes encoding membrane and secreted proteins important
for mouse development. Proc Natl Acad Sci USA 92:6592– 6596
Vidal M 2001 A biological atlas of functional maps. Cell 104:
333–339
Leo CP, Vitt UA, Hsueh AJ 2000 The Ovarian Kaleidoscope database: an online resource for the ovarian research community.
Endocrinology 141:3052–3054
Brivanlou AH, Darnell Jr JE 2002 Signal transduction and the
control of gene expression. Science 295:813– 818
STADY III—International Symposium on Signal Transduction in Health and Disease
1– 4 October, 2002
Tel Aviv, Israel
For further information, contact Professor Zvi Naor at the following address:
Prof. Zvi Naor
Department of Biochemistry
Tel Aviv University
Ramat Aviv 69978
Israel
Telephone: ⫹972-3-6409032/6417057
Fax: ⫹972-3-6406834
E-mail: [email protected] or [email protected]
381