Combinatorial gene regulation by eukaryotic transcription

sb9103.qxd
12/16/1999 9:19 AM
Page 48
48
Combinatorial gene regulation by eukaryotic transcription factors
Lin Chen
Recent structure determinations of high order complexes of
eukaryotic transcription factors bound to DNA have revealed
that residues from their DNA-binding domains are involved in
protein–protein interactions between distinct factors.
Protein–protein interactions between transactivation domains
and coactivators have also been characterized in a number of
recently determined structures. These studies support the
combinatorial mechanism of transcription regulation in
eukaryotic cells and multicellular organisms.
Addresses
Department of Chemistry and Biochemistry, University of Colorado at
Boulder, Boulder, CO 80309-0215, USA;
e-mail: [email protected]
Figure 1
(a)
MCM1 MADS box dimer
C
C
N
Flexible linker
β hairpin
C
C
Modeled half-site
N
MATα2 homeodomain
N
Current Opinion in Structural Biology 1999, 9:48–55
http://biomednet.com/elecref/0959440X00900048
N
(b)
Flexible linker
© Elsevier Science Ltd ISSN 0959-440X
Abbreviations
AF-2
activation function-2
AP-1
activating protein-1
CBP
CREB-binding protein
CRE
cAMP-response element
CREB CRE-binding protein
GABP GA-binding protein
GTF
general transcription factor
IFN
interferon
IL
interleukin
KIX
CREB-binding domain of CBP
LBD
ligand-binding domain
NFAT
nuclear factor of activated T cells
κB nuclear factor κB
NF-κ
pKID
phosphorylated kinase-inducible domain
PPAR-γγ peroxisome proliferator-activated receptor-γ
RHR
Rel homology region
SRC
steroid receptor coactivator
STAT
signal transducer and activator of transcription
TAF
TBP-associated factor
TBP
TATA-box-binding protein
TCR
T-cell receptor
TF
transcription factor
VP-16 herpes simplex virus protein-16
Introduction
Regulated eukaryotic gene transcription involves the
assembly of an initiation complex at the core promoter
region and regulatory complexes at promoterenhancer/operator regions [1]. The core promoter complex
of RNA polymerase II contains multiple protein factors
(referred to as general transcription factors, GTFs), including the TATA-box-binding protein (TBP) and its
associated factors (TAFs). The structures of a number of
binary and ternary GTF–DNA complexes have been
determined in previous years [2–4]. More GTF complexes
will probably appear soon. Structural studies of eukaryotic
transcription factors continue to reveal novel DNA-binding motifs and variants of known motifs [5•,6•,7,8•]. Many
of these studies from the past year will not be discussed
'Arm'
β hairpin
MATα2
homeodomain
MCM1
MADS box dimer
Modeled half-site
Current Opinion in Structural Biology
The structure of MCM1 (residues 1–100) and MATα2 (residues
103–189) bound to DNA. On a natural a-specific operator, the MCM1
MADS box DNA-binding site is flanked on both sides by a MATα2 site.
The MADS box dimer and the left hand MATα2 homeodomain bound
to DNA is the crystallographically observed complex [9••]. The
MATα2–DNA complex on the right hand (light shading and thin stick
DNA drawing) was modeled manually to represent a
MATα2–MCM1–MATα2 complex on an a-specific operator. Each
subunit is shaded differently. Note that the β hairpin of MATα2 extends
the central β sheet of MCM1, which forms the major protein–protein
interaction interface. The β-strand interaction is probably conserved in
the modeled right-hand half-site, but the rest of the linker and the DNA
conformation may be different with different DNA spacing. (a) View
perpendicular to the MCM1 dimer dyad axis. (b) View from the top,
with a rotation 90° from (a). Note the conserved minor groove DNA
binding by the MATα2 homeodomain ‘arm’. All figures drawn using
MOLSCRIPT [52].
here because of the limited space. This review will focus
on recent structural studies of high order regulatory complexes [9••–11••] and activation domain–coactivator
complexes [12••–15••].
The activation of eukaryotic genes in vivo often requires
the coordinated binding of multiple transcription factors
to the promoter-enhancer region; many of these factors
are regulated by distinct signal transduction pathways. In
several cases, it has been shown that the binding of multiple transcription factors to a specific promoter-enhancer
sb9103.qxd
12/16/1999 9:19 AM
Page 49
Combinatorial gene regulation by eukaryotic transcription factors Chen
region is cooperative and requires a unique composition
and spatial arrangement of transcription-factor-binding
sites [16•,17,18]. The assembly of these enhancer complexes (also referred to as the enhanceosome [19]) is
facilitated by protein–protein interactions between
DNA-bound factors and protein-induced DNA bending.
Two features of enhanceosome assembly are especially
important for combinatorial transcription regulation: combinations of multiple transcription factors generate
diverse patterns of regulation [20,21] and highly cooperative binding ensures the specificity of transcriptional
control [22].
The enhanceosome may serve as a template for the
assembly of the core promoter complex by either direct or
coactivator-mediated interactions. Most eukaryotic transcription factors contain one or more transactivation
domain involved in interactions with downstream coactivators and GTFs [23]. These transactivation domains are
usually rich in proline and glutamine, or acidic amino
acids, and appear to be very flexible before binding to
their respective targets. This flexibility is probably the
main reason that the structural characterization of these
domains lagged behind that of the DNA-binding
domains. The recent identification of various targets of
transactivation domains, including TAFs, cAMPresponse element (CRE) binding protein (CREB)
binding proteins (CBPs/p300) and steroid receptor coactivators (SRCs), however, has allowed the structural
characterization of several transactivation domains in
complex with their respective target proteins [12••–15••].
Structures of eukaryotic transcription factor
complexes bound to DNA
In Saccharomyces cerevisiae, three transcription factors, the
homeodomain proteins MATα2 and MATaa1, and the
MADS box protein MCM1, play crucial roles in cell-type
specification. Various combinations of these three factors
and additional factors are involved in gene activation and
repression in different yeast cell types, providing the best
example of the combinatorial control of eukaryotic gene
regulation [24]. These transcription factors bind various
DNA sites cooperatively to form higher order complexes
that have distinct regulatory functions. The crystal structure of a MATα2–MATaa1–DNA ternary complex revealed
that cooperative DNA binding is induced by a C-terminal
amphipathic α helix of MATα2 bound to a hydrophobic
groove on the MATaa1 homeodomain [25]. In another complex, that of MATα2–MCM1–MATα2–DNA, an
N-terminal flexible linker of MATα2 has been suggested
by biochemical studies to interact with the DNA-binding
domain (referred to as the MADS box) of MCM1 to mediate cooperative DNA binding [26]. Indeed, the recent
crystal structure of a DNA-bound ternary complex of
MCM1 and MATα2 [9••] shows that part of this flexible
linker forms a β hairpin that binds to the outer strand of the
central β sheet of the MCM1 MADS box (Figure 1). The
binding interactions involve parallel β-strand hydrogen
49
Figure 2
Jun
Fos
NFAT1
RHR-N
Loop 1
NFAT1
RHR-C
Loop 2
Loop 3
DNA
Current Opinion in Structural Biology
The structure of NFAT1 (residues 399–678), Fos (residues 138–200)
and Jun (residues 254–315) bound to DNA [10••]. Multiple loops (loops
1, 2 and 3 are shown in this orientation) presented by the NFAT Nterminal RHR (RHR-N) contact Fos and Jun. Each component (Fos, Jun
and the two NFAT domains [RHR-N and RHR-C]) is shaded differently.
bonding of mainchain atoms and hydrophobic packing of
sidechains. β-strand-mediated interactions have also been
observed between TFIIA and TBP in TFIIA–TBP–DNA
complexes [3,4] and may be involved in a wide range of
physiological and pathological protein–protein interactions.
Combinatorial control of transcription activation in higher eukaryotic cells has been characterized biochemically
for several promoter-enhancers, including that of the
TCR α gene [18], the IFN-β gene [16•] and the interleukin (IL)-2 gene [27]. A crystallographic study of this
last was reported recently [10••]. The nuclear factor of
activated T cells (NFAT) and members of the activating
protein-1 (AP-1) transcription factor family (including
Fos and Jun) bind cooperatively to their target DNA sites
in the promoter-enhancer and participate in the transcriptional regulation of IL-2 and other immune response
genes [28]. Although NFAT is activated by calcium signals through calcineurin, the AP-1 transcription factors
are induced by agents that activate protein kinase C. The
DNA-binding domain of NFAT is distantly related to
that of the Rel family of transcription factors, including
the nuclear factor-KB (NF-KB) proteins. The DNA-binding domains of AP-1 proteins consist of the basic region,
leucine zipper (bZIP) motif. The DNA-binding domains
of NFAT, Fos and Jun are necessary and sufficient for
cooperative DNA binding. The recently determined
sb9103.qxd
12/16/1999 9:19 AM
50
Page 50
Protein–nucleic acid interactions
enhances GABPα DNA binding both by displacing the
C-terminal α helix from its inhibitory orientation and by
directly stabilizing the DNA-binding residues of
GABPα. GABPβ is also thought to mediate the higher
order assembly of a GABPβ/α complex on DNA (αβ–αβ
dimer) through its C-terminal leucine zipper motif.
Figure 3
GABPβ
GABPα
Ankyrin repeats
C-terminal
helix
Ets domain
DNA
Current Opinion in Structural Biology
Complex of GABPβ (residues 1–157) and GABPα (residues
311–430) bound to DNA [11••]. Each subunit is shaded differently
(GABPβ dark and GABPα light). Note that the GABPβ ankyrin repeats
bind to the GABPα Ets domain and the C-terminal α helix using the
extended loop tips. The ankyrin repeats of GABPβ do not contact the
DNA in this structure.
crystal structure of this complex [10••] shows
protein–protein interactions between NFAT and Fos–Jun
that involve multiple patches of surface residues on the
three proteins and that are facilitated by the bending of
the DNA and the Fos α helix (Figure 2). Previous biochemical studies had suggested that many of these
interacting residues were important for complex assembly [29–31]. In the ternary complex, the NFAT
DNA-binding domain seems to have moved closer to
Fos–Jun from its orientation in the binary NFAT–DNA
complex [32•]. The interaction surfaces are mostly
hydrophilic, with a small hydrophobic center.
DNA binding by a transcription factor can also be modulated by interactions with a protein that does not bind
DNA itself. One example is the GA-binding protein
(GABP) that binds DNA with a conserved GA sequence
motif, whose core DNA-binding complex structure was
recently determined [11••] (Figure 3). The two subunit
GABP complex is involved in the transcriptional regulation of mitochondrial protein genes and some viral
genes. GABPα belongs to the Ets family of transcription
factors, which have a ‘winged helix-turn-helix’ DNAbinding motif. GABPβ is a large protein that contains a
leucine zipper-like motif and a number of ankyrin
repeats, but it does not bind DNA directly. GABPβ binds
to the GABPα Ets motif and a C-terminal α helix using
the extended loop tips of its ankyrin repeats. GABPβ
Complexes of transactivation domains and
their target proteins
The complex structure of a CREB activation domain (the
phosphorylated kinase-inducible domain, pKID)
bound to its target (the KIX domain) on CBP has been
determined by NMR spectroscopy [12••]. Complex interactions between the herpes simplex virus protein-16
(VP16) activation domain and human TAF31 were also
characterized by an NMR study [13••]. More recently, the
crystal structure of the ligand-binding domain (LBD) of a
nuclear receptor, the peroxisome proliferator-activated
receptor-γ (PPAR-γ), which contains the activation function-2 (AF-2) motif, was determined in a ternary complex
with SRC-1 and the anti-diabetic ligand rosiglitazone
[14••,15••]. CBP, TAFs and SRC represent the major targets of transcription activators bound to upstream
enhancers. In both the pKID–KIX and the VP16–TAF
complexes, the transactivation domain presents an amphipathic α helix to a hydrophobic surface on their respective
target. These helices are flexible and largely unstructured
before binding. This flexibility may be important for the
binding of the same region in different conformations to
other regulatory proteins. In the pKID–KIX complex, a
phosphoserine at one end of the amphipathic helix plays
an important regulatory role, conveying the phosphorylation signal through a hydrogen bond interaction. In the
ternary complex of PPAR-γ LBD, rosiglitazone and SRC1, the coactivator SRC-1 displays an amphipathic helix to
a surface formed by AF-2 and other structural elements
from the nuclear receptor LBD. SRC-1 binding is ligand
dependent, providing another means of conveying a signal
to the transcription machinery.
Implications for the mechanisms of eukaryotic
transcription regulation
Together with biological studies on transcriptional synergy
or ‘cross talk’, what do the above complex structures tell us
about the combinatorial gene regulation mechanisms in
eukaryotic cells?
Members of a transcription factor family may be
differentiated in higher order transcription
factor complexes
In the higher order complex structures discussed above, the
residues of a DNA-binding domain that do not contact the
DNA mediate protein–protein interactions. Transcription factors from the same family usually have highly homologous
DNA-binding surfaces, but different surface residues outside
the DNA-binding region. There is some biological evidence
that these variable residues may be important determinants
of each family member’s distinct in vivo functions [33,34].
sb9103.qxd
12/16/1999 9:19 AM
Page 51
Combinatorial gene regulation by eukaryotic transcription factors Chen
A sequence comparison of NFAT (NFATs 1–4) and Fos (cFos, FosB, Fra-1, Fra-2) family members indicated that
different NFAT–Fos pairs should have different interaction
interfaces [10••]. A similar specificity is also seen in the
MATα2–MATaa1–DNA complex, in which the MATα2 Cterminal α helix specifically binds to the MATaa1
homeodomain, but not its own homeodomain [25].
51
Figure 4
(a)
b
e
a
NFAT
RHR-C
The assembly of specific transcription factor complexes
contributes to the specificity of transcriptional control
For each individual transcription factor in the ternary
complexes discussed above, the homeodomain–DNA,
MADS box–DNA and Ets–DNA interactions are all similar to the interactions seen in their respective binary
protein–DNA complexes. This is true even when the
binding site deviates significantly from the consensus
sequence, as does DNA binding by Fos–Jun in the
NFAT–Fos–Jun complex. The recognition of nonconsensus sites by a transcription factor often involves only local
changes or adjustments of DNA-binding residues. Such
apparent ‘relaxation’ of specificity seems very common in
in vivo DNA binding by eukaryotic transcription factors,
especially as part of larger complexes. This may allow the
assembly of enhancer complexes on various composite
sites while maintaining the specificity by cooperative protein–protein interactions.
In higher order complexes, protein–protein interactions
add specificity to the combined DNA-binding specificity
of each component. In the MATα2–MATaa1–DNA and
MATα2–MCM1–DNA complexes, protein–protein interactions are mediated by structural modules that are
tethered to DNA-binding domains through a peptide linker. These linkers, though flexible in the crystal structures
(as evident from high B factors), still seem to impose specificity on complex formation by restricting the arrangement
of each component’s binding site [35]. In contrast, the
interactions between NFAT and Fos–Jun are directly
mediated by residues of their respective DNA-binding
domains and are continuous with the DNA-binding surfaces, similar to a nuclear receptor heterodimer–DNA
complex [36]. Thus, the whole complex has a continuous
DNA-binding groove, explaining why the spacing
between the NFAT and AP-1 DNA-binding sites is highly
conserved.
Significant conformational changes of the protein and
DNA are observed in the above higher order complexes
and they may play important roles in the specificity
(‘indirect read out’) and diversity of higher order complex
assembly. In the MATα2–MCM1–DNA complex, the
MATα2 linker contacting MCM1 shows flexibility by
being able to assume either a β strand or an α-helical conformation. Such flexibility may be important for MATα2
binding to a MCM1 dimer on both sides, with a two or
three base pair spacing on a natural a-specific operator
[9••]. The Fos α helix in the NFAT–Fos–Jun–DNA complex has a significant bend in its fork region. The fork
NFAT
RHR-N
(b)
NF-κB
RHR-C
a
b
e
NF-κB
RHR-N
Current Opinion in Structural Biology
Comparison of NFAT in the (a) NFAT–Fos–Jun–DNA complex and (b)
NF-κB p50 in the p50 dimer–DNA complex. The N-terminal DNAbinding domains of NFAT and p50 are similarly anchored on the DNA,
but their C-terminal domains are in very different orientations. The Cterminal domain of NFAT is similar to that of p50 (root mean square
deviation of 0.84 Å for 65 β strand Cα atoms, and most of the
dimerization residues at the abe sheet are conserved in NFAT).
region of DNA-bound bZIP proteins is usually flexible,
which may favor interactions with other proteins on
DNA, although it is not clear if members of the Fos family have different flexibility in this region or not. DNA
conformational changes in the NFAT–Fos–Jun complex
may explain why the DNA spacer between the NFAT
and AP-1 sites shows a strong preference for AT-rich
sequences [28].
Another level of specificity that is achieved through higher
order complex assembly is the specific orientation of heterodimeric DNA-binding proteins on DNA. Fos and Jun
sb9103.qxd
12/16/1999 9:19 AM
52
Page 52
Protein–nucleic acid interactions
Figure 5
(b)
(a)
(c)
N
A
B
B
E
E
N
A
A
N
B
E
Minor groove
contact
C
ab loop
C
Minor groove
contact
Minor groove
contact
ab loop
ab loop
C
(d)
(e)
N
N
A
B
E
A
B
E
C
ab loop
ab loop
C
Minor groove contact
Current Opinion in Structural Biology
A comparison of the immunoglobulin DNA-binding domains of (a) p53,
(b) NFAT, (c) STAT, (d) NF-κB and (e) the T domain. The β barrel is
oriented similarly, with the ABE sheet shown in front. The conserved ab
loop bound in the DNA major groove is indicated. Note that in p53,
NFAT and STAT, the C-terminal α helix is positioned in the major
groove, whereas a β-strand loop is inserted in the adjacent minor
groove. The orientation of p53 on DNA is significantly different from
that of the others. NF-κB does not contact the DNA minor groove. A
C-terminal α helix of the T domain binds deeply in the minor groove,
opposite the groove contacted by p53, NFAT and STAT.
have almost identical DNA-binding surfaces. Their bZIP
heterodimer was found to bind DNA in two orientations in
the binary (Fos–Jun)–DNA complexes [37,38], but adopted
a unique orientation in the ternary NFAT–Fos–Jun–DNA
complex [37]. As seen in the ternary crystal structure [10••],
the NFAT–Fos and NFAT–Jun interaction interfaces are
different, leading to orientation specificity on the DNA.
Such a specific orientation of heterodimeric transcription
factors on DNA, either through their own asymmetric DNA
binding or through interactions with partner proteins, may
have functional importance.
without binding to DNA. As seen in the crystal structures,
the protein–protein interaction interfaces observed in
these complexes are either limited or highly hydrophilic.
This feature is also observed in other complexes, including
those of the signal transducer and activator of transcription
(STAT) proteins. Crystal structures of two STAT DNAbinding complexes and a STAT N-terminal domain have
been determined recently [39••,40••,41•]. DNA-bound
STAT dimers are suggested to form higher order complexes through the conserved N-terminal protein interaction
domain that probably only dimerizes after DNA binding.
The potential dimer interface of this STAT N-terminal
domain is highly hydrophilic, as seen in the crystallographic dimer [41•]. Complex formation between NFAT and
Fos–Jun, and between MATα2 and MATaa1 is also DNA-
Higher order complexes form upon DNA binding
Unlike many tightly associated transcription factor dimers,
most of the ternary complexes discussed above do not form
sb9103.qxd
12/16/1999 9:19 AM
Page 53
Combinatorial gene regulation by eukaryotic transcription factors Chen
binding-dependent. This DNA-dependent complex formation may enable the combinatorial use of transcription
factors [25].
Combinatorial diversity is achieved at multiple levels
In addition to complex formation between distinct and
related transcription factors, the same DNA-binding
domain of a given transcription factor may adopt different
conformations in different promoter contexts, while
maintaining its specific DNA-binding interactions [42].
NFAT may be one example. On composite NFAT and
AP-1 DNA-binding sites, such as those found in the IL-2
promoter-enhancer, NFAT binds the DNA cooperatively
in complex with AP-1 transcription factors. On some
other DNA sites that have two NFAT DNA-binding sites
arranged with dyad symmetry and appropriate spacing,
NFAT binds the DNA cooperatively as a dimer. The latter mode of DNA binding by NFAT may be similar to
that of the Rel NF-KB transcription factors, which are
involved in a wide range of transcription regulation in the
immune system and in viral gene expression. The DNAbinding sites of the NFAT dimer resemble that of the
NF-KB proteins (the Kb DNA site), raising the possibility that NFAT may regulate gene transcription through
DNA sites (KB or KB-like DNA sites) that are also responsive towards NF-KB regulation. The structures of the
DNA-binding domains of NFAT and NF-KB are also
remarkably similar, both consisting of two Ig modules. In
the NFAT–Fos–Jun-DNA complex, each of NFAT’s two
Ig modules, Rel homology regions (RHRs) N and C,
resembles the corresponding parts of Rel NF-κB p50, but
the relative orientation of the two Ig modules in NFAT is
significantly different from that seen in p50–DNA complexes [43,44]. The C-terminal Ig module in p50
mediates dimer formation, whereas NFAT is a monomer
in solution [37,45] (Figure 4). Strikingly, residues
involved in p50 dimerization are largely conserved in the
NFAT C-terminal Ig module, suggesting that NFAT can
probably dimerize upon binding to DNA using the same
surface. There is evidence that NFAT may bind κB-like
DNA sites as a dimer under certain physiological situations [28], probably by adopting a conformation that is
very much like a typical NF-κB dimer (Figure 4).
Crystals of a NFAT dimer bound cooperatively to a κB
DNA fragment have been obtained and the structure
determination is underway (L Chen, A Breier, B Tasic,
SC Harrison, unpublished data). Alternatively, the
exposed NFAT C-terminal dimer interface can mediate
higher order complex formation between multiple copies
of DNA-bound NFAT–Fos–Jun complexes on an
enhancer [46], similar to the higher order assembly of
DNA-bound GABPβ/α and STAT dimers (see above).
The Ig DNA-binding domain is also found in a novel class
of transcription regulators, referred to as the T-domain
proteins. The T-domain proteins are the products of the
so-called T-box gene, which plays important roles in transcription regulation in embryonic development. The
53
recent structure determinations of NFAT, STATs and the
T-domain [47•] reveal that these proteins, together with
p53 [48] and NF-κB [43,44], form a superfamily that uses
the Ig module as a scaffold for presenting various secondary structural elements for DNA recognition
(Figure 5). Whether the above proteins are evolutionarily
related or not, however, is unclear at present. The recent
structure determination of a DNA-binding complex of
Skn-1 illustrated another way of generating diverse DNAbinding functions, combining various DNA-binding
motifs and scaffolds to form a novel DNA-binding domain
[49•]. This combination of DNA-binding modules is related to but is different from those seen in other
DNA-binding proteins, including zinc-finger proteins and
the POU domain proteins.
Transcriptional coactivators play important roles in the
combinatorial control of transcription
Considering the complexes between transactivation
domains and coactivators discussed above and similar
studies on p53 [50], it seems that the binding of transactivation domains to their target proteins generally involves
an induced amphipathic helix. This mode of protein–protein interaction is also seen in the MATα2–MATaa1–DNA
complex. Amphipathic helices have one or more φxxφφ
motifs, where φ represents a hydrophobic residue. In the
case of pKID, VP16 and other transactivation domains
(p53 and p65), the motif shows a preference for phenylalanine and tyrosine at the first φ, imposing some
specificity on coactivator selection. In SRC-1 and other
nuclear receptor coactivators, the motif is more restricted
to LXXLL (L represents leucine) [51]. The specificity of
coactivator binding also involves hydrogen bonding and
electrostatic interactions.
Thus, interactions between transactivation domains and coactivators are mediated by small, relatively independent
structural modules, each making limited but specific contacts.
There are three LXXLL motifs in SRC-1, two of which can
interact with a nuclear receptor dimer. The presence of a
third LXXLL may imply that these motifs can be used
either combinatorially with different nuclear receptor dimers
or simultaneously in forming higher order complexes.
Similarly, coactivators such as CBP and p300 also contain
separated binding sites for the transactivation domains of various transcription factors. These general features of
transactivation domain–coactivator interactions are consistent
with the combinatorial gene regulation mechanism.
Conclusions
In the past year, we have seen more structural characterizations of the protein–protein interactions involved in the
function of eukaryotic transcription factors. These include
the interactions between DNA-bound transcription factors and interactions between transactivation domains and
their respective targets. These structural studies support
the combinatorial transcription regulation mechanism in
eukaryotic cells.
sb9103.qxd
12/16/1999 9:20 AM
54
Page 54
Protein–nucleic acid interactions
Structural studies of eukaryotic transcription factors with
higher and higher complexity continue. These structures
will allow the analysis of the molecular details of
enhancer complexes (enhanceosome), the core promoter
complex and the interactions between them. These molecular pictures will provide an important framework for
studying and understanding the in vivo mechanisms of
eukaryotic gene regulation.
Acknowledgements
The author is grateful to Ernest Fraenkel, Rachelle Gaudet, Susanne
Swalley and Marc Jacobs for critically reading this review and for their many
helpful comments. The author would like to thank Stephen C Harrison for
a wonderful postdoctoral experience in his laboratory. A postdoctoral
fellowship from the Medical Foundation is acknowledged.
References and recommended reading
Papers of particular interest, published within the annual period of review,
have been highlighted as:
• of special interest
•• of outstanding interest
1.
Tjian R, Maniatis T: Transcriptional activation: a complex puzzle
with few easy pieces. Cell 1994, 77:5-8.
2.
Tan S, Richmond TJ: Eukaryotic transcription factors. Curr Opin
Struct Biol 1998, 8:41-48.
3.
Tan S, Hunziker Y, Sargent DF, Richmond TJ: Crystal structure of a
yeast TFIIA/TBP/DNA complex. Nature 1996, 381:127-151.
4.
Geiger JH, Hahn S, Lee S, Sigler PB: Crystal structure of the yeast
TFIIA/TBP/DNA complex. Science 1996, 272:830-836.
5.
•
Shi Y, Wang YF, Jayaraman L, Yang H, Massague J, Pavletich NP:
Crystal structure of a Smad MH1 domain bound to DNA:
β signaling. Cell 1998,
insights on DNA binding in TGF-β
94:585-594.
This paper describes the first crystal structure of the DNA-binding domain
from a Smad family protein in complex with DNA, showing a novel
DNA-binding mode involving a β hairpin bound in the major groove.
6.
•
Escalante CR, Yie J, Thanos D, Aggarwal AK: Structure of IRF-1 with
bound DNA reveals determinants of interferon regulation. Nature
1998, 391:103-106.
The first crystal structure determination of a DNA-binding domain from the
interferon regulatory factor (IRF) family of proteins in complex with DNA,
showing a variant of the ‘helix-turn-helix’ DNA-binding motif.
7.
Chen FE, Huang DB, Chen YQ, Ghosh G: Crystal structure of
κB bound to DNA.
p50/p65 heterodimer of transcription factor NF-κ
Nature 1998, 391:410-413.
12. Radhakrishnan I, Perez AG, Parker D, Dyson HJ, Montminy MR,
•• Wright PE: Solution structure of the KIX domain of CBP bound to
the transactivation domain of CREB: a model for
activator–coactivator interactions. Cell 1997, 91:741-752.
The first detailed structural characterization of the interactions between an
activation domain (CREB) and a coactivator (CBP).
13. Uesugi M, Nyanguile O, Lu H, Levine AJ, Verdine GL: Induced α helix
•• in the VP16 activation domain upon binding to a human TAF.
Science 1997, 277:1310-1313.
The first structural characterization of the interactions between VP16 and
TAF31, using NMR spectroscopy and biochemical methods.
14. Westin S, Kurokawa R, Nolte RT, Wisely GB, McInerney EM,
•• Rose DW, Milburn MV, Rosenfeld MG, Glass CK: Interactions
controlling the assembly of nuclear-receptor heterodimers and
co-activators. Nature 1998, 395:199-202.
See annotation to [15••].
15. Nolte RT, Wisely GB, Westin S, Cobb JE, Lambert MH, Kurokawa R,
•• Rosenfeld MG, Willson TM, Glass CK, Milburn MV: Ligand binding
and coactivator assembly of the peroxisome proliferator-activated
receptor-γγ. Nature 1998, 395:137-143.
This paper, together with [14••], describes the first crystallographic study of a
nuclear receptor ligand-binding domain and its coactivator in a ternary complex.
16. Kim TK, Maniatis T: The mechanism of transcriptional synergy of
β enhanceosome. Mol Cell 1997,
•
an in vitro assembled interferon-β
1:119-129.
A detailed biochemical study on enhanceosome assembly, demonstrating a
direct correlation between cooperative DNA binding and transcriptional synergy in the context of an assembled enhancer.
17.
Garrity PA, Chen D, Rothenberg EV, Wold BJ: Interleukin-2
transcription is regulated in vivo at the level of coordinated
binding of both constitutive and regulated factors. Mol Cell Biol
1994, 14:2159-2169.
18. Grosschedl R: Higher-order nucleoprotein complexes in
transcription: analogies with site-specific recombination.
Curr Opin Cell Biol 1995, 7:362-370.
β gene
19. Thanos D, Maniatis T: Virus induction of human IFNβ
expression requires the assembly of an enhanceosome. Cell
1995, 83:1091-1100.
20. Yamamoto KR, Pearce D, Thomas J, Miner JN: Combinatorial
regulation at a mammalian composite response element. In
Transcriptional Regulation. Edited by McKnight SL, Yamamoto KR.
New York: Cold Spring Harbor Laboratory Press;
1992:1169-1192.
21. Johnson A: A combinatorial regulatory circuit in budding yeast. In
Transcriptional Regulation, vol 2. Edited by McKnight SL, Yamamoto
KR. New York: Cold Spring Harbor Laboratory Press;
1992:975-1006.
22. Hochschild A, Ptashne M: Cooperative binding of lambda
repressors to sites separated by integral turns of the DNA helix.
Cell 1986, 44:681-687.
Parraga A, Bellsolell L, Ferre-D’Amare AR, Burley SK: Co-crystal
structure of sterol regulatory element binding protein 1a at 2.3 Å
resolution. Structure 1998, 6:661-672.
A recent example showing that binding to a nonconsensus DNA site by a transcription factor involves only localized changes at the protein–DNA interface.
23. Ptashne M, Gann A: Transcriptional activation by recruitment.
Nature 1997, 386:569-577.
α2/MCM1/
9. Tan S, Richmond TJ: Crystal structure of the yeast MATα
•• DNA ternary complex. Nature 1998, 391:660-666.
A higher order eukaryotic transcription factor complex bound to DNA, demonstrating the importance of protein–protein interactions in the assembly.
25. Li T, Stark MR, Johnson AD, Wolberger C: Crystal structure of the
MATa1/MAT α2 homeodomain heterodimer bound to DNA.
Science 1995, 270:262-269.
8.
•
10. Chen L, Glover JN, Hogan PG, Rao A, Harrison SC: Structure of the
•• DNA-binding domains from NFAT, Fos and Jun bound specifically
to DNA. Nature 1998, 392:42-48.
A higher order eukaryotic transcription factor complex bound to DNA, demonstrating the importance of protein–protein interactions in the assembly.
11. Batchelor AH, Piper DE, de la Brousse FC, McKnight SL,
α/β
β: an ETS domain-ankyrin
•• Wolberger C: The structure of GABPα
repeat heterodimer bound to DNA. Science 1998,
279:1037-1041.
The structure determined shows a different kind of higher order transcription
factor complex, as compared with [9••] and [10••], in that one component
does not bind DNA directly; however, the protein–protein interactions are
equally important for the assembly and the enhanced DNA binding. It is also
one of the first two structures to show the detailed protein–protein interactions mediated by ankyrin repeats that play important roles in a wide range
of physiologically important protein complexes.
24. Johnson AD: Molecular mechanisms of cell-type determination in
budding yeast. Curr Opin Genet Dev 1995, 5:552-558.
26. Vershon AK, Johnson AD: A short, disordered protein region
mediates interactions between the homeodomain of the yeast α2
protein and the MCM1 protein. Cell 1993, 72:105-112.
27.
Crabtree GR, Clipstone NA: Signal transmission between the
plasma membrane and nucleus of T lymphocytes. Annu Rev
Biochem 1994, 63:1045-1083.
28. Rao A, Luo C, Hogan PG: Transcription factors of the NFAT family:
regulation and function. Annu Rev Immunol 1997, 15:707-747.
29. Erlanson DA, Chytil M, Verdine GL: The leucine zipper domain
controls the orientation of AP-1 in the NFAT–AP-1–DNA complex.
Chem Biol 1996, 3:981-991.
30. Peterson BR, Sun LJ, Verdine GL: A critical arginine residue
mediates cooperativity in the contact interface between
transcription factors NFAT and AP-1. Proc Natl Acad Sci USA
1996, 93:13671-13676.
sb9103.qxd
12/16/1999 9:20 AM
Page 55
Combinatorial gene regulation by eukaryotic transcription factors Chen
55
31. Sun LJ, Peterson BR, Verdine GL: Dual role of the nuclear factor of
activated T cells insert region in DNA recognition and cooperative
contacts to activator protein 1. Proc Natl Acad Sci USA 1997,
94:4919-4924.
for the DNA-binding-dependent, higher order complex assembly of transcription factors.
32. Zhou P, Sun LJ, Dotsch V, Wagner G, Verdine GL: Solution structure
•
of the core NFATC1/DNA complex. Cell 1998, 92:687-696.
An NMR study of a binary NFAT–DNA complex, showing the different orientation of NFAT on DNA compared with [10••].
κB
43. Ghosh G, van Duyne G, Ghosh S, Sigler PB: Structure of NF-κ
p50 homodimer bound to a κB site. Nature 1995, 373:303-310.
33. Diamond MI, Miner JN, Yoshinaga SK, Yamamoto KR: Transcription
factor interactions: selectors of positive or negative regulation
from a single DNA element. Science 1990, 249:1266-1272.
34. Davis RL, Cheng PF, Lassar AB, Weintraub H: The MyoD DNA
binding domain contains a recognition code for muscle-specific
gene activation. Cell 1990, 60:733-746.
35. Jin Y, Mead J, Li T, Wolberger C, Vershon AK: Altered DNA
recognition and bending by insertions in the α2 tail of the yeast
α1/α
α2 homeodomain heterodimer. Science 1995, 270:290-293.
36. Rastinejad F, Perlmann T, Evans RM, Sigler PB: Structural
determinants of nuclear receptor assembly on DNA direct
repeats. Nature 1995, 375:203-211.
37.
Chen L, Oakley MG, Glover JN, Jain J, Dervan PB, Hogan PG, Rao A,
Verdine GL: Only one of the two DNA-bound orientations of AP-1
found in solution cooperates with NFATp. Curr Biol 1995,
5:882-889.
38. Glover JN, Harrison SC: Crystal structure of the heterodimeric
bZIP transcription factor c-Fos-c-Jun bound to DNA. Nature 1995,
373:257-261.
39. Chen X, Vinkemeier U, Zhao Y, Jeruzalmi D, Darnell JJ, Kuriyan J:
•• Crystal structure of a tyrosine phosphorylated STAT-1 dimer
bound to DNA. Cell 1998, 93:827-839.
See annotation to [40••].
40. Becker S, Groner B, Muller CW: Three-dimensional structure of the
β homodimer bound to DNA. Nature 1998, 394:145-151.
•• Stat3β
The structures described in this paper and [39••], the first for the STAT family, show the unique features of the STAT family proteins, whose multiple
functional domains are highly integrated at the structural level.
41. Vinkemeier U, Moarefi I, Darnell J Jr, Kuriyan J: Structure of the
•
amino-terminal protein interaction domain of STAT-4. Science
1998, 279:1048-1052.
This is the first crystallographic study of potential protein-binding surfaces
from the conserved STAT N-terminal domain. The structural features of the
dimer interface observed in the crystal structure have general implications
42. Lefstin JA, Yamamoto KR: Allosteric effects of DNA on
transcriptional regulators. Nature 1998, 392:885-888.
44. Muller CW, Rey FA, Sodeoka M, Verdine GL, Harrison SC: Structure
κB p50 homodimer bound to DNA. Nature 1995,
of the NF-κ
373:311-317.
45. Hoey T, Sun YL, Williamson K, Xu X: Isolation of two new members
of the NF-AT gene family and functional characterization of the
NF-AT proteins. Immunity 1995, 2:461-472.
46. Rooney JW, Sun YL, Glimcher LH, Hoey T: Novel NFAT sites that
mediate activation of the interleukin-2 promoter in response to
T-cell receptor stimulation. Mol Cell Biol 1995, 15:6299-6310.
47.
•
Muller CW, Herrmann BG: Crystallographic structure of the T
domain-DNA complex of the Brachyury transcription factor. Nature
1997, 389:884-888.
The structure not only expands the Ig DNA-binding protein family, but it also
reveals unusual minor groove DNA binding by an α helix. This observation
is important for understanding the general mechanisms of DNA recognition
by proteins.
48. Cho Y, Gorina S, Jeffrey PD, Pavletich NP: Crystal structure of a p53
tumor suppressor-DNA complex: understanding tumorigenic
mutations. Science 1994, 265:346-355.
49. Rupert PB, Daughdrill GW, Bowerman B, Matthews BW: A new
•
DNA-binding motif in the Skn-1 binding domain-DNA complex.
Nat Struct Biol 1998, 5:484-491.
This structure shows a novel DNA-binding domain that can be regarded as
a hybridization of parts of other known DNA-binding motifs, illustrating a
unique way of varying DNA-binding motifs in order to generate new DNAbinding functions by nature.
50. Kussie PH, Gorina S, Marechal V, Elenbaas B, Moreau J, Levine AJ,
Pavletich NP: Structure of the MDM2 oncoprotein bound to the
p53 tumor suppressor transactivation domain. Science 1996,
274:948-953.
51. Heery DM, Kalkhoven E, Hoare S, Parker MG: A signature motif in
transcriptional co-activators mediates binding to nuclear
receptors. Nature 1997, 387:733-736.
52. Kraulis PJ: MOLSCRIPT: a program to produce both detailed and
schematic plots of protein structures. J Appl Crystallogr 1991,
24:946-950.