DNA binding by the plant-specific NAC transcription factors in crystal

Biochem. J. (2012) 444, 395–404 (Printed in Great Britain)
395
doi:10.1042/BJ20111742
DNA binding by the plant-specific NAC transcription factors in crystal
and solution: a firm link to WRKY and GCM transcription factors
Ditte H. WELNER*, Søren LINDEMOSE†, J. Günter GROSSMANN‡, Niels Erik MØLLEGAARD§, Addie N. OLSEN†,
Charlotte HELGSTRAND*, Karen SKRIVER† and Leila LO LEGGIO*1
*Biophysical Chemistry Group, Department of Chemistry, University of Copenhagen, Universitetsparken 5, 2100 Copenhagen Ø, Denmark, †Section for Biomolecular Sciences,
Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen N, Denmark, ‡Molecular Biophysics Group, Institute of Integrative Biology, Faculty of Health
and Life Sciences, The University of Liverpool, Crown Street, Liverpool L69 7ZB, U.K., and §Department of Cellular and Molecular Medicine, Panum Institute, Faculty of Health
Sciences, University of Copenhagen, Blegdamsvej 3, 2200 Copenhagen N, Denmark
NAC (NAM/ATAF/CUC) plant transcription factors regulate
essential processes in development, stress responses and nutrient
distribution in important crop and model plants (rice, Populus,
Arabidopsis), which makes them highly relevant in the context
of crop optimization and bioenergy production. The structure of
the DNA-binding NAC domain of ANAC019 has previously
been determined by X-ray crystallography, revealing a dimeric
and predominantly β-fold structure, but the mode of binding
to cognate DNA has remained elusive. In the present study,
information from low resolution X-ray structures and small
angle X-ray scattering on complexes with oligonucleotides,
mutagenesis and (DNase I and uranyl photo-) footprinting,
is combined to form a structural view of DNA-binding,
and for the first time provide experimental evidence for the
speculated relationship between plant-specific NAC proteins,
WRKY transcription factors and the mammalian GCM (Glial
cell missing) transcription factors, which all use a β-strand motif
for DNA-binding. The structure shows that the NAC domain
inserts the edge of its core β-sheet into the major groove, while
leaving the DNA largely undistorted. The structure of the NAC–
DNA complex and a new crystal form of the unbound NAC also
indicate limited flexibility of the NAC dimer arrangement, which
could be important in recognizing suboptimal binding sites.
INTRODUCTION
NAC proteins consist of a conserved N-terminal region (NAC
domain) with DNA-binding and oligomerization abilities, and
a diverse C-terminal region which functions as a transcription
regulatory domain. Initial biophysical analysis suggested that the
NAC domain was dominated by β-strand structures [11]. This was
later confirmed by our determination of the first crystal structure
of a NAC domain, that of A. thaliana ANAC019, revealing a
novel dimeric transcription factor fold consisting of mainly βsheets and containing no well-known DNA-binding motif [12].
We have also shown by EMSA (electrophoretic mobility-shift
assay) that the ANAC019 NAC domain binds DNA as a dimer
and characterized the dimer interface [12] as well as identified
a consensus DNA sequence (CGT[GA]) to which ANAC019
and other relatively distant NAC TFs bind [13–16]. Despite the
considerable interest in these proteins since the publication of
the structure [e.g. 2,3,7–10,17], no experimental information
on their molecular interactions with DNA has been published,
besides mutational studies [13].
In the present study, interaction of the ANAC019 NAC domain
with DNA is probed structurally by crystallography and in
solution, providing for the first time experimental evidence for
a DNA-binding mode similar to that of the plant-specific WRKY
TF family [18,19] and the mammalian family of GCM (Glial
cell missing) TFs [20]. The elucidation of the interaction between
ANAC019 and a cognate dsON (double-stranded oligonucleotide)
is thus a key in consolidating the relationship between these three
TF families, which have previously been noted to have structural
similarities [5,21].
Transcription is a fundamental biological process, and a large
part of the molecular machinery is conserved among kingdoms.
However some TF (transcription factor) families, such as NAC
(NAM/ATAF/CUC) proteins, have only been identified in plants.
NAC proteins constitute a highly prolific group of plant-specific
TFs, with representatives in monocots, dicots, conifers and
mosses. Many plants of commercial and scientific interest, such
as Arabidopsis thaliana [1], tobacco [2] and rice [3] have more
than 100 different NAC proteins. Early findings indicated an
important role of NAC proteins in plant development [4]. This has
later been confirmed, and a variety of other important processes
such as biotic and abiotic stress responses and senescence have
been associated with NAC proteins, as reviewed in [5]. A recent
example is the work of Carvallo et al. [6] that suggested that NAC
proteins are part of an evolutionarily conserved cold response,
since NAC019 was up-regulated in response to cold in distantly
related plant species.
Research has also suggested a useful role for NAC proteins in
bioenergy production from lignocellulosic material, since a subset
of NAC proteins, now termed the Secondary Wall NACs, were
shown to influence cell wall thickness [7,8]. Applications in crop
plant engineering have also been demonstrated, e.g. improvement
of rice drought tolerance and yield on expression of a NAC protein
in the roots [9], and increase of wheat protein, zinc and iron
content by replacement of a non-functional NAC gene allele in
modern wheat with a functional ancestral counterpart [10].
Key words: crystal, DNA-binding, small angle X-ray scattering
(SAXS), transcription factor, uranyl photo-probing.
Abbreviations used: DIG, digoxigenin; dsON, double-stranded oligonucleotide; EMSA, electrophoretic mobility-shift assay; GCM, Glial cell missing;
MR, molecular replacement; NAC, NAM/ATAF/CUC; NACBS, NAC-binding site; PEG, poly(ethylene glycol); SAXS, small-angle X-ray scattering; SEC,
size-exclusion chromatography; SNAC1, stress-responsive NAC 1; TF, transcription factor.
Co-ordinates and structure factors have been deposited into PDB with accession codes 3SWP (Native_complex), 3SWM (Au_complex) and 4DUL
(Form IV).
1
To whom correspondence should be addressed (email [email protected]).
c The Authors Journal compilation c 2012 Biochemical Society
396
Table 1
D. H. Welner and others
Data, refinement and validation statistics
Data were collected at 100 K. The complex data were processed with XDS [25], and the form IV data with Denzo/Scalepack [24]. R int is the redundancy independent R -factor on the intensities [47] for
the complex data or the conventional R sym for form IV. Ramachandran statistics are calculated with Rampage [48]. Parentheses denote the values for the highest resolution shell. n.a., not applicable;
RMSD, root mean square deviation.
Parameter
Native_complex
Au_complex
Form IV
Data collection wavelength
Space group
Unit cell
Resolution
R int
I /σ
Completeness
Redundancy
R -factor
R -free
RMSD bonds
RMSD angles
PDB code
Protein atoms
DNA atoms
Gold atoms
Ramachandran statistics (favoured, allowed, outliers)
RMSD of DNA atoms from ideal B-DNA
1.038 Å
P 21 2 1 2 1
a = 69.1 Å, b = 105.5 Å, c = 175.2 Å
20–4.40 Å (4.53–4.40 Å)
10 % (63 %)
17.1 (4.0)
99.4 % (100.0 %)
8.7 (9.0)
26.0%
34.9%
0.015 Å
2.484 ◦
3SWP
4639 with average B = 245.3 Å2
1066 with average B = 330.3 Å2
n.a.
97.0 %, 2.8 %, 0.2%
1.514 Å
1.038 Å
P 21 21 21
a = 68.5 Å, b = 109.1 Å, c = 173.9 Å
20–4.45 Å (4.58–4.45 Å)
11 % (64 %)
10.9 (2.6)
99.4 % (99.6 %)
3.8 (3.9)
25.0%
30.9%
0.015 Å
2.285 ◦
3SWM
4639 with average B = 236.6 Å2
1066 with average B = 275.0 Å2
9 with average B = 415.6 Å2
96.6 %, 3.2 %, 0.2%
1.304 Å
1.087 Å
P 21
a = 40.1 Å, b = 69.1 Å, c = 75.9 Å, β = 97.4 ◦
30–3.00 Å (3.11–3.00 Å)
5.3 % (39.2 %)
25.2 (2.3)
98.9 % (91.3 %)
3.7 (3.2)
25.7%
34.2%
0.012 Å
1.820 ◦
4DUL
2097 with average B = 99.0 Å2
n.a.
n.a.
90.4 %, 7 %, 2.6%
n.a.
EXPERIMENTAL
Protein and oligonucleotide samples
The NAC domain of ANAC019 (At1g52890, residues 1–168,
NCBI Accession number NP_175697) with an N-terminal
His6 tag was expressed from vector pET-15b, purified as
described previously [13] and stored in 20 mM Tris/HCl
(pH 7.5), which is the buffer used for all experiments in the
present study unless otherwise stated. The 26 bp oligo2 (5 gtcttgcgtgttggaacacgcaacag-3 ) was designed with 5 overhangs
to facilitate oligomerization and crystal growth [22], and contains
two palindromically oriented NACBSs (NAC-binding sites; as
defined in [13], sequence TTGCGTG, in bold) separated by
a 6 bp spacing. ssONs (single-stranded oligonucleotides) were
purchased from TAG Copenhagen and annealed to a final
concentration of 250 μM dsON in 10 mM Tris/HCl (pH 7.5)
and 50 mM NaCl. The sequence of the 30 bp blunt-ended
dsON used in EMSA was similar to the sequence of oligo2
(5 -cagtcttgcgtgttggaacacgcaacagtc-3 ; NABCS in bold).
Crystallization and data collection
A new crystal form of the free ANAC019 NAC domain (form
IV) was obtained under similar conditions to the ones reported
previously [23], using 12.5 % PEG [poly(ethylene glycol)] 4000,
0.1 M malic acid/imidazole buffer (pH 7.0) and 5 % glycerol, and
a new batch of protein (4.0–5.8 mg/ml). Data were collected at
beamline 911-2, MAXLAB (Lund, Sweden) and processed with
Denzo/Scalepack [24] (statistics in Table 1).
For the complexed structures, NAC dimer and oligo2 were
mixed in a 1:1 molar ratio. After extensive screening (including
different crystallization conditions and a set of ten dsON), the
conditions leading to crystallization were identified. Crystals
grew within two weeks in manually set up 24-well VDXTM
trays (Hampton Research) using 0.01 M MgSO4 ·7H2 0, 0.05 M
Mes (pH 6.0) and 0.5 % PEG 400 as main precipitant. The
hanging drop was composed of equal amounts (2 μl) of
reservoir solution and protein–DNA mixture (with protein at a
1 mg/ml concentration). Crystals routinely diffracted to 8–9 Å (1
Å = 0.1 nm), and two were found to diffract to ∼ 4.45 Å. One
c The Authors Journal compilation c 2012 Biochemical Society
of them (Native_complex) was grown in the presence of 0.3 M
glycyl–glycyl–glycine as an additive. A dataset (Native_complex)
was collected at beamline I911-2 of MAXLAB at cryogenic
conditions using 22 % glycerol as a cryoprotectant. Additives did
not consistently improve diffraction quality, thus the differences
in diffraction power must be due to variation in individual
crystal quality and handling. Dehydration, annealing and other
postcrystallization improvement techniques did not improve on
the resolution limit. Later in the study, attempts were made
to obtain derivatives with gold-, platinum-, mercury- and leadcontaining compounds. Data were collected at beamline ID14-4
of the ESRF (Grenoble, France) for a trimethyl lead acetate and
two potassium dicyanoaurate derivative crystals, but finally only
one (Au_complex, crystal was soaked for 20 h in 5 mM potassium
dicyanoaurate) was deemed of sufficient quality for structure
determination. Data collection statistics for processing with XDS
[25] for the two complex data sets are reported in Table 1.
Structure determination
The structure of the free form IV was determined by MR
(molecular replacement) using Phaser within the PHENIX suite
[26,27] and an edited NAC monomer (PDB code 1UT7) as
the search model. The MR solution was deemed correct, as it
showed a similar dimer interface as the one identified in the high
resolution crystal structure. Refinement was carried out initially
in Phenix.refine [27] and later in Refmac 5 [28] using jelly-body
restraints, NCS (non-crystallographic symmetry) restraints and
group TLS (Translation/Libration/Screw) refinement, alternated
with manual rebuilding in Coot [29]. This strategy resulted in
acceptable geometry and R-factors (R-factor of 25.7 % and Rfree of 34.2 %) for a model comprising 75 % of the amino
acids in the dimer. The B molecule makes fewer crystal contacts
and was particularly difficult to build in some regions. Further
details on structure determination and validation are given in
the Supplementary online data (at http://www.BiochemJ.org/bj/
444/bj4440395add.htm) and the refinement statistics are given in
Table 1.
The Native_complex data were phased with MR using
Phaser v1.3.1 [26]. The structure is shown in Figure 1 and
Supplementary Movie S1 (at http://www.BiochemJ.org/bj/444/
DNA binding by NAC plant transcription factors
Figure 1
397
Crystal structure of the NAC–DNA complex (Native_complex)
Binding sites according to the assigned DNA sequence are in yellow.
bj4440395add.htm). There were several high-ranking solutions,
which only differed by translation of the DNA along its major axis.
Thus oligo2 packs in the crystal structure as long fibres in a headto-tail arrangement, basically as expected from the design strategy.
Based on MR alone, it was difficult to assign unambiguously the
correct sequence to the DNA. Since a model lacking the bases
would be problematic both in terms of crystallographic refinement
and SAXS (small-angle X-ray scattering) profile simulation,
potentially leading to wrong refinement and conclusions on the
basis of the χ-values, the sequence was instead assigned by
combining the information of protected phosphates in uranyl
photo-probing (Figure 2A) with the crystal structure, to obtain the
best possible match between phosphate protection in solution and
reduction in phosphate solvent accessibility on complex formation
with the AB dimer, as derived from the crystal structure. The
match is also shown in Figure 2(A). All of the nucleotide bases
for oligo2 are thus included in the final model, which is highly
plausible in view of the biochemical data obtained in solution.
Refinement was carried out in phenix.refine [27] and minor
manual rebuilding and inspection of maps were done in Coot v0.6
[29]. The CCP4 package suite v6.1.3 [30] was used for various
tasks in phasing, refinement, validation and analysis. Many
refinement protocols were tested. The best compromise between
R-factors and geometry was obtained using the protocol described
in further detail in the Supplementary online data, which resulted
in R-factors of 26.0 % and 34.9 % for Native_complex, and
25.0 % and 30.9 % for Au_complex. Validation is summarized
in the results section, with further details in the Supplementary
online data. The statistics for both models are shown in Table 1.
Figures showing structures and electron density were prepared
with PyMOL v1.3 (http://www.pymol.org). Analysis of surface
accessibility on the basis of the crystal structures was carried out
with AreaImol from the CCP4 package [30,31], using the default
probe radius of 1.4 Å. Surface area differences upon NAC–DNA
complex formation were calculated per protein amino acid residue
(to identify residues listed in Table 2) or DNA phosphate group
(for the phosphate accessibility analysis in Figure 2A).
Uranyl photo-cleavage analysis and DNase I footprinting
The uranyl photo-cleavage analysis and DNase I footprinting were
performed as described previously using a sample volume of 30 μl
Figure 2 Backbone phosphate accessibility of ANAC019–DNA complex
according to uranyl photo-probing assay and the complex crystal structure
(A) Model of phosphate contacts in the ANAC019–DNA complex. The model represented by
arrows is built from four independent uranyl photo-footprinting experiments on both strands
of the oligonucleotide. The data reflect phosphate interactions in the presence of 200 ng of
ANAC019. The palindromic core NACBS is underlined and the length of the arrows denotes the
relative protection level (i.e. semi-quantitative footprints). Black bold letters indicate nucleotides
whose phosphate group loses more surface area than the threshold value on complex formation
with the AB dimer in the Native_complex crystal structure. The threshold value was chosen
as 40 % of lost accessible area compared with the phosphate losing most surface area on
complex formation, according to the AreaImol analysis, which was set to 100 %. Open bold
letters represent nucleotides whose phosphate groups were judged as likely to be protected by
residues in the loops/termini missing in the crystal structure, on the basis of their approximate
spatial orientation. (B) Differential cleavage plots. The differential cleavage plots compare
the susceptibility of the individual DNA nucleotides to uranyl photo-cleavage in the absence
and presence of 200 ng of ANAC019. Negative values (threshold = 0.1) correspond to a
protein-protected phosphate, and positive values represent enhanced cleavage (more accessible
phosphates upon ANAC019 binding). The vertical scale is in units of ln(f a ) − ln(f c ), where f a is
the fractional cleavage at any bond in the presence of 200 ng of ANAC019 and f c is the fractional
cleavage of the same band in the control (0 ng). The NACBSs are underlined and black boxes
indicate phosphate footprints on both strands.
and up to 200 ng ANAC019 per sample [32–34]. In brief, the
purified [32 P]-labelled DNA fragments were subjected to either
DNase I or uranyl photo-cleavage in binding buffer (see [13]). The
concentration of uranyl nitrate was 1 mM and the concentration
of ANAC019 is indicated in the text. Since uranyl photo-cleavage
is absent above pH 7, the pH of the buffers were adjusted to
6.0–6.5. The samples were incubated for 30 min at 23 ◦ C and
then irradiated in open tubes placed just below a 40W/03 Phillips
fluorescent light tube with maximum emission at 420 nm. After
25 min of irradiation, sodium acetate at pH 4.5 was added to a final
concentration of 0.2 M together with 2.5 vol. of 96 % ethanol on
c The Authors Journal compilation c 2012 Biochemical Society
398
Table 2
D. H. Welner and others
DNA-binding residues of ANAC019
Overview of the residues close to DNA as detected by an analysis of surface and contact areas upon DNA binding performed with AreaImol [30,31]. For simplicity, the residues have been placed in
six groups according the secondary structure element they are part of. The sequence of each group is displayed, with non-conserved residues in regular type and conserved in bold. Parentheses
indicate less likely, but possible, interactions.
Group number
Chain A
Chain B
Chain C
Chain D
Sequence
Location in NAC
1
86–88
86–87
–
86–87
P86 NR
2
3
4
5
6
94–101
113–118, 120
123–125, 129–130
133, 135, 138
160, 162–163
95–101
114–116, 118, 120
123–125, 129–131
133, 135, 138
160, 162–163
–
–
–
–
–
95–101
115, 117–118, 120
123–125, 129
133
162
Y94 WKATGTD
I113 KKALVFY
K123 APKGTKTNW
I133 MHEYR
Y160 KKQ
Poorly defined loop
between β1 and β2
β3
β4
Loop between β4 and β5
β5
C-terminal
ice for 15 min, and the mixtures were centrifuged at 15 000 g for
20 min. The dried pellet was dissolved in 6 μl formamide, 90 mM
Tris/borate and 1 mM EDTA (pH 8.3), containing xylene cyanol
and Bromophenol Blue, and the samples were heated at 90 ◦ C
before loading on to a 8 or 10 % denaturing polyacrylamide gel
(19:1 acrylamide/methylenebisacrylamide). The autoradiograms
were obtained by overnight exposure using intensifying screens.
A Molecular Dynamics STORM 860 PhosphorImager was used
to collect data from the phosphor storage screens and baseline corrected scans were obtained by using the software SAFA
package [35]. Differential cleavage plots were calculated from the
expression ln(f a ) − ln(f c ) representing the differential cleavage
at each bond relative to the control (where f a is the fractional
cleavage at any bond in the presence of the protein and f c
is the fractional cleavage of the same bond in the control).
Using this expression, positive values indicate enhanced cleavage
(hypersensitivity), whereas negative values indicate cleavage
inhibition (footprints).
DNA-interaction
region
Probable function in
DNA binding
Backbone
Increasing affinity
Major groove
(Backbone)
Backbone
(Backbone)
Backbone
Specific recognition
None/increasing affinity
None/Increasing affinity
None
Increasing affinity
ctcctaaaggtactgcaaacaattggatca-3 ; and K162A, 5 -gggttctatgtcgaatatacaaggcgcaatcaagtgcacaaaaacaa-3 .
EMSA with DIG (digoxigenin) labelling (Roche) was
performed as described previously [12], except for the
DNA sequence which was as described under protein and
oligonucleotide samples.
In order to determine the K d value of the interaction between
wt (wild-type) ANAC019 and oligo2, three independent EMSAs
(n = 3) with full titrations of ANAC019 were carried out with
32
P-labelled oligo2 instead of DIG-labelling. The EMSAs were
performed as described in [13], but the DNA concentration was
kept at 75 pM, which is roughly 1000-fold lower than the expected
K d value (results not shown). Autoradiograms were obtained
by exposure using intensifying screens. A Molecular Dynamics
STORM 860 PhosphorImager was used to collect data from
the phosphor storage screens and ImageQuant software version
5.2 (Molecular Dynamics) was used to calculate the fractional
saturation. The data was used to establish a binding isotherm,
which were fitted to the ‘one site binding’ (hyperbola) equation:
Y = (Bmax ×X)/(K d + X) using GraphPad Prism version 5.00.
SAXS analysis
The free ANAC019 domain alone was investigated by SAXS in
order to confirm that the previously determined X-ray structure
(PDB code 1UT4) was compatible with the solution structure.
Data for a concentration series (1–8 mg/ml) in 20 mM Tris/HCl
(pH 7.5) were measured at beamline 2.1, SRS (Daresbury,
England). Additional information on data analysis and modelling
are given in the Supplementary online data.
ANAC019(1–168) (1 mg/ml) and oligo2 (250 μM) were mixed
in equimolar amounts and incubated at room temperature (23 ◦ C)
for 15 min. The sample was then fractionated on a pre-packed
24 ml Superdex 75 column (GE Healthcare) with a flow rate of
0.5 ml/min with 20 mM Tris/HCl (pH 7.5) as the running buffer.
The resulting peak fractions were analysed separately with SAXS
at beamline 711, MAXLAB. Non-peak fractions were used as
background subtraction. Exposure time was 20 s. A concentration
series was recorded on each fraction after concentration with
a Microcon Ultracel YM-10. The data were analysed with the
programs Raw [36] and the ATSAS program package [37].
Site-directed mutagenesis and EMSA
ANAC019(1–168), produced as described above, and mutated
variants thereof, were used for EMSA. Site-directed mutagenesis
was performed with QuikChangeTM II Site-Directed Mutagenesis
kit (Stratagene). The mutagenic primers were the following (sense
strand): R85A, 5 -aatatccaaacgggtcagcacctaaccgggttgcc-3 ;
R88A, 5 -gggtcaagacctaacgcggttgccggatcgg-3 ; K129A, 5 -aaag
c The Authors Journal compilation c 2012 Biochemical Society
Dimer angle calculations
Dimer closure was quantified by defining a vector for each NAC
monomer going through the dimerization interface as well as the
protein core and then calculating the angle between the vector pair
of the dimer. The vector was constructed through the Cα-positions
of residues 18 (part of the dimerization β-sheet) and 135 (situated
in the centre of the core β-sheet). The angle between the vectors
was calculated using the basic formula: cos(α) = a·b/|a|x|b|.
RESULTS AND DISCUSSION
Crystal structure of the NAC–DNA complex
Lack of detailed information on the DNA-binding mode prompted
us to initiate co-crystallization of the ANAC019 NAC domain and
dsONs containing a palindrome of the 7 bp ANAC019-binding
site (NACBS) identified previously [13]. Although the literature
indicates that some NAC TFs target single NACBSs in vivo
[13,38], palindromic NACBSs bind with higher affinity [13] and
were thus chosen. After extensive optimization of crystals, a native
data set to 4.40 Å (Native_complex) and one derivatized with gold
(Au_complex) to 4.45 Å were collected for a complex with oligo2
(data collection statistics are shown in Table 1).
The MR solutions obtained with Native_complex and
Au_complex are very similar and show a ternary complex
consisting of two NAC dimers (chains AB and CD) bound to
DNA binding by NAC plant transcription factors
399
one DNA duplex constituted by chains EF (Figure 1). Combined
with the low resolution, the unusual stochiometry prompted
thorough validation. In short, crystal packing, a biologically
sensible DNA–protein interface, omit maps and similar gold sites
as those originally used to solve the NAC structure alone all
confirmed the MR solution (more details in Supplementary online
data and Supplementary Figure S1 at http://www.BiochemJ.
org/bj/444/bj4440395add.htm). The DNA sequence was assigned
to agree with the pattern of phosphate protection in the uranyl
photo-probing assay described below. The agreement is shown in
Figure 2(A), which shows the phosphate protection in solution
as well the phosphates losing the largest accessible surface
on complexation with the AB dimer in the Native_complex
crystal structure, according to the sequence as assigned in the
deposited PDB file. Additionally, phosphates potentially protected
by protein residues missing in the crystallographic model are
marked, on the basis of the vicinity with missing loops or extended
termini. As it is not possible to completely exclude a possible
small register error, in our analysis we refrain from discussing
interactions with individual bases. As a final validation that the
overall structure is fundamentally correct, the structures could
be refined as well as expected with the limited resolution, to
Rwork = 26.0 % and Rfree = 34.9 % for the Native_complex model
(Table 1).
Each monomer inserts into adjacent major grooves of the DNA
as is common for TFs (Figure 1), and a change in dimer opening
with respect to the previously determined free structure [12] can
be observed, as will be discussed below.
DNase I footprinting and uranyl photo-cleavage analysis of
ANAC019–DNA interactions
DNase I and uranyl photo-footprinting analysis were used to
map the binding in solution. Using binding conditions described
previously [13], the DNase I results clearly demonstrate protection
of the palindromic NACBSs in oligo2 by ANAC019 in a
concentration-dependent manner (Supplementary Figure S2 at
http://www.BiochemJ.org/bj/444/bj4440395add.htm).
Uranyl photo-cleavage can be used to study phosphate contacts
involved in protein– and ligand–DNA complexes [32,33]. The
technique exploits the fact that the uranyl-(VI) cation (UO2 2 + )
forms strong complexes with accessible phosphates of the DNA
backbone and upon irradiation is excited, which initiates a
complex photo-oxidation process eventually leading to strand
break. Binding of a protein to DNA protects phosphates near the
binding site from oxidation. Using this method, a dsON model of
specific protein–phosphate contacts was built using full titrations
(0, 10–200 ng) of ANAC019 (Figure 2A). On the basis of this
model, the DNA sequence in the crystal structure was assigned.
The differential cleavage plot (Figure 2B) shows a nearly perfect
symmetrical pattern when both strands are compared. We thus
conclude that the two monomers in the dimer interact similarly
with the backbone phosphates in each of the individual NACBS
in the palindrome, and no additional binding site was detected
that could support a ternary complex in solution.
Solution structures of the free NAC domain and the
NAC–DNA complex
SAXS gives low resolution information on the structure of
macromolecules and macromolecular complexes in solution.
The data obtained (see Figure 3A, the Supplementary online
data and Supplementary Table S2 at http://www.BiochemJ.org/
bj/444/bj4440395add.htm for further details) show that the free
Figure 3
SAXS analysis
(A) SAXS data for the free ANAC019 NAC domain in solution is shown alongside the CRYSOL
theoretical scattering curve for one of the published crystal structures (PDB code 1UT7) and for
a ‘completed’ model where loops and termini missing in the crystal structure have been built
and the termini perturbed using elNémo [49] (more details in the Supplementary online data
at http://www.BiochemJ.org/bj/444/bj4440395add.htm). χ-Values for the fits are shown on the
Figure. (B) Ab initio DAMFILT model (P1) based on the SAXS data shown in (A) and superposed
with the NAC dimer from the crystal structure. (C) Ab initio DAMFILT models based on SAXS
data (shown in D) from SEC fractions B1 and B3 compared with the Native_complex crystal
structure. The ternary complex is shown in cartoon with the AB dimer in magenta, the CD dimer
in green and the DNA in black. It is superposed using DAMSUP with the DAMFILT models
from B1 (blue mesh) and B3 (yellow spheres). The ab initio reconstructions are consistent
with fraction B1 containing DNA and fraction B3 containing binary complex. (D) SAXS data
from SEC fractions B1–B3 are shown together with the theoretical scattering curves for selected
models derived from the complex crystal structure. For fraction B2 a fit for a model obtained
with OLIGOMER containing 82 % free DNA and 18 % binary complex is additionally shown.
χ-Values for the fits are shown on the Figure. Residual curves highlight the goodness of the
fits at different values of S.
NAC dimer in solution is primarily symmetrical within the
resolution of SAXS (Figure 3B). Evidence for this comes
from ab initio shape modelling imposing 2-fold symmetry,
giving χ-values very similar to the ones obtained without
symmetry imposed (see the Supplementary online data). The
newly published crystal structure of the SNAC1 (stress-responsive
NAC 1) NAC domain [39] also supports the notion that NAC
domains in general form symmetrical dimers, since in the SNAC1
structure the two monomers in the NAC dimer are related by
crystallographic symmetry and are thus identical, although local
asymmetry has been noted previously for the ANAC019 NAC
domain crystal structure [12]. The differences noted in this crystal
structure are beyond the resolution of the SAXS technique. The
c The Authors Journal compilation c 2012 Biochemical Society
400
D. H. Welner and others
SAXS data for free NAC are reasonably well represented by the
known crystal structures of the free ANAC019 NAC domain
(the fit to the structure with PDB code 1UT7 is shown as example
in Figure 3B and Supplementary Table S2). The fit is further
improved by modelling loops and termini missing in the crystal
structure, as can also be seen in Figure 3(B) and Supplementary
Table S2 (see the Supplementary online data for details on how
this complete model was generated). These loops and termini
most likely adopt a variety of conformations in solution.
In order to isolate the complex, SEC (size-exclusion
chromatography) on a NAC/DNA mixture was carried out and
yielded an asymmetric peak (eluting at 10.2 ml, results not
shown), which we interpreted as containing one or more complex
species as well as free DNA. SAXS data could be collected from
three individual peak fractions, designated B1 to B3, where B1
represents the higher-elution-volume fraction. The experimental
curves were compared with different models: (i) oligo2; (ii) the
expected binary complex of one NAC dimer to one dsON created
by removing one dimer from the Native_complex structure;
and (iii) the ternary complex seen in the crystal structure. Free
protein species were excluded, since they elute at higher volume
(∼ 11.5 ml). CRYSOL analysis [40] showed that although the
curves from fractions B1 and B2 fit the free dsON model better
(χ-values of 1.3 and 1.7 respectively), fraction B3 yields an
improved fit using the binary complex as model (χ-value = 1.1)
(Figure 3D). Analysis of the scattering curves with OLIGOMER
[41] enhances the fit to the B2 data by adding 18 % binary complex
to 82 % free oligo2 (Figure 3D), but did not detect the presence of
ternary complex in any of the fractions. Ab initio modelling with
DAMMIN (Figure 3C) supported the CRYSOL/OLIGOMER
results, in that the model from fraction B1 fitted well with free
oligo2 and the model from fraction B3 fitted well with the binary
complex, but could not accommodate the ternary complex. The
relevant calculated and experimental Rg -values are provided in
Supplementary Table S2.
That NAC binds DNA as a dimer has been shown previously
[13]. The DNase I footprinting and uranyl photo-cleavage
data in the present study are fully consistent with symmetric
binding of one NAC dimer to one oligo2 in solution. Additional
higher-molecular-mass bands are observed in EMSA when high
amounts (100–500 ng) of NAC domain are used (e.g. in [13]),
which we interpret as non-specific binding. In the crystal the
high concentrations and dsON arrangement might also favour
non-specific binding leading to an artifactual ternary complex.
Recognition of non-optimal DNA sequences may have relevance
in vivo where many target sequences have single NACBSs,
which must entail non-specific recognition by one of the NAC
monomers.
ANAC019 residues interacting with DNA
As it was not known a priori which of the NAC dimers in the
crystal best represents the interactions with DNA in solution,
the interactions of all monomers with DNA were analysed. First,
an analysis of the surface and contact areas was performed.
The interface between DNA and protein is of comparable size
2
for chains A, B and D (695 +
− 80 Å ). The contact area for chain B
is shown in Figure 4(A), and as illustrated in Figure 4(B) overlaps
well, as expected, with a large positively charged surface patch.
The contact area of chain C, however, is only 129 Å2 , indicating
that chain C is only loosely associated with DNA.
The residues pinpointed by AreaImol were grouped into six
clusters on the basis of their secondary structure (Table 2 and
Figure 5A). The interacting groups are similar for monomers
c The Authors Journal compilation c 2012 Biochemical Society
Figure 4
Important areas of the ANAC019 NAC domain
For simplicity only one NAC monomer is shown (chain B). (A) Surface residues in the NAC–DNA
interface as defined by AreaImol are highlighted in blue. (B) The electrostatic surface generated
with PyMOL. (C) Conserved residues are highlighted in green and residues that are similar in
more than half of the ten aligned NAC domain sequences are coloured yellow. This panel is
based on the sequence alignment of Jensen et al. [16]. (D) Red spheres indicate the residues
found not to be essential for DNA binding in [13]. In the present study we further tested single
mutants of Arg85 (not modelled in crystal structures) and Arg88 (green) and found that Arg88 is
necessary for DNA binding. (E) Important residues of the ANAC019 NAC domain. Conserved
residues are in black boxes and residues that are similar in more than half of the sequences are
in grey boxes, according to the alignment of [16]. Above, blue bars indicate residues at the DNA
interface in any of the four NAC monomers, see Table 2. Stars indicate residues that have been
mutated and found by EMSA to be essential for DNA binding (green) or not (red) in accordance
with (D). An open circle indicates an arginine or a lysine residue close enough to DNA to be an
interaction partner (see Table 3).
A, B and D. On the basis of the fact that uranyl photo-probing
shows a symmetric interaction between NAC and the investigated
DNA sequence, dimer AB probably best represents the interaction
with oligo2 in solution, and thus our further analysis focuses
on this dimer. Group 2 protrudes into the DNA helix and
interacts with the sugar/base region of DNA. These residues
are probably responsible for specific recognition. Their sequence
(WKATGTD) is highly conserved, except for the second threonine
residue [16], and has been suggested to be reminiscent of the
WRKYGQK motif [42], which is responsible for DNA binding
in WRKY TFs [19,21] (see below). Groups 1, 4 and 6 are part
of loop regions and each contains one or more arginine or lysine
residues with adequate distances to DNA to allow interaction with
the backbone (see Table 3 for details). These residues are likely to
contribute to the overall affinity of DNA binding. The direct role of
groups 3 and 5 in DNA binding is arguable, since they are part of
the central β-sheet and do not contain potential phosphate-binding
residues. In the remaining groups, Arg88 , Lys96 , Lys123 , Lys129
and Lys162 can potentially interact with backbone phosphates and
are all conserved (Figures 4C and 4E), except for Lys123 , which
is either lysine or arginine, and in one instance proline [VOZ2
(vascular plant one-zinc finger protein 2)] [16]. Residues 79–85
and 144–151 as well as part of the N- and C-terminal could not
be modelled in the crystal structure and were not included in the
DNA binding by NAC plant transcription factors
Table 3
401
Arginine and lysine residues in the ANAC019 NAC domain
The shortest distance (in Å) to a DNA backbone atom in the Native_complex structure was
measured from the Cβ-atoms of all lysine and arginine residues. The maximum distance from
Cβ to Nζ of lysines is approximately 4.8 Å, and that from Cβ to NH2 of arginine residues is
approximately 6.0 Å. Giving a bonding length of approximately 3 Å, the maximum distance from
Cβ to a DNA backbone atom must be 7.8 Å for lysine and 9.0 Å for arginine residues in order for
them to be probable candidates for DNA-interacting residues. Those that meet this criterium are
underlined. This approach allows identification of potentially interacting residues independent
of the accuracy of side-chain-atom positions. Conserved residues are in bold, according to the
multiple sequence alignment of [16]. Residues not modelled in the crystal structure are not
taken into consideration.
Distances (Å)
Residue name
Chain A
Chain B
Arg19
Arg34
Lys35
Lys53
Lys62
Lys68
Arg76
Arg88
Lys96
Lys102
Arg110
Lys114
Lys115
Lys123
Lys126
Lys129
Arg138
Arg158
Lys161
Lys162
15.4
25.1
25.7
15.4
20.6
11.6
12.5
6.0
6.9
9.1
13.5
9.1
9.7
6.4
8.3
5.2
10.8
10.0
9.2
4.8
15.4
24.1
23.6
16.0
22.0
12.3
12.8
7.1
7.5
8.5
15.0
8.9
9.1
7.5
10.0
4.9
11.0
10.1
9.0
4.6
analysis, but some of these residues could potentially interact in
the complex, for example Arg85 .
The K d value of the interaction between the ANAC019 NAC
domain and oligo2 is estimated in the present study by EMSA to be
24.2 +
− 6.2 nM. We have previously reported that a K123A/K126A
variant of the NAC domain was not affected in its DNA binding
as measured by EMSA, whereas the R85A/R88A variant was
impaired [13]. To further investigate this and other potentially
interacting residues, the single mutants R85A, R88A, K129A
and K162A were made and analysed by EMSA. The ANAC019
residues that have been mutated to date are mapped on to the
structure of the complex in Figure 4(D).
The R88A mutant showed significant impairment in DNA
binding (Supplementary Figure S3 at http://www.BiochemJ.
org/bj/444/bj4440395add.htm). On the other hand, the other
single mutations had no detectable effect (see for example
R85A, also shown in Supplementary Figure S3), indicating K d
values similar to the native or changes below the detection level
of EMSA. Arg85 is absent from all of the currently available
structures of ANAC019 NAC domains, including the complexes
with DNA, but is included in the recently reported structure
of SNAC1 NAC domain [39]. Though the resolution of this
structure (2.6 Å) is lower than the highest resolution obtained
for the ANAC019 NAC domain (1.9 Å for PDB code 1UT7),
some additional residues could be traced in the missing loops,
including Arg85 . If the structure of SNAC1 (PDB code 3ULX) is
superposed on to the AB dimer in the Native_complex structure,
Arg88 is in a conformation ideally positioned for interaction with
one of the DNA phosphate groups, whereas Arg85 is pointing away,
offering a possible structural explanation for the mutagenesis
results. Although the implication of Lys129 and Lys162 in binding
Figure 5
DNA interactions and dimer flexibility
(A) The residues contacting the DNA according to the contact surface analysis are shown,
together with a cartoon representation of the DNA and a transparent depiction of the NAC
surface. Groups 3 and 5 are shown in green, to illustrate the fact that these residues are part of
the core β-sheet. Groups 1 and 6 (blue) as well as 4 (red) contact the DNA backbone and are
shown in sticks. The interesting group 2, which protrudes into the major groove, is depicted
in yellow sticks. (B) Dimer arrangement in different crystal structures of NAC, representing the
open, semi-closed and closed dimer arrangements. The B monomers of a previously solved free
ANAC019 NAC structure (open arrangement, PDB code 1UT7, in green, N-terminal fragment
connected with a broken line), the AB dimer (semi-closed arrangement, from Native_complex, in
yellow) and CD dimer (closed arrangement, from Native_complex, in red) were superimposed,
showing clearly that A monomers are not aligned with each other. (C) Superimposition of the
DNA-binding strands of GCM (MRNTNNHN, PDB code 1ODH) (cyan) and NAC (WKATGTDK)
(green). The DNA from the GCM structure is shown in grey together with the DNA from the NAC
complex structure as a black ribbon. (D) Superimposition of the DNA-binding strands of WRKY
(WRKYGQK, PDB code 2LEX) (cyan) and NAC (WKATGTDK) (green). The DNA from the
WRKY structure is shown in grey together with the DNA from the NAC complex structure
as a black ribbon.
is strongly suggested by the structures in complex with DNA
reported in the present study, it could be that removal of a single
one of these side chains does not change the affinity sufficiently
for EMSA detection. In conclusion our mutagenesis studies show
that, out of the residues mutated so far, Arg88 is the most important
for DNA affinity.
NAC dimer flexibility
A certain amount of flexibility in the dimer arrangement is
evidenced by comparison (Figure 5B) of the structures of
complexes shown in the present study, the previously solved
native structures (PDB codes 1UT7 and 1UT4), and the newly
determined native structure to 3.0 Å resolution (form IV, the
Experimental section and the Supplementary online data for
details). The degree of closure in the dimer was quantified as
an angle between the domains (see the Experimental section for
details) and values for the different crystal structures of ANAC019
and for comparison also of the newly determined SNAC1 structure
[39] are tabulated in Table 4. The values fall roughly into three
groups, representing what we refer to as open, semi-closed and
closed dimer arrangements, with angles of approximately 120 ◦ ,
110 ◦ and 100 ◦ between monomers.
In order to see how well the various dimer arrangements are
represented in solution, three models were made and compared
with the SAXS data for the free NAC domain. In all cases
c The Authors Journal compilation c 2012 Biochemical Society
402
D. H. Welner and others
Table 4 Angles between dimers in crystal structures of NAC domains,
calculated as described in the Experimental section
Crystal structure
Angle
Dimer arrangement
Free ANAC019 (PDB code 1UT4)
Free ANAC019 (PDB code 1UT7)
Free ANAC019 (Form IV)
Native_complex AB dimer
Au_complex AB dimer
Free SNAC1 [39] (PDB code 3ULX, crystallographic dimer)
Native_complex CD dimer
Au_complex CD dimer
120 ◦
119 ◦
108 ◦
107 ◦
107 ◦
97 ◦
102 ◦
100 ◦
Open
Open
Semi-closed
Semi-closed
Semi-closed
Closed
Closed
Closed
the models were based on NAC monomers from the highest
resolution free ANAC019 crystal structure (PDB code 1UT7),
completed to account for missing loops and termini as described
in the Supplementary online data, and for which the CRYSOL
fit is shown in Figure 3(A). The CRYSOL fit of the three
dimer models were compared: an open arrangement (119 ◦ angle),
representing the original free ANAC019 crystal structures; a semiclosed arrangement (107 ◦ angle), representing the form IV free
ANAC019 crystal structure and the AB dimers in the crystal
complexes with DNA; and a closed arrangement (100 ◦ angle),
representing the free SNAC1 structure and the CD dimers in the
complexes. As one approaches more closed dimer arrangements,
both the fit to the experimental Rg - and the χ-values became worse,
as shown in Supplementary Table S2. Our conclusion is that in
solution the free NAC dimer is mostly in an open conformation.
To a minor extent, more closed conformations may also exist in
solution, and lead to form IV and the SNAC1 crystals. From the
evidence so far, it would appear that on the DNA, NAC proteins
take more closed conformations, but it cannot be excluded that
NAC can also bind DNA in the open dimer arrangement. A
possible residue involved in mediating the transition from open
to closed conformation is Tyr21 . Tyr21 in form IV of ANAC019
and the corresponding His21 in SNAC1 are interestingly more
similar than in the original free ANAC019 crystal structures.
However, especially since the resolution of both form IV and
SNAC1 structures is limited, this suggestion should be confirmed
by further experimental investigations.
The recognition of single NACBSs in vivo by the homodimeric
NAC domains [38] would entail the binding of one of the
monomers to a suboptimal nucleotide sequence. The ability
of the dimer to open and close slightly would provide a
molecular mechanism by which the NAC dimer binds to a highaffinity site with one monomer, whereas the other monomer can
then sample the nearby bases for the second highest-affinity
binding site available in non-palindromic sequences. Furthermore
this flexibility explains how NAC can recognize palindromic
sequences in vitro which are separated by a variable number
of bps: 5 bp as described in [15]; 6 bp in oligo2; and 7 and
8 bp in variants of oligo2, although a drop in affinity is observed
(S. Lindemose and K. Skriver, unpublished work).
DNA conformation
An ideal B-DNA model of oligo2 was used as a search model
in MR, and this form is largely conserved after refinement. DNA
bending is not uncommon with TFs [43] and the related GCM
TF (see section below) bends DNA 30 ◦ [44]. However, the very
recent report of DNA complex of a WRKY domain, which is
also structurally related to NAC domains (see below), shows
only slight distortion from B-DNA conformation [19]. We see
no evidence of DNA bending in the complex crystal structures,
c The Authors Journal compilation c 2012 Biochemical Society
but one could imagine a scenario where one NAC dimer bends
DNA in one direction and the other NAC dimer reverses this,
so that the DNA still can pack in straight fibres in the crystal.
Slight distortions like this would be undetected at this resolution
or could be lost in the crystal if the lattice favours an unbent DNA
conformation.
In the uranyl photo-probing experiments some phosphates
exhibit hypersensitivity towards uranyl cleavage (positive values
in Figure 2B). This is most pronounced for the 5 -GCGTGTTG-3
sequence (and 5 -GCGTGTTC-3 on the lower strand). This may
indicate a local distortion in the DNA structure upon ANAC019
binding which will elicit a change in the electrostatic potential of
the (minor) groove, thus creating phosphates more accessible to
uranyl photo-cleavage [31]. However, the exact nature and extent
of the structural changes cannot be deduced.
Similarity to the WRKY and GCM TFs
An analogy between NAC, WRKY and GCM transcription factors
has been suggested previously [13,21,42]. The present structure
confirms this relationship and shows that the DNA-interaction
mode is indeed analogous to that of GCM [44] and WRKY, for
which very recently an NMR structure in complex with DNA
has been published [19]. Although there are clear differences,
for example GCM TFs are monomeric, the three proteins have
a central β-sheet with very similar topology. When the sheets
are superimposed, the DNA-binding strand of NAC aligns with
the DNA-binding strands of GCM and WRKY, with a resulting
similar orientation of the bound dsONs (Figures 5C and 5D).
A glycine residue in NAC (Gly99 ) and WRKY has been noted
to induce an unusual curvature of the DNA-binding strand [21].
From the superimpositions (Figures 5C and 5D) it is clear that this
is a point of divergence in the Cα traces of the three strands, which
otherwise align well. The importance of this glycine is underlined
by the dysfunction of the sog1-1 (suppressor of gamma response)
NAC mutant protein caused by a change of the glycine residue
into an arginine [45].
The present study highlights the similarity in the DNA-binding
mode of NAC, WRKY and GCM. The similarities, previously
only suggested, but now firmly established, together with similar
topologies of the central β-sheet have both previously led to the
proposal of a common origin of the three different TF families
[19,42]. However, since similarities are not apparent overall at the
primary structure level, other explanations, such as mechanistic
convergence of DNA-binding by the β-strand, are also possible.
Concluding remarks
Despite great interest in plant stress and associated NAC TFs,
there is very little molecular description of the interaction with
DNA. We therefore undertook the study of the interaction of
ANAC019, a NAC TF whose structure we have previously solved
[12], with DNA. We show in the present study the first crystal
structure of a NAC–DNA complex and complement it with
solution studies. The structure shows that ANAC019 interacts
with DNA via the curved outer β-strand of its core β-sheet which
encompasses the conserved WKAT sequence, and establishes
firmly the relationship with WRKY and GCM transcription
factors. The structural similarity among these TFs, coupled with
differences in DNA sequence preference, could inspire protein
engineering aimed at modulating target gene specificity. Further,
we show indications of NAC dimer flexibility as a possible
mechanism for adapting to different spacings between binding
sites, adding NAC to the list of TFs for which dimer flexibility may
play a role in biological function, a classic example being λ phage
DNA binding by NAC plant transcription factors
Cro [46]. The structure provides the first framework to understand
the interactions of NAC TFs with DNA at the molecular level.
This can guide the study of the molecular mechanisms of NAC
TF cellular function with resulting increased ability to exploit the
NAC proteins in technological applications.
AUTHOR CONTRIBUTION
The planning of experiments, overall analysis and writing of the paper were performed by
Ditte Welner, Leila Lo Leggio and Karen Skriver; Søren Lindemose and Charlotte Helgstrand
additionally contributed to writing the paper. All authors contributed to experiments and/or
analysis of the experimental data: Søren Lindemose and Niels Erik Møllegaard mostly
for the DNase I footprinting and uranyl photo-cleavage; Ditte Welner and Leila Lo Leggio
mostly for the crystallography and SAXS with additional contributions from Charlotte
Helgstrand (crystallography) and J. Günter Grossmann (SAXS); and Addie Olsen and
Karen Skriver mostly for the mutagenesis.
ACKNOWLEDGEMENTS
We thank Heidi Ernst for helpful discussions, Charlotte O’Shea, Jens-Christian Poulsen
and Dorthe Boelskifte for technical help and the synchrotrons (MAXLAB and ESRF) for
beamtime.
FUNDING
This work was supported by the Danish Agency for Science Technology and Innovation
[grant numbers 10-084503 (to L.L.L.), 274-07-0173 (to K.S.) and 10-093596 (to S.L.).
Travel to synchrotrons was supported by the DANSCATT programme, funded by the Danish
Agency for Science Technology and Innovation [grant number 09-074479] and the ELISA
programme, funded by the European Community’s Seventh Framework Programme [grant
number 226716].
REFERENCES
1 Ooka, H., Satoh, K., Doi, K., Nagata, T., Otomo, Y., Murakami, K., Matsubara, K., Osato,
N., Kawai, J., Carninci, P. et al. (2003) Comprehensive analysis of NAC family genes in
Oryza sativa and Arabidopsis thaliana . DNA Res. 10, 239–247
2 Rushton, P. J., Bokowiec, M. T., Han, S., Zhang, H., Brannock, J. F., Chen, X., Laudeman,
T. W. and Timko, M. P. (2008) Tobacco transcription factors: novel insights into
transcriptional regulation in the Solanaceae . Plant Physiol. 147, 280–295
3 Nuruzzaman, N., Manimekalai, R., Sharoni, A. M., Satoh, K., Kondoh, H., Ooka, H. and
Kikuchi, S. (2010) Genome-wide analysis of NAC transcription factor family in rice. Gene
465, 30–44
4 Aida, M., Vernoux, T., Furutani, M., Traas, J. and Tasaka, M. (2002) Roles of
PIN-FORMED1 and MONOPTEROS in pattern formation of the apical region of the
Arabidopsis embryo. Development 129, 477–486
5 Olsen, A. N., Ernst, H. A., Lo Leggio, L. and Skriver, K. (2005) NAC transcription factors:
structurally distinct, functionally diverse. Trends Plant Sci. 10, 79–87
6 Carvallo, M. A., Pino, M. T., Jeknic, Z., Zou, C., Doherty, C. J., Shiu, S. H., Chen, T. H. and
Thomashow, M. F. (2011) A comparison of the low temperature transcriptomes and CBF
regulons of three plant species that differ in freezing tolerance: Solanum commersonii ,
Solanum tuberosum , and Arabidopsis thaliana . J. Exp. Bot. 62, 3807–3819
7 Hu, R. B., Qi, G., Kong, Y., Kong, D., Gao, Q. and Zhou, G. (2010) Comprehensive
analysis of NAC domain transcription factor gene family in Populus trichocarpa . BMC
Plant Biol. 10, 145
8 Shen, H., Yin, Y. B., Chen, F., Xu, Y. and Dixon, R. A. (2009) A bioinformatic analysis of
NAC genes for plant cell wall development in relation to lignocellulosic bioenergy
production. Bioenerg. Res. 2, 217–232
9 Jeong, J. S., Kim, Y. S., Baek, K. H., Jung, H., Ha, S.-H., Choi, Y. D., Kim, M., Reuzeau, C.
and Kim, J.-K. (2010) Root–specific expression of OsNAC10 improves drought tolerance
and grain yield in rice under field drought conditions. Plant Physiol. 153, 185–197
10 Uauy, C., Distelfeld, A., Fahima, T., Blechl, A. and Dubcovsky, J. (2006) A NAC gene
regulating senescence improves grain protein, ainc, and iron content in wheat. Science
314, 1298–1301
11 Greve, K., La Cour, T., Jensen, M. K., Poulsen, F. M. and Skriver, K. (2003) Interactions
between plant RING-H2 and plant-specific NAC (NAM/ATAF1/2/CUC2) proteins:
RING-H2 molecular specificity and cellular localization. Biochem. J. 371, 97–108
12 Ernst, H. A., Olsen, A. N., Skriver, K., Larsen, S. and Lo Leggio, L. (2004) Structure of the
conserved domain of ANAC, a member of the NAC family of transcription factors. EMBO
Rep. 5, 297–303
403
13 Olsen, A. N., Ernst, H. A., Lo Leggio, L. and Skriver, K. (2005) DNA-binding specificity
and molecular functions of NAC transcription factors. Plant Sci. 169, 785–797
14 Mitsuda, N., Hisabori, T., Takeyasu, K. and Sato, M. H. (2004) VOZ: isolation and
chracterization of novel vascular plant transcription factors with a one-zinc finger from
Arabidopsis thaliana . Plant Cell Physiol. 45, 845–854
15 Xue, G.-P. (2005) A CELD-fusion method for rapid determination of the DNA-binding
sequence specificity of novel plant DNA-binding proteins. Plant J. 41, 638–649
16 Jensen, M. K., Kjaersgaard, T., Nielsen, M. M., Galberg, P., Petersen, K., O’Shea, C. and
Skriver, K. (2010) The Arabidopsis thaliana NAC transcription factor family:
structure–function relationships and determinants of ANAC019 stress signaling.
Biochem. J. 426, 183–196
17 Kunieda, T., Mitsuda, N., Ohme-Takagi, M., Takeda, S., Aida, M., Tasaka, M., Kondo, M.,
Nishimura, M. and Hara-Nishimura, I. (2008) NAC family proteins NARS1/NAC2 and
NARS2/NAM in the outer integument regulate embryogenesis in Arabidopsis . Plant Cell
20, 2631–2642
18 Rushton, P. J., Somssich, I. E., Ringler, P. and Shen, Q. J. (2010) WRKY transcription
factors. Trends Plant Sci. 15, 247–258
19 Yamasaki, K., Kigawa, T., Watanabe, S., Inoue, M., Yamasaki, T., Seki, M., Shinozaki, K.
and Yokohama, S. (2012) Structural basis for sequence-specific DNA recognition by an
Arabidopsis WRKY transcription factor. J. Biol. Chem. 287, 7683–7691
20 Wegner, M. and Riethmacher, D. (2001) Chronicles of switch hunt: gcm genes in
development. Trends Genet. 17, 286–290
21 Yamasaki, K., Kigawa, T., Inoue, M., Watanabe, S., Tateno, M., Seki, M., Shinozaki, K. and
Yokoyama, S. (2005) Solution structure of an Arabidopsis WRKY DNA binding domain.
Plant Cell 17, 944–956
22 Tan, S., Hunziker, Y., Pellegrini, L. and Richmond, T. J. (2000) Crystallization of the yeast
MATa2/MCM1/DNA ternary complex: general methods and principles for protein/DNA
cocrystallization. J. Mol. Biol. 297, 947–959
23 Olsen, A. N., Ernst, H. A., Lo Leggio, L., Johansson, E., Larsen, S. and Skriver, K. (2004)
Preliminary crystallographic analysis of the NAC domain of ANAC, a member of the
plant-specific NAC transcription factor family. Acta Crystallogr. Sect. D Biol. Crystallogr.
D60, 112–115
24 Otwinowski, Z. and Minor, W. (1997) Processing of X-ray diffraction data collected in
oscillation mode. Methods Enzymol. 276, 307–326
25 Kabsch, W. (1993) Automatic processing of rotation diffraction data from crystals of
initially unknown symmetry and cell constants. J. Appl. Crystallogr. 26, 795–800
26 McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Storoni, L. C. and
Read, R. J. (2007) Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674
27 Adams, P. D., Grosse-Kunstleve, R. W., Hung, L.-W., Ioerger, T. R., McCoy, A. J., Moriarty,
N. W., Read, R. J., Sacchettini, J. C., Sauter, N. K. and Terwilliger, T. C. (2002) PHENIX:
building new software for automated crystallographic structuredetermination. Acta
Crystallogr. Sect. D Biol. Crystallogr. D58, 1948–1954
28 Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A.,
Winn, M. D., Long, F. and Vagin, A. A. (2011) Acta Crystallogr. Sect. D Biol. Crystallogr.
D67, 355–367
29 Emsley, P., Lohkamp, B., Scott, W. and Cowtan, K. (2010) Features and development of
Coot. Acta Crystallogr. Sect. D Biol. Crystallogr. D66, 486–501
30 Collaborative Computational Project Number 4 (1994) The CCP4 suite: programs for
protein crystallography. Acta Crystallogr. Sect. D Biol. Crystallogr. D50, 760–763
31 Potterton, E., Briggs, P., Turkenburg, M. and Dodson, E. (2003) A graphical user interface
to the CCP4 program suite. Acta Crystallogr. Sect. D Biol. Crystallogr. D59, 1131–1137
32 Lindemose, S., Nielsen, P. E. and Møllegaard, N. E. (2005) Polyamines preferential interact
with bent adenine tracts in double-stranded DNA. Nucleic Acids Res. 33, 1790–1803
33 Lindemose, S., Nielsen, P. E. and Møllegaard, N. E. (2008) Dissecting direct and indirect
readout of cAMP receptor protein DNA binding using an inosine and 2,6-diaminopurine
in vitro selection system. Nucleic Acids Res. 36, 4797–4807
34 Lindemose, S., Nielsen, P. E., Hansen, M. and Møllegaard, N. E. (2011) A DNA minor
groove electronegative potential genome map based on photo-chemical probing. Nucleic
Acids Res. 39, 6269–6276
35 Das, R., Laederach, A., Pearlman, S. M., Herschlag, D. and Altman, R. B. (2005) SAFA:
semi-automated footprinting analysis software for high-throughput quantification of
nucleic acid footprinting experiments. RNA 11, 344–354
36 Nielsen, S. S., Toft, K. N., Snakenborg, D., Jeppesen, M. G., Jakobsen, J. K., Vestergaard,
B., Kutter, J. P. and Arleth, L. (2009) BioXTAS RAW, a software program for
high-throughput automated small-angle X-ray scattering data reduction and preliminary
analysis. J. App. Crystallogr. 42, 959
37 Konarev, P. V., Petoukhov, M. V., Volkov, V. V. and Svergun, D. I. (2006) ATSAS 2.1, a
program package for small-angle scattering data analysis. J. Appl. Crystallogr. 39,
277–286
38 Tran, L. S., Nakashima, K., Sakuma, Y., Simpson, S. D., Fujita, Y., Maruyama, K., Fujita,
M., Seki, M., Shinozaki, K. and Yamaguchi-Shinozaki, K. (2004) Isolation and functional
analysis of Arabidopsis stress-inducible NAC transcription factors that bind to a
drought-responsive cis -element in the early responsive to dehydration stress 1 promoter.
Plant Cell 16, 2481–2498
c The Authors Journal compilation c 2012 Biochemical Society
404
D. H. Welner and others
39 Chen, Q., Wang, Q., Xiong, L. and Lou, Z. (2011) A structural view of the conserved
domain of rice stress-responsive NAC1. Protein Cell 2, 55–63
40 Svergun, D., Barberato, C. and Koch, M. H. J. (1995) CRYSOL: a program to evaluate
X-ray solution scattering of biological macromolecules from Aatomic coordinates. J.
Appl. Crystallogr. 28, 768–773
41 Konarev, P. V., Volkov, V. V., Sokolova, A. V., Koch, H. J. and Svergun, D. I. (2003)
PRIMUS: a Windows PC-based system for small-angle scattering data analysis. J. Appl.
Crystallogr. 36, 1277–1282
42 Yamasaki, K., Kigawa, T., Inoue, M., Watanabe, S., Tateno, M., Seki, M., Shinozaki, K. and
Yokoyama, S. (2008) Structures and evolutionary origins of plant-specific transcription
factor DNA-binding domains. Plant Physiol. Biochem. 46, 394–401
43 Locasale, J. W., Napoli, A. A., Chen, S., Berman, H. M. and Lawson, C. L. (2009)
Signatures of protein–DNA recognition in free DNA binding sites. J. Mol. Biol. 286,
1054–1065
44 Cohen, S. X., Moulin, M., Hashemolhosseini, S., Kilian, K., Wegner, M. and Müller, C. W.
(2003) Structure of the GCM domain–DNA complex: a DNA-binding domain with a novel
fold and mode of target site recognition. EMBO J. 22, 1835–1845
Received 14 October 2011/27 March 2012; accepted 29 March 2012
Published as BJ Immediate Publication 29 March 2012, doi:10.1042/BJ20111742
c The Authors Journal compilation c 2012 Biochemical Society
45 Yoshiyama, K., Conklin, P. A., Huefner, N. D. and Britt, A. B. (2009) Suppressor
of gamma response 1 (SOG1) encodes a putative transcription factor governing
multiple responses to DNA damage. Proc. Natl. Acad. Sci. U.S.A. 106,
12843–12848
46 Ahlstrom, L. S. and Miyashita, O. (2011) Molecular simulation uncovers the
conformational space of the λ Cro dimer in solution. Biophys. J. 101,
2516–2524
47 Diederichs, K. and Karplus, P. A. (1997) Improved R-factors for diffraction data analysis in
macromolecular crystallography. Nat. Struct. Biol. 4, 269–275
48 Lovell, S. C., Davis, I. W., Arendall, III, W. B., de Bakker, P. I. W., Word, J. M.,
Prisant, M. G., Richardson, J. S. and Richardson, D. C. (2003) Structure validation
by Cα geometry: phi, psi and Cβ deviation. Proteins: Struct. Funct. Genet. 50,
437–450
49 Suhre, K. and Sanejouand, Y. H. (2004) ElNemo: a normal mode web-server for protein
movement analysis and the generation of templates for molecular replacement. Nucleic
Acids Res. 32, W610–W614
Biochem. J. (2012) 444, 395–404 (Printed in Great Britain)
doi:10.1042/BJ20111742
SUPPLEMENTARY ONLINE DATA
DNA binding by the plant-specific NAC transcription factors in crystal
and solution: a firm link to WRKY and GCM transcription factors
Ditte H. WELNER*, Søren LINDEMOSE†, J. Günter GROSSMANN‡, Niels Erik MØLLEGAARD§, Addie N. OLSEN†,
Charlotte HELGSTRAND*, Karen SKRIVER† and Leila LO LEGGIO*1
*Biophysical Chemistry Group, Department of Chemistry, University of Copenhagen, Universitetsparken 5, 2100 Copenhagen Ø, Denmark, †Section for Biomolecular Sciences,
Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen N, Denmark, ‡Molecular Biophysics Group, Institute of Integrative Biology, Faculty of Health
and Life Sciences, The University of Liverpool, Crown Street, Liverpool L69 7ZB, U.K., and §Department of Cellular and Molecular Medicine, Panum Institute, Faculty of Health
Sciences, University of Copenhagen, Blegdamsvej 3, 2200 Copenhagen N, Denmark
METHODS
Table S1
Structure determination of a new crystal form of the ANAC019 NAC
domain
The reported statistics are from Phaser [3]. A final Z-score above 8 indicates that Phaser has
definitely solved the structure according to the PhaserWiki (http://www.phaser.cimr.cam.ac.uk).
Furthermore, LLG should increase for each added component, which is also the case here. N/A,
not applicable.
The structure was determined by MR, which gave solutions with
two monomers in the asymmetric unit with CNS [1], MOLREP
[2] or Phaser [3] within the PHENIX suite [4] [Z and LLG (log
likelihood gain) scores after placing of the first monomer were
13.7 and 331 respectively, Z and LLG scores after placing the
second monomer were 19.6 and 622 respectively]. Despite high
initial R-factors (around 50 %), poor density for the B molecule
especially and difficulties in refinement, the MR solution was
deemed correct, as it showed a similar region forming the dimer
interface to the one previously observed in the high resolution
crystal structure, although the respective orientation of the two
monomers is not identical.
After testing several refinement protocols in various programs,
the structure was initially refined with Phenix.refine [4] alternated
with manual rebuilding in Coot [5]. The final refinement strategy
included positional refinement of only few rebuilt residues at each
round, and TLS (Translation/Libration/Screw) group refinement
for chains A and B separately. However, using this strategy the
structure could only be refined to an R-factor of 32.3 % and Rfree of 37.0 % with RMSD (root mean square density) in bond
lengths of 0.021 Å and RMSD in bond angles of 3.215◦ and
poor Ramachandran geometry. The model was at this point rather
incomplete, with only 210 built residues out of 336 amino acid
residues in total. The B molecule was particularly incomplete with
only 83 out of 168 amino acids built because of the corresponding
density being of very poor quality. The poor quality density
could be partly accounted for by fewer crystal contacts for the
B molecule than the A molecule.
Subsequently, the so-called jelly-body refinement in Refmac5
[6] was tried as an alternative refinement strategy at low resolution.
This proved more successful. Jelly-body restraints were combined
with automatic detection of local NCS (non-crystallographic
symmetry) and group TLS refinement with one group per protein
chain. The refinement proved much more robust than previously
observed and also led to improvement of the electron density,
so that 40 additional residues could be built. The final model
is about 75 % complete and contains 250 out of 336 total
residues, including 111 out 168 amino acids for monomer B. For
comparison, 297 residues were built for the deposited free NAC
structure at 1.9 Å resolution (PDB code 1UT7), 147 residues for
the A and 150 for the B chain. The R-factor and R-free were
Molecular replacement statistics
Search model
Step
Z-score
Log-likelihood gain
Dimer 1
Rotation
Translation
Rigid Body refinement
Rotation
Translation
Rigid Body refinement
Rotation
Translation
Rigid Body refinement
4.8
6.1
N/A
4.4
3.3
N/A
5.8
8.8
N/A
46.7
42.4
64.3
105.2
137.7
265.4
339.6
271.4
333.2
Dimer 2
DNA
improved to 25.7 % and 34.2 % respectively. The model also has
improved geometry, with RMSD in bond length of 0.012 Å and
RMSD in bond angles of 1.820◦ and 90.4 % residues in favoured
regions of the Ramachandran according to Rampage [7]. The
model statistics are shown in Table 1 of the main text.
Crystal structure determination of NAC–DNA complexes
The structure of Native_complex was solved with MR. A trimmed
version (N-terminal removed) of the ANAC019 structure (PDB
code 1UT4) and a B-DNA model of oligo2 (3DNA software [8])
were used as consecutive search models. The MR solution is a
ternary complex consisting of two ANAC019 dimers (AB and
CD) bound to one DNA double helix constituted by chains EF.
The MR statistics can be found in Table S1.
Attempts were made to supplement the MR data with
experimental phase information from the heavy-atom derivatives.
This approach was hindered by poor diffracting quality and lack of
isomorphism of derivatives (e.g. see the cell dimensions in Table 1
of the main text), but finally a gold-derivative data set of similar
quality as the native was collected with the intention of using it in
MAD (multi-wavelength anomalous diffraction) or SAD (singlewavelength anomalous diffraction). The resulting experimental
maps were of very low quality not suited for building. An MR
solution could be obtained using a similar procedure and giving
similar results as for the native. This solution was combined
with the weak SAD information using MRSAD (molecular
Co-ordinates and structure factors have been deposited into PDB with accession codes 3SWP (Native_complex), 3SWM (Au_complex) and 4DUL
(Form IV).
1
To whom correspondence should be addressed (email [email protected]).
c The Authors Journal compilation c 2012 Biochemical Society
D. H. Welner and others
Figure S1
Validation of MR solution
(A) Crystal packing and DNA fibre formation. The overall packing of the crystals looks good, with
the DNA polymerizing as B-DNA fibers throughout the crystal. (B) Omit map at 3σ generated
by removing 5 nucleotides (grey) from the model. This does not allow for an identification of
the individual bases, but does confirm the position of the DNA. (C) Gold sites in structures
of ANAC019 and ANAC019 in complex with DNA. Chain A of NAC alone (PDB code 1UT7, blue)
and of the MR solution (grey) is superimposed. Due to the higher resolution of the NAC structure
(1.9 Å), the site can be modelled in more detail than in the complex structure. Consequently,
it is modelled as two sites with low occupancy (yellow spheres) and the co-ordinating Cys33
(sticks) is modelled in double conformation. The gold site found in the complex structure by
MRSAD is shown as an orange sphere.
replacement SAD) procedure number 1 in phenix.autosol v1.6289 [4], which uses the MR solution as a partial model and
writes out the best combined phases. It identified nine gold
sites, but phase combination did not improve the appearance of
electron density maps. The derivative was however fundamental
in confirming the MR solution and was refined in parallel, using
the native MR solution combined with the nine gold sites as a
starting model.
The MR model showed sensible packing, with the DNA
forming continuous fibres throughout the crystal (Figure S1A) and
interacting with the expected face of the NAC dimer, i.e. opposite
the dimer interface. Omit maps constructed from models lacking
a number of phosphates in the DNA backbone and/or a number of
nucleotides also confirmed the position of the DNA duplex (Figure
S1B). Gold binding in Au_complex is observed at the same sites
as in the gold derivative used in structure determination of the
protein alone [9] (Figure S1C) and difference density shows up at
these sites when the gold-derivative dataset is phased with a MR
model not containing gold atoms. The oligonucleotide sequence,
which could not be assigned based on the crystal structure alone,
was assigned to agree with the phosphate protection information
from uranyl photo-probing as described in the main text. Despite
the possibility of a register error at this resolution, inclusion of the
bases is important for the crystallographic refinement and SAXS
simulations.
Several refinement strategies were tested. Use of experimental
phase restraints and simulated annealing strategies did not
improve refinement. The final refinement strategy chosen was
as follows: (i) MR was followed by rigid body refinement with
each NAC monomer as a rigid body and the dsON (both strands)
as an additional rigid body. For the Native_complex the R- and Rfree values went from 52.0 % and 52.4 % to 46.1 % and 47.9 %
respectively, whereas for Au_complex they went from 51.4 %
and 51.7 % to 44.4 % and 43.0 % respectively; (ii) positional
c The Authors Journal compilation c 2012 Biochemical Society
Figure S2
DNase I footprinting of the ANAC019–DNA complex
Typical autoradiograph showing the DNase I digestion of an oligonucleotide containing the
palindromic NACBS [13] in the absence (control) and presence of 10, 20, 50, 100 and
200 ng of ANAC019 (lanes 1–6). The orientation of the DNA is indicated with 5 and 3 . S
indicates the Maxam–Gilbert A/G markers. The palindromic NACBS contains the 20 bp sequence
5 -ttgcgtgttggaacacgcaa-3 . However, due to the 3 -labelling (see Experimental section of main
text), the footprint presented displays the interaction on the lower strand in the 3 to 5 direction
(i.e. the complement strand). Brackets indicate the boundary of the palindromic NACBS and
the DNase I protected area respectively. Due to the physical size of the DNase I protein itself
(32 kDa), the actual ANAC019 footprint is overestimated by 5–10 bp in both the 5 - and
3 -end.
refinement was carried out with NCS restraints between AC
and BD NAC monomers, including TLS refinement with each
protein chain as a group and the dsON as an additional group.
In Au_complex, individual isotropic B-factors were refined for
the heavy atoms. In order to maintain reasonable Ramachandran
statistics, the highest resolution ANAC019 structure available
(PDB code 1UT7) was used as an additional restraint. For the
Native_complex the R- and R-free values went down to 25.9 %
and 34.9 % respectively, whereas for Au_complex they went
down to 25.3 % and 31.9 % respectively; and (iii) finally, the Nterminal residues 8–11 of chain A, which were missing in the MR
model, could be built. For the Native_complex the R- and R-free
values went down to 26.0 % and 34.9 % respectively, whereas for
Au_complex they went down to 25.0 % and 30.9 % respectively.
SAXS analysis on the free NAC domain
Solution scattering profiles for a concentration series from 1
to 8 mg/ml of the domain were measured in 20 mM Tris/HCl
(pH 7.5), at two camera lengths of 1 m and 4.25 m at
beamline 2.1, SRS (Daresbury, England). Concentration- and
time-dependent X-ray-induced aggregation affects the low-angle
DNA binding by NAC plant transcription factors
Figure S3
DNA-binding ability of mutant NAC domains
Binding of wild-type ANAC019 NAC domain to a 30 bp olugonucleotide containing 2 NACBS
[13] (as detailed in Experimental section of the main text) was compared with that of the R85A
and R88A mutants in an EMSA. The concentrations of protein used are specified above each
lane.
scattering region. The Rg could nevertheless be extrapolated
to 27.3 Å at infinite dilution. The data were processed with
XOTOKO (available from the SRS) and further analysed in the
ATSAS program package [10].
Ab initio modelling was carried out with GASBOR version 2.2i
(reciprocal space) with unknown expected particle anisometry,
with either no symmetry imposed or 2-fold symmetry imposed.
The models were then averaged with DAMAVER and filtered
using DAMFILT. Ab initio modelling showed a very similar shape
compared with the known X-ray structure (Figure 3 of the main
text). Imposing 2-fold symmetry in the ab initio modelling did
not result in a significant increase in the χ-value (average χvalue = 1.06 for nine models in P2, average χ-value = 1.03 for
seven models in P1). We conclude that within the resolution limits
of SAXS, the NAC dimer in solution is symmetrical.
The NAC dimer scattering profile is consistent with the X-ray
crystallographic structures previously determined for free NAC,
as shown from the CRYSOL fit in Figure 3(A) of the main text
and Table S2. However, a discrepancy is found at very low angles,
which is also reflected in the difference between experimental and
calculated Rg -value. A value much closer to the experimental Rg value could be obtained if one of the crystallographic models
for free NAC was completed by adding the loops missing in the
crystallographic model (from a database of loop structures found
in O [11] and by adding the missing parts of the termini in an
extended conformation; see Table S2); however, this resulted in
a worse χ-value. A further improvement could be obtained by
perturbing the termini with the elNémo server [12], where some
of the resulting models fit better both with respect to experimental
Rg - and χ-values (see example of fit in Figure 3A of the main text
and Table S2). Subsequently the SAXS data for the free NAC
domain were used to test complete models with different degrees
of dimer opening, as described in the main text.
Table S2 Experimental and theoretical R g -values derived from SAXS data and selected structural models derived from crystallographically determined
structures
For comparison, the theoretical R g -value calculated from the ternary complex in the crystal structure was 32.3 Å. For the free NAC sample, the experimental R g was calculated from extrapolation to
infinite dilution from a concentration series. χ-Values for the fit of theoretical scattering profiles calculated in CRYSOL [14] for a particular model and experimental data are also given.
Experimental sample
Experimental R g -value
from Guinier analysis (Å)
Fraction B1
Fraction B2
24.6 +
− 3.9
27.9 +
− 1.2
Fraction B3
Free NAC
29.5 +
− 1.8
27.3
Structural model
Theoretical R g -value
calculated from model (Å)
χ-Value
DNA (from Native_complex)
DNA (from Native_complex)
OLIGOMER model (82 % DNA and 18 % binary complex from Native_complex)
Binary complex (DNA and AB dimer from Native_complex)
Free NAC (PDB code 1UT7)
Full model including loops and termini missing in the crystal structure
Example of best perturbed full model, dimer angle 118◦ (open dimer)
As above, but dimer angle of 107◦ (semi-closed dimer)
As above, but dimer angle of 100◦ (closed dimer)
26.6
26.6
–
31.2
26.2
26.9
27.4
26.9
26.5
1.3
1.7
1.2
1.1
1.9
2.2
1.9
2.5
3.1
c The Authors Journal compilation c 2012 Biochemical Society
D. H. Welner and others
REFERENCES
1 Brünger, A. T., Adams, P. D., Clore, GM., DeLano, W. L., Gros, P., Grosse-Kunstleve,
R. W., Jiang, J.-S., Kuszewski, J., Nilges, N., Pannu, N. S. et al. (1998) Crystallography
and NMR system. Acta Crystallogr. Sect. D Biol. Crystallogr. D54, 905–921
2 Vagin, A. and Teplyakov, A. (1997) MOLREP: an automated program for molecular
replacement. J. Appl. Crystallogr. 30, 1022–1025
3 McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Storoni, L. C. and
Read, R. J. (2007) Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674
4 Adams, P. D., Grosse-Kunstleve, R. W., Hung, L.-W., Ioerger, T. R., McCoy, A. J., Moriarty,
N. W., Read, R. J., Sacchettini, J. C., Sauter, N. K. and Terwilliger, T. C. (2002) PHENIX:
building new software for automated crystallographic structure determination. Acta
Crystallogr. Sect. D Biol. Crystallogr. D58, 1948–1954
5 Emsley, P., Lohkamp, B., Scott, W. and Cowtan, K. (2010) Features and development of
Coot. Acta Crystallogr. Sect. D Biol. Crystallogr. D66, 486–501
6 Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A.,
Winn, M. D., Long, F. and Vagin, A. A. (2011) REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr. Sect. D Biol. Crystallogr. D67, 355–367
7 Lovell, S. C., Davis, I. W., Arendall, III, W. B., de Bakker, P. I. W., Word, J. M., Prisant,
M. G., Richardson, J. S. and Richardson, D. C. (2003) Structure validation by Cα
geometry: phi, psi and Cβ deviation. Proteins: Struct. Funct. Genet. 50, 437–450
Received 14 October 2011/27 March 2012; accepted 29 March 2012
Published as BJ Immediate Publication 29 March 2012, doi:10.1042/BJ20111742
c The Authors Journal compilation c 2012 Biochemical Society
8 Lu, X.-L. and Olson, W. K. (2003) 3DNA: a software package for the analysis, rebuilding
and visualization of three-dimensional nucleic acid structures. Nucleic Acids Res. 31,
5108–5121
9 Ernst, H. A., Olsen, A. N., Skriver, K., Larsen, S. and Lo Leggio, L. (2004) Structure of the
conserved domain of ANAC, a member of the NAC family of transcription factors. EMBO
Rep. 5, 297–303
10 Konarev, P. V., Petoukhov, M. V., Volkov, V. V. and Svergun, D. I. (2006) ATSAS 2.1, a
program package for small-angle scattering data analysis. J. Appl. Crystallogr. 39,
277–286
11 Jones, T. A., Zou, J. Y., Cowan, S. W. and Kjeldgaard, M. (1991) Improved methods for
building protein models in electron density maps and the location of errors in these
models. Acta Crystallogr. Sect. A Found. Crystallogr. A47, 110–119
12 Suhre, K. and Sanejouand, Y. H. (2004) ElNemo: a normal mode web-server for protein
movement analysis and the generation of templates for molecular replacement. Nucleic
Acids Res. 32, W610–W614
13 Olsen, A. N., Ernst, H. A., Lo Leggio, L. and Skriver, K. (2005) DNA-binding specificity
and molecular functions of NAC transcription factors. Plant Sci. 169,
785–797
14 Svergun, D., Barberato, C. and Koch, M. H. J. (1995) CRYSOL: a program to evaluate
X-ray solution scattering of biological macromolecules from atomic coordinates. J. Appl.
Crystallogr. 28, 768–773