A highly stable protein chimera built from fragments of different folds

Protein Engineering, Design & Selection vol. 25 no. 11 pp. 699–703, 2012
Published online October 18, 2012 doi:10.1093/protein/gzs074
A highly stable protein chimera built from fragments
of different folds
Sooruban Shanmugaratnam†, Simone Eisenbeis† and
Birte Höcker1
Max Planck Institute for Developmental Biology, Spemannstr. 35, 72076
Tübingen, Germany
1
To whom correspondence should be addressed.
E-mail: [email protected]
Received June 29, 2012; revised August 24, 2012;
accepted September 10, 2012
Edited by Vladimir Tishkov
Proteins increased in complexity during the course of
evolution. Domains as well as subdomain-sized fragments
were recruited and adapted to form new proteins and
novel folds. This concept can be used in engineering to
construct new proteins. We previously reported the combination of fragments from two ancient protein folds, a
flavodoxin-like and a (ba)8-barrel protein. Here we
report two further attempts at engineering a chimeric
protein from fragments of these folds. While one of the
constructs showed a high tendency to aggregate, the other
turned out to be a highly stable, well-structured protein.
In terms of stability against heat and chemical denaturation this chimera, named NarLHisF, is superior to the
earlier presented CheYHisF. This is the second instance
of a chimera build from two different protein folds,
which demonstrates how easily recombination can lead to
the development and diversification of new proteins – a
mechanism that most likely occurred frequently in the
course of evolution. Based on the results of the failed and
the successful chimera, we discuss important considerations for a general design strategy for fold chimeras.
Keywords: (ba)8-barrel/chimera/flavodoxin-like/hybrid
proteins/protein design
Introduction
Protein chimeras are hybrids formed by substitution of fragments between two parent proteins, ranging from short peptides to entire domains. Hybrid proteins occur in nature due
to dramatic gene reorganization events such as horizontal
gene transfer, gene duplication and fusion (Huynen and
Bork, 1998, Lynch and Conery, 2000). These recombination
events have led to major advances in the natural evolution of
organisms as well as protein architecture and their function
(Grishin, 2001, Söding and Lupas, 2003). Many methods in
protein engineering successfully imitate evolutionary
mechanisms to create proteins with new and useful properties
(Eisenbeis and Höcker, 2010). Similarly, possible evolutionary scenarios that might have led to a stable protein fold can
be explored.
†
Thease authors contributed equally to this work.
We have used the concept of illegitimate recombination
between non-homologous or very divergent sequences to
sample possible diversification of the (ba)8-barrel fold. The
(ba)8-barrel is a ubiquitous fold, commonly found among
enzymes (Sterner and Höcker, 2005). It consists of a closed
eight-stranded parallel b-sheet that is surrounded by eight
a-helices. Based on the remarkable 2-fold symmetry in two
enzymes of the histidine biosynthesis pathway, it was suggested that the (ba)8-barrel fold evolved through duplication
and fusion of an ancestral half-barrel (Lang et al., 2000).
This scenario has been tested in a number of engineering
experiments with the imidazole glycerol phosphate synthase
(HisF) from Thermotoga maritima (Höcker et al., 2001,
2004, 2009; Seitz et al., 2007; Fortenberry et al., 2011).
Diversification of (ba)8-barrel enzymes was further elucidated by the recombination of half-barrel-sized fragments
from closely related (ba)8-barrel proteins (Höcker et al.,
2004; Claren et al., 2009).
Prompted by the question where half-barrels could have
originated from, we searched structural databases and found
a striking similarity to a number of proteins with the
flavodoxin-like fold, a doubly wound (ba)5 protein with a
central five-stranded parallel b-sheet. Among the best hits
the following proteins were found: the nitrite response regulator NarL from Escherichia coli, the chemotaxis response
regulator CheY from T.maritima, and the flavodoxin-like
domain of the methylmalonyl CoA mutase (MMCoA) from
Propionibacterium shermanii (Höcker et al., 2002). To test
whether these structurally similar parts could substitute for
each other, we created a fold hybrid using parts of CheY,
which is a well-characterized single-domain protein, to substitute for structurally similar parts in the (ba)8-barrel
protein HisF from T.maritima (Bharat et al., 2008). The
resulting protein, which is named CheYHisF, was a stable
monomer that unfolds cooperatively. When analyzed by circular dichroism (CD) and fluorescence spectroscopy, it
appeared similar to the wild-type HisF protein. Solution of
the crystal structure, however, revealed unexpected details
for the interactions at the new interface. While the fragments
used for the construction of CheYHisF for the most part
retained their structure, we were surprised to find an additional ninth b-strand in the core of the barrel. This strand
invades between b1 and b2 of the CheY part and consists of
residues from the C-terminus of HisF and from the histidinetag included for purification purposes (Bharat et al., 2008).
The importance of the ninth strand was tested through
removal of the strand-forming residues, leading to the construct CheYHisF-sfr. This construct had a high tendency to
aggregate, thus illustrating the importance of this unexpected
structural element for the integrity of the fold. It was possible
to stabilize the CheYHisF-sfr chimera again through the
introduction of five targeted mutations at the fragments’
interface identified by computational design (Eisenbeis et al.,
2012). The design, named CheYHisF-sfr_RM, was targeted
to form a proper eight-stranded ba-barrel, which was
# The Author 2012. Published by Oxford University Press. All rights reserved.
For Permissions, please e-mail: [email protected]
699
S.Shanmugaratnam et al.
confirmed experimentally by nuclear magnetic resonance
spectroscopy. CheYHisF-sfr_RM was not only a very stable,
well-folded protein, but also inherited the ability to bind
phosphorylated compounds from its parent HisF.
This suite of experiments demonstrated the potential of
fragment recombination for the rational design of new and
functional proteins. After this proof of concept we wanted to
test the generalizability of our chimeragenesis approach and
constructed two more hybrids by the combination of parts
from the flavodoxin-like proteins NarL and MMCoA with
the (ba)8-barrel HisF. The fragments of CheY, NarL and
MMCoA that were introduced into the HisF scaffold are
quite different, they share only sequence identities between
16 and 28%. While one of the hybrids did not fold properly,
the other one even surpasses our earlier chimera in terms of
stability. This second successfully created fold chimera illustrates how quickly proteins can diversify via non-homologous
recombination. The successful as well as the failed attempt
also give us hints for a general design strategy of fold
chimeras.
Results
The parts of the narL gene from E.coli and mmCoA gene
from P.shermanii were combined with a part of the hisF
gene from T.maritima. The combination was done analogous
to the earlier reported construction of the cheYhisF gene
(Bharat et al., 2008). Sequences encoding b-strand 1 and the
range from a-helix 2 to b-strand 5 in narL (residues 1 – 15
and 39 – 108) and mmCoA (residues 598– 609 and 633 – 706)
were cloned upstream of the sequence of the hisF-gene encoding a-helix 4 to a-helix 8 (residues 103 – 253), yielding
the chimeric genes narLhisF and mmCoAhisF (Fig. 1). The
fragments of NarL and MMCoA were selected to substitute
for the N-terminal part of HisF due to their remarkable structural similarity (Höcker et al., 2002). This similarity had
been identified in a search of the Protein Data Bank for
structures resembling the N- and C-terminal halves of HisF
(HisF-N and HisF-C) using the program DALI (Holm and
Sander, 1993). NarL had been the first non-(ba)8-barrel hit
for HisF-N immediately followed by MMCoA, while a
Materials and methods
Cloning of NarLHisF and MMCoAHisF
The narL gene was amplified from E.coli genomic DNA and
the flavodoxin-domain from MMCoA from P.shermanii was
constructed by polymerase chain reaction (PCR) using oligonucleotides. The resulting genes narLhisF and mmCoAhisF
were cloned into pET21a yielding the constructs pET21anarLhisF and pET21a-mmCoAhisF. The constructs were
sequenced to exclude any inadvertent PCR mutations.
Heterologous expression and purification of proteins
The NarLHisF and MMCoAHisF proteins, which both carry
a His6-tag at their C-termini, were produced in E.coli
BL21(gold). NarLHisF was mainly found in the soluble fraction of the cell extract and purified from there, whereas
MMCoAHisF was found in the insoluble fraction from
which it was solubilized and refolded. The expression, purification and refolding were performed as described for
CheYHisF (Bharat et al., 2008).
Analytical methods
Analytical gel filtration was performed using a calibrated
Superdex 75 column (Amersham Pharmacia). Highly concentrated protein was eluted at a flow rate of 0.5 ml min21 in
50 mM Tris and 300 mM KCl, pH 7.5. CD spectra were
recorded with a JASCO model J-810 spectropolarimeter.
Fluorescence measurements were carried out with a JASCO
FP-6500 spectrofluorometer. Temperature-induced unfolding
was analyzed by following the far-ultraviolet (UV) CD signal
at 222 nm at slowly increasing temperatures (1 K min21).
Protein unfolding induced by guanidinium chloride was followed by the decrease in fluorescence signal at 320 nm after
excitation at 280 nm. The proteins were incubated with different concentrations of denaturant, and the signals were
recorded after different time intervals until no further change
was observed.
700
Fig. 1. Construction of the fold chimeras. (a) Comparison of the topologies
of HisF (in blue), and the flavodoxin-like domains of MMCoA and NarL (in
green), as well as the chimeric constructs. (b) MMCoAHisF and NarLHisF
are constructed from fragments originating from MMCoA or NarL (residue
598–609 or 1 –15 including b1 and 633– 706 or 39– 108 containing a2-b5,
respectively) and from HisF (residue 103– 253 including a4-a8). (c) Model
of the NarLHisF structure generated by homology modeling with the
program Modeller (Sali and Blundell, 1993) using the crystal structure of
CheYHisF (PDB-id 3cwo) as a template. The parts originating from NarL
and HisF are shown in green and blue, respectively, and the residues after
Val230 that form the ninth strand are shown in orange.
Protein chimera design
search based on HisF-C returned MMCoA as the first hit
with a flavodoxin-like domain. The high similarity of the
structures covers more than three of the ba-modules while
the positioning of the other secondary structural elements
deviates drastically. We chose to swap the largest fragment
possible and thus set the fragment boundaries to where the
structures start to deviate from one another. After heterologous expression of the His6-tagged chimeras in E.coli, most
protein of NarLHisF was found in the soluble fraction of the
cell extract while all MMCoAHisF was found in inclusion
bodies. Thus, NarLHisF was purified from the soluble fraction
via metal affinity and subsequent gel filtration chromatography, while MMCoAHisF was refolded after solubilization
of the inclusion bodies in guanidinium chloride.
To determine the oligomerization state of the chimeras in
solution, analytical gel filtration experiments were conducted.
NarLHisF eluted as a sharp peak with an apparent molecular
mass of 36.7 kDa, which is slightly higher than the calculated molecular mass for the monomeric protein (26.7 kDa)
(Fig. 2a). We observed a similar discrepancy between the apparent and calculated molecular mass of the nine-stranded
barrel CheYHisF. This migration behavior most likely originates from the lesser compactness of the protein and the
wider radius of the barrel. The protein was applied to the
column highly concentrated. Next to the main peak a small
shoulder is visible, which is either a small contamination or
a degradation product. MMCoAHisF on the other hand
eluted in the void volume indicating that the protein forms
soluble aggregates. Consequently, MMCoAHisF was abandoned and only NarLHisF analyzed further.
The secondary structure content of NarLHisF was measured by far-UV CD. The spectrum is indicative of significant a-helical content and compares well to the spectra of
HisF and CheYHisF (Fig. 2b). Tertiary structure was investigated by using fluorescence and near-UV CD spectroscopy.
NarLHisF, CheYHisF and HisF contain the same single tryptophan residue in a-helix 5 providing a convenient way to
compare their tertiary structures. The fluorescence spectra are
Fig. 2. Characterization of the fold chimera NarLHisF. (a) Association state measured by analytical gel filtration. The applied protein was 10 mg ml21. (b)
Secondary structure evaluation by far-UV CD spectroscopy. Protein concentrations were 0.2 mg ml21 (d ¼ 0.1 cm). (c) Tertiary structure evaluation by
fluorescence spectroscopy. Protein concentrations were 0.2 mg ml21. (d) Tertiary structure evaluation by near-UV CD spectroscopy. Protein concentrations
were 0.65 mg ml21 for NarLHisF, 3.4 mg ml21 for HisF and 3.6 mg ml21 CheYHisF (d ¼ 1 cm). (e) Stability measured by thermal denaturation. Proteins
(0.2 mg ml21) were incubated at increasing temperatures while monitored at 222 nm in a 0.1 cm cuvette. All measurements were recorded in 10 mM Tris ( pH
7.5) at 258C.
701
S.Shanmugaratnam et al.
very similar, the emission maxima of NarLHisF, CheYHisF
and HisF occur at 321, 324 and 323 nm, respectively
(Fig. 2c). The near-UV CD spectra of all the proteins are
very distinct and extremely similar (Fig. 2d). These results
indicate that the indole chromophore of the tryptophan is in
a comparable asymmetric environment, and is similarly
shielded in all three proteins.
Stability of NarLHisF was investigated by reversible
unfolding with heat as well as guanidinium chloride. A cooperative unfolding behavior was observed during thermal
denaturation followed by CD at 222 nm (Fig. 2e). NarLHisF
exhibits a remarkable resistance to increasing temperatures
with a thermal melting point of 748C. Denaturation of
NarLHisF with guanidinium chloride also showed highly cooperative unfolding (Fig. 3). The data was analyzed on the
basis of the two-state model in comparison to the unfolding
of HisF (Carstensen et al., 2012) and the CheYHisF chimeras
(Bharat et al., 2008; Eisenbeis et al., 2012). NarLHisF
unfolds with a high m-value (m ¼ 17.0 kJ mol21 M21)
leading to a remarkable DGH2 O of 40.0 kJ mol21 (Table I).
Taken together the biophysical data show that the
NarLHisF chimera has adopted a native fold with a remarkable stability. The spectroscopic features of NarLHisF strongly resemble the ones of CheYHisF, the chimera that we
Fig. 3. Stability of the fold chimera NarLHisF measured by GdmCl-induced
denaturation. The measurement is shown in comparison to CheYHisF
(Bharat et al., 2008) and CheYHisF-sfr_RM (Eisenbeis et al., 2012).
Proteins at concentration of 0.1 mg ml21 were incubated with the given
concentrations of GdmCl in 10 mM Tris ( pH 7.5) at room temperature. The
loss of tertiary structure was followed by recording the decrease of the
fluorescence emission at 320 nm after excitation at 280 nm. The lines were
drawn as a visual aid.
Table I. Thermodynamic parameters for GdmCl-induced unfolding of
NarLHisF compared to HisF and computationally optimized chimera
CheYHisF-sfr_RM. The data were analyzed on the basis of the two-state
model
protein
DG (kJ mol21)
m (kJ mol21 M21)
[D]1/2 (M)
NarLHisF
HisFa
CheYHisF-sfr_RMb
40.0 + 1.9
55.6 + 2.3
18.0 + 2.5
17.0 + 0.9
19.9 + 0.9
7.5 + 0.8
2.4
2.8
2.4
The parameters were calculated from the data shown in Fig. 3.
a
Carstensen et al. (2012).
b
Eisenbeis et al. (2012).
702
reported earlier. Solution of its crystal structure revealed that
this chimera had assembled the original fragments quite
well, but had formed an extra b-strand with the C-terminal
residues including the tag used for purification. This led to
the formation of a nine-stranded barrel, which had not been
observed before. The same residues that form the unexpected
ninth b-strand are also present in NarLHisF. Crystallization
trials of NarLHisF unfortunately did not yield any diffracting
crystals. To analyze the contribution of the C-terminal residues to the integrity of the fold further, we prepared a shortened construct named NarLHisF-sfr, which ended after
valine 230, thereby eliminating possibly strand forming residues. This variant was found in the insoluble fraction of the
cell extract, but could be purified via refolding. Analytical
gel filtration showed that NarLHisF-sfr has a strong tendency
to aggregate indicating that the removed residues are
involved in stabilizing the fold and might form similar interactions in NarLHisF as they form in CheYHisF.
Discussion
We created two new protein chimeras, MMCoAHisF and
NarLHisF, in which parts of the (ba)8-barrel protein HisF
are replaced by fragments from the flavodoxin-like domains
of MMCoA and NarL, respectively (Fig. 1). MMCoA is an
enzyme that induces the formation of an adenosyl radical
from its cofacter, the coenzyme B12, which initiates a free
radical rearrangement of the substrate succinyl-CoA to
methylmalonyl-CoA (Mancia et al., 1996). NarL is a transcription factor that regulates genes involved in nitrite and
nitrate metabolism (Stock et al., 2000). Both proteins consist
of two domains. MMCoA comprises an N-terminal (ba)8barrel and a C-terminal flavodoxin-like domain, that share a
large interface. The two domains are connected by a long
linker of around 30 residues, which wraps around the
flavodoxin-like domain. The transplantation of a fragment
from this embedded domain probably led to the exposure of
hydrophobic areas that in the wild type enzyme are not solvent
accessible. Most likely this problem contributed significantly
to the misfolding and aggregation of the MMCoAHisF
chimera.
The response regulator NarL is composed of an
N-terminal flavodoxin-like domain (receiver) and a
C-terminal DNA-binding domain (effector). The two
domains are connected by a short linker and share a relatively small interface. The receiver domain belongs to the same
superfamily as the single-domain protein CheY, which was
used in our previously designed chimera CheYHisF (Bharat
et al., 2008). Biophysical characterization of the newly constructed chimera NarLHisF shows that this protein has
adopted a native fold strongly resembling CheYHisF. Even
the C-terminal residues that form the ninth strand in
CheYHisF are equally important for the integrity of the fold
suggesting that NarLHisF might also be a nine-stranded
ba-barrel (Fig. 1c). Surprisingly this protein even surpasses
CheYHisF in terms of stability. NarLHisF has a remarkable
resistance to increasing temperature even though only the HisF
part originates from a thermophile while the NarL part comes
from the mesophilic E.coli. Its thermal melting point of 748C
is 98C higher than the one of CheYHisF. Furthermore, assessment of the stability by chemical denaturation shows that
NarLHisF unfolds with higher cooperativity and possesses a
Protein chimera design
higher stability than CheYHisF and even than the computationally optimized chimera CheYHisF-sfr_RM (Eisenbeis
et al., 2012). Its thermodynamic parameters are in a range with
other natural proteins (Table I), and even compare well with
the extremely thermostable (ba)8-barrel HisF (Carstensen
et al., 2012).
Altogether, the results demonstrate how quickly new proteins can develop and diversify through recombination. With
this second fold chimera build from fragments of a (ba)8barrel and a flavodoxin-like protein we show that our approach is generalizable. By recombining structurally similar
fragments we can build new hybrid proteins with welldefined tertiary structure that are as stable as natural proteins
even from thermophilic organisms. These chimeras provide
suitable scaffolds for the establishment of catalytic activity.
Funding
This work was supported by Deutsche Forschungsgemeinschaft
(Grant HO 3022/1-2).
References
Bharat,T.A., Eisenbeis,S., Zeth,K. and Höcker,B. (2008) Proc. Natl Acad.
Sci. USA, 105, 9942– 9947.
Carstensen,L., Zoldak,G., Schmid,F.X. and Sterner,R. (2012) Biochemistry
Claren,J., Malisi,C., Höcker,B. and Sterner,R. (2009) Proc. Natl Acad. Sci.
USA, 106, 3704–3709.
Eisenbeis,S. and Höcker,B. (2010) J. Pept. Sci., 16, 538– 544.
Eisenbeis,S., Proffitt,W., Coles,M., Truffault,V., Shanmugaratnam,S.,
Meiler,J. and Höcker,B. (2012) J. Am. Chem. Soc., 134, 4019– 4022.
Fortenberry,C., Bowman,E.A., Proffitt,W., Dorr,B., Combs,S., Harp,J.,
Mizoue,L. and Meiler,J. (2011) J. Am. Chem. Soc., 133, 18026– 18029.
Grishin,N.V. (2001) J. Struct. Biol., 134, 167–185.
Höcker,B., Beismann-Driemeyer,S., Hettwer,S., Lustig,A. and Sterner,R.
(2001) Nat. Struct. Biol., 8, 32–36.
Höcker,B., Schmidt,S. and Sterner,R. (2002) FEBS Lett., 510, 133–135.
Höcker,B., Claren,J. and Sterner,R. (2004) Proc. Natl Acad. Sci. USA, 101,
16448– 16453.
Höcker,B., Lochner,A., Seitz,T., Claren,J. and Sterner,R. (2009)
Biochemistry, 48, 1145– 1147.
Holm,L. and Sander,C. (1993) J. Mol. Biol., 233, 123–138.
Huynen,M.A. and Bork,P. (1998) Proc. Natl Acad. Sci. USA, 95,
5849–5856.
Lang,D., Thoma,R., Henn-Sax,M., Sterner,R. and Wilmanns,M. (2000)
Science, 289, 1546– 1550.
Lynch,M. and Conery,J.S. (2000) Science, 290, 1151–1155.
Mancia,F., Keep,N.H., Nakagawa,A., Leadlay,P.F., McSweeney,S.,
Rasmussen,B., Bosecke,P., Diat,O. and Evans,P.R. (1996) Structure, 4,
339–350.
Sali,A. and Blundell,T.L. (1993) J. Mol. Biol., 234, 779 –815.
Seitz,T., Bocola,M., Claren,J. and Sterner,R. (2007) J. Mol. Biol., 372,
114–129.
Söding,J. and Lupas,A.N. (2003) Bioessays, 25, 837–846.
Sterner,R. and Höcker,B. (2005) Chem. Rev., 105, 4038–4055.
Stock,A.M., Robinson,V.L. and Goudreau,P.N. (2000) Annu. Rev. Biochem.,
69, 183–215.
703