doi:10.1016/j.jmb.2005.05.070 J. Mol. Biol. (2005) 351, 672–682 The Folding Mechanism of a Dimeric b-Barrel Domain Gonzalo de Prat-Gay1*, Alejandro D. Nadra1 Fernando J. Corrales-Izquierdo2, Leonardo G. Alonso1 Diego U. Ferreiro1 and Yu-Keung Mok2 1 Instituto Leloir, CONICET and Facultad de Ciencias Exactas y Naturales Universidad de Buenos Aires Patricias Argentinas 435 (1405) Buenos Aires, Argentina 2 Department of Chemistry Cambridge University, Lensfield Road, Cambridge CB2 1EW UK The dimeric b-barrel domain is an unusual topology, shared only by two viral origin binding proteins, where secondary, tertiary and quaternary structure are coupled, and where the dimerization interface is composed of two four-stranded half-b-barrels. The folding of the DNA binding domain of the E2 transcriptional regulator from human papillomavirus, strain-16, takes place through a stable and compact monomeric intermediate, with 31% the stability of the folded dimeric domain. Double jump multiple wavelength experiments allowed the reconstruction of the fluorescence spectrum of the monomeric intermediate at 100 milliseconds, indicating that tryptophan residues, otherwise buried in the folded state, are accessible to the solvent. Burial of surface area as well as differential behavior to ionic strength and pH with respect to the native ground state, plus the impossibility of having over 2500 Å2 of surface area of the halfbarrel exposed to the solvent, indicates that the formation of a non-native compact tertiary structure precedes the assembly of native quaternary structure. The monomeric intermediate can dimerize, albeit with a weaker affinity (w1 mM), to yield a non-native dimeric intermediate, which rearranges to the native dimer through a parallel folding channel, with a unimolecular rate-limiting step. Folding pathways from either acid or urea unfolded states are identical, making the folding model robust. Unfolding takes place through a major phase accounting for apparently all the secondary structure change, with identical rate constant to that of the fluorescence unfolding experiment. In contrast to the folding direction, no unfolding intermediate was found. q 2005 Elsevier Ltd. All rights reserved. *Corresponding author Keywords: folding; E2; DNA-binding; b-barrel; papillomavirus Introduction Protein folding is an intricate biological and physicochemical problem because of all the theorPresent addresses: Y.- K. Mok, Department of Biological Sciences, National University of Singapore, Singapore 117543; F. J. Corrales-Izquierdo, Laboratorio de Proteomica, Genomica y Bioinformatica, and Division de Hepatologia y Terapia Genica, Facultad de Medicina, Universidad de Navarra, 31008 Pamplona, Spain; D. U. Ferreiro, Centre for Theoretical Biological Physics and Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Dr, 92093-0378 La Jolla, CA, USA. Abbreviation used: TSE, transition state ensemble. E-mail address of the corresponding author: [email protected] etically possible conformations that a protein molecule can attain, the multiple ways of doing it, and the chemical composition of the milieu where the reaction under study takes place, whether in vivo or in vitro. Studies on protein folding concentrate to a larger extent on monomeric proteins undergoing unimolecular reactions aiming at simplified models and the ability to carry out molecular simulations of the folding process. A number of studies on oligomeric proteins have also been described, but are mostly on large or multidomain proteins that complicate the dissection of the reaction at the molecular level (see Jaenicke & Lilie1 ). Often, the folding of the monomer is followed by association into oligomers, with some structural adjustments but following 0022-2836/$ - see front matter q 2005 Elsevier Ltd. All rights reserved. Folding of a Dimeric b-Barrel essentially native or hierarchical pathways, or can take place through multi-channel processes.2 At the interface between small monomers and large oligomers, we find dimeric proteins that are small enough to address a molecular dissection of the process and eventually computational studies, such as the domain swapping cell-cycle regulatory protein p13suc1.3 Some of these dimers have the particularity that their structure is highly intertwined, i.e. the folding of each monomer depends on the formation of the interface of the dimer, which gives a characteristic highly concerted folding and association reaction and coupling between secondary, tertiary and quaternary structure.4,5 Good models of this are Trp and arc repressors6,7 or the leucine zipper peptide of the GCN4 activator.8 Since they are mostly DNA binding proteins or domains and bind to palindromic sequences, a link between their folding mechanism and their function, while unclear, is expected. The C-terminal domain of the papillomavirus E2 protein was shown to constitute a new family of dimeric folding topology, as well as a newly 673 described DNA binding motif from the crystal structure of the bovine E2–DNA complex.9 This domain is comprised of four antiparallel b-strands from each monomer forming a dimeric b-barrel at the interface of the dimer. Two a-helices pack onto the outside of the barrel, and the major helix participates in DNA binding (Figure 1(a) and (b)). The solution structure of the homologous domain from human papillomavirus strains showed an identical folding topology,10 and so far this topology is only shared by the Epstein–Barr EBNA1 origin binding protein.11 We have recently determined the solution structure of the HPV-16 E2C12 (Figure 1). At the equilibrium, the HPV-16 E2 domain unfolds and dissociates in a highly concerted manner by urea with no stable intermediates accumulated.13 On the other hand, its kinetic folding pathway shows a monomeric intermediate formed in less than 100 ms, with large fluorescence and ellipticity changes, increased ANS binding properties and burial of surface area.14 Because of the particular folding topology, we proposed that the intermediate could not be fully native. Further- Figure 1. Folding topology of the HPV16 DNA binding domain of the E2 transcriptional regulator. (a) Backbone superimposition of the 20 lowest-energy structures calculated for HPV-16 E2C dimer. (b) Strand view of average HPV-16 E2C with Trp residues highlighted. Figures were produced with SwissPdbViewer.31 674 more, the urea denatured state of E2C was shown to contain clusters of residual non-native structures.15 Here, we present a characterization of the folding and unfolding mechanism of this domain with a unique topology, and we fully characterize the monomeric intermediate, including its unfolding cooperativity. We reconstructed its fluorescence spectrum at 100 milliseconds and, using doublejump experiments, we measured its conformational stability. We show that the folding pathways from urea and acid unfolded states are equivalent. We characterize the unfolding reaction and found that it does not involve a detectable intermediate. Independent of whether it is started from acid or urea unfolded E2C, the folding pathway proceeds through a monomeric intermediate with non-native compact tertiary structure preceding the formation of native quaternary structure. Results Effect of ionic strength, pH, and temperature on the folding rates of E2C Refolding of acid unfolded E2C was shown to proceed through two major phases: (i) the formation of a monomeric intermediate with a unimolecular rate constant of 32 sK1 (k1), and (ii) a subsequent change with a second-order rate constant of 1.6!105 sK1 MK1 (k 2) involving both association and folding events, and a most likely first-order phase of w0.5 sK1 (k3), taking place in parallel with the association/folding event.14 To further characterize the nature of the interactions involved in the refolding phases of the E2C dimeric domain, we analyzed the effect of ionic strength. Figure 2(a) shows the effect of sodium chloride on the rate of formation of the monomeric intermediate, where a large and linear increment is Folding of a Dimeric b-Barrel observed, increasing up to sixfold from 0 M to 1.0 M salt. The slow phases were also affected but in a rather different, biphasic, manner; from 0 to 0.1 M salt both the rates were doubled (Figure 2(a)), and the major phase (k2) increased linearly while the minor slower phase (k3) remained invariant from 0.2 M to 1.0 M salt. A rate increase could be explained by screening of unfavorable ionic interactions in the transition state of this intermediate or, alternatively, the increase in ionic strength in the milieu might strengthen hydrophobic interactions, or both effects added together.16 In addition, preferential hydration effects could account at least in part for this effect. The rate of the folding intermediate (k1) is affected by pH, showing an apparent transition midpoint at around pH 6.5 (Figure 2(b)), where the rate is doubled along the pH transition. The rates of the slower phases (k2 and k3), however, increase by 120 and 180-fold, respectively (Figure 2(b)). As observed for the stability of the E2C dimer in equilibrium unfolding experiments,13 the pH transition appears to involve one or more histidine groups. The pH midpoint of the slow phases is lower than 5.8, a significant difference with the intermediate (Figure 2(b), inset). The monomeric intermediate was shown to form through a transition state ensemble (TSE) with significant change in exposure of residues.14 We carried out a pH jump refolding experiment at different temperatures to investigate the thermodynamic properties of the TSE for the slow phases. The observed rates increase with temperature, and we can analyze the data as previously described.14,17 Both slow phases show a significant curvature, indicative of a measurable DCp, explained by change in solvent exposure (Figure 2(c), legend). Since it is difficult to estimate the value of the pre-exponential term of the modified Eyring equation, the absolute values of Figure 2. Effect of solvent variables on the folding kinetics of E2C. (a) Refolding of acid denatured E2C at different salt concentrations. Final concentration of E2C was 2 mM, in 50 mM Mes buffer (pH 6.1), and refolding by dilution 1:1, under the conditions indicated in Materials and Methods. (b) Effect of pH on refolding of acid denatured E2C. The buffers used were Mes and Mops at the indicated pH. (c) Temperature dependence of the slow folding phases. The experimental conditions are those described for (a), and data were fitted as described.10 The absolute values of DH and DS are not reliable, since the pre-exponential of the non-linear Eyring equation term is not known. Folding of a Dimeric b-Barrel activation entropies and enthalpies are uncertain, but both phases display a similar free energy of activation. The temperature does not affect their relative amplitudes (not shown). Values for DCp were K567(G199) cal molK1 degK1 for k2 and for k3. The K520(G89) cal molK1 degK1 value previously obtained for k 1 was K385(G31) cal molK1 degK1, corresponding to the burial of surface in the monomeric intermediate.10 The DCp values for the parallel folding channels are identical, since they involve a similar amount of rearrangement from their respective intermediates to the final folding state. Similar folding pathways from urea and acid denatured E2C We investigated the folding pathway starting from the urea denatured domain, by monitoring tryptophan fluorescence upon dilution of E2C unfolded in 4.0 M urea into folding buffer. The refolding reaction yielded a monomeric intermediate and slow phases with similar behavior to the acid unfolded reaction. The rate of formation of the monomeric intermediate decreased with the increase in urea concentration on the refolding buffer, as Figure 3(a) shows. Extrapolation of the kobs to zero denaturant yields a value of 31 sK1, in excellent agreement with pH jump refolding.14 The slope of this plot, the m ‡ value, is 0.59 kcal molK1 MK1. The amplitude of the intermediate increases linearly over the protein concentration range and extrapolates to zero, indicating that a parallel reaction for the formation of the intermediate, if present, is minimal (Figure 3(a), inset). The slow refolding association phases show a fluorescence decrease similar to the pH jump refolding experiments, with a second-order rate constant of 1.5!105 sK1 MK1 (k2) in excellent 675 agreement with the folding pathway from acid unfolded E2C14 (Figure 3(b)). The observed rate of this folding/association phase is increased with E2C concentration displaying a slope of 2.4!105 sK1 MK1 corresponding to (k2), thus validating the secondorder approximation (Figure 3(c) and (b)). The slowest phase yields a first-order rate of w0.2 sK1 (k3) in excellent agreement with the equivalent process from acid unfolded E2C.14 The relative amplitudes undergo an inversion as the protein concentration increases, stabilizing at 50% each approximately (not shown), which agrees with the fact that k2 is bimolecular and k3 unimolecular. As previously noted, there is a deviation of the observed rate from linearity at high concentration of E2C, where a first-order rate becomes limiting, providing a rough estimate of w1–2 mM KD for the monomer– monomer interaction.13,14 The effect of urea on the rates of the slow association/folding phases was investigated; both showed a decrease as the denaturant was increased, with observed rates of 0.41 sK1 and 0.05 sK1, for k2 and k3, respectively at zero denaturant (Figure 3(c)). The sensitivity to urea is very similar in both phases (identical m‡ values of 1.0 kcal molK1 MK1), slightly more pronounced than in the intermediate. The monomeric intermediate unfolds cooperatively and has a non-native fluorescence spectrum The monomeric intermediate showed an increase in fluorescence, with a subsequent decrease as the reaction proceeds to the folded state as the slow association/folding takes place, in both acid or urea unfolded E2C (Figure 4(a)14). We carried out kinetic refolding experiments at different wavelengths from 300 nm to 400 nm, starting the reaction from the acid denatured E2C domain in order to minimize possible effects of residual denaturant. Figure 3. Folding from urea versus acid denatured E2C. (a) Effect of urea on the folding rate (k1) of the monomeric intermediate from E2C unfolded in 4 M urea. Concentration of E2C was 10 mM using the same conditions as in Figure 2, except that the dilution was 1:10 in refolding buffer. The inset shows a wider concentration range plotted logarithmically. (b) Effect of protein concentration on the slow folding rates, k2 and k3, of urea unfolded E2C. The values are indicated in the text. The amplitudes equilibrate at 50% each, but since it is fluorescence intensity, they do not necessarily measure absolute populations. (c) Effect of urea on the folding rates of the slow phases, k2 and k3. Conditions are similar to those in (a). 676 Folding of a Dimeric b-Barrel By taking fluorescence values at each time for 20 different wavelengths, we reconstructed the spectrum of the intermediate at its maximum amplitude (80 ms). An increase in the fluorescence intensity at 80 ms is observed, but with otherwise no change in its wavelength maximum (Figure 4(a) and (b)). This indicates that the tryptophan residue (or residues) responsible for the signal remains fairly accessible to the solvent in the intermediate. The large increase in the fluorescence quantum yield could be related to another group in the proximity of the fluorophore, causing quenching effects in the acid unfolded state. A further association/folding step leads to the observed blue-shift in the fluorescence spectrum (Figure 4(c)), concomitant with a decrease in the intensity, to finally recover the spectral properties of the folded dimeric domain. This is consistent with the burial of the tryptophan residues at the center of the barrel. The transient nature of the monomeric species was previously demonstrated in a double jump refolding-unfolding experiment, were the unfolding amplitude of the phase reached a maximum at 60–80 ms and then disappeared.14 Since the unfolding amplitude is proportional to the amount of intermediate present, varying the urea concentration in the unfolding (second) mixture would in theory cause the gradual disappearance of a folded population of the intermediate, allowing for a quantitative estimate of its stability. Figure 5 shows the effect of urea on the unfolding amplitudes of the monomeric intermediate in a double mixing refolding/unfolding experiment. The curve displays a cooperative transition reaching a plateau at 2.5 M urea, and a further linear change to 4.0 M denaturant. Since both the folding and unfolding of this intermediate are first-order reactions, as the lack of protein concentration dependence previously showed, we can analyze its stability by assuming a two-state, concentrationindependent model. By this procedure, we obtain a DGunf of 3.45(G0.40) kcal molK1, with an m value of 2.55(G0.32) kcal molK1 MK1 (urea). In similar conditions of buffer, pH, temperature, and protein concentration the DGunf of the native folded dimeric domain was measured to be 11.0 kcal molK1, with an m value of 3.1 kcal molK1 MK1 (urea).13 Unfolding kinetics of E2C Figure 4. Evolution of the fluorescence spectrum of the monomeric intermediate to the folded E2C. (a) The complete folding time trace of acid unfolded E2C, followed at two wavelengths as an example (dark dots, 340 nm; light dots, 350 nm). (b) A time trace was followed at 20 different wavelengths at 5 nm intervals, and the spectra at selected times reconstructed and shown. The unfolding kinetics of the dimeric domain was studied by mixing the folded E2 C-terminal domain to a final concentration of 4.0 M urea.13,15 The fluorescence data can be best described by a double-exponential model (Figure 6(a)). Analysis of the relative amplitudes indicates a major phase accounting for 85% of the fluorescence change and a minor phase of 15%, and the unfolding rate of both phases remained unchanged up to 10 mM E2 (not shown). The observed fluorescence change corresponds to the full change upon unfolding; no changes were observed at very short times or in the dead time (not shown). Folding of a Dimeric b-Barrel Figure 5. Stability of the monomeric intermediate from a double jump folding-unfolding experiment at different urea concentrations. Acid unfolded E2C at 10 mM was refolded by 1:1 dilution into Mes (pH 6.1) buffer and, after a 100 ms delay, unfolded by 1:1 dilution at the indicated urea concentrations. The amplitude change is negative (disappearance of the intermediate) and their absolute value plotted, against urea. The data were fitted to a standard two-state unimolecular transition, and the values obtained indicated in the text. The observed rate of the major unfolding phase decreased with urea concentration, and extrapolation of the data to 0 M denaturant gives the unfolding rate constant in water, ku1, which is 0.009 sK1, corresponding to a half life of 32 seconds and an m value of 0.26(G0.01) kcal molK1 MK1 (urea) (Figure 6(b)). The rate of the minor phase (ku2Z0.5 sK1) appears unaffected by urea but the 677 signal is too small for an accurate determination of changes in rate and amplitude with the denaturant. The effect of ionic strength in the unfolding kinetics was investigated by increasing the concentration of NaCl in the unfolding mixture. The major phase (ku) shows a decrease in the unfolding rate as the salt concentration is increased (Figure 6(c)). We had observed a marked effect of NaCl on the stability of E2C,18 and this will have an effect on the rate of unfolding. In fact, the amplitude of unfolding actually decreases due to the stabilizing effect of the salt on the ground state18 (not shown). Thus, as the salt concentration is increased, a smaller fraction of the molecules become unfolded. However, at 0.15 M salt, the effect on the rate is 90%, and the amplitude is unchanged. This suggests that the transition state of unfolding is rather sensitive to ionic strength, the rate is halved at low NaCl concentrations at which the rate of folding is doubled (Figure 6(c)). Unfolding experiments at different wavelengths on the larger unfolding phase show gradual change at all the wavelengths, with no fluorescence increase beyond native or unfolded species, which suggests that the major unfolding reaction is accompanied by a single spectral shift, different from the folding direction (not shown), suggesting the absence of a populated unfolding intermediate. Unfolding of E2C was also followed by circular dichroism and the ellipticity change of the major phase goes in parallel with the fluorescence change and fits to a single-exponential decay with a firstorder rate of 0.018 sK1 in excellent agreement with the rate of the same phase obtained from fluorescence (0.02 sK1, Figure 7). Since ellipticity traces tend to be much noisier than fluorescence the presence of a minor unfolding phase cannot be determined accurately, assuming it does involve an ellipticity change. In addition, the dead time of the CD unfolding is at least five seconds, which Figure 6. Unfolding kinetics of E2C by urea. (a) Unfolding trace of 10 mM E2C in 4 M urea in Mes (pH 6.1) (see Materials and Methods), shown as an example. The data were fitted to two exponentials (inset, residuals). The two observed rates and their amplitudes are independent of the protein concentration (not shown). (b) Urea concentration dependence of the major unfolding phase (see the text for values). (c) Effect of salt concentration on the major urea unfolding phase of E2C. Conditions are the same as in (a), except for the added salt. The inset shows the effect of pH, using the buffer system explained in Figure 2(b). 678 Figure 7. Concomitant loss of secondary and tertiary structure upon unfolding of E2C. E2C was unfolded by dilution into 4 M urea, at pH 6.1 and 25 8C (see Materials and Methods), and the ellipticity change (line) monitored in a time-course manner, and adjusted to a singleexponential equation. The dead time of the experiment was five seconds. An experiment in identical conditions was carried out and the fluorescence (dots) was monitored, but fitted to two exponentials. The rate constants are coincident (see the text). precludes an accurate determination, since the halflife of the minor fast unfolding phase is less than two seconds. At this stage it is not possible to completely rule out an intermediate from this experiment, but a major phase clearly displays a concomitant disappearance of secondary and tertiary structures, with no detectable fast changes. The two slow phases represent parallel folding channels The transient nature of the monomeric intermediate was determined from its amplitude of unfolding in double jump experiments. Two phases (k2 and k3) are present in the refolding of the E2C from both acid and urea unfolded states (Mok et al.14 and Figure 3(b)). Above 5 mM E2C, these amplitudes correspond roughly to 50% of the fluorescence change (Figure 3(b)). To further investigate these phases, we carried out a double jump experiment, in which the pH unfolded domain was allowed to refold for times greater than 100 ms, after the intermediate disappears, and then unfolded in 4.0 M urea. The unfolding process is described by two exponential decays, as in unfolding of fully folded E2C (Figure 6(a), inset). At short times of refolding delay (100 ms), there are already two phases, suggesting a fast equilibrium. The slower unfolding phase (ku1) corresponds to 60% of the amplitude and at longer delay times, 80% (Figure 8). We can use the change in the amplitude of unfolding as a measure of the folded species at different delay times (Figure 8). This can be fitted to an exponential phase and the rate, 1.7 sK1, Folding of a Dimeric b-Barrel Figure 8. Two folding channels demonstrated by time delay double jump folding-unfolding. Acid unfolded E2C (20 mM) was refolded in 100 mM Mes (pH 6.1), and at the indicated delay times, after the disappearance of the monomeric species, the mixture was unfolded to a final concentration of 4.0 M urea. The fluorescence change was monitored and the data fitted to a double exponential equation. The resulting amplitudes, plotted against time, represent the two parallel folding channels, one (ku2) from a monomeric intermediate and the other (ku3) from a fast formed dimeric intermediate. corresponds to that of the folding reaction through this channel. Multiplying k2 (2!105 MK1 sK1) times the protein concentration (2 mM), we obtain a kobs of 0.4 sK1, in very good agreement, considering the large differences in accuracy between a stoppedflow fluorescence trace and the amplitudes of a double jump experiment. Therefore, the two phases occur early in the refolding process, after the formation of the monomeric intermediate, and presumably arise from an isomerization of the latter. From there, the folding proceeds to the final state by two parallel channels that are partitioned depending on protein concentration. As described before, the dissociation constant for the dimeric intermediate is estimated to be around 2 mM (Figure 3(b) and Mok et al.15). Discussion Here, we completed the folding pathway of the DNA binding domain of a dimeric b-barrel transcriptional regulator, HPV E2C, by further characterization of the monomeric intermediate, and its unfolding mechanism. E2C is an excellent model to address problems concerning the folding of intertwined dimeric proteins in general and a dissection of their kinetic folding pathways in particular. The refolding process from the urea unfolded E2C occurs through a pathway highly equivalent to Folding of a Dimeric b-Barrel that of the acid unfolded state; this means that the unfolded state ensembles in urea and acid are very similar, or at least energetically equivalent, lacking persistent long-range interactions.14,15 The monomeric intermediate is present in both reactions, indicating that it is not related to pH effects on the chromophores and is a reproducible species. The large fluorescence increase in the intermediate appears to be due to the release of an interaction of the chromophore with a quencher in the unfolded state. By reconstructing a fluorescence spectrum for this species from kinetic experiments, we were able to determine that there is not a change in the maximum, which remains with properties of an exposed Trp, supporting the idea that the Trp at the interface are largely exposed in the monomeric intermediate. The slow association/folding step includes a shift of the spectra to that of a buried Trp residue. The analysis of the relative amplitudes of the slow refolding/association phases from both high urea and low pH suggest that there is a dimerization of the intermediate at higher protein concentrations. This would generate a parallel folding pathway, to attain the folded dimeric species through a first-order reaction. The change in molecularity indicates that above a certain concentration, the rate-limiting step is unimolecular. From the plot of rate versus E2C concentration, and the second to first-order transition,15 we can estimate a dissociation constant between 1 mM and 5 mM for a dimerization of an early species, approximately 8 kcal molK1 in terms of free energy, compared to 11 kcal molK1 under similar conditions at equilibrium.13 The transition state ensemble for the formation of the intermediate is very sensitive to the ionic strength, at 1 M salt the rate increased linearly by sixfold. On the other hand, the slow folding/ association rates are only slightly increased. The large effect of ionic strength on the formation of the intermediate could be explained by the strengthening of hydrophobic interactions on its transition state by an increase in the ionic strength of the milieu. Alternatively, as experiments on the arc dimer suggest, the high salt concentration could be screening unfavorable ionic interactions in the transition state, contributing to large increases in rate. The salt dependence of the arc dimer is slightly less pronounced than the E2C intermediate but more pronounced compared to the slow association/folding phase of the E2 domain.19 However, in the arc repressor the transition state leads to the folded state, since it is a single step process, and appears to be a single ensemble of structures. The fact that rates of neither the fast nor the slow phases are decreased by ionic strength strongly suggests that the important ionic interactions for the native dimer, if any, are formed after the transition state of the slower folding/association step. Since this transition state is dimeric, it is tempting to suggest the absence of ionic interactions at the dimer interface; this is supported by the largely hydro- 679 phobic nature of the interface of the folded state with some hydrogen bonding added.12 A monomeric protein such as lysozyme shows a large decrease in the refolding rate of the slowest phase as the ionic strength is increased, suggesting that in this case the ionic interactions in the transition state are favorable.20 Lysozyme also has a fast phase where the fluorescence change goes beyond that of the native protein, but different from the E2C intermediate, the rate is insensitive to ionic strength. Previous work showed that the monomeric intermediate underwent substantial structural change from an ellipticity change corresponding to 50% of the total change, increased binding of ANS, and burial of residues.14 Here, we were able to reconstruct the fluorescence spectrum of the intermediate, indicative of solvent accessible tryptophan residues. Moreover, we characterized the stability of this intermediate towards urea denaturation, and found that it proceeds through a cooperative twostate process. The intermediate has w31% of the stability of the folded dimeric domain, indicating that it is clearly a distinctive thermodynamic entity with a cooperative behavior. The m value, or cooperativity parameter is 82% that of the folded E2C dimer at equilibrium.13 The m value was shown to correlate very well with the change in solvent-accessible surface area upon unfolding (DASA), and this, in turn, with the number of residues and with changes in DCp.21 This would mean that there is more burial of surface area in the formation of the native dimer from unfolded dimer than in the formation of the intermediate from the unfolded. In the Trp repressor burst intermediate, its m value is one-third of the fully folded dimeric repressor and the stability of this intermediate is 15% that of the folded state, half of the comparative stability of the monomeric intermediate.22 Unfolding studies showed a major phase, as judged by the major CD change observed with the same rate. We determined a ku value of 0.01 sK1 in the absence of denaturant, of similar magnitude to the 33-residue leucine zipper peptide derived from the yeast transcriptional activator GCN4,8 and tenfold slower than the unfolding of the arc repressor.23 The Trp repressor seems to unfold even slower than E2C, and its larger stability to urea denaturation at equilibrium may contribute to this slow unfolding rate.6 In contrast to the folding direction, the unfolding of E2C occurs through a gradual change in fluorescence spectral properties with no species with enhanced fluorescence detected, strongly suggesting an apparent concurrent two-state unfolding and dissociation with no intermediate being populated. The bimolecular folding rate constant of E2C is 105 M K1 s K1, substantially lower than the theoretical rate expected from the collision of two spheres,24 and much slower than that of Trp or arc repressors, occurring closer to the diffusion rate limit.19,25 If the monomeric intermediate was native-like, the association rate would be much 680 faster, i.e. approaching diffusion control. Nevertheless, it is clear that the intermediate has compact non-native tertiary structure and represents a distinct thermodynamic species. This species is stable and cooperative and must partially unfold or decompact when proceeding to the dimeric folded species. However, such rearrangement is not expected for the formation of the weak dimeric intermediate I2. However, at this stage, the decompaction of the monomeric intermediate cannot be studied by direct methods in isolation from all other reactions. The Trp repressor showed a burst phase monomeric intermediate and the arc repressor showed no detectable monomeric intermediate, but its extremely fast folding rate suggests the fast formation of a partly folded monomer or an unfolded monomer with a structure compatible with native folding.25,26 The GCN4-p1 leucine zipper domain lies between the Trp repressor and the E2C domain, in that an intermediate is not populated but the folding/association rate constant is two orders of magnitude lower than the near diffusion rates of Trp and arc repressors.8 However, it is not clear why, given the ultrafast helix-coil equilibrium, the pre-formed native helical structure does not yield a near diffusion limit second-order folding/association rate. It is well known that the folding in regions that are helical in the folded state can occur in the dead time of most of the standard kinetic techniques.27 It is difficult, however, to determine whether the structures formed are full helices or represent a pre-helix conformation that may act as pre-formed native-like structures that would accelerate a concomitant folding and association reaction. It is, nevertheless, clear that b-sheet formation requires precise tertiary interactions that will require a larger conformational search or rearrangement of collapsed structures. The unusual dimeric interface present in the half-barrel of a dimeric b-barrel domain such as E2C appears to require the formation of a compact tertiary non-native structure most likely devoid of interstrand b-sheets, that must rearrange to form such a complex and intertwined b-strand based interface. A similar picture emerged Figure 9. Folding mechanism of HPV-16 E2C (see Discussion). Folding of a Dimeric b-Barrel from the comparison of folding from fragments in two paradigmatic small proteins, barnase and CI2.28 Figure 9 shows the simplest model compatible with our results at this stage. The unfolded state ensemble in urea has local residual structure15 but we cannot rule out discrete sub-populations of isomers that may direct the formation of a monomeric or dimeric intermediate. Focusing on the main pathway at low protein concentration, we determined the kinetic stability from ku1/k1k2, to be 12 kcal molK1, compared to 11 kcal molK1 from equilibrium.13 If the minor unfolding phase was sequential, i.e. an intermediate, then the overall process should be ku2ku1/k1k3, and this yields an absurd value. Our results so far suggest that the early dimerization takes place from the monomeric compact intermediate. Thus, depending on the protein concentration, the folding pathway will take place predominantly through a first-order or secondorder reaction. These parallel channels yield possibly two folded conformers, where the actual structural change could be very small, i.e. the position of a side-chain. We propose that there is a slow pH-dependent equilibrium around a pKa corresponding to histidine groups.13 There is a major phase of unfolding, indicative of no intermediate in the reverse reaction. It remains to be established if the minor fast unfolding phase takes place in parallel or if it corresponds to an unfolding intermediate; so far, everything suggests that it is a parallel unfolding pathway. Ongoing mutagenesis, DNA binding, protein truncation and NMR studies of the folded and unfolded states will complete the folding pathway of this model protein with a unique folding topology. Materials and Methods The expression in Escherichia coli as well as the purification of the E2 DNA binding domain from HPV16 were as described.13 An extinction coefficient of 41,900 MK1 was used for determining the concentration of the dimer.29,30 Refolding and unfolding experiments were performed using a DX-18MV stopped-flow equipment from Applied Photophysics, and an Aminco Bowman Series 2 spectrofluorimeter. Fluorescence experiments were carried out at fixed excitation (280 nm) and measuring the emission at 335 nm, except where the wavelength is the variable. The unfolded protein was obtained either by lowering the pH to 1.7 with HCl or at 4.0 M urea, depending on the type of experiment. Refolding buffer was 100 mM Mes– NaOH buffer (pH 6.1), and 1 mM DTT, unless otherwise stated. All experiments were carried out at a fixed temperature of 25(G0.1) 8C, unless indicated otherwise. Double jump refolding/unfolding experiments were started from 8 mM acid unfolded E2C, mixed with refolding buffer for 100 ms or the indicated delay time as the variable and then unfolded with different concentrations of urea in 100 mM Mes buffer (pH 6.1), and the fluorescence change monitored. For unfoldingrefolding double jump experiments, 8 mM E2 was mixed 681 Folding of a Dimeric b-Barrel 1:1 with 8 M urea, both in Mes buffer, in the first mixture to give 4 mM and 4.0 M concentrations of protein and denaturant, respectively. After variable times of unfolding, E2 was diluted to 0.4 M urea, and the fluorescence monitored. We previously showed that the refolding of acid unfolded E2 domain is decomposed into a fast concentration-independent phase (k1), and a slow association/folding phase with more than one component.14 For the data analysis, we used a first-order approximation (two exponentials), which showed a strong concentration dependence in one phase and a less pronounced dependence on the other (k2 and k3, respectively). Since the rates are only fourfold different, it is extremely difficult to assess whether the slower rate (k3) changes with concentration or it is affected by the faster secondorder rate (k2). Since in the present work, we analyze the effect of several factors on the folding, we simplify the analysis by treating the rates as single-exponential decays, as opposed to using a sum of second-order equations, not amenable for an accurate mathematical analysis. k2 and k3 are thus apparent rates, but are nevertheless useful to describe the effects studied here. The association phase at a single concentration was analyzed using a second-order equation with an exponential component as previously described,15 and gives a value identical to that from the slope of the observed rates, validating the approach. Circular dichroism spectra were recorded on a Jasco J810 instrument in 0.1 cm path cuvettes, at 25(G0.1)8 C at 10 mM protein concentration, and averaging ten scans. For urea unfolding denaturation, folded E2C was diluted into a 4.0 M urea solution in 25 mM BisTris–HCl (pH 7.0) and the ellipticity recorded immediately after. 6. 7. 8. 9. 10. 11. 12. 13. 14. Acknowledgements We thank Lisa Gloss for helpful criticism of the manuscript. A.D.N., L.G.A., and D.U.F. hold PhD fellowships from CONICET. This work was supported by the Wellcome Trust, CRIG OIA U41 RG27994. The authors thank the support of Agencia Nacional de Promocion Cientı́fica y Tecnológica (ANPCyT, Argentina). 15. 16. References 17. 1. Jaenicke, R. & Lilie, H. (2000). Folding and association of oligomeric and multimeric proteins. Advan. Protein Chem. 53, 329–401. 2. Wallace, L. A. & Matthews, C. R. (2002). Sequential versus parallel protein-folding mechanisms: experimental tests for complex folding reactions. Biophys. Chem. 101-102, 113–131. 3. Schymkowitz, J. W., Rousseau, F., Irvine, L. R. & Itzhaki, L. S. (2000). The folding pathway of the cellcycle regulatory protein p13suc1: clues for the mechanism of domain swapping. Struct. Fold. Des. 8, 89–100. 4. Wallace, L. A. & Dirr, H. W. (1999). Folding and assembly of dimeric human glutathione transferase A1-1. Biochemistry, 38, 16686–16694. 5. Kim, D. H., Jang, D. S., Nam, G. H. & Choi, K. Y. 18. 19. 20. (2001). Folding mechanism of ketosteroid isomerase from Comamonas testosteroni. Biochemistry, 40, 5011–5017. Gittelman, M. S. & Matthews, C. R. (1990). Folding and stability of trp aporepressor from Escherichia coli. Biochemistry, 29, 7011–7020. Milla, M. E. & Sauer, R. T. (1994). P22 Arc repressor: folding kinetics of a single-domain, dimeric protein. Biochemistry, 33, 1125–1133. Zitzewitz, J. A., Bilsel, O., Luo, J., Jones, B. E. & Matthews, C. R. (1995). Probing the folding mechanism of a leucine zipper peptide by stopped-flow circular dichroism spectroscopy. Biochemistry, 34, 12812–12819. Hegde, R. S., Grossman, S. R., Laimins, L. A. & Sigler, P. B. (1992). Crystal structure at 1.7 Å of the bovine papillomavirus-1 E2 DNA-binding domain bound to its DNA target. Nature, 359, 505–512. Hegde, R. S. (2002). The papillomavirus E2 proteins: structure, function, and biology. Annu. Rev. Biophys. Biomol. Struct. 31, 343–360. Bochkarev, A., Barwell, J. A., Pfuetzner, R. A., Furey, W., Jr, Edwards, A. M. & Frappier, L. (1995). Crystal structure of the DNA-binding domain of the Epstein– Barr virus origin-binding protein EBNA 1. Cell, 83, 39–46. Nadra, A. D., Eliseo, T., Mok, Y. K., Almeida, C. L., Bycroft, M., Paci, M. et al. (2004). Solution structure of the HPV-16 E2 DNA binding domain, a transcriptional regulator with a dimeric beta-barrel fold. J. Biomol. NMR, 30, 211–214. Mok, Y. K., de Prat Gay, G., Butler, P. J. & Bycroft, M. (1996). Equilibrium dissociation and unfolding of the dimeric human papillomavirus strain-16 E2 DNAbinding domain. Protein Sci. 5, 310–319. Mok, Y. K., Bycroft, M. & de Prat-Gay, G. (1996). The dimeric DNA binding domain of the human papillomavirus E2 protein folds through a monomeric intermediate which cannot be native-like. Nature Struct. Biol. 3, 711–717. Mok, Y. K., Alonso, L. G., Lima, L. M., Bycroft, M. & de Prat-Gay, G. (2000). Folding of a dimeric betabarrel: residual structure in the urea denatured state of the human papillomavirus E2 DNA binding domain. Protein Sci. 9, 799–811. Record, M. T., Jr, Zhang, W. & Anderson, C. F. (1998). Analysis of effects of salts and uncharged solutes on protein and nucleic acid equilibria and processes: a practical guide to recognizing and interpreting polyelectrolyte effects, Hofmeister effects, and osmotic effects of salts. Advan. Protein Chem. 51, 281–353. Chen, B. L., Baase, W. A. & Schellman, J. A. (1989). Low-temperature unfolding of a mutant of phage T4 lysozyme. 2. Kinetic investigations. Biochemistry, 28, 691–699. Lima, L. M. & de Prat-Gay, G. (1997). Conformational changes and stabilization induced by ligand binding in the DNA-binding domain of the E2 protein from human papillomavirus. J. Biol. Chem. 272, 19295–19303. Waldburger, C. D., Jonsson, T. & Sauer, R. T. (1996). Barriers to protein folding: formation of buried polar interactions is a slow step in acquisition of structure. Proc. Natl Acad. Sci. USA, 93, 2629–2634. Itzhaki, L. S., Evans, P. A., Dobson, C. M. & Radford, S. E. (1994). Tertiary interactions in the folding pathway of hen lysozyme: kinetic studies using fluorescent probes. Biochemistry, 33, 5212–5220. 682 Folding of a Dimeric b-Barrel 21. Myers, J. K., Pace, C. N. & Scholtz, J. M. (1995). Denaturant m values and heat capacity changes: relation to changes in accessible surface areas of protein unfolding. Protein Sci. 4, 2138–2148. 22. Mann, C. J. & Matthews, C. R. (1993). Structure and stability of an early folding intermediate of Escherichia coli trp aporepressor measured by far-UV stoppedflow circular dichroism and 8-anilino-1-naphthalene sulfonate binding. Biochemistry, 32, 5282–5290. 23. Milla, M. E., Brown, B. M., Waldburger, C. D. & Sauer, R. T. (1995). P22 Arc repressor: transition state properties inferred from mutational effects on the rates of protein unfolding and refolding. Biochemistry, 34, 13914–13919. 24. Fersht, A. R. (1985). Enzyme Structure and Mechanism, W. H. Freeman, New York. 25. Gloss, L. M. & Matthews, C. R. (1998). Mechanism of folding of the dimeric core domain of Escherichia coli trp repressor: a nearly diffusion-limited reaction leads to the formation of an on-pathway dimeric intermediate. Biochemistry, 37, 15990–15999. 26. Gloss, L. M. & Matthews, C. R. (1998). The barriers in the bimolecular and unimolecular folding reactions of 27. 28. 29. 30. 31. the dimeric core domain of Escherichia coli Trp repressor are dominated by enthalpic contributions. Biochemistry, 37, 16000–16010. Thompson, P. A., Eaton, W. A. & Hofrichter, J. (1997). Laser temperature jump study of the helix4coil kinetics of an alanine peptide interpreted with a “kinetic zipper” model. Biochemistry, 36, 9200–9210. de Prat-Gay, G. (1996). Association of complementary fragments and the elucidation of protein folding pathways. Protein Eng. 9, 843–847. Gill, S. C. & von Hippel, P. H. (1989). Calculation of protein extinction coefficients from amino acid sequence data. Anal. Biochem. 182, 319–326. Pace, C. N., Vajdos, F., Fee, L., Grimsley, G. & Gray, T. (1995). How to measure and predict the molar absorption coefficient of a protein. Protein Sci. 4, 2411–2423. Guex, N. & Peitsch, M. C. (1997). SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling. Electrophoresis, 18, 2714–2723. Edited by F. Schmid (Received 7 April 2005; received in revised form 10 May 2005; accepted 31 May 2005) Available online 1 July 2005
© Copyright 2026 Paperzz