Protein Engineering vol.10 no.12 pp.1363–1372, 1997 A fast estimate of electrostatic group contributions to the free energy of protein-inhibitor binding Ingo Muegge,‡ Holly Tao and Arieh Warshel1 Department of Chemistry, University of Southern California, Los Angeles, CA 90089-1062, USA 1To to have a rapid way of anticipating the effect of mutation on both drug binding and catalysis. Keywords: drug resistance/electrostatic fingerprint/endothiapepsin/group contributions/inhibitor binding whom correspondence should be addressed ‡Present address: Pharmaceutical Products Division, Abbott Laboratories, Abbott Park, IL 60064-3500, USA Dissecting ligand–protein binding free energies in individual contributions of protein residues (which are referred to here as ‘group contributions’) is of significant importance. For example, such contributions could help in estimating the corresponding mutational effects and in studies of drug resistance problems. However, the meaning of group contributions is not always uniquely defined and the approximations for rapid estimates of such contributions are not well developed. In this paper, the nature of group contributions to binding free energy is examined, focusing particularly on electrostatic contributions which are expected to be well behaved. This analysis examines different definitions of group contributions; the ‘relaxed’ group contributions that represent the change in binding energy upon mutation of the given residue to glycine, and the ‘non-relaxed’ group contributions that represent the scaled Coulomb interaction between the given residue and the ligand. Both contributions are defined and evaluated by the linear response approximation (LRA) of the PDLD/ S method. The present analysis considers the binding of pepstatin to endothiapepsin and 23 of its mutants as a test case for a neutral ligand. The ‘non-relaxed’ group contributions of 15 endothiapepsin residues show significant peaks in the ‘electrostatic fingerprint’. The residues that contribute to the electrostatic fingerprint are located in the binding site of endothiapepsin. They include the aspartic dyad (Asp32, Asp215) with adjacent residues and the flap region. Twelve of these 15 residues have a heavy atom distance of <3.75 Å to pepstatin. The contributions of 8 (10) of these 12 residues can be reconciled with the calculated ‘relaxed’ group contributions where one allows the protein and solvent (solvent only) to relax upon mutation of the given residue to glycine. On the other hand, it was found that residues at the second ‘solvation shell’ can have relaxed contributions that are not captured by the non-relaxed approach. Hence, whereas residues with significant nonrelaxed electrostatic contributions are likely to contribute to binding, residues with small non-relaxed contributions may still affect the binding energy. At any rate, it is established here that even in the case of uncharged inhibitors it is possible to use the non-relaxed electrostatic fingerprint to detect ‘hot’ residues that are responsible for binding. This is significant since some versions of the nonrelaxed approximation are faster by several orders of magnitude than more rigorous approaches. The general applicability of this approach is outlined, emphasizing its potential in studies of drug resistance where it is crucial © Oxford University Press Introduction The free energy of biological processes such as catalysis and binding provides a correlation between structure and function of proteins (e.g. Warshel, 1981b; Warshel and Åqvist, 1991; Kollman, 1993). This is particularly true for electrostatic energies (Warshel, 1981b) and decomposing the components of such energies to the contributions of individual residues can provide useful guidance for sequence–function relationships (e.g. Muegge et al., 1996). In principle, one can define a thermodynamic cycle that decomposes the calculated property into its group contributions in such a way that these contributions add up to the total value. However, these group contributions depend on the thermodynamic cycle used (e.g. van Gunsteren and Mark, 1992). If one deals with n residues one can create n! different thermodynamic cycles that add up to the same total but each of them may yield different group contributions. Nevertheless, the use of isolated energy contributions to the total free energy and in particular electrostatic contributions is extremely useful, as demonstrated in early studies (Warshel et al., 1986) and in more recent work (e.g. Lee et al., 1992; Boresch and Karplus, 1995). Most importantly, as argued by several workers (Åqvist et al., 1991; Muegge et al., 1996), it is clearly possible and useful to define a contribution by operational definition in terms of a real or conceptual experiment. For example, it is very useful to relate group contributions to mutation experiments. These contributions can be linked to conformational changes and/or changes of interaction energies between the ligand and its environment (protein solvated in water). One of the most reasonable and useful definitions of group contributions is the change of a particular property upon mutation of the given residue to glycine. Such group contributions and the proper thermodynamic cycles can be used to generate all possible mutations. Unfortunately, the calculation of the free energies of mutations is very time consuming and can usually be done for only a limited number of residues. Hence it is important to search for effective approximations for the rapid evaluation of group contributions. This is particularly needed in studying drug resistance problems, where it is crucial to have a fast way of estimating the effect of mutations on the binding of proposed drugs. In previous studies we developed several approximations for obtaining group contributions. In particular, we examined a ‘non-relaxed’ approach that considered the contribution of the protein relaxation in an implicit way, using different ‘protein dielectric constants’ for charged and uncharged residues (Muegge et al., 1996), and compared this approach in some cases with the results obtained by the corresponding 1363 I.Muegge, H.Tao and A.Warshel ‘relaxed’ approaches. The motivation for the non-relaxed approach is obvious: this approach can provide a fast screening of the effect of mutations and locate ‘hot’ residues of significant biological importance (Muegge et al., 1996). However, the range of validity of using the non-relaxed group contributions was not examined in a systematic way. This work presents such an examination, considering as a test case the binding of the inhibitor pepstatin to the aspartic proteinase endothiapepsin, a system for which many structural and binding studies are available (Workman and Burkitt, 1979; James et al., 1982; Rich, 1985; Foundling et al., 1987; Cooper et al., 1989; Sali et al., 1989; Rao et al., 1993a,b; Åqvist et al., 1994; Warshel et al., 1994; Gomez and Freire, 1995). This system is also of great interest in view of the fact that the HIV proteinase is also an aspartic proteinase and studies of the mutations of these enzymes can be useful in providing fundamental understanding of drug resistance. The next section describes the PDLD/S-LRA method and its use in evaluating group contributions. The third section compares the performances of the relaxed and non-relaxed approaches for the case of the uncharged inhibitor pepstatin, where approximate group contributions are not expected to be highly accurate. It is illustrated that the non-relaxed approach provides a reaonable but far from perfect approximation for results of the relaxed approach. In the final section we discuss the significance of our findings, pointing out that even a rough approximation can be useful in studies of drug resistance problems, where one needs very fast screening approaches. Methods PDLD/S-LRA calculations of binding free energies The absolute energy of ligand binding to proteins can be estimated in different ways. These include formally rigorous approaches such as the free energy perturbation (FEP) method (Warshel et al., 1988; Kollman, 1993) and the all-atom linear response approximation (LRA) introduced in studies of binding energies by Lee et al. (1992) and used effectively and extensively by Åqvist and co-workers (Åqvist et al., 1994; Åqvist and Hansson, 1996). More approximated and significantly faster approaches which frequently focus on electrostatic energies are also effective (Bohm and Klebe, 1996). These include the scaled protein dipole Langevin dipole (PDLD/S) method (e.g. Lee et al., 1992) and other approaches (e.g. Madura et al., 1996; Froloff et al., 1997). The present work is formally based on a recent version of the PDLD/S method that takes the protein reorganization into account within the LRA framework. The implementation of the PDLD/ S-LRA method in calculations of binding free energies was considered in several recent publications (e.g. Muegge et al., 1996, 1997b) and here we will emphasize and clarify the main points of this approach. The absolute binding free energy is considered here using the thermodynamic cycle of Figure 1. The inner cycle of the figure (a, d, e, h) is the cycle considered by Lee et al. (1992) in evaluating antibody antigen binding energies. It describes elec ) to the absolute binding the electrostatic contribution (∆G bind,l free energy of a ligand (1) to a protein (p), where the configurations of a protein–ligand complex are kept at a single configuration (s) in the bound state and single configuration (s9) at the dissociated state but the solvent is allowed to relax at each step of the cycle. In this case, we can express the total binding free energy as 1364 elec ) (∆G bind)s9→s 5 (∆G bind s→s9 1 (∆G hyd 1 ∆G vdw 2 T∆S9)s→s9 (1) w ) 1 (G bind ) 5 (∆G pelec,l)s 2 (∆G elec,l elec,l9 s9→s s9 1 (∆G hyd 1 ∆G vdw 2 T∆S9)s9→s where ( )s indicates that the corresponding form is evaluated at a single configuration of the protein–ligand complex. The term ∆G w elec,l is the solvation energy of the ligand in water (without the van der Waals and hydrophobic contributions), ∆G pelec,l is the change of the electrostatic contribution to the solvation energy of the protein–ligand complex upon charging the ligand (from an artificial uncharged state where all atomic partial charges are set to zero to the state where all charges including partial charges are switched on) and ∆G elec,l9 is the electrostatic contribution to the binding free energy of the uncharged ligand, l9. ∆G hyd and ∆G vdw are the hydrophobic and van der Waals contributions to binding and 2T∆S9 represents the non-electrostatic entropic contribution which is associated with the binding of the uncharged ligand. The free energy of Equation 1 can be evaluated conveniently by the semi-macroscopic PDLD/S model (e.g. Lee et al., 1992), replacing the microscopic ∆G elec terms by their semi-macroscopic counterparts. This is done by considering the extra cycles (h, e, f, g) and (a, b, c, d) where the ‘dielectric constant’ of the solvent around the protein is changed from that of water, εw, to a value that corresponds to the assumed protein ‘dielectric constant’, εin (this parameter represents the implicit contribution of the protein which will be considered below). The PDLD/S contributions of the different steps in the cycles are considered in detail elsewhere (Lee et al., 1992, 1993) and they are also given in Figure 1. Since we deal with a single protein–ligand configuration, we consider the sum of the PDLD/S terms as an effective potential (a potential that should be averaged to obtain the proper free energy) and write (∆G pelec,l)s 5 U pelec,l 5 [ l1p – ∆G l91p ) (∆G sol sol ( ε1 in ( l 1∆G sol 1– 1 εin )1 Vε l qµ 1 l V intra in εin [ ( ) ( ) ] w ) 5 Uw (∆G elec,l elec,l 5 s9 l 1∆G sol 1– – 1 εin l (∆G sol 1 l V intra εin ] 1 εw ) S (2) 1 1 – εin εw S9 where ∆G sol denotes the electrostatic contribution to the solvation free energy of the indicated group in water. To be more precise, ∆G sol should be scaled by 1/(1–1/εw), but this small correction is neglected here. V qµ is the electrostatic interaction between the indicated groups in vacuum (this is a l9 5 0. standard PDLD notation) and in the present case Vqµ V lintra is the intramolecular electrostatic interaction of the ligand. Since the PDLD/S results obtained with a single protein–ligand configuration cannot capture properly the effect of the protein reorganization [see discussion in Sham et al. (1997)] a more consistent treatment should involve the use of Electrostatic group contributions to the free energy of protein–inhibitor binding Fig. 1. Thermodynamic cycles for studies of the binding of a ligand (1) to a protein (p). The cycles consider only the electrostatic contributions to binding free energies. This is done by changing the charges of the ligand from their actual value (black squares) to zero (cross-hatched squares). The figure only represents a single configuration (s) for the complex state and a single configuration (s9) for the unbound state. The proper average over the configuration of the system, on the charged and uncharged state of the ligand, is performed by the LRA approach as described in the text. The inner cycle (a, b, d, h) describes the binding process on a macroscopic level. The outer cycles (a, d, c, b) and (h, e, f, g) are used to provide the semi-macroscopic estimates of the relevant electrostatic energies. This is done by using the PDLD/S three steps cycle where the solvent ‘dieletric constant’ is changed from εw to εin, the ligand charges are changed and finally the dielectric constant is changed back form εin to εw. The relevant PDLD/S energy contributions are given; more details about these evaluations are given elsewhere (e.g. Lee et al., 1992). the LRA or related approaches (e.g. Lee et al., 1992; Sham et al., 1997). This approach provides a reasonable approximation for the actual free energy by using [see Lee et al., (1992)]. (3) ∆G pelec,l 5 12 〈U pelec,l 〉l91p 1 〈U pelec,l〉l1p ( ( 1 w w ∆G w elec,l 5 2 〈U elec,l 〉l9 1 〈U elec,l〉l ) ) where , .l and , .l9 designate an MD average over a force field that corresponds to the ligand in its charged and uncharged form and , .l1p and , .l91p designate an MD average over a force field of the ligand plus protein system where the bound ligand is in its charged and uncharged forms, respectively. It is important to realize that the average of Equation 3 is always done where both contributions to the relevant Uelec are evaluated at the same configurations. That is, the PDLD/S energies of the charged and uncharged states are always evaluated at each averaging step at the same structure, but these structures are generated by MD simulations using the potential surface of the charged and uncharged states. The free energy that corresponds to the first two terms of Equation 1 is now obtained from Equations 2 and 3 and is given by p p w ∆G w→ elec,l 5 ∆G elec,l – ∆G elec,l [〈 l91p→l1p ) ™12 (∆G elec,l 〈 l91p→l1p ) 1 (∆G elec,l (4) 1 ( – 1 εin εw 1 (ε – in 1 εw ) l9 l 1 ∆G sol l qµ in 1 [(〈 〉 〈 〉 ) ( l –12 ∆G sol ) 1 Vε 〉 l l V qµ εin 1 εin – 〉 l91p ] l1p 1 εw ) (〈 〉 1 〈∆G 〉 – 〈∆G 〉 – 〈∆G 〉 ) l – ∆G sol l1p l sol ( 1 – ε1 ) ] 1 ∆G l91p l sol l l sol l9 l intra in 1365 I.Muegge, H.Tao and A.Warshel where ∆G lintra is the contribution to the cycle due to structural l . Here we use the fact that ∆G l9 5 0, since relaxation of Vintra sol the hydrophobic term is evaluated separately. Also note that ∆G psol cancels out in the cycle. The last two terms of Equation 4 can be written as [(〈 〉 〈 〉 ) ( l l –12 ∆G sol 1 ∆G sol l 1– l9 1 εw ) (5) (〈 〉 1 〈∆G 〉 ) (1 – ε )] 1 ∆G l – ∆G sol l sol l1p 1 l91p l intra. in The present treatment keeps the ligand structure unchanged in the cycles used to evaluate Equation 4 and evaluates the energetics of its structural relaxation in water by a separate microscopic step. Thus the last two terms in Equation 4 are reduced to –〈∆G lsol 〉l1p ( ε1 – ε1 ) in w l91p p –∆G bind elec,l9 ™ (–〈∆G sol 〉l91p 1 〈∆G sol 〉l91p) 1 ( ε – ε ) 1 ∆∆G prelax 1 ∆G l9relax in w (6) where we used the fact that the electrostatic contribution to ∆G l9sol is zero and we also used the lower cycle to obtain the first two terms of Equation 6. The first three terms represent the electrostatic contribution to the change in solvation energy during the dissociation process when the ligand is held to its configuration in the (l9 1 p) state, while ∆∆G prelax represents the electrostatic effect of protein relaxation upon release of 19 and ∆G l9relax represents the corresponding effect of relaxation of l9. Note also that the average of Equation 6 is not based on a rigorous LRA procedure. Now we can express the PDLD/S-LRA binding free energy as (〈 〉〈 ∆Gbind™ ∆G l1p – ∆G psol sol 〉 – 〈∆G 〉 )( ε1 – ε1 ) l sol l1p in (7) w 〈 〉 ε1 – [(〈∆G 〉 – 〈∆G 〉 ) l 1 V qµ l91p sol in (〈 〉〈 l91p – ∆G p – ∆G sol sol p sol l91p l91p 〉)] ( ε1 – ε1 ) in w 1∆∆G prelax 1 ∆G l9relax 1 ∆Ghyd 1 ∆Gvdw – T∆S9 where , . without subscript designates 1/2 (, .l1p 1 , .l91p) and where, as stated above, we evaluated the effect of Vintra and ∆G l9relax in a separate cycle by constraining the ligand to stay in its original configuration during the cycle of Figure 1. Now the 2T∆S9 term does not include the entropic contributions l but only the translational and rotational of ∆∆G prelax and ∆G relax 1366 [(〈 〉 〈 〉 ) (〈 ∆G l91p sol l91p9 p – ∆G sol l91p 〉〈 – ∆Gl91p – ∆Gpsol sol 〉)] ( ε1 – ε1 ). in w This approximation involves an overestimate of the change in the electrostatic solvation of the protein upon displacement of l9 (the final expression given a larger weight to ∆Gpsol than to ∆Gl1p sol ). The validity of this approximation will be examined elsewhere, but it has little consequence except in the present case where we focus on mutations of neutral residues. Now we end with our final expression: 1 1 p l ∆Gbind ™ 〈∆G l1p – (8) sol 〉 – 〈∆G sol 〉 – 〈∆G sol 〉l1p εin εw ( l 〉 1〈V qµ Considering for simplicity the dissociation (d→e) rather than the binding process (e→d) we may approximate the negative bind term of Equation 1 by of the 2∆G elec,l9 1 terms. We further simplify our treatment by assuming that most p of the effect of ∆∆G relax is captured in the LRA treatment of the uncharging process (a → d in Figure 1). This approximation is further discussed in Muegge et al. (1998). We also make a convenient approximation and neglect the term )( 1 εin ) l 1 ∆G relax 1 ∆Ghyd 1 ∆Gvdw – T∆S9 ∆G hyd represents the hydrophobic term that estimates the relevant surface area from the number of Langevin dipoles in the first ‘solvation shell’ and corrects it considering the local field of each dipole (Lee et al., 1993). ∆G vdw is evaluated by assigning van der Waals interaction terms to the Langevin dipoles (Lee et al., 1993). With regard to the term εin, it is important to realize that this scaling constant is not related to the true local dielectric constant of the protein but merely represents the contributions that are not treated explicitly in the model (King et al., 1991). When all contributions are treated explicitly in the case of the regular PDLD treatment, then εin 5 1. When the protein-induced dipoles are treated implicitly, as in the PDLD/S model, then εin 5 2. Since the PDLD/S represents explicitly all contributions except the induced dipoles, ideally εin 5 2 should be implemented. However, experimentation with many test cases indicates that more accurate results are obtained with εin ™ 4 (larger values are needed if the protein reorganization is not taken into account (Muegge et al., 1996; Sham et al., 1997). This is probably due to the fact that most microscopic treatments do not fully converge within the available stimulation time. In particular, it is possible that the penetration of water molecules upon change of substrate charges is not fully accounted for. It is also possible that the simulations do not sample sufficiently large configurational space, although the model involves averaging over configurations generated by MD simulations with very reliable boundary conditions and with a proper treatment of long-range effects by the LRF approach (Lee and Warshel, 1992). The performance of the PDLD/S approach in evaluation of electrostatic free energies in general (e.g. pKa values, redox potentials, ion pair energies) and binding free energies in particular has been well established (Lee et al., 1993; Warshel et al., 1994; Muegge et al., 1996; Sham et al., 1997). It was also shown that the PDLD/S approach provided reasonable estimates of the effect of mutation on the binding energy of highly charged ligands (Muegge et al., 1996). The PDLD/S-LRA procedure is implemented in a fully automated way in the POLARIS program package (Lee et al., 1993). Electrostatic group contributions to the free energy of protein–inhibitor binding Group contributions The PDLD/S approach for calculations of absolute binding free energies can also be useful in estimating the contributions of individual residues to the binding process. A very useful definition of these group contributions is the effect of ‘mutating’ the given residue to glycine, or alternatively of annihilating all the residual charges of the given residue. Although these definitions are not unique, they provide a ‘road map’ for the location of ‘hot’ residues whose mutations are likely to change the functional properties of the protein (Muegge et al., 1996). This definition is also very useful in any attempt to generate all possible mutations since the mutation A → B can always be obtained by the thermodynamic cycle A → Gly, Gly→ B. The difference between the calculated binding affinities of the wildtype and the glycine mutants are referred to here as the ‘relaxed group contributions’ of the mutated residue. These group contributions involve the full relaxation of the protein matrix as well as the surrounding water solvent upon mutation. In particular, the steric reorganization caused by the truncation of large side chains is explicitly included. Since the evaluation of the relaxed group contributions is time consuming, it is important to examine alternative treatments. In particular, we note that the electrostatic contribution of the ith residue to Equation 8 can be written as (∆Gbind)i 5 V (i) qµ εin 1 〈∆G l1p – ∆G psol 〉(i) sol ( 1 εin – 1 εw ) (9) where we designate (V lqµ)(i) by V (i)qµ. The ith contribution of the solvation term represents the effect of the solvent in response to turning off the residual charges of the ith residue. For nonionized residues the effect of the solvent around the protein is expected to be small. Furthermore, for non-ionized residues the effect of protein reorganization upon turning off the charges of the ith residue is expected to be small and can be reasonably represented by small εin. Thus we may try to use [ ] ∆∆G (i) bind unionized ™ 〈 V (i) qµ εin 〉 (10) This non-relaxed approximation, with εin 5 4, has been found to be reasonable in the case of nucleotide binding in p21ras (Muegge et al., 1996) and the ‘solvation’ of the heme in cytochrome c (Muegge et al., 1997a). Obviously, Equation 9 and small εin provide a very poor approximation in the case of ionized residues where the contribution from the solvation term is enormous, leading to significant compensation of the Vqµ term. Furthermore, as illustrated and argued repeatedly by Warshel and co-workers (e.g., Warshel and Russell, 1984), the combined effect of the solvent and protein relaxation leads to almost complete compensation of the Vqµ term. This physics can be best captured by simply using Coulomb’s law with a large effective dielectric constant. That is, for the group contribution of charged residues, it is very effective to use the seemingly simple expression [∆∆G ] (i) bind ionized ™ 〈 Vε 〉 (i) qµ (11) eff This approximation, with εeff 5 40, has been repeatedly validated in studies of interactions between ionized residues and charged ligands (e.g. Warshel and Åqvist, 1991; Muegge et al., 1996; Sham et al., 1997, and references therein). In the case of interactions between ionized residues and non-ionized ligands the rules are less clear and here we use εeff 5 20 to scale the charge–dipole interactions between the charged protein residue and the neutral ligand. Equations 3 and 4 can be further simplified by replacing the average over Vqµ by Vqµ at the average structure of a single equilibration run or just at the crystal structure. This non-relaxed approach is several orders of magnitude faster than the PDLD/ S-LRA approach, which is in turn a few orders of magnitude faster than the LRA and FEP approaches (Lee et al., 1992). Of course, the non-relaxed approach is less reliable than the more rigorous approaches and this work is mainly focused on determining how much is traded in reliability for the price of a faster method and on examining the optimum εeff for ionized and non-ionized residues. Equations 9 and 10 do not include the contributions from ∆G hyd, ∆G vdw and 2T∆S9. The first two contributions are less sensitive to the detailed structure of the mutated residues and can be estimated by simplified approaches. In particular, a single evaluation of ∆G hyd and ∆G vdw by the PDLD/S approach is sufficient except in the case of mutation to a large bulky residue, which is likely to lead to large structural changes. The 2T∆S9 term is anyhow hard to estimate by any approach and it is outside the scope of the present work. Hence the main uncertainty in developing a practical non-relaxed model is how closely we can come with Equations 8 and 9 to the more complete PDLD/ S treatment of ∆G elec bind. Results and discussion The crystal structure of the aspartic proteinase endothiapepsin complexed with the inhibitor pepstatin (Cooper et al., 1989) was allowed to undergo a short molecular dynamics (MD) relaxation of 5 ps at 300 K by using the program ENZYMIX (Lee et al., 1993). The spherical inner part of the system with radius 18.0 Å was constrained by a weak harmonic potential of → → the form V9 5 ΣiA(r i – ri0)2, with A 5 1.0 kcal/mol. Å2. The r.m.s. deviation of the relaxed structure from the crystal structure was 0.7 Å on average. The binding energy of pepstatin to endothiapepsin depends on the protonation states of the protein (Gomez and Freire, 1995). Therefore, the proper choice of the ionization states of the aspartic acids in the binding pocket is important for reproducing a reasonable binding energy. In order to determine the ionization state of the five most adjacent aspartic acids of the inhibitor, we used the approach described by Lee et al. (1993) and Sham et al. (1997). This was done using the ‘titra_pH’ routine of the POLARIS program package (Lee et al., 1992, 1993). If the probability of a group to be charged at pH 7 was .50%, it was considered to be charged. Our calculations indicated that three aspartic acids, Asp30, Asp32 and Asp77, are ionized. This is in agreement with the probable ionization state of the aspartic dyad (Asp32, Asp215). It is usually assumed that Asp32 is charged and Asp215 is uncharged at pH 3–7 (Bailey and Cooper, 1994). The calculations involved the SCAAS polarization boundary conditions and the local reaction field (LRF) method (Lee and Warshel, 1992) that provides a very efficient way of solving the electrostatic longrange problem. The protein was embedded in a water sphere of radius 20 Å surrounded by a shell with an outer radius of 24 Å containing a Langevin grid and a continuum representation beyond the 24 Å shell. A detailed description of how the 1367 I.Muegge, H.Tao and A.Warshel Table I. Contributions to the overall binding free energy of pepstatin to endothiapepsin Energy contribution (kcal/mol) Free energy terma (l 1 p) 〈∆Vqµ / εin〉 〈 ∆G 〈 ∆G 〈 ∆G 〈 ∆G p solv,w ( ( ( bind elec 〉 l1p solv,w l solv,w 〈∆G hyd〉 ∆G calc bind ∆G exp bind 1 1 2 εw εin 1 1 2 εw εin 1 1 2 εw εin )〉 )〉 )〉 (l9 1 p) 223.42 217.18 2107.21 2104.48 216.85 217.09 2122.79 2119.26 9.01 14.69 225.87 214.0 214.0b aSee Equation 8 and the text for the meaning and evaluation of the different terms. The calculations used an optimum dielectric scaling, εin 5 2.9, that was selected in order to reproduce the experimental binding energy. bThe ‘perfect’ agreement between the calculated and observed ∆G bind is meaningless since it was obtained by adjusting εin (see the text). Using εin 5 4 gave ∆G bind 5 –17.6 kcal/mol. ENZYMIX program treats different regions of the protein is given elsewhere (Lee et al., 1993). The simulations, needed to generate the PDLD/S-LRA results, were performed for the wild-type structure and the 23 ‘mutants’, starting from the state after the initial relaxation and decreased the harmonic potential by using A 5 0.1 kcal/mol.Å2. Otherwise, the MD simulations were performed using the same conditions as described for the initial relaxation. The averaged r.m.s. deviations varied between 0.4 and 0.5 Å. The first step of the calculations involved PDLD/S-LRA evaluation of the absolute free energy of binding of pepstatin to endothiapepsin. This was done by generating four proteincomplex configurations, for both (l9 1 p) and (l 1 p), and averaging the results according to Equation 8. The calculations, which are summarized in Table I, neglected the contributions of ∆G vdw 2T∆S9. This neglect is not so relevant in the present case since we do not focus on the actual value of the absolute binding energy. In fact, resolving the real effect of 2T∆S9 is a major challenge which is being addressed in our laboratory using all-atom simulations, but it is outside the scope of the present work. The contribution of ∆G vdw is positive when it includes the interaction of the solvent with the ligand-free proteins and it usually compensates for some of the negative contribution of ∆G hyd. However, the exact compensation is again outside the scope of the present work and it is simply very easy to change the absolute value of ∆G bind by adjusting εin. Hence all that is done here is to select an εin (εin 5 2.9) that forces the calculated ∆G bind to reproduce the corresponding observed value of 214.0 kcal/mol (Workman and Burkitt, 1979). This arbitrary selection was not meant to represent any sophisticated scientific procedure of obtaining improved results but merely a way of forcing the overall effect of all the protein residues to reproduce the observed binding free energy (this does not mean, of course, that the sum of the group contributions will reproduce the observed binding energy of the native enzyme). Using the standard value of εin 5 4 gave ∆G bind 5 –17.6 kcal/mol. In the next step, we turn to the evaluation of the group 1368 contributions to the total inhibitor binding energy. Figure 2 shows the non-relaxed group contributions of every single residue of endothiapepsin to pepstatin binding free energy. All protein residues that are located in proximity to the inhibitor (i.e. at least one non-hydrogen atom closer than 3 Å to a nonhydrogen atom of the inhibitor) are found to have significant electrostatic contributions to the inhibitor binding. There are three patches of groups that appear to be important in determining the electrostatic fingerprint. The first group comprises residues Asp12, Asp30, Asp32 and Gly34 and stretches along the convex inhibitor side from Iva227 to Sta330. The second group of residues, Ser74, Tyr75, Gly76, Asp77 and Gly78, fits nicely into the groove-like concave side of the inhibitor between Sta330 and Sta332. The third group, comprising residues Asp215, Gly217, Thr218 and Thr219, stretches almost parallel to the first group along the same residues of the inhibitor. The three regions found to be important using the non-relaxed approach coincide with the functionally important motifs in the binding site. The first and third regions provide a symmetrical hydrogen bond network in the active site. They involve similar sequences (Asp32, Thr33, Gly34, Ser35 and Asp215, Thr216, Gly217, Thr218) and contain the aspartic dyad Asp32 and Asp215. The second region (residues 74–83) forms the so-called flap region— a highly flexible loop that controls binding and release of the substrate. This flap region binds to the center part of the inhibitor and covers the catalytic site. Figure 3 illustrates the location of the inhibitor relative to the above-mentioned residues. Fifteen residues are likely to be particularly important for pepstatin binding as they show significant electrostatic peaks in the electrostatic fingerprint of Figure 2. Twelve of these residues that are located close to the inhibitor (heavy atom distance ,3.75 Å) were chosen as our benchmark and were mutated to glycine. That is, the side chain of the given residue was replaced by a hydrogen atom and the main chain conformation remained unchanged. However, in the case of glycine residues in the wildtype sequence we created an artificial ‘mutant’ by annihilating all residual charges of the glycine main chain, thereby forming an uncharged glycine counterpart. The special procedure of mutating Gly to its uncharged form allows one to evaluate the effect of main-chain dipoles. Although such effects cannot be verified by current experimental approaches, they clearly play a major role in binding and catalysis (e.g. Muegge et al., 1996). The mutations of the other residues to their uncharged form is also of interest (e.g. Muegge et al., 1996), but in this work we focused on actual mutations to Gly since this allows one to consider actual steric effects in the relaxed model. The generated mutant structures underwent an MD-relaxation process responding to the change in structure and charge upon mutation. This allows us to examine the explicit effect of the relaxation of the protein matrix and the water solvent on the contribution of the mutated residue to the binding of the inhibitor. The corresponding PDLD/S results, obtained with the relaxed (s 1 p relaxed) and non-relaxed approximations for the residues whose closest distance to the inhibitor, is ,3.75 Å, are summarized in Table II. These residues represent the first ‘solvation shell’ of the ligand and are expected to have the largest contributions to the binding free energy. As can be seen from Table II, for 8 out of 12 residues the s 1 p-relaxed group contributions can be correlated to the non-relaxed group contributions. When we allowed only the solvent to relax and hold the protein atoms fixed upon mutation (s-relaxed), we could reconcile the relaxed (s-relaxed) and non-relaxed group contributions of 10 out of 12 residues. The most negative contribution is due to Electrostatic group contributions to the free energy of protein–inhibitor binding Fig. 2. Non-relaxed group contributions to pepstatin binding energy for every residue of endothiapepsin. The non-relaxed group contributions generate the electrostatic ‘fingerprint’ of the protein-inhibitor interaction. εin 5 4 and εin 5 εeff 5 20 were chosen for uncharged and charged residues, respectively. Fig. 3. Crystal structure of pepstatin bound to endothiapepsin. The pepstatin is shown together with the residues that exhibit the largest group contributions in the electrostatic fingerprint of Figure 2. Gly76 (see Figure 2) and the same trend is obtained using the relaxed approach. The same holds for the largest positive contribution, which is due to Gly217. The effect calculated with the non-relaxed approach could not reproduce the corresponding relaxed results for four residues, Asp30, Gly34, Asp77 and Thr219. The nearest contact between Asp30 and the inhibitor is a methyl group of Sta330 at 3.5 Å. This residue neither forms a hydrogen bond to pepstatin nor is it in other close electrostatically important contact. Even though this residue is charged, its non-relaxed group contribution is not very high. The non-relaxed group contribution of Asp77 is relatively small considering that it is a charged residue. Asp77 forms only a weak hydrogen bond with its backbone to Val229. In the case of Gly34 the hydrogen bond between the Gly34 oxygen and the backbone nitrogen atom of Ala331 was weakened owing to the uncharging of Gly34, and this forced Asp33 to move closer to Ala331. This structural change led to an increase in electrostatic interaction between the charged side chain of Asp33 and Ala331 and gave rise to an even slightly increased inhibitor binding energy in the mutant relative to the wild-type. This effect is opposite to that found in the non-relaxed calculations that produced a negative contribution of Gly34. Although, in the case of the Thr219→Gly mutation, Vqµ decreased by almost 2 kcal/mol, this effect was more than compensated for by an increased solvation since this residue is exposed to the water solvent. Furthermore, the entire side chain of Thr219 has contact to Iva227 and Val228 and its truncation has an overall effect on the conformation of the binding pocket and even the distant stretch of residues 74–78 now binds more strongly than in the wild-type. An interesting point that emerges from Table II is the correlation between the performance of the non-relaxed approach and the location of the given residue. It appears that the nonrelaxed approach correlates well with the solvent relaxed (srelaxed) results (obtained using Equation 3 without allowing for protein reorganization) as long as the given residue is not fully exposed to the solvent. Interestingly, for solvent-exposed residues we found that the non-relaxed approach correlates better with the results obtained by considering the relaxation of both the 1369 I.Muegge, H.Tao and A.Warshel Table II. Group contributions to the binding free energy of pepstatin to endothiapepsin from residues witin the first ‘solvation shell’ of the inhibitora Groupa Original structure Average over 4 MD runs Non-relaxed 20.47 20.89 21.95 21.43 22.95 0.17 22.28 20.39 21.32 0.18 20.54 21.60 Asp12 Asp30 Asp32 Gly34 Ser74 Tyr75 Gly76 Asp77 Asp215 Gly217 Thr218 Thr219 20.26 20.42 21.62 21.20 20.24 21.01 22.05 20.15 20.36 0.44 20.71 21.05 Average over 4 MD runs s1p relaxedb s relaxedc SASAd Locatione 20.26 (1) 0.79 (2) 21.55 (1) 0.14 (2) 21.16 (1) 21.33 (1) 23.95 (1) 0.42 (2) 23.19 (1) 2.29 (1) 21.66 (1) 1.19 (2) 20.06 2.32 21.92 21.08 20.04 20.38 20.54 20.12 20.12 1.05 20.03 20.67 20.25 20.15 20.10 20.30 20.60 20.20 20.40 20.35 20.20 20.70 20.45 20.65 s b/s b b/s s s s s b b/s b/s b/s (1) (2) (1) (1) (2) (1) (1) (1) (1) (1) (1) (1) includes all endothiapepsin residues that have at least one heavy atom with a distance of ,3.75 Å to a heavy atom of pepstatin. the results of the relaxed approach according to Equations 8 and 9. s1p indicates that solvent and protein are subject to structural relaxation upon mutation (the solvent relaxation is automatically obtained by using the ∆G sol,w in Equation 9). Energies are given in kcal/mol. The table considers residues that were found to have large electrostatic contributions in the non-relaxed approach of Figure 2. Asp30, Asp32 and Asp77 were charged. For the relaxed approach all non-glycine residues were mutated to glycine replacing the side chain group by a hydrogen atom. In contrast, glycine residues were mutated to their uncharged counterpart by annihilating the residual charges of the main chain atoms; however, both internal interactions and van der Waals interactions remained activated during the MD-relaxation process of the mutant formed. The corresponding contributions were evaluated from Equation 2 using εin 5 2.9. The non-relaxed contributions were evaluated using εin 5 4 and εin 5 εeff 5 20 for uncharged and charged residues, respectively. (1) and (–) designate agreement and disagreement, respectively, between the given relaxed and non-relaxed results. cOnly the solvent is allowed to relax. These results are obtained by using the relaxed approach and Equations 8 and 9 without taking the protein relaxation upon mutation into account. (The protein atoms are held fixed upon mutation.) dGroup contributions were calculated using the change in the solvent-accessible surface area upon ligand binding of each residue considered. Data taken from Figure 7 of Gomez and Freire (1995). eGroup located at the surface of the protein (s), buried in the protein matrix (b) or partly buried and partly exposed to the solvent (b/s). aThis table bThese are Table III. Group contributions to the binding energy of pepstatin to endothiapepsin from residues of the second ‘solvation shell’ of the inhibitora Group Original structure Average over 4 MD runs Non-relaxed Ala13 Ser35 Gly78 Leu128 Gln187 Ile213 Leu220 Tyr222 Phe275 Phe284 Ile297 0.00 20.43 20.22 0.08 0.37 0.01 20.12 20.10 20.08 0.00 0.10 0.00 0.18 20.26 20.09 0.32 0.02 20.07 20.03 20.04 0.00 0.01 Average over 4 MD runs s1p relaxeda SASAb 22.23 21.70 20.72 23.64 23.51 1.37 20.65 0.74 20.40 22.83 21.73 20.15 0.03 2 20.25 2 20.10 20.10 20.25 20.10 20.10 20.20 aResidues whose closest heavy atom distance to pepstatin is between 3.75 and 6 Å. Energies are given in kcal/mol. The non-relaxed contributions were evaluated using εin 5 4. The relaxed group contributions were calculated using εin 5 2.9. Asp30, Asp32 and Asp77 were charged. bGroup contributions were calculated using the change in the solventaccessible surface area upon inhibitor binding of each residue considered. Data taken from Figure 7 of Gomez and Freire (1995). protein and the solvent (s 1 p) than with results obtained by considering only solvent relaxation (s). In addition to examining the effect of residues with large electrostatic contribution, it is important to explore the effect of other residues that are in close contact with the substrate. Such an examination is summarized in Table III, which considers the effect of mutated residues whose closest distance to the inhibitor is between 3.75 and 6 Å. Table III indicates that residues which lie in the second ‘solvation shell’ of the inhibitor may have 1370 significant relaxed contributions even if the corresponding nonrelaxed contributions are very small. All of these relaxed contributions reflect ‘second-order’ conformational effects, where the mutation of the given residue changes the positions and orientations of residues that interact directly with the ligand. Table III further indicates that Gly78, Gln187 and Ser35 have a significant non-relaxed contribution to inhibitor binding. The closest atom of Gln187 is located .5 Å away from the inhibitor, yet this residue gives a positive contribution to the binding energy (decrease of the binding affinity). Apparently, the Oε1 atom of this residue points directly towards the backbone oxygen of Ala331. This electrostatic repulsion is responsible for the positive contribution to binding found in Figure 2. For a neutral ligand, only the interaction of the nearest protein residues is electrostatically determined. That is, only for these residues a qualitative agreement between the non-relaxed group contributions and the relaxed group contributions could be achieved (Table II). However, for charged ligands this agreement can be reached even for residues that are located relatively far from the ligand, as shown for the binding of highly charged nucleotides in p21ras (Muegge et al., 1996). The non-relaxed and relaxed group contributions to the binding energy of pepstatin to endothiapepsin can be compared with calculations of these contributions using an entirely different approach. Gomez and Freire (1995) calculated the change of solvent-accessible surface area (SASA) of endothiapepsin and pepstatin upon complex formation and the enthalpic change is then expressed by the sum over differently weighted SASA for polar and non-polar atoms and a term for the effective enthalpy of ionization. The results of Gomez and Freire [Figure 7 in Gomez and Freire (1995)] are given in Tables I and II and can be compared with our results. The SASA is an empirical approach that involves several conceptual problems that can Electrostatic group contributions to the free energy of protein–inhibitor binding sometimes be absorbed into the parameters used. The main problem is associated with the assumption that the enthalpic contribution can be obtained by scaling the contributions of the interacting atoms by the corresponding SASA. Unfortunately, electrostatic interactions in protein interiors are not determined by surface area but by local polarity (Warshel et al., 1984). For example, if we consider a small polar ligand or in an extreme case an ionizable group whose energy can be expressed in terms of the energy of moving such a group from water to the protein site (Warshel, 1981a; Yang et al., 1993), we find that this energy is determined by the local polarity and cannot be scaled by surface area considerations; otherwise it would have been possible to calculate pKas of internal groups using surface area considerations [see discussion in Warshel et al. (1984)]. The contributions are classified only according to polar or non-polar atom types. This means that very different hydrogen bonds such as SH and OH will give the same contribution and the same will be true for changes in the ligand that involves a replacement of one polar group by another, in contrast to experimentally observed differences in binding between different ligands [e.g. see the series in Warshel et al. (1994)]. Finally, the SASA approach scales the individual contributions so that they will reproduce the total binding energy but the sum of the effect of all mutations cannot (and should not) sum up to the observed binding energy. Nevertheless, despite these and other conceptual problems one would expect approaches that use effective potentials to be useful. This will be particularly true with models that use effective group contributions such as the approach used in folding studies (e.g. Levitt and Warshel, 1975). Such an approach might sometimes be more accurate than fully microscopic approaches as long as the microscopic approaches involve major convergence problems. At any rate, the present study does not try to assess the reliability of different approaches since this will require comparison with mutation experiments (which are outside the scope of this work) but rather an examination of the ability of the non-relaxed approach to reproduce the relaxed results. Examination of Table II indicates that the larger contributions obtained by the relaxed approach are also found by the nonrelaxed and SASA approaches. Residues in Table III contribute significantly less in the non-relaxed and the SASA approaches. However, there are some major differences in the results of both methods. Although the SASA method is in principle capable of identifying positive contributions to the binding energy, only negative contributions were found. Especially Gly217, which provides a positive contribution to the inhibitor binding in both the relaxed and non-relaxed approaches, was found to provide the most negative contribution in the SASA approach. While it is encouraging that the SASA and the non-relaxed method provide results of such a significant correlation for most of residues considered, it is interesting to find out what caused the misfit for Gly217. This question will be left, however, to subsequent studies that consider actual mutation experiments. Conclusion This work examined the performance of the electrostatic non-relaxed approach in detecting residues that contribute significantly to the binding free energy of a neutral ligand. The binding of the neutral inhibitor pepstatin to endothiapepsin was taken as a generic benchmark. A time-consuming mutational analysis was performed for 23 residues within the first and second ‘solvation shell’ of the ligand. A correlation between the non-relaxed and solvent relaxed contributions was found for 10 out of 12 residues of the first ‘solvation shell’ (8 out of 12 correlated for the non-relaxed and the protein 1 solvent relaxed approach). Hence it appears that looking for the peaks in diagrams of the type presented in Figure 2 should help in identifying the most important residues for inhibitor binding. However, the non-relaxed contributions of residues that show no significant peak and are not in contact with the inhibitor might not be so informative, since mutations of such residues may still lead to global conformational changes rather than to a direct electrostatic effect. That is, the non-relaxed approach cannot reproduce the results of the relaxed approach for neutral residues which are in the second ‘solvation shell’ around the inhibitor. The significance of this finding is not completely clear, as it is almost certain that these calculated relaxed contributions overestimate the actual effects (this point will be verified in the future using observed mutation effects). Nevertheless, it is clear that neutral residues in the second ‘solvation shell’ can contribute by a second-order conformational effect (by pushing the first ‘solvation shell’ residues) and that this indirect effect cannot be reproduced by the non-relaxed approach. Our previous studies (Muegge et al., 1996) already demonstrated that the non-relaxed electrostatic group contributions are useful for charged ligands. The present work demonstrated that even for a neutral ligand this approach is useful although second ‘solvation shell’ residues with no electrostatic contribution that contribute to binding cannot be screened effectively by the current version of our approach. However, even here one can design non-relaxed ways of estimating the coupling between the second and first ‘solvation shells’ and thus for estimating the effect of residues of the second ‘solvation shell’. Hence we conclude that the use of the non-relaxed electrostatic fingerprint can be useful for both charged and uncharged ligands. Obviously, we must keep in mind that finding a small electrostatic contribution cannot be used to exclude the possibility that a mutation of the corresponding residue will change the binding energy in a significant way. As should be obvious to the reader, we have not addressed in this paper the relationship between the calculations and experimental mutation studies. Not only are we not aware of such results for the complex studied but, perhaps more importantly, the paper addresses the theoretical yet practical question of how justified it is to use part of the relaxed model in reproducing the results of the ‘complete’ model. Since we find that the nonrelaxed approximation is valid for more than 70% of the residues in the first ‘solvation shell’ (Table II), we believe that it is justified to take advantage of this approach in fast screening studies (see below). One of the main reasons of our interest in the non-relaxed approach is the need for very rapid computer-aided screening approaches for fighting the drug-resistance problem. Here one has to anticipate the effect of mutations that would reduce the binding energy of a given drug while retaining catalytic activity against the native substrate (Gulnik et al., 1995). The first step in the development of an efficient method is our observation that it is sufficient to have the information of mutating all residues to Gly without the need to know the effect of each residue to any other possible residue. That is, the effect of the A→B mutation can be determined by the thermodynamic cycle A→Gly, Gly→B. This idea (which is one of the reasons for our definition of group contributions) reduces the order of the computations needed that involves a single mutation from N3L to N (where N is the number of residues in a given protein and L is the number of naturally occurring amino acids). The second 1371 I.Muegge, H.Tao and A.Warshel problem is, of course, the time needed to perform each computeraided mutation calculation. Here the non-relaxed approach takes about 2 s of CPU time (on an SGI R8000 processor) for 100 amino acids whereas the fully relaxed approach takes about 50 days and a properly converging LRA study would probably take a year for 100 amino acids. Of course, in principle the relaxed approach is much more reliable but in many cases one can obtain the correct trend by the non-relaxed approach. Furthermore, some of the effects obtained here for the relaxed effect of non-polar residues may reflect instability in the MD relaxation procedure. Acknowledgements This work was supported in part by ONR grant N00014-93-1-0967 and TobaccoRelated Disease Research Program grant 4RT-0002. I.M. gratefully acknowledges support by a fellowship of the Deutsche Forschungsgemeinschaft (DFG). References Åqvist,J. and Hansson,T. (1996) J. Phys. Chem., 100, 9512–9521. Åqvist,J. Luecke,H., Quiocho,F.A. and Warshel,A. (1991) Proc. Natl Acad. Sci. USA, 88, 2026–2030. Åqvist,J., Medina,C. and Samuelsson,J.E. (1994) Protein Engng, 7, 385–391. Bailey,D. and Cooper,J. (1994) Protein Sci., 3, 2129–2143. Bohm,H.-J. and Klebe,G. (1996) Angew. Chem., Int. Ed. Engl., 35, 2589–2614. Boresch,S. and Karplus,M. (1995) J. Mol. Biol., 254, 801–807. Cooper,J.B., Foundling,S.I., Blundell,T.L., Boger,J., Jupp,R.A and Kay,J. (1989) Biochemistry, 28, 8596–8603. Foundling,S.I. et al. (1987) Nature, 327, 349–352. Froloff,N., Windemuth,A. and Honig,B. (1997) Protein Sci., 6, 1293–1301. Gomez,J. and Freire,E. (1995) J. Mol. Biol., 252, 337–350. Gulnik,S.V., Suvorov,L.I., Liu,B.S., Yu,B., Anderson,B., Mitsuya,H. and Erickson,J.W. (1995) Biochemistry, 34, 9282–9287. James,M.N., Sielecki,A., Salituro,F., Rich,D.H. and Hofmann,T. (1982) Proc. Natl Acad. Sci. USA, 79, 6137–6141. King,G., Lee,F.S. and Warshel,A. (1991) J. Chem. Phys., 95, 4366–4377. Kollman,P. (1993) Chem. Rev., 7, 2395–2417. Lee,F.S. and Warshel,A. (1992) J. Chem. Phys., 97, 3100–3107. Lee,F.S., Chu,Z.-T., Bolger,M.B. and Warshel,A. (1992) Protein Engng, 5, 215–228. Lee,F.S., Chu,Z.-T. and Warshel,A. (1993) J. Comput. Chem., 14, 161–185. Levitt,M. and Warshel,A. (1975) Nature, 253, 694. Madura,J.D., Nakajima,Y., Hamilton,R.M., Wierzbicki,A. and Warshel,A. (1996) Struct. Chem., 7, 131–181. Muegge,I., Schweins,T., Langen,R. and Warshel,A. (1996) Structure, 4, 475–789. Muegge,I., Qi,X.P., Wand,A.J., Chu,Z.T. and Warshel,A. (1997a) J. Phys. Chem. B, 101, 825–836. Muegge,I., Schweins,T. and Warshel,A. (1998) Proteins: Struct. Funct., Genet., in press. Rao,C.M., Izaguirre,C.E. and Dunn,B.M. (1993a) Protein Engng, 6, 48. Rao,C.M., Scarborough,P.E., Kay,J., Batley,B., Rapundalo,S., Klutchko,S., Taylor,M.D., Lunney,E.A. and Humblet,C.C. (1993b) J. Med. Chem., 36, 2614–2620. Rich,D.H. (1985) J. Med. Chem., 28, 263–273. Sali,A., Veerapandian,B., Cooper,J.B., Foundling,S.I., Hoover,D. and Blundell, T.L. (1989) EMBO J., 8, 2179–2188. Sham,Y.Y., Chu,C.T. and Warshel,A. (1997) J. Phys. Chem. B, 101, 4458. van Gunsteren,W.F. and Mark,A.E. (1992) Eur. J. Biochem., 204, 947–961. Warshel,A. (1981a) Biochemistry, 20, 3167. Warshel,A. (1981b) Acc. Chem. Res., 14, 284. Warshel,A. and Åqvist,J. (1991) Annu. Rev. Biophys. Biophys. Chem., 20, 267–298. Warshel,A. and Russell,S.T. (1984) Q. Rev. Biophys., 17, 283–422. Warshel,A., Russell,S.T. and Churg,A.K. (1984) Proc. Natl Acad. Sci. USA, 81, 4785–4789. Warshel,A., Sussman,F. and King,G. (1986) Biochemistry, 25, 8368–8372. Warshel,A., Sussman,F. and Hwang,J.-K. (1988) J. Mol. Biol., 201, 139–159. Warshel,A., Tao,H., Fothergill,M. and Chu,Z.-T. (1994) Isr. J. Chem., 34, 253–256. Workman,R.J. and Burkitt,D.W. (1979) Arch. Biochem. Biophys., 194, 157–164. Yang,A.S., Gunner,M.R., Sampogna,R. and Sharp,K. (1993) Proteins: Struct. Funct. Genet., 15, 252–265. Received June 26, 1997; revised August 25, 1997; accepted September 18, 1997 1372
© Copyright 2026 Paperzz