Biol Philos DOI 10.1007/s10539-011-9252-8 Structure, function, and protein taxonomy William Goodwin Received: 21 May 2010 / Accepted: 1 February 2011 Springer Science+Business Media B.V. 2011 Abstract This paper considers two recent arguments that structure should not be regarded as the fundamental individuating property of proteins. By clarifying both what it might mean for certain properties to play a fundamental role in a classification scheme and the extent to which structure plays such a role in protein classification, I argue that both arguments are unsound. Because of its robustness, its importance in laboratory practice, and its explanatory centrality, primary structure should be regarded as the fundamental distinguishing characteristic of protein taxonomy. Keywords Proteins Structure Function Classification Natural kinds Introduction Philosophers have recently begun to investigate our biochemical classification scheme(s) with the goal of revealing how, and on what basis, biochemicals are individuated, or sorted into kinds. The hope is that some of the distinctive features of biochemical practice will shed light on a couple of related, traditional philosophical issues. The first is Monism vs. Pluralism, which, in the biochemical case, comes down to whether there is, or should be, one privileged way-rather than a variety of equally legitimate ways-to sort the biochemical domain into kinds. The second issue is essentialism, which hopes to discern the extent to which our classificatory practices reflect ‘natural’, as opposed to merely pragmatic, distinctions. Perched as it is between biology and chemistry, biochemical classification inherits elements of the both the structural classification prevalent in chemistry and the teleological classification prevalent in parts of biology. Furthermore, whereas W. Goodwin (&) Department of Philosophy and Religion, Rowan University, Glassboro, NJ, USA e-mail: [email protected] 123 W. Goodwin the structural classifications of chemistry have often been regarded as paradigmatic cases of essentialism and monism (not without controversy), biological taxonomy has been suggested as domain which is, or should be, pluralistic and anti-essentialist (also not without controversy). Because biological macromolecules are thought about in both structural and functional terms, biochemical classification provides a rich context in which to consider both whether, and how, features of a scientific practice might be brought to bear on these traditional issues in particular cases. In their work on biochemical classification, both Slater (2009) and Tobin (2010) focus on proteins. This is a good choice because not only are proteins the workhorses of the cell, but their basic structure is well-understood. Furthermore, one of the enduring goals of biochemistry1 has been to explain the function of proteins in terms of their structure. As a result, biochemists have not only developed schemes for classifying proteins in both structural and functional terms, but they have also extensively investigated how these characteristics of proteins are related to one another. In fact, both Tobin and Slater appeal to recent developments in the relationship between protein structure and function—more specifically moonlighting proteins (single polypeptides with multiple functions), intrinsically unstructured proteins, and enzymes with multiple structural variants—to undermine the fundamental individuating role of structure in protein classification. That is, both of them hold that these recent biochemical developments help establish that there is no one privileged way to sort proteins into kinds or types on the basis of their structure. I think they are wrong. Though there might be more general metaphysical or semantic reasons to resist monism and/or essentialism, these developments within the practice do not undermine the fundamental role of structure in protein classification, but instead suggest a more nuanced understanding of this role. In the remainder of this paper, I will consider the arguments against the fundamental role of structure put forward by Slater and Tobin, explain why I think these arguments are not compelling, and then try to establish that (and in what senses) there is a privileged way of sorting proteins into types or kinds and that is according to their primary structure. Though Slater and Tobin reach the same conclusion about the plausibility of a fundamental role for structure in protein classification, they get to this conclusion by different routes. Slater, I contend, inappropriately extrapolates arguments for pluralism about the species concept into the biochemical domain. By considering disanalogies between these domains, I hope to bring out a way of thinking about the relationship between the structural and functional classifications of proteins that Slater neglects. Tobin, on the other hand, mischaracterizes the sense in which structure is fundamental to chemical classification. By clarifying the role of structure in the classification of organic compounds, I hope to provide content to the idea that certain features play a ‘fundamental’ role in a classification scheme. I then argue, using this newly articulated role, that primary structure plays a fundamental role in protein classification. Lastly, I consider the phenomena of moonlighting proteins and 1 Throughout this paper, for simplicity of expression, I will used the terms ‘biochemistry’ and ‘biochemist’ to refer to all of those disciplines, and their corresponding scientists, that investigate the structure and function of proteins. 123 Structure, function, and protein taxonomy intrinsically unstructured proteins and argue that rather that undermining the fundamental role of structure, they in fact support it. Slater’s pluralism In his ‘‘Macromolecular Pluralism’’ Matthew Slater concludes that ‘‘we should be pluralists about macromolecular classification’’ (Slater 2009, p. 851). To reach this conclusion, he considers and then rules out the most obvious candidate versions of monism.2 Specifically, he considers the prospects for both a functional monism— where macromolecules are individuated on the basis of their functional roles—as well as structural monism, where macromolecules are individuated on the basis of their structure. The prospects for a functional monism are not good, according to Slater, because such a monism would seem to presuppose the capacity to individuate functions. Structural monism, on the other hand, is implausible because none of the potential candidate notions of structure result in an individuation of proteins that ‘‘accommodates biological practice’’ (Slater 2009, p. 852). The idea here seems to be that structure results in an individuation of proteins that is either too coarse (in the case of primary structure and secondary structure) or too fine (in the case of tertiary structure) to match the ways that biochemists sort proteins. To support his claim that there is a discrepancy between the level of resolution supported by the various structural monisms and the biological practice, Slater describes some recently uncovered biological phenomena—such as moonlighting proteins and some enzymes that come in multiple structural variants—which show that there is not always a one-to-one correlation between the structure and function of proteins. I will examine the case of moonlighting proteins in more detail later, but for now I will just grant that sometimes biologists classify macromolecules by function in ways that are coarser or finer than standard structural classifications. Certainly, sometimes biologists talk in ways that reflect these courser or finer distinctions. But, why should this lead to the conclusion that there is no one fundamental way of individuating proteins on the basis of their structure? An obvious alternative would be to hold that in such situations, biologists should be understood to be refining, or introducing genera into, an existing structural classificatory scheme. Though their surface discourse may be in terms of these additional classifications, all of this discourse might be smoothly reconcilable with more fundamental distinctions in the background.3 Scientists do things like this all 2 Though most of Slater’s paper is explicitly about whether we should be monists or pluralists about protein classification, it is also clear that issues of essentialism are central to his thinking. He dismisses the significance of biologists’ talk about protein structure because this talk does not occur ‘‘in the context of describing a protein’s essence’’ (Slater 2009, p. 854); further he challenges the monist to explain why a certain description characterizes ‘‘what it is to be a certain kind of protein’’ (Slater 2009, p. 855). 3 As suggested by a referee, biologists may well characterize two structurally distinct polypeptides as both being instances of ‘‘hemoglobin,’’ in which case these distinct polypeptides are treated, in some sense, as the same protein. Surface discourse of this type could be regarded as smoothly reconcilable with more fundamental distinctions in the background by simply thinking of ‘‘hemoglobin’’ as the name for a class of proteins, the membership conditions for which include both structural and functional elements (which might vary according to investigative context). This may not be how biologists actually talk, but 123 W. Goodwin the time; surface discourse and such modifications of our classificatory schemes need not lead to pluralism—there can still be good reasons for understanding a particular, prior distinction as fundamental. Evidently, according to Slater, this way of understanding biochemists’ surface discourse and classificatory practices is not appropriate because the individuation of proteins on the basis of their structure, is ‘‘costly and ad hoc’’ and does not best accommodate scientific practice (Slater 2009, p. 852). In order to understand what this means, and why it leads to pluralism, it is important to recognize that Slater is extrapolating the sorts of arguments used to motivate pluralism about ‘‘species’’ to the macromolecular case. Pluralism about ‘‘species’’, of the sort that Slater cites, is based on the observation that there are multiple competing ways of dividing organisms up into species. To say that these distinct uses of the concept of ‘‘species’’ are competing does not mean simply that different biologists use this concept to express different ideas in different contexts. Instead, these diverse species concepts are associated with a ‘‘multiplicity of legitimate explanatory demands in biology’’ (Stanford 1995, p. 86). In service of these diverse demands, not only might these different notions of species group the same collection of organisms differently, they can also differ in the domain of organisms to which they apply (for instance, criteria based on the possibility of interbreeding don’t work well for asexual organisms). As a result, the diverse species concepts are in tension and it would, ‘‘hobble significant investigations in biology’’ (Stanford 1995, p. 77) to insist upon privileging one of these notions of species above the others. In such circumstances, pluralism seems like a sensible position because it avoids impeding the scientific practice on philosophical grounds. Correspondingly, I think that Slater should be understood to hold that the situation is similar in the case of macromolecules like proteins. That is, he holds that there are various ways of individuating proteins in biochemistry that are in tension with one another in the sense that to privilege one or the other of these ways of individuating proteins would hamper progress in biochemistry (that is why structural monism would be ‘‘costly’’). The reason, therefore, that the mismatch been structural and functional individuations of proteins cannot be dealt with by simple modifications of our existent classificatory schemes (the alternative suggested above) is that this would hinder the explanatory and/or investigative work of biochemists. This is what gives force to the recommendation of pluralism; for if there were no cost to the scientific practice in insisting on monism, then this alternative might legitimately be preferred, on the grounds—perhaps-of conceptual hygiene or explanatory unity. I hope to have made it plausible that Slater’s argument for macromolecular pluralism depends on there being tensions between the various ways that biochemists sort proteins according to structure and function. It is not enough to merely report that in some circumstances the surface discourse of biochemists classifies biomolecules in such a way that there is no one-to-one correspondence Footnote 3 continued they might, without any obvious ‘‘cost’’ to the practice. Indeed, if it ever became important to explain why not all hemoglobin behaves the same (why there are several distinct spots on your gel, for instance) it would be natural to refine the use of ‘‘hemoglobin’’ in this way. 123 Structure, function, and protein taxonomy between structural and functional classification.4 If this is right, then one way to undermine Slater’s pluralism would be to argue that protein classification is significantly different from biological taxonomy (as it is characterized in the argument for ‘‘species’’ pluralism, anyway) not insofar as there are multiple different ways of sorting the objects of the domain, but in that there is no tension between these various classifications—they can be smoothly integrated into a monistic framework without cost to the practice. Indeed, as I will suggest in more detail later in the paper, I think that this is the right way to understand the relationship between structure and function in protein classification. While there is a fundamental, structural way of individuating proteins, there are also supplemental classifications introduced to address various biological interests. The additional classifications are not in tension with the structural classification; instead, they depend upon it in important ways. Tobin and structuralism in chemistry In her ‘‘Microstucturalism and Macromolecules’’ (Tobin 2010), Emma Tobin argues that though microstructuralism (which is the thesis, expressible in a variety of specific forms, that structure is the fundamental individuating characteristic of a chemical kind) is plausible in the case of chemical compounds and some macromolecules, it is not plausible to hold that proteins are fundamentally individuated by their microstructures (which is roughly equivalent to their molecular structure). She reaches this conclusion by considering the phenomenon of moonlighting by intrinsically unstructured proteins (which I will describe in more detail later). Before reaching this conclusion, though, Tobin argues that in cases when the classificatory kinds used by a scientific practice are ‘based on’ molecular structure in some appropriate way, it is still plausible to maintain microstructuralism. However this recently revealed biological phenomena establishes that the functions of at least some proteins are not appropriately ‘based on’ their structures; thus ‘‘what makes a protein the kind of chemical kind that it fundamentally is’’ (Tobin 2010, p. 53) cannot simply be its molecular, or primary, structure. Tobin recognizes that scientists often classify the objects of their domains in multiple and overlapping ways, but that this does not, in and of itself, suggest pluralism—one of these ways may well be fundamental. She argues that molecular structure plays such a fundamental role in our classification of chemical compounds and some macromolecules. In the process, the fundamental role of molecular structure is defended against two sorts of challenges. These challenges are based on the observation that our structural classification scheme is, for certain scientific purposes, not refined enough, while for other purposes it is too specific. Chemists 4 If this were enough, then by parity of reasoning the surface discourse of chemists (which Slater seems to accept as a scientific practice which can be interpreted monistically) would also be sufficient to recommend pluralism about chemical classification. As I will discuss in more detail later on, chemists frequently individuate chemicals in ways that are both finer and courser than their chemical structure; however, there can be no doubt (at least from the point of view of the practice) that chemical structure is the fundamental way of individuating chemical species. 123 W. Goodwin deal with these problems by introducing either refinements or genera into their classificatory schemes, without, Tobin plausibly argues, undermining the sense in which chemical structure is the fundamental way of individuating chemical kinds. More specifically, many macroscopic or higher-level properties on the basis of which we sort chemicals are not inherent in the chemical structure itself, but instead only manifest themselves under certain conditions, or in particular contexts. Still, this context specific classification is compatible with microstructuralism, so long as it is in principle possible to determine ‘‘the macrostructure of a sample based on the specification of the microstructure’’ (Tobin 2010, p. 50). Likewise, some useful chemical classifications do not distinguish between many of the individual chemical structures that fall under them. Nonetheless, so long as the ‘‘higher level properties’’ are ‘‘a direct consequence of the microstructure’’ (Tobin 2010, p. 49), it is still plausible that ‘‘membership of that [higher order] kind is conferred by microstructural properties’’ (Hendry 2006, p. 865) and so that molecular structure maintains its fundamental role. While I think that Tobin is right to insist upon the fundamental role of molecular structure in these cases, and so to resist pluralism, I think that she incorrectly characterizes the senses in which structure is fundamental in chemical classification. As a consequence, I will argue, she misrepresents the significance of moonlighting by intrinsically unstructured proteins. To clarify the sense in which molecular structure is fundamental to chemical classification, I will now briefly describe our chemical classification scheme as well as some of the ways that it has been modified in order to accommodate chemical practice. One of the great triumphs of early organic chemistry was the realization that atomic composition, chemical connectivity through bonds, and stereochemistry5 (collectively referred to as molecular structure, for short) all had to be kept track of in order to individuate chemical kinds in such a way that any two samples of the same kind behaved the same way (for the most part) in laboratory situations. Furthermore, it was the explicit goal of the structural chemists not only to individuate chemical kinds in terms of molecular structure, but also to explain the chemical and physical behavior of the resultant kinds in structural terms.6 Molecular structure is not, however, the only way that organic chemists classify the objects of their study. Indeed, as the field has progressed, organic chemists have found it useful to superimpose other sorts of classifications on this fundamental, structural scheme. For instance, organic molecules are also classified by functional group because this is really useful in understanding and projecting the chemical reactions in which they might participate. Many organic molecules contain more than one functional group, and so they might be classified into different groups for different purposes or in different situations. Additionally, in order to explain the chemical and physical behavior of some organic molecules, it is important to distinguish the 5 Stereochemistry refers to the relative three-dimensional arrangement of atoms in space. For example, though the left and right hands have the same ‘‘connectivity’’, they differ in their relative threedimensional arrangement. Similarly, though any two carbon atoms bonded to the same four distinct groups will have the same ‘chemical connectivity’, there are two distinct ways of arranging these groups around the carbon atom; these molecules differ in their stereochemistry. 6 See Goodwin (2010) for a characterization of the goals and development of structural organic chemistry. 123 Structure, function, and protein taxonomy conformations—the particular, rather than just the relative, three-dimensional arrangements of atoms in space-that are possible for them. Each molecular structure, thus, corresponds to a continuum of distinct conformations. So, whereas functional groups are overlapping genera, conformations are potentially infinite refinements of the structural classification scheme. In the face of both sorts of modification to our chemical classification schemes, though, molecular structure maintains its fundamental role. Organic molecules are classified into functional groups on the basis of their molecular structure. For example, in order for a particular compound to be an alcohol, it must contain an –OH group. However, the fact that a compound contains such a group does not indicate that it should always be thought of as an alcohol. Containing a particular structural subunit is a necessary, but insufficient, for the functional group classification of a compound. In addition, such a classification requires knowing something about both the molecular context of the structural subunit, as well as the chemical context in which the functional classification is intended to serve. Considerations about how chemical structure determines or influences chemical behavior would then decide whether a particular functional group classification is appropriate. So, though functional group classification is really important in organic chemistry, it is not in tension with the fundamental role of molecular structure in organic chemistry. Instead, it is molecular structure that explains when and why a particular functional group classification is appropriate. In the middle of the 20th century, chemists realized that some three-dimensional arrangements of atoms in a compound are more energetically stable than others. Furthermore, for some molecules, it was only by taking into account the energetics of these conformations that their chemical and physical behavior could be explained.7 This was a challenge to the explanatory goals of structural chemists because it showed that molecular structure by itself (composition, connectivity, and stereochemistry) was not enough to explain all the chemical and physical behavior of organic compounds. Still, the birth of conformational analysis did not undermine the fundamental role of molecular structure in the classificatory schemes of organic chemists. There are several reasons for this. First, except in rare circumstances, the individual conformations of organic molecules are not stable with respect to the laboratory practices of organic chemists. In other words, you can’t isolate distinct conformations, and any macroscopic sample of a chemical compound is likely to contain many different conformations at any one time. Second, much of the time, the conformation of an organic molecule is not important to understanding its behavior, and even in contexts where conformation is important, it is only the conformation of part of the molecule that plays a role. Lastly, the idea of a molecular conformation is conceptually dependent on molecular structure. Both historically and practically, one gets a grasp on the range of a molecule’s conformations by considering a molecular structure with fixed bond lengths and angles and then imagining, or mapping out, the space of possible arrangements of the atomic centers accessible by rotations around the single bonds in the molecule. 7 For a more detailed discussion, in a philosophical context, of this subfield of organic chemistry (called ‘‘conformational analysis’’) see Goodwin (2009). 123 W. Goodwin Differences in the energies of these conformations are then explained in terms of the possible interactions between closely situated parts of the molecular structure. So, though the consideration of conformations has become an important part of the practice of organic chemists, this was not accommodated by developing an alternative classificatory scheme that is somehow in tension with classification by molecular structure. Instead, certain refinements have been introduced to the classificatory scheme of organic chemists to allow them to distinguish conformations when this is necessary. Molecular structure is still fundamental because of its close connection to the laboratory life of organic chemists, because of the breadth of its applicability, and because of its fundamental explanatory role. When Tobin argues for the fundamental role of molecular structure in underwriting the context-dependent or higher-level classifications used by molecular scientists, it is the explanatory role of structure that she invokes. So long as molecular structure was found to ‘‘determine’’ or have as ‘‘a direct consequence’’ these additional properties or classifications, they presented no threat to the fundamental role of structure.8 I think that Tobin is right to invoke the explanatory centrality of molecular structure as a reason to regard these other classifications as modifications of the structural classification scheme, rather than as alternatives to it. However, I think she both overlooks some of the other reasons that molecular structure is fundamental (its breadth and close connection to laboratory life) and overstates the sense in which classifications (chemical or physical, macroscopic or otherwise) are held to depend on the molecular structures that explain them. Molecular structure need not ‘‘determine’’ or have as a ‘‘direct consequence’’ all of the properties or classifications that explanatorily depend on it. Rather, in most cases it is enough that these properties or classifications are understood to be present as capacities or potentialities of a molecule (or sample) in virtue of its molecular structure. It is only in certain restricted circumstances that most complex molecules act in those ways that are characteristic of being one functional group or another, nonetheless, the capacity to function like an alcohol, or a carboxylic acid, depends on molecular structure. Actual behavior depends on a lot of other things as well. Similarly, the reactive behavior of many organic molecules depends on their conformations, and what conformations are prevalent or accessible depends on the chemical environment. Still, the space of conformations, and thus the capacity to react in various ways, is determined by molecular structure. This sort of modal dependence on molecular structure reflects the commitment that structure will be an important part of the explanations that chemists provide for higher level behavior and its corresponding classifications, but it also recognizes that lots of other considerations are important in various circumstances. This is, I think, a more realistic understanding of the sense in which molecular structure is explanatorily 8 Slater, too, seems to endorse some such conception of the explanatory role of fundamental classificatory properties. He explains, in the process of rejecting structural monisms, that ‘‘[t]his stance presumes, of course, that the individuation of proteins will to a certain extent represent important biochemical facts about them’’ (Slater 2009, p. 853). Further, he argues against a monism based on primary structure because, ‘‘[a]n amino acid sequence alone has no direct link with a protein’s ultimate biological role’’ (Slater 2009, p. 852). The plausibility of these positions depends on what is meant by a ‘direct link’ and ‘representing important biological facts.’ 123 Structure, function, and protein taxonomy fundamental.9 With this more realistic understanding in hand, I will now go on to argue that not only is the primary structure of a protein broadly applicable and closely connected to laboratory practice, but it is also explanatorily fundamental for protein scientists in much the same way that molecular structure is explanatorily fundamental for chemists. As a result, then, the primary structure of proteins can be regarded as fundamental to the classification of these macromolecules in much the same way, and for many of the same reasons, that molecular structure is fundamental to the classification of chemical kinds. The primacy of primary structure The primary structure of a protein, which is the linear sequence of its component amino acids, is roughly translatable into its molecular structure.10 So to hold that the fundamental way of individuating proteins is by primary structure is to suggest that the scheme for individuating and classifying proteins is an extrapolation of the scheme used for classifying organic molecules generally. This is borne out in the fact that many of the same theoretical concepts and laboratory techniques (Molecular weight, conformation, chromatography, NMR, etc.) are used in both fields. Of course, in the case of these biomolecules, additional classifications, laboratory techniques, and theoretical concepts are added in order to support the interests of biochemists. Most obvious among these additional classifications would be functional classifications, like those used in enzyme nomenclature. These functional classifications are neither in tension with structural classifications of proteins, nor do they represent alternative and equally fundamental alternative ways of classifying biomolecules.11 Rather, because of its robustness, its close connection to laboratory practice, and its explanatory centrality, primary structure is the fundamental way of individuating proteins. Primary structure plays a central role in both our best current understanding of the biological lifecycle of proteins and the laboratory practices used to study them. Because of the vastness of the relevant fields, I can only gesture at these roles here. So, according to the Central Dogma of Molecular Biology, information about the primary structure of proteins is coded in the DNA and then translated by the ribosomes into actual proteins, which are then folded (either actively or passively), combined, and transported so that they can assume their various biological roles. Eventually, proteins are sequentially degraded back into amino acids or smaller 9 To attempt to put this in the more formal terms that Tobin favors, one might say that a classification, K, is compatible with microstructuralism so long as the capacities and/or dispositions in virtue of which something is a K are held to be consequences of its microstructure. So classification into functional groups is compatible with microstructuralism because chemicals are held to have their various reactive capacities and dispositions in virtue of their molecular structure. Similarly, the distinction of various conformations is compatible with microstructuralism because the space of conformations, and thus the potential of being in any particular conformation, depends on molecular structure. 10 Only roughly, though. Some functional proteins include bonds to metal ions, others include disulfide bridges, etc. 11 After all, it is structure that is supposed to explain function, not the other way around. 123 W. Goodwin polypeptide chains. Throughout this cycle, it is the primary structure of the protein that remains constant, while the higher-order structure and functionality of the protein may vary, or fail to exist altogether. One of the principle ways of producing proteins in a lab is to introduce a gene that codes for the primary structure of the protein into bacteria, which can then be stimulated to produce proteins of the desired structure. Additionally, the principal ways that biologists actually separate, count, and identify the proteins that are involved in some biological process (in proteomics, say) make use of properties that are ‘directly’ linked to primary structure. Two-dimensional gel electrophoresis takes a collection of individual proteins molecules and separates it first according to the isoelectric points of the various proteins. Once the proteins are spread out according to isoelectric point, they are then separated in an orthogonal direction by molecular weight. This results in a two-dimensional array of spots of protein, which can then be excised, digested with special enzymes, and run through a mass spectrometer in order to identify the protein by the characteristic distribution of the weights of the resultant polypeptides. All three steps of this process exploit properties linked to the amino acid sequence of the protein. The robustness of primary structure, relative to higher-order structure or functionality, ensures not only its centrality in our understanding of proteins, but also its importance in the laboratory manipulation and control of proteins. Even though the primary structure of proteins does not explain or encode, by itself, all of the characteristics that make proteins interesting and important, it does have a fundamental role in biochemists’ explanations of these characteristics. I will try to bring out this feature by considering some of the biological phenomena mentioned by Slater and Tobin to argue against structural monism for proteins. Moonlighting is, by definition, ‘‘the ability of a protein to fulfill more than one … function’’ (Tompa et al. 2005, p. 484). Several different mechanisms for moonlighting have been proposed. Some of these mechanisms are compatible with the idea that the function of a protein is the result of its detailed three-dimensional structure (which is what biochemists would call its tertiary, or in some cases, its quaternary structure). In such mechanisms, it is ‘‘change[s] in the molecular environment’’ (Tompa et al. 2005, p. 484) that are responsible for the fact that the protein performs different functions in different situations. Though such proteins can be regarded as having a (relatively) fixed tertiary structure, they are capable of performing more than one function because different parts of their ‘‘folded, globular’’ surfaces can interact with different partners in different environments. Moonlighting of this sort shows that there is no one-to-one correlation between the tertiary structure of a protein and its biological function; the molecular environment of the protein can have a central role in determining what function(s) the protein performs. However, this does not undermine the fundamental role of primary structure in classifying and understanding the behavior of these proteins. Indeed, the capacity to perform any of their multiple possible functions is, for proteins of this sort, a consequence of their tertiary structure. The capacity to assume this tertiary structure (in the right biological circumstances) is, in turn, a consequence of the primary structure of the protein. So, though there are a lot of environmental and contextual factors to consider as well, the diverse functions of these sorts of moonlighting proteins are ultimately understood to issue, as potentials or capacities, 123 Structure, function, and protein taxonomy from their primary structure. In this sense, then, moonlighting proteins of this sort aren’t very different from organic molecules that can be classified as several distinct functional types—the appropriate classification depends upon the molecular and reactive environment and the appropriateness of the classification is ultimately understood to issue from molecular structure. In the last decade or so, biochemists have begun to pay increasing attention to the large fraction of gene sequences which code for proteins that are not in a globular, folded state under biological conditions (Wright and Dyson 1999). The proteins generated from such sequences have been found to play a wide variety of different functional roles in spite of the fact that they do not have a well-defined tertiary structure (see Dunker et al. 2008 for a recent review). Proteins of this sort, which are referred to as intrinsically unstructured proteins, or IUPs, present a challenge to ‘‘the central dogma of structural biology,’’ which is the idea that, ‘‘a folded protein structure is necessary for biological function’’ (Wright and Dyson 1999, p. 322). In addition to challenging the orthodoxies of structural biology, IUPs also open up the possibility of additional mechanisms of moonlighting. So instead of having multiple distinct binding sites on the surface of a folded globular protein, some cases of moonlighting may depend on the fact that the protein has a lot of conformational flexibility and can therefore interact with, or bind, multiple partners, thereby resulting in multiple functions. The possibility of ‘‘binding promiscuity’’ resulting from the lack of a stable tertiary structure is alleged, by Tobin, to show that ‘‘all forms of microstucturalism must be rejected’’ (Tobin 2010, p. 54). Because, ‘‘the structural disorder of the primary structure is … the determining factor, which allows functional promiscuity…there can be no straightforward identity between macromolecular kinds and their primary structures’’ (Tobin 2010, p. 52). It seems, then, that Tobin holds that because of their intrinsic (tertiary) structural disorder it is impossible to explain (or derive, or understand?) the function of such proteins on the basis of their primary structures, and as a result microstructuralism is implausible. Whatever her reasons,12 once it is realized that primary structure need not entail or completely explain all interesting properties of protein in order to be explanatorily fundamental, the phenomena of moonlighting by intrinsically unstructured proteins does not undermine structural monism about protein classification. Instead, it reinforces the explanatory centrality of primary structure. Though much more complicated because of the size of typical proteins, attempts to understand the tertiary structure and function of proteins in terms of their primary structure are quite similar to the conformational analysis of organic chemists. In both cases, it is the space of possible conformations and the relative energetics of these conformations that are explanatorily fundamental. The tertiary structure of a protein, when there is one, represents the most energetically stable conformation available to the protein in the relevant biological circumstances. With intrinsically 12 Tobin also claims: ‘‘Clearly, because the primary structures of some moonlighting proteins are intrinsically disordered, then the structural arrangement of the microstructural parts is insufficient to classify any protein as a member of a macromolecular kind’’ (Tobin 2010, pp. 52–53). The italicized portion (the italics are mine) makes it seem as if Tobin thinks that IUPs have no fixed primary structure, and this is why they undermine microstructuralism. If this is the argument, then it is based on a misunderstanding of IUPs. 123 W. Goodwin unstructured proteins, there is no one distinguished conformation that is significantly more energetically stable than available alternatives. Instead broad swaths of the conformation space are available to the protein under biological conditions. Still, the potential to bind with various biological partners and thus the capacity to perform distinct biological functions is understood to issue from the protein’s primary structure, by way of its conformation space. In lieu of one particular conformation acting as a ‘lock’ into which the binding partner might fit, the binding partner might engage in ‘‘conformational selection’’ in which it selects ‘‘the structure that fits from the other conformers among the ensemble’’ (Dunker et al. 2008, p. 2). The presence of intrinsically unstructured regions in proteins, therefore, opens up the potential for different mechanistic routes to biological functionality. As such cases show, the explanation of protein function on the basis of protein structure often proceeds by way of higher-level structure, but not always. So the important lesson of moonlighting intrinsically unstructured proteins is not that primary structure is incapable of completely explaining protein functionality— primary structure was never able to do that. Instead, it is that biological functionality exploits a broader range of the potentials and capacities that proteins have in virtue of their primary structure than was originally thought. Robin Hendry has claimed that, ‘‘[t]he classificatory practices of a scientific discipline reflect its particular theoretical and explanatory interests’’ (Hendry 2006, p. 874). They also reflect the features of the classified phenomena that are relatively constant, and subject to manipulation and control in the laboratory. As we saw in the case of organic chemistry, it can make sense to regard certain characteristics as fundamental to a classificatory scheme, even when the discipline’s surface discourse includes many alternative ways of ‘carving up’ its domain. So long as these alternatives are smoothly reconcilable with the fundamental characteristics, which play certain special roles in the explanatory and laboratory life of the discipline, such classificatory practices do not demand pluralism. In the case of those scientific disciplines that study proteins, this adds up to the not so shocking result that the primary structure of proteins is, well, primary. Higher-order structure and function are important, but they are not as robust as primary structure, either in the laboratory or conceptually. By taking primary structure as explanatorily fundamental, biochemists are committed not to the thought that all interesting properties of proteins can be explained in terms of primary structure alone, but instead to the more modest belief that proteins have the capacity to behave in the ways that they do in virtue of their primary structure. Our classification scheme for proteins should be understood to reflect this commitment, and this is why proteins are individuated, most fundamentally, by primary structure. References Dunker A, Oldfield C, Meng J, Romero P, Yang J, Chen J, Vacic V, Obradovic Z, Uversky V (2008) The unfoldomics decade: an update on intrinsically disordered proteins. BMC Genomics 9(Suppl 2):S1– S2 123 Structure, function, and protein taxonomy Goodwin W (2009) Visual representations in science. Philos Science 76(3):372–390 Goodwin W (2010) How do structural formulas embody the theory of organic chemistry? Br J Philos Sci 61(3):621–633 Hendry R (2006) Elements, compounds, and other chemical kinds. Philos Sci 73:864–875 Slater M (2009) Macromolecular pluralism. Philos Sci 76(5):851–863 Stanford P (1995) For pluralism and against realism about species. Philos Sci 62:70–91 Tobin E (2010) Microstructuralism and macromolecules: the case of moonlighting proteins. Found Chem 12(1):41–54 Tompa P, Szasz C, Buday L (2005) Structural disorder throws new light on moonlighting. Trends Biochem Sci 30:484–489 Wright P, Dyson H (1999) Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. J Mol Biol 293:321–331 123
© Copyright 2026 Paperzz