<oologualJournal o f f h e Linnean SocieQ (1982), 74: 305328. With 2 figures The use of parsimony in testing phylogenetic hypotheses A. L. PANCHEN Department of <oology, The UniversiQ, Newcastle upon Tyne Accepted f o r publication 3une 1981 With the advance of cladistic theory differencesin principle between it and other systematic techniques are few but of fundamental importance. In the mechanics ofclassification they are confined to ranking and the rejection of paraphyletic taxa. In cladistic analysis, leading to cladograms, trees and phylogeny reconstruction, inconsistencies in apparent synapomorphies are said to be resolved using Popper’s hypothetico-deductive method together with the principle ofparsimony. However, not only do cladists not use Popper’s methodology, which is inconsistent with parsimony, but their use of parsimony is invalid as a test of phylogeny. The only accepted extrinsic test of a classification is that enunciated by John Stuart Mill. It has been claimed that cladistic classificationsyield the best results when judged by Mill’s criteria, but this is only possibly the case with analytic classifications produced by numerical techniques. No satisfactory test exists in normal (synthetic) cladism for distinguishing synapomorphy from homoplasy. The effects of this are particularly dire in cladograms and classifications involving fossils in which a Shfenreihe arrangement is adopted. KEY WORDS:--cladistics - homoplasy - hypothetico-deductive - parsimony - stufenreihe. CONTENTS Introduction . . . . . Cladograms and classification . Cladistic analysis . . . . The hypothetico-deductive method The use of parsimony . . . A natural classification? . . Conclusions-Does it matter? . Acknowledgements. . . . References. . . . . . Addendum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 305 307 308 309 314 317 319 324 324 326 INTRODUCTION J The most remarkable feature in the history of systematic biology in the last two decades has been the rise of ‘Phylogenetic Systematics’ associated with the name of W. Hennig. Hennig’s (1950) original account of his systematic methodology attracted little attention outside his native Germany but the translation of a revised manuscript by Davis and Zangerl (Hennig, 1966) resulted in an extraordinary surge not only of interest in, but also advocacy of, Hennig’s system by some taxonomists and, in reaction, its equally emphatic rejection by others. The spread + 0024-4074/82/030305 24 $02.00/0 305 0 1982 The Linnean Society of London 306 A. L PANCHEN of cladism, as it is now almost universally termed, has been marked by much irrationality and considerable acrimony on both sides, which has frequently obscured the scientific issues involved. The controversy is a depressing instance of the inability of two groups having different paradigms to communicate with each other (Kuhn, 1970). Many of the differences between cladists and non-cladists have now been resolved b) the advance ( o r retreat) of cladist theory and others concern nonessential con\.entions rather than points of principle. Common criticisms are summarized and discussed by Mayr (1974). Notably irritant was Hennig’s supposed attitude to speciation. Firstly, it was asserted that he believed that speciation was invariably dichotomous. In fact Hennig (1966) was quite clear that a dichotomous differentiation of the phylogenetic tree . . . “is primarily no more than a methodological principle” (p. 2 10, see also immediately following discussion\. The idea that dichotomy represented the normal, or even only, mode of speciation seems to ha\re been introduced by Brundin (e.g. 1968) and supported by a number ofcladists, e.g. more recently, Eldredge & Tattersall (1975), although it is rejected by Eldredge 11979). Fortunately, it is unlikely that it is supported as a dogma by any cladist today. Another side issue did originate with Hennig: this is his curious convention that e\en if an ecologically isolated population gains genetic isolation from its parent stock, both should be regarded as new species (Hennig, 1966: 56-65) which seems at odds with his own ‘dexriation rule’ (1966: 59, 207; Brundin, 1968) that this is the normal mode of speciation. However, the consensus now seems to be that “Hennig’s views on limiting species at branch points are irrelevant to cladistic practice” (Platnick, 1980). Also contentious has been, and still is, Hennig’s definition of monophyly (Hennig, 1966: 72-73; Nelson, 1971 ; t\shlock, 1971, 1972, 1980; Farris, 1974; Platnick, 1977). Cladists claim, correctly, that they have a strict and unambiguous definition ofa monophyletic group as one containing all the known descendants of a single ancestral species and including that species. Others, notably Ashlock and also Mayr (1974) accuse the cladists, again correctly, of using an established term in a novel sense. For most workers interested in phylogeny the concept of a unique common ancestral species is associated with monophyly (Mayr, 1969) in contrast to Simpson’s (1961) far too accommodating definition, usually characterized as ‘minimal monophyly’. However, the non-cladist concept includes what the cladists exclude as paraphyletic groups, those taken to be derived from a unique ancestor but not containing all its descendants: Reptilia has been the favourite example from Hennig onwards. Ashlock provides a sensible compromise. Hennigian monophyly is distinguished as holophyly, which together with paraphyly comprises monophyly in its original sense. Holophyly is then unambiguous, as is paraphyly, and both may be contrasted with polyphyly. A polyphyletic group is one in which the membership is not uniquely derived. I n the sections which follow I propose to identify the irreducible minimum of significant differences between cladism and traditional classification. These include ranking and the attitude to paraphyletic groups. I then go on to examine the claims of cladists that their methodology includes the use of the hypotheticodeductive method, which leads to what I hope to demonstrate is a fatal methodological flaw in their system-their eccentric use of the criterion of parsimony. USE OF PARSIMONY 307 CLADOGRAMS AND CLASSIFICATION One of the most important distinctions that has emerged in recent cladistic discussion is that between cladograms and trees. Eldredge (1979) defines a cladogram as “a branching diagram depicting the pattern of shared similarities thought to be evolutionary novelties (‘synapomorphies’) among a series of taxa” and a phylogenetic tree (phylogram) as “a diagram (not necessarily branching!) depicting the actual pattern of ancestry and descent among a series of taxa”. The distinction between the two was made explicit by Tattersall & Eldredge (1977) following an unpublished manuscript by Dr G . Nelson. Before this many cladists claimed that their cladograms represented phylogeny directly (or at least the cladogenetic component of it) and also that a classification should be a redundant image of the cladogram which is logically prior to it. Now the emphasis has changed and cladograms and the classifications derived from them represent “not . . . the order of branching of sister-groups, but the order of emergence of unique derived characters, whether or not the development of these characters happens to coincide with speciation events” (Hull, 1980). Thus the emphasis of cladistics has moved away from the reconstruction of phylogeny (not all or perhaps even a majority of cladists are interested in trees) to a hierarchical clustering of synapomorphies for its own sake, with the assumption that the regularities exposed are inherent in nature (Platnick, 1980). With the development of this attitude to the significance of cladograms, cladists seem to be losing or to have lost any interest in the mechanism or mode of evolution (Hull, 1980). Platnick (1980) notes that cladistic analysis can be used to demonstrate relationships between any entities which change by modification through descent. Platnick & Cameron (1977) have shown the applicability of the technique to linguistics and to the texts of ancient manuscripts. The ultimate, or perhaps penultimate, position seems to have been reached by Nelson (1979: 8) to whom “a cladogram is an atemporal concept .. . a synapomorphy scheme”. However, Nelson still appears to retain an interest in phylogeny, which has been lost by some other cladists. An inherent technical difficulty in cladistic classification concerns the matter of ranking. The problem with respect to the incorporation or fossil taxa of low rank as sister-groups of taxa (with extant members) of high rank has been ably discussed by Patterson & Rosen (1977). They propose the adoption of two conventions; (1) sequencing, as proposed by Nelson (1972, 1974), such that a series of fossil taxa of the same rank within a taxon of higher rank is arranged so that each taxon is the sister group of all those succeeding it; (2) the use of the term ‘plesion’, in addition to the normal category name from species up, to avoid the necessity of (e.g.) a fossil species being the sole representative of a monotypic genus, family, order (and intermediate ranks) etc. The alternative is exemplified by McKenna’s (1975) classification of mammals, without (as Patterson & Rosen point out) the mammallike reptiles which together with the mammals form the clade Theropsida. An enormous number of ranks and thus categories was required. McKenna formulates this number as lying between log,n and n-1 for n species of organisms: he optimistically suggests that the number (r) is nearer the former than the latter. I had independently reached the same formulation ( 1, if the terminal taxa, species in this case, are counted). r = 1 +log,n derives from a ‘perfect’ cladogram in which there are exactly 2“ species (where x is any whole number) and every branch has a + 308 A. L. PANCHEN dichotomy at every rank (Fig. 1A). The addition of one species to such a perfectly symmetrical cladogram requires an additional rank ! The worst case (r = n) applies to th; ‘Hennigian comb’ (Fig. lB), in which each terminal taxon is the sister group to all those to its (conventional) right. This type, or something close to it (e.g. Patterson & Rosen, 1977; Gaffney, 1979), has become almost standard for cladistic palaeontologists and, unless the autapomorphies of the terminal taxa are emphasized, comes remarkably close to the scorned ancestor-descendent sequences of unreconstructed palaeontologists. It is, if considered equivalent to a ‘tree’, also identical to the concept of ‘Stufenreihe’ of Abel (1929; Simpson, 1953). For something over a million known species of living animals (e.g. Shorrocks, 1978), the minimum conceivable number of ranks required for a complete cladist classification of the animal kingdom is something over 20 ( r = 1 logp) (as noted by McKenna), while the maximum is of course something over a million! I feel much less sanguine than McKenna that the true number would be near the smaller figure, but the pressure to approach the less extravagant formulation, even for much more modest taxonomic enterprises, probably affects the judgement of neontological cladists (see Conclusions below). A final point about classification concerns a substantial and important difference between cladists and their opponents. The aim of cladistic analysis is the production of holophyletic taxa ; paraphyletic taxa are rejected. Traditional taxonomy accepts paraphyletic taxa if these result from the effective removal from a holophyletic taxon of a clade or clades which have reached a significantly higher grade. The standard example, as noted above, is the paraphyletic class Reptilia from which the holophyletic taxon (traditionally class) Aves has been in effect removed (possibly together with the class Mammalia, depending on one’s opinion of the status of the ‘mammal-like reptiles’ and origin of the Theropsida (Panchen, 1972; Kemp, 1980)). In other words, while only cladistic criteria are used by followers of Hennig, yielding (they hope) holophyletic groups, traditional taxa are monophyletic in Ashlock’s sense but may be paraphyletic if the separation of a descendant clade is dictated by phenetic criteria (Mayr, 1974). It is worth noting that this is equivalent to the use of significant autopomorphies to separate and rank taxa (leaving behind paraphyletic groups) by traditional taxonomists, while cladists reject the use of autapomorphy in this way. + CLADISTIC ANALYSIS Cladistic analysis consists of the recognition, amongst taxonomic characters, of Figure 1 . The two extreme forms of cladogram, shown for eight hypothetical terminal taxa. A, Symmetrical cladogram, giving the minimum number of ranks per number of terminal taxa: r = I +log,n; B, ‘Hennigian comb’ (Shrfmrcihc) cladogram, as often p d u c e d by palaeontologists, giving the maximum number of ranks: r=n. For further explanation see text. USE OF PARSIMONY 309 autapomorphies, synapomorphies and symplesiomorphies, preliminary to the construction of a cladogram. Autapomorphies are diagnostic characters unique to a taxon of a given rank; synapomorphies are unique to a pair of taxa (‘sistergroups’) of that rank, and symplesiomorphies extend beyond the sister group within an inclusive taxon of higher rank. As has been repeatedly emphasized by cladists a symplesiomorphy at one rank may be a synapomorphy at the next higher rank, while a synapomorphy at one rank is always a symplesiomorphy at the rank below: thus the cladogram resulting from cladistic analysis is a nested set of synapomorphies uniting sister groups. It seems to be agreed by most, if not all, cladists that the cladograms that they produce represent hypotheses (e.g. Cracraft, 1979; Gaffney, 1979; Eldredge, 1979). Those cladists who still hold that they are entitled to call themselves ‘phylogenetic systematists’ in the literal sense assert that a cladogram represents phylogeny, via one of the possible trees that can be constructed from it. Synapomorphies are then taken to be evidence of genuine relationship, with the underlying acceptance of the axiom that evolution has occurred. Nonphylogenetic cladists, who do not insist on that axiom, nevertheless presumably all accept that the hierarchy of synapomorphies, and the homologies they represent, correspond to something in nature and not just a solipsist ordering of sense-data. The most obvious ‘something’, is the pattern of ontogenies of organisms which can be arranged in a hierarchical system according to the principle of divergence in ontogeny enunciated by von Baer (1828; Gould, 1977). A classification based on homology due to the pattern of ontogenies, to which we have direct access, need not presuppose evolution (Patterson, 1982). Given that cladograms represent hypotheses, most cladists would like to gather their methodology within the fold of empirical science by asserting that cladograms can be tested. They further assert that this can be done using the hypotheticodeductive method. THE HYPOTHETICO-DEDUCTIVE METHOD This is invariably associated, and overtly by cladists, with the methodology enunciated by Popper (1934, 1959, 1963). Platnick & Gaffney (1977, 1978a’b) have provided useful reviews of Popper’s work, specifically aimed at their taxonomist colleagues. Popper established a criterion of demarcation of empirical (scientific) theories, separating them from metaphysical ones by potential falsifiability. Thus, if scientific theories are in the form of strictly universal statements, as Popper thinks they should be, then each is potentially falsifiable by one or more singular statements, whereas a universal statement is not logically verifiable. However, ‘numerically universal’ and ‘existential’ statements (“all human beings are less than 9ft tall”, “there are black ravens”, respectively, are examples of Popper’s), are not empirical theories and are potentially subject to verification. Popper’s system therefore consists of the proposal of a theory, cast as a universal proposition, in explanation of the phenomena under study. After testing for internal consistency, non-tautology, and against rival theories for explanatory power an attempt is made to falsify it. Thus, new empirical predictions are made from the theory and an attempt made to falsify these by observation and experiment. The observation or experiment may yield a singular statement which contradicts a prediction of the theory and thus falsifies the theory. Every time 710 A. L PANCHEN falsification fails the theory is corroborated, but it can never be proved. Unfortunately it was soon spotted that there is a logical flaw in this scenario (e.g. Achinstein, 1968). Using the logician’s modus tollens (of which more below) a universal proposition can only be refuted by a contradictory particular proposition or ifide Popper) basic statement. However, these basic statements, derived from, but not identical to, the results of observation or experiment, cannot be verified. They have empirical content, or are “facts interpreted in the light of theories; they are soaked in theory, as it were” and contain universal terms denoting “physical bodies which exhibit a certain law-like behauiour” (Popper, 1963: 387). Thus basic statements, being unverifiable, cannot irrevocably refute universal propositions. Popper was aware of this difficulty, but his discussion of it is perfunctory (e.g. Popper. 1963: 41 i. As a result both critics and followers of Popper, including a number of distinguished scientists (e.g. Bondi & Kilmister, 1959; Medawar, 1967 : 144J concluded that he held that irrevocable falsification was possible and also that it was characteristic of the best science. Meanwhile, however, a number of scientists and philosophers pointed out that, whatever the logical status of falsification and despite its excellence as a moral precept for the working scientist, it was not the way in which scientists (even physicists) usually operated (ilchinstein, 1968; Kuhn, 1962; Lakatos, 1970; Harris, 1972; Brush, 1974). Lakatos ( 19701 in fact tried to save Popper from himself by asserting that he \Popper) never was a ‘dogmatic falsificationist’ (except perhaps in a prepublication phase). Lakatos further distinguished two other stances which he claimed characterize Popper’s writings, those of ‘naive’ and ‘sophisticated methodological falsificationism’. The former is distinguishable from dogmatic falsificationism only in that one admits the logical fallacy of irrevocable falsification and then proceeds as if that fallacy did not exist. The latter is a very different kettle of fish: it is not clear that Popper accepted it before Lakatos (1970), but he certainly appears to have done so afterwards (Popper, 1972), despite his strictures on Lakatos’ interpretation of his work (Popper, 1974). ’Sophisticated methodological falsificationism’ is characterized by Lakatos ( 1970: 1 1 6 132) “. . . a scientific theory 7 is falsified if and only if another theory 7‘ has been proposed with the following characteristics: (1) T’ has excess empirical content over T : that is it predicts novel facts, that is, facts improbable in the light of, or even forbidden by T ; (2) T‘ explains the previous success of T, that is, all the unrefuted content of 7’ is included (within the limits of observational error) in the content of T and (3) some ofthe excess content of T is corroborated.” Thus, the emphasis shifts from the confrontation of theories by data to the necessary competition between rival theories. The sophisticated methodological falsificationist would never accept the falsification of any theory by observation or experiment unless a rival theory having Lakatos’ three characteristics is on hand to take over : “no experiment, experimental report, observation statement or well-corroborated lozdevel f a l s i f i n g hypothesis alone can lead to falsification” (Lakatos, 1970: 119). If, as seems to be the case, Popper (1972) assents to all this it seems to me to constitute a U-turn of governmental completeness. There is, however, a verbal differences between Popper and Lakatos. Popper (1974: 1009) would assert that a theory may be falszjied in the absence of a more successful rival, but not necessarih rejected. One other feature of Lakatos’ formulation must be noted. To him scientific progress proceeded by the progress from one theory to another and so on: USE OF PARSIMONY 311 . . . If this progress resulted in increasing empirical content at each stage the ‘problem-shift’ was progressive, if not, degenerating. The idea of using Popper’s methodology to test taxonomic hypothesis seems to have originated with Bock (1974), an emphatic non-cladist, and Miles (1973), then an incipient cladist. Despite his stance, Bock concludes (also Bock, 1977) that the most severe tests in classification are cladistic tests involving falsification. Cladists rapidly accepted this thesis (Wiley, 1975; Gaffney, 1975) and in the last few years an enormous and repetitive literature on the use of the hypotheticodeductive method in testing cladograms and/or classifications has sprung up (the pages of Systematic <0010gy give an all-too-generous sample of this). Two themes are prominent, firstly, whether the method is applicable to classification at all, and, secondly, instructions for its use. Works of actual classification purporting to use the hypothetico-deductive method are relatively (but perhaps not surprisingly) rare. Seminal in exploring the former theme was a paper by Kitts (1977). Firstly he pointed out that the diagnoses of taxa are not empirical hypotheses potentially subject to falsification, but nominal definitions (subject to acceptance, rejection or modification in the light of increasing knowledge) simply because we cannot determine if an organism belongs to a given taxon by reference to anything other than the sort of character states embodied in diagnoses. Secondly, he concluded that the hypotheses of taxonomists, such as the denotation of a given taxon, are arrived at inductively and thus correspond to Popper’s ‘numerically universal statements’. As such they cannot be subjected to testing by the hypotheticodeductive method, which deals only with ‘strictly universal statements’, although in theory they are both verifiable and falsifiable by enumeration. This was further developed by Fox (1978) in an unpublished paper in which he pointed out the temporal and spatial limitations of any hypothesis of relationships, and hence descent, based on a cladogram. In response to Kitts, Cracraft (1978) played down the importance of Popper’s distinction: Patterson (1978) on the other hand suggested that, philosophically, holophyletic taxa are individuals to be recognized rather than tested, acknowledging the priority of Griffiths and Hull. H e further rejected the idea that the principle ofparsimony constitutes a test in Popper’s sense, with which I strongly agree (see below). The idea that taxa are individuals parallels the philosophical theory, independently reached by Putnam ( 1975) and Kripke (1980)’ that terms denoting ‘natural kinds’ are proper names. Nelson (1978) asserted that a hierarchical pattern resulting from cladistic analysis (and thus expressible as a cladogram) is testable. He cites the hierarchy (the letters on each line representing the included terminal taxa in a taxon of higher rank) : T,-T,-T, ABC A BC B C falsified by additional information giving : ABC AB A B C A. L. PANCHEN 312 . . . “under the assumption that the AB component is a shared derived character [synapomorphy) and not a mistake” (my italics). But the identification of such ‘mistakes’ depends ultimately on the principle of parsimony. A work frequently cited with approval by cladists (e.g. Bonde, 1977) as an example of the use of the hypothetico-deductive method is Miles’ (1975) discussion of the relationship of the Dipnoi (lungfish). Miles commences his study by presentation of a cladogram of the major groups of gnathostome vertebrates, but omitting the Dipnoi, as a framework for discussing the position of the latter on the cladogram. Various theories of the relationship of the dipnoans to different groups in the cladogram are then discussed seriatim. In each case, common characters (putative synapomorphies) cited by previous authors are listed as ‘tests’ of alternative proposed relationships and those characters discussed and identified as symplesiomorphies, synapomorphies, autapomorphies, convergent and parallel characters and non-diagnostic ones (‘chance’), before being used to test those relations hips. Miles’ paper is written with exemplary clarity and has an original format which enhances that clarity. However, his testing technique is not that of the hypotheticodeductive method. This is principally because he makes no potentially falsifiable predictions, bold or otherwise, from the various hypotheses of relationship that he presents. Instead, he presents in effect a sort of matrix of incompatible theories of dipnoan relationship each with the set of characters cited by their partisan authors. It is in many respects an admirable study but save for some characteristic jargon has little to do with Popperian method or philosophy. It seems to me that Miles and other cladists may well be confusing Popperian falsification (the hypothetico-deductive method) with the logician’s modus tollem, which is merely the logical form in which it is cast. Modus tollens ( I f p then q, not q, :. not p) may be characterized by the following valid argument whose propositions (in agreement with Gardiner, 1980) I believe to be true: (1’1 r f the porolepiforms are the sister-group of some or all tetrapod groups, then they will be among those fish known to be choanate (if p then q ) . ( 2 : Porolepiforms are not among those fish known to be choanate (not q ) . (3) Therefore porolepiforms are not the sister-group of any tetrapod group (.’. not P). Unfortunately, in this case the consequent ( q ) does not constitute a new prediction from the antecedent (p) (the choanate or non-choanate status of the porolepiforms is taken to be already known). This is an exact analogy of the state of affairs in Miles’ (1975) discussion. It could of course be the case that the consequent represents a prediction from the antecedent: “let us see whether these fish (none of whose snouts are yet described) are choanate, as a test of their status as close relatives of at least some tetrapods.” However, again (and again unfortunately) this would still not be a case of the hypothetico-deductive method according to Popper, despite the logical validity of modus tollens and the truth of the propositions. This is because each of those propositions is either contingent,i.e. not a strictly universal statement but one that described a spatio-temporally limited state of affairs, or (as might be the case with q ) a matter of definition. This is exactly the point made by Kitts (1977). As he also points out, both types of proposition are arrived at inductively, whatever the validity of that particular procedure. Gaffney (1979) presents a useful didactic account of cladistic method particularly with reference to the cladist’s attempted use of Popper’s methodology USE OF PARSIMONY 313 and of parsimony. He further presents a tentative ‘theory of relationships’ of ‘rhipidistian’ fishes and tetrapods in the form of eight sets of nested synapomorphies and the ‘Hennigian comb’ (Stufenreihe) type of cladogram. It is perhaps significant that the autapomorphies of his nine terminal taxa are not listed (see above, p. 308). More importantly, however, Gaffney sets out very clearly the cladist approach to falsification. H e considers the cladists’ paradigm case, the ‘three-taxon statement’. Three taxa, thought to constitute a monophyletic group, are tested by apparent synapomorphies to yield the conclusion that of the three (say X, Y, Z), Y and Z are more closely related to one another than either is to X. Consideration of only one apparent synapomorphy, shared by two of the taxa but not the third, will yield this apparently unambiguous result ; but more than one may give equivocal results ‘falsifying’ the original one. Assuming that the original result is ‘correct’ this can happen for two reasons, parallel evolution, or misidentifications of a character in two taxa as homologous when it is not (convergent evolution-Nelson’s ‘mistakes’). However, Gaffney (1979: 96-97) continues “When multiple series of characters are used, it is often the case that all three possible hypotheses relating three taxa will appear to be falsified by one or more character distributions. In this case, the least-rejected hypothesis (the onefalsijied thefewest number of times) is the one incorporated into the internesting system of hypotheses .. . The principle of parsimony is widely invoked for this decision” (my italics). This seems to me to be a bizarre use of the idea of falsification, being neither ‘dogmatic’, nor either version of ‘methodological falsification’ and certainly not what either Popper or Lakatos meant by the term. Thus, while an accurate representation of what cladists mean by falsification, it is not an application of the hypothetico-deductive method at all. All this, of course, need not matter: the fact that cladists have misunderstood Popper does not necessarily invalidate their methodology. Nor does the fact that they have apparently overlooked Popper’s strictures on the use of parsimony. These are implicit in Popper (1959) and explicit in Popper (1972). In The Logic of Scientijic Discovery, Popper (1959 : 78-84, 134-145) explores the relationship of ‘Simplicity’ to ‘Conventionalism’. The stance of conventionalism, which Popper rejects, is the adoption of the convention, in the absence of a corresponding theory of truth, that all ‘laws of nature’ shall be simple and that ‘simple’ laws shall be protected from falsification by ad hoc auxiliary hypotheses. Conventionalism seems to me to be in exactly the same spirit as the cladist’s acceptance of the phylogeny derived from the cladogram which implies the least parallel and convergent evolution and the fewest reversals, “ . . . not because nature is parsimonious, but because only parsimonious hypotheses can be defended by the investigator without resorting to authoritarianism or apriorism” (Wiley, 1975 : 23-uoted with approval by Cracraft 1979: 34; my italics). Admittedly, Popper (1959: 145) uses the word ‘parsimony’ with approval, but here he is advocating not the conventionalist appeal to simplicity, but the expression of hypotheses in a form in which an attempt at falsification can be most easily made; not a choice amongst hypotheses (cladograms), all of which have already been ‘falsified’ at least once, but a choice among new hypotheses none of which has yet been tested. Popper’s overt stigmatization of conventionalist parsimony ( 1972 : 294-295) follows a discussion of reductions (e.g. of chemistry to physics, or biology or psychology to physics): “I call bad reduction or ad hoc reduction the method of 314 A. L. PANCHEN reduction by merely linguistic devices ; for example, the method of physicalism which suggests that we postulate ad hoc the existence of physiological states to explain behaviour which we previously explained by postulating (though not by postulating ad hocj mental states . . . This second kind of reduction or the use of Ockham’s razor is bad, because it prevents us from seeing the problem. I n the picturesque as well as hard-hitting terminology of Imre Lakatos, it is a disastrous case of a ‘degenerating problem shift’ [see above, p. 31 1 ; and it may prevent either a good reduction, or the study of emergence, or both.” In the last clause I would suggest the applicability of substituting “the study of parallel and convergent evolution” for “the study of emergence’’ to make my point! I have laboured my rejection of the cladists claim to be using the hypotheticodeductive method at some length, because implicit in that claim is one to a type of scientific respectability which they deny their rivals. I n fact cladist methodology, at least in the construction of cladograms, rests on only two principles, the search for synapomorphies and the principle of parsimony. The former is undoubtedly Hennig’s greatest contribution to systematics. It is not of course original, but most non-cladist systematists who have commented on Hennig acknowledge the importance of his insistence on it, (e.g. Mayr, 1974; Bock, 1977; Simpson, 1975), and the word ‘synapomorphy’ has entered, albeit against some resistance, the vocabulary of many non-cladists (including mine). However, in the construction of dichotomous cladograms, Hennigians would be helpless without the principle of parsimony. Cladism stands or falls (except for the use of one atrocious immunizing strategy to be discussed below) on that principle. THE USE OF PARSIMONY The use of parsimony in classification and in phylogeny reconstruction was not invented by cladists. Indeed, while used informally before, it was introduced as a distinct technique into phylogeny reconstruction by phenetic (‘numerical’) taxonomists including Camin & Sokal ( 1965), who incidentally referred to their dendrograms of phylogeny based on phenetic principles as ‘cladograms’, a term also suggested (in parallel!) by Mayr (1965). Sneath & Sokal (1973 : 321) cite three types of ‘minimum evolution’ reconstructions of trees based on parsimony. These are ( 1 ) a minimum number of evolutionary steps (Camin & Sokal, 1965) ; ( 2 ; a minimum number of mutational steps (Fitch & Margoliash, 1967), or ( 3 ) a minimum length tree (Edwards L?L Cavalli-Sforza, 1964; Kluge & Farris, 1969). The justification of all three types of reconstruction is the principle of parsimony and that alone. Camin & Sokal’s minimum evolutionary step method bears superficial resemblance to the cladists’ method using parsimony in that they ( C & S) are using a heterogeneous collection of characters in at least some of which it is possible to identify a primitive and one or a series of derived character states. The phenetic cladogram is then constructed (with hypothetical ancestors, ‘HTS’s, for the terminal taxa, ‘OTU’sj such that the minimum number of evolutionary steps, i.e. total of changes (within character and summing characters) from less to more derived states, is involved. The method differs from that of cladists in that (1) there is no identification of symplesiomorphies and synapomorphies at each rank; (2) all character state changes (positive and apparent negative) are summed to give the USE OF PARSIMONY 315 mean minimum number of steps, so that there is no concept of ‘falsification’ and ( 3 ) the maximum number of characters that can be specified is used. The method is in fact an extension of phenetic classification in which ‘parsimony’ in the sense of the ratio of like to unlike character states is of the essence. The best classification is regarded as the one using the largest number of characters (originally unweighted) and in cluster analysis ‘relationship’ (phenetic resemblance) is merely represented by that ratio, without any phylogenetic or essentialist connotation. The ‘minimum mutation’ use of parsimony is used by those reconstructing phylogeny using differences between different organisms in the molecules of homologous proteins (e.g. cytochrome c, myoglobin) . The method is explained by Fitch & Margoliash (1967).A mutation distance between two homologous proteins is defined “as the number of nucleotides [in the coding DNA] that would need to be altered for the gene for one [protein] to code for the other”. This is derived from the number of different homologous amino-acid radicals between the two molecules and the number of nucleotide changes required to change from one homologous amino-acid to the other in each case. A tree is then constructed by associating the pair of organisms, represented by their homologous proteins, with the minimum mutation distance. The pair is then averaged and the procedure repeated until all the organisms are associated in a dichotomous tree in which the length of the branches between nodes represents the mutation distance. Various correction techniques are employed to allow for the bias in the order of clustering. This technique is perhaps the most easily justified form of parsimony. Within the system, using one series of homologous proteins and thus the codon they represent, it is unknowable how many back-mutations or roundabout mutational routes there have been (except perhaps by adding more organisms) so parsimony is the only refuge and at least like is being compared with like. However, when a tree so produced is compared with an agreed phylogeny from normal anatomical characters there is not a one-to-one correspondance. Both the pattern of dichotomies and the internode distances fail to match, if the latter are compared to probable time distances estimated from the fossil record (RomeroHerrera et al., 1973, 1978). Molecular evolution is not parsimonious. The minimum tree-length method derives from another technique of phenetic classification in which characters are represented by axes in multi-dimensional space so that each organism is represented in that hyperspace by a point plotting its position on the axes of all its characters (Edwards & Cavalli-Sforza, 1964). A phylogeny is then reconstructed as a network of minimum distances between the points. The parsimonious assumption here is that evolution has covered the minimum ‘phenetic distance’ to arrive at the terminal taxa. Philosophically this assumption is similar to that of the ‘minimum step’ method. The cladists use of parsimony contrasts strongly with all three of these phenetic techniques. It should be particularly noted that cladists use parsimony in the construction of cladograms not trees. When cladograms were taken to be direct reconstructions of phylogeny (or of cladogenesis, which is what was meant by ‘phylogeny’), then the purpose of parsimony was the same as that of phenetic or molecular taxonomists engaged on phylogeny reconstruction. This is still the case with those cladists who assert that cladograms (via trees) represent phylogeny. For those who are unwilling to make this assertion, or regard it as unnecessary (Patterson, 1982), then parsimony, apart from being a methodological principle, is presumably a test of homology. 316 A. L. PANCHEN Two cladist uses of parsimony may be distinguished. The first is in the ‘threetaxon statement’ described by Gaffney (1979; and numerous earlier papers by other cladists). This method is used to resolve inconsistent apparent synapomorphies. The second is at first glance characterized by the example of ‘falsification’ cited by Nelson (1978) and noted above. This involves the confrontation between conflicting cladograms. The first use of parsimony is related to the cladist concept of ‘falsification’ noted in the last section. The procedure is firstly to recognize symplesiomorphies present in each of the three taxa (XYZ) being considered. The normal technique here is out-group comparison (e.g. Gaffney, 1979 : 92-94). Secondly, autapomorphies of each of the three taxa are used to diagnose those respective taxa but not to ‘test’ relationship. The remaining significant characters recognized by the cladist should by synapomorphies of the presumed monophyletic group comprising all three taxa and apparent synapomorphies uniting two of the taxa to the exclusion of the third. If a number of the latter type of characters is considered then inconsistencies will almost certainly arise due to parallel and convergent evolution, as noted above. The cladist use of parsimony at its crudest then dictates that a simple majority vote is taken: the two taxa united by the largest number of apparent synapomorphies are declared ‘sister’ qroiips, while the ‘synapomorphies’ apparently producing the two other possible combinations are pronounced false and thus due to parallel or convergent evolution. 4 s I have pointed out before (Panchen, 1979), such a conclusion could only be valid if ( a ) the organisms comprising the three taxa could potentially be atomized into a series of discrete diagnostic characters of equal taxonomic weight, such that the state of each character was in no way correlated with the state of any other (or alternatively that only characters having these features were used); and also (bj if a competent cladistic taxonomist could be reasonably sure that in each taxon, the relatively tiny number of apparent synapomorphies that he uses in his test has a similar ratio of true to ‘false’ synapomorphies as does the class of all its characters. The first proviso introduces the vexed problem of weighting and character correlation which has an extensive literature. It is worth noting that cladists appear to he embarking on the same agonizing consideration of weighing and correlation (Gaffney, 1979: 99-101; Eldredge, 1979: 172-1733 as did the pheneticists perhaps a decade previously (compare Sokal & Sneath, 1963 : 118-120: Sneath & Sokal, 1973: 109-113; also Burtt, 1964). Both Gaffney and Eldredge quote Hecht (1976) and Hecht & Edwards (1976) who rank kinds of morphological features on a scale of significance of one to five or inversely (five to one) of probability of parallelism. Their reactions, however, are different. Gaffney agrees ‘wholeheartedly’ with Hecht & Edwards’ (1977) dictum that “From the viewpoint of lineage detection, i t is more important to use a few well-analysed morphoclines than many poorly or improperly analysed ones” ; while Eldredge concludes that “However risky raw parsimony may be, we are still better off if that is our primary criterion”. The difficulties inherent in this first proviso are of course inherent in the pheneticists’ methodology as a whole, in which a ‘head-count’ of characters is taken in estimating phenetic distance, and thus a fortiori in their uses of parsimony (minimum evolutionary steps and minimum length tree: see above). ‘The stringent, in fact unrealizable, conditions of the second proviso are however unique to the cladist use of the parsimony criterion. USE OF PARSIMONY 317 Thus, even fz we allow that the course of evolution is parsimonious, which we know it not to be (Romero-Herrera et al., 1973, 1978; Friday, this volume), and I f we further allow that evolution is normally divergent, so that true synapomorphies outnumber false synapomorphies, which in many cases appears to be untrue (e.g. Butler: 1982) ; the first cladist use of parsimony (in the ‘three-taxon statement’) is still illegitimate. This is more emphatically the case because, unlike pheneticists, cladists do not aim to use the maximum number of separable characters in assessing ‘relationship’. Their use of parsimony is thus analogous to predicting the results of a competitive election with a large electorate by asking the voting intentions of the first dozen people one happens to meet. The second possible cladist use of parsimony, involving the confrontation between several incompatible cladograms, is even worse than the first. The hypothetical case cited by Nelson (1978) and noted above (p. 3 11) might appear to fall within this second category. He gives a classificatory hierarchy (in effect a cladogram) falsified by another resulting from ‘additional information’. However, in Nelson’s example the established division of the major taxon (ABC) is given as (A) and (BC) while the new data are supposed to give (AB) and (C).The example therefore resolves itself into a simple three-taxon statement. However, a possible system would be to produce a series of cladograms, each based on a, different suite of characters and then to accept the pattern to which the majority of cladograms conform. This is perhaps an improbable exercise for one individual starting from scratch, but a probable one for a cladist reviewing (e.g.) the number of incompatible cladograms depicting the relationships of the major groups of fishes. In this case, unless this is specified, even the number of ‘synapomorphies’ uniting groups at the same rank may not be the same, so the use of parsimony is even less advisable. A NATURAL CLASSIFICATION? The ideal of ‘natural classification’ has probably existed as long as naturalists have tried to group the organisms in which they are interested into a hierarchical system. In fact the ‘species problem’ that Darwin wrestled with and that the young Alfred Russel Wallace and Henry Walter Bates optimistically set out for Amazonia to solve (e.g. McKinney, 1972), was based on the concept of ‘natural classification’. Evolution was the hypothesis explaining the seeming true relationships displayed by classification. The species problem was the problem of how the evolution suggested by classification occurred. Thus, while most cladists and traditional taxonomists agree that a natural classification should reflect phylogeny and thus evolution (and even pheneticists agree that it should be possible to reconstruct phylogeny from classification), the concept of natural classification is logically as well as historically prior to that of phylogeny. Phylogeny then cannot be used to test classifications and some extrinsic criterion must be used to evaluate them if, as I have tried to show, all intrinsic tests are invalid. The only extrinsic criterion for judging the naturalness of a classification which has gained any currency is that formulated by John Stuart Mill (1872: 1974 edn: 714): “The ends of scientific classification are best answered, when the objects are 318 A. L. P.4NCHEN formed into groups respecting which a greater number of general propositions can be made, and those propositions more important, than could be made respecting any other groups into which the same things could be distributed . . . A classification thus formed is properly scientific or philosophical, and is commonly called a Natural, in contradistinction to a Technical or Artificial, classification or arrangement.” Mill’s criterion of naturalness was adopted by Gilmour (1961 ; Gilmour & LYalters, 1963) and has been commended by pheneticists (Sneath, 1961 ; Sneath & Sokal, 1973 : 24-25), traditional taxonomists (Mayr, 1974 : 96) and by Farris ( 1977, 1979, 1980), a “mathematical cladist”. Two related concepts appear to derive from the development of Mill’s criterion. T h e first concerns the distribution ofcharacters within and between taxa. Sneath (1961) points out that Mammalia is a natural taxon because it corresponds to the distribution of a large number of correlated features, but that a taxon confined to horses, mice and rabbits is not a natural taxon because most of the features which would characterize such a group are not exclusive to it but are common to other mammals (or at least placentals). Sokal (1977, quoted but not cited by Farris (1980), further writes “Natural polythetic classifications permit two types of predictions concerning character states. These states should be homogeneous within taxa and heterogeneous between them.” This introduces the second concept, the idea of the predictive power of classifications, characterized by Fitch (1979) : “The essence of predictivity in the sense used here is the degree to which a specific classification agrees with characters not used in the formulation of that classification.” A different formulation is to suggest that the most natural classification is the one that yields the most predictions about a newly discovered organism assigned to a taxon within it. This in the case of the discovery of an incomplete fossil specimen representing a new species, might actually be tested (but inductively!) by discovery of a second more complete specimen. Farris (1977) set out to show that ‘phylogenetic systematics’ (in fact his version of cladism could give more natural classifications, in the Mill-Gilmour sense, than phenetic classification, defined as clustering by overall similarity. Using a hypothetical data matrix of 14 characters in eight taxa, he showed that the criterion of naturalness was correlated with the co-phenetic correlation coefficient used by pheneticists, with a maximum of 1. The states for each of his 14 characters were coded on a binary system as 0 ( =symplesiomorphy) and 1 (=synapomorphy when shared b) two or more taxa within the table, autapomorphy if unique). By an ingenious trick, Farris then demonstrated that under certain circumstances clustering by o\ era11 similarity, as recommended by pheneticists, gave a coefficient of 0, while clustering by special similarity (said to be equivalent to clustering by synapomorphy) still gave a coefficient of 1. I n his original matrix both techniques gave the same (‘natural’) result, but by replicating the distribution ofstates of some of the characters a specified number of times his special similarity clustering emerged unscathed. Farris’ intention was to represent the replication as several characters sharing the same 0 or 1 distribution in all the taxa, but Janowitz (1979) pointed out that it could also be regarded as the weighting of a series of single characters, which would amount to the demonstration of something very different. Furthermore, Dr Garth Underwood pointed out to me at the present symposium (Underwood, this USE OF PARSIMONY 319 volume) that Farris’ 0 and 1 character states need not necessarily represent plesiomorphy and apomorphy respectively, but that the 1 states could be merely minority states of a character. Farris’ demonstration, therefore, is of the superiority of clustering by special similarity but not necessarily by Synapomorphy. However, Farris (1977) also claims to have applied the same techniques to “50 sets of data on real organisms” and Mickevich (1978) demonstrates the superiority of classification using clustering by synapomorphy, using stability as a criterion (contrast Gaffney 1979 : 103-104: “stability is ignorance”). What Mickevich means by stability is the congruence of classifications of the same organisms based on different sets of characters, and she shows effectively that using the Wagner tree method developed by Farris and his colleagues such congruence is significantly better than for a number of other phenetic techniques. The method automatically involves the use of parsimony by minimizing hypotheses of ‘homoplasy’ (parallel and convergent evolution). The mathematical techniques employed by Farris and his colleagues do appear to result in more ‘Gilmour-natural’ classification than phenetic techniques using overall similarity. I n the case of grouping by ‘special similarity’ as demonstrated by Farris (1977) this is obviously only the case if the plesiomorph or apomorph status of each character stated has already been correctly arrived at in advance, but there is a more important point to be made. The computations of ‘mathematical cladists’ are based on pre-existing data sets that can be expressed as a matrix of taxa plotted against character states and parsimony is handled much as pheneticists do, being inherent in the matrix. I n a sense one could refer to the extraction of a classification, or cladogram (and thus phylogeny), from such a matrix as analytic (or divisive: Sneath & Sokal, 1973 : 202-203). Non-mathematical cladists on the other hand tend to use a classificatory method which in the same sense is synthetic (agglomerative: Sneath & Sokal) (see next section). CONCLUSIONS-DOES IT MATTER? In summarizing my conclusions from this rather lengthy review one point must be made very clearly. In criticizing cladism I am not thereby advocating any other system of classification or phylogeny reconstruction. Traditional taxonomy (‘evolutionary’ or ‘eclectic’ or ‘syncretistic’ classification) has always been somewhat ad hoc in its procedure, yielding good results with competent taxonomists and bad with incompetent ones. The standard warks on procedure (Simpson, 1961 ; Mayr, 1969) are to some extent rationalizations of a tradition that is too largely intuitive. Further, I am willing to accept that the mathematical cladism of Farris and his colleagues yields more ‘Gilmour-natural’ classifications, greater congruence between classifications of the same organisms based on different data and possibly (although this is unknowable) truer phylogenies, but only when pre-existing (and unbiased and accurate) data are used and only when the data refer to living organisms, or fossils which are sufficiently complete for comparison of homologous characters to be spread right across the series. It is also the case that if phylogeny were entirely divergent (or, to include the views of non-phylogenetic cladists, homoplasy did not occur) cladistic analysis would by the nature of the definitions of plesiomorphy and apomorphy yield the best classification, even if done by rule of thumb (and by people who collected their own data) rather than by computation. 320 A. L. PANCHEN However, in the real world parallel evolution is common and in some organ systems in some taxa is known to be the dominant mode of evolution (Butler, 1982). Moreover, convergent evolution is frequently detected and is inherent in all controversies about the taxonomic position of major groups. All these things being so it is probable that the greatest danger in cladistic classification is the unassailable conviction of most cladists that their system is not just the best, but the only valid one, that the true “cladogram ‘falls out’ of the analysis in much the same way as a phenogram is the direct output of some computer program in numerical taxonomy” (Eldredge, 1979 : 170). Eldredge, while still a cladist, obviously sees the dangers more clearly than many of his colleagues when he adds a footnote on the same page: “If; as is nearly always the case when more than one character is examined, more than one possible pattern of synapomorphy is evident, it is clear that only one can be “correct”. Other apparent patterns of synapomorphy are therefore false, and the similarities upon which they are based are, by definition, parallelisms. In the case where the investigator is making conscious evaluations as to which character states are primitive and which derived, conflicting cladograms imply actual analytic error, as well as the fact of parallelism. As a result of the relatively recent production of a plethora of cladograms, parallelism turns out to be a far more common evolutionary phenomenon than even most of its more ardent aficionados had thought.” T o be more specific I believe that overconfidence, combined with the inherent difficulties in the use of parsimony when homoplasy is common and, a fortiori, with the cladists’ invalid use ofparsimony, are together sufficient to cast grave doubt on, or even invalidate the method. it‘hile some of the dangers are present in all nonmathematical cladistic analysis, there are others characteristic of neontological and palaeontological cladism respectively. The first obvious fault of cladism in general is perhaps a temperamental and, one may hope, a temporary one. This is the temptation to produce new and revolutionary groupings of organisms, or to resurrect old and discredited ones, just to demonstrate that cladism is getting results. I suspect that several regroupings of major vertebrate taxa now in preparation or in press fall into this category but it would be invidious to cite them without closer study of the results. Unfortunately this kind of exercise, often but perhaps wrongly interpreted by its critics as professional frivolity, is all too easy with the cladists misuse of parsimony. Starting with Hennig’s (1966: 1391 most important operational dictum-“Once a monophyletic group has been recognized the next task of phylogenetic systematics is always to search for its sister group,” the cladist, being human, will be anxious to reinforce the pairing he has ‘discovered’. He will therefore, and rightly, search for as many synapomorphies between his sister-groups as he can; but there are no methodological constraints on this process. The cladist can always win his parsimony vote by searching for more real or apparent synapomorphies to the neglect of apparent synapomorphies uniting either of his sister groups to a third, at the same time reassuring himself that he is subjecting his hypothesis to falsification and to Ockham’s razor. This attempt at inductive confirmation exactly parallels the case cited by Bonde (1975: 308-310), a committed cladist, concerning the argument about the diphyletic origin of tetrapods. Jarvik, in a long series ofpapers (summary in Jarvik, 1968) claimed that, while the Urodela were derived from one USE OF PARSIMONY 32 1 group of crossopterygian fishes-the Porolepiformes, the Anura, and possibly all other tetrapods, were derived from another-the Osteolepiformes. His conclusions were based initially on very detailed anatomical studies of the snout of a restricted number of species in the four groups principally involved and then extended, in an attempt at inductive confirmation, to other regions of the head. Attempts at refutation were rather unconvincing, as Bonde notes, until Jurgens (1971) demonstrated that Jarvik’s urodele snout characters also occurred in primitive Anura. This has been generally accepted as a refutation of Jarvik’s improbable theory, but it would still be open to Jarvik to defend himself using the cladist type of parsimony argument, although I am not aware that he has done so. It may be argued (correctly) that biased defence of one’s own theories is characteristic not only of cladistic groupings, but also of those traditional taxonomy, and in fact of all science. However, because cladists group by synapomorphy rather than by overall phenetic resemblance, there seems to me to be a greater danger that they will dismiss obvious common features which conflict with groupings, as due to symplesiomorphy or convergence. The extreme case of this is what I referred to above (p. 314) as a n “atrocious immunizing strategy” (against supposed ‘falsification’). ‘l’his is the onesynapomorphy grouping. Mayr, ( 1974: 1 19- 120) comments unfavourably on an example from the Diptera cited by Hennig. A notorious example is Kuhne’s (1973) attempt to demonstrate, against the consensus, that monotreme and marsupial mammals are sister-groups. I quote his summary in its entirety: “By reinterpretation of the tooth formul of Ornithorhynchus as dp/PM,-, a synapomorphy with Marsupials has been discovered. Monotremes and Marsupials are sister groups, together derived from Theria of the Middle-Cretaceous.” I also quote from the body of the paper: “if only one synapomorphy is found and recognized as such, and generally acknowledged, the problem is solved, in spite of a number of other characters, which are not synapomorphic but plesiomorphic, dubious or-under the present state of knowledge-cannot rationally be discussed” (Kuhne, 1973: 61). “An essential point: a proven synapomorphy cannot be counterbalanced by other arguments; if it cannot be refuted it has to be accepted as evidence.” Emphasis or comment on my part would be superfluous. Cladistic classification of living organisms, particularly if the terminal taxa are species, is probably less deserving of criticism than that of major taxa which include fossils. However, I believe there to be one factor which may bias the result, of which the operator may be only dimly conscious. This is the temptation to produce the most parsimonious cladogram. ‘Parsimonious’ this time, not in the sense of minimizing homoplasy, but in minimizing the number of ranks. If all the terminal taxa can be neatly paired 0% to the exclusion of monotypic taxa at any rank, then the ‘perfect cladogram’ referred to above (p. 307) can be approached. Thus, despite the absence of any other particular prejudice on the part of the operator, the classification produced will deviate from true phylogeny. This is more strongly the case again because cladists do not take overall phenetic resemblance or difference into account. The tradition of cladogram type amongst palaeontological cladists is quite other. Here the ‘Hennigian comb’ (Stufeenrezhe) type of cladogram prevails. The reasons why this is so are interesting. They are explained by Patterson & Rosen (1977 : 128, 158-159) following a convention suggested by Hennig (1969). Firstly, it was necessary to agree that fossil species or higher taxa, if they could be 322 .4. L. PANCHEN sufficiently well characterized, could appear as terminal taxa together with living species or groups. This was not then a matter of general agreement amongst cladists, but a paper by Schaeffer, Hecht & Eldredge (1972) gave powerful support. Secondly, the position of the fossils is established by reference to a previously agreed pair of living sister groups, in Patterson & Rosen’s case represented by the primithve living (‘holostean’) fish Amia and the extant Teleostei. It always seems to transpire that one sister group, in this case Amia, is more ‘primitive’ (,has more plesiomorph features) than the other, following Hennig’s (1966) deviation rule. A search is then conducted amongst appropriate fossils for groups, species, or occasionally single specimens which have one or more character states corresponding to the autapomorphies of the more advanced living group (extant teleosts). The fossil taxa are then ranged on a scala naturae of cumulative synapomorph characters until t h e full total is reached at the extant (Teleostei ) . Thus, the former autapomorphies of living teleosts become an internested series of synapomorphies of Teleostei more widely defined. This wider Teleostei is then the sister group of the group represented by Amia and any fossil relatives. However, the emphasis is always on the more apomorph extant group and the Stufenreihe leading to it: hence the ‘Hennigian comb’. In many cases the plesiomorph living taxon is not included in the cladogram, or even specified at all (e.g. Gaffney, 1979 for the major taxa of tetrapods: Fig. 2). Gaffney presents an anal>,sisof his cladogram as a series of internested three-taxon statements, but it is clear from his list of cumulative synapomorphies that it was conceived as a Stgfenwihe. It is also worth noting that some of his proposed taxa are polythetic isneath Br Sokal, 1973): e.g. “bipartite braincase ossifications” is cited as a synapomorphy of all Choanata (crossopterygian fishes plus tetrapods) while “at least partial braincase fLision” is taken to be characteristic of the contained taxon ‘Iet ra pod a . However, the worst special feature of classification by Hennigian comb, in addition to those inherent in all forms ofcladism, is the even stronger temptation to ignore contrary e\ridence, because even evidence that might suggest a symmetrical cladogram is ignored or rejected. Again, perhaps invidiously as it is only presented as a methodological example, we may cite Gaffney’s cladogram. H e gives as two of his three autapomorphies of amniotes, the presence of an astragalus and a ‘closed’ \i.e.absent) otic notch. The second feature characterises all Microsauria and the first much more distinctive one occurs in at least two families of that group (Carroll & Gaskill, 1978 ) . However, Gaffney places the microsabrs (within the polyphyletic group ’Lepospondyli’” as two steps more primitive (or two ranks up). Thus, cladistic practice when dealing with taxa including fossils is only in practice distinguishable from those old-fashioned ancestor-descendent sequences of palaeontologists which terminated in a living taxon (e.g. Watson, 1940) by the more careful specification of the h c t that the members of the sequence have their own autapomorphies iif these are cited) and by the repudiation of stratigraphic e\idence. However, most pre-Hennigian palaeontologists were well aware, at least as far as their more speculative phylogenies are concerned, that their sequences did not literally represent ancestral and descendent species. The rejection of stratigraphy is a more important point. Schaeffer et al. (1972) quite rightly remind their colleagues of the dangers of a too literal interpretation of stratigraphy and also suggest that classification of fossil or fossil plus recent organisms should be based entirely on morphology (and presumably other innate features) but that USE OF PARSIMONY 323 CHO AN ATA A r I I I I I I I I I I I I I I I I I I I I I I 7 I -I I I TETRAPODA I I I * L f I 1 I N EOT ETR APODA I I I I I I I I I I I I I I * f ! -- -4 I AMNIOTA I I I I I I I I I 8 I I I I I I I I SAUROPSIDA ’ DlAPSlDA r----4 2 3 8 P P I Figure 2. Stufenreiha-type of cladogram of the relationship of tetrapods and ‘rhipidistian’ fishes. Asterisks refer to unnamed groups. Numbers refer to Gaffney’s list of synapomorphies. (After Gaffney, 1979.) stratigraphic evidence should be used as a guide in reconstructing trees. I would strongly suggest that stratigraphic evidence can also be used as a test (but not a falsifier) of cladograms. A Hennigian comb whose dichotomies run in the reverse direction to an established stratigraphic sequence (e.g. Watson’s, 1919, 1951, ‘trends’ in the evolution of temnospondyl Amphibia) needs more than the invalid use of ‘parsimony’ for its defence. In conclusion, it is fair to note the enormous distance that cladism has come in its short career. Like the early years of ‘numerical taxonomy’ before it, it passed, or is passing, through a phase of arrogant and repellent dogmatism which likewise unleashed a storm of often ill-informed criticism. Its theoretical structure has also changed enormously, to the point where certainly for some of its adherents ‘phylogenetic systematics’ is a total misnomer. Before its methodology can become trustworthy, however, it will have to abandon its pretence to the hypotheticodeductive method and to parsimony as anything other than ad hoc method of comparing alternative groupings. It will also have to come to terms with traditional taxonomy and the problems of weighting, character correlation (Riedl’s 1979 concept of ‘Burden’ is a particularly valuable contribution here) and the functional significance of characters. I t may also be the case that a cladistic I5 A. L. PANCHEN 324 approach may be more suitable in some cases and a more orthodox one in another. I cite Fortey (3r Jefferies i1982) in theirjoint enterprise which has all the boldness of the peace mo\.ement in Yorthern Ireland. Finally, and perhaps the bitterest pill of all, the cladists may ha\re to admit the persistent intuitive element in classification, as in all the best science. Only then will cladism, if still distinct and recognizable, be (in Hennig’s oft-repeated phrase) “the general reference system of biology”. ,ACKNOM’LEDGEMESTS I t is a pleasure to record the stimulus of discussions at the meeting of which this \.olume is a record. I would particularly like to acknowledge helpful debate with Drs K . A. Joysey, II\. E. Friday, Garth Underwood, Colin Patterson and Professor P. h4. Butler. Dr Jane Heal, of the Department of Philosophy, University of Sewcastle very kindly read and commented on the manuscript under great pressure of time and Mrs E. B. Kinghorn typed the manuscript under the same constraint. REFERESCES .\BEI.. 0..1929. P a h o b d o g i e und S/ammesgesrhir-hte. Jena : Fischer. ACHINSTEIN. P., 1968. (Review of) Karl R. Popper: Conjectures and Refutations. British Journal JOT the Philosopiy .fScience, 19: 159--180. ASHLOCK, P. D., 1971. Monophyly and associated terms. SyshzoliC ~ o o l o g y ,20: 6 3 4 9 . ASHLOCK, P. D.. 1972. Monophyly again. Sysfmafic ~ o o l o g y 21: , 43CF438. .ASHLOCK, P. D..1980. An evolutionar). systematist’s view of classification. $ystonalk <oology, 28: 441-450. B.\ER. K.E. von. 1828. Eiitwi-tlrn~s~~rsrhirhte der T h i u e : Beobachtuns und Repexion. Konigsberg : Borntrager. BOCK. W. J . , 1974. Philosophical foundation of classical evolutionary classifications, Systonatu <oology, 22: ‘575 392. BOCK. W.J.% 1977. Foundation and methods of evolutionary classification. I n M. K. Hecht, P. C. Goody & B. X4. Hecht :F.ds‘. Major Pa/terns in lbrfebratr I<ioluiion: 851 895. S e w York: Plenum. BOSDE, S . . 1975. Origin of “higher groups”: viewpoints of phylogenetic systematics. Colloqun internatzonaux du Centre natinnal dr la Rcrherche Scimtifiqiie. 218: 29:1- r324. RONDE, N.. 1977. Cladistic classification as applied t o vertebrates. I n M. K.Hecht. P. C. Goody Br B. M. Hecht iEds Major Pattermi in Ibrtebrate Eoolu/ion: 741 - 4 M . New York: Plenum. BONDI, H. & KILMISTER. C. \V.. 1959. The impact of I-ogik der Forschung. British Journal fit the Philosophy of Srirrur. 10: 55 ~ 3 7 . BRCSDIN, L.. 1968. .\pplication of phylogenetic principles in systematics and evolutionarv theory. .Vobe/ .Smposium. 4 : 173 495. BRUSH, S. G., 1974. Should the history of science he rated X ? Science, .V. Y.,183: 1164- 1172. BCRTT, B. L.. 1964. :\ngiosperm taxonom) in practice. In V. H. Heywood & J. McNeill (Edsj, Phenelic and Phvlogtnetic Clnssification: 5 -16. London : Systematics Association. BCTLER. P. bl.. 1982. Directions of e\aoltition in the mammalian dentition. In K.A. Joysey & A. E. Friday (Edsi, Problems of Phvlogenttu Reconstruction: 235244. London : Academic Press. C.\AMIS.J . H . Sr SOK.AL, R . R. 1965. .A method fnr deducing branching sequences in phylogeny. khlution, 19: 31 1 :326. CARROLL, R. L. & CASKILL, P., 1978. The order Microsauria. Memoirs. .4meruan Philosophical Sock&, 126: Ik211. CR.\C:RAFI’, .I.. 1978. Science. philosophy and systematics. $rJ.tematu .Soology. 27: 21 3 216. CR:\C:R.IFT. J.. 1979. Phyloqenetic anal)sis, evolutionary models, and paleontology. I n J. Cracraft Pr N. Eldredge (Eds,,Phvlogenetu .4nnbxir and Paleonkolou: 7 -39. New York: Columbia University Press. EDW.ARDS, A. W. F. & CAVALLI-SFORZA, L. O., 1964. Reconstruction of evolutionary trees. I n V. H. Heywood and J . McSeill (Eds;, Phenetic and Phylogmtir ClasstJication: 67- 76. London: Systematics Association. ELDREDGF., N.. 1979. Cladism and comnion sense. In J. Cracraft & N. Eldredge (Eds), Phjlogenrtic :lnalvsis and Palennfology: 165-198. New York: Colunibia Cniversit! Press. ELDREDGE. 3.& TATTERS.ALL. I., 1975. Evolutionar) models, phylogenetic reconstruction, and another look at hurninid phylogeny. Cnntributionr in Primatolou. 5: 218-242. FARRIS. J. S.. 1974. Formal definitions of paraphyly and plyphyly. 5jsfematir ~ a o l o p 23: . 548 554. F.ARRIS. J. S . . 1977. O n the phenetic approach to vertebrate classification. In M. K. Hecht, P. C. Goodv p ’ h l . Hecht IEdsi, .Major Putterm in I-brtebrate Emlution: 823-850. New York: Plenum. 8. USE O F PARSIMONY 325 FARRIS, J. S., 1979. On the naturalness of phylogenetic classification. Syskmatic zoologv, 28: 200-214. FARRIS, J. S., 1980. The information content of the phylogenetic system. Systematic <mlogv, 28: 483-519. FITCH, W . M., 1979. Numerical taxonomy: a special project (11). syJfcmatic (oology, 28: 254-255. FITCH, W. M. & MARGOLIASH, E., 1967. Construction of phylogenetic trees. S c k e , N.Y., 155: 27!+284. FORTEY, R. A. & JEFFERIES, R. P. S., 1982. Fossils and phylogeny-a compromise approach. In K. A. Joysey & A. E. Friday (Eds), Problems of Phylogcnetic Reconstruction: 197-234. London: Academic Press. FOX, R. C., 1978. Cladograms as strictly universal statements. (Privately circulated) 7 pp. GAFFNEY, E. S., 1975. A phylogeny and classification of higher categories of turtles. Bulletin of fhe American Museum of .Natural History, 155: 387-436. GAFFNEY, E. S., 1979. An introduction to the logic of phylogeny reconstruction. In J. Cracraft & N. Eldredge (Eds), Phylogenetu Analysis and Palaeontolou: 79-1 11. New York: Columbia University Press. GARDINER, B. G., 1980. Tetrapod ancestry: a reappraisal. In A. L. Panchen (Ed.), The Terrestrial Enuiromnent and the Origin of Land Vertebrates: 177-185. London: Academic Press. GILMOUR, J. S. L., 1961. Taxonomy. In A. M. MacLeod & L. S. Cobley (Eds),Confcmporary Botanical Thought: 2745. Edinburgh: Oliver & Boyd. GILMOUR, J. S. L. & WALTERS, S. M., 1963. Philosophy and classification. In W. B. Tunill (Ed.), Virtas in Botary, 4 : 1-22. London: Pergamon. GOULD, S. J., 1977. Ontogeny and Phylogeny. Cambridge, Massachusetts: Harvard University Press. HARRIS, E. E., 1972. Epicyclic Popperism. British Journal f o r the Philosophy of Science, 23: 55-67. HECHT, M. K., 1976. Phylogenetic inference and methodology as applied to the vertebrate record. Evolutionarv Biology, 9: 335-363. HECHT, M. K. & EDWARDS, J. L., 1976. The determination of parallel or monophyletic relationships: the proteid salamanders-a test case. American Naturalist, 110: 653-677. HECHT, M. K. & EDWARDS,J. L., 1977. The methodology of phylogenetic inference above the species level. In M. K. Hecht, P. C. Goody & B. M. Hecht (Eds), Major Patterns in Vertebrate Evolution: 3-51. New York: Plenum. HENNIG, W., 1950. Grundzuge einer Theorie der phylogcnetzschen Systemotik. Berlin : Deutscher Zentralverlag. HENNIG, W., 1966. Phylogenctk SysUmatics. Urbana: University of Illinois Press. (Reprinted 1979.) HENNIG, W., 1969. Die Sfummesgeschuhte dcr Insckten (Senckenberg Buch 49). FrankfurtlMain : Kramer. HULL, D. L., 1980. The limits of cladism. Systematic <mlogy, 28: 416-440. JANOWITZ, M. F., 1979. A note on phenetic and phylogenetic classifications. Systematu zoo lop^, 28: 197-199. JARVIK, E., 1968. Aspects of vertebrate phylogeny. Nobel Symposium, 4 : 497-527. JURGENS, J. D., 1971. The morphology ofthe nasal region ofAmphibia and its bearing on the phylogeny of the group. Annule uan die Universiteit uon Stellenbosch, 46 (A): 1-146. KEMP, T . S., 1980. Origin of the mammal-like reptiles. Nature, London, 283: 368-372. KITTS, D. B., 1977. Karl Popper, verifiability, and systematic zoology. Systematic <oolog, 26: 185-194. KLUGE, A . G. & FARRIS, J. S., 1969. Quantitative phyletics and the evolution of anurans. Svsfematic <oolopv, 18: 1-32. KRIPKE, S. A,, 1980. Naming and Necessip (Revised ed.). Oxford: Blackwell. KUHN, T. S., 1962. The Stnrcturc of Scimtrfi Revolutions. Chicago: Chicago University Press. KUHN T. S., 1970. The Structure of Scientzz Revolutions (2nd ed.) Chicago: Chicago University Press. KOHNE, W. G., 1973. The systematic position of monotremes reconsidered (Marnmalia). Zeifschriji f i r Morphologie der Tiere, 75: 59-64. LAKATOS, I., 1970. Falsification and the methodology of scientific research programmes. In I. Lakatos & A. Musgrave (Eds), Cn'tz'cUm and the Growth af Knowledge: 91-196. Cambridge: Cambridge University Press. McKENNA, M. C., 1975. Towards a phylogenetic classification of the Mammalia. Contribufionsin Primatology, 5: 21-46. McKINNEY, H. L., 1972. Wallace and Natural Selection. New Haven: Yale University Press. MAYR, E., 1965. Numerical phenetics and taxonomic theory. Qstematic < o o l o ~ , 14: 73-97. MAYR, E., 1969. Principles of Systematic < o o l o ~ . New York: McGraw-Hill. MAYR, E., 1974. Cladistic analysis or cladistic classification. <eitschriff f i r <oologische Svstematik und Evolutionsforschung: 12: 94-128. MEDAWAR, P. B., 1967. The Art ofthe Soluble. London: Methuen. MICKEVICH, M. F., 1978. Taxonomic congruence. Svstematic < o o l o ~ ~27: . 143-158. MILES, R. S., 1973. Relationships of acanthodians. In P. H. Greenwood, R. S. Miles & C. Patterson (Eds), Interrelationships of Fishes: 63-103. London: Academic Press. MILES, R. S., 1975. The relationships of the Dipnoi. Colloques internafionaux du Centre .National de la Recherche Scientijque, 218: 133-148. MILL, J. S., 1872. Book IV: ofoperations subsidiary to induction. A &sum of Lug; Ratiocinative and Inductive (8th ed.). London: Parker. [in J. M. Robson (Ed.) 1974. John Stuarf Mill: Collected Worhs8.Toronto: University Press]. NELSON, G., 1970. Outline of a theory of comparative biology. Systematic <oology, 19: 373-384. NELSON, G., 1971. Paraphyly and plyphyly : redefinitions. Systematic <oology, 20: 471-472. NELSON, G., 1972. Phylogenetic relationship and classification. Sysfematk <oology, 21: 227-231. NELSON, G . , 1974. Classification as an expression of phylogenetic relationships. Systematic <oologv, 22: 344-359. 326 A. L. PANCHEN NELSON, G.. 1978. Classification and prediction: a reply to Kitts. Systematic <oology, 27: 216-218. NELSON, G.. 1979. Cladistic analysis and synthesis: principles and definitions, with a historical note on Adanson’s Fandles dts Plantes i1763 1764). Svstemafic <oologv, 28: 1-2 1. PANCHEN, .4. L., 1972. The interrelationships of the earliest tetrapods. In K. ,4. Joysey & T. S. Kemp (Eds), Sfudies in l’eerttbratt Ez,olution: 65 87. Edinburgh: Oliver & Boyd. PANCHEN, .A. L., 1979. The cladistic debate continued. .Vafure, London, 280: 541. PATTERSOS, C.. 1978. Verifiability in systematics. Systematic <oology, 27: 218-222. PA’ITERSON, C., 1982. Morphological characters and homology. In K. A. Joysey & A. E. Friday (Eds), Problems of Phylogenetu Reconstruction: 21-74. London: Academic Press. PrZTTERSON, C. & ROSEN, D. E.. 1977. Review of ichthyodectiform and other Mesozoic teleost fishes and the theory and practice of classifying fossils. Bulletin of the American Museum of .Vatural History,158: 81--172. PLATNICK, 3.I.. 1977. Paraphyletic and polyphyletic groups. $Wematic <ooLogv, 26: 195-200. PI,.AT?IICK, 5 . I., 1980. Philosophy and the transformation of cladistics. Systematic <oologv, 28: 537 ~ 5 4 6 . PLATNICK, X. I . & CAMERON, H. D., 1977. Cladistic methods in textual, linguistic, and phylogenetic analysis. Systematic Joologp. 26: 380 385. PLATSICK, N. I . Br GAFFNEY. E. S.. 1977. Systematics: a Popperian perspective. Systematic <oologv, 26: 360 365. PIATNICK. S . I. & G.AFFNEY, E. S., 1978a. Evolutionary biolog-): a Popperian perspective. Systematic <oologv. 27: 137 141. PLATNICK. N. I. Br GAFFNEY. E. S., 1978b. Systematics and the Popperian paradigm. Systematic <oo/ogy, 27: 38 1- 388. POPPER, K.R., 1934. LogzA der Forsrhuq. l‘ienna: Julius Springer. POPPER, K.R., 1959. The f.ogzr of Srientzfi Dirrozry.. London: Hutchinson. POPPER, K. R., 1963. Conjectures and Rtfrutations-tht Growth o f & k t z ~Knowledge. London: Routledge & Kegan PdUl. POPPER, K. R., 1972. Objectioe Knaudedge--on Evolutionary ,4pproach. Oxford: University Press. POPPER, K.R.. 1974. Replies to mv critics. In P. A. Schilpp ( t d . ) The Philosophy of Karl Popper: 961-1 197. The Library of Living Philosophers. 14. La Salle, Ill. : Open Court. PC‘TSAM, H., 1975. .Wind, f.anguage and Realitr: Philosophiral Papers, Iblume 2. Cambridge: University Press. RIEDL, R., 1979. Order in Liring Organism. Chichester: J. LViley. ROMERO-HERRER.4, A. E., LEHMANS. H., JOYSEY, K. A. & FRIDAY. A. E., 1973. Molecular evolution of myoglobin and the fossil record: a phylogenetic synthesis. .Vature, London, 246: 389-395. ROMERO-HERRERA. .A. E., LEHMASN, H., JOYSEY. K.A. & FRIDAY, A. E.. 1978. On the evolution of tnyoglobin. Philosophiral Transactions of the Riyal Socieg of London, ( B ) 283: 61- 163. SHAEFFER. B.. HECHT, M. K.& ELDREDGE, 5..1972. Phylogeny and paleontology. Evolutionary Biology. 6: ~ 3 1 46. SHORROCKS, B . . 1978. The Genesh ofDizwsi@. London: Hodder & Stoughton. SIMPSON. G. G., 1953. Tht Major Features of Evolution. New York: Columbia University Press. SIMPSOS, G . G.. 1961. Prinr$les of Animal TaxonoEv. New York: Columbia University Press. SIMPSON, G. G.. 1975. Recent advances in methods of phylogenetic inference. Conh’bulions in Primalob& 5: :i-19. SNEATH. P. €I. .%.,1961. Recent developments in theoretical and quantitative taxonomy. Systematic.<oologv, 10: 118 139. SNErlTH, P. H. A. & SOK.AL, R. R., 1973. ,Vumeriral Taxonomy. San Francisco: Freeman. S O U L , R. R. & SNEATH, P. H . A., 1963. PTinciplcs of Numerical T Q X O San ~ JFrancisco: . Freeman. TATTERS.4LL. I. & ELDREDGE, N.. 1977. Fact, theory and fantasy in human paleontology. American Scientist, 65: 204- 21 1. WATSON, D. M. S., 1919. The structure, evolution and origin of the Amphibia-The “Orders” Rachitomi and Stereospondyli. Philosophical Transactions of the Royal SociCg of London, ( E ) 209: 1-73. WATSON, D. M. S., 1940. The origin of frogs, Transactions ofthe Royal Sock9 ofEdinburgh, 60: 195-231. WATSON, D. M. S., 1951. Paleonfalagy and Modern Biology. New Haven: Yale University Press. WILEY, E. 0.. 1975. Karl R. Popper. systematics, and classification: a reply to Walter Bock and other evolutionary taxonomists. Srstematic <wlogv, 24: 233- 242. ADDENDUM .4 recent paper by Rohlf & Sokal (1980) severely criticizes the attempt by Mickevich ( 1978) to demonstrate the superiority of mathematical cladistics (‘phylogenetic systematics’) to phenetic techniques (see p. 3 19 above). Mickevich appears to identify ‘stability’ (as a criterion for judging between taxonomic methods) with congruence between classifications of the same organisms derived from different character sets. Rohlf & Sokal refer to four different definitions of USE OF PARSIMONY 327 stability (the first two from Sneath & Sokal, 1973) none of which coincides with Mickevich’s. Of these four the first approximates to her definition of congruence, but whereas she compares classifications based on discrete character sets (e.g. larval u. adult, male u. female), Sneath & Sokal suggest the criterion that a classification should change little, if at all, when additional characters (n2) are considered together with the original set (n,). Rohlf & Sokal thus contend that Mickevich’s results are not a test of stability but of the largely abandoned nonspecificity hypothesis (Sneath & Sokal, 1973 : 97ff.). They further charge that she biassed her results by recording the data sets uniformly for all the taxonomic methods rather than using the coding technique most suitable to each method, and also that she imposed the inherent restrictions of her favoured technique (unrooted Wagner trees) on all the others. However, probably the strongest evidence they cite of apparent bias is in the presentation of her results. These are summarized (Mickevich, 1978: table 1) as mean rank, which gives a spurious impression of the degree of superiority of the Wagner method, particularly over the best phenetic technique (UPGMA). The mean rank ratio between the two is 2.24, whereas the ratio of their respective mean consensus information indices is only 1.14. Both these means are tabulated by Rohlf & Sokal, but only the former by Mickevich. Lastly, as noted by Rohlf & Sokal, whereas the Wagner method (m.c.i. index 0.593) appears somewhat better than the best phenetic technique (0.518) two other cladistic techniques (CaminSokal/WISS :0.472 and Le Quesne : 0.459) were decidedly worse, with the latter the worst of all the eight methods tested. In a rejoinder Mickevich (1980) attempts to answer all Rohlf & Sokal’s criticisms. With some justification she rejects their (and Sokal & Sneath’s, 1973) first definition of stability, suggesting that testing a classification by adding a new set of characters to the pre-existing set gives a spurious impression of stability by the conflation of two sets of data. It should be noted, however, that this is precisely how a non-mathematical cladist confirms his prejudices by the use of parsimony, whether he introduces a minority of incongruent ‘homoplastic’ characters or bolsters his parsimony ‘vote’ by seeking additional apparent synapomorphies (see p. 320). Her answer to the charge of biassing the results by the coding technique etc. is a simple (or rather complex) denial. However, she does not satisfactorily answer their criticism of her boosting the apparent significance of her results by the misleading presentation of mean ranks without mean consensus information indices or some other valid summary measure. In the light of Rohlf & Sokal’s criticism and Mickevich’s reply I now regard her attempted demonstration as much less significant and convincing than I did when the main manuscript of this paper was completed. Ghiselin ( 1980) reviews Cracraft & Eldredge (1979), a symposium from which several of our references are quoted (Cracraft, 1979; Gaffney, 1979; Eldredge, 1979). In his review he comments on the cladist’s proprietorial claim to be using Popper’s methodology, and notes (as I do) the ad hoc use of the hypothesis of parallelism as a by-product of their use of parsimony-similarly for convergence : “If we merely cry ‘Convergence!’ whenever we encounter a contradiction, no datum under the sun will lead us to correct our errors”. ADDITIONAL REFERENCES ROHLF, F. J. & SOKAL, R. R., 1980. Comments on taxonomic congruence. Systemtic &d~gv, 29: 97-101. A. L. PANCHEN 328 MICKEVICH, M . F., 1980. Taxonomic congruence: Rohlfand Sokal’s misunderstanding. S J J ~ O<oo.bgy, ~ L 29: 162-176. GHISELIN, M. T., 1980. Phylogenetic mythogenais and paleontology, [review of Phylogmtic Analysis and PuleontologV . . . (JoelCracraft & Niles Eldrege, Eds). New York: Columbia U.P.]. Evolution, 34: 822-824. CRACRAFT,J. & ELDREDGE, N. (Eds),Phylogmfic Analysis and Pukontology. New York: Columbia University Press.
© Copyright 2026 Paperzz