Seediscussions,stats,andauthorprofilesforthispublicationat: https://www.researchgate.net/publication/226635673 Somepedagogicalaspectsofproof ArticleinInterchange·February1990 DOI:10.1007/BF01809605 CITATIONS READS 146 321 1author: GilaHanna UniversityofToronto 57PUBLICATIONS1,042CITATIONS SEEPROFILE Availablefrom:GilaHanna Retrievedon:28May2016 Interchange, Vol. 21, No. 1 (Spring, 1990), 6-13 Some Pedagogical Aspects of Proof Gila Hanna The Ontario Institute for Studies in Education In discussing the role of proof in mathematics education it seems helpful to distinguish among different perceptions of proof. It is particularly useful, I believe, to consider the following three aspects, around whichi will organize my comments. 1. Formal proof: proof as a theoretical concept in formal logic (or metalogic), which may be thought of as the ideal which actual mathematical practice only approximates. 2. Acceptable proof: proof as a normative concept that defines what is acceptable to qualified mathematicians. 3. The teaching of proof: proof as an activity in mathematics education which serves to elucidate ideas worth conveying to the student. Formal Proof A formal proof of a given sentence is a finite sequence of sentences such that the first sentence is an axiom, each of the following sentences is either an axiom or has been derived from preceding sentences by applying rules of inference, and the last sentence is the one to be proved. Such a proof is, in principle, mechanizable, and eliminates the psychological aspects of proof. The formal approach to proof was developed, in fact, to eliminate the need for recourse to intuitive evidence and human judgment, both seen as potential sources of serious error. Formalism has a long history. The Greeks made a distinction between the content of an argument and a formal structure based upon the rules of thought and independent of the content. Aristotle and Plato may have held different views on the genesis of the formal rules of thought, as they did on the nature of universal properties and laws in general, but they both recognized the usefulness and power of such rules. The promise offered by a separation of content and form was pursued by other philosophers, notably by Leibniz (1646-1716) who believed that it was possible to elaborate a characteristica universalis, a symbolic script with the power to express all human thought without ambiguity. Knowledge could then be transmitted through an algebra which would allow one to distinguish clearly between fact and fiction, truth and falsity. Should any arguments arise, they could be easily settled: "Gentlemen, let us calculate!" Such a language, in his view, would make truths "stable" "visible;' "irresistible," and "mechanical." The formalization of mathematical knowledge in particular was further elaborated by Frege (1848-1925) who devised a formal conceptual language (Begriffsschrift) embodying explicit criteria for the validity of chains of inference. Frege also attempted to demonstrate that all arithmetical notions could be reduced to purely logical concepts -- an approach which Hilbert (1862-t943) took several steps further. In his Grundlagen der Geometrie (1899), Hilbert showed that to understand the properties of concepts such as point, line, and plane one need not rely upon knowledge of objects existing outside of mathematics. These concepts could usefully be assumed within a formal axiomatic system that defined what signs and combinations of signs were valid expressions and what constituted a valid derivation. New truths about these geometrical concepts could then be found by mere derivation from the axioms and definitions of the system. 6 Interchange Vol. 21/1 e'I'he Ontario Institute for Studies in Education SOME PEDAGOGICAL ASPECTS OF PROOF 7 Within such a formalism, there clearly can be no appeal to a metaphysical definition of truth: the truth of a statement hinges entirely upon the axioms and the internal consistency of the system itself. Thus, to demonstrate the full power of the formal approach to mathematics, it is necessary to be able to establish definitively, for any given formal system, that it is or is not consistent. But this requirement proved to be unattainable when G6del showed, in 1931, that the formalization of a theory does not ensure that its consistency can be definitively established. Furthermore G6del demonstrated that in a consistent formal system there will be theorems (true statements) which are not provable within the system: such a system will always be incomplete. The implication is that a formalization of a branch of mathematics, for example, can never encompass all of" its theorems. There are other limitations which a strict formalism might impose on mathematics. Tymoczko (1986) underlined the potential sterility of a purely formal approach by describing how it might change the work of the mathematician: The " . . . ideal mathematician chooses a certain axiom system to investigate.., turns on valid formal deductions in the appropriate formal s y s t e m . . , is often characterized as a game player, playing with formal systems" (p. 45). Tymoczko pointed out that if all mathematical activity were to fit this description, then mathematics would become a very limited science, oblivious in fact to mathematical content. Alternatively, mathematics would become a very broad science indeed, redefined as the investigation of formal theories in general - - in which case, as Quine put it, mathematics might as well investigate formal theories in Greek mythology. While G6del's findings placed severe limits upon the power of the formal approach, and others such as Lakatos and Tymoczko have challenged it on philosophical grounds, these concerns have not diminished the usefulness of formal methods in the treatment of segments of mathematics or in individual proofs. And deductive rigour, one of the essential elements of formalism, has acquired the status of an important tool in mathematical work and an ultimate criterion which mathematicians can fall back on when required. Yet a concern for precision in definition and rigour in deduction was never unique to the strictly formalist approach. This concern was shared by the togicist, formalist, and intuitionist schools whose important work on the foundations of mathematics gave enormous impetus to the growth of mathematics after the turn of the century and has had a lasting effect upon mathematical practice. The new stress on precision and rigour was also reflected in the syllabus of the universities, where the axiomatic approach became a common denominator of most mathematics courses. This trend was furthered in particular by the considerable influence enjoyed by the group of talented French mathematicians who wrote under the name of Bourbaki: The axiomatic method teaches o n e . . , to find the common ideas buried under the external apparatus of detail appropriate to each of the theories considered, to single out these ideas, and to exhibit them. (Bourbaki, 1971, p. 26) It is not surprising that these developments also influenced the educators who prepared the various "new math" curricula for the secondary schools. What is perhaps most interesting is that these educators not only assumed that rigorous proof was reflective of mathematical practice, but also seem to have held the underlying belief that formal derivations are, in themselves, an aid to understanding and thus constitute a useful didactic device (Hanna, 1983). Acceptable Proof In the last couple of decades both mathematicians and mathematics educators have begun to reassess the role of axiomatic structures and formal proof. It has come to be more wide- 8 GILA HANNA ly recognized, in particular, that practising mathematicians have long agreed that proofs may have different degrees of formal validity and still gain the same degree of acceptance. This has led to a reconsideration of what the ideal proof should be and of what one ought to teach in the schools. Mathematicians freely admit that a proof may lack conviction when it is shown to be valid by virtue of its form alone, without regard to its content. And when a new mathematical idea is in fact accepted, it is often the significance of what is proved, rather than the correctness of the proof, which carries the most weight. The acceptance of a theorem by practising mathematicians is a social process which is more a function of understanding and significance than of rigorous proof. It was the fallibility of social processes and of the human judgment embodied in them, ironically enough, which provided the impetus for the development of systems of formal validation. Yet mathematicians undeniably insist on using their judgment when deciding whether or not to accept a new proof-- and continue to accept such proofs for their semantic rather than their syntactic qualities. The criteria which mathematicians appear to apply, consciously or unconsciously, are that the proof must proceed from specific and accepted premises, must present an argument that is not flawed, and must lead to a result which, even if unexpected, seems upon reflection to make sense in the context of other mathematical knowledge. The implicit assumption may well be that a proof which passes these tests of acceptability would also pass the test of formal validation, but in most cases an actual formal validation is not regarded as necessary. It can certainly b e argued, as does Fetzer (1988), that "what makes (what we call) a proof a proof is its validity rather than its acceptance (by us) as v a l i d . . , social processing, therefore, is neither necessary nor sufficient for a proof to be valid" (p. 1050). Yet most mathematicians choose to proceed with their work without concern for the formality of their proofs. It is true that a proof is considered a prerequisite for the publication of a theorem, but the proof need be neither rigorous nor complete. In practice, mathematicians seem to adhere to Bateson's view that "the advantages in scientific thought come from a combination of loose and strict thinking, and this combination is the most precious tool of science" A factor in the general acceptance of a new result is, of course, the value which mathematicians see in it. The same is true of proofs. Mathematicians seem to value most a proof which has interesting implications for the branch of mathematics in which it is embedded and yields insights into related fields. A proof is valued for bringing out essential mathematical relationships rather than for merely demonstrating the correctness of a result. The Teaching of Proof In recent years many mathematics educators have actively re-examined the role of mathematical proof in the curriculum, and as a result there has been a trend away from what has often been seen as an over-reliance on rigorous proofs. In a desire to take into account the role of proof as a means of communication, and in recognition of the social processes that play such a crucial part in the acceptance by mathematicians of a new result, educators have come to place greater emphasis on the concept of proof as "convincing argument." This trend away from formal proof in the curriculum, and the resulting search for alternative ways of demonstrating the validity of mathematical results in the classroom, has motivated a number of studies dealing with the problem of teaching proof. Leron (1983), concerned that most of the formal proofs found in textbooks do a poor job of communicating mathematical ideas, suggested that such mathematical presentations would be much more SOME PEDAGOGICAL ASPECTS OF PROOF 9 comprehensible if the proof were structured into short autonomous modules, each emphasizing one particular idea. Deploring the teaching of geometry as narrow and overly concerned with deductive proof, Volmink (1988) argued that mathematics education would be better served if the curriculum were to place greater emphasis on the social criteria for the acceptance of a mathematical truth, at the expense of the purely formal ones. Movshovitz-Hadar (1988) elaborates upon six different ways of presenting theorems and six ways of presenting proofs, in an effort to enhance mathematical understanding through what she calls the "stimulating responsive method?' Alibert (1988), on the other hand, prefers to rely on the method of scientific debate, which provides students an opportunity to discuss the arguments made in the course of a proof. In an extensive study of the processes involved in teaching a mathematical proof, Balacheff (1988) also points to the importance of creating classroom situations in which the student becomes aware of the complexity of the problem and of the necessity to produce valid arguments. These authors, and others not cited here, have made a substantial contribution to our understanding of the didactics of proof and have offered specific and interesting new ways of teaching proofs. In their expositions, however, a proof is viewed primarily as a valid argument, as opposed to an argument that must be both valid and explanatory. I believe it would be useful to introduce to the discussion an explicit distinction between proofs that prove and proofs that explain. In what follows I will first elaborate upon the concept of proof as explanation, and then consider its implications for the handling of proof in the curriculum, suggesting that whenever possible we should present to students proofs that explain rather than ones that only prove. A proof that explains and a proof that proves are both legitimate proofs. By this I mean that both types of proof meet the requirements for a mathematical proof, and thus serve in equal measure to establish the validity of a mathematical assertion. Both consist of statements that are either axioms themselves or follow from previous statements (and thus eventually from axioms) as a result of the correct application of rules of inference. They are not necessarily different in their degree of rigour, and both types would be recognized as valid by the mathematical community. There is nevertheless a very important difference between these two kinds of proof. A proof that proves shows only that a theorem is true; it provides evidential reasons alone. It is concerned only with substantiation, with what are known as Rationes cognoscendi, that is, why-we-hold-it-to-be-so reasons. A proof that explains, on the other hand, also shows why a theorem is true; it provides a set of reasons that derive from the phenomenon itself: Rationes essendi, or why-it-is-so reasons. A proof that proves may rely on mathematical induction or even on syntactic considerations alone. But a proof that explains must provide a rationale based upon the mathematical ideas involved, the mathematical properties that cause the asserted theorem to be true. The sense in which I use the term explanation in this context is perhaps best clarified in contradistinction to that of Balacheff. In his analysis of the cognitive and social aspects of proof, Balacheff (1988) proposed the following distinctions: • We call an explanation the discourse of an individual who aims to establish for somebody else the validity of a statement. The validity of an explanation is initially related to the speaker who articulates it. • We call proof an explanation which is accepted by a community at a given time. • We call mathematicalproofa proof accepted by mathematicians. As a discourse, mathematical proofs have now-a-days a specific structure and follow well defined rules that have been formalized by logicians. (p. 2) For Balacheff, then, a proof would seem to be an explanation by virtue of being a proof. 10 GILA HANNA Yet surely not all proofs have explanatory power. One can even establish the validity of many mathematical assertions by purely syntactic means; with such a syntactic proof one essentially demonstrates that a statement is true without ever showing what mathematical property makes it true. Thus I prefer to use the term explain only when the proof reveals and makes use of the mathematical ideas which motivate it, Following Steiner (1978), I wilt say that a proof explains when it shows what "characteristic property" entails the theorem it purports to prove. As Steiner put it: an explanatory proof makes reference to a characterizing property of an entity or structure mentioned in the theorem, such that from the proof it is evident that the results depend on the property. It must be evident, that is, that if we substitute in the proof a different object of the same domain, the theorem collapses; more, we should be able to see as we vary the object how the theorem changes in response. (p. 143) • . . The following example illustrates the difference between a proof that proves and a proof that explains: Prove that the sum of the first n positive integers, S(n), is equal to n(n+l)/2. A proof that proves Proof by mathematical induction: For n=l the theorem is true• Assume it is true for an arbitrary k. Then consider: S(k+l) = S(k) + (k+l) = n(n+l) + (n+l) = (n+l)(n+2) 2 2 Therefore the statement is true for k+l if it is true for k. By the induction theorem, the statement is true for all n, Now, this is certainly an acceptable proof: it demonstrates that a mathematical statement is true. What it does not do, however, is show why the sum of the first n integers is n(n+l)/2 or what characteristic property of the sum of the first n integers might be responsible for the value n(n+l)/2. (Proofs by mathematical induction are non-explanatory in general.) Gauss's proof of the same statement, however, is explanatory because it uses the property of symmetry (of two different representations of the sum) to show why the statement is true. It makes explicit reference to the symmetry, and it is evident from the proof that its result depends on this property: A proof that explains Gauss's proof is as follows: S S . . = 1+ 2 = n + (n-l) . . . . . . . . . . . . . . . . . . . + ... + ... . . . . . . . . . . . . . . 2S = (n+l) + (n+l) + . . . S _ n(n+l). 2 + n + 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . + (n+l) = n (n+l) Another explanatory proof of this same statement is, of course, the geometric representation of the first n integers by an isosceles right triangle of dots; here the characteristic property is the geometrical pattern that compels the truth of the statement. We can represent the sum of the first n integers as triangular numbers (see Figure 1). SOME PEDAGOGICAL ASPECTS OF PROOF . ,:. 1 1+2 11 ,fi 1+2+3+4 1+2+3 Figure 1 The dots form isosceles right triangles containing S(n) = 1 + 2 + 3 + . . . + n d o t s Two such sums S(n) + S(n) give a square containing n 2 dots and n additional dots because the diagonal of n dots is counted twice. Therefore: 2S(n) = n 2 + n S(n)- n2 + n 2 i_ n(n+l) 2 Another explanatory proof would be the representation of the first n integers by a staircaseshaped area as follows: a rectangle with sides n and n + l is divided by a zigzag line (see Figure 2). n+l The whole area is n(n+l), and the staircase-shaped area, 1 + 2 + 3 + . . . + n only half, hence n(n+l) 2 4 4 1 Figure 2 Both Gauss's proof and the geometric representation show that one can adopt an explanatory approach to proof in the classroom without abandoning the criteria of legitimate mathematical proof and reverting to reliance on intuition alone. What one must do, rather, is to replace one proof, of the non-explanatory kind, by another equally legitimate proof which has explanatory power, the power to bring out the mathematical message in the theorem. In their paper "Wann ist ein Beweis ein Beweis?" (When is a proof a proof?.), Wittmann and Mueller (1988) characterize explanatory proofs - - of which they furnish an interesting example - - by a term which makes reference to content (inhaltich-anschaulich). They state: "In German we use the term "inhaltich-anschaulicher Beweis" in order to indicate that this method of demonstration calls upon the meaning of the term employed, as distinct from abstract methods, which escape from the interpretation of the terms and employ only the abstract relations between them." Their terminology reminds us again that proofs that 12 GILA HANNA rely on mathematical logic alone can never provide explanatory reasons. Such proofs look only at the form of the argument: their whole raison d~tre is in being content-free. The distinction I have made between proofs that prove and proofs that explain is similar to that which Bolzano makes between "making certain" (Gewissmachung) and "building a foundation" (Begruendung). He argues that making certain requires no more than a formal demonstration, while building a foundation demands an approach which also provides insight into the connections among ideas. Much has been made of the importance of convincing in mathematics education (Hirst, 1989), but a proof that convinces need not be a proof that explains: it is certainly possible to be convinced that a statement is true without knowing why it is true. Thus it is not its ability to convince which distinguishes the explanatory proof, convincing though we would expect it to be. Nor is an explanatory proof distinguished by its degree of validity: one would expect such a proof to be an acceptable proof, as that term has been defined above. But the focus of an explanatory proof is clearly upon understanding, rather than upon the deductive mechanism. Abandoning non-explanatory proofs in favour of (equally valid) explanatory ones would not make the curriculum any less reflective of accepted mathematical practice. In fact, as I have argued elsewhere (Hanna, 1983), mathematicians, including those who have recourse to purely syntactic methods, are really more interested in the message behind the proof than in its syntax, and see the mechanics of proof as a necessary but ultimately less significant aspect of mathematics. Thus there is no infidelity to the practice of mathematics if in mathematics education we focus as much as possible on good mathematical explanations, highlighting for the students in our proof of a theorem the important mathematical ideas that lead to its truth. Summary and Implications As mathematics educators it is our mission to make students understand mathematics. It is my contention that in support of this mission we should give a more prominent place in the mathematics curriculum to proofs that explain. If we stress explanatory proofs our primary concern is for insight, of course. But there is another important reason for such a stress. Students of mathematics, unlike mathematicians, have yet to learn the relative importance of different mathematical topics, and may easily be misled on this point by a pervasive classroom emphasis on the deductive mechanism. (Deductive mechanisms are an important facet of mathematics in themselves, however, and require their own classroom treatment.) A first step in shifting to the use of explanatory proofs is for educators of teachers to stress that mathematical understanding is the goal, and furthermore that understanding is much more than confirming that all the links in a chain of deduction are correct, that in fact the completeness of detail in a formal deduction may obscure rather than enlighten, and that understanding requires some appeal to previous mathematical experience. Another important step, and a challenge for curriculum developers, is to identify suitable explanatory proofs as alternatives to the many non-explanatory ones now in use. Note Some of these ideas were presented at the 13th Annual Conference of the International Group for the Psychology of Mathematics Education. SOME PEDAGOGICAL ASPECTS OF PROOF 13 References Alibert, D. (1988). Towards new customs in the classroom. For the learning of mathematics, 8(2), 31-35. Balacheff. N. (t988). A study of pupils'proving processes at the junior high school level. Unpublished paper presented at the Joint International Conference 66th NCTM and UCSMP Project, Chicago. B0urbaki, N. (1971). The architecture of mathematics. In E Le Lionnais (Ed.), Great currents of mathematical thought (Vol. 1). New York: Dover. Fetzer, J. H. (1988). Program verification: The very idea. Communications of the ACM, 31(9), 1048-1063. Hanna, G. (1983). Rigorous proof in mathematics education. Toronto: OISE Press. Hirst, K., & Hirst, A. (Eds.). (1989). Proceedings of the Sixth International Congress on Mathematical Education. Budapest: Janos Bolyai Mathematical Society. Leron, U. (1983). Structuring mathematical proofs. American Mathematical Monthly, 90(3), 174-185. Movshovits-Hadar, N. (1988). Stimulating presentation of theorems followed by responsive proofs. For the learning of mathematics, 8(2), 12-19. Steiner, M. (1978). Mathematical explanation. Philosophical Studies, 34, 135-151. Tymoczko, T. (1986). Making room for mathematicians in the philosophy of mathematics. The Mathematical InteIIigencer, 8(3), 44-50. Volmink, J. (1988). The role of proof in students' understanding of geometry. Unpublished paper presented at the AERA Annual Meeting, New Orleans. Wittmann, E. C., & Mueller, G. (1988). Warmist ein Beweis ein Beweis? Unpublished paper presented at the Annual Meeting of Mathematics Teachers, Germany.
© Copyright 2026 Paperzz