A FORGOTTEN PAPER ON THE FUNDAMENTAL THEOREM OF

5. Smithies
13/9/00
11:25 AM
Page 333
Notes Rec. R. Soc. Lond. 54 (3), 333–341 (2000)
© 2000 The Royal Society
A FORGOTTEN PAPER ON THE FUNDAMENTAL THEOREM OF ALGEBRA
by
FRANK SMITHIES
167 Huntingdon Road, Cambridge CB3 0DH, UK
SUMMARY
In 1798, there appeared in the Philosophical Transactions of the Royal Society a paper
by James Wood, purporting to prove the fundamental theorem of algebra, to the
effect that every non-constant polynomial with real coefficients has at least one real
or complex zero. Since the first generally accepted proof of this result was given by
Gauss in 1799, Wood’s paper deserves careful examination. After giving a brief
outline of Wood’s career, I describe the argument of his paper. His proof turns out to
be incomplete as it stands, but it contains an original idea, which was to be used later,
in the same context, by von Staudt, Gordan and others, without knowledge of Wood’s
work. After putting Wood’s work in context, I conclude by showing how his idea can
be used to prove the complex form of the fundamental theorem of algebra, stating that
every non-constant polynomial with complex coefficients has at least one zero in the
complex field.
INTRODUCTION
Professor Peter Goddard, Master of St John’s College, Cambridge, has drawn attention
to a paper by James Wood, one of his predecessors, entitled ‘On the roots of equations’,
which appeared in Philosophical Transactions volume 88. It was communicated by
Nevil Maskelyne, F.R.S., the Astronomer Royal, and was read on 17 May 1798.1
The paper purports to prove what is usually called the fundamental theorem of
algebra, stating it in the form ‘Every equation has as many roots of the form a±√(±b2)
as it has dimensions’; in more modern language, the author is saying that a nonconstant polynomial of degree n≥1, with real coefficients, has n real or complex zeros
(some of which may be repeated).
The paper appeared the year before Gauss, in his Helmstedt dissertation, gave the
first generally accepted proof of the theorem.2 It therefore seems worthwhile to
examine Wood’s paper and its context with some care.
333
5. Smithies
13/9/00
334
11:25 AM
Page 334
Frank Smithies
WOOD’S CAREER
James Wood had a remarkable career. Born on 14 December 1760, he was the son of
a Lancashire weaver; he attended Bury School, and a sizarship attached to the school
enabled him to enter St John’s College, Cambridge, in January 1778. As an
undergraduate, he inhabited a tiny room, nicknamed the Tub (unused since his time),
at the top of a staircase in Second Court; he is said to have wrapped his feet in straw
to keep warm, and to have read by the light of the staircase candle. He emerged as
Senior Wrangler and first Smith’s Prizeman in 1782, becoming a Fellow of the
college; he became President of the college in 1802 and was elected Master in 1815.
In 1816–17 he was Vice-Chancellor of the University, displaying on several occasions
his attachment to older ways of doing things; in particular, he closed down the
recently formed Union Society, apparently because he disapproved of any discussion
of political questions by the undergraduates.
He was appointed Dean of Ely in 1820, and Rector of Freshwater, Isle of Wight
(a well-endowed living) in 1823. He did a great deal for St John’s College, reforming
the rules for the election of Fellows and contributing substantially to the building of
the college’s extension across the river (New Court). He died on 23 April 1839,
leaving a considerable sum of money to the college, most of which was eventually
spent in the building of a new chapel.
Wood was active as a textbook writer; his Elements of Algebra 3 first appeared in
1795, and went through numerous editions, continuing after his death under the
editorship of Thomas Lund until 1861; it had become the standard algebra textbook
in the University. It was ultimately superseded by Isaac Todhunter’s Algebra for the
Use of Colleges and Schools,4 of which the first edition appeared in 1855. Wood also
wrote textbooks on mechanics (1796) and optics (1798).
The 1798 paper in which we are interested seems to have been Wood’s only
published scientific paper, except for a note on haloes in 1790 in the Transactions of
the Manchester Literary and Philosophical Society.
In the Baker–Mayor history of St John’s College,5 Wood is described as a Fellow
of The Royal Society, but it appears that the Society has no record of his being one.
EARLIER WORK ON THE FUNDAMENTAL THEOREM
Before we describe Wood’s paper in any detail, something should be said about the
background to his work. By the time he was writing, complex numbers, generally
called imaginary or impossible numbers, were widely but not universally accepted as
a valid part of mathematics, especially in algebra. In a letter written by Lagrange to
Lorgna in 1777,6 he remarks on the fact that the use of imaginary numbers has become
generally accepted. That every algebraic equation with real coefficients has a real or
imaginary root was generally believed to be true, and various attempts had been
made to prove it. Those by d’Alembert, Euler, de Foncenex and Lagrange were to be
severely criticized by Gauss in his 1799 dissertation.7 The principal British worker in
5. Smithies
13/9/00
11:25 AM
Page 335
A forgotten paper on the fundamental theorem of algebra
335
this field in the 18th century was Edward Waring (1734–1798), who became Lucasian
Professor of Mathematics in Cambridge in 1760. He published his Meditationes
Algebraicae in 1770, and expanded it in later editions; the third edition of 17828 is
quoted in Wood’s paper. In this book and in separate papers, Waring discusses various
aspects of the theory of equations; in particular, he investigates several transformations
to which equations can be subjected, apparently in the hope of facilitating their
solution. At one point in the preface to the third edition (p. xli) he states explicitly that
every equation has a real or imaginary root α + β √ (–1); this remark seems to be
based on a method he has devised for approximating imaginary roots. His argument
is far from being a proof of the theorem. Waring clearly had good knowledge of
continental work in algebra; he refers to results by Cramer, Euler, Bézout,
Vandermonde, Lagrange and others. It is clear from references in Wood’s paper and
in his algebra textbook that he was an admirer of Waring’s work. It is a curious fact
that Wood’s paper came out only a few months before Waring’s death.
WOOD’S ARGUMENT
We shall now describe the argument used by Wood in his paper. He begins (his Prop.
I) with a detailed description of the now standard procedure for calculating the highest
common factor of two polynomials, the analogue for polynomials of the Euclidean
algorithm for finding the highest common factor of two natural numbers.
In his Prop. II, Wood starts with an equation of even degree 2m; as he remarks later,
an equation of odd degree with real coefficients necessarily has a real root. He writes
the equation as:
x2m +px2m–1 +qx2m–2 + rx2m–3 +… =0
(1)
and seeks two roots, which he denotes by v+z and v– z. Substituting these in (1), he
obtains two equations, one of the form:
v2m +b(z)v2m–2 +c(z)v2m–4 +… =0
(2)
and the other of the form (after a factor v has been dropped):
A(z)v2m–2 +B(z)v2m–4 + C(z)v2m–6 +… =0.
(3)
If we write y = v2, these become:
ym +b(z)ym–1 + c(z)ym–2 +… =0
(4)
A(z)y m–1 +B(z)y m–2 + C(z)y m–3 +… =0,
(5)
and
the coefficients being polynomials in z.
5. Smithies
13/9/00
336
11:25 AM
Page 336
Frank Smithies
Applying the highest common factor (HCF) process to the left-hand sides of (4)
and (5), regarded as polynomials in y, he reaches a remainder independent of y; this
will be a polynomial in z of degree m(2m – 1), which was later to be called the
resultant of the polynomials in (4) and (5). If now, he says, we can find a value z0 of
z for which this polynomial vanishes, then the last divisor in the HCF process will be,
when z0 is substituted for z, a common factor of the two polynomials. He then alleges
that the last divisor in the HCF process will be of the form y– Z(z), where Z(z)is a
polynomial in z, whence it would follow that v2 – Z(z0) is a common factor of the
polynomials in (2) and (3), and that (1) has the two roots:
x=±√(Z(z0)) +z0 = z0 ±v0
where v0 =√(Z(z0)). He then states as his conclusion (his Prop. II) that we can find a
root of an equation of degree 2m, provided that we can find a root of an associated
equation of degree m(2m –1). He also gives a corollary to the effect that
x2 – 2z0x+z 20 – v02
will be a quadratic factor of the original equation (1).
This is the first (and chief ) weak point in Wood’s argument; he gives no evidence
for his statement that the last divisor in the HCF process will be of degree 1 in y; in
practice, it could easily be of higher degree. We may remark here that at that period,
and for a good many years afterwards, such exceptional cases were frequently ignored,
attention being focused on the general case.
He now states as his final conclusion (his Prop. III) that every equation (with real
coefficients, it is understood) has as many roots of the form a ± √(b2) as it has
dimensions (i.e. its degree). His argument for this proceeds by successive enumeration
of cases.
• Case 1. Every equation of odd degree has at least one real root, say x0; by division
by (x– x0), the equation can be reduced to one of lower but even degree.
• Case 2. If the equation is of degree 2m, where m is odd, then m(2m – 1) is odd, and
therefore, by Prop. II, we have to solve an equation of odd degree, so we can find
real values for z0 and v0. The polynomial in (1) thus has a quadratic factor:
x2 – 2z0 x+z 20 – v 20
and therefore has two real or complex zeros.
• Case 3. If the equation is of degree 2m, where m is ‘evenly odd’, i.e. m/2 is odd,
then the equation for z0 has either two real roots or two of the form a ±b√(–1) and
v20 will be of the form c±d√(–1). He concludes that the given equation then has a
biquadratic factor with real coefficients; this can be resolved into two quadratics
with real coefficients, which will have four roots, as required.
He goes on to say that Prop. III can be proved in the same way when m/4, m/8, m/16,
5. Smithies
13/9/00
11:25 AM
Page 337
A forgotten paper on the fundamental theorem of algebra
337
…, is an odd number, and therefore holds for all equations. This last remark is not
altogether convincing, since the situation becomes more and more complicated as
the progression continues, and he gives no reason to suppose that new and
unexpected complications will not arise at some stage. The argument is crying out
to be recast in a form where mathematical induction can be used; as we shall see
later, it is possible to do so.
LATER WORK IN THE FIELD
Although Wood’s proof is incomplete, it contains all the ingredients required for a
proof of the fundamental theorem. It belongs to a class of proofs or attempted proofs
that we may describe as quasi-algebraic, in that the only tool used that is not strictly
algebraic is the fact that a real algebraic equation of odd degree always has a root. Later
proofs belonging to this class include Gauss’s second proof of 1815,9 one given by von
Staudt in 1845,10 one by Gordan in 187611 and expanded in his 1885 book12 and one
by Malet in 1878.13
Wood’s paper appears to have excited no interest whatever in the mathematical
world. The only mention of it that I can find, apart from a casual reference in some
editions of his own algebra textbook, appears to be in Moritz Cantor’s Vorlesungen
über die Geschichte der Mathematik in the final volume of 1908 in a section
contributed by Florian Cajori.14 After describing Wood’s argument in some detail, he
criticizes it on the same grounds that Gauss used for earlier attempts by Euler,
Lagrange and Laplace: to wit, that Wood assumes that the roots exist in some sense,
and that what is to be proved is that these roots must be real or complex numbers. This
remark appears unjust, since Wood makes no such assumption.
In further discussion, it will be useful to introduce a little terminology: if the
degree of a polynomial is expressed as n =2qp, where p is odd, we shall say that the
polynomial has evenness index q. The idea used by Wood of reducing an equation to
one with a smaller evenness index goes back to de Foncenex’s attempted proof,15 said
by Delambre to have been inspired by Lagrange, and published in 1759; it reappears
in Lagrange’s paper of 1772.16 There is no evidence indicating whether Wood knew
of these papers, but he seems to have been the first to use as an auxiliary equation the
resultant of two polynomials derived from the original equation. This idea is also
implicit in Von Staudt’s paper of 184517 and is explicitly expressed by Gordan in
1876;18 in both these papers the method is based on the same ideas as Wood’s.
THE COMPLEX FORM OF THE THEOREM
All the papers that we have mentioned so far focus on the case where the original
equation has real coefficients; the fundamental theorem is thought of as being the
statement that every real polynomial can be factorized into real polynomials of degree
1 or 2. The theorem that every polynomial with complex coefficients has at least one
5. Smithies
13/9/00
338
11:25 AM
Page 338
Frank Smithies
zero, and therefore can be factorized into complex polynomials of degree 1, goes back
to 1815, when Argand outlined a proof of it;19 a more rigorous proof was given by
Cauchy in 1821.20 Gauss mentioned it for the first time in his fourth proof of the
theorem.21 The possibility of a quasi-algebraic proof of the complex result is hinted
at by Malet22 in 1878.
A PROOF OF THE COMPLEX THEOREM BASED ON WOOD’S IDEAS
We conclude by outlining a quasi-algebraic proof of the fundamental theorem,
essentially following Wood’s method, for polynomials with general real or complex
coefficients. We shall take for granted some elementary properties of the resultant of
two polynomials. We begin with a lemma.
Lemma. If f (x) is a polynomial of odd degree k with real or complex coefficients, then
it has at least one zero.
We take as our inductive hypothesis that every polynomial of odd degree less than
k has a zero. If all the coefficients of f (x) are real, then consideration of the behaviour
of f (x) as x varies on the real line shows at once that f (x) has at least one (real) zero.
Supposing now that f (x), normalized to have leading coefficient 1, has some nonreal coefficients, we write:
g(x)= f (x) f (x),
where f (x) denotes the polynomial whose coefficients are the complex conjugates of
those of f(x); then g(x) has real coefficients and is of degree 2k. Write:
G(x,u)= [g(x+u)+ g(x– u)]/2,
H(x,u)= u–1[g(x+ u)– g(x– u)]/2.
Let S(u) be the resultant of G(x,u) and H(x,u), regarded as polynomials in x. Since
G(x,u) and H(x,u) are even functions of u, so is S(u); since G is of degree 2k and H
of degree 2k–1, S(u) is of degree 2k(2k– 1). If we write:
v=u2,
we shall have S(u)=T(v), say, where T has real coefficients and is of degree k(2k– 1),
which is odd. Consequently, T has a real zero v0 and, if we write
u0 =√v0,
we have S(u0)=0.
By known properties of the resultant, it follows that G(x,u0) and H(x,u0) have a
common divisor. Since G and H are even functions of u, and u0 is either real or
5. Smithies
13/9/00
11:25 AM
Page 339
A forgotten paper on the fundamental theorem of algebra
339
purely imaginary, G and H will have real coefficients and therefore a real HCF, so that
their common divisor h(x) may be supposed to have real coefficients and leading
coefficient 1.
Since:
g(x+ u0)= G(x,u0)+ u0 H(x,u0),
h(x) must divide g(x+u0), so h(x–u0) divides:
g(x)= f(x)f (x).
We now distinguish two cases. Suppose first that h(x–u0) is a constant multiple of f(x)
or f (x), say f(x); since h(x– u0) and f(x) both have leading coefficient 1, we have
equality: h(x– u0)=f(x). In this case, h(x) and f(x) have the same degree, so h(x) has
odd degree and real coefficients, and therefore has a real zero x0, say. Then f(x0 + u0)
= h(x0)=0, so f(x) has a zero x0 +u0, as required. The case h(x– u0)= f (x) can be treated
similarly. Second, if h(x–u0) is not a constant multiple of f(x) or f (x), then some proper
divisor of h(x–u0) must divide f(x) or f (x); since the conjugate of a divisor of f (x) must
divide f(x), we have a factorization f(x)=m(x) k(x), where m and k have lower degree
than f. Since deg f=deg m+deg k, either deg m or deg k must be odd; by the inductive
hypothesis, either m or k has a zero, which will be a zero of f, as required.
We now come to the main result.
Theorem. If f(x) is an arbitrary polynomial with real or complex coefficients, then
f(x) has at least one real or complex zero.
Since we have already dealt with polynomials of odd degree, we may suppose that f(x)
has degree n =2qp, where p is odd and q≥1. We follow a path similar to the proof of
the lemma. This time our inductive hypothesis is that: (i) every polynomial with
evenness index less than q has a zero, and (ii) every polynomial with evenness index
q and degree less than n has a zero. Write
F(x, u)=[ f(x+ u)+ f(x–u)] /2,
E(x, u)= u–1[f(x+u)– f(x–u)]/2.
Let R(u) be the resultant of F and E, regarded as polynomials in x; then R(u) is of
degree n(n–1) and is an even function of u. Writing v= u2, we have R(u)= Q(v), say,
where Q(v) is a polynomial of degree n(n – 1)/2; since Q has evenness index less by
1 than that of f, it follows from part (i) of the inductive assumption that Q(v) has a zero,
say v0, whence R(u0) = 0, where u0 = √v0. Consequently F(x,u0) and E(x,u0) have a
common divisor, say k(x); since:
f(x+u0)= F(x,u0)+ u0E(x,u0),
k(x) divides f(x+u0), so k(x–u0) divides f(x). Because deg k(x) ≤deg E(x, u0)=n–1, we
5. Smithies
13/9/00
340
11:25 AM
Page 340
Frank Smithies
have deg k<deg f, so that we can write f(x)= k(x– u0)r(x) for some non-trivial r(x).
Since f has evenness index q, either k(x–u0) or r(x) will have evenness index not greater
than q, and its degree will be less than n. By part (ii) of our inductive hypothesis, either
k(x–u0) or r(x) must have a zero, which will be a zero of f(x), as required.
CONCLUSION
We see from our discussion that, although the proof of the fundamental theorem of algebra
in James Wood’s paper23 is incomplete as it stands, it can be successfully completed along
his lines, and indeed completed so as to prove more than he was aiming at. In addition,
he deserves the credit for being the first to see that we can construct an auxiliary
polynomial of smaller evenness index than the initial one by using the resultant of two
polynomials derived from the original equation, thus providing the tools for a simple
quasi-algebraic proof of the fundamental theorem of algebra, even in its complex form.
ACKNOWLEDGEMENTS
My thanks are due to Peter Goddard for bringing Wood’s paper to my attention, to
Graeme Segal and H.S. Tropp for useful discussions of Wood’s argument, and to the
referee for some very helpful suggestions.
NOTES
1
2
3
4
5
6
7
8
9
10
11
12
13
J. Wood, ‘On the roots of equations’, Phil. Trans. R. Soc. Lond. 88, 368–377 (1798).
C.G. Gauss, ‘Demonstratio nova theorematis functionem algebraicam rationalem integram
unius variabilis in factores reales primi vel secundi gradus resolvi posse’, Dissertation,
Helmstedt (1799); Werke 3, 1–30 (1866).
J. Wood, The elements of algebra, 9th edn (Cambridge, 1830).
J. Todhunter, Algebra for the use of colleges and schools (Macmillan, Cambridge, 1855).
T. Baker, History of the College of St John the Evangelist, Cambridge (ed. J.E.B. Mayor)
(Cambridge University Press, 1869).
J.L. Lagrange, ‘Letter to Lorgna’, Oeuvres 14, 261.
Op. cit., note 2.
E. Waring, Meditationes Algebraicae, 3rd edn (Cambridge, 1782).
C.G. Gauss, ‘Demonstratio nova altera theorematis omnem functionem algebraicam
rationalem integram unius variabilis in factores reales primi vel secundi gradus resolvi
posse’, Comm. Soc. Reg. Sci. Göttingen 3, 107–142 (1816); Werke 3, 33–56 (1866).
C. von Staudt, ‘Beweis des Satzes, dass jede algebraische rationale ganze Function von einer
Veränderlichen in Factoren vom ersten Grade aufgelöst werden kann’, J. Reine Angew.
Math. 29, 97–103 (1845).
P. Gordan, ‘Ueber den Fundamentalsatz der Algebra’, Math. Ann. 10, 572–575 (1876).
P. Gordan, Vorlesungen über Invariantentheorie, vol. 1 (1885).
J.C. Malet, ‘Proof that every algebraic equation has a root’, Trans. R. Irish Acad. 26, 453–455
(1878).
5. Smithies
13/9/00
11:25 AM
Page 341
A forgotten paper on the fundamental theorem of algebra
14
15
16
17
18
19
20
21
22
23
341
M. Cantor (ed.), Vorlessungen über die Geschichte der Mathematik, vol. 4 (Teubner, Leipzig,
1908).
D. de Foncenex, ‘Réflexions sur les Quantités imaginaires’, Miscellanea Taurinensia 1,
113–146 (1759).
J.L. Lagrange, ‘Sur la forme des racines imaginaires des équations’, Nouv. Mém. Acad. Berlin
1772, 222–258 (1774); Oeuvres 3, 479–516 (1869).
Op.cit., note 10.
Op.cit., note 11.
J.R. Argand, ‘Réflexions sur la nouvelle théorie des imaginaires, suivies d’une application
à la démonstration d’un théorème d’analise’, Annales de Math. (Gergonne) 5, 197–209
(1815).
A.L. Cauchy, Cours d’analyse de l’École Royale Polytechnique, Ie Partie. Analyse algébrique
(1821); Oeuvres 2, 3 (1897).
C.G. Gauss, ‘Beiträge zur Theorie der algebraischen Gleichungen’, Abh. Ges. Wiss. Göttingen
4 (1848), 3–34 (1850); Werke 3, 73–102 (1866).
Op.cit., note 13.
Op.cit., note 1.