Transcendental Functions and Initial Value Problems: A Different Approach to Calculus II Byungchul Cha Byungchul Cha ([email protected]) teaches at Muhlenberg College in Allentown, Pennsylvania, after working for three years at Hendrix College. He received his B.S. from the Korea Advanced Institute of Science and Technology, and his Ph.D. from Johns Hopkins University. His area of specialization is number theory. When he is not teaching calculus, he enjoys engaging his young son in doing other mathematics, such as counting from 1 to 10 and drawing octagons. Imagine that all of our knowledge of elementary transcendental functions has been somehow lost. Using the differential equations that these functions satisfy, how can we rebuild them? This material is taken from a Calculus II curriculum developed and used at Hendrix College for a number of years. This article consists of several snapshots of our approach, and gives a coherent exposition of the materials on transcendental functions. We present two theorems on initial value problems (IVP) and coupled initial value problems (CIVP) that simply assert the existence and uniqueness of their solutions. After this, we study various initial value problems that are satisfied by the functions exp t, ln t, sin t, cos t, and arcsin t. Our theorems guarantee that the solutions to these initial value problems exist and are unique. Therefore, we can define these functions as the solutions. By using our fundamental theorems as tools, we derive important properties of these functions. We conclude with a brief description of how we integrate these materials with other traditional calculus topics to make a coherent Calculus II course. The idea of using differential equations to define transcendental functions appears in some calculus textbooks. For example, Callahan et al. [4] influenced the development of the Hendrix curriculum, and, in particular, our exposition on the exponential function overlaps with Chapter 4 in [4] substantially. However, we include it here for completeness. Initial value problems Recall that an initial value problem (IVP) consists of a differential equation and an initial condition. In this article, we assume that all of our differential equations are of the type dy = F(t, y), dt (1) where both the partial derivatives of F(t, y) exist and are continuous. This helps us avoid some technical difficulties. We now state, without proof, our fundamental theorem of initial value problems (see, for example, [3]), which will serve as the cornerstone for our exposition. 288 c THE MATHEMATICAL ASSOCIATION OF AMERICA Theorem 1 (Fundamental theorem of IVPs). the form dy = F(t, y) and dt Every initial value problem of y(t0 ) = y0 has a unique solution y = f (t) on an open interval containing t0 . Intuitively, this theorem means this: At each point in the t y-plane, equation (1) specifies the slope of the line tangent to a solution curve passing through the point, and therefore defines a direction field (see Figure 1). When a particle is released at the point (t0 , y0 ), the Fundamental Theorem states that the particle must travel along the solution curve determined by this direction field, at least for a short period of time. One consequence of the theorem is that there cannot be two solution curves on the direction field that meet one another. If two such curves did exist, the intersection point could be taken as an initial condition, and this would define an IVP possessing two different solutions, contradicting the theorem. y y = f (t) (t 0 ; y 0 ) t Figure 1. Fundamental theorem of IVP. The exponential function. Consider the initial value problem dy =y dt with y(0) = 1. (2) Theorem 1 asserts there is a unique solution, which we define to be y = exp t. (Later, we define other transcendental functions this way, and each defining IVP will appear in a box as above.) Starting from this definition, we prove the exponential laws: for all real numbers x and y, • • • • exp(x + y) = exp x exp y exp(−x) = 1/ exp x exp(x − y) = exp x/ exp y exp(x y) = (exp x) y . To prove these, we use the following strategy. We construct an appropriate IVP and show that each side of an equality is a solution to this IVP. Then, it follows from the uniqueness in Theorem 1 that the equality holds. We will call this an IVP-style proof. To prove exp(x + y) = exp x exp y, we take f 1 (t) := exp(t + a) and f 2 (t) := exp t exp a (treating a as a constant). Because we have defined exp t as the solution to VOL. 38, NO. 4, SEPTEMBER 2007 THE COLLEGE MATHEMATICS JOURNAL 289 (2), we can differentiate f 1 (t) directly to get d d ( f 1 (t)) = (exp(t + a)) = exp(t + a) = f 1 (t), dt dt using the chain rule. In addition, we see that f 1 (0) = exp a. This says that f 1 (t) is a solution to the IVP dy =y dt with y(0) = exp a. (3) Also, we verify by direct computation that d ( f 2 (t)) = exp t exp a = f 2 (t) and dt f 2 (0) = exp 0 exp a = exp a, using (2) again. So, f 2 (t) is also a solution to (3). By uniqueness, we conclude that f 1 (t) = f 2 (t). So, exp(x + y) = exp x exp y for all x and y. The other exponential laws can be shown to hold by similar arguments. For example, we let f 1 (t) := exp(at) and f 2 (t) := (exp t)a , and show that both f 1 (t) and f 2 (t) are solutions to the IVP dy = ay dt with y(0) = 1. Again, by uniqueness, f 1 (t) = f 2 (t). Note that, in differentiating this f 2 (t), we have used the power rule (x a ) = ax a−1 for any real number a. For a nonnegative integer a, the power rule can be easily established using the binomial theorem, and implicit differentiation extends it for rational a. Usually, it is not until an advanced calculus course that a rigorous definition of x a for an irrational exponent a is given, so that the expression x a , for a fixed x > 0, gives a continuous function of a. If we accept this fact, then our proof for a rational exponent a is sufficient to imply exp(at) = (exp t)a for all real a because both sides represent, for a fixed t, continuous functions of a that agree on all rational numbers. (See an interesting exposition by Anselone and Lee [1] regarding diffentiability of exponential functions.) Alternatively, one can take the equation (exp t)a := exp(at) as a definition of x a for any real a for a fixed x = exp t. This is possible because the range of exp t can be shown to be all positive reals (see below). Our earlier proof for rational a then says that this definition is consistent with the conventional definition of x a for rational a. As a corollary of the exponential laws, we argue that the range of y = exp t is all positive reals. Observe that the line y = 0 is a solution to the differential equation dy/dt = y. Therefore, the graph of y = exp t must always stay above the t-axis since otherwise it would intersect y = 0. Hence (exp t) = exp t > 0 for all t. Thus y = exp t is a strictly increasing function. Take > 0. Then exp > 1 because exp 0 = 1. Now, we use the fourth exponential law to obtain exp(n · ) = (exp )n , which becomes arbitrarily large as n increases. From the intermediate value theorem, we conclude that the range of exp t contains (1, ∞). This, in turn, implies that the range contains the interval (0, 1) as well, thanks to the second exponential law exp(−x) = 1/ exp x. This completes the proof that the range of exp t is the set of all positive real numbers. There is an issue that we have overlooked in our discussion of y = exp t. Theorem 1 guarantees the existence of a solution to (2), but only locally. In particular, exp t has not been shown to be well-defined on the entire t-line yet. For example, a solution of the 290 c THE MATHEMATICAL ASSOCIATION OF AMERICA IVP dy/dt = y 2 and y(0) = 1 is y = 1/(1 − t), which is undefined at t = 1. In fact, all solutions of dy/dt = y 2 blow up at some value of t; that is, they possess vertical asymptotes (see Figure 2). After taking a calculus course based on the material in this article, one student, Jared Williams, asked when in general solutions of differential equations blow up, and eventually, this question resulted in an undergraduate research paper [5]. In particular, his theorem [5] implies that y = exp t does not blow up, and is well-defined for all real t. y t Figure 2. Direction field of y = y 2 and its solutions. The logarithmic function. Before we define the logarithmic function, we study how two IVPs are related when their solutions are inverses of each other. Suppose that y1 (t) and y2 (t) satisfy ⎧ ⎧ ⎨ dy2 = F (t, y ), ⎨ dy1 = F (t, y ), 1 1 2 2 dt dt and (4) ⎩ ⎩ y1 (a) = b y2 (b) = a. Further, assume that they are inverses of each other. A standard result from implicit differentiation (see, for example, [6, p. 235]) is that, if f is a one-to-one differentiable function whose inverse is also differentiable, then 1 ( f −1 ) (x) = f ( f −1 (x)) . Therefore, the expressions F1 and F2 are related by F2 (t, y) = 1 . F1 (y, t) (5) In words, we can obtain F2 from F1 by interchanging t and y, and flipping. Now we define the logarithmic function. Since we expect ln t to be the inverse of exp t, equation (5) suggests that we define y = ln t as a solution to the IVP dy 1 = dt t with y(1) = 0. (6) du u (7) Its integral version t ln t = 1 VOL. 38, NO. 4, SEPTEMBER 2007 THE COLLEGE MATHEMATICS JOURNAL 291 is nothing but a standard definition of ln t used in many calculus textbooks (see, for example, [6]). It is immediate from (7) that the domain of ln t is (0, ∞) because the integrand 1/u is continuous on this interval. In particular, we see that the domain of ln t is the same as the range of exp t. Here is an IVP-style proof of the fact that exp t and ln t are indeed inverse functions of each other. Define f 1 (t) := (exp ◦ ln)(t) and f 2 (t) := t, for all t > 0. Note that f 1 (t) is defined for t > 0, because ln t is defined for t > 0. We differentiate f 1 (t) directly using (2), (6), and the chain rule, and obtain exp(ln t) f 1 (t) d f 1 (t) = = . dt t t Further, f 1 (1) = 1 from (6) and (2). This shows that f 1 (t) is a solution to the IVP dy/dt = y/t with y(1) = 1. On the other hand, it is easy to verify that f 2 (t) is a solution to the same IVP. Therefore, f 1 (t) = f 2 (t). That is, (exp ◦ ln)(t) = t for t > 0. Similarly, define f 3 (t) := (ln ◦ exp)(t) and f 4 (t) := t for all t. Since the range of exp t is the positive reals, f 3 (t) is defined for all t. Further, we can show by direct computation that both f 3 (t) and f 4 (t) are solutions to the IVP dy/dt = 1 and y(0) = 0. For example, by (2), (6), and the chain rule, we get 1 d f 3 (t) = · exp t = 1. dt exp t The remaining computations are straightforward, with the result that exp t and ln t are inverses. Coupled initial value problems A coupled initial value problem (CIVP) consists of a pair of initial value problems ⎧ dx ⎪ ⎪ = F1 (t, x, y), ⎨ x(t0 ) = x0 , dt (8) and ⎪ dy y(t0 ) = y0 . ⎪ ⎩ = F2 (t, x, y) dt As we did with (1), we assume that all of the partial derivatives of F1 (t, x, y) and F2 (t, x, y) exist and are continuous in order to avoid some technical difficulties. We state a theorem similar to Theorem 1 that asserts the existence and uniqueness of a pair of solutions under similar conditions [2, p. 205]. Theorem 2 (Fundamental theorem of CIVPs). Every coupled initial value problem of the form (8) has a unique pair of solutions x = x(t) and y = y(t) on an open interval containing t0 . Sine and cosine functions. We define x(t) := cos t and y(t) := sin t to be the unique solutions to the CIVP ⎧ dx ⎪ ⎪ = −y, ⎨ dt ⎪ dy ⎪ ⎩ =x dt 292 and x(0) = 1, y(0) = 0. (9) c THE MATHEMATICAL ASSOCIATION OF AMERICA With this definition, there is an IVP-style proof for the fundamental identity sin2 t + cos2 t = 1. (10) Let f (t) := cos2 t + sin2 t. Then it is not difficult to check that f (t) = 0 and f (0) = 1 by (9) and the chain rule. Hence, Y = f (t) is a solution to the IVP dY =0 dt and Y (0) = 1. On the other hand, the constant function Y = 1 is also a solution to this IVP. This proves (10) by the uniqueness statement of Theorem 1. Next, we sketch a proof of the addition formulas sin(t + a) = sin t cos a + cos t sin a (11) cos(t + a) = cos t cos a − sin t sin a using what we will call a CIVP-style proof. Define the pair of functions f 1 (t) := sin(t + a) and f 2 (t) := cos(t + a), and another pair f 3 (t) := sin t cos a + cos t sin a and f 4 (t) := cos t cos a − sin t sin a. Then, a direct computation using (9) and the chain rule shows that both pairs are solutions to the CIVP ⎧ dy1 ⎪ ⎪ = y2 , ⎨ y1 (0) = sin a, dt and ⎪ y2 (0) = cos a. dy ⎪ ⎩ 2 = −y1 dt Therefore, the two pairs are the same by the uniqueness statement of Theorem 2. The addition formulas (11) are thus proved. Next, we consider a direction field which is determined by the differential equations (9) in t x y-space (see Figure 3(a)). The solution curve starts at the point (t0 , x0 , y0 ) = (0, 1, 0). If we project this direction field onto the x y-plane, we obtain a new direction field (Figure 3(b)) on the x y-plane associated with dy/d x. We leave it to readers to y y 1 1 (cos t, sin t) θ= t 1 t (a) x x 1 (b) Figure 3. Direction field of sine and cosine in the t x y-space and its projection onto the x yplane. VOL. 38, NO. 4, SEPTEMBER 2007 THE COLLEGE MATHEMATICS JOURNAL 293 check, using the chain rule and (9), that x dy =− . dx y Also, the projection of the solution curve lies on the unit circle, as evidenced by the identity (10). Recall that the angle θ in polar coordinates of the point (x, y) is given by θ = tan−1 (y/x). A little calculation shows that dθ/dt = 1. Further, θ(0) = tan −1 y(0) x(0) = 0. So, we get θ(t) = t. In other words, a particle released at the point (1, 0) in this direction field will move along the circle at a constant angular speed 1 radian per unit time. This links our CIVP approach with the usual definitions of the sine and cosine. Arcsine function The last function that we consider here is the arcsine function y = arcsin t. It is defined to be a solution to the IVP 1 dy =√ dt 1 − t2 with y(0) = 0. (12) This IVP is motivated by the fact that y = sin t satisfies the initial condition y(0) = 0 and the differential equation dy = 1 − y2, dt (13) (swap t and y, and flip!) on a small interval containing t = 0, over which y = sin t is one-to-one. Since y (0) > 0, such an interval does exist, and (13) follows from (9) and (10). This IVP definition is, by part one of the fundamental theorem of calculus [6, p. 396], equivalent to the integral version t arcsin t = 0 du . √ 1 − u2 (14) √ Since the integrand 1/ 1 − u 2 is a continuous function of u on (−1, 1), the domain of arcsin t contains the open interval (−1, 1). However, we can show that both lim t→1− 0 t du √ 1 − u2 and t lim t→−1+ 0 du √ 1 − u2 exist, so arcsin t can be continuously extended to the closed interval [−1, 1]. We prove the convergence of the improper integrals as follows. It is not difficult to see that d 1 , −u 1 − u 2 = −2 1 − u 2 + √ du 1 − u2 294 c THE MATHEMATICAL ASSOCIATION OF AMERICA which, in turn, implies that du = −u 1 − √ 1 − u2 u2 +2 1 − u 2 du. From this, we deduce that arcsin(1) = lim t→1− 0 t du =2 √ 1 − u2 1 1 − u 2 du = 0 π , 2 (15) 1√ since 0 1 − u 2 du is the area of a quarter circle of radius 1. The computation of arcsin(−1) = −π/2 is similar. Equation (15) has many important consequences. We briefly describe some of them here as exercises for the readers. (a) Show that the function arcsin t gives a one-to-one correspondence from [−1, 1] to [−π/2, π/2]. Hint: (The equation (14) says arcsin t has a positive derivative, from which the injectivity of arcsin t follows. For surjectivity, use the intermediate value theorem.) (b) Prove that sin(arcsin t) = t for all t ∈ [−1, 1]. Together with (a), this implies that sin t and arcsin t are inverses of each other. (Hint: instead of proving just this, show even more: for all t ∈ [−1, 1], sin(arcsin t) = t, and cos(arcsin t) = 1 − t 2. This can be done by a CIVP-style proof with this CIVP: ⎧ dy y2 1 ⎪ ⎪ ⎨ dt = √1 − t 2 , ⎪ dy y ⎪ ⎩ 2 = −√ 1 dt 1 − t2 and y1 (0) = 0, y2 (0) = 1.) (c) Define arccos t := π/2 − arcsin t. Then deduce results similar to those in (b) for cos t. (d) From (15) and (b), we get sin(π/2) = 1. Combine this with (10) and (11) to recover many other properties of the sine and cosine functions, such as periodicity, odd/evenness, and so on. IVPs as teaching tools for calculus II The materials we outlined in this article have been successfully used in the second calculus course at Hendrix College for many years. We mix these with other standard materials, including the following: • • • Qualitative, numerical, and analytic tools—graphical analysis, Euler’s method, and separation of variables ([6, §9.1–§9.5])—for studying differential equations. Integration techniques, such as partial fractions and trigonometric substitutions. Sequences and series. VOL. 38, NO. 4, SEPTEMBER 2007 THE COLLEGE MATHEMATICS JOURNAL 295 The emphasis on differential equations provides meaningful contexts for various integration techniques. Also, the IVP- and CIVP-style proofs require students to use many familiar ideas from Calculus I—the chain rule, the fundamental theorem of calculus, and so on—in an unfamiliar setting. Although not mentioned in this article, our approach to IVPs lends itself nicely to an overview on how to obtain power series expansions of transcendental functions via Picard iteration. Acknowledgments. Many of my colleagues at Hendrix College contributed to this approach for Calculus II, including Bob Eslinger, Dwayne Collins, David Sutherland, and Duff Campbell. This article has been greatly improved by suggestions made by Carl Burch, an anonymous referee, and the editor. References 1. P. Anselone and J. Lee, Differentiability of Exponential Functions, College Mathematics Journal 36 (2005) 388–393. 2. P. Blanchard, R. Devaney, and G. Hall, Differential Equations, 3rd ed., Brooks/Cole, 2005. 3. W. Boyce and R. DiPrima, Elementary Differential Equations, 8th ed., Wiley, 2004. 4. J. Callahan, D. Cox, K. Hoffman, D. O’Shea, H. Pollatsek, and L. Senechal, Calculus in Context, Freeman, 1995. 5. D. Campbell and J. Williams, Exploring Finite-Time Blow-Up, PME Journal 11 (2003) 423–428. 6. J. Stewart, Calculus, Early Transcendentals, 5th ed., Brooks/Cole, 2003. Proof Without Words: A Graph Theoretic Summation of the First n Integers n i= i=1 n+1 2 For example, n = 5: 1 6 2 5 3 4 1 1 1 1 1 6 2 6 2 6 2 6 2 6 2 5 3 5 3 5 3 5 3 5 3 4 4 4 4 4 For a related result, see Mathematics Magazine 80 (2007) 182. Joe DeMaio Joey Tyson Kennesaw State University 296 c THE MATHEMATICAL ASSOCIATION OF AMERICA
© Copyright 2026 Paperzz