Jim Lambers MAT 280 Spring Semester 2009-10 Lecture 6 Notes These notes correspond to Section 11.5 in Stewart and Sections 2.5 and 3.5 in Marsden and Tromba. The Chain Rule Recall from single-variable calculus that if a function π(π₯) is diο¬erentiable at π₯0 , and π (π₯) is diο¬erentiable at π(π₯0 ), then the derivative of the composition (π β π)(π₯) = π (π(π₯)) is given by the Chain Rule (π β π)β² (π₯0 ) = π β² (π(π₯0 ))π β² (π₯0 ). We now generalize the Chain Rule to functions of several variables. Let f : π· β βπ β βπ , and let g : π β βπ β π·. That is, the range of g is the domain of f . Assume that g is diο¬erentiable at a point p0 β π , and that f is diο¬erentiable at the point q0 = g(p0 ). Then, f has a Jacobian matrix π½f (q0 ), and g has a Jacobian matrix π½g (p0 ). These matrices contain the ο¬rst partial derivatives of f and g evaluated at q0 and p0 , respectively. Then, the Chain Rule states that the derivative of the composition (f β g) : π β βπ , deο¬ned by (f β g)(x) = f (g(x)), at p0 , is given by the Jacobian matrix π½f βg (p0 ) = π½f (g(p0 ))π½g (p0 ). That is, the derivative of f β g at p0 is the product, in the sense of matrix multiplication, of the derivative of f at g(p0 ) and the derivative of g at p0 . This is entirely analogous to the Chain Rule from single-variable calculus, in which the derivative of π β π at π₯0 is the product of the derivative of π at π(π₯0 ) and the derivative of π at π₯0 . It follows from the rules of matrix multiplication that the partial derivative of the πth component function of f β g with respect to the variable π₯π , an independent variable of g, is given by the dot product of the gradient of the πth component function of f with the vector that contains the partial derivatives of the component functions of g with respect to π₯π . We now illustrate the application of this general Chain Rule with some examples. Example Let π : β3 β β be deο¬ned by π (π₯, π¦, π§) = ππ§ cos 2π₯ sin 3π¦, and let g : β β β3 be a vector-valued function of one variable deο¬ned by g(π‘) = β¨π₯(π‘), π¦(π‘), π§(π‘)β© = β¨2π‘, π‘2 , π‘3 β©. 1 Then, π β g is a scalar-valued function of π‘, 3 (π β g)(π‘) = ππ§(π‘) cos 2π₯(π‘) sin 3π¦(π‘) = ππ‘ cos 4π‘ sin 3π‘2 . To compute its derivative with respect to π‘, we ο¬rst compute ] [ ] [ βπ = ππ₯ ππ¦ ππ§ = β2ππ§ sin 2π₯ sin 3π¦ 3ππ§ cos 2π₯ cos 3π¦ ππ§ cos 2π₯ sin 3π¦ , and gβ² (π‘) = β¨π₯β² (π‘), π¦ β² (π‘), π§ β² (π‘)β© = β¨2, 2π‘, 3π‘2 β©, and then apply the Chain Rule to obtain ππ ππ‘ = βπ (π₯(π‘), π¦(π‘), π§(π‘)) β gβ² (π‘) β‘ = [ ππ₯ (π₯(π‘), π¦(π‘), π§(π‘)) ππ¦ (π₯(π‘), π¦(π‘), π§(π‘)) ππ§ (π₯(π‘), π¦(π‘), π§(π‘)) ] β£ ππ₯ ππ‘ ππ¦ ππ‘ ππ§ ππ‘ β€ β¦ ππ¦ ππ§ ππ₯ + ππ¦ (π₯(π‘), π¦(π‘), π§(π‘)) + ππ§ (π₯(π‘), π¦(π‘), π§(π‘)) ππ‘ ππ‘ ππ‘ = (β2ππ§(π‘) sin 2π₯(π‘) sin 3π¦(π‘))(2) + (3ππ§(π‘) cos 2π₯(π‘) cos 3π¦(π‘))(2π‘) + (ππ§(π‘) cos 2π₯(π‘) sin 3π¦(π‘))(3π‘2 ) = ππ₯ (π₯(π‘), π¦(π‘), π§(π‘)) 3 3 3 = β4ππ‘ sin 4π‘ sin 3π‘2 + 6π‘ππ‘ cos 4π‘ cos 3π‘2 + 3π‘2 ππ‘ cos 4π‘ sin 3π‘2 . β‘ Example Let π : β2 β β be deο¬ned by π (π₯, π¦) = π₯2 π¦ + π₯π¦ 2 , and let g : β2 β β2 be deο¬ned by [ g(π , π‘) = π₯(π , π‘) π¦(π , π‘) ] [ = 2π + π‘ π β 2π‘ ] . Then, π β g is a scalar-valued function of π and π‘, (π β g)(π , π‘) = π₯(π , π‘)2 π¦(π , π‘) + π₯(π , π‘)π¦(π , π‘)2 = (2π + π‘)2 (π β 2π‘) + (2π + π‘)(π β 2π‘)2 . To compute its gradient, which includes its partial derivatives with respect to π and π‘, we ο¬rst compute [ ] [ ] βπ = ππ₯ ππ¦ = 2π₯π¦ + π¦ 2 π₯2 + 2π₯π¦ , and [ π½g (π , π‘) = π₯π π₯π‘ π¦π π¦π‘ 2 ] [ = 2 1 1 β2 ] , and then apply the Chain Rule to obtain β(π β g)(π , π‘) = βπ (π₯(π , π‘), π¦(π , π‘))π½g (π , π‘) [ ] [ ] π₯π π₯π‘ ππ₯ (π₯(π , π‘), π¦(π , π‘)) ππ¦ (π₯(π , π‘), π¦(π , π‘)) = π¦π π¦π‘ [ ππ₯ (π₯(π , π‘), π¦(π , π‘))π₯π + ππ¦ (π₯(π , π‘), π¦(π , π‘))π¦π = ] ππ₯ (π₯(π , π‘), π¦(π , π‘))π₯π‘ + ππ¦ (π₯(π , π‘), π¦(π , π‘))π¦π‘ [ [2π₯(π‘)π¦(π‘) + π¦(π‘)2 ](2) + [π₯(π‘)2 + 2π₯(π‘)π¦(π‘)](1) = ] [2π₯(π‘)π¦(π‘) + π¦(π‘)2 ](1) + [π₯(π‘)2 + 2π₯(π‘)π¦(π‘)](β2) [ 4(2π + π‘)(π β 2π‘) + 2(π β 2π‘)2 + (2π + π‘)2 + 2(2π + π‘)(π β 2π‘) = 2(2π + π‘)(π β 2π‘) + (π β 2π‘)2 β 2(2π + π‘)2 β 4(2π + π‘)(π β 2π‘) ] . β‘ Example Let π : β β β be deο¬ned by π (π₯) = π₯3 + 2π₯2 , and let π : β2 β β be deο¬ned by π(π’, π£) = sin π’ cos π£. Then π β π is a scalar-valued function of π’ and π£, (π β π)(π’, π£) = (sin π’ cos π£)3 + 2(sin π’ cos π£)2 . To compute its gradient, which includes partial derivatives with respect to π’ and π£, we ο¬rst compute π β² (π₯) = 3π₯2 + 4π₯, and βπ = [ ππ’ ππ£ ] = [ cos π’ cos π£ β sin π’ sin π£ ] , and then use the Chain Rule to obtain β(π β g)(π’, π£) = π β² (π(π’, π£))βπ(π’, π£) = [3(π(π’, π£))2 + 4π(π’, π£)] [ ] cos π’ cos π£ β sin π’ sin π£ [ ] = [3 sin2 π’ cos2 π£ + 4 sin π’ cos π£] cos π’ cos π£ β sin π’ sin π£ [ ] (3 sin2 π’ cos2 π£ + 4 sin π’ cos π£) cos π’ cos π£ β(3 sin2 π’ cos2 π£ + 4 sin π’ cos π£) sin π’ sin π£ . = β‘ 3 Example Let f : β2 β β2 be deο¬ned by [ f (π₯, π¦) = π1 (π₯, π¦) π2 (π₯, π¦) ] [ = π₯2 π¦ π₯π¦ 2 ] , and let g : β β β2 be deο¬ned by g(π‘) = β¨π₯(π‘), π¦(π‘)β© = β¨cos π‘, sin π‘β©. Then f β g is a vector-valued function of π‘, f (π‘) = β¨cos2 π‘ sin π‘, cos π‘ sin2 π‘β©. To compute its derivative with respect to π‘, we ο¬rst compute [ ] ] [ (π1 )π₯ (π1 )π¦ 2π₯π¦ π₯2 π½f (π₯, π¦) = , = (π2 )π₯ (π2 )π¦ π¦ 2 2π₯π¦ and gβ² (π‘) = β¨β sin π‘, cos π‘β©, and then use the Chain Rule to obtain ] ][ β² [ π₯ (π‘) (π1 )π₯ (π₯(π‘), π¦(π‘)) (π1 )π¦ (π₯(π‘), π¦(π‘)) β² β² (f β g) (π‘) = π½f (π₯(π‘), π¦(π‘))g (π‘) = π¦ β² (π‘) (π2 )π₯ (π₯(π‘), π¦(π‘)) (π2 )π¦ (π₯(π‘), π¦(π‘)) ][ ] [ β sin π‘ 2π₯(π‘)π¦(π‘) π₯(π‘)2 = π¦(π‘)2 2π₯(π‘)π¦(π‘) cos π‘ = β¨2 cos π‘ sin π‘(β sin π‘) + cos2 π‘(cos π‘), sin2 π‘(β sin π‘) + 2 cos π‘ sin π‘(cos π‘)β© = β¨β2 cos π‘ sin2 π‘ + cos3 π‘, β sin3 π‘ + 2 cos2 π‘ sin π‘β©. β‘ The Implicit Function Theorem The Chain Rule can also be used to compute partial derivatives of implicitly deο¬ned functions in a more convenient way than is provided by implicit diο¬erentiation. Let the equation πΉ (π₯, π¦) = 0 implicitly deο¬ne π¦ as a diο¬erentiable function of π₯. That is, π¦ = π (π₯) where πΉ (π₯, π (π₯)) = 0 for π₯ in the domain of π . If πΉ is diο¬erentiable, then, by the Chain Rule, we can diο¬erentiate the equation πΉ (π₯, π¦(π₯)) = 0 with respect to π₯ and obtain πΉπ₯ + πΉπ¦ ππ¦ = 0, ππ₯ which yields ππ¦ πΉπ₯ =β . ππ₯ πΉπ¦ 4 By the Implicit Function Theorem, the equation πΉ (π₯, π¦) = 0 deο¬nes π¦ implicitly as a function of π₯ near (π₯0 , π¦0 ), where πΉ (π₯0 , π¦0 ) = 0, provided that πΉπ¦ (π₯0 , π¦0 ) β= 0 and πΉπ₯ and πΉπ¦ are continuous near (π₯0 , π¦0 ). Under these conditions, we see that ππ¦/ππ₯ is deο¬ned at (π₯0 , π¦0 ) as well. Example Let πΉ : β2 β β2 be deο¬ned by πΉ (π₯, π¦) = π₯2 + π¦ 2 β 4. The equation πΉ (π₯, π¦) = 0 deο¬nes π¦ implicitly as a function of π₯, provided that πΉ satisο¬es the conditions of the Implicit Function Theorem. We have πΉπ₯ = 2π₯, πΉπ¦ = 2π¦. Since both of these partial derivatives are polynomials, and therefore are continuous on all of β2 , it follows that if πΉπ¦ β= 0, then π¦ can be implicitly deο¬ned as a function of π₯ at a point where πΉ (π₯, π¦, π§) = 0, and ππ¦ πΉπ₯ π₯ =β =β . ππ₯ πΉπ¦ π¦ For example, at the point (π₯, π¦) = (0, 2), πΉ (π₯, π¦) = 0, and πΉπ¦ = 4. Therefore, π¦ can be implicitly deο¬ned as a function of π₯ near this point, and at π₯ = 0, we have ππ¦/ππ₯ = 0. β‘ (0) (0) (0) More generally, let πΉ : π· β βπ+1 β β, and let p0 = (π₯1 , π₯2 , . . . , π₯π , π¦ (0) ) β π· be such that (0) (0) (0) πΉ (π₯1 , π₯2 , . . . , π₯π , π¦ (0) ) = 0. In this case, the Implicit Function Theorem states that if πΉπ¦ β= 0 near p0 , and all ο¬rst partial derivatives of πΉ are continuous near p0 , then this equation deο¬nes π¦ as a function of π₯1 , π₯2 , . . . , π₯π , and βπ¦ πΉπ₯ = β π, βπ₯π πΉπ¦ (0) π = 1, 2, . . . , π. (0) (0) To see this, we diο¬erentiate the equation πΉ (π₯1 , π₯2 , . . . , π₯π , π¦ (0) ) = 0 with respect to π₯π to obtain the equation βπ¦ πΉπ₯π + πΉπ¦ = 0, βπ₯π where all partial derivatives are evaluated at p0 , and solve for βπ¦/βπ₯π at p0 . Example Let πΉ : β3 β β be deο¬ned by πΉ (π₯, π¦, π§) = π₯2 π§ + π§ 2 π¦ β 2π₯π¦π§ + 1. The equation πΉ (π₯, π¦, π§) = 0 deο¬nes π§ implicitly as a function of π₯ and π¦, provided that πΉ satisο¬es the conditions of the Implicit Function Theorem. We have πΉπ₯ = 2π₯π§ β 2π¦π§, πΉπ¦ = 2π¦π§ β 2π₯π§, πΉπ§ = π₯2 + 2π¦π§ β 2π₯π¦. 5 Since all of these partial derivatives are polynomials, and therefore are continuous on all of β3 , it follows that if πΉπ§ β= 0, then π§ can be implicitly deο¬ned as a function of π₯ and π¦ at a point where πΉ (π₯, π¦, π§) = 0, and π§π₯ = β πΉπ₯ 2π¦π§ β 2π₯π§ = 2 , πΉπ§ π₯ + 2π¦π§ β 2π₯π¦ π§π¦ = β πΉπ¦ 2π₯π§ β 2π¦π§ = 2 . πΉπ§ π₯ + 2π¦π§ β 2π₯π¦ For example, at the point (π₯, π¦, π§) = (1, 0, β1), πΉ (π₯, π¦, π§) = 0, and πΉπ§ = 1. Therefore, π§ can be implicitly deο¬ned as a function of π₯ and π¦ near this point, and at (π₯, π¦) = (1, 0), we have π§π₯ = 2 and π§π¦ = β2. β‘ We now consider the most general case: let F : π· β βπ+π β βπ , and let (0) (0) (0) (0) (0) p0 = (π₯1 , π₯2 , . . . , π₯(0) π , π¦1 , π¦2 , . . . , π¦π ) β π· be such that (0) (0) (0) (0) (0) F(π₯1 , π₯2 , . . . , π₯(0) π , π¦1 , π¦2 , . . . , π¦π ) = 0. If we diο¬erentiate this system of equations with respect to π₯π , we obtain the systems of linear equations βπ¦π βπ¦1 βπ¦2 F π₯π + F π¦1 + F π¦2 + β β β + F π¦π = 0, π = 1, 2, . . . , π, βπ₯π βπ₯π βπ₯π where all partial derivatives are evaluated at p0 . (0) (0) (0) To examine the solvability of these systems of equations, we ο¬rst deο¬ne x0 = (π₯1 , π₯2 , . . . , π₯π ), and denote the component functions of the vector-valued function F by F = β¨πΉ1 , πΉ2 , . . . , πΉπ β©. We then deο¬ne the Jacobian matrices β€ β‘ βπΉ1 βπΉ1 βπΉ1 (p ) (p ) (p ) 0 0 0 βπ₯1 βπ₯2 βπ₯π β₯ β’ βπΉ βπΉ2 2 (p ) β’ βπ₯12 (p0 ) βπΉ 0 βπ₯ βπ₯π (p0 ) β₯ 2 π½x,F (p0 ) = β’ β₯, β¦ β£ βπΉπ βπΉπ βπΉπ βπ₯1 (p0 ) βπ₯2 (p0 ) βπ₯π (p0 ) β‘ β’ β’ π½y,F (p0 ) = β’ β£ and β‘ β’ β’ π½y (x0 ) = β’ β£ βπ¦1 βπ¦1 (p0 ) βπ¦2 βπ¦1 (p0 ) βπΉ1 βπ¦2 (p0 ) βπΉ2 βπ¦2 (p0 ) βπΉ1 βπ¦π (p0 ) βπΉ2 βπ¦π (p0 ) βπ¦π βπ¦1 (p0 ) βπΉπ βπ¦2 (p0 ) βπΉπ βπ¦π (p0 ) βπ¦1 βπ₯1 (x0 ) βπ¦2 βπ₯1 (x0 ) βπ¦1 βπ₯2 (x0 ) βπ¦2 βπ₯2 (x0 ) βπ¦1 βπ₯π (x0 ) βπ¦1 βπ₯π (x0 ) βπ¦π βπ₯1 (x0 ) βπ¦π βπ₯2 (x0 ) βπ¦1 βπ₯π (x0 ) 6 β€ β₯ β₯ β₯, β¦ β€ β₯ β₯ β₯. β¦ Then, from our previous diο¬erentiation with respect to π₯π , for each π = 1, 2, . . . , π, we can concisely express our systems of equations as a single system π½x,F (p0 ) + π½y,F (p0 )π½y (x0 ) = 0. If the matrix π½y,F (p0 ) is invertible (also nonsingular), which is the case if and only if its determinant is nonzero, and if all ο¬rst partial derivatives of F are continuous near p0 , then the equation F(p) = 0 implicitly deο¬nes π¦1 , π¦2 , . . . , π¦π as a function of π₯1 , π₯2 , . . . , π₯π , and π½y (x0 ) = β[π½y,F (p0 )]β1 π½x,F (p0 ), where [π½y,F (p0 )]β1 is the inverse of the matrix π½y,F (p0 ). Example Let F : β4 β β2 by deο¬ned by [ ] [ ] πΉ1 (π₯, π¦, π’, π£) π₯π’ + π¦ 2 π£ F(π₯, π¦, π , π‘) = = . πΉ2 (π₯, π¦, π’, π£) π₯2 π£ + π¦π’ + 1 Then the vector equation F(π₯, π¦, π’, π£) = 0 implicitly deο¬nes (π’, π£) as a function of (π₯, π¦), provided that F satisifes the conditions of the Implicit Function Theorem. We will compute the partial derivatives of π’ and π£ with respect to π₯ and π¦, at a point that satisο¬es this equation. We have [ ] [ ] βπΉ1 βπΉ1 π’ 2π¦π£ βπ₯ βπ¦ π½(π₯,π¦),F (π₯, π¦, π’, π£) = βπΉ2 βπΉ2 = , 2π₯π£ π’ βπ₯ βπ¦ [ βπΉ1 βπΉ1 ] [ ] π₯ π¦2 βπ’ βπ£ π½(π’,π£),F (π₯, π¦, π’, π£) = βπΉ2 βπΉ2 = . π¦ π₯2 βπ’ βπ£ From the formula for the inverse of a 2 × 2 matrix, ]β1 [ ] [ 1 π π π βπ = , π π ππ β ππ βπ π we obtain [ π½(π’,π£) (π₯, π¦) = π’π₯ π’π¦ π£π₯ π£π¦ ] = β[π½(π’,π£),F (π₯, π¦, π’, π£)]β1 π½(π₯,π¦),F (π₯, π¦, π’, π£) [ 2 ][ ] 1 π₯ βπ¦ 2 π’ 2π¦π£ = β 3 π₯ 2π₯π£ π’ π₯ β π¦ 3 βπ¦ [ 2 ] 1 π₯ π’ β 2π₯π¦ 2 π£ 2π₯2 π¦π£ β π¦ 2 π’ = . 2π₯2 π£ β π¦π’ π₯π’ β 2π¦ 2 π£ π¦ 3 β π₯3 These partial derivatives can then be evaluated at any point (π₯, π¦, π’, π£) such that F(π₯, π¦, π’, π£) = 0, such as (π₯, π¦, π’, π£) = (0, 1, 0, β1). Note that the matrix π½(π’,π£),F (π₯, π¦, π’, π£) is not invertible (that is, singular) if its determinant π₯3 β π¦ 3 = 0; that is, if π₯ = π¦. When this is the case, (π’, π£) can not be implicitly deο¬ned as a function of (π₯, π¦). β‘ 7 Practice Problems 1. For each of the following pairs of functions f and g, use the Chain Rule to compute all of the ο¬rst partial derivatives of f β g with respect to the independent variables of g. (a) π (π₯, π¦, π§) = β π₯2 + π¦ 2 + π§ 2 , g(π‘) = β¨ππ‘ cos π‘, ππ‘ sin π‘, π‘ππ‘ β© (b) π (π₯, π¦) = ln β π₯2 + π¦ 2 , g(π’, π£) = β¨π’ cos π£, π’ sin π£β© (c) f (π₯, π¦) = β¨π₯3 + 3π¦ 2 , π¦ 3 β 3π₯2 β©, g(π‘) = β¨cos π‘, sin π‘β© (d) f (π’, π£) = β©( 1+ ( π£ π’) π£ π’) π π’ cos cos π’, 1 + cos sin π’, sin 2 2 2 2 2 2 βͺ , g(π₯, π¦) = β¨π₯2 π¦, π₯π¦ 2 β© 2. Compute the indicated derivatives or partial derivatives of the functions that are implicitly deο¬ned by the following equations. Then obtain the equation of the tangent line (or tangent plane) to the curve (surface) at the indicated point. β (a) πΉ (π₯, π¦) = (π₯2 + π¦ 2 )2 β 2(π₯2 β π¦ 2 ) = 0, ππ¦/ππ₯, (π₯0 , π¦0 ) = ( 3/2, 1/2) β (b) πΉ (π₯, π¦, π§) = π₯2 + π¦ 2 + π§ 2 β 8 π₯2 + π¦ 2 + 15 = 0, βπ§/βπ₯, βπ§/βπ¦, (π₯0 , π¦0 , π§0 ) = (4, 0, 1) 3. (Just for fun) Describe the graphs of the curves or surfaces deο¬ned by the equations in Problem 2. Additional Practice Problems Additional practice problems from the recommended textbooks are: β Stewart: Section 11.5, Exercises 1-11 odd, 17, 19 odd β Marsden/Tromba: Section 2.5, Exercises 5, 9, 11; Section 3.5, Exercises 3, 5 8
© Copyright 2026 Paperzz