1 Calculus 1.1 Gradients and the Derivative 0 + () (+) Let ( ()) and (+

1
Calculus
1.1
Gradients and the Derivative
𝑄
𝑓 (π‘₯+β„Ž)
𝛿𝑦 ((((𝑇
((
(
(
(
((((
𝑃
((
𝑅 𝑓 (π‘₯)
𝛿π‘₯
0
π‘₯
π‘₯+β„Ž
Let 𝑃 (π‘₯, 𝑓 (π‘₯)) and 𝑄(π‘₯+β„Ž, 𝑓 (π‘₯+β„Ž)) denote two points on the curve of the function 𝑦 = 𝑓 (π‘₯)
and let 𝑅 denote the point of intersection of the horizontal through 𝑃 with the vertical through
𝑄. Let 𝛿π‘₯ = (π‘₯ + β„Ž) βˆ’ π‘₯ = β„Ž and 𝛿𝑦 = 𝑓 (π‘₯ + β„Ž) βˆ’ 𝑓 (π‘₯). Then the gradient of the chord 𝑃 𝑄 is
given by
𝛿𝑦
𝑓 (π‘₯ + β„Ž) βˆ’ 𝑓 (π‘₯)
𝑄𝑅
=
=
.
𝑃𝑅
𝛿π‘₯
β„Ž
Let 𝑃 𝑇 denote the tangent to the curve at 𝑃 . The gradient to the curve 𝑦 = 𝑓 (π‘₯) at 𝑃 may be
approximated by the chord 𝑃 𝑄 by taking the point 𝑄 to be sufficiently close to 𝑃 . Let 𝑄 take
a succession of values taking it closer and closer to 𝑃 then the straight line through 𝑃 and 𝑄
approaches closer and closer to the tangent 𝑃 𝑇 . As 𝑄 becomes arbitrarily close to, but never
coinciding with 𝑃 , the line through 𝑃 𝑄 becomes arbitrarily close to the tangent 𝑃 𝑇 . We shall
write 𝑄 β†’ 𝑃 , and say that ”𝑄 tends to 𝑃 ”. We shall also say that 𝑄 tends to the limit 𝑃 .
Finally, we shall say that in the limit as 𝑄 tends to 𝑃 the gradient of the chord 𝑃 𝑄 tends to the
gradient of the tangent 𝑃 𝑇 . The notion of limit may then be extended from points to numbers
by saying that in the limit as 𝑄 β†’ 𝑃 we have that 𝛿π‘₯ β†’ 0, β„Ž β†’ 0 and 𝑓 (π‘₯ + β„Ž) β†’ 𝑓 (π‘₯). It is
clear now why we stipulate that 𝑃 must not coincide with 𝑄, for in this case, 𝛿π‘₯ = 0, β„Ž = 0,
𝑓 (π‘₯ + β„Ž) = 𝑓 (π‘₯) and the ratio 𝑄𝑅
is indeterminate, being of the form ”0/0”, sometimes called
𝑃𝑅
an indeterminate form. We now abbreviate the phrase ”in the limit as β„Ž tends to zero” by
limβ„Žβ†’0 and similarly for 𝛿π‘₯ tending to zero, we write lim𝛿π‘₯β†’0 . We define the gradient of the
tangent of the curve at the point 𝑃 to be the limiting value of the gradient of the chord 𝑃 𝑄 as
𝑄 β†’ 𝑃 and we call this the derivative of the function 𝑦 = 𝑓 (π‘₯) at π‘₯. Formally,
Definition 1.1
The derivative of the function 𝑦 = 𝑓 (π‘₯) at π‘₯ is
(
)
𝑓 (π‘₯ + β„Ž) βˆ’ 𝑓 (π‘₯)
β€²
𝑓 (π‘₯) = lim
β„Žβ†’0
β„Ž
1
or equivalently,
𝑑𝑦
𝛿𝑦
= lim
𝑑π‘₯ 𝛿π‘₯β†’0 𝛿π‘₯
Notes
1.
if 𝑄 approaches 𝑃 from the right (left) then β„Ž > 0 (β„Ž < 0).
2.
the notation
1.2
𝑑𝑦
𝑑π‘₯
does not indicate a ratio, it represents the limit of a ratio.
Limits
In the above diagram, 𝑦 = 𝑓 (π‘₯), 𝑃 is a fixed point, 𝑄 is a variable point, so π‘₯ is a fixed number,
and β„Ž is a variable. Let 𝑄 β†’ 𝑃 from either the left (β„Ž < 0) or right (β„Ž > 0), and consider the
values of 𝑓 (π‘₯ + β„Ž) as β„Ž β†’ 0. If β„Ž β†’ 0 and β„Ž > 0 (β„Ž < 0) write β„Ž β†’ 0+ (β„Ž β†’ 0βˆ’). Consider the
limits π‘™π‘–π‘šβ„Žβ†’0βˆ’ 𝑓 (π‘₯ + β„Ž), π‘™π‘–π‘šβ„Žβ†’0+ 𝑓 (π‘₯ + β„Ž). If a limit gives a definite value, the limit is said to
exist. If it does not give a definite value, the limit does not exist. For the function whose graph
is shown in the diagram, the three numbers 𝑓 (π‘₯), π‘™π‘–π‘šβ„Žβ†’0+ 𝑓 (π‘₯ + β„Ž), π‘™π‘–π‘šβ„Žβ†’0βˆ’ 𝑓 (π‘₯ + β„Ž) all exist
and are equal. In this case 𝑦 = 𝑓 (π‘₯) is said to be continuous at π‘₯. If it happens that the three
numbers are not all equal or some of them do not exist, then 𝑦 = 𝑓 (π‘₯) is discontinuous at π‘₯.
Many of the functions that we are familiar with are continuous, for example all polynomials,
sinπ‘₯, cosπ‘₯, expπ‘₯. But, many, while being continuous for most values of π‘₯ also have some points
of discontinuity, for example rational functions (at points where the denominator is zero but
the numerator is not), tanπ‘₯, cosecπ‘₯, secπ‘₯, cotπ‘₯.
Let 𝑓 (π‘₯) = π‘₯2 and π‘₯ = 2 then π‘™π‘–π‘šβ„Žβ†’0βˆ’ (2 + β„Ž)2 = 22 = π‘™π‘–π‘šβ„Žβ†’0+ (2 + β„Ž)2 . In
this case, both limits and the value of the function exist and all three numbers are equal.
Example 1.2
1.
2.
Let 𝑓 (π‘₯) = π‘₯1 and π‘₯ = 0 then 𝑓 (0) is not defined. Also, if π‘₯ = β„Ž and β„Ž > 0 then as
β„Ž β†’ 0, β„Ž1 takes larger and larger values without bound. We write 𝑓 (π‘₯) β†’ ∞ as π‘₯ β†’ 0+.
Similarly, if β„Ž < 0, β„Ž1 takes negative values with larger and larger magnitude. We write
𝑓 (π‘₯) β†’ βˆ’βˆž as π‘₯ β†’ 0βˆ’. In this case, the value of the function is not defined and either
the limits do not exist, or if one allows the possibility of ±βˆž as a limit, then the limits
are not equal.
3.
The function
π‘₯2 + π‘₯
π‘₯
is not defined at π‘₯ = 0 since division by zero is not allowed. It is not the same function
𝑓 (π‘₯) =
as 𝑔(π‘₯) = π‘₯ + 1. But for any value of π‘₯ other than 0, we do have 𝑓 (π‘₯) = 𝑔(π‘₯) = π‘₯ + 1 and
hence, since g(x) is continuous,
lim 𝑓 (π‘₯) = 1
π‘₯β†’0
i.e. as π‘₯ becomes closer and closer to 0, 𝑓 (π‘₯) becomes closer and closer to 1. In this case
the value of the function does not exist, while the limits exist and are equal.
2
Now we change notation and allow π‘₯ to be a variable.
Theorem 1.3
Properties of Limits
Let 𝑐 be a fixed number, and 𝑓 (π‘₯) and 𝑔(π‘₯) functions of π‘₯. Then provided the limits exist
(
) (
)
lim (𝑓 (π‘₯) + 𝑔(π‘₯)) = lim 𝑓 (π‘₯) + lim 𝑔(π‘₯)
π‘₯→𝑐
π‘₯→𝑐
π‘₯→𝑐
i.e. the limit of a sum equals the sum of the limits.
(
)(
)
lim (𝑓 (π‘₯)𝑔(π‘₯)) = lim 𝑓 (π‘₯) lim 𝑔(π‘₯)
π‘₯→𝑐
π‘₯→𝑐
π‘₯→𝑐
i.e. the limit of a product equals the product of the limits.
The proof is beyond the scope of this course.
An important example of a limit is the following:
Theorem 1.4
For πœƒ measured in radians, as πœƒ β†’ 0, we have
sin πœƒ
β†’ 1.
πœƒ
Alternatively,
sin πœƒ
= 1.
πœƒβ†’0
πœƒ
lim
Proof: Once again, we cannot substitute πœƒ = 0 as this gives a division by zero. We use a
geometric argument. Let the circle be of radius π‘Ÿ.
Z
Z
Z
Z
Z
𝐷
Z
AZ
AZ
Z
Z
A
πœƒ
Z 𝐢
A
Z
A
Z
𝐴
𝐡
Z
Z
Z
Z
Since the line 𝐢𝐷 is a tangent to the circle, angle 𝐴𝐷𝐢 is a right angle. Now consider the
three areas: △𝐴𝐷𝐡 < sector 𝐴𝐷𝐡 < △𝐴𝐷𝐢. The area of △𝐴𝐷𝐡 is 21 π‘Ÿ2 sin πœƒ, the area of the
πœƒ
1
sector is
× πœ‹π‘Ÿ2 = π‘Ÿ2 πœƒ and the area of △𝐴𝐷𝐢 is 12 π‘Ÿ2 tan πœƒ. Hence,
2πœ‹
2
1 2
1
1
π‘Ÿ sin πœƒ < π‘Ÿ2 πœƒ < π‘Ÿ2 tan πœƒ
2
2
2
3
dividing by 12 π‘Ÿ2 gives
sin πœƒ < πœƒ < tan πœƒ
The first inequality gives
sin πœƒ
<1
πœƒ
while the second gives
cos πœƒ <
Now let πœƒ β†’ 0 then cos πœƒ β†’ 1 and so
Hence the result.
1.3
sin πœƒ
.
πœƒ
sin πœƒ
is trapped between 1 and a number tending to 1.
πœƒ
Differentiation
The process of finding the derivative of a function is called differentiation. Having defined the
derivative as the gradient of a curve, we now show how to use the definition to find derivatives
algebraically.
Example 1.5
Let 𝑦 = π‘₯2 . Then we have
(
)
)
(
𝑑𝑦
𝑓 (π‘₯ + β„Ž) βˆ’ 𝑓 (π‘₯)
(π‘₯ + β„Ž)2 βˆ’ π‘₯2
= lim
= lim
β„Žβ†’0
𝑑π‘₯ β„Žβ†’0
β„Ž
β„Ž
( 2
)
(
)
π‘₯ + 2π‘₯β„Ž + β„Ž2 βˆ’ π‘₯2
2π‘₯β„Ž + β„Ž2
= lim
= lim
= lim (2π‘₯ + β„Ž) = 2π‘₯
β„Žβ†’0
β„Žβ†’0
β„Žβ†’0
β„Ž
β„Ž
So, the derivative of π‘₯2 is 2π‘₯.
The next example generalises this result to higher powers of π‘₯.
Example 1.6
Let 𝑦 = π‘₯𝑛 , with 𝑛 a positive integer. Then
(
)
(
)
𝑑𝑦
𝑓 (π‘₯ + β„Ž) βˆ’ 𝑓 (π‘₯)
(π‘₯ + β„Ž)𝑛 βˆ’ π‘₯𝑛
= lim
= lim
β„Žβ†’0
𝑑π‘₯ β„Žβ†’0
β„Ž
β„Ž
Expand (π‘₯ + β„Ž)𝑛 using the binomial theorem β€” only the first few terms will be needed:
( 𝑛
)
π‘₯ + 𝑛π‘₯π‘›βˆ’1 β„Ž + 21 𝑛(𝑛 βˆ’ 1)π‘₯π‘›βˆ’2 β„Ž2 + 16 𝑛(𝑛 βˆ’ 1)(𝑛 βˆ’ 2)π‘₯π‘›βˆ’3 β„Ž3 + ... + β„Žπ‘› βˆ’ π‘₯𝑛
= lim
β„Žβ†’0
β„Ž
(
)
1
1
π‘›βˆ’1
π‘›βˆ’2
π‘›βˆ’3 2
π‘›βˆ’1
= lim 𝑛π‘₯
+ 𝑛(𝑛 βˆ’ 1)π‘₯ β„Ž + 𝑛(𝑛 βˆ’ 1)(𝑛 βˆ’ 2)π‘₯ β„Ž + ... + β„Ž
β„Žβ†’0
2
6
cancelling first the π‘₯𝑛 ’s and then an β„Ž from the numerator and denominator. Now, let β„Ž β†’ 0
then all but the first term becomes zero, so
𝑑𝑦
= 𝑛π‘₯π‘›βˆ’1
𝑑π‘₯
Note: Cancelling the β„Žβ€™s, eliminated the problem of division by 0 and effectively enabled us to
put β„Ž = 0 in the remaining formula.
4
Example 1.7
Let 𝑦 = sin π‘₯. Then
(
)
(
)
𝑑𝑦
𝑓 (π‘₯ + β„Ž) βˆ’ 𝑓 (π‘₯)
sin (π‘₯ + β„Ž) βˆ’ sin π‘₯
= lim
= lim
β„Žβ†’0
𝑑π‘₯ β„Žβ†’0
β„Ž
β„Ž
Now, in the identity sin 𝑋 βˆ’ sin π‘Œ = 2 cos ( 𝑋+π‘Œ
) sin ( π‘‹βˆ’π‘Œ
), let 𝑋 = π‘₯ + β„Ž, π‘Œ = π‘₯ then
2
2
(
)
β„Ž
2 cos ( 2π‘₯+β„Ž
)
sin
(
)
𝑑𝑦
2
2
= lim
𝑑π‘₯ β„Žβ†’0
β„Ž
(
)
β„Ž sin ( β„Ž2 )
= lim cos (π‘₯ + ) β„Ž
β„Žβ†’0
2
2
)
(
(
)
β„Ž
)
sin
(
β„Ž
2
= lim cos (π‘₯ + ) lim
β„Ž
β„Žβ†’0
β„Žβ†’0
2
2
and from theorem 1.4,
sin ( β„Ž2 )
β„Ž
2
β†’ 1 as β„Ž β†’ 0, which leaves us with
(
)
β„Ž
𝑑𝑦
= lim cos (π‘₯ + ) = cos π‘₯
𝑑π‘₯ β„Žβ†’0
2
Similarly, we may differentiate π‘₯βˆ’π‘› for positive integral 𝑛, and cos π‘₯ (see example sheets)
and also the functions 𝑒π‘₯ and ln π‘₯. We summarise a few important derivatives which should be
committed to memory.
1.3.1
Some Important Derivatives
Function
Derivative
π‘₯𝑛
𝑛π‘₯π‘›βˆ’1
sin π‘₯
cos π‘₯
cos π‘₯
βˆ’sin π‘₯
π‘₯
𝑒
𝑒π‘₯
ln π‘₯
1
π‘₯
Multiplication by a Scalar and a Sum of two functions
Let 𝑒(π‘₯) and 𝑣(π‘₯) be functions of π‘₯ whose derivatives are known and let π‘˜ be a constant number.
Theorem 1.8
1.
Then the derivative of the function π‘˜ 𝑒(π‘₯), is given by
𝑑
𝑑𝑒
(π‘˜ 𝑒) = π‘˜
𝑑π‘₯
𝑑π‘₯
2.
Then the derivative of their sum 𝑒(π‘₯) + 𝑣(π‘₯), is given by
𝑑
𝑑𝑒 𝑑𝑣
(𝑒 + 𝑣) =
+
𝑑π‘₯
𝑑π‘₯ 𝑑π‘₯
5
Proof: Using the definition of derivative, let 𝑓 (π‘₯) = 𝑒(π‘₯) + 𝑣(π‘₯)
(
)
(
)
𝑑𝑓
𝑓 (π‘₯ + β„Ž) βˆ’ 𝑓 (π‘₯)
𝑒(π‘₯ + β„Ž) + 𝑣(π‘₯ + β„Ž) βˆ’ 𝑒(π‘₯) βˆ’ 𝑣(π‘₯)
= lim
= lim
β„Žβ†’0
𝑑π‘₯ β„Žβ†’0
β„Ž
β„Ž
(
)
𝑒(π‘₯ + β„Ž) βˆ’ 𝑒(π‘₯) 𝑣(π‘₯ + β„Ž) βˆ’ 𝑣(π‘₯)
= lim
+
β„Žβ†’0
β„Ž
β„Ž
)
(
)
(
𝑣(π‘₯ + β„Ž) βˆ’ 𝑣(π‘₯)
𝑑𝑒 𝑑𝑣
𝑒(π‘₯ + β„Ž) βˆ’ 𝑒(π‘₯)
+ lim
=
+
= lim
β„Žβ†’0
β„Žβ†’0
β„Ž
β„Ž
𝑑π‘₯ 𝑑π‘₯
A function 𝑓 (π‘₯) may be the product of two simpler functions, e.g. 𝑓 (π‘₯) = (π‘₯2 + 1)𝑒π‘₯ is a
product of π‘₯2 + 1 and 𝑒π‘₯ . Then 𝑓 (π‘₯) may be differentiated using the product rule.
Theorem 1.9
Product Rule
Let 𝑓 (π‘₯) = 𝑒(π‘₯)𝑣(π‘₯), where 𝑒(π‘₯) and 𝑣(π‘₯) are functions of π‘₯. Then
𝑑
𝑑𝑒
𝑑𝑣
(𝑒(π‘₯)𝑣(π‘₯)) = 𝑣
+𝑒
𝑑π‘₯
𝑑π‘₯
𝑑π‘₯
Proof: First Method. Let 𝑦(π‘₯) = 𝑒(π‘₯)𝑣(π‘₯). As π‘₯ is incremented to π‘₯ + 𝛿π‘₯, let 𝑒, 𝑣, 𝑦 change
to 𝑒 + 𝛿𝑒, 𝑣 + 𝛿𝑣, 𝑦 + 𝛿𝑦 then
𝑦(π‘₯ + 𝛿π‘₯) = 𝑦 + 𝛿𝑦 = (𝑒 + 𝛿𝑒)(𝑣 + 𝛿𝑣)
and so
𝛿𝑦 = (𝑒 + 𝛿𝑒)(𝑣 + 𝛿𝑣) βˆ’ 𝑒𝑣
i.e.
𝛿𝑦 = 𝑒𝛿𝑣 + 𝑣𝛿𝑒 + 𝛿𝑒𝛿𝑣
Using the definition of derivative,
( )
𝑑𝑦
𝛿𝑦
= lim
𝑑π‘₯ 𝛿π‘₯β†’0 𝛿π‘₯
(
)
𝑒𝛿𝑣 + 𝑣𝛿𝑒 + 𝛿𝑒𝛿𝑣
= lim
𝛿π‘₯β†’0
𝛿π‘₯
i.e.
(
)
𝛿𝑣
𝛿𝑒
𝛿𝑣
= lim 𝑒 + 𝑣 + 𝛿𝑒
𝛿π‘₯β†’0
𝛿π‘₯
𝛿π‘₯
𝛿π‘₯
now using the theorems about limits,
(
)
(
)
(
)
𝛿𝑣
𝛿𝑒
𝛿𝑣
= lim 𝑒
+ lim 𝑣
+ lim 𝛿𝑒
𝛿π‘₯β†’0
𝛿π‘₯β†’0
𝛿π‘₯β†’0
𝛿π‘₯
𝛿π‘₯
𝛿π‘₯
(
)
𝛿𝑣
𝛿𝑒
𝛿𝑣
= 𝑒 lim
+ 𝑣 lim
+ lim 𝛿𝑒 lim
𝛿π‘₯β†’0 𝛿π‘₯
𝛿π‘₯β†’0 𝛿π‘₯
𝛿π‘₯β†’0
𝛿π‘₯β†’0 𝛿π‘₯
𝑑𝑣
𝑑𝑒
=𝑒 +𝑣
𝑑π‘₯
𝑑π‘₯
6
which is the required formula, and completes the proof.
Second method. Again, using the definition of derivative,
(
)
𝑑𝑓
𝑓 (π‘₯ + β„Ž) βˆ’ 𝑓 (π‘₯)
= lim
𝑑π‘₯ β„Žβ†’0
β„Ž
)
(
𝑒(π‘₯ + β„Ž)𝑣(π‘₯ + β„Ž) βˆ’ 𝑒(π‘₯)𝑣(π‘₯)
= lim
β„Žβ†’0
β„Ž
)
(
𝑒(π‘₯ + β„Ž)𝑣(π‘₯ + β„Ž) βˆ’ 𝑒(π‘₯ + β„Ž)𝑣(π‘₯) + 𝑒(π‘₯ + β„Ž)𝑣(π‘₯) βˆ’ 𝑒(π‘₯)𝑣(π‘₯)
= lim
β„Žβ†’0
β„Ž
)
(
))
(
(
𝑒(π‘₯ + β„Ž) βˆ’ 𝑒(π‘₯)
𝑣(π‘₯ + β„Ž) βˆ’ 𝑣(π‘₯)
+ 𝑣(π‘₯)
= lim 𝑒(π‘₯ + β„Ž)
β„Žβ†’0
β„Ž
β„Ž
but the limit of a sum is the sum of the limits
(
(
))
(
(
))
𝑣(π‘₯ + β„Ž) βˆ’ 𝑣(π‘₯)
𝑒(π‘₯ + β„Ž) βˆ’ 𝑒(π‘₯)
= lim 𝑒(π‘₯ + β„Ž)
+ lim 𝑣(π‘₯)
β„Žβ†’0
β„Žβ†’0
β„Ž
β„Ž
and letting β„Ž β†’ 0, 𝑒(π‘₯ + β„Ž) β†’ 𝑒(π‘₯), giving
(
)
(
)
𝑣(π‘₯ + β„Ž) βˆ’ 𝑣(π‘₯)
𝑒(π‘₯ + β„Ž) βˆ’ 𝑒(π‘₯)
= 𝑒(π‘₯) lim
+ 𝑣(π‘₯) lim
β„Žβ†’0
β„Žβ†’0
β„Ž
β„Ž
=𝑒
𝑑𝑒
𝑑𝑣
+𝑣
𝑑π‘₯
𝑑π‘₯
which is the required formula.
Example 1.10
Let 𝑦 = π‘₯2 sin π‘₯. Let 𝑒 = π‘₯2 and 𝑣 = sin π‘₯. Then
𝑑𝑦
𝑑
𝑑𝑒
𝑑𝑣
=
(𝑒𝑣) = 𝑣
+𝑒
= (sin π‘₯)2π‘₯ + π‘₯2 (cos π‘₯)
𝑑π‘₯
𝑑π‘₯
𝑑π‘₯
𝑑π‘₯
= 2π‘₯ sin π‘₯ + π‘₯2 cos π‘₯
Example 1.11
Let 𝑦 = π‘₯ln π‘₯. We can let 𝑒 = π‘₯ and 𝑣 = ln π‘₯. Then
( )
𝑑𝑦
𝑑
𝑑𝑒
𝑑𝑣
1
=
(𝑒𝑣) = 𝑣
+𝑒
= (ln π‘₯)1 + π‘₯
𝑑π‘₯
𝑑π‘₯
𝑑π‘₯
𝑑π‘₯
π‘₯
= 1 + ln π‘₯
Suppose we wish to differentiate the function 𝑦 = sin (π‘₯2 ). This is a function of a function in
the sense that the output from the function π‘₯2 is the input for the 𝑠𝑖𝑛𝑒 function. Let 𝑒 = π‘₯2 ,
then 𝑦 = sin𝑒, and we now have two simpler functions, each of which we can differentiate
differentiate. The following theorem enables us to differentiate the composite function 𝑦 =
sin (π‘₯2 ). A function like this is called a function of a function. In the example, 𝑦 is the function
sin of the function π‘₯2 .
7
Theorem 1.12
Chain Rule or Function of a Function Rule Let 𝑦 = 𝑓 (𝑒) and 𝑒 = 𝑔(π‘₯) then the
derivative of the function 𝑦 = 𝑓 (𝑔(π‘₯)) is given by
𝑑𝑦
𝑑𝑦 𝑑𝑒
=
𝑑π‘₯
𝑑𝑒 𝑑π‘₯
Proof: From the definition of derivative
𝑑𝑦
𝛿𝑦
= lim
𝑑π‘₯ 𝛿π‘₯β†’0 𝛿π‘₯
where 𝛿π‘₯ represents an increment in π‘₯ and 𝛿𝑦 representing the corresponding increment in 𝑦.
Let the change in 𝑒 corresponding to an increment 𝛿π‘₯ in π‘₯ be 𝛿𝑒 then
𝛿𝑦
𝛿𝑦 𝛿𝑒
=
𝛿π‘₯
𝛿𝑒 𝛿π‘₯
and so
𝑑𝑦
𝛿𝑦 𝛿𝑒
= lim
.
𝑑π‘₯ 𝛿π‘₯β†’0 𝛿𝑒 𝛿π‘₯
Using the theorems on limits,
𝑑𝑦
=
𝑑π‘₯
(
𝛿𝑦
lim
𝛿π‘₯β†’0 𝛿𝑒
) (
𝛿𝑒
lim
𝛿π‘₯β†’0 𝛿π‘₯
)
.
𝑑𝑦 𝑑𝑒
𝑑𝑒 𝑑π‘₯
which is the derivative of 𝑦 with respect to 𝑒 multiplied by the derivative of 𝑒 with respect to
π‘₯.
=
Example 1.13
Differentiate 𝑦 = sin (π‘₯2 ). Let 𝑒 = π‘₯2 then 𝑦 = sin𝑒 and
𝑑𝑦
= cos 𝑒,
𝑑𝑒
𝑑𝑒
= 2π‘₯.
𝑑π‘₯
Hence, using the chain rule
𝑑𝑦
𝑑𝑦 𝑑𝑒
=
= (cos 𝑒)2π‘₯ = 2π‘₯ cos π‘₯2
𝑑π‘₯
𝑑𝑒 𝑑π‘₯
With practice it is possible to use the chain rule without explicitly making the substitution.
Example 1.14
Let 𝑦 = 𝑒sin π‘₯ . Let 𝑒 = sin π‘₯, then 𝑦 = 𝑒𝑒 . Now, applying the chain rule:
𝑑𝑦
𝑑𝑦 𝑑𝑒
=
= 𝑒𝑒 cos π‘₯ = cos π‘₯ 𝑒sin π‘₯
𝑑π‘₯
𝑑𝑒 𝑑π‘₯
Example 1.15
Let 𝑦 = ln (π‘₯ + 3). Let 𝑒 = π‘₯ + 3, then 𝑦 = ln 𝑒. Now, applying the chain rule:
𝑑𝑦
𝑑𝑦 𝑑𝑒
1
1
=
= 1=
𝑑π‘₯
𝑑𝑒 𝑑π‘₯
𝑒
π‘₯+3
8
Example 1.16
Let 𝑦 =
1
𝑀
= π‘€βˆ’1 for any function 𝑀 of π‘₯. Now, applying the chain rule:
𝑑𝑦
𝑑𝑦 𝑑𝑀
𝑑𝑀
1 𝑑𝑀
=
= βˆ’π‘€βˆ’2
=βˆ’ 2
𝑑π‘₯
𝑑𝑀 𝑑π‘₯
𝑑π‘₯
𝑀 𝑑π‘₯
Theorem 1.17
Quotient Rule. Let
𝑒
= 𝑒𝑣 βˆ’1
𝑣
for any functions 𝑒 and 𝑣 of π‘₯. Let 𝑀 = 𝑣 βˆ’1 and apply the product rule:
(
)
𝑑𝑦
𝑑𝑒
𝑑𝑀
1 𝑑𝑣
βˆ’1 𝑑𝑒
=𝑀
+𝑒
=𝑣
+𝑒 βˆ’ 2
𝑑π‘₯
𝑑π‘₯
𝑑π‘₯
𝑑π‘₯
𝑣 𝑑π‘₯
𝑦=
=
𝑑𝑣
𝑣 𝑑𝑒
βˆ’ 𝑒 𝑑π‘₯
𝑑π‘₯
𝑣2
This is known as the quotient rule.
Example 1.18
Let 𝑦 =
𝑒π‘₯
. So, using the quotient rule:
π‘₯
𝑣 𝑑𝑒 βˆ’ 𝑒 𝑑𝑣
π‘₯𝑒π‘₯ βˆ’ 𝑒π‘₯
𝑑𝑦
= 𝑑π‘₯ 2 𝑑π‘₯ =
=
𝑑π‘₯
𝑣
π‘₯2
Example 1.19
Let 𝑦 = tan π‘₯ =
(
π‘₯βˆ’1
π‘₯2
)
𝑒π‘₯
sin π‘₯
. So, we using the quotient rule:
cos π‘₯
𝑣 𝑑𝑒 βˆ’ 𝑒 𝑑𝑣
cos π‘₯ cos π‘₯ βˆ’ sin π‘₯ (βˆ’cos π‘₯)
𝑑𝑦
= 𝑑π‘₯ 2 𝑑π‘₯ =
𝑑π‘₯
𝑣
cos2 π‘₯
1
=
= sec2 π‘₯
2
cos π‘₯
Using the basic functions, and the rules for differentiating products, quotients and function–
of–a–function, it is possible to differentiate a wide range of functions. Two last examples should
suffice:
2
𝑒π‘₯
2
Example 1.20 Let 𝑦 =
. This is a quotient, so we use that rule, but along the way 𝑒π‘₯
sin π‘₯
needs to be differentiated using the chain rule.
2
2
𝑣 𝑑𝑒 βˆ’ 𝑒 𝑑𝑣
𝑑𝑦
(sin π‘₯)(2π‘₯𝑒π‘₯ ) βˆ’ (𝑒π‘₯ )(cos π‘₯)
= 𝑑π‘₯ 2 𝑑π‘₯ =
𝑑π‘₯
𝑣
sin2 π‘₯
2
𝑒π‘₯
= (2π‘₯sin π‘₯ βˆ’ cos π‘₯) 2
sin π‘₯
Example 1.21 Let 𝑦 = (1 + cos π‘₯2 )6 . This is a function of a function of a function: let 𝑒 =
1 + cos π‘₯2 then 𝑦 = 𝑒6 may be differentiated with respect to 𝑒. To differentiate 𝑒 = 1 + cos π‘₯2
we may use the chain rule again to differentiate 𝑒 w.r.t π‘₯. So let 𝑣 = π‘₯2 then 𝑒 = 1 + cos 𝑣.
The formula for the derivative becomes
𝑑𝑦
𝑑𝑦 𝑑𝑒 𝑑𝑣
=
𝑑π‘₯
𝑑𝑒 𝑑𝑣 𝑑π‘₯
9
Note: this is the product of all the functions, differentiated with respect to their argument and
multiplied together (justifying the term chain rule). Hence,
𝑑𝑦
= 6𝑒5 (βˆ’sin 𝑣) 2π‘₯
𝑑π‘₯
= 6(1 + cos π‘₯2 )5 (βˆ’2π‘₯ sin π‘₯2 ) = βˆ’12π‘₯ sin π‘₯2 (1 + cos π‘₯2 )5
1.4
Implicit Differentiation
Let 𝑦 = 𝑓 (π‘₯) then 𝑦 is called an explicit function of π‘₯, e.g. 𝑦 = π‘₯2 . Substituting a value of π‘₯
gives a value of 𝑦. However, consider the example
π‘₯2 + 𝑦 2 = 1.
which represents the equation of a circle. In this case substituting a value of π‘₯ such that
βˆ’1 ≀ π‘₯ ≀ +1 merely gives a value for 𝑦 2 . To find 𝑦 further operations are required, including
taking the√square root. Here, 𝑦 is not a function of π‘₯, since rearranging the expression to
get 𝑦 = ± 1 βˆ’ π‘₯2 , we may have two values for 𝑦, or 1 value or even none ( a function must
give exactly one value). The general case is often written 𝑓 (π‘₯, 𝑦) = 0 and is called an implicit
relation between π‘₯ and 𝑦. In order to differentiate an implicit expression, we proceed as follows.
Example 1.22
In chapter 2 we obtained the trigonometric identity sin 2π‘₯ = 2sin π‘₯ cos π‘₯.
Suppose we differentiate both sides (the left is a function of a function, while the right is a
product):
2 cos 2π‘₯ = 2(sin π‘₯)(βˆ’sin π‘₯) + (cos π‘₯)(cos π‘₯) = 2(cos2 π‘₯ βˆ’ sin2 π‘₯)
and cos2 π‘₯βˆ’sin2 π‘₯ is the formula for cos2π‘₯, so, differentiating an identity also gives an identity.
We can also differentiate:
)
𝑑 ( 2
𝑑
(1)
π‘₯ + 𝑦2 =
𝑑π‘₯
𝑑π‘₯
𝑑 ( 2)
𝑑 ( 2)
π‘₯ +
𝑦 =0
𝑑π‘₯
𝑑π‘₯
𝑑
To obtain 𝑑π‘₯
(𝑦 2 ) we use the chain rule: let 𝑒 = 𝑦 2 , then
𝑑𝑒
𝑑𝑒 𝑑𝑦
𝑑𝑦
=
= 2𝑦 .
𝑑π‘₯
𝑑𝑦 𝑑π‘₯
𝑑π‘₯
Hence
2π‘₯ + 2𝑦
𝑑𝑦
=0
𝑑π‘₯
and rearranging this gives:
𝑑𝑦
π‘₯
=βˆ’
𝑑π‘₯
𝑦
Note that the right hand side contains 𝑦, and this may be obtained by solving the implicit
relation for 𝑦, if required. Implicit differentiation is very useful in finding the derivatives of
inverse functions.
10
1.5
Example
Let 𝑦 = tanβˆ’1 π‘₯, the inverse tan function (and not
1
).
tan π‘₯
We can rearrange the formula so as
to eliminate the inverse function in favour of π‘‘π‘Žπ‘› by taking the tangent of both sides, to obtain
π‘₯ = tan 𝑦 and then differentiate both sides:
𝑑
𝑑
(π‘₯)=
(tan 𝑦)
𝑑π‘₯
𝑑π‘₯
𝑑𝑦
𝑑π‘₯
Now, from the identity sec2 𝑦 = 1 + tan2 𝑦 = 1 + π‘₯2 , we get
1 = sec2 𝑦
𝑑𝑦
1
=
𝑑π‘₯
1 + π‘₯2
which is the formula for the derivative of tanβˆ’1 π‘₯.
We end this section on differentiation by a summary of results. Some of these have been
proved, or solved in examples. Others are on exercise sheets.
Function
Derivative
π‘₯𝑛
𝑛π‘₯π‘›βˆ’1
sin π‘₯
cos π‘₯
cos π‘₯
βˆ’sin π‘₯
tan π‘₯
sec2 π‘₯
cot π‘₯
βˆ’cosec2 π‘₯
sec π‘₯
sec π‘₯ tan π‘₯
cosec π‘₯
βˆ’cosec π‘₯ cot π‘₯
π‘₯
𝑒
𝑒π‘₯
ln π‘₯
1
π‘₯
sinβˆ’1 π‘₯
√
cosβˆ’1 π‘₯
tanβˆ’1 π‘₯
2
1
1 βˆ’ π‘₯2
βˆ’1
√
1 βˆ’ π‘₯2
1
1 + π‘₯2
Applications of the Derivative
𝑑𝑦
The derivative has been defined as the gradient of a curve. In other words, 𝑑π‘₯
represents the
rate of change of 𝑦 with respect to π‘₯. As an example, if 𝑦 represents distance, and π‘₯ time,
this becomes the rate of change of distance with respect to time, i.e. speed. Note also, if we
differentiate again, we measure the rate of change of speed, i.e. acceleration.
𝑑𝑓
Suppose we have a function 𝑓 (π‘₯). We may interpret 𝑑π‘₯
= 0 as follows.
11
𝑓 (π‘₯)
𝑑𝑓
= 0, the gradient of the graph is zero and the tangent to the curve is horizontal.
When 𝑑π‘₯
A point at which this happens is called a stationary point or critical point. There are three
possibilities. The diagram shows a local maximum and the gradient is changing from positive
to negative as we move from left to right through the stationary point. Similarly, we may draw
a local minimum and in this case the gradient changes from negative to positive as we move
from left to right through the stationary point. Such points are called turning points. The third
possibility is that the sign of the gradient remains the same (either positive or negative) either
side of the stationary point and this is called a point of inflection. An example is provided by
the curve of 𝑦 = π‘₯3 at the point π‘₯ = 0, the gradient is zero there but the gradient is positive
everywhere else.
2.1
Example
Find the maximum value of the function 𝑦 = 2 + 3π‘₯ βˆ’ π‘₯2
Solution: Differentiating,
𝑑𝑦
= 3 βˆ’ 2π‘₯ = 0
𝑑π‘₯
which gives π‘₯ = 1.5. Substituting this into the equation, we get 𝑦 = 4.25. To see whether it is
a maximum or a minimum, we may calculate the gradient either side of π‘₯ = 1.5,
𝑓 β€² (1.4) = 0.2 > 0
𝑓 β€² (1.6) = βˆ’0.2 < 0
so we deduce that 4.25 is the maximum.
𝑑𝑦
A function may have several values of π‘₯ where 𝑑π‘₯
= 0, for example, sin π‘₯ has many (in fact
infinitely many) maxima and minima.
There is an alternative way of classifying stationary points, which involves finding the second
derivative of the function and evaluating it at the stationary point.
Theorem 2.1
Second derivative test. Let 𝑦 = 𝑓 (π‘₯) and let
if
𝑑2 𝑦
>0
𝑑π‘₯2
12
𝑑𝑦
𝑑π‘₯
= 0 at the point π‘₯ = π‘₯0 . Then 1.
at π‘₯ = π‘₯0 then the stationary point is a minimum.
2. if
𝑑2 𝑦
<0
𝑑π‘₯2
at π‘₯ = π‘₯0 then the stationary point is a maximum.
3. if
𝑑2 𝑦
=0
𝑑π‘₯2
at π‘₯ = π‘₯0 then the test fails and further investigation is needed.
Example 2.2
Find and classify the turning points of the function 𝑦 = π‘₯3 βˆ’ 3π‘₯ + 1.
Solution: Differentiating,
𝑑𝑦
= 3π‘₯2 βˆ’ 3 = 0
𝑑π‘₯
so there are two turning points at π‘₯ = 1 and π‘₯ = βˆ’1. Using the second derivative test:
𝑑2 𝑦
= 6π‘₯
𝑑π‘₯2
2
𝑑 𝑦
If π‘₯ = +1 then 𝑑π‘₯
2 = +6 which is positive, so we have a minimum.
𝑑2 𝑦
If π‘₯ = βˆ’1 then 𝑑π‘₯2 = βˆ’6 which is negative, so we have a maximum.
Let π‘₯ = 0 then 𝑦 = 1. This gives sufficient information to sketch the curve.
13
3
1
1
βˆ’1
βˆ’1
More examples of maxima and minima can be found on the problem sheets.
14