Chain Rule

Chain Rule

Let f:IR and g:JR where I,J are intervals and f(I)J. Suppose f is differentiable at c and g is differentiable at f(c). Then (gf) is differentiable at c and is:

(gf)(c)=g(f(c))f(c)

It is tempting to use, in a similar way of deriving the Algebraic Differentiability Theorem, via the difference quotient:

g(f(x))g(f(c))xc=g(f(x))g(f(c))f(x)f(c)f(x)f(c)xc

Which we cannot use directly. We'll have to do some work to justify the f(x),f(c) bits in order to use this, which we do via μ below.

Proof

For yJ define the function:

μ(y)={g(y)g(f(c))yf(c)yf(c)g(f(c))y=f(c)

Now notice that μ is continuous at f(c) (J is on an interval, and since f(c) then:

limyf(c)μ(y)=limyf(c)g(y)g(f(c))yf(c)=g(f(c))=μ(f(c))

because g itself is continuous).

Now also notice that yJ, we claim that:

g(y)g(f(c))=μ(y)(yf(c))

which is true since when yf(c) it's definitely true (multiply it out) and for y=f(c) then the LHS becomes 0, which equals the RHS.

Now let xI such that xc. Apply the above with y=f(x). Then:

g(f(x))g(f(c))=μ(f(x))(f(x)f(c))g(f(x))g(f(c))xc=μ(f(x))f(x)f(c)xclimxcg(f(x))g(f(c))xc=limxc(μ(f(x))f(x)f(c)xc)=limxcμ(f(x))f(c)

(note we used the Algebraic Limit Theorem for Functional Limits on the last step). Now because f is continuous at c and μ is continuous, then fμ is continuous at c and thus it's limit is g(f(c)). As such then:

=μ(f(c))f(c)=g(f(c))f(c)

as desired.