The Derivative

Part 2, Chapter 2: Single-Variable Real Analysis

Learning objectives

  • Define the derivative as f(a)=limh0(f(a+h)f(a))/hf'(a)=\lim_{h\to 0}(f(a+h)-f(a))/h
  • Prove that differentiability implies continuity, but not conversely
  • State and apply the Mean Value Theorem and its sign-of-derivative corollaries
  • Use Taylor's theorem to approximate functions by polynomials

The derivative is the prototype of all linearisation. Real systems are rarely linear, but they are locally linear, that is precisely the geometric content of f(a)f'(a). Whenever you read about Newton's method, gradient descent, Kalman filters, control theory, or first-order asymptotic analysis, you are watching the derivative do its real job: providing the best linear approximation to a non-linear function at a point. The Mean Value Theorem (MVT) then turns local approximation into global conclusions, like "if the derivative is positive everywhere, the function is increasing".

The derivative as a limit

The derivative of ff at aa is f(a)=limhto0fracf(a+h)f(a)hf'(a)=\lim_{h\to 0}\frac{f(a+h)-f(a)}{h}hto0fracf(a+h)f(a)h, provided this limit exists. Geometrically it is the slope of the tangent line; physically it is an instantaneous rate of change. The fraction (f(a+h)f(a))/h(f(a+h)-f(a))/h is the difference quotient, the slope of the secant from (a,f(a))(a,f(a)) to (a+h,f(a+h))(a+h,f(a+h)). Differentiation is the operation of taking the limit of secant slopes.

Differentiability implies continuity (not conversely)

If f(a)f'(a) exists, then f(x)f(a)=(xa)cdotfracf(x)f(a)xaf(x)-f(a)=(x-a)\cdot\frac{f(x)-f(a)}{x-a}. As xtoax\to a, the second factor approaches f(a)f'(a) (finite) and the first factor approaches 0, so f(x)tof(a)f(x)\to f(a). The converse fails dramatically: f(x)=xf(x)=|x| is continuous everywhere but has f(0)=1f'(0^-)=-1 and f(0+)=+1f'(0^+)=+1, so f(0)f'(0) does not exist. There exist functions (Weierstrass's "monster") that are continuous everywhere but differentiable nowhere, analysis is full of such surprises.

The Mean Value Theorem

The Mean Value Theorem (MVT) says: if ff is continuous on [a,b] and differentiable on (a,b)(a,b), then there exists cin(a,b)c\in(a,b) with f(c)=fracf(b)f(a)baf'(c)=\frac{f(b)-f(a)}{b-a}. Geometrically: somewhere in the interval, the tangent line has the same slope as the chord connecting the endpoints. From MVT we instantly conclude: if fequiv0f'\equiv 0, then ff is constant; if f>0f'>0, then ff is strictly increasing; and the more subtle Cauchy MVT underpins L'Hôpital's rule.

Use the grapher to plot f(x)=x2f(x)=x^2 on [1,3]. The chord slope is (91)/(31)=4(9-1)/(3-1)=4, and the tangent has slope f(x)=2xf'(x)=2x. MVT predicts cc with 2c=42c=4, i.e. c=2c=2. Visually you can see the tangent at x=2x=2 is parallel to the chord. Try x3x^3 on the same interval to see a different cc, and try x|x| on [-1,1] to see what happens when differentiability fails at an interior point (MVT does not apply).

Taylor's theorem

Taylor's theorem extends MVT: if ff is n+1n+1 times differentiable, then f(a+h)=f(a)+f(a)h+fracf(a)2!h2+cdots+fracf(n)(a)n!hn+Rn(h)f(a+h)=f(a)+f'(a)h+\frac{f''(a)}{2!}h^2+\cdots+\frac{f^{(n)}(a)}{n!}h^n+R_n(h)n(h), where the remainder Rn(h)=fracf(n+1)(xi)(n+1)!hn+1R_n(h)=\frac{f^{(n+1)}(\xi)}{(n+1)!}h^{n+1}n(h)=fracf(n+1)(xi)(n+1)!hn+1 for some xi\xi between aa and a+ha+h. This is why polynomials of high degree can approximate smooth functions with arbitrary accuracy, the whole basis of numerical analysis.

Where this shows up
  • Newton's laws & physics: velocity is the derivative of position, acceleration the derivative of velocity, force is mass times acceleration. Maxwell's equations, Schrödinger's equation, and the Einstein field equations are all differential equations, equations involving derivatives.
  • Gradient descent & machine learning: every neural-network parameter update is thetaleftarrowthetaetanablaL(theta)\theta \leftarrow \theta - \eta \nabla L(\theta). Backpropagation is the chain rule executed at industrial scale. Without the Mean Value Theorem and Taylor expansions, you cannot prove that gradient descent converges to a local minimum.
  • Control theory & robotics: PID controllers, Kalman filters, and model-predictive control all linearise non-linear dynamics around an operating point using f(a)f'(a) (or its multivariable analogue, the Jacobian). The "first-order Taylor expansion" is the engineer's daily bread.

Pause and think: A function ff is differentiable at 00 with f(0)=2f'(0)=2. What is limxto0f(x)\lim_{x\to 0}f(x)xto0f(x)? Justify in one line using the implication "differentiable implies continuous".

Try it

  • Predict first: use the limit definition to compute f(x)f'(x) for f(x)=1/xf(x)=1/x at x=2x=2. Hint: expand 1/(2+h)1/21/(2+h)-1/2 over a common denominator.
  • Apply MVT to f(x)=sinxf(x)=\sin x on [0,\pi/2]. Which cc does the theorem guarantee? Solve cosc=2/pi\cos c = 2/\pi numerically.
  • Show that f(x)=xxf(x)=x|x| is differentiable at 00 and find f(0)f'(0). (Use the limit definition; the answer is 0, the function is C^1 but not C^2 at 0.)
  • True or false: if f(x)>0f'(x)>0 for all xx in (a,b)(a,b), then ff is strictly increasing on (a,b)(a,b). Prove using MVT.

A trap to watch for

The implication "differentiable implies continuous" is one-way only. Many beginners assume continuous implies differentiable, then are baffled when they meet x|x|, the Cantor staircase, or the Weierstrass nowhere-differentiable function. The right mental model: continuity controls vertical jumps; differentiability additionally controls corners and cusps.

What you now know

You can compute derivatives from the limit definition, prove the differentiable-implies-continuous implication, apply MVT and its corollaries, and understand Taylor expansions as the canonical local approximation. Section §2.4 (Integration) is the dual: instead of measuring rates of change, we measure accumulated change, and section §2.5 (FTC) shows the two operations are mutual inverses.

Mark section complete →

References

  • Garrity, T. (2002). All the Mathematics You Missed. Cambridge University Press, ch. 2.
  • Spivak, M. (2008). Calculus (4th ed.). Publish or Perish, ch. 10-11.
  • Rudin, W. (1976). Principles of Mathematical Analysis (3rd ed.). McGraw-Hill, ch. 5.
  • Abbott, S. (2015). Understanding Analysis (2nd ed.). Springer, ch. 5.
  • Apostol, T. M. (1974). Mathematical Analysis (2nd ed.). Addison-Wesley, ch. 5.

This page is prerendered for SEO and accessibility. The interactive widgets above hydrate on JavaScript load.