The Derivative
Learning objectives
- Define the derivative as
- Prove that differentiability implies continuity, but not conversely
- State and apply the Mean Value Theorem and its sign-of-derivative corollaries
- Use Taylor's theorem to approximate functions by polynomials
The derivative is the prototype of all linearisation. Real systems are rarely linear, but they are locally linear, that is precisely the geometric content of . Whenever you read about Newton's method, gradient descent, Kalman filters, control theory, or first-order asymptotic analysis, you are watching the derivative do its real job: providing the best linear approximation to a non-linear function at a point. The Mean Value Theorem (MVT) then turns local approximation into global conclusions, like "if the derivative is positive everywhere, the function is increasing".
The derivative as a limit
The derivative of at is , provided this limit exists. Geometrically it is the slope of the tangent line; physically it is an instantaneous rate of change. The fraction is the difference quotient, the slope of the secant from to . Differentiation is the operation of taking the limit of secant slopes.
Differentiability implies continuity (not conversely)
If exists, then . As , the second factor approaches (finite) and the first factor approaches 0, so . The converse fails dramatically: is continuous everywhere but has and , so does not exist. There exist functions (Weierstrass's "monster") that are continuous everywhere but differentiable nowhere, analysis is full of such surprises.
The Mean Value Theorem
The Mean Value Theorem (MVT) says: if is continuous on [a,b] and differentiable on , then there exists with . Geometrically: somewhere in the interval, the tangent line has the same slope as the chord connecting the endpoints. From MVT we instantly conclude: if , then is constant; if , then is strictly increasing; and the more subtle Cauchy MVT underpins L'Hôpital's rule.
Use the grapher to plot on [1,3]. The chord slope is , and the tangent has slope . MVT predicts with , i.e. . Visually you can see the tangent at is parallel to the chord. Try on the same interval to see a different , and try on [-1,1] to see what happens when differentiability fails at an interior point (MVT does not apply).
Taylor's theorem
Taylor's theorem extends MVT: if is times differentiable, then , where the remainder for some between and . This is why polynomials of high degree can approximate smooth functions with arbitrary accuracy, the whole basis of numerical analysis.
- Newton's laws & physics: velocity is the derivative of position, acceleration the derivative of velocity, force is mass times acceleration. Maxwell's equations, Schrödinger's equation, and the Einstein field equations are all differential equations, equations involving derivatives.
- Gradient descent & machine learning: every neural-network parameter update is . Backpropagation is the chain rule executed at industrial scale. Without the Mean Value Theorem and Taylor expansions, you cannot prove that gradient descent converges to a local minimum.
- Control theory & robotics: PID controllers, Kalman filters, and model-predictive control all linearise non-linear dynamics around an operating point using (or its multivariable analogue, the Jacobian). The "first-order Taylor expansion" is the engineer's daily bread.
Pause and think: A function is differentiable at with . What is ? Justify in one line using the implication "differentiable implies continuous".
Try it
- Predict first: use the limit definition to compute for at . Hint: expand over a common denominator.
- Apply MVT to on [0,\pi/2]. Which does the theorem guarantee? Solve numerically.
- Show that is differentiable at and find . (Use the limit definition; the answer is 0, the function is C^1 but not C^2 at 0.)
- True or false: if for all in , then is strictly increasing on . Prove using MVT.
A trap to watch for
The implication "differentiable implies continuous" is one-way only. Many beginners assume continuous implies differentiable, then are baffled when they meet , the Cantor staircase, or the Weierstrass nowhere-differentiable function. The right mental model: continuity controls vertical jumps; differentiability additionally controls corners and cusps.
What you now know
You can compute derivatives from the limit definition, prove the differentiable-implies-continuous implication, apply MVT and its corollaries, and understand Taylor expansions as the canonical local approximation. Section §2.4 (Integration) is the dual: instead of measuring rates of change, we measure accumulated change, and section §2.5 (FTC) shows the two operations are mutual inverses.
Mark section complete →
References
- Garrity, T. (2002). All the Mathematics You Missed. Cambridge University Press, ch. 2.
- Spivak, M. (2008). Calculus (4th ed.). Publish or Perish, ch. 10-11.
- Rudin, W. (1976). Principles of Mathematical Analysis (3rd ed.). McGraw-Hill, ch. 5.
- Abbott, S. (2015). Understanding Analysis (2nd ed.). Springer, ch. 5.
- Apostol, T. M. (1974). Mathematical Analysis (2nd ed.). Addison-Wesley, ch. 5.