Eigenvalues, Eigenvectors, and Diagonalization
Learning objectives
- Define eigenvalue and eigenvector via
- Compute eigenvalues by solving the characteristic equation
- Find eigenvectors by row-reducing
- Connect eigenvalues to trace, determinant, and the geometry of the transformation
Eigenvalues are the intrinsic stretching factors of a linear transformation, the directions in which the matrix acts as a pure scaling. They are the most powerful concept in finite-dimensional linear algebra. They control the long-run behaviour of dynamical systems, the stability of solutions to differential equations, the principal components of high-dimensional data, and the energy levels of quantum systems. If you take only one thing from this chapter, take the geometric picture: an eigenvector is a direction the matrix preserves; its eigenvalue is the scaling factor along that direction.
The definition
A non-zero vector is an eigenvector of a square matrix with eigenvalue if . Geometrically, stretches by the factor without rotating it (though it may flip when ). The zero vector is excluded by convention, otherwise every scalar would qualify as an eigenvalue.
Finding eigenvalues: the characteristic equation
Rearrange as . For a non-zero to exist, the matrix must be singular, equivalently, . This is the characteristic equation of . For a matrix, it reduces to .
Finding eigenvectors
Once you have an eigenvalue , solve by row reduction. The non-trivial solutions form the eigenspace . Its dimension is the geometric multiplicity of . The algebraic multiplicity is how many times appears as a root of the characteristic polynomial. The two coincide for diagonalisable matrices; the gap measures the failure of diagonalisation.
Two beautiful identities
For any matrix with eigenvalues (counted with algebraic multiplicity):
- , the trace is the sum of eigenvalues.
- , the determinant is their product.
The matrix-multiplier above lets you see eigenvectors directly: enter a matrix and look for the directions that get scaled without rotating. For A = \begin{pmatrix} 4 & 1 \\ 2 & 3 \end{pmatrix}, the eigenvalues are , with eigenvectors and respectively, you can verify by feeding these vectors through the widget.
- PageRank algorithm: Google's original algorithm computes the dominant eigenvector of a stochastic matrix encoding the web's link structure. Page importance is literally the entries of that eigenvector, the most-cited eigenvalue computation in industry.
- Principal Component Analysis (PCA): The principal components of a data set are the eigenvectors of the covariance matrix, ordered by their eigenvalues. Larger eigenvalues = directions with more variance = features that explain the data.
- Quantum mechanics, observables and measurement: Every observable (energy, position, spin) is a Hermitian operator. Its eigenvalues are the possible measurement outcomes; its eigenvectors are the corresponding states. Spectroscopy data are literal eigenvalue plots.
Pause and think: If , what is ? Before computing, predict by powering up the eigenvalue. This is exactly the reason eigenvector decomposition makes matrix powers tractable.
Try it
- Predict the eigenvalues of A = \begin{pmatrix} 4 & 1 \\ 2 & 3 \end{pmatrix} by computing trace and determinant first. Then solve .
- For each eigenvalue you found, predict the eigenvector by inspecting . Then verify by computing .
- A matrix has eigenvalues . Predict the trace and determinant. Justify with the two beautiful identities.
- If , predict before computing. The answer is .
- Predict: is ever an eigenvalue of an invertible matrix? Justify in one sentence using the determinant identity.
A trap to watch for
Eigenvectors are defined up to a non-zero scalar multiple, and are the same eigenvector in disguise (both lie on the same line through the origin). Students sometimes panic when their textbook lists but they computed ; both are correct. The eigenspace is a subspace, not a unique vector. Conversely, the zero vector is NEVER an eigenvector (by convention), if you got from row reduction, you probably picked a non-eigenvalue , because was invertible and forced .
What you now know
You can compute eigenvalues from the characteristic equation, find eigenvectors by row reduction, and connect both back to trace and determinant. With eigenvalues in hand, the diagonalisation machinery of §1.7 becomes concrete: count independent eigenvectors, build , and the diagonal form falls out. This closes Chapter 1: the linear-algebra toolkit that every subsequent chapter of Garrity rests on.
Mark section complete →
References
- Garrity, T. (2002). All the Mathematics You Missed. Cambridge University Press, ch. 1, §1.8.
- Strang, G. (2016). Introduction to Linear Algebra (5th ed.). Wellesley-Cambridge, ch. 6.
- Axler, S. (2015). Linear Algebra Done Right (3rd ed.). Springer, ch. 5 (eigenvalues, eigenvectors, invariant subspaces).
- Horn, R. A., Johnson, C. R. (2012). Matrix Analysis (2nd ed.). Cambridge University Press, ch. 1 (spectral theory).
- Trefethen, L. N., Bau, D. (1997). Numerical Linear Algebra. SIAM, lec. 24-25 (eigenvalue algorithms).