Similar Matrices and Change of Basis

Part 1, Chapter 1: Linear Algebra Toolkit

Learning objectives

Define matrix similarity $B = P^{-1} A P$ as the same linear map in a new basis
Identify similarity invariants: determinant, trace, rank, eigenvalues, characteristic polynomial
Recognise when two matrices CANNOT be similar by comparing invariants
Connect similarity to diagonalisation $A = P D P^{-1}$ as the simplest similarity class

Two matrices are similar exactly when they describe the same linear transformation in different coordinate systems. This is the right notion of equivalence for square matrices: the matrix changes when you change the basis, but the underlying transformation does not. Whatever the linear map "really does" geometrically, how much it rotates, stretches, projects, survives the change of basis. The numerical features that survive are the similarity invariants, and they are exactly the features worth computing.

The definition

Two $n \times n$ matrices $A$ and $B$ are similar if there exists an invertible matrix $P$ with $B = P^{-1} A P$ . The matrix $P$ is the change-of-basis matrix: its columns are the new basis vectors expressed in the old coordinates. If you apply $A$ to a vector and then re-express the result in a new basis, you get the same answer as if you had first re-expressed the input in the new basis, applied $B$ , and stayed there.

Similarity invariants

The following quantities are preserved by similarity:

Determinant: $\det(B) = \det(P^{-1} A P) = \det(P)^{-1} \det(A) \det(P) = \det(A)$ .
Trace: $\operatorname{tr}(B) = \operatorname{tr}(A)$ , using the cyclic property of trace.
Rank: $\operatorname{rank}(B) = \operatorname{rank}(A)$ , because multiplication by invertibles preserves rank.
Eigenvalues and characteristic polynomial: $\det(B - \lambda I) = \det(A - \lambda I)$ .
Minimal polynomial and Jordan form (covered in advanced courses).

Diagonalisation

A matrix $A$ is diagonalisable if it is similar to a diagonal matrix, that is, $A = P D P^{-1}$ for some diagonal $D$ and invertible $P$ . In this case the columns of $P$ are eigenvectors of $A$ and the diagonal entries of $D$ are the corresponding eigenvalues. Diagonalisation is the cleanest similarity class: it makes powers easy ( $A^k = P D^k P^{-1}$ ) and reveals the geometry of the transformation directly. Not every matrix is diagonalisable; the failure cases are captured by Jordan Normal Form.

The matrix widget above can illustrate similarity: set up a matrix $A$ with non-trivial geometry, then apply a coordinate change. The new matrix $B = P^{-1} A P$ has different entries but performs the same overall transformation, you can verify by checking that the determinant and trace remain unchanged.

Where this shows up

Markov chain stationary distributions: A transition matrix $P$ is often diagonalised to compute $P^n$ for large $n$ , revealing the long-run stationary distribution as the eigenvector with eigenvalue 1. Without diagonalisation, computing $P^{1000}$ by hand is hopeless.
Differential equations: The system $\dot{\mathbf{x}} = A \mathbf{x}$ has solution $\mathbf{x}(t) = e^{A t} \mathbf{x}_0$ . If $A = P D P^{-1}$ , then $e^{A t} = P e^{D t} P^{-1}$ and $e^{D t}$ is just exponentials on the diagonal, a closed-form solution.
Quantum mechanics: Diagonalising the Hamiltonian operator in the energy basis reduces time evolution to multiplication by phase factors $e^{-i E_k t / \hbar}$ kt/hbar. Every physics undergraduate course relies on this similarity trick.

Pause and think: Suppose $A$ and $B$ are $2 \times 2$ matrices with $\operatorname{tr}(A) = \operatorname{tr}(B)$ and $\det(A) = \det(B)$ . Must they be similar? (Hint: they have the same characteristic polynomial. Does that suffice? Consider $\begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix}$ vs $\begin{pmatrix} 1 & 1 \\ 0 & 1 \end{pmatrix}$ .)

Try it

Predict whether $A = \begin{pmatrix} 2 & 0 \\ 0 & 3 \end{pmatrix}$ and $B = \begin{pmatrix} 3 & 0 \\ 0 & 2 \end{pmatrix}$ are similar. Justify by comparing eigenvalues, trace, and determinant. Then construct the explicit $P$ .
If $\operatorname{tr}(A) = 7$ and $B = P^{-1} A P$ , predict $\operatorname{tr}(B)$ . Justify with one line.
Predict whether $A = \begin{pmatrix} 2 & 1 \\ 0 & 2 \end{pmatrix}$ is diagonalisable. (Hint: count the independent eigenvectors for the repeated eigenvalue.)
If $A$ is diagonalisable with eigenvalues $2, 2, 5$ , predict the diagonal form $D$ and compute $A^3$ if $A = P D P^{-1}$ .
Predict whether $A = \begin{pmatrix} 1 & 0 \\ 0 & 2 \end{pmatrix}$ and $C = \begin{pmatrix} 1 & 0 \\ 0 & 3 \end{pmatrix}$ can be similar. (Compare eigenvalues.)

A trap to watch for

Sharing all of trace, determinant, and characteristic polynomial does not guarantee that two matrices are similar. The matrices $I_2 = \begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix}$ and $J = \begin{pmatrix} 1 & 1 \\ 0 & 1 \end{pmatrix}$ both have trace $2$ , determinant $1$ , and characteristic polynomial $(\lambda - 1)^2$ . Yet they are NOT similar: $I_2$ is diagonal, while $J$ is a Jordan block (not diagonalisable). The full classification of similarity classes requires Jordan Normal Form, the eigenvalues alone are not enough when multiplicities are involved. Always check eigenspace dimensions before declaring similarity.

What you now know

You can recognise when two matrices are similar (or definitively not similar), compute the invariants, and use diagonalisation to make hard matrix operations tractable. The next section (Section 1.8) goes deeper: how to actually find eigenvalues and eigenvectors, the building blocks of every diagonalisation.

Mark section complete →

References

Garrity, T. (2002). All the Mathematics You Missed. Cambridge University Press, ch. 1, Section 1.7.
Axler, S. (2015). Linear Algebra Done Right (3rd ed.). Springer, ch. 5 (eigenvalues and similarity).
Hoffman, K., Kunze, R. (1971). Linear Algebra (2nd ed.). Prentice-Hall, ch. 6 (similarity).
Horn, R. A., Johnson, C. R. (2012). Matrix Analysis (2nd ed.). Cambridge University Press, ch. 1.
Strang, G. (2016). Introduction to Linear Algebra (5th ed.). Wellesley-Cambridge, ch. 6 (eigenvalues, diagonalisation).