I've come across a paper that mentions the fact that matrices commute if and only if they share a common basis of eigenvectors. Where can I find a proof of this statement?
Matrices commute if and only if they share a common basis of eigenvectors?
- 
3You can find a good and detailed discussion of this fact in the book "Linear Algebra", by Kenneth Hoffman and Ray Kunze, chapter 6 (Elementary Canonical forms). – 2010-10-07
- 
6As a physics student, I would like to mention that this simple theorem, is tremendously useful and important in quantum mechanics, and gives rise to the concepts of compatible and incompatible variables, which is essential for the measurements and probabilistic interpretation of quantum mechanics. – 2013-02-17
- 
0I've change "set" to "basis" in the title (=question), since a set of common eigenvectors always exists (though it may be empty). – 2013-06-17
3 Answers
Suppose that $A$ and $B$ are $n\times n$ matrices, with complex entries say, that commute.
 Then we decompose $\mathbb C^n$ as a direct sum of eigenspaces of $A$, say $\mathbb C^n = E_{\lambda_1} \oplus \cdots \oplus E_{\lambda_m}$, where $\lambda_1,\ldots, \lambda_m$ are the eigenvalues of $A$, and $E_{\lambda_i}$ is the eigenspace for $\lambda_i$. (Here $m \leq n$, but some eigenspaces could be of dimension bigger than one, so we need not have $m = n$.)
Now one sees that since $B$ commutes with $A$, $B$ preserves each of the $E_{\lambda_i}$: If $A v = \lambda_i v, $ then $A (B v) = (AB)v = (BA)v = B(Av) = B(\lambda_i v) = \lambda_i Bv.$
Now we consider $B$ restricted to each $E_{\lambda_i}$ separately, and decompose each $E_{\lambda_i}$ into a sum of eigenspaces for $B$. Putting all these decompositions together, we get a decomposition of $\mathbb C^n$ into a direct sum of spaces, each of which is a simultaneous eigenspace for $A$ and $B$.
NB: I am cheating here, in that $A$ and $B$ may not be diagonalizable (and then the statement of your question is not literally true), but in this case, if you replace "eigenspace" by "generalized eigenspace", the above argument goes through just as well.
- 
1I'm a bit unclear on "putting decompositions together" part...does it give you an explicit form for the common set of eigenvectors? – 2010-10-07
- 
3@Yaroslav: Dear Yaroslav, Yes, you will be able to get an explicit form for the common set of eigenvectors, but it may be complicated to compute (at least if you are computing by hand). What you need to do is to find a basis for each $E_{\lambda_i}$. You can then write a matrix for $B$ (restricted to $E_{\lambda_i}$) in terms of this basis, and then by the usual diagonalization procedure (applied to this new matrix) find a new basis for $E_{\lambda_i}$ consisting of eigenvectors for $B$. (So these will be simultaneous eigenvectors for $B$ and $A$ --- with $A$-eigenvalue all equal to ... – 2010-10-07
- 
3... $\lambda_i$.) You do this for each $\lambda_i$, and putting all of the bases you found for each $E_{\lambda_i}$ together, you get a basis for $\mathbb C^n$ consisting of simultaneous eigenvectors. As Qiaochu points out, the situation is easiest when all the $E_{\lambda_i}$ are one-dimensional (i.e. when $A$ has distinct eigenvalues) since then you don't need to do the second step of computing eigenvectors for $B$; each $E_{\lambda_i}$ will automatically consists of simultaneous eigenvectors for $A$ and $B$ (again, this is just in the special case when $A$ has distinct eigenvalues). – 2010-10-07
- 
3Also, as your probably know, in the quantum mechanics literature, the situation when the $E_{\lambda_i}$ are bigger than one-dimensional is sometimes referred to by saying that the operator $A$ has *degeneracies*. – 2010-10-07
- 
0@MattE Hi Matt, can you explain a little bit in detail on what can be said when $A$ or $B$ is not diagonalizable? As in this case one is no longer guaranteed to have the relation $Av = \lambda_iv$, and I don't see how your argument can be extended. In particular, given two commuting matrices $A$ and $B$, if $B$ is symmetric and $A$ is not diagonalizable, is a subspace of $\mathbb{R}^n$ that is invariant under $A$ also invariant under $B$? – 2017-03-11
- 
0@rodrigo-de-azevedo, hello. When $\mathbf{B}$ preserves an eigenspace $E_{\lambda_i}$, does it mean that those vectors in $E_{\lambda_i}$ are eigenvectors of $\mathbf{B}$ with eigenvalue 1? – 2018-03-23
This is false in a sort of trivial way. The identity matrix $I$ commutes with every matrix and has eigenvector set all of the underlying vector space $V$, but no non-central matrix has this property.
What is true is that two matrices which commute and are also diagonalizable are simultaneously diagonalizable. The proof is particularly simple if at least one of the two matrices has distinct eigenvalues.
- 
3(And this can be improved to "simultaneously Jordanized", among other things) – 2010-10-07
- 
1Dear Qiaochu, I suspect that the question means "admit a common set of eigenvectors", not that every eigenvector for one is an eigenvector fo the other. – 2010-10-07
- 
0Yes, I meant Matt's version, the precise statement was "Thus, the Hamiltonian and all the translation operators of the crystal commute with each other. They possess, therefore, a common set of eigenstates." which I figured would have an equivalent statement in linear algebra – 2010-10-07
- 
8@MarianoSuárez-Alvarez: That's not true. Take the 3 x 3 matrix nilpotent $A$ with ones just above the diagonal and all other entries zero. Then $A$ and $A^2$ commute, but are not simultaneously Jordanized (because $A$ is a single Jordan block, so any $P$ Jordanizing it must give $PAP^{-1} = A$ and thus $PA^2P^{-1} = A^2$ is not in Jordan form). It is true that any two commuting matrices can be simultaneously put in upper triangular form, though, maybe that's what you were remembering? – 2012-11-26
- 
6@MarianoSuárez-Alvarez: Your comment is just **wrong**, as is shown in [this answer](http://math.stackexchange.com/a/49393/). – 2014-05-06
Let $S$ be a set of commuting matrices over an algebraically closed field $F$. Then there may not be a common basis of eigenvectors (since any of them may not be diagonalizable!) but there must be at least a common eigenvector:
Burnside's theorem on matrix algebras states that if $F$ is algebraically closed, $V$ is a finite-dimensional $F$-vector space and $S$ is a proper subalgebra of $\text{End}(V)$ then there exists a nontrivial $S$-invariant subspace, i.e, there exists $W\leq V$ with $0\neq W\neq V$ such that $s(W)\subseteq W$ for every $s\in S$.
Suppose $S\subseteq M_n(F)$ with $n>1$ is commuting. Observe that a subspace of $F^n$ is $S$-invariant if and only if it is invariant for $$, the subalgebra of $M_n(F)$ generated by $S$. Since $S$ is commuting, $$ is also commuting and therefore $\neq M_n(F)$. Burnside's theorem applies, and so there exists a proper and nontrivial subspace $V\leq F^n$ which is invariant for all $S$. If $V$ has dimension more than $1$ then $\neq\text{End}(V)$, since $$ is commuting, and we can apply Burnside's theorem again. By induction there exists an $S$-invariant subspace of dimension $1$, and so a common eigenvector for the matrices in $S$.
