Avoiding the Cayley–Hamilton theorem [duplicate]

Every $n\times n$ matrix satisfies a polynomial equation of degree at most $n^2$, simply because the space of $n\times n$ matrices has dimension $n^2$.

By the Cayley–Hamilton theorem, every matrix satisfies a polynomial equation of degree $n$.

Is there a simple proof that every matrix satisfies a polynomial equation of degree at most $n$ without using the Cayley–Hamilton theorem?


Solution 1:

Ths might not count as simple, but it provides another point of view:

$A$ makes $K^n$ into a $K[x]$-module and by the structure theorem of finitely generated modules over a PID, we get

$$K^n \cong K[x]/(f_1) \oplus \dotsb \oplus K[x]/(f_s).$$

Set $f=f_1 \dotsb f_s$. Clearly $f$ acts by zero on the right hand side, hence $f$ acts by zero on $K^n$, which means $f(A)=0$. By comparison of the dimensions of both sides, we get

$$n = \dim K^n = \sum \dim K[x]/(f_i) = \sum \deg f_i = \deg f.$$

Of course, the structure theorem is not really simpler than Cayley-Hamilton, but its standard proof does not need any Cayley-Hamilton related methods, I think.

Solution 2:

Proof that any matrix satisfies a polynomial of degree at most $n^2-1$:

Let $A$ be a matrix that doesn't satisfy any such polynomial. Then $I,A,A^2,\dots,A^{n^2-1}$ are linearly independent and form a basis for the space of matrices. It follows that every matrix commutes with $A$.

This implies that $A$ is a multiple of the identity, so that $A - \lambda I = 0$, so that $A$ satisfies $x - \lambda$. This contradicts our hypothesis that $A$ satisfies no such polynomial.

Solution 3:

This is a proof by induction:

The result is trivial for $n=1$ and of course we can assume the field $K$ to be algebraically closed.

Pick an eigenvector with $Av = \lambda v$, set $V = \langle v \rangle$. $V$ is an invariant one-dimensional subspace.

We get two new endomorphisms: $A_{|V} \in End(V)$ and $(A_{K^n/V}: [x] \mapsto [Ax]) \in End(K^n/V)$.

By induction we obtain $f$ of degree $1$ with $f(A_{|V})=0$ and $g$ of degree $n-1$ with $g(A_{K^n/V})=0$.

The desired $h(A)=0$ holds for the product $h=fg$:

$g(A_{K^n/V})=0$ translates into $g(A)(K^n) \subset V$, hence $h(A)(K^n)=f(A)(g(A)(K^n)) \subset f(A)(V)=0$.

Solution 4:

Yes, use Jordan form over an algebraic closure of the ground field.