More about Eigen Structure and Diagonalization

Remarks: Let and be $n \times n$ matrices.

We said that if and are similar, then they have the same eigenvalues. In particular, if is diagonalizable, then is similar to a diagonal matrix . Thus, and have the same eigenvalues which means the diagonal elements of are the eigenvalues of . Now if is diagonalizable, then there exists a nonsingular matrix such that . In this case, we say diagonalizes or is diagonalizable via . Now after explaining the relationship between and (they have the same eigenvalues), the questions that arise are:
1. What is the relationship between and ?
2. Are and unique?
The answer to this first is that the columns of are eigenvectors of . The answer to the second is is not unique and is not necessarily unique. In fact, if you multiply by any nonzero number, then the resulting matrix will diagonalize . Also, if you change the order of the columns of , then the resulting matrix will diagonalize . If you do so, also may change (will have the same elements as the original, but the location of these elements may change), which means is not necessarily unique.
Remark: To diagonalize an $n \times n$ matrix that has linearly independent eigenvectors, find linearly independent eigenvectors , , $\cdots$ , , of . Then form the matrix $P = [x_1 ~ x_2 ~ \cdots ~ x_n]$ . Now $P^{-1}AP = D$ , where is a diagonal matrix whose diagonal elements are the eigenvalues of corresponding to the eigenvalues of , , $\cdots$ , .
Now recall that the eigenvalues of Hermitian matrices and the diagonal elements are real and the eigenvalues of skew-Hermitian matrices and the diagonal elements have zero real parts. Also, recall that if is unitary, then (note can be complex), is normal, , , and if is an eigenvalue of , then . Moreover, any two distinct columns of are orthogonal (i.e. , when , and each one of them is a unit vector). Here are more facts about these matrices:
1. (Schur's Theorem) If is an $n \times n$ matrix, then there exists a unitary matrix such that , where is upper-triangular. Moreover, and have the same eigenvalues.
2. From the above, note that we can write (proof: exercise). This is called the Schur decomposition of or Schur normal form.
3. From Schur's theorem: If is an $n \times n$ hermitian or a skew-Hermitian matrix, then there exists a unitary matrix such that , where is diagonal. Thus, is diagonalizable, and we say in this case is unitarily diagonalizable.
  Proof of the Hermitian Case: By Schur's Theorem, there exists a unitary matrix and an upper-triangular matrix such that . Now take the Hermitian transpose of both sides to get . Thus, . But, also. Therefore, , which implies is diagonal. The skew-Herimitian case is similar.
4. in Schur's decomposition is diagonal iff is normal. Moreover, when is normal, the rows of are eigenvetors of . Thus, is unitarily diagonalizable iff is normal. Moerover, is normal iff has a complete orthonormal set of eigenvectors.
5. From the previous part, if you take to be real symmetric, then there exists an orthogonal matrix such that , where is diagonal. Thus, is diagonalizable, and we say in this case is orthogonally diagonalizable (proof: exercise). The eigenvalues of a real symmetric matrix are real and eigenvectors are real. Also, eigenvectors corresponding to different eigenvalues are orthogonal.
6. If is Herimitian, then eigenvectors that correspond to different eigenvalues are orthogonal.
  Proof: Let $(\lambda_1,x)$ and $(\lambda_2,y)$ be two eigenpairs of where $\lambda_1 \neq \lambda_2$ . We have to prove that . Now consider
  
  $\displaystyle (Ax)^H y = x^H A^H y = x^H A y = \lambda_2 x^H y.$
  
  $\displaystyle (Ax)^H y = (y^H A x)^H = (\lambda_1 y^H x)^H = \lambda_1 x^H y.$
  Therefore, $\lambda_1 x^H y = \lambda_2 x^H y$ , which implies $\lambda_1 x^H y - \lambda_2 x^H y = (\lambda_1 - \lambda_2) x^H y = 0$ , . Since $\lambda_1 \neq \lambda_2$ , then .
is orthogonaly diagonalizable iff has orthonormal eigenvectors iff is real symmetric.
(Cayley-Hamilton Theorem) Every matrix satisfies its characteristic equation.

Transforming Complex Hermitian Eigenvalue Problems to Real Ones

Let be a complex Hermitian matrix, where and are real $n \times n$ matrices and let $(\lambda,z=x+iy)$ be an eigenpair of , where and are in $\mathbb{R}^n$ (recall that $\lambda$ is real because is Hermitian). Now, $(A+iB)(x+iy)=\lambda (x+iy)$ if and only if

$\displaystyle \left[\begin{array}{cc} A & -B\\ B & A\end{array}\right]\left[\b... ...\ y\end{array}\right]=\lambda \left[\begin{array}{c} x\\ y\end{array}\right].$

Note that since is Hermitian, then is symmetric and is skew-symmetric. Hence, the matrix $\left[\begin{array}{cc} A & -B\\ B & A\end{array}\right]$ is real symmetric. Thus, we managed to reduce a complex Hermitian eigenvalue problem of order to a real symmetric eigenvalue problem of order .

The companion Matrix

Let be the $n \times n$ matrix such that , $k=1,2, \cdots, n-1$ , and $a_(n,k)=-c_{k-1}$ , $k=1,2, \cdots, n$ . By expanding the determinant of $A-\lambda I$ across the last row, you'll find out that the characterestic polynomial of is

$\displaystyle p(\lambda)=(-1)^n \left(x^n + \sum_{k=1}^{n} c_{k-1} x^{k-1} \right).$

The matrix

is called the companion matrix of the polynomial $p(\lambda)$ . The companion matrix is sometimes defined to be the traspose of the matrix above and it satisfies the following: $A e_k = e_{k+1}$ , $k=1,2, \cdots, n-1$ , and $Ae_n = [-c_0 ~ -c_1 ~ \cdots ~ -c_{n-1}]^T$ .

Definition: The spectral radius of an $n \times n$ matrix , denoted $\rho(A)$ , is the maximum eigenvalue of in magnitude; i.e. if the eigenvalues of are $\lambda_1,$ $\lambda_2$ , $\cdots$ , $\lambda_n$ , then $\rho(A) =$ max $_i \vert\lambda_i\vert$ .

Definition: Let be an $m \times n$ matrix and let be the matrix such that $b_{ij} = \vert a_{ij}\vert$ .

The one-norm of , denoted $\Vert A\Vert _1$ , is the maximum column sum of .
The $\infty$ -norm, denoted $\Vert A\Vert _{\infty}$ , of is the maximum rwo sum of .
The two-norm (or spectral norm) of , denoted $\Vert A\Vert _2$ , is the non-negative square root of $\rho(A^TA)$ . Note that is square and symmetric.
The Frobenius norm of , denoted $\Vert A\Vert _F$ , is $\sqrt{\sum_{i=1}^m \sum_{j=1}^n \vert a_{ij}\vert^2}$ .