User:Prof McCarthy/Matrix similarity: Difference between revisions
→References: changed article categories to links per WP:USERNOCAT using AWB (10810) |
m Simplified hatnote syntax |
||
Line 1: | Line 1: | ||
{{ |
{{other uses|Similarity (geometry)|Similarity transformation (disambiguation){{!}}Similarity transformation|Similarity (disambiguation)}} |
||
{{Distinguish|similarity matrix}} |
{{Distinguish|similarity matrix}} |
||
In [[linear algebra]], two ''n''-by-''n'' [[matrix (mathematics)|matrices]] ''A'' and ''B'' are called '''similar''' if |
In [[linear algebra]], two ''n''-by-''n'' [[matrix (mathematics)|matrices]] ''A'' and ''B'' are called '''similar''' if |
Revision as of 15:27, 13 June 2016
In linear algebra, two n-by-n matrices A and B are called similar if
for some invertible n-by-n matrix P. Similar matrices represent the same linear operator under two different bases, with P being the change of basis matrix.
A transformation is called a similarity transformation or conjugation of the matrix A. In the general linear group, similarity is therefore the same as conjugacy, and similar matrices are also called conjugate; however in a given subgroup H of the general linear group, the notion of conjugacy may be more restrictive than similarity, since it requires that P can be chosen to lie in H.
Change of coordinates
The concept of matrix similarity can also be seen in the change of basis of a linear transformation.
Consider the n×n matrix A that transforms vectors x into vectors X in the n-dimensional vector space V=Rn, that is
where both vectors x and X are measured relative to the standard coordinate basis e1, e2, ..., en.
Now let B be an n×n matrix that changes the coordinates so the vectors x and X so they are measured relative to a different set of basis vectors, b1, b2, ..., bn. Let the matrix B be constructed with bi as its ith column. This matrix is non-singular because its columns form a basis and are therefore linearly independent, which means the inverse of B exists. The components of x and X measured relative to the basis bi, are given by
The transformation A can be defined in the new coordinates by the calculation,
where
Thus, the matrix K defines the transformation A in the new coordinate basis, and results from a similarity transformation of A.
Eigenvector basis
In this section, it is shown that a matrix A with a linearly independent system of eigenvectors is similar to a diagonal matrix formed from its eigenvalues.
Let A be an n×n linear transformation that has n linearly independent eigenvectors vi, and consider the change of coordinates of A so that it is defined relative to its eigenvector basis.
Recall that the eigenvectors vi of A satisfy the eigenvalue equation,
Assemble these eigenvectors into the matrix V, which is invertible because these vectors are assumed to be linearly independent. This means the coordinates x and X relative to the basis vi can be computed as,
This yields the change of coordinates
To see the effect of this change of coordinates on A, introduce I=VV-1 into the eigenvalue equation
and multiply both side by V-1 to obtain
Notice that
which is the natural basis vector. Thus,
and the matrix K is found to be a diagonal matrix with the eigenvalues λi as its diagonal elements.
This shows that a matrix A with a linearly independent system of eigenvectors is similar to a diagonal matrix formed from its eigenvalues.
Properties
Similarity is an equivalence relation on the space of square matrices.
Similar matrices share any properties that are really properties of the represented linear operator:
- Rank
- Characteristic polynomial, and attributes that can be derived from it:
- Determinant
- Trace
- Eigenvalues, and their algebraic multiplicities
- Geometric multiplicities of eigenvalues (but not the eigenspaces, which are transformed according to the base change matrix P used).
- Minimal polynomial
- Elementary divisors, which form a complete set of invariants for similarity
- Rational canonical form
Because of this, for a given matrix A, one is interested in finding a simple "normal form" B which is similar to A—the study of A then reduces to the study of the simpler matrix B. For example, A is called diagonalizable if it is similar to a diagonal matrix. Not all matrices are diagonalizable, but at least over the complex numbers (or any algebraically closed field), every matrix is similar to a matrix in Jordan form. Neither of these forms is unique (diagonal entries or Jordan blocks may be permuted) so they are not really normal forms; moreover their determination depends on being able to factor the minimal or characteristic polynomial of A (equivalently to find its eigenvalues). The rational canonical form does not have these drawbacks: it exists over any field, is truly unique, and it can be computed using only arithmetic operations in the field; A and B are similar if and only if they have the same rational canonical form. The rational canonical form is determined by the elementary divisors of A; these can be immediately read off from a matrix in Jordan form, but they can also be determined directly for any matrix by computing the Smith normal form, over the ring of polynomials, of the matrix (with polynomial entries) XIn − A (the same one whose determinant defines the characteristic polynomial). Note that this Smith normal form is not a normal form of A itself; moreover it is not similar to XIn − A either, but obtained from the latter by left and right multiplications by different invertible matrices (with polynomial entries).
Notes
Similarity of matrices does not depend on the base field: if L is a field containing K as a subfield, and A and B are two matrices over K, then A and B are similar as matrices over K if and only if they are similar as matrices over L. This is so because the rational canonical form over K is also the rational canonical form over L. This means that one may use Jordan forms that only exist over a larger field to determine whether the given matrices are similar.
In the definition of similarity, if the matrix P can be chosen to be a permutation matrix then A and B are permutation-similar; if P can be chosen to be a unitary matrix then A and B are unitarily equivalent. The spectral theorem says that every normal matrix is unitarily equivalent to some diagonal matrix. Specht's theorem states that two matrices are unitarily equivalent if and only if they satisfy certain trace equalities.
See also
References
- Horn and Johnson, Matrix Analysis, Cambridge University Press, 1985. ISBN 0-521-38632-2. (Similarity is discussed many places, starting at page 44.)