The Quaternions
The Quaternions
Fuzhen Zhang+
Department of Mathematical Science
Nova Southeastern University
Fort Lauderdale, Florida 33314
ABSTRACT
1. INTRODUCTION
The family of quatemions plays a role in quantum physics [l, 18, 191. It
often appears in mathematics as an algebraic system-a skew field or
noncommutative division algebra [7]. While matrices over commutative rings
have gained much attention [6], the literature on matrices with quatemion
entries, though dating back to 1936 [39], is fragmentary. Renewed interest
has been wnnessed recently [3, 4, 9, 11-13, 31, 36, 35, 47, 481. In hopes that
it will be useful to a wide audience, a concise survey on matrices of
quatemion entries with new proofs for some known results is presented here.
* Part of this paper was presented at the AMS-MAA joint meeting, San Antonio, January
1993, under the title “Everything about the matrices of quatemions.”
’ The main work was done while the author was a student of R. C. Thompson’s at the
University of California, Santa Barbara. E-mail: zhang@alpha .nova. edu.
QUESTION
2. Does every quatemion matrix A have an eigenvalue?
As usual, let C and [w denote the fields of the complex and real numbers
respectively. Let Q be a four-dimensional vector space over [w with an
ordered basis, denoted by e, i, j, and k.
A renl quaternion, simply called quaternion, is a vector
.z=j2=k’= -1,
1
If a and b are any (real) scalars, while u, v are any two of e, i, j, k, then
the product (au)(bv) is defined as (abXuv). These rules, along with the
distribution law, determine the product of any two quaternions.
Real numbers and complex numbers can be thought of as quatemions in
the natural way. Thus xae + x,i + x2j + x,k is simply written as xg +
x,i + x2j + x,k.
24 FUZHEN ZHANG
One can also prove the lemma by changing (1) to a linear homogeneous
system of real variables.
It is readily seen that 1 x] contains a single element if and only if x E R. If
r G R, then [xl contains infinitely many quatemions, among which there are
only two complex numbers that are a conjugate pair; moreover x - x* for
every quatemion x.
This lemma yields the following theorem.
l I
aI a2
A:q=a,+a,jEQ+q’= - - EQ’.
- a2 a,
Then _& is
. b’j1 ec ti ve and preserves the operations. Furthermore, 191’ = det 9’,
and the eigenvalues of 9’ are Re 9 f IIm q/i.
The fundamental theorem of algebra over the quatemion skew field has
gained little attention. It is of less importance than that over the complex
number field, though some problems may reduce to it.
Let f(r) = a,x” + a,,_,x”-’ + ... +a,x + a, be a polynomial in x
over Q and a, z 0. It is natural to ask whether f(z) = 0 always has a
solution and how many solutions f(x) = 0 has (as an example, x2 + 1 = 0
has infinitely many solutions in Q). More generally, let F(X) be a polynomial
in x with general terms a,, xa, x 1.1 xa,, , where the a,‘s are quatemions. Does
F(x) = 0 have a solution in C2??
This question was partially answered in [16].
The proof in [16] 1sa topological one and is accomplished by showing that
f(x) and g(x) = x” are homotopic mappings and the latter has degree n.
QUATERNIONS AND MATRICES OF QUATERNIONS 27
Note that f<~> has only one term with “degree” n. As a consequence, we
have the following result which was first shown in [32].
(4A)B = Q(AB),
(A41B = A(4B),
(P4)A = P(4A)-
All operators for complex matrices can be performed except the ones such
as
(qA)B = A(@)
1. (&T = (AT);
2. CAB)*
- --= B*A*;
3. AB # AB in general;
4 ( AB)T # B?‘A’ in general;
5: CAB)-’ = B-‘A-’ ifs and B are invertible;
6. (A*)-’ = (A-‘)* ifA is invertible;
7. (A)-’ # A- ’ in general;
8. (A’)-’ # (A-‘)r in general.
Proof. First note that the proposition is true for complex matrices.
Let A = A, + A, j, B = B, + B,j, where A,, A,, B,, and B, are
QUATERNIONS AND MATRICES OF QUATERNIONS 29
-
AB=Za (A,B,-A,B,)+(A,B,+A,B,)j=Z
-(n,,x,)(_; ;) =(Z>O)
-
*B,A,-B,A, =I, B,A,+B,A, =0
-
= (B,A, - B,A,) + (B,A, + B,x)j = I
* BA =I. n
The adjoint of a quatemion matrix has been exploited in the study of its
quatemionic numerical range [3, 4, 27, 361; this we will enlarge on later.
If
30 FUZHEN ZHANG
‘1 90 + 9,i 0 yz + y3i\
0 1 0 0
xe= 0 -9; + 9G 1 90 - 9,i *
\o 0 0 1 I
1. XI = zzn;
2. XiB = XA X8;
3. XA+B = XA t XBJ
4. xA* = ( &I*;
5. ~~-1 = (xA)-’ fA_’ exists;
6. xA is unitaq, Hermitian, or normal if and only if A is unitary,
Hermitian, or normal, respect&y;
7. if IA is a Jordan block with A on the diagonal, then
IA B>O
-13
I A’. (3)
QUATERNIONS AND MATRICES OF QUATERNIONS 31
In particular,
I I
I A
> 0,
-A z
we get
I I
A B
> 0.
-B A
1. A is invertible;
2. Ax = 0 has a unique solution 0.
3. ( xAl # 0, i.e., xA is invertible.
4. A has no zero eigenvalue. More precisely, if Ax = hx or Ax = xh for
some quaternion A and some quaternion vector x # 0, then A # 0.
5. A is the product of elementary quaternion matrices.
32 FUZHEN ZHANG
Thus Ax = 0 is equivalent to
A,x, -A,< = 0
and
A,x, + A,r, = 0.
A,x, +A,(-x,) =0
and
- _
(-A,)x, + A,(-x,) =O,
5. EIGENVALUES
Then the left eigenvalues of A are 1 and i, whereas the right eigenvalues of
A are 1 and all the quatemions in [il.
A=
Then A has two left eigenvalues + l/(fixi + j>, and infinitely many right
eigenvalues which are the quatemions satisfying A4 + 1 = 0. Note that the
c~,( A) is not discrete and that al(A) n u,.,(A) = 4.
34 FUZHEN ZHANG
A= 0 i
i -i 0' 1
Then k is a left eigenvalue but not a right one. Note that A is Hermitian.
The proof is basically the same as in the complex case. The situation of
general matrices is much more complicated. Generally speaking, there is no
very close relation between left and right eigenvalues. For real matrices,
however, we have the following theorem.
(qAr/-‘)qx = (qhq-‘)qx
and
Aqx = (qhq-l)qx,
For any given matrix A E M,(Q), does there always exist A E Q and
nonzero column vector x of quatemions such that AX = Ax? This is of
course a very basic question. It was raised in [I4, p. 2171 and was later proved
by Wood [411. Woods p roof, adopted below, is a purely topological one.
QUATERNIONS AND MATRICES OF QUATERNIONS 35
and
f,,(A) = -A,
fi( A) = AZ - A = g,(A),
and
g,(A) = AZ.
Thus g,, is homotopic to fo. On the other hand, f0 and g,, viewed as the
maps from the S-sphere S 3 into GL(n, Q), correspond to integers 0 and n,
respectively, in n,GL(n, Q), the third homotopy group (isomorphic to the
integers [53]) of GL(n, Q). This is a contradiction. n
(4)
or
(5)
Since
U*AU =
( 1
; B” ,
appears an even number of times, c~ncl the complex eigewalues ofthat matrix
appear in conjugate pairs.
Note that Proposition 4.2 follows from Corollary 5.1 immediately. The
structure of the Jordan canonical form of the block matrix in the above
corollary will be given in the next section.
The following illustrates why this idea does not come through for the (left
eigenvahie) system An- = As.
Let A = A, + h2j. Then
and
A, - A,1
0.
-A, + A,1
A, - A,1 A, -A,1
- - 0. (6)
-A, +\,I A, - A,1 =
QUATERNIONS AND MATRICES OF QUATERNIONS 39
Note that, by Proposition 4.2, the left-hand side of the above identity is
nonnegative. Can this function, regarded as a function on [w4, vanish? It is
unknown if this approach can go through without using Woods result.
I I
A
= O?
-E Pi
6. CANONICAL FORMS
This can be shown by the mathematical induction and Lemmas 5.1 and
5.2.
As noticed earlier, every diagonal entry of a triangular matrix is a left
eigenvalue. The following says that they are also right eigenvalues.
A=
Then
A(;)=(: ;,)(j)=(h’rA;l:ly)=(;;)=(;)h.
QUATERNIONS AND MATRICES OF QUATERNIONS
XSS’XAXS = x] =
42 FUZHEN ZHANG
This means that x, is the Jordan form of xA. Conversely, let j be the Jordan
form of xA, and let P be an invertible matrix such that P-‘/YAP = J. If it can
be shown that P and J take the forms
Thus
xx* K2xX= XD
and therefore
so
UAV =
where D, = diag(d,, . . . , cl,) and the d’s are the positive singular values of A.
It is readily seen that A, A*, AA*, and A*A are all of the same rank.
Two square matrices of quatemions A and B are said to be similar if
there exists an invertible quatemion matrix S of the same size such that
S-‘AS = B. It is immediate that similar quatemion matrices have the same
(right) eigenvalues. This is not true for left eigenvalues. As one sees below,
even the trace is not preserved under similarity. Recall that the trace of a
square matrix is the sum of the main diagonal entries of the matrix, which is
equal to the sum of the eigenvalues of the matrix in the complex case.
Then A and B are similar, but they have different traces and different left
eigenvalues.
U= , and B = U*AU.
1
- -*
\ v5J di I
Then the column vectors of A arc left linearly dependent and right linearly
independent. A is of rank 2, a11d is invertible.
It is easy to show that an II X )I qllaternion matrix A is diagonalizable if
and only if A has II right linearly independent eigenvectors belonging to
right eigenvahles. The following example says that the eigenvectors belonging
to different eigenavhles are not necessarily (right or left) linearly indepen-
dent.
Hamilton theorem, which have been obtained by several authors in the last
decade through extremely difficult approaches.
Let A be an n x n quatemion matrix, and let xA be the complex adjoint
matrix of A. We define the q-determinant of A to be 1~~1,simply called the
determinant of A, denoted by lAlq, i.e., l Al, = l xAl by definition.
It is immediate that l Al, = l All Al = (det Al” when A is a complex
matrix.
We have the following results on determinants of quaternion matrices.
1. A is invertible * I Al, # 0.
2. IABI, = IAI,IB(,, consequently IA-‘I, = lAl,I’, ifA_’ exists.
3. IPAQI, = l Al,, for any elementary matrices P and Q.
4. I Al, = n:= Jh,12 > 0, where the A,‘s are the standard eigenvalues of
A.
5. Cayley-Hamilton theorem: Let A be a square matrix of quaternions,
and FA(A) = lAZ,, - ,yAl, called the characteristic polynomial of A, where A
is a complex indeterminant. Then F,(A) = 0, and F,(A,) = 0 if and only if
A, is an eigenvalue.
6. Zf A and B are similar, then l Al, = (Bl, and F,(A) = F,(A).
7. Hadamard theorem: Let A = (a,,) be positive semi&finite Hermi-
tian. Then 141, Q 17;=,q,!t, with equality if and only if A is real diagonal.
Furthermore, if A is partitioned as
A=
with the convention that each n,, t = 1,2, . . . , r, is the largest number in its
cycle and that n = n, > n2 > ... > n, > 1. For example, if u is the
identity, then IT = (n)(n - 1) . ..(2)(l).
QUATERNIONS AND MATRICES OF QUATERNIONS 49
x . ..
an,k,ak,k3 “’ ak~_,k,ak,n,
= c E((T)cz,.
CTE S”
A=
It follows from the definition and the above lemma that if D is an upper
triangular matrix with diagonal entries A,, . . . , A,, then IDl,, = n:= II h,12.
Applying Theorem 6.3, we have
50 FUZHEN ZHANG
j=li=l
and ecpality holds if ancl only if the colunm vectors of A are mutually
orthogonal.
to that of its intersection with the complex plane, and further reduce the
general matrix case to 2-by-2 matrices. For normal quatemion matrices, we
will give explicit characterizations of Wg< A) n [wand W,(A) n @.
Let B(A) = WY<A) n @. We call B(A), as Kippenhahn did, the bild (or
representative) of A, and the subset of B(A) lying in the closed upper half
plane the upper bild of A, denoted by B+(A). Note that a complex number
c E B(A) if and only if C E B(A), by Theorem 2.1, part 5. Thus I?( A) =
B + ( A) U B _ ( A), where B_(A) is the reflection of I? + ( A) about the x-axis
in the complex plane. As we will see shortly, B(A) is not convex in general,
while W,( ,yA), symmetrically suited about the x-axis since xA and L are
similar, is convex, Note that if x E W7( A), then any quatemion similar to x
is contained in W,(A). In particular, x* E W,(A).
A comparison of B(A) and W,( xA) is given below.
A=
52 FUZHEN ZHANG
and
respectively.
The last part of the previous theorem ensures that the intersection of
B+(A) with any vertical line is an either empty set or a line segment. The
same is true for horizontal lines [4].
QUATERNIONS AND MATRICES OF QUATERNIONS 53
Co(u*Au) = (uy,u;)
(; q(::)
where A = A, + A, j and u = U, + u2 j, as before.
Note that if 9 = 9” + 9,i + q2j + q,k E W,(A), then
90 +dmi~W~(A)n@
and
and
54 FUZHEN ZHANC
under the convention that (h,7k, + h,k,)/(k,5 + k,) means a pair {h,, h,]
when k, = k, = 0. Consequently B+(A) is convex.
The main idea of the proof in [36] or [47] is to convert the upper-bild
investigation into an extremal problem of eigenvalues with side conditions,
then employ Langrange multipliers. Au-Yeung [3] later shortened the proof
by showing that the upper bild coincides with the union of certain consecu-
tive triangles.
lust as for the case of complex matrices [23, p. 181, the convexity of the
upper bild of a quaternion matrix is easily reduced to that of 2 X 2 cases.
k,i
G=
-a+pj
QUESTION 9.1. Is there a short and conceptual proof for Theorem 9.6?
QUATERNIONS AND MATRICES OF QUATERNIONS 55
QUESTION 9.2. How are B(A) and W,( xA) related? What is the area
ratio of these two sets?
QUESTION 9.3. Is there a simple proof (without using Theorem 9.6) for
the statement that the quatemionic numerical range of the diagonal matrix
formed by the diagonal entries (or eigenvalues) of A is contained in WC,<A)?
REFERENCES