Minors and the Laplace Expansion - Jörg Liesen Volker Mehrmann

holds for all A,B∈ R^n,n, although we have shown the result in Theorem7.15only for A,B ∈K^n,n.

The proof of Theorem7.15suggests that det(A)can be easily computed while transforming A∈ K^n,ninto its echelon form using elementary row operations.

Corollary 7.16 For A∈ K^n,nlet S₁, . . . ,St ∈ K^n,nbe elementary matrices, such that A = St. . .S₁A is in echelon form. Then either A has a zero row and hence det(A)=0, orA=Inand hencedet(A)=(det(S₁))⁻¹ · · ·(det(St))⁻¹.

As shown in Theorem5.4, every matrixA∈ K^n,ncan be factorized asA=P LU, and hence det(A) = det(P)det(L)det(U). The determinants of the matrices on the right hand side are easily computed, since these are permutation and triangular matrices. AnLU-decomposition of a matrix Atherefore yields an efficient way to compute det(A).

MATLAB-Minute.

Look at the matriceswilkinson(n)forn=2,3,. . .,10in MATLAB. Can you find a general formula for their entries? Forn=2,3,_{. . .},10compute

A=wilkinson(n)

[L,U,P]=lu(A)(LU-decomposition; cp. the MATLAB-Minute above Defi- nition5.6)

det(L), det(U), det(P), det(P)_∗det(L)_∗det(U), det(A)

Which permutation is associated with the computed matrixP? Why isdet(A) an integer for oddn?

92 7 Determinants of Matrices

Theorem 7.18 For A∈R^n,n, n≥2, we have

Aadj(A) = adj(A)A = det(A)In.

In particular A is invertible if and only if det(A) ∈ R is invertible. In this case (det(A))⁻¹ =det(A⁻¹)and A⁻¹=(det(A))⁻¹adj(A).

Proof LetB= [bi j]have the entriesbi j =(−1)ⁱ⁺^jdet(A(j,i)). ThenC= [ci j] = adj(A)Asatisfies

ci j = n k=1

bi kak j = n

k=1

(−1)ⁱ⁺^kdet(A(k,i))ak j.

Letaℓbe theℓth column ofAand let

A(k,i):= [a₁, . . . ,ai−1,ek,ai+1, . . . ,an] ∈ R^n,n,

whereek is thekth column of the identity matrix In. Then there exist permutation matricesPandQthat performk−1 row andi−1 column exchanges, respectively, such that

PA(k, i)Q=

1 ⋆ 0 A(k,i)

Using (1) in Lemma7.10we obtain

det(A(k,i))=det

1 ⋆

0 A(k,i) =det(PA(k,i)Q)

=det(P)det(A(k, i))det(Q)

=(−1)^(k⁻¹⁾⁺⁽ⁱ⁻¹⁾det(A(k,i))

=(−1)^k⁺ⁱdet(A(k,i)).

The linearity of the determinant with respect to the columns now gives

c_{i j} = n k=1

(−1)ⁱ⁺^k(−1)^k⁺ⁱa_{k j}det(A(k,i))

=det([a1, . . . ,ai−1,aj,ai+1, . . . ,an])

0, i = j det(A), i = j

=δi j det(A),

and thus adj(A)A=det(A)In. Analogously we can show thatAadj(A)=det(A)In.

If det(A)∈ Ris invertible, then

In =(det(A))⁻¹adj(A)A=A(det(A))⁻¹adj(A),

i.e.,Ais invertible with A⁻¹=(det(A))⁻¹adj(A). If, on the other hand,Ais invertible, then

1=det(In)=det(A A⁻¹)=det(A)det(A⁻¹)=det(A⁻¹)det(A), where we have used the multiplication theorem for determinants over R (cp. our comment following the proof of Theorem7.15). Thus, det(A) is invertible with (det(A))⁻¹ =det(A⁻¹), and again A⁻¹=(det(A))⁻¹adj(A). ⊓⊔ Example 7.19

(1) For

A= 4 1

2 1

∈Z^2,2

we have det(A) = 2 and thus A is not invertible. But A is invertible when considered as an element ofQ^2,2, since in this case det(A⁻¹)=(det(A))⁻¹= ¹₂. (2) For

t−1t−2 t t−1

∈(Z[t])^2,2

we have det(A)=1. The matrixAis invertible, since 1∈Z[t]is invertible.

Note that if A ∈ R^n,n is invertible, then Theorem7.18shows that A⁻¹ can be obtained by inverting only one ring element, det(A).

We now use Theorem7.18 and the multiplication theorem for matrices over a commutative ring with unit to prove a result already announced in Sect.4.2: In order to show that A ∈ R^n,n is the (unique) inverse of A ∈ R^n,n, only one of the two equations A A=In orAA=Inneeds to be checked.

Corollary 7.20 Let A∈R^n,n. If a matrixA∈R^n,nexists withA A=I_nor AA=I_n, then A is invertible andA=A⁻¹.

Proof IfA A =In, then the multiplication theorem for determinants yields 1=det(In)=det(A A)=det(A)det(A)=det(A)det(A),

i.e., det(A)∈ R is invertible with(det(A))⁻¹ =det(A). Thus also Ais invertible and has a unique inverseA⁻¹. Forn =1 this is obvious and forn≥2 it was shown in Theorem7.18. If we multiply the equationA A= Infrom the right with A⁻¹we get A= A⁻¹.

The proof starting fromAA=Inis analogous. ⊓⊔

94 7 Determinants of Matrices Let us summarize the invertibility criteria for a square matrix over a field that we have shown so far:

A∈G Ln(K) Theorem 5.2

⇐⇒ The echelon form of Ais the identity matrixIn Definition 5.10

⇐⇒ rank(A)=n

clear

⇐⇒ rank(A)=rank([A,b])=n for allb∈ K^n,1

Algorithm 6.6

⇐⇒ |L(A,b)| =1 for allb∈K^n,1

Theorem 7.18

⇐⇒ det(A)=0. (7.3)

Alternatively we obtain:

A∈/G L_n(K) Theorem 5.2

⇐⇒ The echelon form of Ahas at least one zero row

Definition 5.10

⇐⇒ rank(A) <n

clear

⇐⇒ rank([A,0]) <n

Algorithm 6.6

⇐⇒ L(A,0)= {0}

Theorem 7.18

⇐⇒ det(A)=0. (7.4)

In the fieldsQ,RandCwe have the (usual) absolute value| · |of numbers and can formulate the following useful invertibility criterion for matrices.

Theorem 7.21 If A∈K^n,nwith K ∈ {Q,R,C}isdiagonally dominant, i.e., if

|ai i|>

j=1 j=i

|ai j| for all i =1, . . . ,n,

thendet(A)=0.

Proof We prove the assertion by contraposition, i.e., by showing that det(A) =0 implies that Ais not diagonally dominant.

If det(A) = 0, then L(A,0) = {0}, i.e., the homogeneous linear system of equations Ax = 0 has at least one solution'x = ['x₁, . . . ,'x_n]^T =0. Let'x_m be an entry of'xwith maximal absolute value, i.e.,|'x_m| ≥ |'x_j|for all j = 1, . . . ,n. In particular, we then have|'x_m|>0. Themth row of A'x=0 is given by

a_m1'x₁+a_m2'x₂+. . .+a_mn'x_n=0 ⇔ a_mm'x_m= − n

j=1 j=m

a_{m j}'x_j.

We now take absolute values on both sides and use the triangle inequality, which yields

|a_mm| |'x_m| ≤ n

j=1 j=m

|a_{m j}| |'x_j| ≤ n

j=1 j=m

|a_{m j}||'x_m|, hence |a_mm| ≤ n

j=1 j=m

|a_{m j}|,

so that Anot diagonally dominant. ⊓⊔

The converse of this theorem does not hold: For example, the matrix

A= 1 2

1 0

∈Q^2,2, has det(A)= −2=0, butAis not diagonally dominant.

From Theorem7.18we obtain theLaplace expansion⁴of the determinant, which is particularly useful when Acontains many zero entries (cp. Example7.24below).

Corollary 7.22 For A∈ R^n,n, n≥2, the following assertions hold:

(1) For each i =1,2, . . . ,n we have det(A) =

n j=1

(−1)ⁱ⁺^jai jdet(A(i,j)).

(Laplace expansion ofdet(A)with respect to the i th row A.) (2) For each j =1,2, . . . ,n we have

det(A) = n

i=1

(−1)ⁱ⁺^jai jdet(A(i,j)).

(Laplace expansion ofdet(A)with respect to the j th column of A.)

Proof The two expansions for det(A)follow immediately by comparison of the diagonal entries in the matrix equations det(A)I_n = Aadj(A)and det(A)I_n =

adj(A)A. ⊓⊔

The Laplace expansions allows a recursive definition of the determinant: ForA∈ R^n,nwithn ≥2, let det(A)be defined as in (1) or (2) in Corollary7.22. We can choose an arbitrary row or column ofA. The formula for det(A)then contains only matrices of size(n−1)×(n−1). For each of these we can use the Laplace expansion again, now expressing each determinant in terms of determinants of(n−2)×(n−2)matrices.

We can do this recursively until only 1×1 matrices remain. For A= [a11] ∈ R^1,1 we define det(A):=a11.

Finally we stateCramer’s rule,⁵which gives an explicit formula for the solution of a linear system in form of determinants. This rule is only of theoretical value, because in order to compute then components of the solution it requires the evaluation of n+1 determinants ofn×nmatrices.

4Pierre-Simon Laplace (1749–1827) published this expansion in 1772.

5Gabriel Cramer (1704–1752).

96 7 Determinants of Matrices Corollary 7.23 Let K be a field, A ∈ G Ln(K)and b ∈ K^n,1. Then the unique solution of the linear system of equations Ax =b is given by

'x= ['x₁, . . . ,'xn]^T = A⁻¹b=(det(A))⁻¹adj(A)b, with

'xi = det[a₁, . . . ,ai−1,b,ai+1, . . . ,an]

det(A) , i =1, . . . ,n.

Example 7.24 Consider

⎡

⎢⎢

⎣ 1 3 0 0 1 2 0 0 1 2 1 0 1 2 3 1

⎤

⎥⎥

⎦∈Q^4,4, b=

⎡

⎢⎢

⎣ 1 2 1 0

⎤

⎥⎥

⎦∈Q^4,1.

The Laplace expansion with respect to the last column yields

det(A)=1·det

⎛

⎝

⎡

⎣ 1 3 0 1 2 0 1 2 1

⎤

⎦

⎞

⎠=1·1·det 1 3

1 2 =1·1·(−1)= −1.

Thus,Ais invertible andAx =bhas a unique solution'x=A⁻¹b∈Q^4,1, which by Cramer’s rule has the following entries:

'x₁=det

⎛

⎜⎜

⎝

⎡

⎢⎢

⎣ 1 3 0 0 2 2 0 0 1 2 1 0 0 2 3 1

⎤

⎥⎥

⎦

⎞

⎟⎟

⎠/det(A)= −4/(−1)=4,

'x₂=det

⎛

⎜⎜

⎝

⎡

⎢⎢

⎣ 1 1 0 0 1 2 0 0 1 1 1 0 1 0 3 1

⎤

⎥⎥

⎦

⎞

⎟⎟

⎠/det(A)=1/(−1)= −1,

'x₃=det

⎛

⎜⎜

⎝

⎡

⎢⎢

⎣ 1 3 1 0 1 2 2 0 1 2 1 0 1 2 0 1

⎤

⎥⎥

⎦

⎞

⎟⎟

⎠/det(A)=1/(−1)= −1,

'x4=det

⎛

⎜⎜

⎝

⎡

⎢⎢

⎣ 1 3 0 1 1 2 0 2 1 2 1 1 1 2 3 0

⎤

⎥⎥

⎦

⎞

⎟⎟

⎠/det(A)= −1/(−1)=1.

Exercises

7.1 A permutationσ∈S_nis called anr -cycleif there exists a subset{i₁, . . . ,i_r} ⊆ {1,2, . . . ,n}withr≥1 elements and

σ(i_k)=i_k₊₁fork=1,2, . . . ,r−1, σ(i_r)=i₁, σ(i)=i for i∈ {/ i₁, . . . ,i_r}. We write anr-cycle asσ=(i₁,i₂, . . . ,ir). In particular, a transpositionτ ∈Sn

is a 2-cycle.

(a) Letn=4 and the 2-cyclesτ_1,2=(1,2),τ_2,3=(2,3)andτ_3,4=(3,4)be given. Computeτ_1,2◦τ_2,3,τ_1,2◦τ_2,3◦τ_1,2⁻¹, andτ_1,2◦τ_2,3◦τ_3,4.

(b) Letn≥4 andσ=(1,2,3,4). Determineσ^jfor j=2,3,4,5.

(d) Show that two cycles with disjoint elements, i.e.(i₁, . . . ,ir)and(j₁, . . . ,js) with{i₁, . . . ,ir} ∩ {j₁, . . . ,js} =Ø, commute.

(e) Show that every permutationσ∈ Sn can be written as product of disjoint cycles that are, except for the order, uniquely determined byσ.

7.2 Prove Lemma7.10(1) using (7.1).

7.3 Show that the group homomorphism sgn : (Sn,◦)→({1,−1},·)satisfies the following assertions:

(a) The setAn = {σ∈ Sn|sgn(σ)=1}is a subgroup ofSn(cp. Exercise3.8).

(b) For allσ∈ Anandπ∈ Snwe haveπ◦σ◦π⁻¹ ∈ An. 7.4 Compute the determinants of the following matrices:

(a) A= [e_n,e_n₋₁, . . . ,e₁] ∈Z^n,n, wheree_i is theith column of the identity matrix.

(b) B= bi j

∈Z^n,nwith

bi j =

⎧⎪

⎨

⎪⎩

2 for |i−j| =0,

−1 for |i−j| =1, 0 for |i−j| ≥2.

(c)

C =

⎡

⎢⎢

⎣

1 0 1 0 0 0 0

e 0 e^π 4 5 1 √

π e² 1 ¹⁷₃₁ √

6 √ 7 √

8√ 10 e³ 0 −e π e 0 π^e e⁴ 0 10001 0 π⁻¹ 0 e²π e⁶ 0 √

2 0 0 0 −1

0 0 1 0 0 0 0

⎤

⎥⎥

⎦

∈R^7,7.

98 7 Determinants of Matrices (d) The 4×4 Wilkinson matrix⁶ (cp. the MATLAB-Minute at the end of

Sect.7.2).

7.5 Construct matrices A,B ∈ R^n,n for some n ≥ 2 and with det(A+B) = det(A)+det(B).

7.6 LetR be a commutative ring with unit,n ≥ 2 and A ∈ R^n,n. Show that the following assertions hold:

(a) adj(In)=I_n.

(b) adj(A B)=adj(B)adj(A), ifAandB∈ R^n,nare invertible.

(d) adj(A^T)=adj(A)^T.

(e) det(adj(A))=(det(A))ⁿ⁻¹, ifAis invertible.

(f) adj(adj(A))=det(A)ⁿ⁻²A.

(g) adj(A⁻¹)=adj(A)⁻¹, ifAis invertible.

Can one drop the requirement of invertibility in (b) or (e)?

7.7 Let n ≥ 2 and A = [ai j] ∈ R^n,n withai j = x_i+¹y_j for some x₁, . . . ,xn, y₁, . . . ,yn ∈R. Hence, in particular,xi+yj =0 for alli,j. (Such a matrixA is called aCauchy matrix.⁷)

(a) Show that

det(A)=

1≤i<j≤n(x_j−x_i)(y_j−y_i) n

i,j=1 (x_i+y_j) .

(b) Use (a) to derive a formula for the determinant of then×n Hilbert matrix (cp. the MATLAB-Minute above Definition5.6).

7.8 LetRbe a commutative ring with unit. Ifα1, . . . ,αn∈ R,n≥2, then

Vn :=! α_i^j⁻¹

⎡

⎢⎢

⎢⎣

1α1 · · ·αⁿ₁⁻¹ 1α₂ · · ·αⁿ₂⁻¹ ... ... ... 1αn · · ·αⁿ_n⁻¹

⎤

⎥⎥

⎥⎦∈ R^n,n

is called aVandermonde matrix.⁸ (a) Show that

det(Vn)=

1≤i<j≤n

(αj−αi).

6James Hardy Wilkinson (1919–1986).

7Augustin Louis Cauchy (1789–1857).

8Alexandre-Théophile Vandermonde (1735–1796).

(b) LetK be a field and letK[t]≤n−1be the set of polynomials in the variable tof degree at mostn−1. Show that two polynomialsp,q ∈K[t]≤n−1are equal if there exist pairwise distinctβ₁, . . . ,βn ∈K withp(βj)=q(βj).

7.9 Show the following assertions:

(a) LetK be a field with 1+1=0 and let A∈ K^n,n withA^T = −A. Ifnis odd, then det(A)=0.

(b) IfA∈G L_n(R)withA^T =A⁻¹, then det(A)∈ {1,−1}. 7.10 LetK be a field and

A₁₁ A₁₂ A₂₁ A₂₂

for some A₁₁ ∈ Kⁿ¹^,n¹,A₁₂ ∈ Kⁿ¹^,n²,A₂₁ ∈ Kⁿ²^,n¹, A₂₂ ∈ Kⁿ²^,n². Show the following assertions:

(a) IfA₁₁∈G Ln₁(K), then det(A)=det(A₁₁)det

A₂₂−A₂₁A⁻₁₁¹A₁₂ . (b) IfA₂₂∈G Ln₂(K), then det(A)=det(A₂₂)det

A₁₁−A₁₂A⁻₂₂¹A₂₁ . (c) IfA₂₁=0, then det(A)=det(A₁₁)det(A₂₂).

Can you show this also when the matrices are defined over a commutative ring with unit?

7.11 Construct matricesA₁₁,A₁₂,A₂₁,A₂₂∈R^n,n forn≥2 with det

A₁₁ A₁₂

A₂₁ A₂₂ =det(A₁₁)det(A₂₂)−det(A₁₂)det(A₂₁).

7.12 Let A = [ai j] ∈ G Ln(R)withai j ∈ Z fori,j = 1, . . . ,n. Show that the following assertions hold:

(a) A⁻¹∈Q^n,n.

(b) A⁻¹∈Z^n,nif and only if det(A)∈ {−1,1}.

7.13 Show thatG= {A∈Z^n,n| det(A)∈ {−1,1} }is a subgroup ofG Ln(Q).

Chapter 8 The Characteristic Polynomial and Eigenvalues of Matrices

We have already characterized matrices using their rank and their determinant. In this chapter we use the determinant map in order to assign to every square matrix a unique polynomial that is called the characteristic polynomial of the matrix. This polynomial contains important information about the matrix. For example, one can read off the determinant and thus see whether the matrix is invertible. Even more important are the roots of the characteristic polynomial, which are called the eigenvalues of the matrix.

8.1 The Characteristic Polynomial

Dalam dokumen Jörg Liesen Volker Mehrmann (Halaman 97-106)