Basic Definitions and Properties - Jörg Liesen Volker Mehrmann

Chapter 13 Adjoints of Linear Maps

In this chapter we introduce adjoints of linear maps. In some sense these represent generalizations of the (Hermitian) transposes of a matrices. A matrix is symmetric (or Hermitian) if it is equal to its (Hermitian) transpose. In an analogous way, an endomorphism is selfadjoint if it is equal to its adjoint endomorphism. The sets of symmetric (or Hermitian) matrices and of selfadjoint endomorphisms form certain vector spaces which will play a key role in our proof of the Fundamental Theorem of Algebra in Chap.15. Special properties of selfadjoint endomorphisms will be studied in Chap.18.

Lemma 13.1 The mapsβ⁽¹⁾andβ⁽²⁾defined in (13.1) and (13.2), respectively, are linear, i.e.,β⁽¹⁾ ∈L(V,W^∗)andβ⁽²⁾∈L(W,V^∗). Ifdim(V)=dim(W)∈Nand β is non-degenerate (cp. Definition11.9), thenβ⁽¹⁾andβ⁽²⁾are bijective and thus isomorphisms.

Proof We prove the assertion only for the mapβ⁽¹⁾; the proof forβ⁽²⁾is analogous.

We first show the linearity. Letv1, v2∈Vandλ1,λ2∈ K. For everyw∈Wwe then have

β⁽¹⁾(λ1v1+λ2v2)(w)=β(λ1v1+λ2v2, w)

=λ1β(v1, w)+λ2β(v2, w)

=λ1β⁽¹⁾(v1)(w)+λ2β⁽¹⁾(v2)(w)

λ1β⁽¹⁾(v1)+λ2β⁽¹⁾(v2) (w),

and henceβ⁽¹⁾(λ1v1+λ2v2)=λ1β⁽¹⁾(v1)+λ2β⁽¹⁾(v2). Therefore,β⁽¹⁾∈L(V,W^∗).

Let now dim(V)=dim(W)∈Nand letβbe non-degenerate. We show thatβ⁽¹⁾∈ L(V,W^∗)is injective. By (5) in Lemma10.7, this holds if and only if ker(β⁽¹⁾)= {0}.

Ifv∈ker(β⁽¹⁾), thenβ⁽¹⁾(v)=βv=0∈W^∗, and thus βv(w)=β(v, w)=0 for allw∈W.

Sinceβis non-degenerate, we havev=0. Finally, dim(V)=dim(W)and dim(W)

= dim(W^∗)imply that dim(V) = dim(W^∗)so that β⁽¹⁾ is bijective (cp. Corol-

lary10.11). ⊓⊔

We next discuss the existence of the adjoint map.

Theorem 13.2 IfV andWare K -vector spaces withdim(V)=dim(W)∈Nand βis a non-degenerate bilinear form onV×W, then the following assertions hold:

(1) For every f ∈L(V,V)there exists a uniquely determined g∈L(W,W)with β(f(v), w)=β(v,g(w)) for allv∈V andw∈W.

The map g is called theright adjointof f with respect toβ.

(2) For every h ∈L(W,W)there exists a uniquely determined k∈L(V,V)with β(v,h(w))=β(k(v), w) for allv∈Vandw∈W.

The map k is called theleft adjointof h with respect toβ.

Proof We only show (1); the proof of (2) is analogous.

LetV^∗ be the dual space ofV, let f^∗ ∈ L(V^∗,V^∗)be the dual map of f, and letβ⁽²⁾ ∈L(W,V^∗)be as in (13.2). Sinceβ is non-degenerate,β⁽²⁾is bijective by Lemma13.1. Define

g:=(β⁽²⁾)⁻¹◦ f^∗◦β⁽²⁾ ∈ L(W,W).

13.1 Basic Definitions and Properties 189

Then, for allv∈Vandw∈W,

β(v,g(w))=β(v, ((β⁽²⁾)⁻¹◦ f^∗◦β⁽²⁾)(w))

=β⁽²⁾

((β⁽²⁾)⁻¹◦ f^∗◦β⁽²⁾)(w) (v)

=β⁽²⁾

(β⁽²⁾)⁻¹(f^∗(β⁽²⁾(w))) (v)

β⁽²⁾◦(β⁽²⁾)⁻¹◦β⁽²⁾(w)◦ f (v)

=β⁽²⁾(w)(f(v))

=β(f(v), w).

(Recall that the dual map satisfies f^∗(β⁽²⁾(w))=β⁽²⁾(w)◦ f.)

It remains to show the uniqueness ofg. Letg ∈ L(W,W)withβ(v,g(w)) = β(f(v), w)for allv∈Vandw∈W. Thenβ(v,g(w))=β(v,g(w)), and hence

β(v, (g−g)(w))=0 for allv∈V andw∈W.

Sinceβ is non-degenerate in the second variable, we have(g−g)(w) =0 for all

w∈W, so thatg =g. ⊓⊔

Example 13.3 LetV = W = K^n,1 andβ(v, w) = w^TBv with a matrix B ∈ G Ln(K), so thatβ is non-degenerate (cp. (1) in Example11.10). We consider the linear map f : V → V,v → Fv, with a matrix F ∈ K^n,n, and the linear map h :W→W,w→Hw, with a matrixH∈ K^n,n. Then

βv :W→ K, w→w^T(Bv), β⁽¹⁾:V→W^∗, v→(Bv)^T, β⁽²⁾:W→V^∗, w→w^TB,

where we have identified the isomorphic vector spacesW^∗ andK^1,n, respectively V^∗andK^1,n, with each other. Ifg∈L(W,W)is the right adjoint of f with respect toβ, then

β(f(v), w)=w^TB f(v)=w^TB Fv=β(v,g(w))=g(w)^TBv

for allv ∈ V andw ∈ W. If we represent the linear map gvia the multiplication with a matrix G ∈ K^n,n, i.e., g(w) = Gw, then w^TB Fv = w^TG^TBv for all v, w ∈ K^n,1. Hence B F = G^TB. Since Bis invertible, the unique right adjoint is given byG=(B F B⁻¹)^T =B^−TF^TB^T.

Analogously, for the left adjointk∈L(V,V)ofhwith respect toβwe obtain the equation

β(v,h(w))=(h(w))^TBv=w^TH^TBv=β(k(v), w)=w^TBk(v) for allv ∈ V andw ∈ W. With k(v) = Lv for a matrix L ∈ K^n,n, we obtain

H^TB =B Land henceL =B⁻¹H^TB.

IfV is finite dimensional andβis a non-degenerate bilinear form onV, then by Theorem 13.2every f ∈ L(V,V)has a unique right adjointg and a unique left adjointk, such that

β(f(v), w)=β(v,g(w)) and β(v, f(w))=β(k(v), w) (13.3) for allv, w∈V. Ifβis symmetric, i.e., ifβ(v, w)=β(w, v)holds for allv, w∈V, then (13.3) yields

β(v,g(w))=β(f(v), w)=β(w, f(v))=β(k(w), v)=β(v,k(w)).

Therefore,β(v, (g −k)(w)) = 0 for allv, w ∈ V, and henceg = k, sinceβ is non-degenerate. Thus, we have proved the following result.

Corollary 13.4 Ifβ is a symmetric and non-degenerate bilinear form on a finite dimensional K -vector space V, then for every f ∈ L(V,V)there exists a unique g∈L(V,V)with

β(f(v), w)=β(v,g(w)) and β(v, f(w))=β(g(v), w) for allv, w∈V.

By definition, a scalar product on a Euclidean vector space is a symmetric and non- degenerate bilinear form (cp. Definition12.1). This leads to the following corollary.

Corollary 13.5 IfV is a finite dimensional Euclidean vector space with the scalar product ·,·, then for every f ∈L(V,V)there exists a unique f^ad ∈L(V,V)with

f(v), w = v,fâd(w) and v,f(w) = fâd(v), w (13.4) for allv, w∈V. The map fâdis called theadjointof f (with respect to ·,·).

In order to determine whether a given mapg∈L(V,V)is the unique adjoint of f ∈ L(V,V), only one of the two conditions in (13.4) have to be verified: If for f,g ∈L(V,V)the equation

f(v), w = v,g(w) holds for allv, w∈V, then also

v, f(w) = f(w), v = w,g(v) = g(v), w

for allv, w∈V, where we have used the symmetry of the scalar product. Similarly, if v, f(w) = g(v), wholds for allv, w∈V, then also f(v), w = v,g(w)for allv, w∈V.

13.1 Basic Definitions and Properties 191 Example 13.6 Consider the Euclidean vector spaceR^3,1with the scalar product

v, w =w^TDv, where D=

⎡

⎣1 0 0 0 2 0 0 0 1

⎤

⎦,

and the linear map

f :R^3,1→R^3,1, v→Fv, where F =

⎡

⎣1 2 2 1 0 1 2 0 0

⎤

⎦.

For allv, w∈R^3,1we then have

f(v), w =w^TD Fv=w^TD F D⁻¹Dv=(D^−TF^TD^Tw)^TDv= v,f^ad(w), and thus

f^ad :R^3,1→R^3,1, v→D⁻¹F^TDv=

⎡

⎣1 2 2 1 0 0 2 2 0

⎤

⎦v,

where we have used that Dis symmetric.

We now show that uniquely determined adjoint maps also exist in the unitary case.

However, we cannot conclude this directly from Corollary13.4, since a scalar product on aC-vector space is not a symmetric bilinear form, but a Hermitian sesquilinear form. In order to show the existence of the adjoint map in the unitary case we construct it explicitly. This construction works also in the Euclidean case.

LetVbe a unitary vector space with the scalar product ·,·and let{u₁, . . . ,u_n} be an orthonormal basis ofV. For a given f ∈L(V,V)we define the map

g : V→V, v →

i=1

v, f(u_i)u_i.

Ifv, w∈V andλ,µ∈C, then g(λv+µw)=

i=1

λv+µw, f(u_i)u_i =

i=1

λ v,f(u_i)u_i+µ v,f(u_i)u_i

=λg(v)+µg(w),

and henceg∈L(V,V). Let nowv=n

i=1λiui∈Vandw∈V, then v,g(w) = ⁿ

i=1

λiui,

j=1

w, f(uj)uj

i=1

λi w, f(ui) =

i=1

λi f(ui), w

= f(v), w.

Furthermore,

v, f(w) = f(w), v = w,g(v) = g(v), w

for allv, w ∈ V. Ifg ∈ L(V,V)satisfies f(v), w = v,g(w)for allv, w ∈ V, theng=g, since the scalar product is positive definite. We can therefore formulate the following result analogously to Corollary13.5.

Corollary 13.7 If V is a finite dimensional unitary vector space with the scalar product ·,·, then for every f ∈L(V,V)there exists a unique fâd∈L(V,V)with f(v), w = v,fâd(w) and v,f(w) = fâd(v), w (13.5) for allv, w∈V. The map fâdis called theadjointof f (with respect to ·,·).

As in the Euclidean case, again the validity of one of the two equations in (13.5) for allv, w∈Vimplies the validity of the other for allv, w∈V.

Example 13.8 Consider the unitary vector spaceC^3,1with the scalar product

v, w =w^HDv, where D=

⎡

⎣1 0 0 0 2 0 0 0 1

⎤

⎦,

and the linear map

f :C^3,1→C^3,1, v→Fv, where F=

⎡

⎣1 2i 2 i 0 −i 2 0 3i

⎤

⎦.

For allv, w∈C^3,1we then have

f(v), w =w^HD Fv=w^HD F D⁻¹Dv=(D^−HF^HD^Hw)^HDv

= v,f^ad(w), and thus

f^ad :C^3,1→C^3,1, v→D⁻¹F^HDv=

⎡

⎣ 1−2i 2

−i 0 0 2 2i−3i

⎤

⎦v,

13.1 Basic Definitions and Properties 193

where we have used that Dis real and symmetric.

We next investigate the properties of the adjoint map.

Lemma 13.9 LetVbe a finite dimensional Euclidean or unitary vector space.

(1) If f1,f2∈L(V,V)andλ1,λ2∈K (where K =Rin the Euclidean and K =C in the unitary case), then

(λ1f₁+λ2f₂)âd =λ1f₁âd+λ2f₂âd.

In the Euclidean case the map f → f^adis therefore linear, and in the unity case semilinear.

(2) We have(IdV)^ad =IdV.

(3) For every f ∈L(V,V)we have(fâd)âd = f . (4) If f1,f2∈L(V,V), then(f2◦ f1)âd= f₁âd◦ f₂âd. Proof

(1) Ifv, w∈Vandλ1,λ2∈ K, then

(λ1f1+λ2f2)(v), w =λ1 f1(v), w +λ2 f2(v), w

=λ1

v,f₁^ad(w) +λ2

v, f₂^ad(w)

v,λ1f₁^ad(w)+λ2f₂^ad(w)

= v,

λ1f₁^ad+λ2f₂^ad (w)

and thus(λ1f1+λ2f2)âd=λ1f₁âd+λ2f₂âd.

(2) For all v, w ∈ V we have IdV(v), w = v, w = v,IdV(w), and thus (IdV)^ad=IdV.

(3) For allv, w∈Vwe have fâd(v), w = v, f(w), and thus(fâd)âd = f. (4) For allv, w∈Vwe have

(f2◦ f1)(v), w = f2(f1(v)), w =

f1(v),f₂^ad(w)

= v,f₁^ad

f₂^ad(w)

= v,

f₁^ad◦ f₂^ad (w)

and thus(f2◦ f1)âd = f₁âd◦ f₂âd. ⊓⊔

The following result shows relations between the image and kernel of an endomorphism and of its adjoint.

Theorem 13.10 IfVis a finite dimensional Euclidean or unitary vector space and f ∈L(V,V), then the following assertions hold:

(1) ker(f^ad)=im(f)^⊥. (2) ker(f)=im(f^ad)^⊥. Proof

(1) Ifw∈ker(f^ad), then f^ad(w)=0 and

0= v, f^ad(w) = f(v), w

for allv∈V, hencew∈im(f)^⊥. If, on the other hand,w∈im(f)^⊥, then 0= f(v), w = v,f^ad(w)

for allv ∈ V. Since ·,·is non-degenerate, we have f^ad(w)=0 and, hence, w∈ker(f^ad).

(2) Using(fâd)âd = f and (1) we get ker(f)=ker((fâd)âd)=im(fâd)^⊥. ⊓⊔ Example 13.11 Consider the unitary vector spaceC^3,1with the standard scalar product and the linear map

f :C^3,1→C^3,1, v→Fv, with F =

⎡

⎣1 i i i 0 0 1 0 0

⎤

⎦.

Then

f^ad :C^3,1 →C^3,1, v→F^Hv, with F^H =

⎡

⎣ 1−i1

−i 0 0

⎤

⎦.

The matricesFandF^Hhave rank 2. Therefore, dim(ker(f))=dim(ker(f^ad))=1.

A simple calculation shows that

ker(f)=span

⎧⎨

⎩

⎡

⎣ 0 1

−1

⎤

⎦

⎫⎬

⎭ and ker(f^ad)=span

⎧⎨

⎩

⎡

⎣0 1 i

⎤

⎦

⎫⎬

⎭.

The dimension formula for linear maps implies that dim(im(f))=dim(im(f^ad))= 2.

From the matricesFandF^Hwe can see that

im(f)=span

⎧⎨

⎩

⎡

⎣1 i 1

⎤

⎦,

⎡

⎣1 0 0

⎤

⎦

⎫⎬

⎭ and im(f^ad)=span

⎧⎨

⎩

⎡

⎣ 1

−i

⎤

⎦,

⎡

⎣1 0 0

⎤

⎦

⎫⎬

⎭. The equations ker(f^ad)=im(f)^⊥and ker(f)=im(f^ad)^⊥can be verified by direct computation.

Dalam dokumen Jörg Liesen Volker Mehrmann (Halaman 190-198)