vol20_pp207-225. 162KB Jun 04 2011 12:06:07 AM

(1)

A NEW DECOMPOSITION FOR SQUARE MATRICES∗

JULIO BEN´ıTEZ†

Abstract. A new decomposition is derived for any complex square matrix. This decomposition is based on the canonical angles between the column space of this matrix and the column space of its conjugate transpose. Some applications of this factorization are given; in particular some matrix partial orderings and the relationship between the canonical angles and various classes of matrices are studied.

Key words.Decomposition of matrices, EP matrices, Canonical angles, Matrix partial ordering.

AMS subject classifications.15A23, 15A57.

1. Introduction. LetC_m,n _{be the set of} _m_×_n _{complex matrices, and let}_A∗_,

R(A),N(A), and rank(A) denote the conjugate transpose, column space, null space, and rank, respectively, of A ∈ C_{m,n. For a nonsingular} _A _∈ C_{n,n, we shall denote}

A−∗ _{= (}_A−1₎∗ _{= (}_A∗₎−1_{. Furthermore, let} _A† _{stand for the Moore-Penrose inverse} ofA, i.e., for the unique matrix satisfying the equations

AA†A=A, A†AA†=A†, AA† = (AA†)∗, A†A= (A†A)∗.

Given a matrix A ∈ C_n,n_{, it can be proved that the set} _{X _∈ C_n,n _: _AXA ₌

A, XAX = X, AX = XA} is empty or a singleton. When it is a singleton, it is customary to denote by A# _{its unique element, called the group inverse of} _A_. Furthermore,In means the identity matrix of order n. We shall denote by 0n,mthe zero matrix inC_{n,m, and when there is no danger of confusion, we will simply write 0.}

In addition,1n and0n will denote then×1 column vectors all of whose components are 1 and 0, respectively.

Hartwig and Spindelb¨ock arrived at the following result, given in [14] as Corol-lary 6.

Theorem 1.1. _Let_A∈C_n,n _and_r_{= rank(}_A₎_{. There exist a unitary}_U _∈C_n,n_,

Σ =σ1Ir1⊕ · · · ⊕σtIrt,r1+· · ·+rt=r,σ1>· · ·> σt>0,L∈Cr,n−r,K ∈Cr,r

∗_{Received by the editors November 17, 2009. Accepted for publication April 2, 2010. Handling} Editor: Roger A. Horn.

†_{Departamento de Matem´}_{atica Aplicada, Instituto de Matem´}_{atica Multidisciplinar, Universidad} Polit´ecnica de Valencia, Camino de Vera s/n, 46022, Valencia, Spain. ([email protected]).

(2)

such that

A=U

ΣK ΣL

0 0

U∗ (1.1)

and

KK∗₊_LL∗₌_I r. (1.2)

Straightforward computations show that ifAis written as in (1.1), then

A†₌_U

K∗_Σ−1 ₀

L∗_Σ−1 ₀

U∗_. (1.3)

The usefulness of the representation provided in Theorem 1.1 to explore various classes of matrices, was demonstrated in [2, 3, 24].

In the sequel, kKk with K ∈ C_m,n _{will be the matrix norm induced by the}

Euclidean vector norm (known as the spectral norm); see [18, pp 270, 281]. One has that ifU, V are unitary, thenkU AV∗_k₌_kAk_{for any matrix}_A_{such that the product}

U AV∗ _{is meaningful ([18, p 283]). Also it will be needed that} _kAk2 ₌ _kA∗_k2 ₌

kA∗_Ak₌_kAA∗_k_{holds for any complex matrix.}

The canonical angles (also called principal angles) between two subspaces provide the best available characterization of the relative position of two given subspaces. This concept allows us to characterize or measure, in a natural way, how two subspaces differ, which is the main connection with perturbation theory. In [9, 20, 22] we can find how these angles were discovered and rediscovered again several times. Computation of canonical angles between subspaces is important in many applications including statistics [8, 15], information retrieval [16], and analysis of algorithms [23]. There are many equivalent definitions of the canonical angles (see [11]). But, for our purposes, the most convenient is the following:

LetX andY be two nontrivial subspaces ofCn _and_r_{= min}_{_dim_X_,_dim_Y}_{. We} define the canonical anglesθ1, . . . , θr∈[0, π/2] betweenX andY by

cosθi=σi(PXPY), i= 1, . . . , r, (1.4)

where the nonnegative real numbersσ1(PXPY), . . . , σr(PXPY) are the singular values of PXPY. Here, PS stands for the orthogonal projector onto the subspaceS ⊂ Cn. We will have in mind the possibility that one canonical angle is repeated.

(3)

2. Main result. The following theorem is the main result of the paper.

Theorem 2.1. _Let _A∈C_n,n_{, let} _r _{= rank(}_A₎_{, and let}_θ₁_{, . . . , θ}_p _{be the}

canon-ical angles between R(A) and R(A∗₎ _{belonging to} _]0_{, π/}_2[_{. Denote by} _x _and _y _the

multiplicities of the angles0 andπ/2 as a canonical angle betweenR(A)andR(A∗₎_,

respectively. There exists a unitary matrix Y ∈C_n,n _{such that}

A=Y

M C M S

0 0

Y∗_, (2.1)

whereM ∈C_r,r _{is nonsingular,}

C= diag(0y,cosθ1, . . . ,cosθp,1x), (2.2)

S=

diag(sinθ1, . . . ,sinθp,1y) 0p+y,n−(r+p+y) 0x,p+y 0x,n−(r+p+y)

,

andr=y+p+x. Furthermore,xandy+n−rare the multiplicities of the singular values 1 and 0 inPR(A)PR(A∗₎, respectively.

Proof. Let us representAas in (1.1). From this representation and (1.3) we have

A†_A₌_U

K∗_K _K∗_L

L∗_K _L∗_L

U∗_.

Now, let us observe kKk2 ₌ _kK∗_{Kk ≤ kAA}†_k _{= 1, since the norm of a submatrix} cannot be greater than the norm of the matrix [1, Lemma 2] and the spectral norm of any orthogonal projector is 1. LetK=V CW∗ _{be the singular value decomposition} ofK, where V, W ∈C_r,r _{are unitary and} _C_∈C_r,r _{is diagonal with nonnegative real}

numbers on its diagonal. SincekCk=kKk ≤1, we can write

C= diag(0y,cosθ1, . . . ,cosθp,1x), (2.3)

wherey, p, x∈N_{∪ {}₀_} _satisfy_y₊_p₊_x₌_r_{, and} _θ₁_{, . . . , θ}_p_∈_]0_{, π/}_2[.

Let us define

b

C= diag(0_y,cosθ1, . . . ,cosθp) and Sb= diag(1y,sinθ1, . . . ,sinθp). (2.4)

Obviously,Sbis nonsingular and

b

C2₊_S_b2₌_I y+p. (2.5)

Now, let us partitionV andW as follows:

V = V1 V2

, W = W1 W2

(4)

SinceV is unitary,

Iy+p 0 0 Ix

=Ir=V∗V =

V∗ 1

V∗ 2

V1 V2

=

V∗

1V1 V1∗V2

V∗

2V1 V2∗V2

,

hence

Iy+p=V1∗V1, 0 =V1∗V2, 0 =V2∗V1, Ix=V2∗V2. (2.7)

Using again thatV is unitary we get

Ir=V1V1∗=

V1 V2 V∗

1

V∗ 2

=V1V1∗+V2V2∗. (2.8)

Similarly, we get

Iy+p =W1∗W1, 0 =W1∗W2, 0 =W2∗W1, Ix=W2∗W2 (2.9)

andIr=W1W1∗+W2W2∗.

From the singular value decomposition ofK, (2.4), and (2.6) we get

K=V CW∗= V1 V2 "

b

C 0 0 Ix

#

W∗ 1

W∗ 2

=V1CWb 1∗+V2W2∗. (2.10)

Now, let us prove

LL∗=V1Sb2V1∗. (2.11)

In fact, the combination of (1.2), (2.5), (2.8), (2.9), and (2.10) leads to

LL∗=Ir−KK∗

=Ir−(V1CWb 1∗+V2W2∗)(V1CWb 1∗+V2W2∗)∗ =Ir−(V1CWb 1∗+V2W2∗)(W1CVb 1∗+W2V2∗) =Ir−V1Cb2V1∗−V2V2∗

=V1V1∗−V1Cb2V1∗ =V1(Ip+y−Cb2)V1∗ =V1Sb2V1∗.

Now, we have from (2.7) and (2.11)

V∗

1LL∗V1=V1∗V1Sb2V1∗V1=Sb2. (2.12)

Let us define

(5)

Apply (2.12) and the definition of X1 to obtain X1∗X1 =Sb−1V1∗LL∗V1Sb−1 =Iy+p, therefore, the columns ofX1are orthonormal. AsX1hasy+porthonormal columns belonging toCn−r_{, we have}_y₊_p≤n−r. Let us definet= (n−r)−(y+p)∈N∪ {0}. We can findX2∈Cn−r,t such thatX = [X1 X2] is unitary. Finally, let us define

S= " b S 0y+p,t 0x,y+p 0x,t #

∈C_r,n₋_r_.

(2.14)

FromCb2₊_S_b2₌_I

y+p andCbSb=SbCb we get

C2+SS∗₌_I

r, (Cb⊕It)2+S∗S=In−r, CS=S(Cb⊕It). (2.15)

SinceX is unitary we get

X∗

1X2= 0. (2.16)

From (2.7) and (2.11) we get (L∗_V₂₎∗₍_L∗_V_{2) =}_V∗

2LL∗V2=V2∗V1Sb2V1∗V2= 0, hence

L∗_V 2= 0. (2.17)

Taking into account (2.13) and (2.17),

L∗_V ₌_L∗ _V 1 V2

= L∗_V

1 L∗V2

=h X1Sb 0 i

.

From which, and having in mind thatV is unitary andSbis Hermitian, we get

L=V

" b SX∗ 1 0 #

= V1 V2

" SXb ∗ 1 0

#

=V1SXb 1∗. (2.18)

By using (2.16) and postmultiplying (2.18) byX2we get

LX2= 0. (2.19)

From (2.7), (2.11), (2.13), and (2.19) we obtain

LX= (2.20)

=L X1 X2

= LX1 LX2

=h LL∗_V

1Sb−1 0 i

=h V1Sb 0 i

.

On the other hand, we have

V S = V1 V2

" Sb 0 0 0

#

=h V1Sb 0 i

.

(6)

Relations (2.20) and (2.21) proveL=V SX∗_{. Moreover, observe}

A=U

ΣK ΣL

0 0

U∗

=U

ΣV CW∗ _Σ_{V SX}∗

0 0

U∗

=U

ΣV C ΣV S

0 0

W∗ ₀

0 X∗

U∗ =U W 0 0 X

W∗_Σ_{V C} _W∗_Σ_{V S}

0 0

W∗ ₀

0 X∗

U∗.

Thus, if we denoteY =U(W⊕X) andM =W∗_Σ_V_{, we have}

A=Y

M C M S

0 0

Y∗.

Notice thatY is unitary andM is nonsingular.

It remains to prove that θ1, . . . , θp are the canonical angles between R(A) and

R(A∗_{) belonging to ]0}_{, π/}_{2[, and}_x_and_y_{are the multiplicities of the singular values 0} and 1 inPR(A)PR(A∗), respectively. To this end, we will use (1.4). It is straightforward by checking the four conditions of the Moore-Penrose inverse that ifA is written as in (2.1), then

A†=Y

CM−1 ₀

S∗_M−1 ₀

Y∗.

(2.22)

Now, from (2.15), it is simple to verify

PR(A)PR(A∗)=AA†A†A

=Y

M C M S

0 0

CM−1 ₀

S∗_M−1 ₀

CM−1 ₀

S∗_M−1 ₀

M C M S

0 0

Y∗

=Y

Ir 0

0 0

C2 _CS

S∗_C _S∗_S

Y∗

=Y

C2 _CS

0 0

Y∗_. (2.23)

Next, we are going to find the singular value decomposition of PR(A)PR(A∗). Let us remark that from (2.15) we get that the matrix

T = "

C S

−S∗ _C_b_⊕_I t

#

(7)

is unitary. Hence, the singular value decomposition ofPR(A)PR(A∗)is

PR(A)PR(A∗₎=Y(C⊕0n₋_r,n₋r)(T Y∗) =Ydiag(0, . . . ,0

| {z } y

,cosθ1, . . . ,cosθp

| {z }

p

,1, . . . ,1 | {z }

x

| {z }

r

,0, . . . ,0 | {z } n−r

)(T Y∗₎_,

since Y and T Y∗ _{are unitary and} _C_⊕_0n

−r,n−r is a diagonal matrix with real and nonnegative numbers on its diagonal. Therefore, these numbers are the singular values ofPR(A)PR(A∗).

3. Applications. In this section some applications of the decomposition given in Theorem 2.1 are discussed. Let us remark that from (2.15), matrices C and S

“almost” commute and behave as the ordinary trigonometric functionsx7→cosxand

x7→ sinx. An evident fact is that C is Hermitian because C is diagonal with real numbers on its diagonal. The following lemma will be used several times in the sequel.

Lemma 3.1. _{Assume that matrices} _C _and_S _{are defined as in Theorem 2.1 and} letqbe an arbitrary positive integer. For R1, R2∈Cr,q we have

CR1=CR2 andS∗R1=S∗R2 ⇐⇒ R1=R2

and for T1, T2∈Cq,r

T1C=T2C andT1S=T2S ⇐⇒ T1=T2.

Proof. Let us prove the first equivalence (the another one has a similar proof). The⇐part is trivial. To prove the⇒part, it is enough to premultiplyCR1=CR2 byC and premultiplyS∗_R

1 =S∗R2 byS, add the last two obtained equalities and use the first relation of (2.15).

3.1. The dimension ofR(A)∩R(A∗₎_and_R₍_A₎_∩R₍_A∗₎⊥_. _{We apply Theorem} 2.1 to find the dimension ofR(A)∩R(A∗_{) and}_R₍_A₎_∩R₍_A∗₎⊥_{in terms of the canonical} angles betweenR(A) andR(A∗_).

Theorem 3.2. _{For any square complex matrix} _{A, one has}

(i) the dimension ofR(A)∩R(A∗₎_{is the multiplicity of the angle 0 as a canonical}

angle between R(A)andR(A∗₎_.

(ii) the dimension of R(A)∩ R(A∗₎⊥ _{is the is the multiplicity of the angle} _π/₂

as a canonical angle betweenR(A)andR(A∗₎_.

(8)

(i) First of all let us compute the dimension ofN(A)∩ N(A∗_{). Let}_x_{∈ N}₍_A₎_∩

N(A∗_{) be represented as}

x∗= [u∗ v∗]Y∗_, _u_∈_C

r,1, v∈Cn−r,1. (3.1)

SinceAx=0andA∗_x₌₀_{we get, respectively,}

M C M S

0 0

u v

=0,

CM∗ ₀

S∗_M∗ ₀ u v

=0,

which, in view of the invertibility ofM, reduces to

Cu+Sv=0, CM∗_u₌₀_and_S∗_M∗_u₌₀_.

From Lemma 3.1, we obtain M∗_u₌₀_{, which implies} _u₌₀_{. Hence}_S_v ₌₀_{. Thus,} we obtain that forxrepresented as in (3.1), one hasx∈ N(A)∩ N(A∗_{) if and only if} u=0andSv=0. Having in mind the representation (2.14) and the nonsingularity ofSb(obtained from (2.4)), we have

dim [N(A)∩ N(A∗_{)] = dim}_N₍_S_{) = (}_n−r₎₋_rank(_S_{) =}_n−r−_rank(_S_b_{) =}_n−r−₍_y₊_p₎_.

Since

[R(A)∩ R(A∗_)]⊥₌_R₍_A₎⊥₊_R₍_A∗₎⊥₌_N₍_A∗_{) +}_N₍_A₎

and

dim [N(A) +N(A∗)] = dimN(A) + dimN(A∗)−dim [N(A)∩ N(A∗)],

we obtain, recalling dimN(A) =N(A∗_{) =}_n₋_r_{, that}

dim [R(A)∩ R(A∗)] =n−dim[R(A)∩ R(A∗)]⊥

=n−dim (N(A) +N(A∗)) =n−[2(n−r)−(n−r−y−p)] =r−y−p

=x.

(ii) Let us recallR(A∗₎⊥₌_N₍_A_{). We will prove}

R(A)∩ N(A) =

Y

u 0

: u∈C_r,1_{, C}u=0

.

(3.2)

(9)

haveAA†_x₌ _x_{. Therefore using the representations (2.1) and (2.22) we easily get} v=0. Sincex∈ N(A), using again representation (2.1) and the nonsingularity ofM

we getCu=0. The reverse inclusion of (3.2) is trivial to be obtained. Thus, from (2.3) we get

dim [R(A)∩ N(A)] = dimN(C) =r−rank(C) =y.

Using again Theorem 2.1, Theorem 3.2 can be restated as follows

Theorem 3.3. _{For any square complex matrix} _A_∈C_n,n_{, one has}

(i) the dimension ofR(A)∩ R(A∗₎ _{is the multiplicity of the singular value 1 in}

the matrix PR(A)PR(A∗).

(ii) ifkis the multiplicity of the singular value 0 in the matrixPR(A)PR(A∗₎, then

the dimension ofR(A)∩ R(A∗₎⊥ _{is the} _rank(_A_{) +}_k₋_n.

3.2. How far is a matrix from being EP?. The following consequence is a clean measure of the departure of a square matrix from being EP (recall that a square matrixAis said to be EP when AA†₌_A†_A_{). More precisely, we have the following} result.

Theorem 3.4. _Let _A_∈C_n,n_{. Then}

kAA†₋_A†_Ak_{= max}_{_sin_θ_:_θ _{is a canonical angle between} _R₍_A_{) and} _R₍_A∗₎_}.

Proof. Let us represent Aas in (2.1). From (2.15) and by following the compu-tations made in (2.23), we have

AA†−A†A=Y

Ir 0

0 0

−

C2 _CS

S∗_C _S∗_S

Y∗=Y

SS∗ _−CS

−S∗_C _−S∗_S

Y∗.

If we denoteT =Y∗₍_AA†₋_A†_A₎_Y_{, then}_kAA†₋_A†_Ak2₌_kT_k2₌_{kT T}∗_k_{. Thus, to} calculatekAA†₋_A†_Ak_{, we must first calculate}_{T T}∗_{. It is straightforward to see that}

T T∗=

SS∗SS∗+CSS∗C −SS∗CS+CSS∗S

−S∗_CSS∗₊_S∗_SS∗_C _S∗_C2_S₊_S∗_SS∗_S

.

We shall use (2.15) in order to simplify each block of T T∗_{. The upper-left block} simplifies to

SS∗SS∗+CSS∗C=SS∗(Ir−C2) +C(Ir−C2)C = (SS∗+C2)(Ir−C2)

(10)

The upper-right block reduces to

−SS∗_CS₊_CSS∗_S₌₋₍_I

r−C2)CS+C(Ir−C2)S = 0.

SinceT T∗_{is Hermitian and its upper-right block is zero, the lower-left block of}_{T T}∗ is also zero. Now, the lower-right block ofT T∗_{simplifies to}

S∗C2S+S∗SS∗S=S∗C2S+S∗(Ir−C2)S=S∗S.

Hence, T T∗ ₌_SS∗_⊕_S∗_S _{and by using that} _kA

1⊕A2k = max{kA1k,kA2k} holds for any pair of matricesA1 and A2 (see, for example, relation 5.2.12 of [18]), we get

kT T∗_k_{= max}_{kSS∗_k,_kS∗_Sk}₌_kSk2_{and by observing the form of}_S_{in Theorem 2.1,} we obtain

kSk= max{sinθ:θ is a canonical angle betweenR(A) andR(A∗)}.

The following application of Theorem 2.1 is another measure of the departure of a square matrix from being EP. To state this result, letCEP

n denote the subset ofCn,n composed of EP matrices. Furthermore, if X ∈C_n,n _and _{S ⊂}C_{n,n, the expression}

dist(X,S) will denote the distance betweenX and S (i.e., the infimum ofkX−Yk

whenY ∈ S). The following simple lemma will be useful.

Lemma 3.5. _Let_A_∈C_n,n _{be represented as in Theorem 2.1. Then}_kAk₌_kM_k.

Proof. Having in mind the first relation of (2.15), we have kAk2 ₌ _kAA∗_k ₌

kM M∗_⊕₀_k₌_{kM M}∗_k₌_kM_k2_.

Theorem 3.6. _Let _A∈C_n,n_{. Then}

dist(A,CEP_n ₎_≤

≤2kAksup{sin(θ/2) :θ is a canonical angle between R(A) and R(A∗₎_}_.

Proof. Let us writeAas in (2.1) and letB=Y(M⊕0)Y∗_{. Matrix}_B_{is obviously} EP becauseM is nonsingular. From the first relation of (2.15), a simple computation shows (A−B)(A−B)∗₌_Y_[(2_M₍_I

r−C)M∗)⊕0]Y∗. Therefore,

dist(A,CEP

n )2≤ kA−Bk2=k2M(Ir−C)M∗k ≤2kMkkIr−CkkM∗k.

From Lemma 3.5, one has dist(A,CEP_n ₎2_≤₂_kAk2_kI_r₋_Ck_{. From (2.3) we get}

kIr−Ck= sup{1−cosθ:θis a canonical angle betweenR(A) andR(A∗)}.

(11)

3.3. Characterizations of various classes of matrices. As in [2] we provide characterizations of various known classes of matrices.

Theorem 3.7. _Let _A_{be a square complex matrix represented as in Theorem 2.1.} Then

(i) Ahas group inverse if and only if none of the canonical angles betweenR(A)

andR(A∗₎_is_π/₂_{, or equivalently,}_C _{is nonsingular. In this case, one has}

A#=Y

C−1_M−1 _C−1_M−1_C−1_S

0 0

Y∗_. (3.3)

(ii) A is a partial isometry (i.e.,A∗₌_A†_{) if and only if}_M _{is unitary.} (iii) A is star-dagger (i.e.,A∗_A† ₌_A†_A∗_{) if and only if}_{M M}∗_C₌_CM∗_M_. (iv) Ais normal (i.e.,AA∗ ₌_A∗_{A) if and only if all the canonical angles between}

R(A)andR(A∗₎ _{are zero and}_M _{is normal.}

(v) A is an oblique projector (i.e.,A2₌_{A) if and only if}_CM ₌_I r. (vi) A is an orthogonal projector (i.e., A2₌_A₌_A∗_{) if and only if} _M ₌_I

r and

all the canonical angles betweenR(A)and R(A∗₎_are ₀_. (vii) A is EP (i.e., AA† ₌_A†_{A, or equivalently,} _P

R(A) =PR(A∗₎) if and only if

all the canonical angles betweenR(A)and R(A∗₎_are ₀_. (viii) A is bi-EP (i.e., AA†_A†_A ₌ _AA†_A†_A _or _P

R(A)PR(A∗) = PR(A∗)PR(A)) if

and only if all the canonical angles between R(A)andR(A∗₎_are ₀_or _π/₂_. (ix) A is a contraction (i.e.,kAk ≤1) if and only if M is a contraction.

Proof. Let us writeAas in (2.1).

(i) One has that A has group inverse if and only if rank(A2_{) = rank(}_A_{) [4,} Sec. 4.4]. Moreover, as is easy to see from (2.2) we have that none of the canonical angles betweenR(A) andR(A∗_{) is}_π/_{2 if and only if}_C _{is nonsingular. Furthermore,} we have

A2₌_Y₍_{M C}_⊕_I

n−r)Y∗A. (3.4)

Assume that C is nonsingular. From (3.4) we get rank(A2_{) = rank(}_A_{), because} premultiplying by nonsingular matrices does not change the rank.

Assume that C is singular. SinceM∗ _{is nonsingular, there exists} _u_∈_C r,1 such thatu6=0andCM∗_u₌₀_{. Define}

v=Y

u 0

∈C_n,1_.

(12)

(3.4) we get

(A∗)2v=A∗Y

CM∗ ₀ 0 In−r

Y∗Y

u 0

=0

and

A∗v=Y

CM∗ ₀

S∗_M∗ ₀

Y∗Y

u 0

=Y

0

S∗_M∗_u

.

If A∗_v₌₀_{, then}_S∗_M∗_u₌₀_{. Recall that by the choice of} _u_{we have} _CM∗_u₌₀_, and now, from Lemma 3.1 we getM∗_u₌₀_{, which is a contradiction with}_u₆₌₀_and the nonsingularity ofM∗_{. Hence} _v _{∈ N}₍₍_A∗₎2_{) and}_v _{∈ N}_/ ₍_A∗_{), therefore the null} spaces ofA∗ _{and (}_A∗₎2 _{are not equal, and thus, the ranks of}_A_and_A2_{are not equal.}

To finish the proof of this item, let us observe that the expression (3.3) can be verified by direct verifications.

(ii) Using the expressions (2.1) and (2.22) we haveA∗₌_A† _{if and only if}_CM∗₌

CM−1 _and_S∗_M∗₌_S∗_M−1_{. Lemma 3.1 leads to}

CM∗₌_CM−1 _and_S∗_M∗₌_S∗_M−1 _⇐⇒ _M−1₌_M∗_.

(iii) The proof is similar as in (ii).

(iv) We haveAA∗₌_A∗_A_{if and only if the following four equalities are satisfied}

M M∗₌_CM∗_{M C,} _{0 =}_CM∗_{M S,} _{0 =}_S∗_M∗_{M C,} _{0 =}_S∗_M∗_{M S.}

In view of the nonsingularity ofM we have 0 = S∗_M∗_{M S} _⇔_{0 = (}_{M S}₎∗₍_{M S}₎ _⇔

M S = 0 ⇔ S = 0. From the first relation of (2.15) we have S = 0 ⇔ C2 ₌ _I_r. Having in mind that C is a diagonal matrix with nonnegative real numbers on its diagonal, we haveC2 ₌_I

r ⇔C =Ir. Therefore,AA∗ =A∗Aif and only if C=Ir andM M∗₌_M∗_M_{. From (2.3), the equality} _C₌_I

r is equivalent to saying that all the canonical angles betweenR(A) and R(A∗_{) are zero.}

(v) We haveA2 ₌_A_{if and only if}_{M CM C}₌_{M C}_and _{M CM S}₌_{M S}_{. Taking} into account that M is nonsingular we get A2 ₌_A _{if and only if} _{CM C} ₌ _C _and

CM S=S. Lemma 3.1 finishes the proof of this item.

(vi) Since M is nonsingular, we haveA∗ ₌ _A _{if and only if} _{M C} ₌ _CM∗ _and

S= 0. In view of (2.15) and item (v) we getA2₌_A₌_A∗_{if and only if}_C₌_M ₌_I_r.

(vii) It follows from Theorem 3.4.

(viii) It is easily seen that

AA†A†A−A†AAA†=Y

0 CS

−S∗_C ₀

(13)

Thus,A is bi-EP if and only if CS= 0. If CS= 0, from the first equality of (2.15) we getC3₌_C_{. From (2.3), (2.4), and (2.14) we have}

p= 0, C= 0y,y⊕Ix, S=Iy⊕0x,t. (3.5)

Obviously, (3.5) impliesCS= 0. Evidently, from Theorem 2.1, the conditions given in (3.5) are equivalent to saying that all the canonical angles betweenR(A) andR(A∗₎ are 0 orπ/2.

(ix) It is trivial from Lemma 3.5.

3.4. Applications to some partial matrix orderings. In the following, three partial orderings in C_n,n _{will be studied. The first of them is the star ordering}

introduced in [10], which is defined by

A≤∗ B ⇐⇒ A†A=A†B and AA† =BA†,

or alternatively,

A≤∗ B ⇐⇒ A∗_A₌_A∗_B _and _AA∗₌_BA∗_. (3.6)

The second one is the sharp ordering defined in [19]:

A#≤B ⇐⇒ A#A=A#B and AA#=BA#,

whenAandB have group inverse. It is easy to verify that

A≤#B ⇐⇒ AB =A2=BA.

(3.7)

Furthermore, we will consider the minus ordering defined in [13]. An equivalent form of this ordering is the following [7, 17]:

A≤− B ⇐⇒ AB†B=A, BB†A=A, AB†A=A.

(3.8)

In [3], the authors provide handy tools to verify whether given matrices A and

B satisfy A ≤∗ B, A ≤# B, or A ≤− B when A is written as in (1.1). However, the

characterizations given in [3] of B ≤∗ A, B #≤ A, and B ≤− A, when A is written as in (1.1), lead to various sets of matrix equations very difficult to handle. These sets of equations, under particular situations, can be reduced, as the authors of [3] showed. In the forthcoming Theorem 3.9 we find a more general situation where the

(14)

Before doing this, the following result, given by Cao in [6], will be helpful.

Theorem 3.8. _If_A∈C_r,r_,_B∈C_r,s_,_C∈C_s,s_{, and}_M ₌

A B

0 C

, then there

existsM# _{if and only if there exist}_A# _and_C#_{, and in addition} ₍_I

r−AA#)B(Is−

CC#_{) = 0} _{holds. Furthermore, when}_M# _{exists, it is given by}

M#=

A# ₍_A#₎2_B₍_I

s−CC#) + (Ir−AA#)B(C#)2−A#BC#

0 C#

.

Theorem 3.9. _Let _{A, B}_∈C_n,n _{and let} _A_{be of the form (2.1). Assume that} _A

has group inverse.

(i) B≤∗ A if and only ifB can be written as

B=Y

B1 B2

0 0

Y∗_, _B

1∈Cr,r, B2∈Cn−r,r, (3.9)

whereB1 andB2 satisfy

B∗

1B1=B∗1M C, B2=B1C−1S, B1C−2B1∗=M C−1B∗1 (3.10)

(ii) If B has group inverse, then B ≤# A if and only if B can be written as in (3.9) andB1, B2 satisfy

B1 #

≤M C and B2=C−1M−1B1M S. (3.11)

(iii) B≤− A if and only ifB can be written as in (3.9)andB1, B2 satisfy

B1C−1M−1B1=B1 and B1C−1M−1B2=B2. (3.12)

Proof. Let us writeAas in (2.1). SinceAhas group inverse, by Theorem 3.7,C

is nonsingular. Also, let us writeB as

B=Y

B1 B2

B3 B4

Y∗_, _B

1∈Cr,r, B4∈Cn−r,n−r. (3.13)

(i) We obtain from (2.1) and (3.13)

B∗_B₌_Y

B∗

1B1+B3∗B3 B1∗B2+B3∗B4

B∗

2B1+B4∗B3 B2∗B2+B4∗B4

Y∗_, (3.14)

BB∗=Y

B1B1∗+B2B2∗ B1B3∗+B2B4∗

B3B1∗+B4B2∗ B3B3∗+B4B4∗

Y∗,

(15)

B∗_A₌_Y

B∗

1M C B∗1M S

B∗

2M C B∗2M S

Y∗_, (3.16)

and

AB∗=Y

M CB∗

1+M SB2∗ M CB∗3+M SB4∗

0 0

Y∗.

(3.17)

Assume B ≤∗ A. In particular we haveBB∗ ₌ _AB∗_{, which in view of the (3.14),} (3.15), (3.16), and (3.17), leads to B3B3∗+B4B∗4 = 0, i.e., B3 = 0 and B4 = 0. Taking these relations into account, it is seen thatB ≤∗ Aimplies

B∗1B1=B∗1M C, B1∗B2=B1∗M S, B2∗B1=B2∗M C, B2∗B2=B2∗M S, (3.18)

and

B1B1∗+B2B2∗ =M CB1∗+M SB2∗. (3.19)

The first equality of (3.18) implies thatB∗

1M C is Hermitian, thus

M−∗C−1B1∗=B1C−1M−1. (3.20)

From the second and third relations of (3.18) we obtain

B∗

1M S=B1∗B2= (B∗2B1)∗= (B2∗M C)∗=CM∗B2.

SolvingB2 and using (3.20) we get

B2=M−∗C−1B∗1M S =B1C−1M−1M S =B1C−1S.

Now, we shall simplify each side of (3.19) by using (2.15):

B1B1∗+B2B2∗=B1B1∗+B1C−1SS∗C−1B1∗ =B1B1∗+B1(C−1(Ir−C2)C−1)B1∗ =B1C−2B1∗

(3.21)

and

M CB∗1+M SB2∗=M CB1∗+M SS∗C−1B∗1 =M(C+ (Ir−C2)C−1)B1∗ =M C−1_B∗

1. (3.22)

(16)

Now, we will prove that if B is written as in (3.9) and the conditions (3.10) are satisfied, then B ≤∗ A. In other words, we will verify (3.18) and (3.19). By the computations made in (3.21) and (3.22), condition (3.19) holds. Thus, it only remains to prove the second, third, and fourth relations of (3.18):

B1∗B2=B1∗B1C−1S=B1∗M CC−1S=B∗1M S,

B∗

2B1=S∗C−1B1∗B1=S∗C−1B1∗M C=B∗2M C, and

B∗

2B2=S∗C−1B1∗B1C−1S=S∗C−1B∗1M CC−1S=B2∗M S.

The sufficiency is proved.

(ii) From (2.1) and (3.13) it follows

AB =Y

M CB1+M SB3 M CB2+M SB4

0 0

Y∗_,

BA=Y

B1M C B1M S

B3M C B3M S

Y∗_,

and

B2=Y

B2

1+B2B3 B1B2+B2B4

B3B1+B4B3 B3B2+B42

Y∗_.

Assume thatB≤#A, i.e.,AB =BA=B2_{holds. From} _AB₌_BA_{we get}

M CB1+M SB3=B1M C, M CB2+M SB4=B1M S, B3M C= 0, B3M S= 0.

The nonsingularity ofM andC leads toB3= 0. Therefore, we obtain

M CB1=B1M C, M CB2+M SB4=B1M S, B3= 0. (3.23)

Taking into account thatB3= 0, fromBA=B2 we get

B1M C=B12, B1M S =B1B2+B2B4, 0 =B42. (3.24)

SinceB =Y

B1 B2 0 B4

Y∗_{, we can apply Theorem 3.8 obtaining that}_B

4has group

inverse. Since 0 =B2

4, premultiplying byB #

4 we getB4= 0. Hence (3.23) and (3.24) reduce to

(17)

In particular, from (3.25), we getB1 #

≤M C andB2=C−1M−1B1M S.

Let us prove that ifBis written as in (3.9) andB1,B2satisfy (3.11), thenB #

≤A, or in other words, let us prove AB = BA = B2_{. If we write} _A _{as in (2.1) and} _B as in (3.9), we obtain AB = BA = B2 _{if and only if} _B

1M C = M CB1 = B12 and

B1M S = M CB2 = B1B2. In view of the assumptions, it only remains to prove

B1B2=B1M S:

B1B2=B1C−1M−1B1M S=C−1M−1B21M S=C−1M−1M CB1M S=B1M S.

(iii) SinceB≤− A, the corresponding version of (3.8) leads toAA†_BB ₌_B_{, which} yields B3 = 0 and B4 = 0. From BA†A = B we get B1C2+B2S∗C = B1 and

B1CS+B2S∗S=B2. Taking into account (2.15), these relations yield

B2S∗C=B1SS∗, B2(C⊕It)2=B1S(C⊕It). (3.26)

SinceCis nonsingular, the second relation of (3.26) yieldsB2(C⊕It) =B1S. Paren-thetically, let us remark that postmultiplyingB2(C⊕It) =B1SbyS∗, and using the third relation of (2.15) yields the first relation of (3.26). Therefore, the two relations of (3.26) are equivalent to the simpler relation

B2(C⊕It) =B1S. (3.27)

It further follows that BA†_B ₌ _B_{, obtained from the third condition of (3.8), is} equivalent to

(B1C+B2S∗)M−1B1=B1, (B1C+B2S∗)M−1B2=B2. (3.28)

Now let us simplifyB1C+B2S∗. To do this, we use (2.15) and (3.27). Observe that the third relation of (2.15) can be equivalently written asS(Cb⊕It)−1₌_C−1_S_{, and} (3.27) is equivalent toB2=B1S(C⊕It)−1. Therefore,

B1C+B2S∗=B1C+S(C⊕It)−1S∗ =B1 C+C−1SS∗ =B1C−1(C2+SS∗) =B1C−1.

And thus, (3.28) is equivalent to (3.12).

(18)

us observe that conditions (3.10), (3.11), and (3.12) are much easier to manage, and let us remark that these conditions were obtained under the assumption thatA has group inverse, a condition onAmuch more general that being idempotent or EP.

REFERENCES

[1] O.M. Baksalary and J. Ben´ıtez. On linear combinations of two commuting hypergeneralized projectors.Computers & Mathematics with Applications, 56:2481–2489, 2008.

[2] O.M. Baksalary and G. Trenkler. Characterizations of EP, normal, Hermitian matrices.Linear and Multilinear Algebra, 56:299–304, 2008.

[3] O.M. Baksalary, G. Trenkler, and G.P.H. Styan. On a matrix decomposition of Hartwig and Spindelb¨ock.Linear Algebra and its Applications, 430:2798–2812, 2009.

[4] A. Ben-Israel and T.N.E. Greville.Generalized Inverses. Theory and Applications(2nd edition), Springer-Verlag, 2002.

[5] J. Ben´ıtez and V. Rakoˇcevi´c. Applications of CS decomposition in linear combinations of two orthogonal projectors.Applied Mathematics and Computation, 203:761–769, 2008. [6] Cao CG. Some results of group inverses for partitioned matrices over skew fields,Journal of

Natural Science of Heilongjiang University, 18(3):5–7, 2001.

[7] R.E. Cline and R.E. Funderlic. The rank of a difference of matrices and associated generalized inverses.Linear Algebra and its Applications, 24:185–215, 1979.

[8] J. Dauxois and G.M. Nkiet. Canonical analysis of two Euclidean subspaces and its applications.

Linear Algebra and its Applications, 264:355–388, 1997.

[9] C. Davis and W.M. Kahan. The rotation of eigenvectors by a perturbation III.SIAM Journal on Numerical Analysis, 7:1–46, 1970.

[10] M.P. Drazin. Natural structures on semigroups with involution.Bulletin of the American Math-ematical Society, 84:139–141, 1978.

[11] A. Galantai and Cs.J Hegedus. Jordan’s principal angles in complex vector spaces Numerical Linear Algebra with Applications, 13:589–598, 2006.

[12] G.H. Golub and C.F. van Loan.Matrix Computations(3rd edition), Johns Hopkins Studies in the Mathematical Sciences, Johns Hopkins University Press, Baltimore, MD, 1996. [13] R.E. Hartwig. How to partially order regular elements.Mathematica Japonica, 25:1–13, 1980. [14] R.E. Hartwig and K. Spindelb¨ock. Matrices for whichA∗_and_A†_commute._{Linear and}

Multi-linear Algebra, 14:241–256, 1984.

[15] H. Hotelling. Relation between two sets of variables.Biometrika, 28:322–377, 1936.

[16] E.R. Jessup, M.W. Berry and Z. Drmac. Matrices, vector spaces, and information retrieval.

SIAM Review, 41:335–362, 1999.

[17] G. Marsaglia and G.P.H. Styan. Equalities and inequalities for ranks of matrices.Linear and Multilinear Algebra, 2:269–292, 1974.

[18] C.D. Meyer.Matrix Analysis and Applied Linear Algebra. SIAM, Philadelphia, PA, 2000. [19] S.K. Mitra. On group inverses and the sharp order.Linear Algebra and its Applications, 92:17–

37, 1987.

[20] C.C. Paige and M. Wei. History and generality of the CS decomposition. Linear Algebra and its Applications, 208/209:303–326, 1994.

[21] V. Rakoˇcevi´c and H.K. Wimmer. A variational characterization of canonical angles between subspaces.Journal of Geometry, 78:122–124, 2003.

[22] I.B. Risteski and K.G. Trencevski. Principal Values and Principal Subspaces of Two Subspaces of Vector Spaces with Inner Product. Beitr¨age zur Algebra und Geometrie, 42:289–300, 2001.

(19)

[24] G. Trenkler. On oblique and orthogonal projectors. InContributions to Probability and Statis-tics: Applications and Challenges, Proceedings of the International Statistics Workshop

World Scientific, Singapore, 178–191, 2006.

[25] H.K. Wimmer. Canonical angles of unitary spaces and perturbations of direct complements.