vol18_pp98-107. 112KB Jun 04 2011 12:06:05 AM

(1)

BLOCK REPRESENTATIONS OF THE DRAZIN INVERSE OF A

BIPARTITE MATRIX∗

M. CATRAL†_{, D.D. OLESKY}‡_, _AND _{P. VAN DEN DRIESSCHE}†

Abstract. Block representations of the Drazin inverse of a bipartite matrixA=

2

4

0 B C 0

3

5in

terms of the Drazin inverse of the smaller order block productBCorCBare presented. Relationships between the index ofAand the index ofBC are determined, and examples are given to illustrate all such possible relationships.

Key words. Drazin inverse, Bipartite matrix.

AMS subject classifications.15A09.

1. Introduction. Let A be an n×n real or complex matrix. Theindex of A

is the smallest nonnegative integer q such that rankAq+1 _{= rank}

Aq_{. The} _Drazin

inverseofAis the unique matrixAD _satisfying

AAD=ADA

(1.1)

AD_AAD₌_AD

(1.2)

Aq+1AD=Aq,

(1.3)

whereq= indexA(see, for example, [1, Chapter 4], [2, Chapter 7]). If indexA= 0, then A is nonsingular and AD ₌ _A−1_{. If index} _A _{= 1, then} _AD ₌ _A#_{, the} _group

inverse of A. See [1], [2], [8] and references therein for applications of the Drazin inverse.

The problem of finding explicit representations for the Drazin inverse of a general 2×2 block matrix in terms of its blocks was posed by Campbell and Meyer in [2]. Since then, special cases of this problem have been studied. Some recent papers containing representations for the Drazin inverse of such 2×2 block matrices are [3], [4], [6], [7], [8], [9], [11] and [12]; however the general problem remains open.

∗_{Received by the editors on October 20, 2008. Accepted for publication on January 25, 2009.} Handling Editor: Ravindra B. Bapat.

†_{Department of Mathematics and Statistics, University of Victoria, Victoria, British Columbia,} Canada V8W 3R4 (mcatral@uvic.ca, pvdd@math.uvic.ca).

‡_{Department of Computer Science, University of Victoria, Victoria, British Columbia, Canada} V8W 3P6 (dolesky@cs.uvic.ca).

(2)

In this article, we considern×nblock matrices of the form

(1.4) A =





0 B

C 0



,

where B is p×(n−p), C is (n−p)×pand the zero blocks are square. Since the digraph associated with a matrix of the form (1.4) is bipartite, we call such a matrix a bipartite matrix. In section 2, we give block representations for the Drazin inverse of a bipartite matrix. These block representations are given in terms of the Drazin inverse of eitherBCorCB, both of which are matrices of smaller order thanA. These formulas for AD _when _A _{has the form (1.4) cannot, to our knowledge, be obtained}

from known formulas for the Drazin inverse of 2×2 block matrices. In section 3, we describe relations between the index of the matrix BC and the index of A, and in section 4 we give examples to illustrate these results.

2. Block representations for AD_. _{The following result gives the Drazin}

in-verse of a bipartite matrix in terms of the Drazin inin-verse of a product of its subma-trices.

Theorem 2.1. _Let A be as in(1.4). Then

(2.1) AD =





0 (BC)D_B

C(BC)D ₀ 

.

Furthermore, if indexBC=s, then index A≤2s+ 1.

Proof. Denote the matrix on the right hand side of (2.1) byX. Then

AX=





0 B

C 0



 



0 (BC)D_B

C(BC)D ₀ 

 = 



BC(BC)D ₀

0 C(BC)D_B 

,

XA=





0 (BC)D_B

C(BC)D ₀ 

 



0 B

C 0



 = 



(BC)DBC 0

0 C(BC)D_B 

(3)

and

XAX =





(BC)D_BC ₀

0 C(BC)D_B 

 



0 (BC)D_B

C(BC)D ₀ 



=





0 (BC)D_BC₍_BC₎D_B

C(BC)D_BC₍_BC₎D ₀





=





0 (BC)D_B

C(BC)D ₀ 

 by (1.2).

Thus,X satisfies AX=XAby (1.1) andXAX=X. Let indexBC=s. Then

A2s+2_X ₌ 



(BC)s+1 ₀

0 (CB)s+1 

 



0 (BC)D_B

C(BC)D ₀ 



=





0 (BC)s+1₍

BC)D_B

(CB)s+1_C₍_BC₎D ₀





=





0 (BC)s_B

C(BC)s+1₍_BC₎D ₀ 

 by (1.3) and associativity

=





0 (BC)s_B

C(BC)s ₀ 

 by (1.3)

= A2s+1_.

By [2, Theorem 7.2.3], indexA≤2s+ 1 andX =AD_.

We now give three lemmas, the results of which are used to write AD _{in terms}

of (CB)D_{, rather than (}_BC₎D _{as in (2.1). Lemma 2.2 is an easy exercise using the}

definition of the Drazin inverse, and Lemma 2.3 is proved in [2, p. 149] for square matrices but that proof holds for the more general case stated below.

Lemma 2.2. _If _U _{is an}_n_×_n_{matrix, then} ₍_U2₎D_{= (}_UD₎2_.

Lemma 2.3. _If _V _is_m_×_n_and_W _is_n_×_m_{, then}₍_{V W}₎D₌_V_[(_{W V}₎2_]D_W_.

Lemma 2.4. _If_B _is_p_×(_n₋_p₎_and_C_is₍_n₋_p_)×_p_{, then}₍_BC₎D_B₌_B₍_CB₎D_.

Proof. By Lemmas 2.2 and 2.3, (BC)D ₌ _B_[(_CB₎2_]D_C ₌ _B_[(_CB₎D_]2_C.

(4)

Note that Lemma 2.4 implies that (CB)D_C ₌_C₍_BC₎D _{and thus Theorem 2.1}

gives the following four representations for the Drazin inverseAD_{of a bipartite matrix.}

Corollary 2.5. _Let _A _{be as in}_{(1.4). Then}

AD ₌ 



0 (BC)D_B

C(BC)D ₀ 

 = 



0 B(CB)D

C(BC)D ₀ 



=





0 (BC)D_B

(CB)D_C ₀ 

 = 



0 B(CB)D

(CB)D_C ₀ 

.

We end this section with some special cases for AD _{in Corollary 2.5. If} _A _is

nonsingular, thenB andC are necessarily square and nonsingular, and the formulas in Corollary 2.5 reduce to

A−1 =





0 C−1

B−1 ₀ 

.

IfBCis nilpotent, then both (BC)D_and_AD_{are zero matrices. If}_C₌_B∗_{is (}_n₋_p_)×_p

with rankB < p, then BB∗ _{is singular and Hermitian; thus index} _BB∗ _{= 1. As} _A

is also Hermitian, index A = 1. In this case, AD ₌ _A# ₌

A†, the Moore-Penrose inverse ofA, w ith

AD =





0 (BB∗)†_B

B∗₍_BB∗₎† ₀ 

 = 



0 B∗†

B† ₀ 

 = A†.

3. Index of A from the index of BC. Results in this section that w e state for indexA in terms ofBC and index BC could alternatively be stated in terms of

CB and indexCB.

LetAbe as in (1.4). Then forj= 0,1, . . .

A2j =





(BC)j ₀

0 (CB)j 



and

A2j+1

=





0 (BC)j_B

C(BC)j ₀ 

= 



0 B(CB)j

(CB)j_C ₀ 

(5)

Thus,

(3.1) rankA2j _{= rank(}_BC₎j_{+ rank(}_CB₎j

and

(3.2) rankA2j+1 ₌ _rank(

BC)j_B_{+ rank}_C₍_BC₎j_.

Lets= indexBCand suppose thats= 0. Ifn= 2p, thenBandCare bothp×p

invertible matrices and index A= 0. In this case,AD₌_A−1_{. Otherwise, if}_n_{= 2}_p_,

then rankBC = rankB = rankC = rankCB = p, index A = 1 and AD ₌ _A# _as

given in [5, Theorem 2.2] in terms of (BC)−1_.

We use the following rank inequality (see [10, page 13]) of Frobenius in the proof of some of the results in this section.

Lemma 3.1. _{(Frobenius Inequality)}_If _U _is _m_×_k_, _V _is_k_×_n _and_W _is_n_×_p_,

then

rankU V + rankV W ≤rankV + rankU V W.

Theorem 3.2. _Let _A _{be as in} _(1.4)_{and suppose that index} _BC ₌_s_≥_{1. Then}

index A= 2s−1, 2s or 2s+ 1.

Proof. From Theorem 2.1, indexA≤2s+ 1. By Lemma 3.1,

rankB(CB)s−1_{+ rank(}_CB₎s−1_C _≤ _rank(_CB₎s−1_{+ rank(}_BC₎s

< rank(CB)s−1_{+ rank(}_BC₎s−1_,

since indexBC=s. Thus, using (3.1) and (3.2), rankA2s−1_<_rank_A2s−2 _{and index} A >2s−2. Therefore, indexA= 2s−1, 2s or 2s+ 1.

In the following three theorems, we give necessary and sufficient conditions for each of the values of indexAwhich are identified in Theorem 3.2.

index A= 2s−1if and only if

(i)rank(BC)s_{= rank(}_BC₎s−1_B _and _rank(_CB₎s_{= rank(}_CB₎s−1_C_{, or}

(ii) rank(BC)s_{= rank(}_CB₎s−1_C _and_rank(_CB₎s_{= rank(}_BC₎s−1_B_.

Proof. From Theorem 3.2, index A ≥ 2s−1. Now, using (3.1) and (3.2), rankA2s _{= rank}_A2s−1 _{if and only if rank(}_BC₎s_{+ rank(}_CB₎s _{= rank(}_BC₎s−1_B₊

rank(CB)s−1_C_{, or equivalently,}_(i)_or_(ii)_{holds. Thus, index} _A_{= 2}_s₋_{1 if and only}

(6)

Note that if index A = 2s−1, then in fact rank(BC)s _{= rank(}_BC₎s−1_B ₌

rank(CB)s_{= rank(}_CB₎s−1

C. The conditions of Theorem 3.3 hold for any Hermitian bipartite matrixA(as in (1.4) with C=B∗_{) since index}_BB∗_{= 1 = index}_A_.

Lemma 3.4. _{If index} _BC₌_s_{, then}

rank(BC)s+1_{= rank(}

BC)s_{= rank(} BC)s

B= rankC(BC)s_{= rank(}

CB)s+1 .

Proof. Lett= rank(BC)s_{. Since index}_BC₌_s_{, it follows that}_t_{= rank(}_BC₎s₌

rank(BC)s+1 _{= rank(}

BC)s_B_{= rank}_C₍_BC₎s_{, where the latter two equalities hold as} t = rank(BC)s+1 _≤ _rank(_BC₎s_B _≤ _rank(_BC₎s ₌ _t _and _t _{= rank(}_BC₎s+1 _≤

rankC(BC)s_≤_rank(_BC₎s₌_t_{. By Lemma 3.1,}

rankC(BC)s_{+ rank(} BC)s

B≤rank(BC)s_{+ rank(} CB)s+1

so

2t ≤ t+ rank(CB)s+1

= rankA2s+2

≤rankA2s+1

= 2t.

Thus, rank(CB)s+1 ₌_t_.

index A= 2s if and only if indexCB =s and (i) rank(BC)s _<_rank(_BC₎s−1_B _or

(ii) rank(CB)s_<_rank(_CB₎s−1_C_.

Proof. Suppose that indexA= 2s. Let t= rank(BC)s_{. Then}

rankA2s_{= rank}_A2s+1_{= 2}_{t <}_rank_A2s−1_,

where the second equality follows from Lemma 3.4. Since rankA2s _{= rank(}_BC₎s₊

rank(CB)s_{= 2}_t_{, it follow s that rank(}_CB₎s₌_t_{. Using (3.2),}

rankA2s−1 ₌ _rank(

BC)s−1

B+ rankC(BC)s−1

= rankB(CB)s−1_{+ rank(}_CB₎s−1_C

≤ rank(CB)s−1_{+ rank(}

BC)s _{by Lemma 3}_.₁

= rank(CB)s−1₊_t.

Thus,

2t < rankA2s−1

≤ rank(CB)s−1

+t

(7)

rank(CB)s−1_C_{, then by Theorem 3.3(i), index} _A _{= 2}_s₋_{1, a contradiction. Thus,}

rank(BC)s_<_rank(_BC₎s−1

B or rank(CB)s_<_rank(_CB₎s−1 C.

On the other hand, suppose that indexCB=sand(i)or(ii) holds. Since(i)or (ii)holds, indexA= 2s−1 from Theorem 3.3. If indexA= 2s+1, then rankA2s+1_<

rankA2s _{or equivalently rank(}_BC₎s_B _{+ rank}_C₍_BC₎s _< _rank(_BC₎s_{+ rank(}_CB₎s_.

Thus, Lemma 3.4 implies that t < rank(CB)s_{, contradicting index} _CB ₌ _s _and

showing by Theorem 3.2 that indexA= 2s.

If A is as in (1.4) with index BC = index CB =s ≥1, then index A is 2sor 2s−1, depending on whether or not one of the rank conditions(i)or(ii)in Theorem 3.5 holds. Note that if neither of these rank conditions holds, then the rank condition (i)of Theorem 3.3 holds. For example, ifAis as in (1.4) with

B=C=





1 0

0 0



,

then indexBC= indexCB= 1 =s, and indexA= 1 = 2s−1. Note that neither of the rank conditions in Theorem 3.5 holds.

index A= 2s+ 1if and only if rank(CB)s_>_rank(_CB₎s_C_.

Proof. Let t = rank(BC)s _{and suppose that index} _A _{= 2}_s_{+ 1. Then as in}

the proof of Theorem 3.5, it follows that t < rank(CB)s_{. But by Lemma 3.4,}

rank(CB)s+1 _{= rank(}

CB)s_C_{= rank}_C₍_BC₎s₌_t_{. Thus, rank(}_CB₎s_>_rank(_CB₎s_C_.

For the converse, rank(CB)s_>_rank(_CB₎s_C_{implies that rank(}_BC₎s_{+ rank(}_CB₎s_>

rank(BC)s_B_{+ rank}_C₍_BC₎s_{. That is, rank}_A2s _>_rank_A2s+1 _{and by Theorem 3.2,}

indexA= 2s+ 1.

Corollary 3.7. _Let_A_{be as in}_(1.4)_{and suppose that index}_BC₌_s_≥_{1. Then}

index A= 2s+ 1if and only if index CB=s+ 1.

Proof. Suppose that index A = 2s+ 1. Theorem 3.6 gives rank(CB)s _>

rank(CB)s_C _≥ _rank(_CB₎s+1 _{so index} _CB _≥ _s_{+ 1. The index assumptions on} _A

andBCgive rankA2s+2_{= rank}_A2s+4 _{and rank(}_BC₎s+1_{= rank(}_BC₎s+2_{, and using}

(3.1), these imply that rank(CB)s+1 _{= rank(}_CB₎s+2_{. Thus, index} _CB₌_s_{+ 1. For}

the converse, if index CB =s+ 1, then rank(CB)s _>_rank(_CB₎s+1 _{so rank} A2s _>

rankA2s+2_{by (3.1). Using (3.1), (3.2) and Lemma 3.4, rank}_A2s+2 _{= rank}_A2s+1 _and

it follows that indexA= 2s+ 1.

(8)

Example 4.1. _Let A =           

0 0 0 1 0

0 0 0 1 −2

0 0 0 −1 1

1 1 1 0 0

1 1 2 0 0

           =   0 B C 0  . Then BC =     

1 1 1

−1 −1 −3

0 0 1

     ,

and it is easily shown that index BC = 2 = s. Since rankA = 4, rankA2 _{= 3 and}

rankA3 _{= rank}_A4_{= 2, it follows that index} _A_{= 3 = 2}_s₋_{1. By the formula in [2,}

Theorem 7.7.1] for the Drazin inverse of a 2×2 block triangular matrix and noting that the leading block ofBC is a 2×2 nilpotent matrix,

(BC)D ₌ 

   

0 0 −1

0 0 1

     , giving

AD =





0 (BC)D_B

C(BC)D ₀   =           

0 0 0 1 −1

0 0 0 −1 1

0 0 −1 0 0

0 0 0 0 0

           .

It is easily verified that conditions(i)and(ii) of Theorem 3.3 are satisfied.

Example 4.2. _Let

(9)

whereB is ap×psingular matrix with indexB =s≥1, andC=I, thep×pidentity matrix. Then indexCB=sand rank(CB)s_<_rank(_CB₎s−1

C; thus by Theorem 3.5, indexA= 2s. In this case, by Theorem 2.1,

AD=





0 BD_B

BD ₀



.

Example 4.3. _Let_A_{be the 7}_×_{7 matrix}

A=                 

0 0 0 −1 0 0 1

0 0 0 1 1 0 0

0 0 0 0 1 1 0

1 1 0 0 0 0 0

0 −1 1 0 0 0 0

0 0 −1 0 0 0 0

1 0 0 0 0 0 0

                 =   0 B C 0  ,

for which the directed graphD(A) is a path graph (see, for example, [5]). Then

BC=     

0 −1 0

1 0 1

0 −1 0



   

and (BC)2=



   

−1 0 −1

0 −2 0

−1 0 −1

     ,

so rankBC = rank(BC)2 _{= 2 and index} _BC _{= 1 =}_s_{. Thus, (}_BC₎D _{= (}_BC₎# _and

as the directed graphD(BC) is a path graph, (BC)# _{is given by [5, Corollary 3.8]:}

(BC)#₌1

2     

0 1 0

−1 0 −1

0 1 0

(10)

Using Theorem 2.1,

AD ₌ 



0 (BC)#_B

C(BC)# ₀ 

 =

1 2



               

0 0 0 1 1 0 0

0 0 0 1 −1 −1 −1

0 0 0 1 1 0 0

−1 1 −1 0 0 0 0

1 1 1 0 0 0 0

0 −1 0 0 0 0 0

0 1 0 0 0 0 0



                .

Note that rankCB= 3>rankCBC = 2 and indexCB = 2; thus by Theorem 3.6 or Corollary 3.7, indexA= 2s+ 1 = 3. AlthoughA# does not exist, (BC)# _{does exist}

and thus results in [5] can be applied to determineAD _{in this example.}

Acknowledgement. This research is partially supported by NSERC Discovery Grants.

REFERENCES

[1] A. Ben–Israel and T.N.E. Greville.Generalized Inverses: Theory and Applications. 2nd Edition, Springer-Verlag, New York, 2003.

[2] S. L. Campbell and C. D. Meyer. Generalized Inverses of Linear Transformations. Pitman, London, 1979; Dover Publications, Inc., New York, 1991.

[3] N. Castro-Gonzales and E. Dopazo. Representations of the Drazin inverse for a class of block matrices.Linear Algebra and its Applications, 400:253–269, 2005.

[4] N. Castro-Gonzales, E. Dopazo, and J. Robles. Formulas for the Drazin inverse of special block matrices.Applied Mathematics and Computation, 174:252–270, 2006.

[5] M. Catral, D. D. Olesky, and P. van den Driessche. Group inverses of matrices with path graphs.

Electronic Journal of Linear Algebra, 17:219–233, 2008.

[6] J. Chen, Z. Xu, and Y. Wei. Representations for the Drazin inverse of the sumP+Q+R+S and its applications.Linear Algebra and its Applications, 430:438–454, 2009.

[7] D. S. Cvetkovi´c-Ili´c. A note on the representation for the Drazin inverse of 2×2 block matrices.

Linear Algebra and its Applications, 429:242–248, 2008.

[8] D. S. Cvetkovi´c-Ili´c, J. Chen, and Z. Xu. Explicit representations of the Drazin inverse of block matrix and modified matrix. Linear and Multilinear Algebra, 2008. Available online at

http://www.informaworld.com/10.1080/03081080701772830.

[9] R. Hartwig, X. Li, and Y. Wei. Representations for the Drazin inverse of a 2×2 block matrix.

SIAM Journal on Matrix Analysis and Applications, 27:757–771, 2005.

[10] R.A. Horn and C.R. Johnson.Matrix Analysis.Cambridge University Press, New York, 1985. [11] X. Li and Y. Wei. A note on the representations for the Drazin inverse of 2×2 block matrices.

Linear Algebra and its Applications, 423:332–338, 2007.

[12] Y. Wei. Expressions for the Drazin inverse of a 2×_{2 block matrix.} _{Linear and Multilinear}