vol18_pp108-116. 140KB Jun 04 2011 12:06:02 AM

(1)

AN ANGLE METRIC THROUGH THE NOTION

OF GRASSMANN REPRESENTATIVE∗

GRIGORIS I. KALOGEROPOULOS†_{, ATHANASIOS D. KARAGEORGOS}†_, _AND

ATHANASIOS A. PANTELOUS†

Abstract. The present paper has two main goals. Firstly, to introduce different metric topolo-gies on the pencils (F, G) associated with autonomous singular (or regular) linear differential or difference systems. Secondly, to establish a new angle metric which is described by decomposable multi-vectors called Grassmann representatives (or Pl¨ucker coordinates) of the corresponding sub-spaces. A unified framework is provided by connecting the new results to known ones, thus aiding in the deeper understanding of various structural aspects of matrix pencils in system theory.

Key words. Angle metric, Grassmann manifold, Grassmann representative, Pl¨ucker coordi-nates, Exterior algebra.

AMS subject classifications.15A75, 15A72, 32F45, 14M15.

1. Introduction - Metric Topologies on the Relevant Pencil (F, G). The perturbation analysis of rectangular (or square) matrix pencils is quite challenging due to the fact that arbitrarily small perturbations of a pencil can result in eigenvalues and eigenvectors vanishing. However, there are applications in which properties of a pencil ensure the existence of certain eigenpairs. Motivated by this situation, in this paper we consider the generalized eigenvalue problem (both for rectangular and for square constant coefficient matrices) associated with autonomous singular (or regular) linear differential systems

F x′(t) =Gx(t),

or, in the discrete analogue, with autonomous singular (or regular) linear difference systems

F x_k+1=Gxk,

wherex∈Cn _{is the state control vector and} _{F, G}_∈_Cm×n _(or_{F, G}_∈_Cn×n_{). In the}

last few decades the perturbation analysis of such systems has gained importance, not only in the context of matrix theory, but also in numerical linear algebra and control theory; see [6], [8], [9], [11], [13].

∗_{Received by the editors November 5, 2008. Accepted for publication February 5, 2009. Handling}

Editor: Michael J. Tsatsomeros.

†_{Department of Mathematics, University of Athens, Panepistimiopolis, GR-15784 Athens, Greece}

([email protected], [email protected], [email protected]).

(2)

The present paper is divided into two main parts. The first part deals with the topological aspects, and the second part with the “relativistic” aspects of matrix pencils. The topological results have a preliminary nature. By introducing different

metric topologies on the space of pencils (F, G), a unified framework is provided

aiding in the deeper understanding of perturbation aspects. In addition, our work connects the new angle metric (which in the authors’ view is more natural) to some already known results. In particular, the angle metric is connected to the results in [8]-[11] and the notion of deflating subspaces of Stewart [6], [8], and it is shown to be equivalent to the notion of e.d. subspace.

Some useful definitions and notations are required.

Definition 1.1. For a given pair

L= (F, G)∈ Lm,n

∆

=L:L= (F, G) ;F, G ∈Cm×n_,

we define:

a) theflat matrix representation ofL, [L]_f = F −G ∈Cm×2n_{, and}

b) thesharp matrix representation ofL, [L]_s=

F −G

∈C2m×n_.

Definition 1.2. IfL= (F, G)∈ Lm,n, thenLis callednon-degeneratewhen

a) m≤n and rank [L]_f =m, or

b) m≥n and rank [L]_s=n.

Remark 1.3. In the case of the flat description, degeneracy implies zero row minimal indices, whereas, in the case of the sharp description, degeneracy implies zero column minimal indices.

Remark 1.4. Ifm=n, then non-degeneracy implies thatLis a regular pair.

By using these definitions, we can express the notion of strict equivalence as follows.

Definition 1.5. LetL, L′∈ Lm,n.

a) LandL′are calledstrict equivalent(LEHL′) if and only if there exist non-singular

matricesQ∈Fn×n _and_P _∈_Fm×m_{such that}

[L′]_f=P[L]_f

Q O

O _Q

,

or equivalently,

[L′]_s=

P O O _P

(3)

b) L and L′ _{are called} _{left strict equivalent} ₍_LEL

HL′) if and only if there exists a

non-singular matrixQ∈Fn×n _{such that}

[L′]_f =P[L]_f.

Similarly,LandL′are calledright strict equivalent(LER

HL′) if and only if there exists

a non-singular matrixQ∈Fn×n _{such that}

[L′]_s= [L]_sQ.

For the pairL= (F, G)∈ Lm,n, we can define four vector spaces as follows:

Xfr(L)

∆

=rowspanF[L]f, X c f(L)

∆

=colspanF[L]f,

Xr s(L)

∆

=rowspanF[L]s, Xsc(L)

∆

=colspanF[L]s.

It is clear that for everyL, L′ ∈ Lm,nwithLEHLL′, we haveXfr(L) =Xfr(L′) and

Xr

s(L) =Xsr(L′). Moreover, we observe thatXfr(L) andXsr(L) are invariant under

EL

H-equivalence, butXfc(L) andXsc(L) are not always invariant underEHL-equivalence.

Also, for every L, L′ _{∈ L}

m,n with LEHRL′, we see that Xfc(L) = Xfc(L′) and

Xc

s(L) =Xsc(L′). Moreover, we observe that Xfc(L) andXsc(L) are invariant under

ER

H-equivalence, butXfr(L) andXsr(L) are not always invariant underEHR-equivalence.

In the next lines, we provide a classification of the pairs considering their di-mensions. Moreover, the invariant properties of the four vector subspaces are also investigated. The following classification is a simple consequence of the definitions and observations, so far. The table below demonstrates the type of subspace and the invariant properties.

Definition 1.6. For a non-degenerate pair L = (F, G) ∈ Lm,n, the

normal-characteristic space, XN(L), is defined as follows:

(i) whenm≤n,XN(L)

∆

=Xr

f (L), which isEHL-invariant,

(ii) whenm > n,XN(L)=∆Xsc(L), which isEHR-invariant.

A straightforward consequence of (i) is that for a regular pencilL= (F, G)∈ Ln,n,

we have thatXN(L) =Xfr(L),

Xfr(L) =rowspanF[L]f =colspanF[L]tf

and

Xc

(4)

Furthermore, in the case where the pair is non-degenerate and m ≤ n, we have

rank[L]_f =m, XN(L) =Xfr(L) =rowspanF[L]f and dimXN(L) =m. Therefore,

we can identify XN(L) with a point of the Grassmann manifold Gm,F2n. Note

that ifL1EL

HL, thenXN(L1) represents the same point onGm,F2nas XN(L).

Dimensions Xr

f(L) Xsr(L) Xfc(L) Xsc(L)

Invariant spaces

m≤n/2 E

L H -invariant EL H -invariant Xc f(L) ≡ F2m_, _E_H

-invariant

Xc s(L) ≡ F2m_, _E_H

-invariant

Xr f(L),

Xr s(L).

n/2< m≤2n E

L H -invariant

Xr s(L) ≡ F2m_, _E

H -invariant

Xc f(L) ≡ F2m_, _E

H -invariant ER H -invariant Xr f(L),

Xc s(L)

n≤m/2<2m Xr

f(L) ≡ F2n_, _E

H -invariant

Xr s(L) ≡ Fn_, _E

H -invariant ER H -invariant ER H -invariant Xc f(L),

Xc s(L)

n/2< m≤2n E

L H -invariant

Xr s(L) ≡ Fn_, _E

H -invariant

Xc f(L) ≡ Fm_, _E

H -invariant ER H -invariant Xr f(L),

Xc s(L)

Regular,m=n E

L H -invariant

Xr s(L) ≡ Fn_, _E_H

-invariant

Xc f(L) ≡ Fm_, _E_H

-invariant

ER H -invariant

Xr f(L),

Xc s(L)

Now, let us consider the set of regular pairs. For a pair L = (F, G) ∈ Ln,n,

the equivalent class EL

H= L1∈ Ln,n: [L1]f =P[L]f, P ∈Fn×n,detP = 0

can be

considered as a representation of a point onGn,F2n_{. This point is fully identified}

byXN(L1), whereL1∈ EHL(L). Note that a point of G

n,F2n_{does not represent}

a pair, but only the left-equivalent class of pairs. Furthermore, we shall denote by

ΣL

1 the set of all distinct generalized eigenvalues (finite and infinite) of the pair L=

(F, G) ∈ Ln,n, and by ΣL2 the set of normalized generalized eigenvectors (Jordan

vectors included) defined for everyλ∈ΣL

1.

We defineQ=ΣL

1,ΣL2

:∀L∈ Ln,nand consider the function

f :Ln,n → Q:L→f(L)

∆

=ΣL

1,ΣL2

∈ Q.

It is obvious thatf−1_ΣL

1,ΣL2

:∀L∈ Ln,n=EHL(L), which means that f is EHL

-invariant. Thus, from the eigenvalue-eigenvector point of view, it is reasonable to

identify the whole classEL

H(L) as a point of the Grassmann manifold. The study of

(5)

be studied on the appropriate Grassmann manifold. In the next section, a new angle metric is introduced from the point of view of metric topology on the Grassmann manifold.

2. A New Angle Metric. In this session, we consider the Grassmann

man-ifolds which are associated with elements of Lm,n. In the case where m ≤ n, the

manifold Gm,F2n _{is constructed, and on the other hand, when}_m _≥_n _the

man-ifolds Gn,F2m _{is the subject of investigation. The relationship between the two}

Grassmann manifolds and the pairs ofLm,nis clarified by the following remark.

Remark 2.1. (i) For any V ∈Gm,F2n_{, there exists an} _EL

H-equivalence class

from Lm,n (m ≤ n), which is represented by a non-degenerate L= (F, G)∈ Lm,n,

such thatV =rowspan[L]_f.

(ii) For anyV ∈Gn,F2m_{, there exists an}_E_HR_{-equivalence class from}_L_m,n₍_m_≥_n_),

which is represented by a non-degenerateL= (F, G)∈ Lm,n, such that V=colspan

[L]_s.

Definition 2.2 ([8]). A real valued functiond(·,·) :Gm,R2n_×G_m,R2n_→ R+

o (m ≤ n ) is called an orthogonal invariant metric if for every V1,V2,V3 ∈

Gm,R2n _(where _V1 ₌ _X

N(L1), V2 =XN(L2), V3 = XN(L3), and L1, L2, L3 ∈

Lm,n are non-degenerate), it satisfies the following properties:

(i) d(XN(L1),XN(L2))≥0, andd(XN(L1),XN(L2)) = 0 if and only ifL1EHLL2,

(ii) d(XN(L1),XN(L2)) =d(XN(L2),XN(L1)),

(iii) d(XN(L1),XN(L2))≤d(XN(L1),XN(L3)) +d(XN(L3),XN(L2)),

(iv) dcolspan[L1]_ftU, colspan[L2]t_fU = dcolspan[L1]_ft, colspan[L2]t_f for

any orthogonal matrixU ∈Rm×m_.

Similarly, we can define the orthogonal invariant metric onGn,R2m_.

Further-more, we summarize some already known results for metrics onGm,R2n_,_m_≤_n_,

and afterwards, we introduce the new angle metric.

Theorem 2.3 ([12]). For any two points VL1,VL2∈G

m,R2n_{such that}

VL1 =X

r

f(L1) =colspanR[L1]tf and VL2 =X

r

f (L2) =colspanR[L2]tf,

if we defineX1=[L1]t_f[L1]_f− 1 2

[L1]t_f andX2=[L2]t_f[L2]_f−

1 2

[L2]t_f, then

J(VL1,VL2) = arccos

detX1Xt

2X2X1t

1

2 _(2.1)

and

dJ (VL1,VL2) = sinJ(VL1,VL2) (2.2)

(6)

Note that metrics (2.1) and (2.2) have been used extensively in [8]-[11], on the perturbation analysis of the generalized eigenvalue-eigenvector problem. These met-rics have also been related to other metmet-rics on the Grassmann manifold, for instance,

the “GAP” between subspaces, sinθ(X, Y), etc.; see [8]-[11], [13]-[14].

Now, we denoteLm,n/EHLas the quotientEHL-the collection of equivalence classes.

It is easily derived that there is an one-to-one map

ϕ:Lm,n/EHL →G

m,R2n

such that

EL

H(L)→ϕ

EL H(L)

∆

=X_N(L) =Xr

f(L1)

∆

=VL∈Gm,R2n. (2.3)

Considering map (2.3), we can always associate anm-dimensional subspaceVLofR2n

to a one-dimensional subspace of the vector space Λm_R2n _{(see [2]-[5]), or}

equiv-alently, a point on the Grassmann variety Ω (m,2n) of the projective spacePv₍R_),

v=

2n

m

−1. This one-dimensional subspace of Λm_R2n_{is described by}

decom-posable multi-vectors called Grassmann representatives (or Pl¨ucker coordinates) of

the corresponding subspaceVL, which are denoted by g(VL). It is also known that

if x, x′ ∈ Λm_R2n _{are two Grassmann representatives of} _V

L, then x′ = λx where

λ∈R_{\ {}₀_}_.

Definition 2.4. Let L = (F, G) ∈ Lm,n and VL ∈ G

m,R2n _{be a linear}

subspace ofR2n _{that corresponds to the equivalent class} _EL

H(L). If x∈Λm

_R₂_n

is

a Grassmann representative ofVL and x= [x0, x1, . . . , xv]t, v =

2n

m

−1, then

we define the normal Grassmann representative of as the decomposable multi-vector which is given by

x=

  

sign(xv)

x₂ x, if xv= 0

sign(xi)

x₂ x, if xv= 0 and xi is the first non-zero component of x

wheresign(xi) =

1, if xi >0

−1, if xi <0 .

Proposition 2.5. The normal Grassmann representativexof the subspaceVL∈

Gm,R2n_{is unique and}_x

2= 1 ( · 2 denotes the Euclidean norm). Proof. Letx′ ∈Λm_R2n_{be another Grassmann representative of}_V

(7)

thatx′₌_λx_{for some} _λ_∈_R_{\ {}₀_}_{. By Definition 2.4,}

x′ ₌

    

sign₍x′

v)

x′

2 x

′_, _if _x′

v=λxv = 0

sign(x′

i)

x′

2 x

′_, _if _x′

v =λxv= 0 and x′i =λxi

=

  

λsign(λ)sign(xv)

|λ|x₂ x, if xv= 0

λsign(λ)sign(xi)

|λ|x₂ x, if xv= 0 and xi is the first non-zero component ofx

=x ,

keeping in mind that λsign_|_λ_|(λ) = 1. Thus,xis a unique Grassmann representative of

the subspaceVL. The propertyx 2= 1 is obvious.

Definition 2.6 (The new angle metric). ForVL1,VL2 ∈G

m,R2n_{, we define}

the angle metric,∢(V_L

1,VL2), by

∢(V_L

1,VL2)

∆

= arccos|x1,x2|, (2.4)

wherex1,x2are the normal Grassmann representatives ofVL1,VL2, respectively, and

·,·denotes the inner product.

In what it follows, we denote the set of strictly increasing sequences oflintegers

(1≤l ≤m) chosen from 1,2, . . . , m; for example, Q2,m ={(1,2),(1,3),(2,3), . . .}.

Clearly, the number of the sequences ofQl,mis equal to

m l

.

Furthermore, we assume that A = [aij] ∈ Mm,n(R), whereMm,n(R) denotes

the set ofm×nmatrices overR_{. Let}_{µ, ν} _{be positive integers satisfying 1}_≤_µ_≤_m_,

1 ≤ ν ≤ n, α = (i1, . . . , iµ) ∈ Qµ,m and β = (j1, . . . jν) ∈ Qν,n. Then A[α|β] ∈

Mµ,ν(R) denotes the submatrix ofAformed by the rowsi1, i2, . . . , iµand the columns

j1, j2, . . . , jν.

Finally, for 1 ≤ l ≤min{m, n} the l-compound matrix (or l-adjugate) of A is

the

m l

×

n l

matrix whose entries are det{A[α|β]}, α ∈ Ql,m, β ∈ Ql,n,

arranged lexicographically inαandβ. This matrix is denoted byCl(A); see [2]-[5] for

more details about this interesting matrix. Moreover, it should be stressed out that a compound matrix is in fact a vector corresponding to the Grassmann representative

(or Pl¨ucker coordinates) of a space.

Proposition 2.7. The angle metric (2.4) is an orthogonal invariant metric on the Grassmann manifold Gm,R2n_.

(8)

which is a consequence of the ∆-inequality on the unit sphere. Thus, we need to

prove (iv). LetVLi =colspanR[Li]

t

f,i= 1,2, andU be am×mnon-singular matrix

with |U| =λ. Then Cm

[Li]t_fU

=Cm

[Li]t_f

|U| = λCm

[Li]t_f

, i = 1,2, and

consequently,

∢

[L1]t_fU,[L2]t_fU=∢

[L1]t_f,[L2]t_f.

Moreover, if U is an orthogonal matrix, then an orthogonal invariant metric is

ob-tained.

This angle metric onGm,R2n_{is related to the standard metrics (2.1) and (2.2).}

Theorem 2.8. For every VL1,VL2∈G

m,R2n_{, we have that} ∢(V_L

1,VL2) =J (VL1,VL2)

and

sin{∢(V_L

1,VL2)}=dJ(V1,V2) = sin{J(V1,V2)}.

Proof. By using the Binet-Cauchy theorem [3], we obtain that

det [X1Xt

2X2X1t] =Cm(X1)Cm(X2t)Cm(X2)Cm(X1t)

=Cm(X1)Cmt (X2)Cm(X2)Cmt (X1).

By Theorem 2.3, we also have

Cm(Xi) =

Cmt [Li]fCm(Li)f

−1 2

Cmt

[Li]f

= g(VLi)

g(VLi)

2

=εxti,

whereε=±1,i= 1,2.

Thus, since

detX1X2tX2X1t

=xt1x2x

t

2x1={x1,x2} 2

,

it follows that

detX1Xt

2X2X1t

1 2 ₌_|_x

1, x2|.

Then

arccosdetX1X2tX2X1t

1

2 _{= arccos}_|_x

1,x2| ⇔ J(VL1,VL2) =∢(VL1,VL2), and

sin{∢(V_L

(9)

Remark 2.9. When m =n, this angle metric is suitable for the perturbation analysis for the generalized eigenvalue-eigenvector problem.

Acknowledgments. The authors are grateful to the anonymous referees for their insightful comments and suggestions.

REFERENCES

[1] C. Giannakopoulos. Frequency Assignment Problems of Linear Multivariable Problems: An

Exterior Algebra and Algebraic Geometry Based Approach. PhD Thesis, City University,

London, 1984.

[2] W. Greub. Multilinear Algebra, 2nd edition. Universitext, Springer-Verlag, New York-Heidelberg, 1978.

[3] W.V.D. Hodge and D. Pedoe. Methods of Algebraic Geometry. Cambridge University Press, Boston, 1952.

[4] N. Karcanias and U. Ba¸ser. Exterior algebra and invariant spaces of implicit systems: the Grassmann representative approach.Kybernetika(Prague), 30:1–22, 1994.

[5] M. Marcus and H. Minc.A Survey of Matrix Theory and Matrix Inequalities. Allyn and Bacon, Boston, 1964.

[6] G.W. Stewart. On the perturbation of pseudo-inverse, projections and linear least squares problems.SIAM Review, 19:634–662, 1977.

[7] G.W. Stewart. Perturbation theory for rectangular matrix pencils. Linear Algebra and its

Applications, 208/209:297–301, 1994.

[8] G.W. Stewart and J.G. Sun. Matrix Perturbation Theory. Academic Press, Boston, 1990. [9] J.G. Sun. Invariant subspaces and generalized invariant subspaces. I. Existence and uniqueness

theorems.Mathematica Numerica Sinica(Chinese), 2(1):1–13, 1980.

[10] J.G. Sun. Invariant subspaces and generalized invariant subspaces. II. Perturbation theorems.

Mathematica Numerica Sinica(Chinese), 2(2):113–123, 1980.

[11] J.G. Sun. Perturbation analysis for the generalized eigenvalue and the generalized singular value problem.Lecture Notes in Mathematics: Matrix Pencils, 973:221–244, 1983. [12] J.G. Sun. A note on Stewart’s theorem for definite matrix pairs. Linear Algebra and its

Applications, 48:331–339, 1982.

[13] J.G. Sun. Perturbation of angles between linear subspaces. Journal of Computational

Mathe-matics, 5:58–61, 1987.

[14] Y. Zhang and L. Qiu. On the angular metrics between linear subspaces. Linear Algebra and