• Tidak ada hasil yang ditemukan

vol20_pp303-313. 117KB Jun 04 2011 12:06:08 AM

N/A
N/A
Protected

Academic year: 2017

Membagikan "vol20_pp303-313. 117KB Jun 04 2011 12:06:08 AM"

Copied!
11
0
0

Teks penuh

(1)

PERTURBATIONS OF FUNCTIONS OF

DIAGONALIZABLE MATRICES∗

MICHAEL I. GIL’†

Abstract. LetAand ˜Aben×ndiagonalizable matrices andf be a function defined on their spectra. In the present paper, bounds for the norm off(A)−f( ˜A) are established. Applications to differential equations are also discussed.

Key words. Matrix valued functions, Perturbations, Similarity of matrices, Diagonalizable matrices.

AMS subject classifications.15A54, 15A45, 15A60.

1. Introduction and statement of the main result. LetCn be a Euclidean

space with the scalar product (·,·), Euclidean normk·k=p

(·,·) and identity operator

I. Aand ˜Aaren×nmatrices with eigenvaluesλj and ˜λj (j= 1, . . . , n), respectively.

σ(A) denotes the spectrum of A, A∗ is the adjoint to A, and N2(A) is the

Hilbert-Schmidt (Frobenius) norm ofA: N2

2(A) = Trace(A∗A).

In the sequel, it is assumed that each of the matrices A and ˜A has n linearly

independent eigenvectors, and therefore, these matrices are diagonalizable. In other words, the eigenvalues of these matrices are semi-simple.

Letf be a scalar function defined on σ(A)∪σ( ˜A). The aim of this paper is to

establish inequalities for the norm off(A)−f( ˜A). The literature on perturbations

of matrix valued functions is very rich but mainly, perturbations of matrix functions of a complex argument and matrix functions of Hermitian matrices were considered, cf. [1, 11, 13, 14, 16, 18]. The matrix valued functions of a non-Hermitian argument have been investigated essentially less, although they are very important for various applications; see the book [10].

The following quantity plays an essential role hereafter:

g(A) :=

"

N2 2(A)−

n X

k=1

|λk|2

#1/2

.

Received by the editors December 8, 2009. Accepted for publication April 19, 2010. Handling Editor: Peter Lancaster.

Department of Mathematics, Ben Gurion University of the Negev, P.0. Box 653, Beer-Sheva 84105, Israel ([email protected]). Supported by the Kameah fund of the Israel.

(2)

g(A) enjoys the following properties:

g2(A)≤2N22(AI) (AI = (A−A∗)/2i) and g2(A)≤N22(A)− |Trace(A2)|, (1.1)

cf. [8, Section 2.1]. IfA is normal, theng(A) = 0. Denote byµj, j = 1, . . . , m≤n,

the distinct eigenvalues ofA, and bypj the algebraic multiplicity ofµj. In particular,

one can writeµ1 =λ1 =· · · =λp1, µ2 =λp1+1 =· · · =λp1+p2, etc. Similarly, ˜µj,

j = 1, . . . ,m˜ ≤ n, are the distinct eigenvalues of ˜A, and ˜pj denotes the algebraic

multiplicity of ˜µj.

Letδj be the half-distance fromµj to the other eigenvalues ofA, namely,

δj:= min

k=1,...,m;k6=j|µj−µk|/2>0.

Similarly,

˜

δj:= min

k=1,...,m˜;k6=j|µ˜j−µ˜k|/2>0.

Put

β(A) :=

m X

j=1 pj

n1

X

k=0 gk(A)

δk j

k! and β( ˜A) :=

˜

m X

j=1

˜

pj n1

X

k=0 gk( ˜A)

˜

δk j

k!.

Hereg0(A) =δ0

j = 1 and ˜g0(A) = ˜δ0j = 1. According to (1.1),

β(A)≤ m X

j=1 pj

n−1

X

k=0

(√2N2(AI))k

δk j

k! .

In what follows, we put

f(λk)−f(˜λj)

λk−λ˜j

= 0 if λk= ˜λj.

Now we are in a position to formulate our main result.

Theorem 1.1. LetAandA˜ben×ndiagonalizable matrices andf be a function

defined on σ(A)∪σ( ˜A). Then the inequalities

N2(f(A)−f( ˜A))≤β(A)β( ˜A) max

j,k

f(λk)−f(˜λj)

λk−˜λj

N2(A−A˜) (1.2)

and

N2(f(A)−f( ˜A))≤β(A)β( ˜A) max

j,k |f(λk)−f(˜λj)| (1.3)

are valid.

The proof of this theorem is divided into lemmas which are presented in the next two sections. The importance of Theorem 1.1 lies in the fact that the right-hand sides

of inequalities (1.2) and (1.3) only involve universal quantities calculated forAand ˜A,

and the values of the functionf on the spectraσ(A) andσ( ˜A), but e.g. no matrices

(3)

2. The basic lemma. LetT and ˜T be the invertible matrices performing

sim-ilarities ofAand ˜Ato diagonal ones, that is,

T−1AT =S, (2.1)

and

˜

T−1A˜T˜= ˜S, (2.2)

where S = diag (λj) and ˜S = diag (˜λj). Everywhere below, {dk}nk=1 is the standard

orthonormal basis and kBk = sup {kBhk/khk : h ∈ Cn} for an n×n matrix B.

Recall thatN2(AB)≤ kBkN2(A).

We begin with the following lemma.

Lemma 2.1. Let the hypotheses of Theorem 1.1 hold. Then

N22(f(A)−f( ˜A))≤ kT˜−1k2kTk2

n X

j,k=1

|(f(λk)−f(˜λj))(T−1T d˜ j, dk)|2. (2.3)

Proof. We have

N22(A−A˜) =N22(T ST−1−T˜S˜T˜−1) =N22(T ST−1T˜T˜−1−T T−1T˜S˜T˜−1)

=N22(T(ST−1T˜−T−1T˜S˜) ˜T−1)≤ kT˜−1k2kTk2N22(ST−1T˜−T−1T˜S˜).

But

N2

2(ST−1T˜−T−1T˜S˜) =

n X

j,k=1

|(ST−1T˜T−1T˜S˜)d

j, dk)|2

=

n X

j,k=1

|(T−1T d˜

j, S∗dk)−(T−1T˜Sd˜ j, dk)|2=J, (2.4)

where

J:=

n X

j,k=1

|(λk−λ˜j)(T−1T d˜ j, dk)|2.

So,

(4)

Similarly, taking into account that

T−1f(A)T =f(S) = diag (f(λ

j)) and T˜−1f( ˜A) ˜T =f( ˜S) = diag (f(˜λj)),

we get (2.3), as claimed.

¿From the previous lemma we obtain at once:

Corollary 2.2. Under the hypotheses of Theorem 1.1, we have

N2(f(A)−f( ˜A))≤κT˜κT[ n X

j,k=1

|f(λk)−f(˜λj)|2]1/2,

where

κT :=kTkkT−1k and κ˜T :=kT˜kkT˜−1k.

Recall that

f(λk)−f(˜λj)

λk−λ˜j

= 0 if λk= ˜λj.

Note that (2.3) implies

N2

2(f(A)−f( ˜A))≤ kT˜−1k2kTk2

n X

j,k=1

|f(λk)−f(˜λj)

λk−˜λj

(λk−λ˜j)(T−1T d˜ j, dk)|2

≤ kT˜−1k2kTk2max

j,k

f(λk)−f(˜λj)

λk−λ˜j

2

J. (2.5)

It follows from (2.4) that

J =N22(T−1AT˜−T−1A˜T˜)≤ kT−1k2kT˜k2N22(A−A˜).

Thus, (2.5) yields:

Corollary 2.3. Under the hypotheses of Theorem 1.1, the inequality

N2(f(A)−f( ˜A))≤κTκ˜TN2(A−A˜) max

j,k

f(λk)−f(˜λj)

λk−λ˜j

(2.6)

is true.

Since

f(λk)−f(˜λj)

λk−λ˜j

≤ sup s∈co(A,A˜)

(5)

whereco(A,A˜) is the closed convex hull of the eigenvalues of the both matricesAand ˜

A, cf. [17], we have:

Corollary 2.4. Let A and A˜ be n×n diagonalizable matrices and f be a

function defined and differentiable on co(A,A˜). Then

N2(f(A)−f( ˜A))≤κTκ˜T max s∈co(A,A˜)|f

(s)|N

2(A−A˜).

We need also the following corollary of Lemma 2.1.

Corollary 2.5. Under the hypotheses of Theorem 1.1, it holds that

N2(f(A)−f( ˜A))≤ kT˜−1kkTkmax

j,k |f(λk)−f(˜λj)|[ n X

j=1

kT d˜ jk2 n X

k=1

k(T−1)∗dkk2]1/2.

(2.7)

3. Proof of Theorem 1.1. LetQj be the Riesz projection forµj:

Qj=−

1

2πi

Z

Lj

Rλ(A)dλ, (3.1)

whereLj :={z∈C:|z−µj|=δj}.

Lemma 3.1. The inequality

kQjk ≤ n−1

X

k=0 gk(A)

δk j

k!

is true.

Proof. By the Schur theorem, there is an orthonormal basis{ek}, in whichA=

D+V, where D = diag (λj) is a normal matrix (the diagonal part of A) and V

is an upper triangular nilpotent matrix (the nilpotent part of A). Let Pj be the

eigenprojection ofD corresponding toµj. Then by (3.1),

Qj−Pj=− 1

2πi

Z

Lj

(Rλ(A)−Rλ(D))dλ= 1

2πi

Z

Lj

Rλ(A)V Rλ(D)dλ

sinceV =A−D. But

Rλ(A) = (D+V −Iλ)−1= (I+Rλ(D)V)−1Rλ(D).

Consequently,

Rλ(A) = n−1

X

k=0

(6)

So, we have

Qj−Pj = n1

X

k=1

Ck, (3.2)

where

Ck = (−1)k

1

2πi

Z

Lj

(Rλ(D)V)kRλ(D)dλ.

By Theorem 2.5.1 of [8], we get

k(Rλ(D)V)kk ≤ N k

2(Rλ(D)V) √

k! ≤

Nk

2(V) ρk(A, λ)k!,

whereρ(A, λ) =mink|λ−λk|. In addition, directly from the definition ofg(A), when

the Hilbert-Schmidt norm is calculated in the basis{ek}, we haveN2(V) =g(A). So,

k(Rλ(D)V)kk ≤ g k(A)

δk j

k! (λ∈Lj),

and therefore,

kCkk ≤ 1

Z

Lj

k(Rλ(D)V)kRλ(D)k|dλ| ≤ g k(A)

δjk+1√k! 1

Z

Lj

|dλ|= g

k(A)

δk j

k!.

Hence,

kQj−Pjk ≤ n−1

X

k=1

kCkk ≤ n−1

X

k=1 gk(A)

δk j

k!, (3.3)

and therefore,

kQjk ≤ kPjk+kQj−Pjk ≤ n1

X

k=0 gk(A)

δk j

k!,

as claimed.

Let{vk}nk=1 be a sequence ofthe eigenvectorsofA, and{uk}nk=1bethe

biorthog-onal sequence: (vj, uk) = 0 (j6=k), (vj, uj) = 1 (j, k= 1, . . . , n). So

A=

m X

k=1

µkQk= n X

k=1

λk(·, uk)vk. (3.4)

Rearranging these biorthogonal sequences, we can write

Qj = pj

X

k=1

(7)

where{ujk}pj

k=1 and{vjk}

pj

k=1 are biorthogonal: (vjk, ujm) = 0 (m6=k), (vjk, ujk) =

1 (m, k= 1, . . . , pj). Observe that we can always choose these systems so thatkujkk= kvjkk.

Lemma 3.2. Let kujkk=kvjkk. Then we have

kujkk2≤ kQ

jk (k= 1, . . . , pj).

Proof. Clearly,Qjujk= (Qjujk, ujk)vjk. So,

(Qjujk, vjk) = (ujk, ujk)(vjk, vjk) =kujkk4.

Hence,kujkk4≤ kQjkkujkk2, as claimed.

Lemmas 3.1 and 3.2 imply:

Corollary 3.3. The inequalities

kujsk2≤ n−1

X

k=0 gk(A)

δk j

k! (s= 1,2, . . . , pj)

are valid.

Again, let{dk}be the standard orthonormal basis.

Lemma 3.4. Let Abe an n×n diagonalizable matrix. Then the operator

T =

n X

k=1

(·, dk)vk (3.5)

has the inverse one defined by

T−1=

n X

k=1

(·, uk)dk, (3.6)

and (2.1) holds.

Proof. Indeed, we can write out

T−1T =

n X

j=1 dj

n X

k=1

(·, dk)(vk, uj) = n X

j=1

dj(·, dj) =I

and

T T−1=

n X

k=1

(·, uk) n X

j=1

(dk, dj)vj = n X

k=1

(8)

Moreover,

AT =

n X

j=1

vjλj(·, dj) and S=

n X

k=1

λk(·, dk)dk.

Hence,

T−1AT =

n X

k=1 dk

n X

j=1

(vj, uk)λj(·, dj) = n X

j=1

djλj(·, dj) =S.

So, (2.1) really holds.

Lemma 3.5. Let T be defined by (3.5). Then

kTk2≤ m X

j=1

pjkQjk and kT−1k2≤ m X

j=1

pjkQjk,

and therefore,

κT ≤ m X

j=1

pjkQjk.

Proof. Due to the previous lemma

kT−1xk2=

n X

k=1

|(x, uk)|2≤ kxk2 n X

k=1

kukk2 (x∈Cn). (3.7)

By the Schwarz inequality,

(T x, T x) =

n X

k=1

n X

s=1

(x, dk)(vk, vs)(x, ds)

≤ n X

k=1

|(x, dk)|2[ n X

s,k=1

|(vk, vs)|2]1/2≤ kxk2 n X

s=1

kvsk2. (3.8)

But by Lemma 3.2,

n X

s=1

kusk2= n X

s=1

kvsk2= m X

j=1

pj

X

k=1

kvjkk2≤ m X

j=1

pjkQjk.

Now (3.7) and (3.8) yield the required result.

(9)

Corollary 3.6. The inequalityκT β(A) is true.

Lemma 3.7. Under the hypotheses of Theorem 1.1, we have

N2(f(A)−f( ˜A))≤ kT˜−1kkTkmax

j,k |f(λk)−f(˜λj)|[

˜

m X

j=1

˜

pjkQ˜jk m X

k=1

pkkQkk]1/2,

whereQk,Q˜j are the eigenprojections of A andA˜, respectively.

Proof. By (3.6), we have

(T−1)∗ =

n X

k=1

(·, dk)uk.

Moreover, due to Lemma 3.2,

n X

j=1

k(T−1)d

jk2=

n X

k=1

kukk2≤ m X

k=1

pkkQkk.

Similarly,

n X

k=1

kT d˜ kk2≤ m X

k=1

pkkQ˜kk.

Now (2.7) implies the required result.

Thanks to Lemmas 3.5 and 3.7 we get the inequality

N2(f(A)−f( ˜A))≤max

j,k |f(λk)−f(˜λj)|

˜

m X

j=1

˜

pjkQ˜jk m X

k=1

pkkQkk. (3.9)

Proof of Theorem 1.1: Inequality (1.2) is due to (2.6) and Corollary 3.6. Inequal-ity (1.3) is due to (3.9) and Lemma 3.1.

4. Applications. Let us consider the two differential equations

du

dt =Au (t >0) and du˜

dt = ˜Au˜ (t >0),

whose solutions are u(t) and ˜u(t), respectively. Let us take the initial conditions

u(0) = ˜u(0)∈Cn. Then we have

u(t)−u˜(t) = (exp [At]−exp [ ˜At])u(0).

By (1.2),

(10)

≤te N2(A−A)β(A)β( ˜A)ku(0)k (t≥0),

where

α= max{max

k Re λk,maxj Re

˜

λj}.

Furthermore, let 06∈σ(A)∪σ(A). Then the function

A−1/2 sin (A1/2t) (t >0)

is the Green function to the Cauchy problem for the equation

d2w

dt2 +Aw= 0 (t >0).

Simultaneously, consider the equation

d2w˜

dt2 + ˜Aw˜= 0 (t >0),

and take the initial conditions

w(0) = ˜w(0)∈Cn and w(0) = ˜w(0) = 0.

Then we have

w(t)−w˜(t) = [A−1/2 sin (A1/2t)

−A˜−1/2 sin ( ˜A1/2t)]w(0).

By (1.2),

kw(t)−w˜(t)k ≤β(A)β( ˜A)N2(A−A˜) max

j,k

sin (t√λk)

λk −

sin (t√λ˜j)

˜

λj λk−˜λj

(t >0).

Here one can take an arbitrary branch of the roots.

Acknowledgment. I am very grateful to the referee of this paper for his really helpful remarks.

REFERENCES

[1] R. Bhatia.Matrix Analysis. Springer, Berlin, 1997.

(11)

[3] K. Du. Note on structured indefinite perturbations to Hermitian matrices. J. Comput. Appl. Math., 202(2):258–265, 2007.

[4] P. Forrester and E. Rains. Jacobians and rank 1 perturbations relating to unitary Hessenberg matrices. Int. Math. Res. Not. 2006, Article ID 48306, 2006.

[5] B. Fritzsche, B. Kirstein, and A. Lasarow. On Schwarz-Pick-Potapov block matrices of matrix-valued functions which belong to the extended Potapov class.Analysis (Munich), 22(3):243–263, 2002.

[6] B. Fritzsche, B. Kirstein, and A. Lasarow. Orthogonal rational matrix-valued functions on the unit circle: Recurrence relations and a Favard-type theorem.Math. Nachr., 279(5-6):513– 542, 2006.

[7] M.I. Gil’. Perturbations of simple eigenvectors of linear operators. Manuscripta Mathematica, 100:213–219, 1999.

[8] M.I. Gil’.Operator Functions and Localization of Spectra. Lecture Notes In Mathematics, vol. 1830, Springer-Verlag, Berlin, 2003.

[9] L. Han, M. Neumann, and M. Tsatsomeros. Spectral radii of fixed Frobenius norm perturba-tions of nonnegative matrices. SIAM J. Matrix Anal. Appl., 21(1):79–92, 1999.

[10] N.J. Higham. Functions of Matrices: Theory and Computations. Society for Industrial and Applied Mathematics, Philadelfia, 2008.

[11] R. Hryniv and P. Lancaster. On the perturbation of analytic matrix functions.Integral Equa-tions Operator Theory, 34(3):325–338, 1999.

[12] T. Jiang, X. Cheng, and L. Chen. An algebraic relation between consimilarity and similarity of complex matrices and its applications.J. Phys. A, Math. Gen., 39(29):9215–9222, 2006. [13] P. Lancaster, A.S. Markus, and F. Zhou. Perturbation theory for analytic matrix functions:

The semisimple case.SIAM J. Matrix Anal. Appl., 25(3):606–626, 2003.

[14] H. Langer and B. Najman. Remarks on the perturbation of analytic matrix functions. III.

Integral Equations Operator Theory, 15(5):796–806, 1992.

[15] A. Lasarow. Dual Szego pairs of sequences of rational matrix-valued functions. Int. J. Math. Math. Sci. 2006, Article ID 23723, 2006.

[16] A. Mikhailova, B. Pavlov, and L. Prokhorov. Intermediate Hamiltonian via Glazman’s splitting and analytic perturbation for meromorphic matrix-functions.Math. Nachr., 280(12):1376– 1416, 2007.

[17] A.M. Ostrowski. Solution of Equations in Euclidean and Banach Spaces. Academic Press, New York, 1973.

Referensi

Dokumen terkait

[10] Andrica, D., Todea (Aldea), M., Perfect Morse functions and some applications, Proceedings of the International Conference on Theory and Ap- plications of Mathematics

Matrix functions, Generalization of nonnegative matrices, Eventually nonnegative matrices, Eventually positive matrices, Exponentially nonnegative matrices, Eventually exponen-

A key tool to obtain these bounds is using some Hermitian matrices which are, in the sense of Loewner matrix ordering, below the Hermitian part of A or, more generally, below

Among the P -matrices are several important and well-studied matrix subclasses (see [5, 6]for more information on these classes of matrices): An (invertible) M -matrix is a

The study of the spectra of real symmetric (or Hermitian) matrices subordinated to a certain tree, and, in particular, of the number of distinct eigenvalues such acyclic matrices,

Further, we give some L¨ owner partial orders for compound matrices of Schur complements of positive semidefinite Hermitian matrices, and obtain some estimates for eigenvalues of

We show that the matrices of S have real eigenvalues, complete systems of eigenvectors and moreover they are Q -Hermitian, for Q a positive definite Hermitian operator..

In this paper we will study the spectral properties of Wigner real symmetric and Hermitian random matrices in the case when the matrix entries have heavy tails of distribution..