vol20_pp303-313. 117KB Jun 04 2011 12:06:08 AM

(1)

PERTURBATIONS OF FUNCTIONS OF

DIAGONALIZABLE MATRICES∗

MICHAEL I. GIL’†

Abstract. LetAand ˜Aben×_ndiagonalizable matrices and_f be a function defined on their spectra. In the present paper, bounds for the norm off(A)−f( ˜A) are established. Applications to differential equations are also discussed.

Key words. Matrix valued functions, Perturbations, Similarity of matrices, Diagonalizable matrices.

AMS subject classifications.15A54, 15A45, 15A60.

1. Introduction and statement of the main result. LetCn _{be a Euclidean}

space with the scalar product (·,·), Euclidean normk·k=p

(·,·) and identity operator

I. Aand ˜Aaren×nmatrices with eigenvaluesλj and ˜λj (j= 1, . . . , n), respectively.

σ(A) denotes the spectrum of A, A∗ _{is the adjoint to} _A_{, and} _N₂₍_A_{) is the}

Hilbert-Schmidt (Frobenius) norm ofA: N2

2(A) = Trace(A∗A).

In the sequel, it is assumed that each of the matrices A and ˜A has n linearly

independent eigenvectors, and therefore, these matrices are diagonalizable. In other words, the eigenvalues of these matrices are semi-simple.

Letf be a scalar function defined on σ(A)∪σ( ˜A). The aim of this paper is to

establish inequalities for the norm off(A)−f( ˜A). The literature on perturbations

of matrix valued functions is very rich but mainly, perturbations of matrix functions of a complex argument and matrix functions of Hermitian matrices were considered, cf. [1, 11, 13, 14, 16, 18]. The matrix valued functions of a non-Hermitian argument have been investigated essentially less, although they are very important for various applications; see the book [10].

The following quantity plays an essential role hereafter:

g(A) :=

"

N2 2(A)−

n X

k=1

|λk|2

#1/2

.

∗_{Received by the editors December 8, 2009. Accepted for publication April 19, 2010. Handling} Editor: Peter Lancaster.

†_{Department of Mathematics, Ben Gurion University of the Negev, P.0. Box 653, Beer-Sheva} 84105, Israel ([email protected]). Supported by the Kameah fund of the Israel.

(2)

g(A) enjoys the following properties:

g2(A)≤2N22(AI) (AI = (A−A∗)/2i) and g2(A)≤N22(A)− |Trace(A2)|, (1.1)

cf. [8, Section 2.1]. IfA is normal, theng(A) = 0. Denote byµj, j = 1, . . . , m≤n,

the distinct eigenvalues ofA, and bypj the algebraic multiplicity ofµj. In particular,

one can writeµ1 =λ1 =· · · =λp1, µ2 =λp1+1 =· · · =λp1+p2, etc. Similarly, ˜µj,

j = 1, . . . ,m˜ ≤ n, are the distinct eigenvalues of ˜A, and ˜pj denotes the algebraic

multiplicity of ˜µj.

Letδj be the half-distance fromµj to the other eigenvalues ofA, namely,

δj:= min

k=1,...,m;k6=j|µj−µk|/2>0.

Similarly,

˜

δj:= min

k=1,...,m˜;k6=j|µ˜j−µ˜k|/2>0.

Put

β(A) :=

m X

j=1 pj

n₋1

X

k=0 gk₍_A₎

δk j

√

k! and β( ˜A) :=

˜

m X

j=1

˜

pj n₋1

X

k=0 gk_{( ˜}_A₎

˜

δk j

√

k!.

Hereg0₍_A_{) =}_δ0

j = 1 and ˜g0(A) = ˜δ0j = 1. According to (1.1),

β(A)≤ m X

j=1 pj

n−1

X

k=0

(√2N2(AI))k

δk j

√

k! .

In what follows, we put

f(λk)−f(˜λj)

λk−λ˜j

= 0 if λk= ˜λj.

Now we are in a position to formulate our main result.

Theorem 1.1. _Let_A_and_A˜_be_n_×_n_{diagonalizable matrices and}_f _{be a function}

defined on σ(A)∪σ( ˜A). Then the inequalities

N2(f(A)−f( ˜A))≤β(A)β( ˜A) max

j,k

f(λk)−f(˜λj)

λk−˜λj

N2(A−A˜) (1.2)

and

N2(f(A)−f( ˜A))≤β(A)β( ˜A) max

j,k |f(λk)−f(˜λj)| (1.3)

are valid.

The proof of this theorem is divided into lemmas which are presented in the next two sections. The importance of Theorem 1.1 lies in the fact that the right-hand sides

of inequalities (1.2) and (1.3) only involve universal quantities calculated forAand ˜A,

and the values of the functionf on the spectraσ(A) andσ( ˜A), but e.g. no matrices

(3)

2. The basic lemma. LetT and ˜T be the invertible matrices performing

sim-ilarities ofAand ˜Ato diagonal ones, that is,

T−1_AT ₌_S, ₍₂_.₁₎

and

˜

T−1_A_˜_T_˜_{= ˜}_S, ₍₂_.₂₎

where S = diag (λj) and ˜S = diag (˜λj). Everywhere below, {dk}nk=1 is the standard

orthonormal basis and kBk = sup {kBhk/khk : h ∈ Cn_} _{for an} _n_×_n _matrix _B_.

Recall thatN2(AB)≤ kBkN2(A).

We begin with the following lemma.

Lemma 2.1. _{Let the hypotheses of Theorem 1.1 hold. Then}

N22(f(A)−f( ˜A))≤ kT˜−1k2kTk2

n X

j,k=1

|(f(λk)−f(˜λj))(T−1T d˜ j, dk)|2. (2.3)

Proof. We have

N22(A−A˜) =N22(T ST−1−T˜S˜T˜−1) =N22(T ST−1T˜T˜−1−T T−1T˜S˜T˜−1)

=N22(T(ST−1T˜−T−1T˜S˜) ˜T−1)≤ kT˜−1k2kTk2N22(ST−1T˜−T−1T˜S˜).

But

N2

2(ST−1T˜−T−1T˜S˜) =

n X

j,k=1

|(ST−1_T_˜₋_T−1_T_˜_S_˜₎_d

j, dk)|2

=

n X

j,k=1

|(T−1_{T d}_˜

j, S∗dk)−(T−1T˜Sd˜ j, dk)|2=J, (2.4)

where

J:=

n X

j,k=1

|(λk−λ˜j)(T−1T d˜ j, dk)|2.

So,

(4)

Similarly, taking into account that

T−1_f₍_A₎_T ₌_f₍_S_{) = diag (}_f₍_λ

j)) and T˜−1f( ˜A) ˜T =f( ˜S) = diag (f(˜λj)),

we get (2.3), as claimed.

¿From the previous lemma we obtain at once:

Corollary 2.2. _{Under the hypotheses of Theorem 1.1, we have}

N2(f(A)−f( ˜A))≤κT˜κT[ n X

j,k=1

|f(λk)−f(˜λj)|2]1/2,

where

κT :=kTkkT−1k and κ˜T :=kT˜kkT˜−1k.

Recall that

f(λk)−f(˜λj)

λk−λ˜j

= 0 if λk= ˜λj.

Note that (2.3) implies

N2

2(f(A)−f( ˜A))≤ kT˜−1k2kTk2

n X

j,k=1

|f(λk)−f(˜λj)

λk−˜λj

(λk−λ˜j)(T−1T d˜ j, dk)|2

≤ kT˜−1k2kTk2max

j,k

f(λk)−f(˜λj)

λk−λ˜j

2

J. (2.5)

It follows from (2.4) that

J =N22(T−1AT˜−T−1A˜T˜)≤ kT−1k2kT˜k2N22(A−A˜).

Thus, (2.5) yields:

Corollary 2.3. _{Under the hypotheses of Theorem 1.1, the inequality}

N2(f(A)−f( ˜A))≤κTκ˜TN2(A−A˜) max

j,k

f(λk)−f(˜λj)

λk−λ˜j

(2.6)

is true.

Since

f(λk)−f(˜λj)

λk−λ˜j

≤ sup s∈co(A,A˜)

(5)

whereco(A,A˜) is the closed convex hull of the eigenvalues of the both matricesAand ˜

A, cf. [17], we have:

Corollary 2.4. _Let _A _and _A˜ _be _n_×_n _{diagonalizable matrices and} _f _{be a}

function defined and differentiable on co(A,A˜). Then

N2(f(A)−f( ˜A))≤κTκ˜T max s∈co(A,A˜)|f

′₍_s₎_|_N

2(A−A˜).

We need also the following corollary of Lemma 2.1.

Corollary 2.5. _{Under the hypotheses of Theorem 1.1, it holds that}

N2(f(A)−f( ˜A))≤ kT˜−1kkTkmax

j,k |f(λk)−f(˜λj)|[ n X

j=1

kT d˜ jk2 n X

k=1

k(T−1)∗dkk2]1/2.

(2.7)

3. Proof of Theorem 1.1. LetQj be the Riesz projection forµj:

Qj=−

1

2πi

Z

Lj

Rλ(A)dλ, (3.1)

whereLj :={z∈C:|z−µj|=δj}.

Lemma 3.1. _{The inequality}

kQjk ≤ n−1

X

k=0 gk₍_A₎

δk j

√

k!

is true.

Proof. By the Schur theorem, there is an orthonormal basis{ek}, in whichA=

D+V, where D = diag (λj) is a normal matrix (the diagonal part of A) and V

is an upper triangular nilpotent matrix (the nilpotent part of A). Let Pj be the

eigenprojection ofD corresponding toµj. Then by (3.1),

Qj−Pj=− 1

2πi

Z

Lj

(Rλ(A)−Rλ(D))dλ= 1

2πi

Z

Lj

Rλ(A)V Rλ(D)dλ

sinceV =A−D. But

Rλ(A) = (D+V −Iλ)−1= (I+Rλ(D)V)−1Rλ(D).

Consequently,

Rλ(A) = n−1

X

k=0

(6)

So, we have

Qj−Pj = n₋1

X

k=1

Ck, (3.2)

where

Ck = (−1)k

1

2πi

Z

Lj

(Rλ(D)V)kRλ(D)dλ.

By Theorem 2.5.1 of [8], we get

k(Rλ(D)V)kk ≤ N k

2(Rλ(D)V) √

k! ≤

Nk

2(V) ρk₍_{A, λ}₎√_k_!,

whereρ(A, λ) =mink|λ−λk|. In addition, directly from the definition ofg(A), when

the Hilbert-Schmidt norm is calculated in the basis{ek}, we haveN2(V) =g(A). So,

k(Rλ(D)V)kk ≤ g k₍_A₎

δk j

√

k! (λ∈Lj),

and therefore,

kCkk ≤ 1

2π

Z

Lj

k(Rλ(D)V)kRλ(D)k|dλ| ≤ g k₍_A₎

δ_jk+1√k! 1

2π

Z

Lj

|dλ|= g

k₍_A₎

δk j

√

k!.

Hence,

kQj−Pjk ≤ n−1

X

k=1

kCkk ≤ n−1

X

k=1 gk₍_A₎

δk j

√

k!, (3.3)

and therefore,

kQjk ≤ kPjk+kQj−Pjk ≤ n₋1

X

k=0 gk₍_A₎

δk j

√

k!,

as claimed.

Let{vk}nk=1 be a sequence ofthe eigenvectorsofA, and{uk}nk=1bethe

biorthog-onal sequence: (vj, uk) = 0 (j6=k), (vj, uj) = 1 (j, k= 1, . . . , n). So

A=

m X

k=1

µkQk= n X

k=1

λk(·, uk)vk. (3.4)

Rearranging these biorthogonal sequences, we can write

Qj = pj

X

k=1

(7)

where{ujk}pj

k=1 and{vjk}

pj

k=1 are biorthogonal: (vjk, ujm) = 0 (m6=k), (vjk, ujk) =

1 (m, k= 1, . . . , pj). Observe that we can always choose these systems so thatkujkk= kvjkk.

Lemma 3.2. _Let _k_u_jkk₌_k_v_jkk_{. Then we have}

kujkk2_{≤ k}_Q

jk (k= 1, . . . , pj).

Proof. Clearly,Qjujk= (Qjujk, ujk)vjk. So,

(Qjujk, vjk) = (ujk, ujk)(vjk, vjk) =kujkk4.

Hence,kujkk4≤ kQjkkujkk2, as claimed.

Lemmas 3.1 and 3.2 imply:

Corollary 3.3. _{The inequalities}

kujsk2≤ n−1

X

k=0 gk₍_A₎

δk j

√

k! (s= 1,2, . . . , pj)

are valid.

Again, let{dk}be the standard orthonormal basis.

Lemma 3.4. _Let _A_{be an} _n_×_n _{diagonalizable matrix. Then the operator}

T =

n X

k=1

(·, dk)vk (3.5)

has the inverse one defined by

T−1=

n X

k=1

(·, uk)dk, (3.6)

and (2.1) holds.

Proof. Indeed, we can write out

T−1_T ₌

n X

j=1 dj

n X

k=1

(·, dk)(vk, uj) = n X

j=1

dj(·, dj) =I

and

T T−1=

n X

k=1

(·, uk) n X

j=1

(dk, dj)vj = n X

k=1

(8)

Moreover,

AT =

n X

j=1

vjλj(·, dj) and S=

n X

k=1

λk(·, dk)dk.

Hence,

T−1_AT ₌

n X

k=1 dk

n X

j=1

(vj, uk)λj(·, dj) = n X

j=1

djλj(·, dj) =S.

So, (2.1) really holds.

Lemma 3.5. _Let _T _{be defined by (3.5). Then}

kTk2≤ m X

j=1

pjkQjk and kT−1k2≤ m X

j=1

pjkQjk,

and therefore,

κT ≤ m X

j=1

pjkQjk.

Proof. Due to the previous lemma

kT−1xk2=

n X

k=1

|(x, uk)|2≤ kxk2 n X

k=1

kukk2 (x∈Cn). (3.7)

By the Schwarz inequality,

(T x, T x) =

n X

k=1

n X

s=1

(x, dk)(vk, vs)(x, ds)

≤ n X

k=1

|(x, dk)|2[ n X

s,k=1

|(vk, vs)|2]1/2≤ kxk2 n X

s=1

kvsk2. (3.8)

But by Lemma 3.2,

n X

s=1

kusk2= n X

s=1

kvsk2= m X

j=1

pj

X

k=1

kvjkk2≤ m X

j=1

pjkQjk.

Now (3.7) and (3.8) yield the required result.

(9)

Corollary 3.6. _{The inequality}_κ_T _≤_β₍_A₎ _{is true.}

Lemma 3.7. _{Under the hypotheses of Theorem 1.1, we have}

N2(f(A)−f( ˜A))≤ kT˜−1kkTkmax

j,k |f(λk)−f(˜λj)|[

˜

m X

j=1

˜

pjkQ˜jk m X

k=1

pkkQkk]1/2,

whereQk,Q˜j are the eigenprojections of A andA˜, respectively.

Proof. By (3.6), we have

(T−1)∗ =

n X

k=1

(·, dk)uk.

Moreover, due to Lemma 3.2,

n X

j=1

k(T−1₎_∗_d

jk2=

n X

k=1

kukk2≤ m X

k=1

pkkQkk.

Similarly,

n X

k=1

kT d˜ kk2≤ m X

k=1

pkkQ˜kk.

Now (2.7) implies the required result.

Thanks to Lemmas 3.5 and 3.7 we get the inequality

N2(f(A)−f( ˜A))≤max

j,k |f(λk)−f(˜λj)|

˜

m X

j=1

˜

pjkQ˜jk m X

k=1

pkkQkk. (3.9)

Proof of Theorem 1.1: Inequality (1.2) is due to (2.6) and Corollary 3.6. Inequal-ity (1.3) is due to (3.9) and Lemma 3.1.

4. Applications. Let us consider the two differential equations

du

dt =Au (t >0) and du˜

dt = ˜Au˜ (t >0),

whose solutions are u(t) and ˜u(t), respectively. Let us take the initial conditions

u(0) = ˜u(0)∈Cn_{. Then we have}

u(t)−u˜(t) = (exp [At]−exp [ ˜At])u(0).

By (1.2),

(10)

≤te N2(A−A)β(A)β( ˜A)ku(0)k (t≥0),

where

α= max{max

k Re λk,maxj Re

˜

λj}.

Furthermore, let 06∈σ(A)∪σ(A). Then the function

A−1/2 _{sin (}_A1/2_t_{) (}_{t >}₀₎

is the Green function to the Cauchy problem for the equation

d2_w

dt2 +Aw= 0 (t >0).

Simultaneously, consider the equation

d2_w_˜

dt2 + ˜Aw˜= 0 (t >0),

and take the initial conditions

w(0) = ˜w(0)∈Cn _and _w′_{(0) = ˜}_w′_{(0) = 0}_.

Then we have

w(t)−w˜(t) = [A−1/2 _{sin (}_A1/2_t₎

−A˜−1/2 _{sin ( ˜}_A1/2_t_)]_w₍₀₎_.

By (1.2),

kw(t)−w˜(t)k ≤β(A)β( ˜A)N2(A−A˜) max

j,k

sin (_√t√λk)

λk −

sin (_√t√λ˜j)

˜

λj λk−˜λj

(t >0).

Here one can take an arbitrary branch of the roots.

Acknowledgment. I am very grateful to the referee of this paper for his really helpful remarks.

REFERENCES

[1] R. Bhatia.Matrix Analysis. Springer, Berlin, 1997.

(11)

[3] K. Du. Note on structured indefinite perturbations to Hermitian matrices. J. Comput. Appl. Math., 202(2):258–265, 2007.

[4] P. Forrester and E. Rains. Jacobians and rank 1 perturbations relating to unitary Hessenberg matrices. Int. Math. Res. Not. 2006, Article ID 48306, 2006.

[5] B. Fritzsche, B. Kirstein, and A. Lasarow. On Schwarz-Pick-Potapov block matrices of matrix-valued functions which belong to the extended Potapov class.Analysis (Munich), 22(3):243–263, 2002.

[6] B. Fritzsche, B. Kirstein, and A. Lasarow. Orthogonal rational matrix-valued functions on the unit circle: Recurrence relations and a Favard-type theorem.Math. Nachr., 279(5-6):513– 542, 2006.

[7] M.I. Gil’. Perturbations of simple eigenvectors of linear operators. Manuscripta Mathematica, 100:213–219, 1999.

[8] M.I. Gil’.Operator Functions and Localization of Spectra. Lecture Notes In Mathematics, vol. 1830, Springer-Verlag, Berlin, 2003.

[9] L. Han, M. Neumann, and M. Tsatsomeros. Spectral radii of fixed Frobenius norm perturba-tions of nonnegative matrices. SIAM J. Matrix Anal. Appl., 21(1):79–92, 1999.

[10] N.J. Higham. Functions of Matrices: Theory and Computations. Society for Industrial and Applied Mathematics, Philadelfia, 2008.

[11] R. Hryniv and P. Lancaster. On the perturbation of analytic matrix functions.Integral Equa-tions Operator Theory, 34(3):325–338, 1999.

[12] T. Jiang, X. Cheng, and L. Chen. An algebraic relation between consimilarity and similarity of complex matrices and its applications.J. Phys. A, Math. Gen., 39(29):9215–9222, 2006. [13] P. Lancaster, A.S. Markus, and F. Zhou. Perturbation theory for analytic matrix functions:

The semisimple case.SIAM J. Matrix Anal. Appl., 25(3):606–626, 2003.

[14] H. Langer and B. Najman. Remarks on the perturbation of analytic matrix functions. III.

Integral Equations Operator Theory, 15(5):796–806, 1992.

[15] A. Lasarow. Dual Szego pairs of sequences of rational matrix-valued functions. Int. J. Math. Math. Sci. 2006, Article ID 23723, 2006.

[16] A. Mikhailova, B. Pavlov, and L. Prokhorov. Intermediate Hamiltonian via Glazman’s splitting and analytic perturbation for meromorphic matrix-functions.Math. Nachr., 280(12):1376– 1416, 2007.

[17] A.M. Ostrowski. Solution of Equations in Euclidean and Banach Spaces. Academic Press, New York, 1973.