vol9_pp270-275. 125KB Jun 04 2011 12:06:17 AM

(1)

CONSTRUCTIONS OF TRACE ZERO SYMMETRIC STOCHASTIC

MATRICES FOR THE INVERSE EIGENVALUE PROBLEM∗

ROBERT REAMS†

Abstract. In the special case of where the spectrumσ={λ1, λ2, λ3,0,0, . . . ,0}has at most three nonzero eigenvalues λ1, λ2, λ3 with λ1 ≥ 0 ≥λ2 ≥λ3, andλ1+λ2+λ3 = 0, the inverse eigenvalue problem for symmetric stochastic n×nmatrices is solved. Constructions are provided for the appropriate matrices where they are readily available. It is shown that whennis odd it is not possible to realize the spectrumσwith ann×nsymmetric stochastic matrix whenλ3= 0 and

3 2n−3 >

λ₂

λ₃ ≥0, andit is shown that this boundis best possible.

Key words. Inverse eigenvalue problem, Symmetric stochastic matrix, Symmetric nonnegative matrix, Distance matrix.

AMS subject classifications.15A18, 15A48, 15A51, 15A57

1. Introduction. Lete1, . . . , en denote the standard basis inRn, soei denotes

the vector with a 1 in the ith position and zeroes elsewhere. We will denote by e

the vector of all ones, i.e. e = [1,1, . . . ,1]T _∈ _Rn_{. A matrix} _A _∈ _Rn×n _{is said to}

be stochastic when all of its entries are nonnegative and all its row sums are equal to 1, i.e. A has right eigenvector e corresponding to the eigenvalue 1. We will be concerned with symmetric stochastic matrices, so that these matrices are in fact doubly stochastic. Also, the eigenvalues will necessarily be real. If A ∈ Rn×n _is

nonnegative, has eigenvalueλ1>0 corresponding to the right eigenvectorethen _λ1 1A is stochastic, and for convenience we will state our results in the form, for example, of a matrixA having eigenvectore corresponding to 1 +ǫ, where the spectrumσ=

{1 +ǫ,−1,−ǫ,0,0, . . . ,0}, with 0 ≤ǫ≤1. We will say that σ={λ1, . . . , λn} ⊂R

is realized as the spectrum of a matrixA in the event that then×n matrixA has eigenvaluesλ1, . . . , λn.

The nonnegative inverse eigenvalue problem is to find necessary and sufficient conditions that the elements of the setσ={λ1, . . . , λn} ⊂Care the eigenvalues of a

matrix with nonnegative entries. This problem is currently unsolved except in special cases [1], [7], [8], [9], [10]. The restriction of this problem to symmetric nonnegative matrices for which the eigenvalues λ1, . . . , λn satisfy λ1 ≥ 0 ≥ λ2 ≥ · · · ≥ λn is

solved in [3], where it is shown that the only necessary and sufficient condition is that n

i=1λi ≥0. Distance matrices are necessarily symmetric, nonnegative, have

trace zero, and must haveλ1≥0≥λ2≥ · · · ≥λn, although these are not sufficient

conditions to be a distance matrix. We conjectured in [6], after solving numerous special cases includingn= 2,3,4,5,6, that the only necessary and sufficient conditions for the existence of a distance matrix with a givenσis thatλ1≥0≥λ2≥ · · · ≥λn

andn

i=1λi = 0. Distance matrices with eigenvector e were previously studied in [5],

∗ _{Receivedby the editors on 13 October 2001. Final manuscript acceptedfor publication on 4}

September 2002. Handling Editor: Tom Laffey.

† _{Department of Mathematics, National University of IrelandGalway, Galway, Ireland}

([email protected]).

(2)

although not in the context of these matrices eigenvalues. Bounds on the eigenvalues of symmetric stochastic matrices are given in [2] and [4], although they do not provide a restriction on the eigenvalues for the matrices under consideration here.

In Section 2 we provide an explicit construction of ann×nsymmetric stochastic matrix which realizes the spectrum{2,−1,−1,0,0, . . . ,0}, followed by showing that it is not possible to realize the spectrum{1,−1,0,0, . . . ,0}with a symmetric stochastic matrix whennis odd, although it is possible to realize this spectrum whennis even. In Section 3 we provide explicit constructions of symmetric stochastic matrices to realize {1 +ǫ,−1,−ǫ,0,0, . . . ,0}, for 1 ≥ ǫ ≥ 0, when n is even. We then show that it is not possible to realize this spectrum with a symmetric stochastic matrix when ₂_n3₋₃ > ǫ ≥ 0, and n is odd. Although we can realize this spectrum with a symmetric stochastic matrix when 1≥ǫ≥ 3

2n−3, andnis odd. In the latter case we

do not provide an explicit construction, instead making use of the Intermediate Value Theorem in several variables.

2. Freedom and restrictions on spectra. Lemma 2.1 will be used to establish that the matrix under consideration is nonnegative.

Lemma 2.1. _Let _A _{= (}_a_ij₎ _∈ Rn×n _{be a symmetric matrix with eigenvalues} λ1, . . . , λn which satisfy λ1≥0 ≥λ2 ≥ · · · ≥λn. Suppose that A has eigenvector e corresponding toλ1, and that A has all diagonal entries equal to zero. ThenA is a nonnegative matrix.

Proof. Write A = λ1eeT

n +λ2u2u T

2 +· · ·+λnunuTn, where u2, . . . , un are unit

eigenvectors corresponding toλ2, . . . , λn, respectively. Then−2aij= (ei−ej)TA(ei− ej) =λ2((ei−ej)Tu2)2+· · ·+λn((ei−ej)Tun)2≤0, for all i, j, 1≤i, j≤n.

Our next two theorems give some foretaste of what is and isn’t possible with the inverse eigenvalue problem for symmetric nonnegative matrices. Note first that a 3×3 symmetric nonnegative matrix with eigenvectore and of trace zero must be a nonnegative scalar multiple ofA=eeT ₋_I_{, which has spectrum}_σ₌_{₂_,₋₁_,₋₁_}_.

Theorem 2.2. _Let _σ ₌ _{₂_,₋₁_,₋₁_,₀_,₀_{, . . . ,}₀_}_{. Then} _σ _{can be realized as the}

spectrum of a symmetric nonnegative matrixA∈Rn×n _{with eigenvector}_e correspond-ing to2, forn≥3.

Proof. Let u= [u1, . . . , un]T, v= [v1, . . . , vn]T ∈Rn be given byuj=

2 ncosθj,

vj =

2

nsin θj, where θj = 2π

n j, for 1 ≤ j ≤ n. Then A = 2 eeT

n −uu

T ₋_vvT _is

symmetric and has zero diagonal entries by construction. Since the roots ofxn₋₁

aree2πinj, 1≤j ≤n, then (coefficient ofxn−1) = 0 =n j=1−e

2πi

nj and (coefficient

ofxn−2_{) = 0 =}n

j,k=1,j=ke

2πi nje

2πi

n k= (n j=1e

2πi

n j)2−n j=1e

2πi

n 2j. It follows that

n

j=1 cosθj =nj=1 sin θj =nj=1 sin 2θj = 0, which tells us thatuTe=vTe= uT_v_{= 0, respectively. It also follows that}n

j=1 cos 2θj = 0, so thatn_j=1 cos2θj−

n

j=1 sin2 θj= 0. We also know that nj=1 cos2 θj+nj=1 sin2 θj=n, so we can

conclude thatn

j=1cos2 θj =n_j=1 sin2 θj =n₂. This tells us that uandv are unit

vectors. ThusAhas spectrumσand is nonnegative from Lemma 2.1.

Theorem 2.3. _Let _λ1 _> ₀ _and _σ ₌ _{_λ1,₋_λ1,₀_{, . . . ,}₀_}_{. Then} _σ _{cannot be}

(3)

ecorresponding to λ1, whennis odd.

Proof. Suppose A is a matrix of the given form that realizes σ. Write A =

λ1ee_nT −λ1uuT_{, where} _u _{= [}_{u1, . . . , u}

n]T ∈ Rn, ||u|| = 1 and uTe = 0. A has

trace zero and is nonnegative by hypothesis so all diagonal entries are zero, and thus 0 = λ1

n −λ1u 2

i, for 1≤i≤n. But thenui= √±1_n and 0 =n_i=1ui=n_i=1√±1n, which

is not possible whennis odd.

Remark 2.4. _{The same}_σ_{of Theorem 2.3 can be realized by an}_n_×_n_symmetric

nonnegative matrix with eigenvectorecorresponding toλ1 whennis even, by taking in the proof the unit vectoru= _√1

n[1,1, . . . ,1,−1,−1, . . . ,−1]

T _∈_Rn_.

3. At most three nonzero eigenvalues. The idea in Remark 2.4 can be ex-tended to the case where n is a multiple of 4 for the case of at most three nonzero eigenvalues.

Theorem 3.1. _Let _σ ₌ _{_{1 +}_ǫ,₋₁_,₋_ǫ,₀_,₀_{, . . . ,}₀_}_{, where} ₁ _≥ _ǫ _≥ ₀_{. Then} σ can be realized by a symmetric nonnegative matrix A ∈ Rn×n _{with eigenvector} _e corresponding to1 +ǫ, whenn= 4mfor somem.

Proof. Letu, v∈Rn _{be given by} u= _√1

n[1,1, . . . ,1, 1, 1, . . . , 1,−1,−1, . . . ,−1,−1,−1, . . . ,−1] T_,

v= √1_n[1,1, . . . ,1,−1,−1, . . . ,−1, 1, 1, . . . , 1,−1,−1, . . . ,−1]T_.

Then √e_n, uand v are orthonormal. Let A= (1+_nǫ)eeT ₋_uuT ₋_ǫvvT_{, and note}

that all the diagonal entries ofAare zero.

However, the way we deal with the remaining cases forneven is somewhat more complicated.

Theorem 3.2. _Let _σ ₌ _{_{1 +}_ǫ,₋₁_,₋_ǫ,₀_,₀_{, . . . ,}₀_}_{, where} ₁ _≥ _ǫ _≥ ₀_{. Then} σ can be realized by a symmetric nonnegative matrix A ∈ Rn×n _{with eigenvector} _e corresponding to1 +ǫ, whenn= 4m+ 2 for somem.

Proof. Let u = [u1, . . . , un]T, v = [v1, . . . , vn]T ∈ Rn, and A = (1 +ǫ)ee T

n − uuT ₋_ǫvvT_{. We will require that 0 =} 1+ǫ

n −u 2

i −ǫvi2, for 1 ≤ i ≤ n, so that A

has all zero diagonal entries. Let ui =

1+ǫ

n cos θi and vi =

1+ǫ

nǫ sin θi, where

the θi’s are chosen below. Our θi’s must satisfy n_i=1 cos θi = n_i=1 sin θi =

n

i=1 cosθisinθi = 0. So thatuandvare unit vectors we must haven_i=1 cos2θi= n

1+ǫ and

n

i=1 sin 2_θ

i=₁₊nǫ_ǫ, which will be achieved ifn_i=1 cos2θi−n_i=1 sin 2_θ

i= n(1₋ǫ)

1+ǫ , and

n

i=1 cos2 θi+n_i=1 sin 2 _θ

i = n. So for any givenǫ ∈ [0,1] we also

require for our choice ofθi’s thatni=1 cos 2θi =n(1₁₊−_ǫǫ).

Let 0 ≤δ < 2π

n, and choose the angles θi, 1≤i ≤n, around the origin in the

plane.

Letθi for 1≤i≤mbe, respectively, the 1st quadrant angles 2π

n −δ, 2( 2π

n −δ), . . . ,(m−1)( 2π

n −δ), m( 2π

n −δ).

Letθi form+ 1≤i≤2mbe, respectively, the 2nd quadrant angles

(m+ 1)2π

n +mδ, (m+ 2) 2π

n + (m−1)δ, . . . ,(2m−1) 2π

n + 2δ,2m 2π

n +δ.

Letθ2m+1=π.

Letθi for 2m+ 2≤i≤3m+ 1 be, respectively, the 3rd quadrant angles

−2m2π

n −δ,−(2m−1) 2π

n −2δ, . . . ,−(m+ 2) 2π

n −(m−1)δ,−(m+ 1) 2π

(4)

Letθi for 3m+ 2≤i≤4m+ 1 be, respectively, the 4th quadrant angles

−m(2π

n −δ),−(m−1)( 2π

n −δ), . . . ,−2( 2π

n −δ),−( 2π

n −δ).

Letθ4m+2= 0.

For eachθi there is aθi+π in the above list, by pairing off 1st quadrant angles

with appropriate 3rd quadrant angles, and pairing 2nd quadrant angles with appro-priate 4th quadrant angles. Therefore since cos (θi+π) =−cosθi we conclude that

n

i=1 cosθi= 0.

For each θi = 0 and θi = π in the above list we can pair off any θi with a

corresponding−θiand conclude thatni=1 sinθi= 0.

Similarly,n

i=1 sin 2θi= 0.

Pairing off eachθi with aθi+πin the same way as before, and since cos 2θi =

cos (2θi+ 2π), we must have

n

i=1

cos 2θi= 2 +

1st,2nd,3rd,4th quadrant

cos 2θi= 2 + 2

1st,4th quadrant

cos 2θi.

Because we can pair off eachθi in the 1st quadrant with each−θiin the 4th quadrant

n

i=1 cos 2θi= 2 + 41st quadrant cos 2θi. Next using the trigonometric formula

n

j=0

cosjα= sin (

n+1

2 α) cos (n2α)

sin α 2

(found in [11], for example), whereα= 2(2π

n −δ) and some simplification we obtain

that

n

i=1

cos 2θi= 2

sin ((2m+ 1)δ) sin ( π

2m+1−δ)

. Letf(δ) = 2 sin ((2_{sin (} mπ+1)δ)

2m+1−δ), then f(0) = 0 and lim

δ→2π n

f(δ) = n, and notice that f is continuous on the interval [0,2π

n ). Also, since

0≤ǫ≤1 we have that 0≤ n(1₁₊−_ǫǫ)≤n, but then by the Intermediate Value Theorem for eachǫ∈(0,1] there is a δ∈[0,2π

n) su ch thatf(δ) = n(1−ǫ)

1+ǫ . The case ǫ= 0 and δ= 2_nπ is covered by the remark after Theorem 2.3.

In order to deal with the case wherenis odd we will improve on Theorem 2.3.

Theorem 3.3. _Let _σ ₌ _{_{1 +}_ǫ,₋₁_,₋_ǫ,₀_,₀_{, . . . ,}₀_}_{, where} 3

2n−3 > ǫ ≥ 0, and n≥3. Thenσ cannot be realized by a symmetric nonnegative matrixA∈Rn×n _with eigenvector ecorresponding to1 +ǫ, whenn= 2m+ 1for some m.

Proof. σ is realizable when ǫ = 1 from Theorem 2.2. We wish to determine the minimum value ofǫ for which it is possible to construct a matrix of the desired form. Using the same notation as in the proof of Theorem 3.2 we wish to determine the minimum value of ǫ as a function of θ1, . . . , θn subject to the three constraints

n

i=1 cosθi=ni=1 sin θi= 0 andni=1 cosθi sinθi= 0 (we know from Theorem

2.3 that ǫ > 0). Observe that finding the minimum ǫ is equivalent to finding the maximum value of the function ₁₊n_ǫ = F(θ1, . . . , θn) = ni=1cos2 θi subject to the

three constraints. For the moment let us determine the maximum value ofF subject to the one constraintn

i=1cosθi= 0. Letλdenote the Lagrange multiplier so that

(5)

for eachi we have sin θi = 0 or else cos θi = −λ/2. We cannot have all θi’s equal

to 0 orπ, since then we would not haven

i=1cosθi = 0, becausenis odd. Suppose

now thatkof the cosθi’s are equal, but not equal to±1, thenkcosθi=±1±1· · ·.

Then in order to maximizeF we must have as many±1’s as possible, thus we must havek= 2 with cosθp = cosθq (say) and the remainingθi’s all either 0 orπ. Then

2 cosθi = 1 or −1, and there are two possibilities:

Case 1: cosθp= cosθq = 1₂ withm−1 of the remaining cosθi’s equ al to 1 and the

othermof the cosθi’s equal to−1.

Case 2: cosθp = cosθq =−₂1 with m−1 of the remaining cos θi’s equal to−1 and

the othermof the cosθi’s equal to 1.

In either case the maximum value of F is 1₂ + 2m−1 = n−3₂, in which case

ǫ= ₂_n3₋₃.

Notice now that this maximum value forF can be just as easily achieved when

θp =−θq and that thenni=1sin θi = 0 and ni=1 cos θi sin θi = 0. So ǫ has in

effect been minimized subject to all three constraints.

Corollary 3.4. _Let _σ ₌ _{_{1 +}_ǫ,₋₁_,₋_ǫ,₀_{, . . . ,}₀_}_{, where} ₁ _≥ _ǫ _≥ 3 2n−3, and n ≥ 3. Then σ can realized as the spectrum of a symmetric nonnegative matrix

A∈Rn×n _{with eigenvector}_e_{corresponding to}_{1 +}_ǫ_{, when}_n_{is odd.} Proof. LetF(θ1, . . . , θn) =nj=1cos2 θj. Then

F(2π

n1,

2π n 2, . . . ,

2π

n(n−1),

2π n n) =

n

2,

and

F(2π 3 ,−

2π

3 ,0,0, . . . ,0, π, π, . . . , π) =n− 3 2.

Moreover, F is continuous as a function of cos θ1, . . . ,cos θn, particu larly on the

following intervals for the cosθi’s:

For n = 4k+ 3 let the cos θi’s satisfy cos2π(k_n+1) ≤ cos θk+1 ≤ cos2₃π and

cos2π(3_nk+3) ≤cosθ3k+3≤cos2₃π. Also, let cos2_nπj ≤cosθj≤cos 0 for 1≤j≤kand

3k+ 4≤j≤4k+ 3, and let cosπ≤cos θj ≤cos2_nπj for k+ 2≤j≤3k+ 2 except

that cosθ2k+2= cosπ.

For n = 4k+ 1 let the cos θi’s satisfy cos2π(k_n+1) ≤ cos θk+1 ≤ cos2₃π and

cos2π(3_nk+2) ≤cosθ3k+2≤cos2₃π. Also, let cos2_nπj ≤cosθj≤cos 0 for 1≤j≤kand

3k+ 3≤j≤4k+ 1, and let cosπ≤cos θj ≤cos2_nπj for k+ 2≤j≤3k+ 1 except

that cosθ1= cos 0.

For each ǫ such that 1 ≥ ǫ ≥ ₂_n3₋₃ we have n₂ ≤ ₁₊n_ǫ ≤n− 3₂, then from the Intermediate Value Theorem for real valued functions of several variables it follows that for eachǫ∈[ 3

2n−3,1] there is a (θ1, . . . , θn) su ch thatF(θ1, . . . , θn) = n 1+ǫ.

The author does not see a natural extension of the above methods to deal with the case of at most four nonzero eigenvalues.

(6)

REFERENCES

[1] M. Boyle and D. Handelman. The spectra of nonnegative matrices via symbolic dynamics. Annals of Mathematics, 133:249–316, 1991.

[2] M. Fiedler. Bounds for eigenvalues of doubly stochastic matrices.Linear Algebra and Its Ap-plications,5:299–310, 1972.

[3] M. Fiedler. Eigenvalues of nonnegative symmetric matrices.Linear Algebra and Its Applica-tions,9:119–142, 1974.

[4] M. Fiedler. Additive compound matrices and an inequality for eigenvalues of symmetric stochas-tic matrices.Czechoslovak Mathematical Journal,24(99):392–402, 1974.

[5] T.L. Hayden andP. Tarazaga. Distance matrices andregular figures.Linear Algebra and its Applications,195:9-16, 1993.

[6] T.L. Hayden, R. Reams, and J. Wells. Methods for constructing distance matrices and the inverse eigenvalue problem.Linear Algebra and Its Applications,295:97–112, 1999. [7] T.J. Laffey andE. Meehan. A characterization of trace zero nonnegative 5×5 matrices.Linear

Algebra and Its Applications,302/303:295–302, 1999.

[8] R. Loewy andD. London. A note on an inverse problem for nonnegative matrices.Linear and Multilinear Algebra,6:83–90, 1978.

[9] H. Minc.Nonnegative Matrices.John Wiley andSons, New York, 1988.

[10] R. Reams. An inequality for nonnegative matrices andthe inverse eigenvalue problem.Linear and Multilinear Algebra,41:367-375, 1996.