getdoc9786. 239KB Jun 04 2011 12:04:43 AM

(1)

El e c t ro n ic

Jo ur n

a l o

f P

r o b

a b i l i t y Vol. 12 (2007), Paper no. 41, pages 1131–1150.

Journal URL

http://www.math.washington.edu/~ejpecp/

Large Deviations for the largest eigenvalue of rank

one deformations of Gaussian ensembles

Mylène Ma¨ıda Université Paris-Sud Laboratoire de mathématiques

Bˆatiment 425 Facult´e des Sciences 91405 Orsay Cedex, France. [email protected]

Abstract

We establish a large deviation principle for the largest eigenvalue of a rank one deformation of a matrix from the GUE or GOE. As a corollary, we get another proof of the phenomenon, well-known in learning theory and finance, that the largest eigenvalue separates from the bulk when the perturbation is large enough.

A large part of the paper is devoted to an auxiliary result on the continuity of spherical integrals in the case when one of the matrix is of rank one, as studied in (12).

Key words: Large deviations, spiked models, spectral radius, Itzykson-Zuber integrals. AMS 2000 Subject Classification: Primary 60F10, 15A52.

(2)

1 Introduction

We consider in this paper rank one deformations of matrices from Gaussian ensembles, that is matrices which can be writtenWN+AN,withWN from the Gaussian Orthogonal (or Unitary) Ensemble andAN rank one deterministic, real symmetric ifWN is from the GOE and Hermitian

ifWN is from the GUE.

Since the fifties, the classical Gaussian ensembles (see Mehta (19)) have been extensively stud-ied. Various results for the global regime were established (Wigner semicircle law (22), large deviations for the spectral measure (5)...); the statistics of the spacings between eigenvalues were investigated for example in (8; 7), as well as the behaviour of extremal eigenvalues (Tracy-Widom distribution (21)). In the meantime, people got interested in the universality of some of these results. In this context, it is natural to look at various deformations of these ensembles, for example the rank one deformations we are interested in.

This so-called “deformed Wigner ensemble” was studied in (16) and (6), where the authors focused mainly on the problem of the local spacings and in (20) and (11), where they studied the behaviour of the largest eigenvalue. In this framework, our goal in this paper will be to establish a large deviation principle for the largest eigenvalue of XN = WN +AN, that we denote in the sequel by x∗_N. Note that our result can also be seen as a generalization of the result established in (4) for the largest eigenvalue of a matrix distributed according to the GOE. If we denote byθthe unique non zero eigenvalue ofAN,the joint law of the eigenvaluesx1, . . . , xN of XN =WN +AN is given by

Qθ_N(dx1, . . . , dxN) =

1

Z_Nβ,θ

Y

i<j

|xi−xj|βINβ(θ, XN)e− N

2

PN

i=1x2idx₁. . . dx_N, (1)

whereI_Nβ is the spherical integral defined by

I_Nβ(θ, XN) :=

Z

eNtr(U XNU∗AN)_dmβ N(U) =

Z

eN θ(U XNU∗)11_dmβ

N(U),

withmβ_N the Haar probability measure on_ON the orthogonal group of sizeN ifβ = 1, on the

unitary group_UN if β = 2 and Z_Nβ,θ is a normalizing constant. The fact that the joint law of

the eigenvalues of XN and that I_Nβ(θ, XN) depend on AN only through its non zero eigenvalue

θcomes from the unitary invariance respectively of the law ofWN and of the Haar measuremβ_N .

Our main result is the following

Theorem 1.1. For β = 1 or 2, if θ > 0, then under Qθ_N, the largest eigenvalue x∗_N = max_{x1, . . . , xN} satisfies a large deviation principle in the scale N, with good rate function

K_θβ defined as follows:

• If θ6

q

β

2,

K_θβ(x) =



  

  

+_∞, if x <√2β

Z x

√

2β

p

z2₋₂_{β dz,} _if √₂_β ₆_x₆_θ₊ β

2θ,

(3)

with M_θβ(x) = 1 2

Z x

√

2β

p

z2₋₂_{β dz}₋_θx₊1 4x

2₊ β 4 −

β

4 log

β

2 + 1 2θ

2_.

• If θ>

q

β

2,

K_θβ(x) =

+_∞, if x <√2β Lβ_θ(x), if x>√2β,

withLβ_θ(x) = 1 2

Z x

θ+β

2θ

p

z2₋₂_βdz₋_θ

x₋

θ+ β 2θ

+ 1 4

x₋

θ+ β 2θ

2

.

One can see in particular thatK_θβ differs from the rate function for the deviations of the largest eigenvalue of the non-deformed model that was obtained in (4).

Note that in the case when θ <0,similar results would hold for the smallest eigenvalue of the deformed ensemble. We let the precise statement to the reader and assume in the sequel that

θ >0.

Remark 1.2. Let us mention that, although we did not investigate this point in full details, very similar results can be obtained with our techniques in the case of sample covariance matrices for the so-called “single spike model” that is matrices of the form XX∗_, _where _X _{is a} _p_× n matrix, whose column vectors are iid Gaussian (real or complex) with a covariance matrix diag(a,1,1, . . . ,1),with a single spike a >1 (see below for references).

We have to mention an important corollary of Theorem 1.1 :

Corollary 1.3. Forβ = 1or2, underQθ_N,x∗

N converges almost surely to the edge of the support of the semicircle law σβ as long as θ6θc :=

q

β

2 and separates from the support when θ > θc.

In this case, it converges toθ+₂β_θ.

This allows us to give a new proof, via large deviations, to this known phenomena which is crucial for applications to finance and learning theory (cf. for example (15; 18)).

On the mathematical level, this kind of phase transition has been pointed out and proved by several authors in the case of non-white sample covariance matrices (cf. for example (3) for the complete analysis in the complex Gaussian case, (10), (9) for more general models, (17) for statistical applications to PCA).

(4)

2 Continuity of spherical integrals

The question we want to address in this section is the continuity, in a topology to be prescribed, ofI_Nβ(θ, BN),in its second argumentBN.The matrix BN is supposed to be symmetric if β = 1 and Hermitian ifβ = 2.Due to invariance property of the Haar measure, we can always assume thatBN is real diagonal.

We denote by λ1(BN), . . . , λN(BN) the eigenvalues of BN in decreasing order and we let

The following continuity property holds

Proposition 2.1. Forβ= 1or2,for anyθ >0and anyκ >0,there exists a functiongκ :R+→

Remark 2.2. According to Theorem 6 of (12), we know that, for some values ofθ, the limit of

1

NlogI β

N(θ, BN) as N goes to infinity depends not only on the limiting spectral measure of BN but also on the limit of λ1(BN). Therefore _N1 logINβ(θ, BN) cannot be continuous in the spectral measure ofBN but we have also to localize λ1(BN).That is precisely the content of Proposition

2.1 above. We also refer the reader to the remarks made in (12) on point (3) of Lemma 14 therein.

A key step to show Proposition 2.1 is to get an equivalent as explicit as possible of 1

NlogI β

N(θ, BN).This is given by

Lemma 2.3. IfBN has spectral radius uniformly bounded inN, then for anyδ >0,for N large enough,

where vN is the unique solution in

of the equation

(5)

This lemma can be regarded as a generalization to any value ofθof the second point of Lemma 14 in (12).

The remaining of this section is devoted to its proof. For the sake of simplicity, we prove in full details the caseβ = 1 and leave to the reader the changes to the other cases.

2.1 Some preliminary inequalities

Notation and remarks.

• We denote byλ1>. . .>λN the eigenvalues ofBN in decreasing order.

• Forξ _∈]0,1/2[, we define

KN(ξ) :=_{j _{∈ {}1, . . . , N_}/λj _∈(λ1−N−ξ, λ1]},

and we denote byj0 the cardinality of KN(ξ).

• The eigenvaluesλ1 >. . .>λN being fixed, we can see that the functionx7→ 1

N N

X

i=1 1

x₋λi

is strictly decreasing taking negative values on (_−∞, λN) and strictly decreasing taking positive values on (λ1,∞).Therefore, vN as introduced in Lemma 2.3, is well defined and

we can define similarly ˜vN as the unique solution in

λN(BN)₋ 1

2θ, λ1(BN)−

1 2θ

c

of the

equation

1

N

1 2θ

N

X

i=j0+1

1 ˜

vN + ₂1_θ ₋λi = 1.

Moreover, ifθ >0,one can easily see that vN +₂1_θ and ˜vN+₂1_θ both lie in (λ1,∞).

• EandVdenotes respectively the expectation and the variance under the standard Gaussian measure onRN.

• As 1 + 2θvN ₋2θλ1 >0, we can define the probability measure on RN given by

PN(dg1, . . . , dgN) = (2π)−

N

2

N

Y

i=1

hp

1 + 2θvN₋2θλi e−12(1+2θvN−2θλi)g2idgi

i

.

We denote byEPN and VPN respectively the expectation and the variance underPN.

• Similarly, we define the probability measure onRN−j0 _{given by}

˜

PN(dgj0+1, . . . , dgN) = (2π)−

N−j₀

2

N

Y

i=j0+1

hp

1 + 2θvN˜ ₋2θλi e−12(1+2θv˜N−2θλi)gi2dgi

i

.

We denote byE_P_˜

(6)

Before going to the proof of Lemma 2.3, we enumerate hereafter some inequalities on the quan-tities we have just introduced, that will be useful further.

Fact 2.4. Let θ >0. We have the following inequalities :

1. For i>j0+ 1, vN + 1

2θ −λi>N−

ξ _and _˜_vN₊ 1

2θ−λi >N− ξ_.

2. vN+ 1

2θ−λ1 > j0

2θN −N− ξ _and

∀i, vN + 1

2θ −λi >

1 2θN.

3. ˜vN 6vN 6λ1 6vN˜ + 1

2θ 6vN+

1 2θ.

4. For any δ > 0, for any ξ _∈ (0,1/2), for any ε > 0, there exists N0 such that for any

N >N0 such that j0 6δN1−

ξ

2,|vN−˜vN|6ε.

Proof :

1. As mentionned in the remarks above,vN+₂1_θ >λ1.Fori>j0+ 1, λi6λ1−N−ξ so that

vN+ ₂1_θ >λ1 >λi+N−ξ.The same holds for ˜vN.

2. We have the following inequality

j0

1

vN +₂1_θ ₋(λ1−N−ξ) 6

j0

X

i=1

1

vN +₂1_θ ₋λi 62θN,

where the right inequality comes from the fact that PN

i=1 _v_N₊11

2θ−λi = 2θN and for all i, vN+₂1_θ₋λi>0 and the left one is inherited from the definition ofj0.Putting the leftmost and rightmost terms together, we get the first inequality announced in point (2) above. The second one is even simpler : we have that PN

i=1vN+₂11_θ−λi = 2θN wand each term is

positive so that any of them is smaller than 2θN.

3. Suppose thatvN <vN˜ ,then for all i>j0+ 1, vN +₂1_θ −λi <vN˜ +₂1_θ−λi

2θN =

N

X

i=j0+1

1 ˜

vN +₂1_θ ₋λi < N

X

i=j0+1

1

vN +₂1_θ ₋λi,

but the rightmost term is smaller or equal to 2θN.Therefore, ˜vN 6vN.

Theλi’s being in decreasing order, we get that 2θ= 1

N N

X

i=1

1

vN+ ₂1_θ ₋λi 6

1

vN +₂1_θ ₋λ1

,

this gives vN+ 1

2θ 6λ1+

1

2θ,i.e. vN 6λ1.

4. Letε >0, δ >0 and ξ_∈(0,1/2) be fixed. From (3), we have that

(7)

IfvN +₂1_θ ₋λ1 6ε,then |vN −vN˜ |6ε. Otherwise, we can write, thanks to (3), that

N

From Cauchy-Schwarz inequality, we get

j0

where the last inequality holds forN large enough. ✷

2.2 Proof of Lemma 2.3

We first prove the upper bound: the starting point will be the same as in (12). It is a well known fact that the first column vector of a random orthogonal matrix distributed according to the Haar measure on _ON has the same law as a standard Gaussian vector in RN divided by its

Euclidian norm. Therefore, we can write

IN(θ, BN) =E exp

From concentration for the norm of a Gaussian vector (cf. (12) for details), we get that, for any

κ such that 0< κ <1/2,

From there, we have

(8)

where M is the uniform bound on the spectral radius of BN and we use that PN(_AN(κ))61.

Therefore, for anyδ >0,we get that for N large enough,

1

N logIN(θ, BN)6θvN −

1 2

N

X

i=1

log (1 + 2θvN ₋2θλi) +δ.

For the proof of the lower bound, we have to treat two distinct cases. In both cases, the starting point, inherited from (2), is the following:

IN(θ, BN)>δ(κ, N)eN θvN−N1−κθ(M+vN)_E

"

1_A_N₍κ)exp

(

θ N

X

i=1

λig2_i ₋θvN N

X

i=1

g_i2

)#

,

but our startegy will be different according to whether there is a lot of eigenvalues at the vicinity of the largest eigenvalueλ1 or not, that is according to the size of j0 defined above.

Forκ_∈]0,1/2[ fixed, we choose in the sequel ξ such that 0< ξ 6 1₂₋κ.

• First case : N is such that j0 >δN1−

ξ

2.

In this case, the situation is very similar to what happens with a small θ, we therefore follow the proof of (12). Indeed, we write

E "

1_A_N₍_κ₎exp

(

θ N

X

i=1

λig2i −θvN N

X

i=1

g2_i

)#

=

N

Y

i=1

hp

1 + 2θvN −2θλi

i−1

PN(AN(κ)),

and show that, forN large enough,PN(_AN(κ))>1/2.

An easy computation gives that

V_P_N

1

Nkgk

2

= 2

N2

N

X

i=1

1

(1 + 2θvN ₋2θλi)2

and our goal is to show that this variance decreases fast enough. From (2) in Fact 2.4, we have

vN + 1

2θ −λi >vN +

1

2θ −λ1 > j0

2θN −N− ξ_> δ

2θN− ξ

2 −N−ξ_> δ

2θN− ξ_.

This gives that

VPN

1

Nkgk

2

6 2

N2_δ2N

2ξ_N ₆ 2 δ2N

2ξ−1

Therefore, by Chebichev inequality,

PN(_AN(κ)c)6

2

δ2N

2κ+2ξ−1₆ 1 2,

(9)

• Second case : N is such that j06δN1−

ξ

2.

The strategy will be a bit different : separating the eigenvalues of BN that are in KN(ξ) and the others, we get

IN(θ, BN) >eN θv˜N−N1−κθ(M+˜vN) _E ₁

The first term will be treated similarly to what we made in the first case. We can easily check

thatE_P˜_N

which goes to zero with our choice of ξ. This gives that for N large enough,

E

(10)

Putting together (3) and (4), we get that forN large enough

The last step is now to prove that, for N large enough,

On one side, we have that

forN large enough,

This concludes the proof of Lemma 2.3. ✷

2.3 Proof of Proposition 2.1

Let κ > 0 be fixed and (BN)N_∈N and (B_N′ )N_∈N two sequences of matrices. We denote by

We introduce also the following notations: HBN(z) =

(11)

• First case : N and θare such that 2θ_∈HBN((λ1+ 2δ,+∞))∩HB′_N((λ′1+ 2δ,+∞)).

In this frame work, continuity has been established in Lemma 14 of (12). It comes from Lemma 2.3 and the fact that, for anyλ_{∈ ∪}N0>0∩N>N0 (supp ˆνBN ∩supp ˆνB_N′ ), z7→ (z−λ)−

Thanks to Lemma 2.3, we know that it is enough to study

∆N :=

−κ_{, we proceed as in the proof of Lemma 5.1 in (14) and define a permutation} σN that allows to put in pairs all but (N1−κ_∧_{N δ}_{) of the}_λi_{’s with a corresponding}_λ′

σN(i) which

lies at a distance less than δ from λi.

As in (14), we denote by_J0 the set of indicesisuch that we have such a pairing. Then we have

∆N 6

where we used once again that 1

so that we get the required continuity in this second case.

•Third case : N andθare such that 2θ_∈HBN((λ1+ 2δ,+∞)) and 2θ /∈HB′_N((λ′1+ 2δ,+∞)).

In this case, we proceed exactly as in the second case. The only point is that establishing that

(12)

On one side we have from Fact 2.4 that

λ′₁ 6v_N′ + 1

2θ 6λ′1+ 2δ. (6)

On the other side, as _|λ1 −λ′1| 6 δ, λ′1 + 2δ is greater than λ1 and the map BN 7→ HBN is

continuous outside the support of all the spectral measures so that

HB′_N(λ

′

1+ 2δ)−HBN(λ′1+ 2δ)

6C(δ),

with the functionC going to zero at zero.

Furthermore,H_B′

N is decreasing on (λ ′

1,+∞) so thatHB′ N(λ

′

1+ 2δ)<2θ, yielding

HBN(λ′1+ 2δ)62θ+C(δ),

and HBN being decreasing

HBN(λ1+ 3δ)62θ+C(δ) =HBN

vN +

1 2θ

+C(δ),

what implies

λ1 6vN+

1

2θ 6λ1+ 3δ+K(δ),

with the functionK going to zero at zero. and, together with (6) this gives that

|vN ₋v′N|65δ+K(δ).

Now the same estimates as in the second case above lead to the same conclusion. This gives

Proposition 2.1. ✷

3 Large deviations for

x

∗

N

The goal of this section is to prove the large deviation principle forx∗_N, the largest eigenvalue of a matrix from the deformed Gaussian ensemble, announced in the introduction in Theorem 1.1. A first step will be to prove the following

Proposition 3.1. For β= 1 or 2 and θ >0, if we define

Pθ_N(dx1, . . . , dxN) =

1

Z_Nβ

Y

i<j

|xi−xj|βINβ(θ, XN)e− N

2

PN

i=1x2idx₁. . . dx_N, (7)

withZ_Nβ the normalizing constant in the case θ= 0,and we let

F_θβ(x) :=

+_∞, if x <√2β

−β2 +

β

2 log

β

2 −Φβ(x, σβ)−I

β

(13)

where σβ denotes the semicircle law whose density on R is given by 1

βπ1[−√2β,√2β]

p

2β₋t2_dt,

for µ_{∈ P}(R) and x_∈R,

Φβ(x, µ) =β

Z

log_|x₋y_|dµ(y)₋1 2x

2_,

and I_µβ(x, θ) = lim

N_→∞

1

N logI β

N(θ, BN), where BN has limiting spectral measure µ and limiting largest eigenvaluex, then we have the following large deviations bounds :

1. there exists a function fθ:R+_→R+ going to infinity at infinity such that for allN

Pθ_N

max

i=1...N|xi|>M

6e−N fθ(M)_.

2. For any x, for any M such that _|x_|< M,

lim

δ_↓0lim supN_→∞

1

N logP θ

N(x6x∗N 6x+δ,max

1...N|xi|6M)6−F β θ(x)

3. For any x,

lim

δ_↓0lim infN_→∞

1

N logP θ

N(x6x∗N 6x+δ)>−F β θ(x)

Remark 3.2. The function I_µβ(x, θ) = lim

N_→∞

1

N logI β

N(θ, BN) is well defined by virtue of Theo-rem 6 in (12) (an explicit expression for it will be given in Section 3.2 below).

3.1 Proof of Proposition 3.1

• We first prove the “exponential tightness” property (1).

It is more convenient to rewrite (7) as

Pθ_N(dx1, . . . , dxN) =

eN2θ2

Z_Nβ

Y

i<j

|xi₋xj_|βe−N2tr(XN−AN)2dx₁. . . dxN.

Now, a well known inequality (see for example Lemma 2.3 in (2)) gives that

tr(XN ₋AN)2 >min

π N

X

i=1

|xk₋aπ(k)|2,

where the minimum is taken over all permutationsπof_{1, . . . , N_}. But allak’s are zero, except

(14)

we can assume thatπ−_∗1(1) = 1, whereπ_∗ is the permutation for which the minimum is reached. Therefore

tr(XN ₋AN)2 >(x1−θ)2+

N

X

i=2

x2j.

We can now use the very same estimates as in Lemma 6.3 in (4) to get (1). More precisely, we can write

|(x₋θ) +θ₋xj_|βe− x2_j

2 ₆e (x−θ)2

4 ,

forx large enough, so that, forM large enough,

Pθ_N

max

i=1...N|xi|>M

6NPθ_N(_|x1|>M)6

Z_Nβ₋₁ Z_Nβ e

−14N(M−θ)2+

N

2θ2.

From Selberg formula (cf for example proof of Proposition 3.1 in (5)), we can show that 1

N log Z_Nβ₋₁

Z_Nβ −−−−→N_→∞ C. This concludes the proof of (1).

• For allx <√2β,

lim

N_→∞

1

N logP θ

N(x∗N 6x) =−∞. (8)

Indeed, we know from Theorem 1.1 in (5) that the spectral measure of WN satisfies a large

deviation principle in the scale N2 with a good rate function whose unique minimizer is the semicircle lawσβ.We can check that adding a deterministic matrix of bounded rank (uniformly

in N) does not affect the spectral measure in this scale so that the spectral measure of XN

satisfies the same large deviation principle.

Therefore, if we let x <√2β,f _{∈ C}b(R) such that f(y) = 0 ify 6x but

R

f dσβ >0 and if we consider the closed setF :=_{µ/R

f dµ= 0_}, we have that

Qθ_N(x∗_N 6x)6Qθ_N 1

N N

X

i=1

f(xi) = 0

!

6Qθ_N(ˆµN ∈F),

where ˆµN := _N1 PN

i=1δxi is the spectral measure ofXN. As σβ ∈/ F,

lim sup

N→∞

1

N2logQ

θ

N(x∗N 6x)<0.

Furthermore, as we saw above, 1

N log Z_Nβ₋₁

Z_Nβ −−−−→N_→∞ C, the same holds for P θ

N and we

immedi-ately deduce what gives immediimmedi-ately (8).

• Let nowx>√2β and δ >0.

LetM >_|x_|andδsmall enough so thatM >_|x+δ_|.One important remark is that, by invariance by permutation, we have,

Pθ_N(x6x∗N 6x+δ, max

i=1...N|xi|6M)6NP θ

N(x6x16x+δ, x1 > max

i=2...Nxi,i=1max...N|xi|6M).

(15)

• πˆN :=

1

N ₋1

N

X

i=2

δxi,

• PN_N−1 is the measure on RN−1 such that, for each Borel set E, we have

PN_N−1(λ_∈E) =P0_N₋₁ r

1₋ 1

Nλ∈E

!

.

With these notations, we have

B :=Pθ_N(x6x∗_N 6x+δ, max

i=1...N|xi|6M)

6

Z x+δ

x

dx1

Z

[−M,M]N−1

e(N−1)Φβ(x1,πˆN)_.Cβ N.I

β

N(θ, XN).dP0N₋1(x2, . . . , xN),

whereC_Nβ :=NZ β N−1

Z_Nβ .

1₋ 1

N

βN(N₄−1)

.

Let 0< κ < 1₄, we have

B 6C_Nβ.

Z x+δ

x

dx1

Z

ˆ

πN∈B(σβ,N−κ),

maxi=2...Nxi6x1

e(N−1)Φβ(x1,πˆN)_Iβ

N(θ, XN).dPNN−1(x2, . . . , xN)

+ (2M)NeN M θC_NβPN_N−1(ˆπN _∈/ B(σβ, N−κ)), (9)

where B(σβ, N−κ_{) is the ball of size} _N−κ _{centered at} _σβ_{, for the Dudley distance (defined at}

the very beginning of Section 2).

We first show that the second term is exponentially negligible. We have

P_NN−1(ˆπN ∈/ B(σβ, N−κ))6PNN−1(kFN−1−Fβk∞>N−κ),

whereFN₋1 and Fβ are respectively the (cumulative) distribution function of ˆπN andσβ. We know from the result of Bai in (1) that

kEN_N−1FN₋1−Fβk=O(N−

1 4),

whereEN_N−1 is the expectation underPN_N−1, so that

PN_N−1(_kFN−1−Fβk∞>N−κ)6PNN−1

kFN−1−ENN−1FN−1k∞> N−κ

2

.

But, by a result of concentration of (13) (see Theorem 1.1), we have that there exists a constant

C >0 such that for allN _∈N,

P_NN−1(_kFN₋1−ENN−1FN−1k∞>N−

κ₎₆_e−CN2−2κ ,

so that

lim sup

N_→∞

1

N logP N−1

N (ˆπN ∈/B(σβ, N− κ_{)) =}

(16)

We can now come back to the first term in (9). The same computation as in the proof of Proposition 3.1 in (5), based on Selberg formula, gives that

1

N logC β

N −−−−→_N →∞ −

β

2 log

β

2 +

β

2.

Applying Proposition 2.1 together with Theorem 6 of (12), we can conclude that

lim sup

N_→∞

1

N logP θ

N(x6x∗N 6x+δ,max

1...N|xi|6M)6− β

2 log

β

2+

β

2+ supz_∈[x,x+δ]

[Φβ(z, σβ)+Iσββ(z, θ)].

One can easily see thatz_7→Φβ(z, σβ) is continuous on (√2β,+∞) and the continuity ofIσββ(., θ)

will be shown in the proof of Lemma 3.4 below. We therefore get the upper bound :

lim sup

δ↓0

lim sup

N→∞

1

N logP θ

N(x6x∗N 6x+δ,max

1...N|xi|6M)6− β

2 log

β

2+

β

2+ Φβ(x, σβ) +I

β σβ(x, θ).

• We now conclude the proof of Proposition 3.1 by showing the lower bound (3). We proceed as in (4). Lety > x > r >√2β. Then,

Pθ_N(y >x∗_N >x) > Pθ_N(x1 ∈[x, y], max

i=2...N|xi|6r)

> C_Nβ exp





(N −1) _zinf

∈[x,y]

µ_∈Br(σβ,N−κ)

(Φβ(z, µ) +Iµ(z, θ)−gκ(x−y))



 

.PN_N−1(ˆπN _∈Br(σβ, N−κ)),

where Br(σβ, N−κ) = B(σβ, N−κ)∩ P([−r, r]), with P([−r, r]) the set of probability measure

whose support is included in [₋r, r] and gκ going to zero at zero by virtue of Proposition 2.1. We proceed as for the upper bound to show that PN_N−1(ˆπN _∈ Br(σβ, N−κ_{)) is going to 1.}

Knowing the asymptotics ofC_Nβ, we get

lim inf

N→∞ P θ

N(y>x∗N >x)>− β

2log

β

2 +

β

2 + infz∈[x,y](Φβ(z, σβ) +I

β

σβ(z, θ)−gκ(x−y)).

We let nowy decrease to x. Φβ(., σβ) and Iσββ(., θ) are continuous on ( √

2β,+_∞) (see Lemma 3.4 below) so that we have the required lower bound

lim inf

y_→x lim infN→∞ P θ

N(y >x∗N >x)>− β

2 log

β

2 +

β

2 + Φβ(x, σβ) +I

β σβ(x, θ).

This concludes the proof of Proposition 3.1. ✷

3.2 Proof of Theorem 1.1

(17)

Definition 3.3. For µa compactly supported measure, we define Hµ its Hilbert transform by

Hµ:R\co(supp µ) → R

z _7→

Z ₁

z₋λdµ(λ).

withco(supp µ) the convex enveloppe of the support of µ.

It is easy to check that Hµ is injective, therefore we can define its inverse Gµ defined on the image of Hµ such that Gµ(Hµ(z)) = z. The R-transform Rµ is then given, for z 6= 0, by Rµ(z) =Gµ(z)₋1_z. Moreover, one can check thatl:= limz_→0Rµ(z)exists and we letRµ(0) =l,

so that Rµ is continuous at0.

Lemma 3.4. F_θβ is lower semicontinuous with compact level sets.

Proof. We know from Theorem 6 in (12) that

I_µβ(x, θ) =θv(x, θ)₋ β 2

Z

log

1 +2θ

β v(x, θ)−

2θ β λ

dµ(λ),

with v(x, θ) :=

(

Rµ

2θ β

, ifHµ(x)> 2_βθ,

x₋₂β_θ, otherwise, .

Therefore, we have to check that Iσββ(x, θ) is continuous at x∗ satisfying Hσβ(x∗) =

2θ

β for a θ

such that x∗ _>√₂_β. _{From Definition 3.3, we see that} _v₍_{., θ}_{) is continuous at} _x∗_. _{Moreover as} x∗ > √2β, it is outside the support of σβ and x _7→ R

log (x₋λ)dσβ(λ) is continuous at x∗.

ThereforeF_θβ is continuous on (√2β,+_∞) and lower semi-continuous on R.

Moreover, we can check that, for x large enough, Iσββ(x, θ) 6 θx, so that F β

θ(x) ∼+∞

1 2x2, its level sets are therefore compact.

We now go to the proof of Theorem 1.1. If we defineLβ_θ(x) :=F_θβ(x)₋infx∈RF_θβ(x), then Lβ_θ is a good rate function and a direct consequence of Proposition 3.1 is that, for

Z_Nβ,θ =

Z

. . .

Z Y

i<j

|xi₋xj_|βI_Nβ(θ, XN)e−N2

PN i=1x2idx

1. . . dxN,

we have that lim

N→∞

1

N log Z_Nβ,θ

Z_Nβ = infx_∈RF

β

θ(x) so that, under QθN, x∗ satisfies a large deviation

principle with good rate functionLβ_θ.

To conclude the proof, we have to study the function F_θβ and show that Lβ_θ coincide with the functionK_θβ as defined in Theorem 1.1.

We recall that, for x>√2β,we have

F_θβ(x) =₋β 2 +

β

2log

β

2 −β

Z

log_|x₋y_|dσβ(y) +

1 2x

2₋_Iβ σβ(x, θ),

(18)

Relying for example on the proof of Lemma 2.7 in (5), we have that

From Lemma 2.7 in (5), we also get that

Hσ_β(x) = 1

β(x−

p

x2₋₂_β_{) for} _{x >}p

2β, (12)

from which we deduce that lim

x_→√2βHσβ(x) = then increasing so that its infimum is reached at θ+ ₂β_θ. This gives immediately in this case thatK_θβ(x) =F_θβ(x)₋infxF_θβ(x) =Lβ_θ(x), as defined in Theorem 1.1..

2_,_{which is increasing.}

(19)

4 Proof of Corollary 1.3

In the proof of Theorem 1.1. above, we saw that K_θβ is increasing on [√2β,+_∞) if θ6

q

β

2,so that in this case its infimum is reached at √2β.

We also saw that, differentiatingF_θβ on (√2β,+_∞), we got that when θ >

q

β

2, it reaches its

minimum atθ+₂β_θ.This is enough to conclude. _✷

Acknowledgments : I would like to thank Sandrine P´ech´e for many fruitful discussions during the preparation of the paper.

References

[1] Z. D. Bai, Convergence rate of expected spectral distributions of large random matrices. I. Wigner matrices, Ann. Probab.21(1993), no. 2, 625–648. MR1217559 (95a:60039)

[2] Z. D. Bai,Methodologies in spectral analysis of large-dimensional random matrices, a review, Statist. Sinica 9 (1999), no. 3, 611–677, With comments by G. J. Rodgers and Jack W. Silverstein; and a rejoinder by the author. MR1711663 (2000e:60044)

[3] J. Baik, G. Ben Arous, and S. P´ech´e, Phase transition of the largest eigenvalue for non-null complex sample covariance matrices, Ann. Probab. 33 (2005), no. 5, 1643–1697. MR2165575 (2006g:15046)

[4] G. Ben Arous, A. Dembo, and A. Guionnet,Aging of spherical spin glasses, Probab. Theory Related Fields 120(2001), no. 1, 1–67. MR1856194 (2003a:82039)

[5] G. Ben Arous and A. Guionnet, Large deviations for Wigner’s law and Voiculescu’s non-commutative entropy, Probab. Theory Related Fields 108 (1997), no. 4, 517–542. MR14656140 (98i:15026)

[6] E. Br´ezin and S. Hikami, Correlations of nearby levels induced by a random potential, Nuclear Phys. B 479(1996), no. 3, 697–706. MR1418841 (97j:82080)

[7] P. Deift, T. Kriecherbauer, K. T.-R. McLaughlin, S. Venakides, and X. Zhou, Uniform asymptotics for polynomials orthogonal with respect to varying exponential weights and ap-plications to universality questions in random matrix theory, Comm. Pure Appl. Math.52

(1999), no. 11, 1335–1425. MR1702716 (2001g:42050)

[8] P. A. Deift, Orthogonal polynomials and random matrices: a Riemann-Hilbert approach, Courant Lecture Notes in Mathematics, vol. 3, New York University Courant Institute of Mathematical Sciences, New York, 1999. MR1677884 (2000g:47048)

(20)

[10] N. El Karoui,Tracy-widom limit for the largest eigenvalue of a large class of complex sample covariance matrices., Annals of Probability35 (2007), no. 2. MR2308592

[11] D. Féral and S. Péché, The largest eigenvalue of rank one deformation of large wigner matrices, Comm.Math.Phys 272(2007), no. 1, 185–228. MR2291807

[12] A. Guionnet and M. Ma¨ıda, A Fourier view on the R-transform and related asymptotics of spherical integrals, J. Funct. Anal. 222 (2005), no. 2, 435–490. MRMR2132396 (2006b:60043)

[13] A. Guionnet and O. Zeitouni, Concentration of the spectral measure for large matrices, Electronic Communicartions in Probability 5 (2000). MR1781846 (2001k:15035)

[14] A. Guionnet and O. Zeitouni,Large deviations asymptotics for spherical integrals, J. Funct. Anal. 188(2002), no. 2, 461–515. MR1883414 (2003e:60055)

[15] D. Hoyle and M. Rattray,Limiting form of the sample covariance eigenspectrum in pca and kernel pca, Proceedings of Neural Information Processing Systems, 2003.

[16] K. Johansson, Universality of the local spacing distribution in certain ensembles of Hermitian Wigner matrices, Comm. Math. Phys. 215 (2001), no. 3, 683–705. MR1810949 (2002j:15024)

[17] I. Johnstone,On the distribution of the largest eigenvalue in principal components analysis., Annals of Statistics29(2001), 295–327. MR1863961 (2002i:62115)

[18] L. Laloux, P. Cizeau, M. Potters, and J. Bouchaud, Random matrix theory and financial correlations, Intern. J. Theor. Appl. Finance3 (2000), no. 3, 391–397.

[19] M. L. Mehta, Random matrices, second ed., Academic Press Inc., Boston, MA, 1991. MR1083764 (92f:82002)

[20] S. P´ech´e,The largest eigenvalue of small rank perturbations of hermitian random matrices, Probab. Theory Related Fields134 (2006), no. 1, 127–173. MR2221787 (2007d:15041)

[21] C. A. Tracy and H. Widom, Level-spacing distributions and the Airy kernel, Comm. Math. Phys.159 (1994), no. 1, 151–174. MR1257246 (95e:82003)