• Tidak ada hasil yang ditemukan

getdoce4ce. 125KB Jun 04 2011 12:05:06 AM

N/A
N/A
Protected

Academic year: 2017

Membagikan "getdoce4ce. 125KB Jun 04 2011 12:05:06 AM"

Copied!
12
0
0

Teks penuh

(1)

ELECTRONIC COMMUNICATIONS in PROBABILITY

BERRY-ESSEEN BOUNDS FOR PROJECTIONS OF COORDINATE

SYMMETRIC RANDOM VECTORS

LARRY GOLDSTEIN

Department of Mathematics, University of Southern California, Los Angeles, CA 90089-2532, USA email: [email protected]

QI-MAN SHAO1

Department of Mathematics, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China

email: [email protected]

SubmittedMay 16,2009, accepted in final formOctober 23,2009 AMS 2000 Subject classification: 60F05,60D05

Keywords: Normal approximation, convex bodies

Abstract

For a coordinate symmetric random vector(Y1, . . . ,Yn) =Y∈Rn, that is, one satisfying(Y1, . . . ,Yn) =d

(e1Y1, . . . ,enYn)for all(e1, . . . ,en)∈ {−1, 1}n, for whichP(Yi=0) =0 for alli=1, 2, . . . ,n, the

fol-lowing Berry Esseen bound to the cumulative standard normalΦfor the standardized projection Wθ=Yθ/vθofYholds:

sup

x∈R|

P(Wθx)−Φ(x)| ≤2

n

X

i=1

|θi|3E|Xi|3+8.4E(Vθ2−1)

2,

where =θ·Yis the projection ofYin directionθ ∈Rnwith||θ||=1, = p

Var(Yθ),Xi =

|Yi|/vθ and = Pn

i=1θ 2 iX

2

i. As such coordinate symmetry arises in the study of projections of

vectors chosen uniformly from the surface of convex bodies which have symmetries with respect to the coordinate planes, the main result is applied to a class of coordinate symmetric vectors which includes cone measureCn

p on theℓnpsphere as a special case, resulting in a bound of order

Pn i=1|θi|3.

1

Introduction and main result

Properties of the distributions of vectors uniformly distributed over the surface, or interior, of compact, convex bodies, such as the unit sphere inRn, have been well studied. When the convex

body has symmetry with respect to allncoordinate planes, a vectorYchosen uniformly from its surface satisfies

(Y1, . . . ,Yn) =d(e1Y1, . . . ,enYn) for all(e1, . . . ,en)∈ {−1, 1}n

1PARTIALLY SUPPORTED BY HONG KONG RGC CERG 602206 AND 602608

(2)

and is said to be coordinate symmetric. Projections

Yθ=θ·Y=

n

X

i=1

θiYi (1.1)

ofYalongθ Rnwith||θ||=1 have generated special interest, and in many cases normal ap-proximations, and error bounds, can be derived for , the projection standardized to have mean zero and variance 1. In this note we show that when a random vector is coordinate sym-metric, even though its components may be dependent, results for independent random variables may be applied to derive error bounds to the normal for its standardized projection. Bounds in the Kolmogorov and total variation metric for projections of vectors with symmetries are given also in[8], but the bounds are not optimal; the bounds provided here, in particular those in The-orem 2.1 for the normalized projections of the generalizationCn

p,F of cone measure, are of order

Pn

i=1|θi|3. In related work, many authors study the measure of the set of directions on the unit

sphere along which projections are approximately normally distributed, but in most cases bounds are not provided; see in particular[12],[1]and[2]. One exception is[6]where the surprising order Pni=1|θi|4 is obtained under the additional assumption that a joint density function of Y

exists, and is log-concave.

When the componentsY1, . . . ,Ynof a coordinate symmetric vectorYhave finite variancesv12, . . . ,vn2,

respectively, it follows easily fromYi=dYiand(Yi,Yj) =d(Yi,Yj)fori6=j∈ {1, . . . ,n}that EYi=0, and EYiYj=vi2δi j,

and hence, that

EYθ=0 and Var(Yθ) =2 where v

2

θ=

n

X

i=1 θi2vi2. Standardizing to variance 1, write

=Yθ/vθ and Xi=|Yi|/vθ. (1.2) Whenv2

i =v2is constant ini then 2= v2, the common variance of the components, for allθ with||θ||=1.

One conclusion of Theorem 1.1 gives a Kolmogorov distance bound between the standardized projection and the normal in terms of expectations of functions of =

Pn i=1θ

2 iX

2 i and

Pn

i=1|θi|3|Xi|3. We apply Theorem 1.1 to standardized projections of a family of coordinate

sym-metric random vectors, generalizing cone measure Cn

p on the sphereS(ℓ n

p), defined as follows.

Withp>0, let

S(ℓnp) ={xRn:

n

X

i=1

|xi|p=1} and B(ℓnp) ={xRn:

n

X

i=1

|xi|p1}.

WithµnLebesgue measure onRn, the cone measure ofAS(n

p)is given by

Cpn(A) =

µn([0, 1]A) µn(B(n

p))

where [0, 1]A={t a:aA,t[0, 1]}. (1.3)

(3)

In [5] an L1 bound between the standardized variableW

θ in (1.2) and the normal is obtained when Yhas the cone measure distribution. Here an application of Theorem 1.1 yields Theorem 2.1, which gives Kolmogorov distance bounds of the orderPn

i=1|θi|3 for a class of distributions

Cn

p,F which include cone measure as a special case.

We note that ifθRnsatisfies||θ||=1, so Hölder’s inequality with 1/s+1/t=1 yields sum of the coordinates ofY.

We have the following simple yet crucial result, shown in Section 3.

LEMMA 1.1. LetYbe a coordinate symmetric random variable inRn such that P(Yi =0) = 0for all i =1, 2 . . . ,n, and let ǫi =sign(Yi), the sign of Yi. Then the signsǫ1, . . . ,ǫn of the coordinates Y1, . . . ,Ynare i.i.d. variables taking values uniformly in{−1, 1}, and

(ǫ1, . . . ,ǫn) and (|Y1|, . . . ,|Yn|) are independent.

The independence property provided by Lemma 1.1 is the key ingredient in the following theorem. THEOREM1.1. LetY= (Y1, . . . ,Yn)be a coordinate symmetric random vector inRnwhose components

We remark that in related work, Theorem 4 in[3]gives an exponential non-uniform Berry-Esseen bound for the Studentized sums

n

A simplification of the bounds in Theorem 1.1 result whenYhas the ‘square negative correlation property,’ see[9], that is, when

(4)

as then

E(V2

θ −1)2≤

n

X

i=1 θ4

iVar(X 2 i),

and hence the first term on the right hand side of(1.7)can be replaced by 8.4Pn

i=1θi4Var(Xi2).

Proposition 3 of[9]shows that cone measureCn

p satisfies a correlation condition much stronger

than(1.8); see also[1]regarding negative correlation in the interior ofB(ℓnp).

2

Application

One application of Theorem 1.1 concerns the following generalization of cone measureCn p. Let

n2 andG1, . . . ,Gnbe i.i.d. nontrivial, positive random variables with distribution function F,

and set

G1,n=

n

X

i=1

Gi.

In addition, let ǫ1, . . . ,ǫn be i.i.d. random variables, independent of G1, . . . ,Gn, taking values

uniformly in{−1, 1}. LetCn

p,F be the distribution of the vector

Y= ‚

ǫ1(

G1

G1,n

)1/p, . . . ,ǫn( Gn G1,n

)1/p Œ

. (2.1)

By results in[10], for instance, cone measureCn

p as given in (1.3) is the special case whenF is

the Gamma distributionΓ(1/p, 1).

THEOREM2.1. LetYhave distributionCp,Fn given by (2.1) with p>0and F for which EG21+4/p<

when G1is distributed according to F . Then there exists a constant cp,F depending on p and F such

that for allθRnfor which||θ||=1we have

sup

x∈R|

P(Wθx)Φ(x)| ≤cp,F n

X

i=1

|θi|3, (2.2)

where

=Yθ/vθ with =θ·Y and 2=E G1

G1,n

2/p .

As the Gamma distributionΓ(1/p, 1)has moments of all orders, the conclusion of Theorem 2.1 holds, in particular, for cone measureCn

p.

3

Proofs

(5)

Then, using the coordinate symmetry property to obtain the fourth equality, we have P(ǫ1=e1, . . . ,ǫn=en,|Y1| ∈A1, . . . ,|Yn| ∈An)

= P(ǫ1=e1, . . . ,ǫn=en,ǫ1Y1A1, . . . ,ǫnYnAn) = P(e1Y1∈A1, . . . ,enYnAn)

= P(Y1e1A1, . . . ,YnenAn)

= 1

2n

X

(γ1,...,γn)∈{−1,1}n

P(Y1γ1A1, . . . ,YnγnAn)

=

n

Y

i=1

P(ǫi=ei)

!

P(|Y1| ∈A1, . . . ,|Yn| ∈An). ƒ

Before proving Theorem 1.1, we invoke the following well-known Berry-Esseen bound for inde-pendent random variables (see [11]): if ξ1,· · ·,ξn are independent random variables satisfying i=0, E|ξi|3<for1in andPn

i=12i =1,

sup

x∈R|

P(

n

X

i=1

ξix)−Φ(x)| ≤min(1, 0.7056 n

X

i=1

E|ξi|3).

In particular, ifǫ1, . . . ,ǫnare independent random variables taking the values−1,+1 with equal

probability, and b1, . . . ,bnare any nonzero constants, thenW=

Pn

i=1biǫisatisfies

sup

x∈R|

P(Wx)Φ(x/V)| ≤min(1, 0.7056

n

X

i=1

|bi|3/V3), (3.1)

whereV2=Pn i=1b2i.

Proof of Theorem 1.1. By Lemma 1.1, recallingXi=|Yi|/vθ, we may write =

n

X

i=1 ǫiθiXi

where{ǫi, 1in}is a collection of i.i.d. random variables withP(ǫi=1) =P(ǫi=1) =1/2, independent ofX1, . . . ,Xn. Note that, by construction,Pn

i=1θi2EXi2=1.

Now,

P(Wθx)P(Zx)

= EP(Wθx|{Xi}1≤i≤n)−Φ(x/Vθ)

+EnΦ(x/Vθ)−Φ(x) o

:= R1+R2. (3.2)

By(3.1),

|R1| ≤ Enmin(1,0.7056 Pn

i=1|θi|3|Xi|3

Vθ3 )

o

P(V2

θ <1/2) +0.7056(23 /2)

n

X

i=1

|θi|3E|X i|3

≤ 2E|Vθ2−1|I{|Vθ2−1|>1/2}+2

n

X

i=1

|θi|3E|X

(6)

As toR2, lettingZN(0, 1)be independent ofVθwe have |R2| = |P(Zx/Vθ)−P(Zx)|

≤ |P(Zx/Vθ,|2−1| ≤1/2)−P(Zx,|2−1| ≤1/2)| +|P(Zx/Vθ,|2−1|>1/2)−P(Zx,|V

2

θ −1|>1/2)| ≤ |P(Zx/Vθ,|Vθ21| ≤1/2)P(Zx,|Vθ21| ≤1/2)|

+P(|Vθ2−1|>1/2)

≤ |P(Zx/Vθ,|2−1| ≤1/2)−P(Zx,|V

2

θ −1| ≤1/2)| +2E|Vθ2−1|I{|Vθ2−1|>1/2}

:= R3+2E|2−1|I{|V

2

θ −1|>1/2}, where

R3=|E

(Φ(x/Vθ)−Φ(x))I{|2−1| ≤1/2}

|. By monotonicity, it is easy to see that

|(1+x)−1/21+x/2|

x2 ≤c0:=4

p

25 (3.4)

for|x| ≤1/2. Hence, assuming|V2

θ −1| ≤1/2 1/Vθ= (1+2−1)−

1/2=1

−(1/2)(Vθ21) +γ1(Vθ21)2 with|γ1| ≤c0. A Taylor expansion ofΦyields

Φ(x/Vθ)−Φ(x)

= (x)(1/Vθ−1) + (1/2)x2(1/Vθ−1)2φ′(xγ2)

= (x)n(1/2)(V2

θ −1) +γ1(Vθ2−1)2 o

+(1/2)x2φ′(xγ2) (V

2

θ −1)2 (Vθ(Vθ+1))2 where(2/3)1/2γ2≤

p

2 whenever|Vθ21| ≤1/2. Let

c1=sup x∈R|

(x)|= p1 2πe

−1/20.24198

and

sup

x

sup (2/3)1/2γ

2≤21/2

|x2φ(xγ 2)|

= sup

x

sup (2/3)1/2

γ2≤21/2

|x3γ2φ(xγ2)|

= sup

x

sup (2/3)1/2γ

2≤21/2

γ22|2|3φ(xγ2)

≤ 3 2supx |

x|3φ(x) = 3(3/e)

3/2

(7)

SinceE(V2 Collecting the bounds above yields

|P(Wx)P(Zx)|

Lastly, (1.7) follows from (1.6) and the fact that

4.2E|Vθ21|I{|Vθ21|>1/2}+0.4E(Vθ21)2I{|Vθ21| ≤1/2}

Proof of Theorem 2.1. LetYbe distributed asCn

(8)

For 0< r < 1/2, we apply the following exponential inequality for non-negative independent random variables (see, for example, Theorem 2.19 in[7]): Forξi, 1i n, independent non-negative random variables with a:=Pn

i=1Eξiand b2:=

Pn

i=12i <, and any0<x<a,

P(

n

X

i=1

ξix)exp(a−x)

2

2b2

. (3.7)

Letc=E(G1)/(2(EG1+2pVar(G1))). Observe that

E ‚

G1 G1,n

Œ2r

E ‚

G1

G1,nI{G1/G1,n≥c/n} Œ2r

≥ (c/n)2rP(G1/G1,n≥c/n)

and

P(G1/G1,nc/n)

P((n1)G1≥c(G1,n−G1))

P(G1E(G1)/2,G1,nG1(n1)(EG1+2pVar(G1))) = 1−P(G1<E(G1)/2)1−P(G1,nG1>(n−1)(EG1+2

p

Var(G1)))

≥ 1−exp((EG1)2/(8EG2 1))

(1 1

4(n1)),

obtaining the final inequality by applying (3.7) with n = 1 to the first factor and Chebyshev’s inequality to the second. This proves(3.5).

AsCn

p,F is coordinate symmetric with exchangeable coordinates, we apply Theorem 1.1 with= vnas in (3.6), and claim that it suffices to show

E(G1/G1,n)3r=O(n−3r) (3.8)

and

E(Vθ21)2=O(

n

X

i=1

θi4). (3.9)

In particular, regarding the second term in (1.7), we have by (3.5) and (3.8)

n

X

i=1

|θi|3E|X

i|3=vn−3 n

X

i=1

|θi|3E( Gi

G1,n

)3r=O( n

X

i=1

|θi|3),

which dominates (3.9), the order of the first term in (1.7), thus yielding the theorem.

Lettingµ=EG1, the main idea is to use (i) thatG1,n/nis close toµwith probability one by the

law of large numbers; and (ii) the Taylor expansions

(1+x)−2r=12r x+γ1x2 forx>1/2 (3.10) and

(1+x)−2r=1+γ2x forx>−1/2 (3.11)

where|γ1| ≤r(2r+1)22r+2and|γ

(9)

We first show that

(nµ)2rE(G1/G1,n)2r=EG12r+O(n−1). (3.12) Let∆n= (G1,n−)/(nµ)and write

G1,n=(1+ ∆n).

Then

(nµ)2rE(G1/G1,n)2r = (nµ)2rE(G1/G1,n)2rI{G1,n≤nµ/2}

+(nµ)2rE(G1/G1,n)2rI{G1,n>nµ/2} = (nµ)2rE(G

1/G1,n)2rI{G1,n≤nµ/2}

+E(G12r(1+ ∆n)−2rI{n>1/2}

:= R4+R5. (3.13)

By(3.7), we have

P(G1,nnµ/2)exp

−(nµ/2)

2

2nEG2 1

=exp

2

8EG2 1

(3.14)

and hence

R4(nµ)2rP(G1,nnµ/2) =O(n−2). (3.15) By(3.10),

R5 = E(G12r(1−2r∆n+γ1∆2n)I{∆n>−1/2}

= E(G12r(12r∆n)E(G12r(12r∆n)I{n≤ −1/2} +EG2r1 γ1∆2nI{n>1/2}

= EG12r2r EG12r(G1,n)/(nµ)E(G12r(12r∆n)I{n≤ −1/2} +EG2r1 γ1∆2nI{n>1/2}

= EG12r2r EG12r(G1µ)/(nµ) +R5,1, (3.16) where

R5,1=−E(G2r1 (1−2r∆n)I{∆n≤ −1/2}+EG2r1 γ1∆2nI{∆n>−1/2}.

Applying Hölder’s inequality to the first term inR5,1, and that∆n≥ −1, yields |R5,1| ≤ E(G2r1 (1+2r)I{∆n≤ −1/2}+O(1)EG12r∆

2

n (3.17)

≤ (1+2r)EG12r(2r+2)/(2r+1)(2r+1)/(2+2r)P1/(2+2r)(∆n≤ −1/2) +O(1)EG2r

1 ∆

2 n

= O(1)P1/(2+2r)(∆n≤ −1/2)

+O(1)(nµ)−2EG12r(G1µ)2+EG12rE(

n

X

i=2

(Giµ)2

= O(n−1), (3.18)

(10)

As to(3.8), again applying(3.14), we have

n3rE(G1/G1,n)3r = n3rE(G1/G1,n)3rI{G1,n≤nµ/2}

+n3rE(G1/G1,n)3rI{G1,n>nµ/2}

n3rP(G1,nnµ/2) +EG13r/(µ/2)3r

= O(1). Now, to prove(3.9), write

Vθ21= (Vθ21)I{G1,nnµ/2}+ (Vθ21)I{G1,n>nµ/2}. (3.19) Note thatVθ2=O(n2r)by(3.5). Similarly to(3.15), by(3.14)again

E(Vθ21)2I{G1,n≤nµ/2}=O(n4r)P(G1,n≤nµ/2) =O(n−1). (3.20)

For the next term in (3.19) observe that

(Vθ21)I{G1,n>nµ/2} (3.21)

From(3.12)it follows that

(nµ)−2r

(11)

Hence

1 <∞, using the Cauchy-Schwarz inequality for

the second step, we have

ER27 = O(1)E∆2

[1] Anttila, M., Ball, K., and Perissinaki, I. (2003) The central limit problem for convex bodies. Trans. Amer. Math. Soc., 355 (12): 4723-4735 (electronic). MR1997580

[2] Bobkov, S. (2003) On concentration of distributions of random weighted sums.Ann. Probab. 31, 195-215. MR1959791

[3] Chistyakov, G.P. and Götze, F. (2003). Moderate deviations for Student’s statistic.Theory Probability Appl.47, 415-428. MR1975426

(12)

[5] Goldstein, L. (2007)L1bounds in normal approximation.Ann. Probab.35, pp. 1888-1930.

MR2349578

[6] Klartag, B. (2009). A Berry-Esseen type inequality for convex bodies with an unconditional basis.Probab. Theory Related Fields45, 1-33. MR2520120

[7] Lai, T.L., de la Pena, V., and Shao, Q-M. (2009) Self-normalized processes: Theory and Statistical Applications. Springer, New York. MR2488094

[8] Meckes, M. and Meckes, E. (2007). The central limit problem for random vectors with sym-metries.J. Theoret. Probab.20, 697-720. MR2359052

[9] Naor, A. and Romik, D. (2003) Projecting the surface measure of the sphere ofℓn

p.Ann. Inst.

H. Poincaré Probab. Statist.39, 241-261.

[10] Schechtman, G., Zinn, J. (1990) On the volume of the intersection of two ℓn

p balls.Proc.

Amer. Math. Soc.,110, 217-224. MR1015684

[11] Shevtsova, I. G. (2006) Sharpening the upper bound for the absolute constant in the Berry-Esseen inequality. (Russian)Teor. Veroyatn. Primen. 51622-626; translation in The-ory Probab. Appl.51, (2007), 549-553. MR2325552

Referensi

Dokumen terkait

This mostly expository note surveys and recovers a lower bound for the number of distinct eigenvalues of real symmetric matrices associated with a graph.. The relation is

For normal matrices in WPFn and those in WPFn for which the spectral radius is simple, we present approximating matrices that satisfy the strong P-F property and are polynomials

The main result of this paper is the following local limit theorem for the joint distribution of the vector ( |C1| ,. , |C k | ) in the Erd˝ os-Rényi random graph:.. Theorem 1.1

At the end of this note we show that (2) is not true in three dimensions, i.e., with probability one, the cut points of two independent three dimensional simple random walks

We present a short probabilistic proof of a weak convergence result for the number of cuts needed to isolate the root of a random recursive tree.. The proof is based on a

Key words: Large dimensional random matrix, eigenvalues, circulant matrix, symmetric circu- lant matrix, reverse circulant matrix, k circulant matrix, empirical spectral

With this choice of ζ, the genealogical tree is the CRT (see [1]), and the Brownian snake can be seen as an embedding of the CRT in R d. We can now give the definition of the ISE

In this paper, we consider Riemannian manifolds admitting a semi-symmetric non-metric connection such that U is a unit parallel vector field with respect to the Levi-Civita