getdoce4ce. 125KB Jun 04 2011 12:05:06 AM

(1)

ELECTRONIC COMMUNICATIONS in PROBABILITY

BERRY-ESSEEN BOUNDS FOR PROJECTIONS OF COORDINATE

SYMMETRIC RANDOM VECTORS

LARRY GOLDSTEIN

Department of Mathematics, University of Southern California, Los Angeles, CA 90089-2532, USA email: [email protected]

QI-MAN SHAO1

Department of Mathematics, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China

email: [email protected]

SubmittedMay 16,2009, accepted in final formOctober 23,2009 AMS 2000 Subject classification: 60F05,60D05

Keywords: Normal approximation, convex bodies

Abstract

For a coordinate symmetric random vector(Y₁, . . . ,Yn) =Y∈Rn, that is, one satisfying(Y1, . . . ,Yn) =d

(e₁Y1, . . . ,enYn)for all(e1, . . . ,en)∈ {−1, 1}n, for whichP(Yi=0) =0 for alli=1, 2, . . . ,n, the

fol-lowing Berry Esseen bound to the cumulative standard normalΦfor the standardized projection W_θ=Y_θ/v_θofYholds:

sup

x∈R|

P(Wθ≤x)−Φ(x)| ≤2

n

X

i=1

|θi|3E|Xi|3+8.4E(Vθ2−1)

2_,

whereYθ =θ·Yis the projection ofYin directionθ ∈Rnwith||θ||=1, vθ = p

Var(Y_θ),Xi =

|Yi|/vθ andVθ = Pn

i=1θ 2 iX

2

i. As such coordinate symmetry arises in the study of projections of

vectors chosen uniformly from the surface of convex bodies which have symmetries with respect to the coordinate planes, the main result is applied to a class of coordinate symmetric vectors which includes cone measure_Cn

p on theℓnpsphere as a special case, resulting in a bound of order

Pn i=1|θi|3.

1 Introduction and main result

Properties of the distributions of vectors uniformly distributed over the surface, or interior, of compact, convex bodies, such as the unit sphere inRn_{, have been well studied. When the convex}

body has symmetry with respect to allncoordinate planes, a vectorYchosen uniformly from its surface satisfies

(Y₁, . . . ,Yn) =d(e1Y1, . . . ,enYn) for all(e1, . . . ,en)∈ {−1, 1}n

1_{PARTIALLY SUPPORTED BY HONG KONG RGC CERG 602206 AND 602608}

(2)

and is said to be coordinate symmetric. Projections

Y_θ=θ_·Y=

n

X

i=1

θ_iY_i (1.1)

ofYalongθ _∈Rn_with_||θ_||=1 have generated special interest, and in many cases normal ap-proximations, and error bounds, can be derived for Wθ, the projection Yθ standardized to have mean zero and variance 1. In this note we show that when a random vector is coordinate sym-metric, even though its components may be dependent, results for independent random variables may be applied to derive error bounds to the normal for its standardized projection. Bounds in the Kolmogorov and total variation metric for projections of vectors with symmetries are given also in[8], but the bounds are not optimal; the bounds provided here, in particular those in The-orem 2.1 for the normalized projections of the generalization_Cn

p,F of cone measure, are of order

Pn

i=1|θi|3. In related work, many authors study the measure of the set of directions on the unit

sphere along which projections are approximately normally distributed, but in most cases bounds are not provided; see in particular[12],[1]and[2]. One exception is[6]where the surprising order Pn_i₌₁|θ_i_|4 _{is obtained under the additional assumption that a joint density function of} _Y

exists, and is log-concave.

When the componentsY1, . . . ,Ynof a coordinate symmetric vectorYhave finite variancesv12, . . . ,vn2,

respectively, it follows easily fromY_i=_d₋Y_iand(Y_i,Y_j) =_d(₋Y_i,Y_j)fori₆=j_{∈ {}1, . . . ,n_}that EYi=0, and EYiYj=vi2δi j,

and hence, that

EYθ=0 and Var(Yθ) =vθ2 where v

2

θ=

n

X

i=1 θ_i2v_i2. Standardizing to variance 1, write

Wθ=Yθ/vθ and Xi=|Yi|/vθ. (1.2) Whenv2

i =v2is constant ini then vθ2= v2, the common variance of the components, for allθ with_||θ_||=1.

One conclusion of Theorem 1.1 gives a Kolmogorov distance bound between the standardized projection Wθ and the normal in terms of expectations of functions of Vθ =

Pn i=1θ

2 iX

2 i and

Pn

i=1|θi|3|Xi|3. We apply Theorem 1.1 to standardized projections of a family of coordinate

sym-metric random vectors, generalizing cone measure Cn

p on the sphereS(ℓ n

p), defined as follows.

Withp>0, let

S(ℓn_p) =_{x_∈Rn_:

n

X

i=1

|x_i_|p=1_} and B(ℓn_p) =_{x_∈Rn_:

n

X

i=1

|x_i_|p_≤1_}.

Withµn_{Lebesgue measure on}_Rn_{, the cone measure of}_A_⊂_S(_ℓn

p)is given by

Cpn(A) =

µn_{([0, 1]A)} µn_(B(_ℓn

p))

where [0, 1]A=_{t a:a_∈A,t_∈[0, 1]_}. (1.3)

(3)

In [5] an L1 _{bound between the standardized variable}_W

θ in (1.2) and the normal is obtained when Yhas the cone measure distribution. Here an application of Theorem 1.1 yields Theorem 2.1, which gives Kolmogorov distance bounds of the orderPn

i=1|θi|3 for a class of distributions

Cn

p,F which include cone measure as a special case.

We note that ifθ_∈Rn_satisfies_||θ_||=1, so Hölder’s inequality with 1/s+1/t=1 yields sum of the coordinates ofY.

We have the following simple yet crucial result, shown in Section 3.

LEMMA 1.1. LetYbe a coordinate symmetric random variable inRn such that P(Y_i =0) = 0for all i =1, 2 . . . ,n, and let ǫ_i =sign(Y_i), the sign of Y_i. Then the signsǫ₁, . . . ,ǫ_n of the coordinates Y₁, . . . ,Y_nare i.i.d. variables taking values uniformly in_{−1, 1_}, and

(ǫ₁, . . . ,ǫ_n) and (_|Y1|, . . . ,|Yn|) are independent.

The independence property provided by Lemma 1.1 is the key ingredient in the following theorem. THEOREM1.1. LetY= (Y₁, . . . ,Yn)be a coordinate symmetric random vector inRnwhose components

We remark that in related work, Theorem 4 in[3]gives an exponential non-uniform Berry-Esseen bound for the Studentized sums

n

A simplification of the bounds in Theorem 1.1 result whenYhas the ‘square negative correlation property,’ see[9], that is, when

(4)

as then

E(V2

θ −1)2≤

n

X

i=1 θ4

iVar(X 2 i),

and hence the first term on the right hand side of(1.7)can be replaced by 8.4Pn

i=1θi4Var(Xi2).

Proposition 3 of[9]shows that cone measure_Cn

p satisfies a correlation condition much stronger

than(1.8); see also[1]regarding negative correlation in the interior ofB(ℓn_p).

2 Application

One application of Theorem 1.1 concerns the following generalization of cone measure_Cn p. Let

n_≥2 andG1, . . . ,Gnbe i.i.d. nontrivial, positive random variables with distribution function F,

and set

G_1,n=

n

X

i=1

G_i.

In addition, let ǫ₁, . . . ,ǫ_n be i.i.d. random variables, independent of G1, . . . ,Gn, taking values

uniformly in_{−1, 1_}. Let_Cn

p,F be the distribution of the vector

Y=

ǫ1(

G1

G1,n

)1/p, . . . ,ǫ_n( Gn G1,n

)1/p

. (2.1)

By results in[10], for instance, cone measure_Cn

p as given in (1.3) is the special case whenF is

the Gamma distributionΓ(1/p, 1).

THEOREM2.1. LetYhave distributionC_p,Fn given by (2.1) with p>0and F for which EG2₁+4/p<∞

when G1is distributed according to F . Then there exists a constant cp,F depending on p and F such

that for allθ_∈Rn_{for which}_||θ_||=1we have

sup

x∈R|

P(W_θ_≤x)₋Φ(x)_{| ≤}cp,F n

X

i=1

|θ_i_|3_, _(2.2)

where

Wθ=Yθ/vθ with Yθ=θ·Y and vθ2=E G₁

G1,n

2/p .

As the Gamma distributionΓ(1/p, 1)has moments of all orders, the conclusion of Theorem 2.1 holds, in particular, for cone measure_Cn

p.

3 Proofs

(5)

Then, using the coordinate symmetry property to obtain the fourth equality, we have P(ǫ1=e1, . . . ,ǫn=en,|Y1| ∈A1, . . . ,|Yn| ∈An)

= P(ǫ₁=e₁, . . . ,ǫ_n=e_n,ǫ₁Y₁_∈A₁, . . . ,ǫ_nY_n_∈A_n) = P(e₁Y1∈A1, . . . ,enYn∈An)

= P(Y₁_∈e₁A₁, . . . ,Y_n_∈e_nA_n)

= 1

2n

X

(γ1,...,γn)∈{−1,1}n

P(Y₁_∈γ₁A1, . . . ,Yn∈γnAn)

=

n

Y

i=1

P(ǫi=ei)

!

P(_|Y1| ∈A1, . . . ,|Yn| ∈An).

Before proving Theorem 1.1, we invoke the following well-known Berry-Esseen bound for inde-pendent random variables (see [11]): if ξ₁,_{· · ·},ξ_n are independent random variables satisfying Eξ_i=0, E_|ξ_i_|3_<_∞_for₁_≤_i_≤_{n and}Pn

i=1Eξ2i =1,

sup

x∈R|

P(

n

X

i=1

ξi≤x)−Φ(x)| ≤min(1, 0.7056 n

X

i=1

E_|ξi|3).

In particular, ifǫ1, . . . ,ǫnare independent random variables taking the values−1,+1 with equal

probability, and b1, . . . ,bnare any nonzero constants, thenW=

Pn

i=1biǫisatisfies

sup

x∈R|

P(W_≤x)₋Φ(x/V)_{| ≤}min(1, 0.7056

n

X

i=1

|bi|3/V3), (3.1)

whereV2₌Pn i=1b2i.

Proof of Theorem 1.1. By Lemma 1.1, recallingXi=|Yi|/vθ, we may write Wθ=

n

X

i=1 ǫ_iθ_iX_i

where_{ǫ_i, 1_≤i_≤n_}is a collection of i.i.d. random variables withP(ǫ_i=1) =P(ǫ_i=₋1) =1/2, independent ofX₁, . . . ,X_n. Note that, by construction,Pn

i=1θi2EXi2=1.

Now,

P(W_θ_≤x)₋P(Z_≤x)

= EP(Wθ≤x|{Xi}1≤i≤n)−Φ(x/Vθ)

+EnΦ(x/Vθ)−Φ(x) o

:= R1+R2. (3.2)

By(3.1),

|R₁_{| ≤} Enmin(1,0.7056 Pn

i=1|θi|3|Xi|3

V_θ3 )

o

≤ P(V2

θ <1/2) +0.7056(23 /2₎

n

X

i=1

|θ_i_|3_E_|_X i|3

≤ 2E_|V_θ2−1_|I{|V_θ2−1_|>1/2_}+2

n

X

i=1

|θ_i_|3_E_|_X

(6)

As toR₂, lettingZ_∼N(0, 1)be independent ofV_θwe have |R₂_| = _|P(Z_≤x/Vθ)−P(Z≤x)|

≤ |P(Z_≤x/Vθ,|Vθ2−1| ≤1/2)−P(Z≤x,|Vθ2−1| ≤1/2)| +_|P(Z_≤x/Vθ,|Vθ2−1|>1/2)−P(Z≤x,|V

2

θ −1|>1/2)| ≤ |P(Z_≤x/V_θ,_|V_θ2₋1_{| ≤}1/2)₋P(Z_≤x,_|V_θ2₋1_{| ≤}1/2)_|

+P(_|V_θ2−1|>1/2)

≤ |P(Z_≤x/Vθ,|Vθ2−1| ≤1/2)−P(Z≤x,|V

2

θ −1| ≤1/2)| +2E_|V_θ2−1|I{|V_θ2−1|>1/2}

:= R3+2E|Vθ2−1|I{|V

2

θ −1|>1/2}, where

R3=|E

(Φ(x/Vθ)−Φ(x))I{|Vθ2−1| ≤1/2}

|. By monotonicity, it is easy to see that

|(1+x)−1/2₋₁₊_x_/₂_|

x2 ≤c0:=4

p

2₋5 (3.4)

for_|x_{| ≤}1/2. Hence, assuming_|V2

θ −1| ≤1/2 1/Vθ= (1+Vθ2−1)−

1/2₌₁

−(1/2)(V_θ2₋1) +γ₁(V_θ2₋1)2 with_|γ1| ≤c0. A Taylor expansion ofΦyields

Φ(x/Vθ)−Φ(x)

= xφ(x)(1/Vθ−1) + (1/2)x2(1/Vθ−1)2φ′(xγ2)

= xφ(x)n₋(1/2)(V2

θ −1) +γ1(Vθ2−1)2 o

+(1/2)x2φ′(xγ₂) (V

2

θ −1)2 (Vθ(Vθ+1))2 where(2/3)1/2_≤γ2≤

p

2 whenever_|V_θ2₋1_{| ≤}1/2. Let

c1=sup x∈R|

xφ(x)_|= _p1 2πe

−1/2_≤_0.24198

and

sup

x

sup (2/3)1/2_≤_γ

2≤21/2

|x2φ′_(x_γ 2)|

= sup

x

sup (2/3)1/2

≤γ2≤21/2

|x3γ2φ(xγ2)|

= sup

x

sup (2/3)1/2_≤_γ

2≤21/2

γ−₂2_|xγ₂_|3φ(xγ₂)

≤ 3 2supx |

x_|3φ(x) = 3(3/e)

3/2

(7)

SinceE(V2 Collecting the bounds above yields

|P(W_≤x)₋P(Z_≤x)_|

Lastly, (1.7) follows from (1.6) and the fact that

4.2E_|V_θ2₋1_|I_{|V_θ2₋1_|>1/2_}+0.4E(V_θ2₋1)2I_{|V_θ2₋1_{| ≤}1/2_}

Proof of Theorem 2.1. LetYbe distributed asCn

(8)

For 0< r < 1/2, we apply the following exponential inequality for non-negative independent random variables (see, for example, Theorem 2.19 in[7]): Forξ_i, 1_≤i _≤ n, independent non-negative random variables with a:=Pn

i=1Eξiand b2:=

Pn

i=1Eξ2i <∞, and any0<x<a,

P(

n

X

i=1

ξ_i_≤x)_≤exp₋(a−x)

2

2b2

. (3.7)

Letc=E(G₁)/(2(EG₁+2pVar(G₁))). Observe that

E

G₁ G_1,n

2r

≥ E

G₁

G_1,nI{G1/G1,n≥c/n} 2r

≥ (c/n)2rP(G1/G1,n≥c/n)

and

P(G₁/G_1,n_≥c/n)

≥ P((n₋1)G1≥c(G1,n−G1))

≥ P(G₁_≥E(G₁)/2,G_1,n₋G₁_≤(n₋1)(EG₁+2pVar(G₁))) = 1−P(G₁<E(G₁)/2)1−P(G_1,n₋G1>(n−1)(EG1+2

p

Var(G₁)))

≥ 1−exp(₋(EG₁)2_/_(8EG2 1))

(1₋ 1

4(n₋1)),

obtaining the final inequality by applying (3.7) with n = 1 to the first factor and Chebyshev’s inequality to the second. This proves(3.5).

As_Cn

p,F is coordinate symmetric with exchangeable coordinates, we apply Theorem 1.1 withvθ= v_nas in (3.6), and claim that it suffices to show

E(G₁/G1,n)3r=O(n−3r) (3.8)

and

E(V_θ2₋1)2=O(

n

X

i=1

θ_i4). (3.9)

In particular, regarding the second term in (1.7), we have by (3.5) and (3.8)

n

X

i=1

|θ_i_|3_E_|_X

i|3=vn−3 n

X

i=1

|θ_i_|3_E( Gi

G1,n

)3r₌_O( n

X

i=1

|θ_i_|3_),

which dominates (3.9), the order of the first term in (1.7), thus yielding the theorem.

Lettingµ=EG1, the main idea is to use (i) thatG1,n/nis close toµwith probability one by the

law of large numbers; and (ii) the Taylor expansions

(1+x)−2r=1₋2r x+γ₁x2 forx>₋1/2 (3.10) and

(1+x)−2r=1+γ2x forx>−1/2 (3.11)

where_|γ₁_{| ≤}r(2r+1)22r+2_and_|_γ

(9)

We first show that

(nµ)2rE(G₁/G_1,n)2r=EG₁2r+O(n−1). (3.12) Let∆_n= (G1,n−nµ)/(nµ)and write

G1,n=nµ(1+ ∆n).

Then

(nµ)2rE(G1/G1,n)2r = (nµ)2rE(G1/G1,n)2rI{G1,n≤nµ/2}

+(nµ)2rE(G₁/G_1,n)2rI_{G_1,n>nµ/2_} = (nµ)2r_E(G

1/G1,n)2rI{G1,n≤nµ/2}

+E(G₁2r(1+ ∆_n)−2rI_{∆_n>₋1/2_}

:= R4+R5. (3.13)

By(3.7), we have

P(G_1,n_≤nµ/2)_≤exp

−(nµ/2)

2

2nEG2 1

=exp

− nµ

2

8EG2 1

(3.14)

and hence

R₄_≤(nµ)2rP(G_1,n_≤nµ/2) =O(n−2). (3.15) By(3.10),

R5 = E(G12r(1−2r∆n+γ1∆2n)I{∆n>−1/2}

= E(G₁2r(1₋2r∆_n)₋E(G₁2r(1₋2r∆_n)I_{∆_n_{≤ −}1/2_} +EG2r₁ γ₁∆2_nI_{∆_n>₋1/2_}

= EG₁2r₋2r EG₁2r(G_1,n₋nµ)/(nµ)₋E(G₁2r(1₋2r∆_n)I_{∆_n_{≤ −}1/2_} +EG2r₁ γ₁∆2_nI_{∆_n>₋1/2_}

= EG₁2r₋2r EG₁2r(G₁₋µ)/(nµ) +R_5,1, (3.16) where

R5,1=−E(G2r1 (1−2r∆n)I{∆n≤ −1/2}+EG2r1 γ1∆2nI{∆n>−1/2}.

Applying Hölder’s inequality to the first term inR_5,1, and that∆_n_{≥ −}1, yields |R5,1| ≤ E(G2r1 (1+2r)I{∆n≤ −1/2}+O(1)EG12r∆

2

n (3.17)

≤ (1+2r)EG₁2r(2r+2)/(2r+1)(2r+1)/(2+2r)P1/(2+2r)(∆_n_{≤ −}1/2) +O(1)EG2r

1 ∆

2 n

= O(1)P1/(2+2r)(∆_n_{≤ −}1/2)

+O(1)(nµ)−2EG₁2r(G₁₋µ)2+EG₁2rE(

n

X

i=2

(G_i₋µ)2

= O(n−1), (3.18)

(10)

As to(3.8), again applying(3.14), we have

n3rE(G₁/G1,n)3r = n3rE(G1/G1,n)3rI{G1,n≤nµ/2}

+n3rE(G1/G1,n)3rI{G1,n>nµ/2}

≤ n3rP(G_1,n_≤nµ/2) +EG₁3r/(µ/2)3r

= O(1). Now, to prove(3.9), write

V_θ2₋1= (V_θ2₋1)I_{G_1,n_≤nµ/2_}+ (V_θ2₋1)I_{G_1,n>nµ/2_}. (3.19) Note thatV_θ2=O(n2r₎_by_{(3.5). Similarly to}_{(3.15), by}_(3.14)_again

E(V_θ2₋1)2I{G1,n≤nµ/2}=O(n4r)P(G1,n≤nµ/2) =O(n−1). (3.20)

For the next term in (3.19) observe that

(V_θ2₋1)I_{G1,n>nµ/2} (3.21)

From(3.12)it follows that

(nµ)−2r

(11)

Hence

1 <∞, using the Cauchy-Schwarz inequality for

the second step, we have

ER2₇ = O(1)E∆2

[1] Anttila, M., Ball, K., and Perissinaki, I. (2003) The central limit problem for convex bodies. Trans. Amer. Math. Soc., 355 (12): 4723-4735 (electronic). MR1997580

[2] Bobkov, S. (2003) On concentration of distributions of random weighted sums.Ann. Probab. 31, 195-215. MR1959791

[3] Chistyakov, G.P. and Götze, F. (2003). Moderate deviations for Student’s statistic.Theory Probability Appl.47, 415-428. MR1975426

(12)

[5] Goldstein, L. (2007)L1_{bounds in normal approximation.}_{Ann. Probab.}_{35, pp. 1888-1930.}

MR2349578

[6] Klartag, B. (2009). A Berry-Esseen type inequality for convex bodies with an unconditional basis.Probab. Theory Related Fields45, 1-33. MR2520120

[7] Lai, T.L., de la Pena, V., and Shao, Q-M. (2009) Self-normalized processes: Theory and Statistical Applications. Springer, New York. MR2488094

[8] Meckes, M. and Meckes, E. (2007). The central limit problem for random vectors with sym-metries.J. Theoret. Probab.20, 697-720. MR2359052

[9] Naor, A. and Romik, D. (2003) Projecting the surface measure of the sphere ofℓn

p.Ann. Inst.

H. Poincaré Probab. Statist.39, 241-261.

[10] Schechtman, G., Zinn, J. (1990) On the volume of the intersection of two ℓn

p balls.Proc.

Amer. Math. Soc.,110, 217-224. MR1015684

[11] Shevtsova, I. G. (2006) Sharpening the upper bound for the absolute constant in the Berry-Esseen inequality. (Russian)Teor. Veroyatn. Primen. 51622-626; translation in The-ory Probab. Appl.51, (2007), 549-553. MR2325552