getdoc11c3. 134KB Jun 04 2011 12:04:06 AM

(1)

in PROBABILITY

ASYMPTOTIC CONSTANTS FOR MINIMAL DISTANCE IN THE

CEN-TRAL LIMIT THEOREM

EMMANUEL RIO

UMR 8100 CNRS, Université de Versailles Saint Quentin en Y, Bât. Fermat, 45 Avenue des Etats Unis, 78035 Versailles Cedex France

email: [email protected]

SubmittedSeptember 22, 2010, accepted in final formDecember 20, 2011 AMS 2000 Subject classification: 60F05

Keywords: Minimal metric, Wasserstein distance, Cornish-Fisher expansion of first order, Esseen’s mean central limit theorem, Global central limit theorem

Abstract

In this paper, we generalize the asymptotic result of Esseen (1958) concerning the Wasserstein distance of order one in the mean central limit theorem to the Wasserstein distances of orderrfor rin]1, 2].

1 Introduction and main results.

In this paper we continue the research started in Rio (2009), concerning the Wasserstein distances in the central limit theorem. Let Ω be a probability space, rich enough to generate the set of probability laws on IR×IR. Letdbe a pseudodistance on the set of real-valued random variables, such that d(X,Y)depends only on the law of (X,Y). Then, according to Paul Lévy (see Note B in Fréchet (1950) for this fact) the minimal distance dˆ associated to d is defined by d(ˆ µ,ν) = inf_{d(X,Y) : X ∼ µ,Y ∼ ν_}, where the infimum is taken over all random vectors(X,Y)with respective marginal laws µandν. When E=IR, r≥1 and d(X,Y) =_kX −Ykr, we denote by W_r the so defined minimal distance on the space_M_r of probability laws with a finite absolute moment of orderr. This distance is usually called Wasserstein distance of orderr. The distances W_r are homogeneous of degree 1.

Throughout the paper, X1,X2, . . . is a sequence of independent and identically distributed real-valued random variables with mean zero and finite positive variance. We setS_n=X₁+X₂+_{· · ·}+X_n andv_n=VarS_n. We denote byµ_nthe law of v−1/2

n Snand byγvthe normal law with mean 0 and variancev. In a recent paper, Rio (2009) proved that, forrin[1, 2],

(1.1) lim sup

n→∞ p

nW_r(µ_n,γ₁)<_∞

as soon as IE(_|X1|r+2)<∞. An interesting problem is then to find the limit in (1.1). LetFndenote the distribution function of µ_n andΦ denote the distribution function of the standard normal.

(2)

Esseen (1958) proved that

(1.2) lim

n→∞ p

n_kF_n₋Φ_k_p=A_p(µ1),

whereAp(µ1)is some nonnegative explicit constant. In the specific casep=1,

kF_n₋Φ_k₁=W₁(µ_n,γ₁).

Consequently Esseen’s result gives the asymptotic constant in (1.1) forr =1. Zolotarev (1964) provided the following representation. LetY andU be independent random variables, with re-spective distributions the standard normal law and the uniform law over[₋1/2, 1/2]. Then

A1(µ1) =k(α3/6)(1−Y2) + (h/σ)Uk1,

whereσis the standard deviation ofX1,α3=σ−3IE(X13)andhis the span of the distribution of X1for lattice distributions and 0 otherwise. Forr>1, it is known since a long time (cf. Dall’Aglio (1956) and Fréchet (1957) for more about this) that

(1.3) W_r(µ_n,γ1) =kFn−1−Φ− 1

kr,

and consequentlyW_r(µ_n,γ₁)₆=_kF_n₋Φ_k_r in general. In the next two theorems we describe the asymptotic behaviour ofW_r(µ_n,γ₁)asntends to_∞, forrin[1, 2]. As for r=1, this behaviour depends on whether or notX₁has a lattice distribution.

Theorem 1.1. LetY be a random variable with standard normal law. Setα₃= (IE(X2 1))−

3/2_IE(X3 1).

Letrbe any real in]1, 2]. If the distribution ofX1is not a lattice distribution and ifIE(|X1|r+2)<

∞, then

lim n→∞n

1/2_W

r(µn,γ1) = (|α3|/6)k1−Y2kr.

We now state the results for lattice distributions. We will consider either smoothed sums or un-smoothed sums. For lattice distributions taking values in the arithmetic progression_{a+kh:k_∈ ZZ_}(hbeing maximal), the smoothed sums ¯S_nare defined by

¯

Sn=Sn+hU,

whereUis a random variable with uniform distribution over[₋1/2, 1/2], independent ofSn. We denote by ¯µ_n the law of v−_n1/2S¯n. Theorem 1.2 gives the asymptotic constants for smoothed or unsmoothed sums.

Theorem 1.2. LetX₁,X₂, . . .be centered and independent identically distributed lattice random variables with varianceσ2_{, taking values in the arithmetic progression}_{_a₊_kh_:_k_∈_ZZ_}₍_h_being

maximal). LetY andUbe two independent random variables, with respective laws the standard

normal law and the uniform law over [₋1/2, 1/2]. Let r be any real in]1, 2]. Assume that

IE(_|X1|r+2)<∞. Then

(a) lim

n→∞n

1/2_W

r(¯µn,γ1) = (|α3|/6)k1−Y2kr

and

(b) lim

n→∞n

1/2_W

(3)

Remark 1.1. The constants appearing here are represented as in Zolotarev (1964), and conse-quently (b) still holds forr=1. Proceeding as in Esseen (1958) one can prove that (a) is true in the case r=1.

In the specific caser=2, the asymptotic constant can be computed more explicitely. Let us state the corresponding result.

Corollary 1.3. LetX₁,X₂, . . .be centered and independent identically distributed lattice random variables with varianceσ2_{and finite moment of order}₄_{, taking values in the arithmetic}

progres-sion_{a+kh:k_∈ZZ_}(hbeing maximal). Then, with the same notations as in Theorems 1.1 and

1.2,

lim n→∞n

1/2_W

2(µn,γ1) = 1 6

Æ

3(h/σ)2₊₂_α2 3.

Remark 1.1. In the non lattice case, as shown by Theorem 1.1, the above result holds withh=0.

Example 1: symmetric sign.Assume that the random variables are symmetric signs, that is IP(X₁=

1) =IP(X₁=₋1) =1/2. In that caseh=2,σ=1 andα₃=0. Then the asymptotic constant in Corollary 1.3 is 1/p3.

Example 2: Poisson distribution. Letλ >0 and assume thatX1+λhas the Poisson distribution

P(λ). Then h= 1, σ2 = λ andα3 = λ−1/2. In that case Corollary 1.3 gives the asymptotic constant 1

6(5/λ) 1/2_.

2 Non lattice distributions or lattice distributions and smoothed

sums.

In this section, we prove Theorem 1.1. for non lattice distributions and Theorem 1.2(a) for lattice distributions and smoothed sums. Dividing the random variables by the standard deviation ofX1, we may assume that the random variablesX1,X2, . . . satisfy IE(X12) =1. Let ¯Fndenote the distri-bution function of ¯µ_n. The first step is the result below of pointwise convergence, which is known as the Cornish-Fisher expansion. As pointed in Hall (1992, Theorem 2.4 and final comments to Chapter 2), the Cornish-Fisher expansion of first order does not need the Cramer condition. To be exhaustive, we give a proof of this result

Lemma 2.1. Ifµ₁is not a lattice distribution, then, for anyuin]0, 1[,

(a) pn(F_n−1(u)₋Φ−1(u)) = α3 6 (Φ

−1_(u))2

−1 +o(1),

asntends to_∞. Ifµ₁is a lattice distribution, then

(b) pn(_F¯−1

n (u)−Φ−

1_{(u)) =} α3 6 (Φ

−1_(u))2₋₁ +o(1).

Furthermore the convergence in (a) and (b) are uniform on[δ, 1₋δ], for any positiveδ.

Proof of Lemma 2.1. Let φ = Φ′ denote the density of the standard normal. We start from Esseen’s (1945) estimates

(2.1) Fn(x) = Φ(x) +

α3 6 (1−x

(4)

which holds true for non lattice distributions, and

(2.2) F¯_n(x) = Φ(x) +α3 6 (1−x

2₎_φ_(x)n−1/2₊_o(n−1/2_),

which holds true for lattice distributions. Moreover these estimates hold uniformly inxasntends to_∞. Let

Ψ(x) = Φ(x) +α3 6 (1−x

2₎_φ_(x)n−1/2_and_{Q(u) = Φ}−1_{(u) +}α3 6 ((Φ

−1_(u))2

−1)n−1/2.

We start by proving that, for any positiveA, uniformly inxover[₋A,A], (2.3) Q(Ψ(x)) =x+O(1/n).

First

sup x∈IR|

1₋x2_|φ(x) = (2π)−1/2.

Fornlarge enough,Ψ(x)lies in[Φ(₋2A),Φ(2A)]for anyxin[₋A,A]. Then, by the Taylor formula at order 2 applied atΦ(x)with the increment α3

6(1−x

2₎_φ_(x)n−1/2_,

|Φ−1(Ψ(x))₋x₋α3 6 (1−x

2_)n−1/2

| ≤ α

2 3

72πnu∈[Φ(−sup2A),Φ(2A)]|

(Φ−1)′′(u)_|.

Now

(Φ−1₎′′_{(u) = Φ}−1_(u)(_φ_(Φ−1_(u)))−2_.

Hence

sup u∈[Φ(−2A),Φ(2A)]|

(Φ−1₎′′_(u)_|₌ _sup

x∈[0,2A]

(x/φ2_{(x)) =}_2A_/φ2_(2A)_<_∞_,

which ensures that, uniformly inxover[₋A,A], (2.4) Φ−1_{(Ψ(x)) =}_x₊α3

6 (1−x

2_)n−1/2₊_O(1_/_n). Now, uniformly inxover[₋A,A],

Q(Ψ(x)) = Φ−1(Ψ(x)) +α3 6 ((Φ

−1_(Ψ(x)))2

−1)n−1/2

=x+α3 6(1−x

2_)n−1/2₊_O(1_/_{n) +}α3 6(x

2

−1)n−1/2+O(1/n) by (2.4) applied twice. The estimate (2.3) follows.

Starting from (2.1), we now prove Lemma 2.1(a). The proof of Lemma 2.1(b) (omitted) can be done exactly in the same way, starting from (2.2). From (2.1), fornlarge enough, F_n(x)lies in [Φ(₋2A),Φ(2A)]for anyxin[₋A,A]. Then, by (2.1) again and the Rolle theorem

Q(Fn(x))−Q(Ψ(x)) =o(n−1/2)

uniformly inx over[₋A,A]. Consequently, by (2.3), there exists some sequence(ǫ_n)_nof positive reals converging to 0 such that

(5)

for any xin[₋A,A]. It follows that

(2.5) _|F_n−1(u)₋Q(F_n(F_n−1(u)))_{| ≤}n−1/2ǫ_n,

provided thatF_n−1(u)lies in[₋A,A]. By (2.1) again this condition holds true fornlarge enough ifu belongs to[Φ(₋A/2),Φ(A/2)]. The estimate (2.1) also implies that the jumps ofFnare uniformly bounded byo(n−1/2_{). Hence}

sup u∈]0,1[|

F_n(F_n−1(u))₋u_|=o(n−1/2),

which ensures that

(2.6) sup

u∈[Φ(−A/2),Φ(A/2)]|

Q(F_n(F−1

n (u)))−Q(u)|=o(n− 1/2_).

Finally, by (2.5) and (2.6), Lemma 2.1(a) holds true withδ= Φ(₋A/2).

Proof of Theorem 1.1. Recall that, for lawsµandνwith respective distribution functionsF and G,W_r(µ,ν) =_kF−1₋G−1_k_r. Now, by Lemma 2.1, for any positiveδ,

lim n→∞n

r/2 Z1−δ

δ

|F_n−1(u)₋Φ−1(u)_|rdu= _|α₃_|/6r Z1−δ

δ

|Φ−1(u))2₋1_|rdu

and

lim n→∞n

r/2 Z1−δ

δ

|F¯_n−1(u)₋Φ−1(u)_|rdu= _|α3|/6 r

Z1−δ

δ

|Φ−1(u))2₋1_|rdu.

SinceΦ−1(u)has the standard normal distribution under the Lebesgue measure over[0, 1], The-orem 1.1 will follow from the above inequality if we prove that, forN large enough,

(2.7a) lim

δց0supn≥N

nr/2 Z1

0

1inf(u,1−u)<δ|Fn−1(u)−Φ−

1_(u)_|r_du₌₀

and

(2.7b) lim

δց0supn≥N

nr/2 Z1

0

1inf(u,1−u)<δ|F¯n−1(u)−Φ−

1_(u)_|r_du₌_0.

Now_|F−1

n (u)−F¯n−1(u)| ≤n−1

/2_{h, which ensures that}

lim

δց0 supn>0

nr/2 Z1

0

1inf(u,1−u)<δ|F¯n−1(u)−F¯− 1 n (u)|

r_du₌_0.

Hence (2.7b) follows from (2.7a) via the triangle inequality. The proof of (2.7a) will be done via Theorem 6.1 in Rio (2009), which is based on estimates of Borisov, Panchenko and Skilyagina (1998) for smooth functions ofS_n. ForF andGdistribution functions on the real line, let

κr,2(F,G) = Z1

0

(6)

Theorem 6.1 in Rio (2009) states that, ifnr/2≥IE(_|X1|r+2)(recallX1has unit variance), then

κ_r,2(F_n,Φ)_≤C n−r/2IE(_|X₁_|r+2) for some positive constantCdepending only onr. Now

Z1

0

1inf(u,1−u)<δ|Fn−1(u)−Φ−

1_(u)_|r_du_≤ 4

1+ (Φ−1₍_δ₎₎2κr,2(Fn,Φ).

Hence, forN= (IE(_|X₁_|r+2₎₎2/r_,

sup n≥N

nr/2 Z1

0

1inf(u,1−u)<δ|Fn−1(u)−Φ− 1_(u)

|rdu_≤ 4CIE(|X1| r+2₎

1+ (Φ−1₍_δ₎₎2,

which implies (2.7a). Hence Theorem 1.1 holds true.

3 Lattice distributions.

In this section we prove Theorem 1.2(b). Again we may assume thatσ2₌_{1. Let}

∆n(t) =n1/2(Fn−1(t)−Φ− 1_(t)).

From (2.7a), it is enough to prove that, for anyδin]0, 1/2[,

(3.1) lim n→∞

Z1−δ

δ

|∆_n(t)_|r_du₌ Z1

0 Z1−δ

δ

|(α₃/6)(1_{− |}Φ−1_(t)_|2_{) +}_hu₋_h_/₂_|r_{dud t}_.

In order to prove (3.1) we will use Lemma 2.1(b). For any distribution functionF and any real x, letF(x₋0) =limt↑xF(t). Letx0be the smallest number in the lattice{n−1/2(a+kh):k∈ZZ} such thatF_n(x₀₋0)_≥δ. For any relative integerl, setx_l=x₀+lhn−1/2_and_a

l=Fn(xl). Then, for anylin ZZ such that 0<a_l₋₁<a_l<1, we have

(3.2) F_n−1(t) =xlfor anyt∈]al−1,al[, sincea_l₋₁=F_n(x_l₋0). Furthermore, it can easily be proven that

(3.3) F¯_n−1((1₋u)al−1+ual) =xl+n−1/2h(u−1/2)for anyu∈[0, 1].

Throughout the sequel, let tl(u) = (1−u)al−1+ual. Let m be the largest integer such that F_n(x_m)_≤1₋δ. It comes from Esseen’s estimates thata_l₋₁<a_l for anyl in [1,m], fornlarge enough. Then, forlin[1,m]anduin[0, 1], by Lemma 2.1(b) together with (3.3),

∆_n(t_l(u)) =α3 6 |Φ

−1_(t

l(u))|2−1

−h(u₋1/2) +o(1) uniformy inl∈[1,m]andu∈[0, 1]. It follows that

Z am

a0

|∆_n(t)_|r_du₌ m X

l=1

(a_l₋al−1) Z1

0

α₆3 |Φ−1(t_l(u))|2−1−h(u−1₂) r

(7)

Now|a0−δ|+|1−δ−am|=O(n−1/2)and|∆n(t)|is uniformly bounded on[δ, 1−δ]. It follows

which ensures that the upper bound in (3.5) converges to 0 as n tends to _∞. It follows that (I_n₋Jn)converges to 0 asntends to∞. Finally

so that, repeating the arguments of the proof of (3.4), we have:

(8)

Both (3.4), (3.6) and the convergence of(I_n₋Jn)to 0 then imply (3.1). Theorem 1.2(b) is proved.

Acknowledgment. I would like to thank the referee for carefully reading the paper and for suggestions and comments that improved the style of the paper.

References

[1] Borisov, I.S., Panchenko D.A. and Skilyagina, G.I., On minimal smoothness conditions for asymptotic expansions of moments in the CLT. Siberian advances in Mathematics 8 80-95 (1998).

[2] Dall’Aglio, G., Sugli estremi deli momenti delle funzioni di ripartizione doppia.Ann. Scuola

Norm. Sup. Pisa, (Ser. 3)1035-74, (1956).

[3] Esseen, C-G., Fourier analysis of distribution functions. A mathematical study of the Laplace-Gaussian law. sl Acta Math.771-125 (1945).

[4] Esseen, C-G., On mean central limit theorems.Kungl. Tekn. Högsk. Handl. Stockholm 121 (1958).

[5] Fréchet M., Recherches théoriques modernes sur le calcul des probabilités, premier livre. Généralités sur les probabilités, éléments aléatoires. Paris, Gauthier-Villars. (1950).

[6] Fréchet M., Sur la distance de deux lois de probabilité. C. R. Acad. Sci. Paris244689-692, (1957).

[7] Hall, P., The bootstrap and Edgeworth expansion. Springer Series in Statistics. Springer-Verlag, New York (1992).

[8] Rio, E., Upper bounds for minimal distances in the central limit theorem.Ann. Institut H.

Poincaré Probab. Statist.45, 802-817, (2009).