getdoc8975. 145KB Jun 04 2011 12:04:38 AM

(1)

in PROBABILITY

STRONG APPROXIMATION FOR MIXING SEQUENCES

WITH INFINITE VARIANCE

RALUCA BALAN1

Department of Mathematics and Statistics, University of Ottawa, 585 King Edward Avenue, Ottawa, ON, K1N 6N5, Canada.

email: rbalan@uottawa.ca

INGRID-MONA ZAMFIRESCU

Department of Mathematics, Baruch College, City University of New York, One Bernard Baruch Way, New York, NY 10010, USA.

email: Ingrid-Mona Zamfirescu@baruch.cuny.edu

Submitted 20 December 2004, accepted in final form 14 January 2006 AMS 2000 Subject classification: Primary 60F15, 60F17; Secondary 60E05.

Keywords: mixing sequence; domain of attraction of the normal law; truncation; blocking technique; Skorohod embedding.

Abstract

In this paper we prove a strong approximation result for a mixing sequence with infinite variance and logarithmic decay rate of the mixing coefficient. The result is proved under the assumption that the distribution is symmetric and lies in the domain of attraction of the normal law. Moreover the function L(x) = EX2_1{|

X|≤x} is supposed to be slowly varying with remainder (logx)−α_{(log log)}−β₍_x_{) with}_{α, β >}_1.

1 Introduction

The concept of mixing is a natural generalization of independence and can be viewed as “asymptotic independence”: the dependence between two random variables in a mixing se-quence becomes weaker as the distance between their indices becomes larger. There is an immense amount of literature dedicated to limit theorems for mixing sequences, most of it assuming that the moments of second order or higher are finite (see e.g. the recent survey article [3]). One of the most important results in this area is Shao’s strong invariance principle [14], from which one can easily deduce many other limit theorems.

In this paper we prove a strong approximation result for a mixing sequence of identically distributed random variables with infinite variance, whose distribution is symmetric and lies in the domain of attraction of the normal law (DAN). This suggests that it may be possible to obtain a similar result for the self-normalized sequence. Self-normalized limit theorems have become increasingly popular in the past few years, but so far only the case of independent

1_{RESEARCH SUPPORTED BY NSERC, CANADA}

(2)

random variables was considered. Therefore, our result may contain the seeds of future research in the promising new area of self-normalized limit theorems for dependent sequences; see e.g. [1], or [12].

Suppose first that {Xn}n≥1 is a sequence of i.i.d. random variables with Sn =Pni=1Xi and

EX = 0, EX2=∞(hereX denotes a generic random variable with the same distribution as

Xn). IfX∈DAN (or equivalently, the functionL(x) =EX21{|X|≤x}is slowly varying), then the “central limit theorem” continues to hold in the form Sn/ηn →d N(0,1), where {ηn}n is

a nondecreasing sequence of positive numbers satisfying

η2n∼nL(ηn). (1)

(see e.g. [6], IX.8, XVII.5). Moreover, by Theorem 1 of [5], if the distribution ofX issymmetric then

lim sup

n→∞

Sn

(2η2

nlog logηn)1/2

= 1 or∞ a.s. depending on whether the integral

Ilog log:=

Z ∞

b

x2

L(x) log logx dF(x)

converges or diverges (here b := inf{x ≥ 1;L(x) > 0}). Hence Ilog log < ∞ is a minimum requirement for the “law of the iterated logarithm” in the case of i.i.d. random variables with infinite variance.

In the 1971 Rietz Lecture, Kesten has discussed Feller’s result and raised the question of its correctness; see his Remark 9, [8]. Fortunately, he settled this problem, by replacing Feller’s normalizing constant (ηn2log logηn)1/2 with a slightly different constant γn, which behaves

roughly as a root of the equation γ2

n =CnL(γn) log logγn (see Theorem 7). A more general

form of the law of the iterated logarithm for the “trimmed” sumSn(r) (i.e. the sum obtained

by deleting from Sn ther-th largest terms) has been recently obtained in [9].

Following these lines, Theorem 2.1 of [11] proved that it is possible to obtain (on a larger probability space), the strong approximation

Sn−Tn=o(an) a.s. (2)

where Tn = Pni=1Yi and {Yn}n≥1 is a zero-mean Gaussian sequence (with EYn2 = τn for

suitable constantsτn). His ratean is chosen such that

a2n∼nL(an)v(an), (3)

where vis a nondecreasing slowly varying function with limx→∞v(x) =∞and

I:=Iv(·)=

Z ∞

b

x2

L(x)v(x) dF(x)<∞. (4)

In this paper we prove that a strong approximation of type (2) continues to hold in the mixing case.

We recall that a sequence {Xn}n≥1 of random variables is calledρ-mixingif ρ(n) := sup

k≥1

(3)

whereρ(Mk

1,M∞k+n) := sup{|Corr(U, V)|;U ∈L2(Mkn), V ∈L2(Mk∞+n)}andMbadenotes the

σ-field generated byXa, Xa+1, . . . , Xb.

Here is our result.

Theorem 1 Let{Xn}n≥1 be aρ-mixing sequence of symmetric identically distributed random

variables with EX = 0,EX2₌_∞ _and_X _∈_DAN_{, where} _X _{denotes a random variable with}

the same distribution asXn. Assume that

ρ(n)≤C(logn)−r for somer >1. (5) Let v be a nondecreasing slowly varying function such that v(x)≥Clog logxfor xlarge; let

τ = min(3, r+ 1). Suppose that the function L(x) =EX2₁

{|X|≤x} satisfies (4) and is slowly varying with remainder (logx)−α_v−β₍_x₎ _{for some} _{α > τ /}₍_τ ₋₂₎_{, β > τ /}_{2, i.e. for any}

λ∈(0,1)there exists C >0 such that

(SR) 1−L(λx)

L(x) ≤C(logx)

−α_v−β₍_x₎_for_x_large_.

Then without changing its distribution, we can redefine {Xn}n≥1 on a larger probability space

together with a standard Brownian motion W ={W(t)}t≥0 such that for some constantss2n

Sn−W(s2n) =o(an) a.s. (6)

where{an}n is a nondecreasing sequence of positive numbers satisfying (3).

Condition (SR) specifies the rate of convergence of L(λx)/L(x) to 1, for the slowly varying functionL(see p.185 of [2]). It was used inonly oneplace, namely to ensure the convergence of the sum (28) in the proof of Lemma 11. Unfortunately, we could not avoid it.

We should mention here that a “functional central limit theorem” forρ-mixing sequences with infinite variance was obtained by [15] under the condition P

nρ(2n)<∞. In order to obtain

the strong approximation (6) we needed to impose the logarithmic decay rate ofρ(n). The remaining part of the paper is dedicated to the proof of Theorem 1: the description of the general method is given in Section 2, while the technical details are discussed in Sections 3 and 4. Among other ingredients, the proof uses the blocking technique introduced in [13], according to which the original random variables are replaced by their sums over progressively larger blocks of integers (separated by smaller blocks, whose length is also progressively larger). Throughout this work,C denotes a generic constant that does not depend onnbut may be different from place to place. We denote byI(a, b] the measure attributed by the integralIto the interval (a, b]. We let A(x) =L(x)v(x).

2 Sketch of Proof

As in [4] we may take

ηn = inf{s≥b+ 1;

L(s)

s2 ≤

1

n}, an= inf{s≥b+ 1; A(s)

s2 ≤

1

(4)

Clearly (1) and (3) hold. We have an≥ηn and

a2n≥Cη2nv(ηn)≥Cη2nlog logηn. (7)

Without loss of generality we will assume thatη2

n=nL(ηn) anda2n=nA(an).

The proof is based on a double truncation technique at levelsbn:=v−p(an)an andan (which

is due to [5]), and a repeated application of the method of [14] on each of the “truncation” intervals [0, bn], (bn, an].

We assume thatp >1/2. Let ˆ

Xn=XnI{|Xn|≤bn} Xn′ =XnI{bn<|Xn|≤an}, X¯n=XnI{|Xn|>an}.

By thesymmetryassumptionEXˆn=EXn′ = 0; sinceEXn = 0, it follows thatEX¯n= 0. We

have Xn = ˆXn+Xn′ + ¯Xn and hence

Sn= ˆSn+Sn′ + ¯Sn (8)

where ˆSn, Sn′,S¯n denote the partial sums of ˆXi, Xi′, respectively ¯Xi.

By Lemmas 3.2 and 3.3 of [5] (under the symmetry assumption), (4) is equivalent to

P

n≥1P(|X|> ǫan)<∞for allǫ >0. Hence

¯

Sn=o(an) a.s. (9)

In Section 3, we show that the central part ˆSn gives us the approximation

ˆ

Sn−W(s2n) =o((ηn2log logηn)1/2) a.s. (10)

for some constants s2

n. In Section 4 we show that between the two truncations we have

S′

n=o(an) a.s. (11)

The conclusion (6) follows immediately by (7)-(11).

3 The Central Part

The goal of this section is to prove relation (10) on a possibly larger probability space on which the sequence {Xˆn}n is redefined (without changing its distribution). In order to do

this, we introduce the blocks H1, I1, H2, I2, . . .of consecutive integers and we decompose the sum ˆSn into three terms containing the sums over the “small” blocks Ii, the sums over the

“big” blocksHi, and the remaining ˆXj’s (whose sum is shown to be negligible). The idea is to

construct the blocks Ii small enough to make the term depending on these blocks negligible,

but large enough to give sufficient space between the blocksHi. The sumsui over the blocks

Hiwill provide us with the desired approximation (10), by applying an almost sure invariance

principle (due to [14]) to the martingale differencesξi =ui−E(ui|u1, . . . , ui−1), after proving

that the sum of the termsE(ui|u1, . . . , ui−1) is negligible as well.

We define the blocksH1, I1, H2, I2, . . .of consecutive integers such that

(5)

with a = 1/α. Note that (1−a)τ > 2. Let Nm := Pmi=1card(Hi ∪Ii) ∼ exp(ma) and

The first three terms will be of ordero(ηn). The last term will give us the desired approximation

with rateo((η2

nlog logηn)1/2).

We begin with two elementary lemmas.

Lemma 2 There exists C >0 such that bn ≤Cηn fornlarge, and hence

nL(bn)≤Cηn2 f or n large. (13)

Proof. _{The relation}_b_n _≤_Cη_n _for_n _{large, can be written as}_a_n_/η_n_≤_Cvp₍_a_n_{) for}_n_large;

using the definitions ofan andηn, this in turn is equivalent to:

L(an)

L(ηn)

≤Cv2p−1(an) fornlarge. (14)

Since L is slowly varying, it follows by Potter’s Theorem (Theorem 1.5.6.(i) of [2]) that for anyC >1, δ >0 we have

This is exactly relation (14) withδ= 2−1/p. Relationship (13) follows using the fact thatLis nondecreasing and slowly varying, and the definition ofηn: nL(bn)≤nL(Cηn)≤CnL(ηn) =

Cη2

n. ¤

Lemma 3 For any integerλ >0there existsC=Cλ>0such thataλn≤Canandbλn ≤Cbn

fornlarge, and hence

(6)

and hence aλn/an ≤Cλ1/(2−δ) fornlarge. By the definition of bn and Potter’s theorem, we

The last statement in the lemma follows since Lis slowly varying. ¤

We are now ready to treat the first three terms in the decomposition (12). Lemma 4 We have Pm_i₌₁vi = o(m2exp(₃1ma)L1/2(bNm)) a.s. and hence

3ma). The first statement in the lemma

follows by the Chebyshev’s inequality and the Borel-Cantelli lemma. The second statement follows usingmn∼(logn)1/aand relation (13). ¤

Proof. _{The second statement follows by (13). For the first part, it is enough to prove that}

for anyε >0

For this we apply Lemma 2.4 of [14] with

q=τ, B=k−a(τ+2)/(τ−2)c1_k/2, x=εc1_k/2, as (2.20) of [14] provided we show that:

(7)

≤CX Theorem (Theorem 1.5.6.(i) of [2]):

v(bn)

From here we conclude that

X

for allε >0. By the Borel-Cantelli lemma, it follows that

Tmk

Our last theorem gives us the desired approximation for the last term in (12). To prove this theorem we need two lemmas. Letσ∗2

i =Eξi2, s∗m2= Pm

(8)

Lemma 7 We have P i≥1d

−τ /2

i E|ξi|τ<∞.

Proof. _{It is enough to prove the lemma with} _c_i _{instead of} _d_i_{, and} _u_i _{instead of} _ξ_i_{. By}

Lemma 2.3 of [14] we have

E|ui|τ ≤C{(card(Hi))τ /2·max j∈Hi

(EXˆj2)τ /2+ card(Hi)·max j∈Hi

E|Xˆj|τ} ≤

Cn(ia−1_exp(_ia₎₎τ /2_Lτ /2₍_b[exp(

ia_)]) +ia−1exp(ia)E|X|τ1{|_X_|≤₂_b [exp(ia)]}

o

=

Ccτ /i 2 n

i−(1−a)τ /2+ia−1e−(τ−2)ia/2L−τ /2(b[exp(ia_)])E|X|τ1{|_X_|≤₂_b_[exp(ia)]}

o

. (21)

The lemma follows by (17). ¤

Lemma 8 We have Pm_i₌₁(E(ξ2

i|Gi−1)−Eξi2) =o(dm) a.s.

Proof. _{It is enough to prove the lemma with}_c_m_{instead of}_d_m_{. Let}_u∗

i =u2i1{|ui|≤c1/2i } and

u∗∗

i =u2i1{|ui|>c1/2i }. The conclusion will follow from:

m X

i=1

(E(u∗∗i |Gi−1) +Eu∗∗i ) =o(cm) a.s. (22)

Um:= m X

i=1

(E(u∗i|Gi−1)−Eu∗i) =o(m−(r−1/2)a(logm)3cm) a.s. (23)

m X

i=1

(E2(ui|Gi−1) +EE2(ui|Gi−1)) =o(m−(2r−1)a(logm)2cm) a.s. (24)

To prove (22), note thatE|ui|τ ≥E|ui|τ1_{|_u_i|_>c1/2 i }≥c

(τ−2)/2

i Eu∗∗i . Relationship (22) follows

by Kronecker’s lemma, (21) and (17).

To prove (23), letβm=m−(r−1/2)a(logm)3cm. For anyi≤m

Eu∗i2=Eu4i1{|ui|≤c1/2_i }≤ciEu

2

i ≤Cia−1exp(ia)L(b[exp(ma_)])c_m

where we used (20) in the last inequality. By (2.34) of [14] and (5),we get: E(maxl≤mUl2)≤

C(logm)4_m−2ar_exp(_ma₎_L₍_b[exp(

ma_)])c_m. Let m_k = [k1/a]. Using the same argument based on a subsequence convergence criterion as in the proof of Lemma 6, we get Um=o(βm) a.s.

It remains to prove (24). By the mixing property, (20) and (5), we have EE2(ui|Gi−1) ≤ Ci−(2r−1)a−1_exp(_ia₎_L₍_b

[expia]) = Ci

−(2r−1)a−1_c

i. Relation (24) follows by the Kronecker’s

lemma. ¤

Here is the main result of this section.

Theorem 9 Without changing its distribution, we can redefine the sequence {ξi}i≥1 on a

larger probability space together with a standard Brownian motion W ={W(t)}t≥0 such that

mn

X

i=1

(9)

Proof. _{By Theorem 2.1 of [14], Lemma 7 and Lemma 8, we can redefine the sequence}_{_ξ_i_}_i_≥₁

on a larger probability space together with a standard Brownian motionW ={W(t)}t≥0such

that

Using the mixing property, (20) and (13), we obtain:

s∗m2=

4 Between the Two Truncations

This section is dedicated to the proof of relation (11): S′

n/an →0 a.s. For this we consider

the same blocksHi, Ii as in Section 3 and we decompose the sumSn′ into three components,

depending on the sums over the blocks Ii, the sums over the blocks Hi and the remaining

termsX′

j. The sumsu′iover the blocksHi are once again approximated by the corresponding

martingale differences ξ′

i and relation (11) follows by a martingale subsequence convergence

criterion.

We will prove that all the 4 terms in the above decomposition are of ordero(an).

We begin by treating the first three terms. Note that EX′2

j = L(aj)−L(bj) ≤L(aj) and

Proof. _{Same argument as in Lemma 4.} _¤

Lemma 11 We havemaxNm<n≤Nm+1

Proof. _{Using the same argument as in Lemma 5, it suffices to show that} X

k≥1

(10)

Letnj = [exp(ja)] andαj=anj. Note that the sum in (27) is smaller than

CX

j≥1

E|X|τ1{2αj−1<|X|≤2αj}L −τ /2₍_α

j)e−(τ−2)j

a_/₂ ≤

CX

j≥1

(L(2αj)−L(2αj−1))·αjτ−2L−τ /2(αj)e−(τ−2)j

a_/₂

,

where we used the inequality: E|X|τ_1{

a<|X|≤b} ≤(L(b)−L(a))bτ−2. Note that

ατ_j−2L−τ /2₍_α

j)e−(τ−2)j

a

/2₌_L−1₍_α

j) Ã

α2

j

exp(ja₎_L₍_α j)

!(τ−2)/2

≤CL−1(2αj)·v(τ−2)/2(αj).

Sinceαj∼αj−1, we have 2αj−1≥αjforjlarge. We conclude that the sum in (27) is smaller

than

CX

j≥1

·

1− L(αj)

L(2αj) ¸

v(τ−2)/2(αj). (28)

Using (SR) and the fact that αj≥Cn1j/2andv(x)≥Clog logx, we get

X

j≥1

µ

1− L(αj)

L(2αj) ¶

v(τ−2)/2(αj)≤C X

j≥1

(logαj)−1/av−d(αj)≤

CX

j≥1

(lognj)−1/a(log lognj)−d≤C X

j≥1

j−1_(log_j₎−d_<_∞_,

where d:=β−(τ−2)/2>1 (and we recall thata= 1/α). This concludes the proof of (27).

¤

Lemma 12 We havePm_i₌₁E(u′

i|Gi′−1) =o(m−(r−1/2)a·(logm)3·exp(12ma)·L1/2(aNm))a.s.

and hence Pmn

i=1E(u′i|Gi′−1) =o(an) a.s.

Proof. _{Same argument as in Lemma 6.} _¤

For our last result, we will need the following martingale subsequence convergence criterion (which is probably well-known).

Lemma 13 Let {Sn,Fn}n≥1 be a zero-mean martingale and {an}n≥1 a nondecreasing

se-quence of positive numbers with limnan =∞. If there exists a subsequence {nk}k such that

ank/ank−1≤C for allk, and

X

k≥1

E|Snk−Snk−1|

p

apnk

<∞ f or some p∈[1,2], (29)

(11)

Proof. _{Note that} _{_S_n

k,Fnk}k≥1 is a martingale. From (29) it follows thatSnk/ank →0 a.s. (see Theorem 2.18 of [7]). By the extended Kolmogorov inequality (see p. 65 of [10]), we have

Finally, we treat the last term in the decomposition (26). Theorem 14 We have Lemma 13, it is enough to prove that for a suitable subsequence{nk}k we have

X

We proceed now with the proof of (30). Let

(12)

Using (34) and (33) we get: for everymnk−1 < i≤mnk,

Eξi′2≤C(lognk)(a−1)/ank·A(ank)I(bnk, ank] =C(lognk)

(a−1)/a_a2

nkI(bnk, ank]. (35) From (32) and (35) and recalling thatmn∼(logn)1/a, we get

EZ2

k

a2

nk

≤ C[(lognk)1/a−(lognk−1)1/a]·(lognk)(a−1)/aI(bnk, ank]

≤ C(lognk−1)(1−a)/a 1 nk−1

(nk−nk−1)·(lognk)(a−1)/aI(bnk, ank]

= Cnk−nk−1 nk−1

I(bnk, ank]≤C 1

φ(k) + 1I(bnk, ank]≤CI(ank−1, ank],

where we used the inequality f(y)−f(x) ≤ f′₍_x₎₍_y₋_x_{) for the concave function} _f₍_x_{) =} (logx)1/a_{for the second inequality, and the choice (31) of the function}_φ_{for the last inequality.}

Relationship (30) follows by (4). ¤

Acknowledgement. The first author would like to thank Professor Mikl´os Cs¨org˝o for a series of wonderful lectures given at Carleton University throughout the academic year 2003-2004, which stimulated her interest in the area of self-normalized limit theorems. The authors are grateful to the Editor (Professor Martin Barlow) and the two anonymous referees for their helpful suggestions.

References

[1] R.M. Balan and R. Kulik. Self-normalized weak invariance principle for mixing sequences. Tech. Rep. Series, Lab. Research Stat. Probab., Carleton University-University of Ottawa 417(2005).

[2] N.H. Bingham, Goldie, C.M. and J.L. Teugels.Regular Variation(1987). Cambridge Uni-versity Press, Cambridge.

[3] R.C. Bradley. Basic properties of strong mixing conditions. A survey and some open questions. Probability Surveys2(2005), 107-144.

[4] M. Cs¨org˝o, B. Szyszkowicz and Q. Wang. Donsker’s theorem for self-normalized partial sum processes.Ann. Probab.31(2003), 1228-1240.

[5] W. Feller. An extension of the law of the iterated logarithm to variables without variance. J. Math. Mech.18(1968), 343-355.

[6] W. Feller.An Introduction to Probability Theory and Its Applications, Volume II, Second Edition (1971). John Wiley, New York.

[7] P. Hall and C.C. Heyde.Martingale Limit Theory and Its Applications(1980). Academic Press, New York.

[8] H. Kesten. Sums of independent random variables - without moment conditions. Ann. Math. Stat.43(1972), 701-732.

(13)

[10] M. Lo`eve.Probability Theory, Volume II, Fourth Edition (1978). Springer, New York. [11] J. Mijnheer. A strong approximation of partial sums of i.i.d. random variables with infinite

variance.Z. Wahrsch. verw. Gebiete52(1980), 1-7.

[12] S.Y. Novak. On self-normalized sums and Student’s statistic. Theory Probab. Appl. 49 (2005), 336-344.

[13] W. Philipp and W.F. Stout. Almost sure invariance principles for partial sums of weakly dependent random variables.Mem. AMS Vol. 2161(1975).

[14] Q.-M. Shao. Almost sure invariance principles for mixing sequences of random variables. Stoch. Proc. Appl.48 (1993), 319-334.