getdoc6e81. 348KB Jun 04 2011 12:04:30 AM

(1)

El e c t ro n ic

Jo ur

n a l o

f P

r o

b a b i_{l i t y}

Vol. 14 (2009), Paper no. 35, pages 978–1011. Journal URL

http://www.math.washington.edu/~ejpecp/

Rates of convergence for minimal distances in the central

limit theorem under projective criteria

Jérôme Dedecker

∗

Florence Merlevède

†

Emmanuel Rio

‡

Abstract

In this paper, we give estimates of ideal or minimal distances between the distribution of the normalized partial sum and the limiting Gaussian distribution for stationary martingale differ-ence sequdiffer-ences or stationary sequdiffer-ences satisfying projective criteria. Applications to functions of linear processes and to functions of expanding maps of the interval are given.

Key words:Minimal and ideal distances, rates of convergence, Martingale difference sequences, stationary sequences, projective criteria, weak dependence, uniform mixing.

AMS 2000 Subject Classification:Primary 60F05.

Submitted to EJP on November 5, 2007, final version accepted June 6, 2008.

∗_{Université Paris 6, LSTA, 175 rue du Chevaleret, 75013 Paris, FRANCE. E-mail: jerome.dedecker@upmc.fr}

†_{Université Paris Est, Laboratoire de mathématiques, UMR 8050 CNRS, Bâtiment Copernic, 5 Boulevard Descartes,}

77435 Champs-Sur-Marne, FRANCE. E-mail: florence.merlevede@univ-mlv.fr

‡_{Université de Versailles, Laboratoire de mathématiques, UMR 8100 CNRS, Bâtiment Fermat, 45 Avenue des}

(2)

1 Introduction and Notations

LetX₁,X₂, . . . be a strictly stationary sequence of real-valued random variables (r.v.) with mean zero and finite variance. Set Sn = X1+X2+· · ·+Xn. By Pn−1/2_S

n we denote the law of n −1/2_S

n and by G_σ2 the normal distribution N(0,σ2). In this paper, we shall give quantitative estimates of the

approximation ofP_n−1/2_S

n byGσ2 in terms of minimal or ideal metrics.

LetL(µ,ν) be the set of the probability laws onR2 _{with marginals}_µ _and_ν_{. Let us consider the}

following minimal distances (sometimes called Wasserstein distances of orderr)

W_r(µ,ν) =



 

 

infn

Z

|x₋y_|rP(d x,d y):P_{∈ L}(µ,ν)o if 0<r<1

infn

Z

|x₋y_|rP(d x,d y)1/r:P_{∈ L}(µ,ν)o ifr_≥1 .

It is well known that for two probability measuresµandν onR_{with respective distributions}

func-tions (d.f.) F andG,

Wr(µ,ν) =

Z 1

0

|F−1(u)₋G−1(u)_|rdu 1/r

for anyr≥1. (1.1)

We consider also the following ideal distances of order r (Zolotarev distances of order r). For two probability measuresµandν, andr a positive real, let

ζr(µ,ν) =sup

nZ

f dµ− Z

f dν: f _∈Λ_ro,

whereΛ_r is defined as follows: denoting byl the natural integer such thatl < r _≤l+1,Λ_r is the class of real functions f which arel-times continuously differentiable and such that

|f(l)(x)₋f(l)(y)_{| ≤ |}x₋y_|r−l for any(x,y)_∈R_×R_. _(1.2)

It follows from the Kantorovich-Rubinstein theorem (1958) that for any 0<r≤1,

W_r(µ,ν) =ζr(µ,ν). (1.3)

For probability laws on the real line, Rio (1998) proved that for anyr>1,

W_r(µ,ν)_≤c_r ζr(µ,ν)

1/r

, (1.4)

wherecris a constant depending only onr.

For independent random variables, Ibragimov (1966) established that if X₁ _∈ Lp _for _p _∈_]_{2, 3}_]_,

then W1(Pn−1/2_S

n,Gσ2) = O(n

1−p/2₎ _{(see his Theorem 4.3). Still in the case of independent r.v.’s,}

Zolotarev (1976) obtained the following upper bound for the ideal distance: ifX1∈Lpforp∈]2, 3],

thenζp(Pn−1/2_S

n,Gσ2) =O(n

1−p/2₎_{. From (1.4), the result of Zolotarev entails that, for} _p

∈]2, 3],

W_p(P_n−1/2_S

n,Gσ2) =O(n

1/p−1/2₎_{(which was obtained by Sakhanenko (1985) for any} _p_>_{2). From}

(1.1) and Hölder’s inequality, we easily get that for independent random variables inLp _with_p _∈

]2, 3],

Wr(Pn−1/2_S

n,Gσ2) =O(n

−(p−2)/2r₎ _{for any 1}

(3)

In this paper, we are interested in extensions of (1.5) to sequences of dependent random variables. More precisely, for X1 ∈ Lp and p in ]2, 3] we shall give Lp-projective criteria under which: for r∈[p−2,p]and(r,p)₆= (1, 3),

W_r(P_n−1/2_S

n,Gσ2) =O(n

−(p−2)/2 max(1,r)₎_. _(1.6)

As we shall see in Remark 2.4, (1.6) applied tor =p₋2 provides the rate of convergenceO(n−

p−2 2(p−1)₎

in the Berry-Esseen theorem.

When (r,p) = (1, 3), Dedecker and Rio (2008) obtained that W₁(P_n−1/2_S

n,Gσ2) = O(n

−1/2₎ _for

stationary sequences of random variables in L3 _satisfying L1 _{projective criteria or weak}

depen-dence assumptions (a similar result was obtained by Pène (2005) in the case where the vari-ables are bounded). In this particular case our approach provides a new criterion under which

W₁(P_n−1/2_S

n,Gσ2) =O(n

−1/2_log_n₎_.

Our paper is organized as follows. In Section 2, we give projective conditions for stationary martin-gales differences sequences to satisfy (1.6) in the case(r,p)₆= (1, 3). To be more precise, let(X_i)_i_∈Z

be a stationary sequence of martingale differences with respect to some σ-algebras (_F_i)_i_∈Z (see

Section 1.1 below for the definition of(_F_i)_i∈Z). As a consequence of our Theorem 2.1, we obtain

that if(X_i)_i_∈Zis inLpwithp∈]2, 3]and satisfies

∞

X

n=1

1

n2−p/2

E

S2_n

n

F0

−σ2

p/2<∞, (1.7)

then the upper bound (1.6) holds provided that(r,p)₆= (1, 3). In the case r = 1 and p = 3, we obtain the upper boundW₁(P_n−1/2_S

n,Gσ2) =O(n

−1/2_log_n₎_.

In Section 3, starting from the coboundary decomposition going back to Gordin (1969), and using the results of Section 2, we obtain Lp_{-projective criteria ensuring (1.6) (if} ₍_r_,_p₎ ₆_{= (}_{1, 3}₎_{). For}

instance, if(Xi)i∈Zis a stationary sequence ofLp random variables adapted to(Fi)i∈Z, we obtain

(1.6) for any p ∈]2, 3[ and any r ∈[p−2,p]provided that (1.7) holds and the series E(S_n|F0)

converge inLp_{. In the case where}_p₌_{3, this last condition has to be strengthened. Our approach}

makes also possible to treat the case of non-adapted sequences.

Section 4 is devoted to applications. In particular, we give sufficient conditions for some functions of Harris recurrent Markov chains and for functions of linear processes to satisfy the bound (1.6) in the case (r,p) ₆= (1, 3) and the rate O(n−1/2logn) when r = 1 and p = 3. Since projective criteria are verified under weak dependence assumptions, we give an application to functions of

φ-dependent sequences in the sense of Dedecker and Prieur (2007). These conditions apply to unbounded functions of uniformly expanding maps.

1.1 Preliminary notations

Throughout the paper, Y is aN(0, 1)-distributed random variable. We shall also use the following notations. Let(Ω,_A,P₎_{be a probability space, and}_T _:_Ω_7→_Ω_{be a bijective bimeasurable}

transfor-mation preserving the probabilityP. For aσ-algebra_F₀ satisfying_F₀ _⊆ T−1(_F₀), we define the nondecreasing filtration(_F_i)_i∈Z byF_i = T−i(F0). LetF−∞ =

T

k∈ZFk andF∞=

W

k∈ZFk. We shall denote sometimes byE_i _{the conditional expectation with respect to}_F_i_{. Let}_X₀_{be a zero mean}

(4)

2 Stationary sequences of martingale differences.

In this section we give bounds for the ideal distance of order r in the central limit theorem for stationary martingale differences sequences(X_i)_i∈Zunder projective conditions.

Notation 2.1. For anyp>2, define the envelope normk.k1,Φ,p by

kXk1,Φ,p=

Z 1

0

(1∨Φ−1(1−u/2))p−2QX(u)du

whereΦdenotes the d.f. of theN(0, 1)law, andQ_X denotes the quantile function of_|X_|, that is the cadlag inverse of the tail function x →P(_|X|>x).

Theorem 2.1. Let(X_i)_i_∈Zbe a stationary martingale differences sequence with respect to(Fi)i∈Z. Let

σdenote the standard deviation of X0. Let p∈]2, 3]. Assume thatE|X0|p<∞and that

∞

X

n=1

1

n2−p/2

E

S_n2

n

F0

−σ2

1,Φ,p<∞, (2.1)

and

∞

X

n=1

1

n2/p

E

S2_n

n

F0

−σ2

p/2<∞. (2.2)

Then, for any r _∈ [p₋2,p] with (r,p) ₆= (1, 3), ζr(Pn−1/2_S

n,Gσ2) = O(n

1−p/2₎_{, and for p} ₌ ₃_,

ζ1(Pn−1/2_S

n,Gσ2) =O(n

−1/2_log_n₎_.

Remark 2.1. Leta>1 and p>2. Applying Hölder’s inequality, we see that there exists a positive constant C(p,a) such thatkXk1,Φ,p ≤ C(p,a)kXka. Consequently, if p ∈]2, 3], the two conditions (2.1) and (2.2) are implied by the condition (1.7) given in the introduction.

Remark 2.2. Under the assumptions of Theorem 2.1, ζ_r(P_n−1/2_S

n,Gσ2) = O(n

−r/2₎ _if _r _< _p₋_2.

Indeed, let p′ =r+2. Since p′< p, if the conditions (2.1) and (2.2) are satisfied for p, they also hold forp′. Hence Theorem 2.1 applies withp′.

From (1.3) and (1.4), the following result holds for the Wasserstein distances of orderr.

Corollary 2.1. Under the conditions of Theorem 2.1, Wr(Pn−1/2_S

n,Gσ2) =O(n

−(p−2)/2 max(1,r)₎_{for any}

r in[p₋2,p], provided that(r,p)₆= (1, 3).

Remark 2.3. Forpin]2, 3],Wp(Pn−1/2_S

n,Gσ2) =O(n

−(p−2)/2p₎_{. This bound was obtained by}

Sakha-nenko (1985) in the independent case. For p < 3, we have W1(Pn−1/2_S

n,Gσ2) = O(n

1−p/2₎_{. This}

bound was obtained by Ibragimov (1966) in the independent case.

Remark 2.4. Recall that for two real valued random variables X,Y, the Ky Fan metricα(X,Y) is defined byα(X,Y) =inf{ǫ >0 :P(_|X−Y|> ǫ)_≤ǫ}. LetΠ(µ,ν)be the Prokhorov distance between

µandν. By Theorem 11.3.5 in Dudley (1989) and Markov inequality, one has, for anyr>0,

(5)

Taking the minimum over the random couples (X,Y) with law _L(µ,ν), we obtain that, for any 0< r ≤1, Π(µ,ν)_≤(Wr(µ,ν))1/(r+1). Hence, if Πn is the Prokhorov distance between the law of

n−1/2S_n and the normal distributionN(0,σ2₎_,

Π_n_≤(Wr(Pn−1/2_S

n,Gσ2))

1/(r+1)_{for any 0}_<_r ≤1 .

Taking r=p₋2, it follows that under the assumptions of Theorem 2.1,

Π_n=O(n−

p−2

2(p−1)₎ _if _p_<_{3 and}_Π

n=O(n−1/4

p

logn) if p=3. (2.3)

For p in]2, 4], under (2.2), we have thatkPni=1E(X2i −σ2|Fi−1)kp/2 =O(n2/p)(apply Theorem

2 in Wu and Zhao (2006)). Applying then the result in Heyde and Brown (1970), we get that if

(Xi)i∈Zis a stationary martingale difference sequence inLpsuch that (2.2) is satisfied then

kF_n₋Φ_σk_∞=O n−

p−2 2(p+1)_.

whereFnis the distribution function ofn−1/2SnandΦσis the d.f. of Gσ2. Now

kF_n₋Φ_σk_∞_≤ 1+σ−1(2π)−1/2

Π_n.

Consequently the bounds obtained in (2.3) improve the one given in Heyde and Brown (1970), provided that (2.1) holds.

Remark 2.5. If (X_i)_i∈Zis a stationary martingale difference sequence inL3 such thatE(X₀2) =σ2

and

X

k>0

k−1/2kE(X_k2|F0)−σ2k3/2<∞, (2.4)

then, according to Remark 2.1, the conditions (2.1) and (2.2) hold forp=3. Consequently, if (2.4) holds, then Remark 2.4 gives_kF_n₋Φ_σk_∞=O n−1/4plogn

. This result has to be compared with Theorem 6 in Jan (2001), which states thatkFn−Φσk∞=O(n−1/4) if

P

k>0kE(X 2

k|F0)−σ

2

k3/2<

∞.

Remark 2.6. Notice that if(Xi)i∈Zis a stationary martingale differences sequence, then the

condi-tions (2.1) and (2.2) are respectively equivalent to

X

j≥0

2j(p/2−1)_k2−jE₍_S2

2j|F0)−σ2k1,Φ,p<∞, and

X

j≥0

2j(1−2/p)_k2−jE₍_S2

2j|F0)−σ2kp/2<∞.

To see this, letA_n=_kE₍_S2

n|F0

−E₍_S2

n)k1,Φ,pandBn=kE(Sn2|F0

−E₍_S2

n)kp/2. We first show that A_nandB_nare subadditive sequences. Indeed, by the martingale property and the stationarity of the sequence, for all positiveiand j

A_i₊_j = _kE₍_S2

i + (Si+j−Si)2|F0

−E₍_S2

i + (Si+j−Si)2)k1,Φ,p

≤ A_i+_kE (S_i+j−Si)2−E(S2j)|F0k1,Φ,p.

Proceeding as in the proof of (4.6), p. 65 in Rio (2000), one can prove that, for anyσ-field_A and any integrable random variableX,kE(X|A)_k_1,_Φ_,_p_{≤ k}Xk1,Φ,p. Hence

kE ₍_S_i₊_j₋_S_i₎2₋E₍_S2

j)|F0

k1,Φ,p≤ kE((Si+j−Si)2−E(S2j)|Fi

k1,Φ,p.

(6)

3 Rates of convergence for stationary sequences

In this section, we give estimates for the ideal distances of order r for stationary sequences which are not necessarily adapted toFi.

Theorem 3.1. Let(X_i)_i∈Zbe a stationary sequence of centered random variables inLp with p∈]2, 3[,

and letσ2_n=n−1E₍_S2

n). Assume that

X

n>0

E(Xn|F0)and

X

n>0

(X₋n−E(X−n|F0))converge inLp, (3.1)

and

X

n≥1

n−2+p/2_kn−1E₍_S2

n|F0)−σ2nkp/2<∞. (3.2)

Then the seriesP_k∈ZCov(X0,Xk)converges to some nonnegativeσ2, and

1. ζr(Pn−1/2_S

n,Gσ2) =O(n

1−p/2₎_{for r}

∈[p₋2, 2],

2. ζr(Pn−1/2_S

n,Gσ2n) =O(n

1−p/2₎_{for r}

∈]2,p].

Remark 3.1. According to the bound (5.40), we infer that, under the assumptions of Theorem 3.1, the condition (3.2) is equivalent to

X

n≥1

n−2+p/2kn−1E(S_n2|F0)−σ2kp/2<∞. (3.3)

The same remark applies to the next theorem withp=3.

Remark 3.2. The result of item 1 is valid withσn instead ofσ. On the contrary, the result of item 2 is no longer true ifσ_n is replaced byσ, because for r ∈]2, 3], a necessary condition forζ_r(µ,ν)

to be finite is that the two first moments ofν and µ are equal. Note that under the assumptions of Theorem 3.1, both W_r(P_n−1/2_S

n,Gσ2) and Wr(Pn−1/2Sn,Gσ2n) are of the order of n

−(p−2)/2 max(1,r)_.

Indeed, in the case wherer∈]2,p], one has that

Wr(Pn−1/2_S

n,Gσ2)≤Wr(Pn−1/2Sn,Gσ2n) +Wr(Gσ2n,Gσ2), and the second term is of order|σ−σn|=O(n−1/2).

In the case wherep=3, the condition (3.1) has to be strengthened.

Theorem 3.2. Let(X_i)_i_∈Z be a stationary sequence of centered random variables inL3, and letσ2_n =

n−1E(S2_n). Assume that (3.1) holds for p=3and that

X

n≥1

1

n

X

k≥n

E(Xk|F0)

3<∞ and

X

n≥1

1

n

X

k≥n

(X₋k−E(X−k|F0))

3<∞. (3.4)

Assume in addition that

X

n≥1

n−1/2_kn−1E₍_S2

n|F0)−σ2nk3/2<∞. (3.5)

Then the seriesP_k_∈ZCov(X0,Xk)converges to some nonnegativeσ

(7)

1. ζ1(Pn−1/2_S

n,Gσ2) =O(n

−1/2_log_n₎_,

2. ζr(Pn−1/2_S

n,Gσ2) =O(n

−1/2₎_{for r}

∈]1, 2],

3. ζr(Pn−1/2_S

n,Gσ2n) =O(n

−1/2₎_{for r}

∈]2, 3].

4 Applications

4.1 Martingale differences sequences and functions of Markov chains

Recall that the strong mixing coefficient of Rosenblatt (1956) between twoσ-algebras _A and _B is defined by α(_A,_B) =sup_{|P₍_A_∩_B₎₋P₍_A₎P₍_B₎_| _: ₍_A_,_B₎_{∈ A × B }}_{. For a strictly stationary}

sequence(Xi)i∈Z, letFi=σ(Xk,k≤i). Define the mixing coefficientsα1(n)of the sequence(Xi)i∈Z

by

α1(n) =α(F0,σ(Xn)).

For the sake of brevity, letQ=QX0 (see Notation 2.1 for the definition). According to the results of

Section 2, the following proposition holds.

Proposition 4.1. Let (X_i)_i∈Z be a stationary martingale difference sequence in Lp with p ∈]2, 3].

Assume moreover that the series

X

k≥1

1

k2−p/2

Z α1(k)

0

(1_∨log(1/u))(p−2)/2Q2(u)du and X

k≥1

1

k2/p

Z α1(k)

0

Qp(u)du2/p (4.1)

are convergent.Then the conclusions of Theorem 2.1 hold.

Remark 4.1. From Theorem 2.1(b) in Dedecker and Rio (2008), a sufficient condition to get

W₁(P_n−1/2_S

n,Gσ2) =O(n

−1/2_log_n₎_is

X

k≥0

Z α1(n)

0

Q3(u)du<∞.

This condition is always strictly stronger than the condition (4.1) whenp=3.

We now give an example. Consider the homogeneous Markov chain(Yi)i∈Zwith state spaceZ

de-scribed at page 320 in Davydov (1973). The transition probabilities are given byp_n,n+1=p−n,−n−1= a_n forn_≥0, p_n_,0=p₋_n_,0=1₋a_n forn>0, p_0,0=0,a₀=1/2 and 1/2_≤a_n<1 for n_≥1. This chain is irreducible and aperiodic. It is Harris positively recurrent as soon asP_n≥₂Πn_k−₌1₁ak<∞. In that case the stationary chain is strongly mixing in the sense of Rosenblatt (1956).

Denote by K the Markov kernel of the chain (Yi)i∈Z. The functions f such that K(f) =0 almost

everywhere are obtained by linear combinations of the two functions f₁ and f₂ given by f₁(1) =1,

f₁(₋1) =₋1 and f₁(n) = f₁(₋n) =0 ifn₆=1, and f₂(0) =1, f₂(1) = f₂(₋1) =0 and f₂(n+1) = f2(−n−1) =1−an−1ifn>0. Hence the functions f such thatK(f) =0 are bounded.

If(X_i)_i_∈Zis defined byXi= f(Yi), with K(f) =0, then Proposition 4.1 applies if

(8)

which holds as soon as P₀(τ = n) = O(n−1−p/2(logn)−p/2−ε), where P₀ is the probability of the chain starting from 0, and τ= inf_{n> 0,Xn = 0}. Now P0(τ = n) = (1−an)Πin=−11ai for n≥ 2. Consequently, if

a_i=1₋ p 2i

1+ 1+ε

logi

forilarge enough ,

the condition (4.2) is satisfied and the conclusion of Theorem 2.1 holds.

Remark 4.2. If f is bounded and K(f) ₆= 0, the central limit theorem may fail to hold forS_n =

Pn

i=1(f(Yi)−E(f(Yi))). We refer to the Example 2, page 321, given by Davydov (1973), whereSn properly normalized converges to a stable law with exponent strictly less than 2.

Proof of Proposition 4.1. Let Bp(_F₀) be the set of F0-measurable random variables such that

kZ_k_p_≤1. We first notice that

kE(X2_k|F0)−σ2kp/2= sup

Z∈Bp/(p−2)₍_F 0)

Cov(Z,X_k2).

Applying Rio’s covariance inequality (1993), we get that

kE₍_X2

k|F0)−σ2kp/2≤2

Z α1(k)

0

Qp(u)du2/p,

which shows that the convergence of the second series in (4.1) implies (2.2). Now, from Fréchet (1957), we have that

kE₍_X2

k|F0)−σ2k1,Φ,p=sup

_E

((1_{∨ |}Z_|p−2)_|E₍_X2

k|F0)−σ2|),Z F0-measurable, Z∼ N(0, 1) .

Hence, settingǫk=sign(E(Xk2|F0)−σ2),

kE₍_X2

k|F0)−σ2k1,Φ,p=sup

Cov(ǫk(1∨ |Z|p−2),Xk2),ZF0-measurable,Z∼ N(0, 1) .

Applying again Rio’s covariance inequality (1993), we get that

kE₍_X2

k|F0)−σ2k1,Φ,p≤C

Z α1(k)

0

(1_∨log(u−1))(p−2)/2Q2(u)du,

which shows that the convergence of the first series in (4.1) implies (2.1).

4.2 Linear processes and functions of linear processes

In what follows we say that the series P_i_∈Zai converges if the two series

P

i≥0ai and

P

i<0ai converge.

Theorem 4.1. Let(a_i)_i∈Z be a sequence of real numbers in ℓ2 such that P

i∈Zai converges to some

real A. Let (ǫi)i∈Z be a stationary sequence of martingale differences in Lp for p ∈]2, 3]. Let Xk =

P

j∈Zajǫk−j, andσ

2

n=n−

1_E₍_S2

n). Let b0=a0−A and bj=aj for j6=0. Let An=

P

j∈Z( Pn

k=1bk−j)2.

If A_n=o(n), thenσ2

n converges toσ2=A2E(ǫ02). If moreover

∞

X

n=1

1

n2−p/2

E

1

n

Xn

j=1

ǫj

2 F0

−E(ǫ2₀)

p/2<∞, (4.3)

(9)

1. If A_n=O(1), thenζ1(Pn−1/2_S

Remark 4.3. If the condition given by Heyde (1975) holds, that is

∞

Proof of Theorem 4.1.We start with the following decomposition:

Sn=A Burkholder’s inequality, there exists a constantC such that

kRnkp≤C

According to Remark 2.1, since (4.3) holds, the two conditions (2.1) and (2.2) of Theorem 2.1 are satisfied by the martingale Mn = A

Pn

k=1ǫk. To conclude the proof, we use Lemma 5.2 given in Section 5.2, with the upper bound (4.7).

Proof of Remarks 4.3 and 4.4. To prove Remark 4.3, note first that

(10)

LetTi=

In the next result, we shall focus on functions of real-valued linear processes

X_k=h X

Proof of Theorem 4.2.Theorem 4.2 is a consequence of the following proposition:

Proposition 4.2. Let(ai)i∈Z,(ǫi)i∈Zand(Xi)i∈Zbe as in Theorem 4.2. Let(ǫ_i′)i∈Zbe an independent

(11)

To prove Theorem 4.2, it remains to check (4.10). We only check the first condition. Since

From Burkholder’s inequality, for anyβ >0,

(4.9). The second part can be handled in the same way.

Proof of Proposition 4.2. Let _F_i = σ(ǫk,k ≤ i). We shall first prove that the condition (3.2) of Theorem 3.1 holds. We write

kE(S2_n|F0) − E(S2n)kp/2≤2 conditional expectation with respect toǫ. Define

Yi=

Applying first the triangle inequality, and next Hölder’s inequality, we get that

kE(XiXk+i|F0)−E(XiXk+i)kp/2 ≤ kh0(Yk′+i+Zk+i)kpkh0(Yi′+Zi)−h0(Yi′+Zi′)kp

(12)

Letm_1,_i =_|Y_i′+Z_i_{| ∨ |}Y_i′+Z_i′_|. Sincew_h

By subadditivity, we obtain that

distributed, it follows that

In the same way

provided that the first condition in (4.10) holds.

We turn now to the control ofPn_i₌₁Pn_k₌_ikE(XiXk+i|F0)kp/2. We first write that

Using the same arguments as before, we get that

kE₍_X_k₊_i_|F_i_+[_k_/₂_]₎_k_p₌_kE₍_X_b₍_k_)|F₀₎_k_p_≤₂

In the same way,

(13)

Let

m2,i,k=

X

j∈Z

aiǫi−j

∨

X

j<−[k/2]

ajǫi′−j+

X

j≥−[k/2]

ajǫi−j

.

Using again the subbadditivity of t → w_h(t,M), and the fact that (P_j_<₋_[_k_/₂_]a_jǫ_i−j,m2,i,k),

(P_j_<₋_[_k_/₂_]ajǫi′−j,m2,i,k)and(

P

j<−[k/2]ajǫ−j,M2,−[k/2])are identically distributed, we obtain that

Xi−E(Xi|Fi+[k/2])

p=

X−[k/2]−E(X−[k/2]|F0)

p≤2

wh

X

j<−[k/2]

a_jǫ₋j

,M2,−[k/2]

p.

(4.12) Consequently

X

n≥1

1

n3−p/2

n

X

i=1

n

X

k=i

kE₍_X_i_X_k₊_i_|F₀₎_k_p_/₂_<_∞

provided that (4.10) holds. This completes the proof of (3.2).

Using the bounds (4.11) and (4.12) (taking b(k) = nin (4.11) and[k/2] = nin (4.12)), we see that the condition (3.1) of Theorem 3.1 (and also the condition (3.4) of Theorem 3.2 in the case

p=3) holds under (4.10).

4.3 Functions of

φ-dependent sequences

In order to include examples of dynamical systems satisfying some correlations inequalities, we introduce a weak version of the uniform mixing coefficients (see Dedecker and Prieur (2007)).

Definition 4.1. For any random variable Y = (Y1,· · ·,Yk) with values in Rk define the function

g_x,j(t) =1It≤x−P(Yj≤ x). For anyσ-algebraF, let

φ(_F,Y) = sup

(x1,...,xk)∈Rk

E

Yk

j=1

g_x_j_,_j(Y_j)

F−E

Yk

j=1

g_x_j_,_j(Y_j)

∞.

For a sequenceY= (Y_i)_i∈Z, whereY_i =Y₀◦Ti andY₀ is aF₀-measurable and real-valued r.v., let

φk,Y(n) = max

1≤l≤k _i sup l>...>i1≥n

φ(_F₀,(Yi1, . . . ,Yil)).

Definition 4.2. For anyp_≥1, let_C(p,M,P_X)be the closed convex envelop of the set of functions f

which are monotonic on some open interval ofRand null elsewhere, and such thatE(_|f(X)_|p)<M.

Proposition 4.3. Let p_∈]2, 3]and s_≥p. Let X_i= f(Y_i)₋E₍_f₍_Y_i₎₎_{, where Y}_i₌_Y₀_◦_Ti _{and f belongs}

toC(s,M,PY0). Assume that

X

i≥1

i(p−4)/2+(s−2)/(s−1)φ2,Y(i)(s−2)/s<∞. (4.13)

Then the conclusions of Theorem 4.2 hold.

Remark 4.5. Notice that ifs= p= 3, the condition (4.13) becomesP_i≥₁φ2,Y(i)1/3 < ∞, and if

(14)

Proof of Proposition 4.3. Let Bp(_F₀) be the set of _F₀-measurable random variables such that

kZkp≤1. We first notice that

kE₍_X_k_|F₀₎_k_p_{≤ k}E₍_X_k_|F₀₎_k_s₌ _sup

Z∈Bs/(s−1)₍_F 0)

Cov(Z,f(Y_k)).

Applying Corollary 6.2 with k= 2 to the covariance on right hand (take f1 = Id and f2 = f), we

obtain that

kE(X_k|F0)ks ≤ sup

Z∈Bs/(s−1)₍_F 0)

8(φ(σ(Z),Y_k))(s−1)/s_kZks/(s−1)(φ(σ(Yk),Z))1/sM1/s

≤ 8(φ1,Y(k))(s−1)/sM1/s, (4.14)

the last inequality being true becauseφ(σ(Z),Y_k)_≤φ1,Y(k) andφ(σ(Yk),Z)≤ 1. It follows that the conditions (3.1) (for p_∈]2, 3]) and (3.4) (forp=3) are satisfied under (4.13). The condition (3.2) follows from the following lemma by takingb= (4₋p)/2.

Lemma 4.1. Let X_i be as in Proposition 4.3, and let b_∈]0, 1[.

If X

i≥1

i−b+(s−2)/(s−1)φ2,Y(i)(s−2)/s<∞, then

X

n>1

1

n1+bk

E₍_S2

n|F0)−E(Sn2)kp/2<∞.

Proof of Lemma 4.1. Since,

kE₍_S2

n|F0)−E(S2n)kp/2≤2

n

X

i=1

n−i

X

k=0

kE₍_X_i_X_k₊_i_|F₀₎₋E₍_X_i_X_k₊_i₎_k_p_/₂_,

we infer that there existsC>0 such that

X

n>1

1

n1+bk

E₍_S2

n|F0)−E(Sn2)kp/2≤C

X

i>0

X

k≥0

1

(i+k)bkE(XiXk+i|F0)−E(XiXk+i)kp/2. (4.15)

We shall bound upkE(XiXk+i|F0)−E(XiXk+i)kp/2in two ways. First, using the stationarity and the

upper bound (4.14), we have that

kE(XiXk+i|F0)−E(XiXk+i)kp/2≤2kX0E(Xk|F0)kp/2≤16kX0kpM1/s(φ1,Y(k))(s−1)/s. (4.16)

Next, note that

kE₍_X_i_X_k₊_i_|F₀₎₋E₍_X_i_X_k₊_i₎_k_p_/₂ ₌ _sup

Z∈Bs/(s−2)₍_F 0)

Cov(Z,X_iX_k₊_i)

= sup

Z∈Bs/(s−2)₍_F 0)

E₍₍_Z₋E₍_Z₎₎_X_i_X_k₊_i₎_.

Applying Corollary 6.2 withk=3 to the termE₍₍_Z₋E₍_Z₎₎_X_i_X_k₊_i₎_(take _f₁₌_Id, _f₂₌ _f₃₌ _f_{), we}

obtain that_kE₍_X_i_X_k₊_i_|F₀₎₋E₍_X_i_X_k₊_i₎_k_p_/₂_{is smaller than}

sup Z∈Bs/(s−2)₍_F

0)

(15)

Sinceφ(σ(Z),Y_i,Y_k₊_i)_≤φ2,Y(i)andφ(σ(Yi),Z,Yk+i)≤1,φ(σ(Yk+i),Z,Yi)≤1, we infer that

kE₍_X_i_X_k₊_i_|F₀₎₋E₍_X_i_X_k₊_i₎_k_p_/₂_≤₃₂₍_φ_2,_Y₍_i₎₎(s−2)/s_M2/s_. _(4.17)

From (4.15), (4.16) and (4.17), we infer that the conclusion of Lemma 4.1 holds provided that

X

i>0

[i

(s−2)/(s−1)_]

X

k=1

1

(i+k)b

(φ2,Y(i))(s−2)/s+

X

k≥0

[k

(s−1)/(s−2)_]

X

i=1

1

(i+k)b

(φ1,Y(k))(s−1)/s<∞.

Here, note that

[i(s−2)/(s−1)_]

X

k=1

1

(i+k)b ≤i −b+s_s−2

−1 and

[k(s−1)/(s−2)_]

X

i=1

1

(i+k)b ≤

[2k(s−1)/(s−2)_]

X

m=1

1

mb ≤Dk

(1−b)(s−1)

(s−2)_,

for someD>0. Sinceφ1,Y(k)≤φ2,Y(k), the conclusion of Lemma 4.1 holds provided

X

i≥1

i−b+ss−2−1φ 2,Y(i)

s−2

s <∞ and

X

k≥1 k(1−b)

(s−1) (s−2)_φ

2,Y(k)

s−1

s <∞.

To complete the proof, it remains to prove that the second series converges provided the first one does. If the first series converges, then

lim n→∞

2n

X

i=n+1

i−b+ss−−21φ 2,Y(i)

s−2

s =0 . (4.18)

Since φ2,Y(i) is non increasing, we infer from (4.18) that φ2,Y(i)1/s = o(i−1/(s−1)−(1−b)/(s−2)). It follows that φ2,Y(k)(s−1)/s ≤ Cφ2,Y(k)(s−2)/sk−1/(s−1)−(1−b)/(s−2) for some positive constant C, and the second series converges.

4.3.1 Application to Expanding maps

LetBV be the class of bounded variation functions from[0, 1]toR_{. For any}_h_∈_BV_{, denote by}_k_dh_k

the variation norm of the measuredh.

LetT be a map from[0, 1]to[0, 1]preserving a probabilityµon[0, 1], and let

S_n(f) =

n

X

k=1

(f _◦Tk₋µ(f)).

Define the Perron-Frobenius operatorK fromL2([0, 1],µ)toL2([0, 1],µ)viathe equality

Z 1

0

(Kh)(x)f(x)µ(d x) =

Z 1

0

h(x)(f ◦T)(x)µ(d x). (4.19)

A Markov KernelK is said to beBV-contracting if there existC >0 andρ∈[0, 1[such that

(16)

The mapT is said to beBV-contracting if its Perron-Frobenius operator isBV-contracting.

Let us present a large class ofBV-contracting maps. We shall say thatT is uniformly expanding if it belongs to the classC defined in Broise (1996), Section 2.1 page 11. Recall that if T is uniformly expanding, then there exists a probability measureµon[0, 1], whose densityfµ with respect to the

Lebesgue measure is a bounded variation function, and such thatµis invariant byT. Consider now the more restrictive conditions:

(a) T is uniformly expanding.

(b) The invariant measureµis unique and(T,µ)is mixing in the ergodic-theoretic sense.

(c) 1

fµ

1_f_µ_>₀ is a bounded variation function.

Starting from Proposition 4.11 in Broise (1996), one can prove that if T satisfies the assumptions (a), (b) and (c) above, then it is BV contracting (see for instance Dedecker and Prieur (2007), Section 6.3). Some well known examples of maps satisfying the conditions (a), (b) and (c) are:

1. T(x) =βx₋[βx]forβ >1. These maps are calledβ-transformations.

2. Iis the finite union of disjoint intervals(I_k)₁_≤k≤n, andT(x) =a_kx+b_kon I_k, with_|a_k_|>1.

3. T(x) =a(x−1₋1)₋[a(x−1₋1)]for somea>0. Fora=1, this transformation is known as the Gauss map.

Proposition 4.4. Letσ2_n=n−1E₍_S2

n(f)). If T is BV -contracting, and if f belongs toC(p,M,µ)with

p∈]2, 3], then the seriesµ((f−µ(f))2)+2P_n_>₀µ(f◦Tn·(f−µ(f)))converges to some nonnegative

σ2, and

1. ζ1(Pn−1/2_S

n(f),Gσ2) =O(n

−1/2_log_n₎_{, for p}₌₃_,

2. ζr(Pn−1/2_S

n(f),Gσ2) =O(n

1−p/2₎_{for r}

∈[p₋2, 2]and(r,p)₆= (1, 3),

3. ζr(Pn−1/2_S

n(f),Gσ2n) =O(n

1−p/2₎_{for r}

∈]2,p].

Proof of Proposition 4.4. Let (Y_i)_i_≥₁ be the Markov chain with transition KernelK and invariant measureµ. Using the equation (4.19) it is easy to see that(Y0, . . . ,Yn)is distributed as(Tn+1, . . . ,T). Consequently, to prove Proposition 4.4, it suffices to prove that the sequence X_i = f(Y_i)₋µ(f)

satisfies the condition (4.13) of Proposition 4.3.

According to Lemma 1 in Dedecker and Prieur (2007), the coefficientsφ2,Y(i)of the chain(Yi)i≥0

with respect toFi =σ(Yj,j≤i)satisfyφ2,Y(i)≤Cρifor someρ∈]0, 1[and some positive constant

C. It follows that (4.13) is satisfied fors=p.

5 Proofs of the main results

From now on, we denote byC a numerical constant which may vary from line to line.

Notation 5.1. Forlinteger,qin]l,l+1]and f l-times continuously differentiable, we set

|f|Λq =sup{|x−y| l−q

(17)

5.1 Proof of Theorem 2.1

We prove Theorem 2.1 in the caseσ=1. The general case follows by dividing the random variables by σ. Since ζ_r(P_aX,P_aY) = _|a|r_ζ

r(PX,PY), it is enough to bound up ζr(PSn,Gn). We first give an upper bound forζp,N:=ζp(PS₂N,G2N).

Proposition 5.1. Let(Xi)i∈Zbe a stationary martingale differences sequence inLp for p in]2, 3]. Let

M_p=E₍_|_X₀_|p₎_{. Then for any natural integer N ,}

2−2N/pζ2_p/_,_Np≤M_p+ 1

2p2 N

X

K=0

2K(p/2−2)_kZ_K_k_1,_Φ_,_p2/p+2

p∆N, (5.1)

where ZK =E(S₂2K|F0)−E(S₂2K)and∆N =

PN−1

K=02−2K

/p_k_Z Kkp/2.

Proof of Proposition 5.1. The proof is done by induction on N. Let (Y_i)_i∈N be a sequence of

N(0, 1)-distributed independent random variables, independent of the sequence(X_i)_i_∈Z. Form>0,

letTm=Y1+Y2+· · ·+Ym. SetS0=T0=0. For any numerical function f andm≤n, set

f_n₋_m(x) =E₍_f₍_x₊_T_n₋_T_m₎₎_.

Then, from the independence of the above sequences,

E₍_f₍_S_n₎₋_f₍_T_n_{)) =}

n

X

m=1

D_m with D_m=E _f_n

−m(Sm−1+Xm)− fn−m(Sm−1+Ym)

. (5.2)

For any two-times differentiable functiong, the Taylor integral formula at order two writes

g(x+h)₋g(x) =g′(x)h+1

2h

2_g′′₍_x_{) +}_h2

Z 1

0

(1₋t)(g′′(x+th)₋g′′(x))d t. (5.3)

Hence, for anyqin]2, 3],

|g(x+h)₋g(x)₋g′(x)h−1

2h

2_g′′₍_x₎_{| ≤}_h2

Z 1

0

(1−t)_|th|q−2|g|Λqd t≤ 1

q(q₋1)|h|

q

|g|Λq. (5.4)

Let

D′_m=E₍_f′′

n−m(Sm−1)(Xm2 −1)) =E(fn−m′′ (Sm−1)(Xm2 −Y

2

m))

From (5.4) applied twice with g = f_n−m, x = S_m−₁ and h = X_m or h = Y_m together with the martingale property,

Dm−

1 2D

′ m

≤

1

p(p−1)|fn−m|ΛpE(|Xm| p₊

|Y_m_|p).

NowE(_|Ym|p)≤p−1≤(p−1)Mp. Hence

|D_m₋(D′_m/2)_{| ≤}M_p_|f_n₋_m_|Λ

p (5.5)

Assume now that f belongs toΛ_p. Then the smoothed function fn−m belongs to Λp also, so that

|f_n−m_|Λ_p _≤1. Hence, summing onm, we get that

E₍_f₍_S_n₎₋ _f₍_T_n₎₎_≤_nM_p_{+ (}_D′_/₂₎ _where _D′₌_D′

1+D′2+· · ·+D′n. (5.6)

(18)

Notation 5.2. Set m₀ = m₋1 and write m₀ in basis 2: m₀ = P_iN₌₀b_i2i with b_i = 0 or b_i = 1 (note that b_N =0). Setm_L=P_iN₌_Lb_i2i, so thatm_N =0. LetI_L_,_k=]k2L,(k+1)2L]_∩N_{(note that}

I_N_,1=]2N, 2N+1]),U(_Lk)=P_i_∈_I

L,kXi and ˜U

(k)

L =

P

i∈IL,kYi. For the sake of brevity, letU

(0)

L =ULand ˜

U(_L0)=_U˜_L_.

Sincem_N=0, the following elementary identity is valid

D_m′ =

N−1

X

L=0

E

(f_n′′₋₁₋_m

L(SmL)− f ′′

n−1−mL+1(SmL+1))(X 2

m−1)

.

Nowm_L₆=m_L+1 only if bL=1, then in this casemL=k2L withkodd. It follows that

D′=

N−1

X

L=0

X

k∈IN−L,0

kodd

E

(f_n−′′₁_−k₂L(Sk2L)−f′′

n−1−(k−1)2L(S(k−1)2L))

X

{m:mL=k2L}

(X_m2 −σ2). (5.7)

Note that_{m:m_L=k2L_}=I_L_,_k. Now by the martingale property,

E_k₂L

X

i∈IL,k

(X2_i −σ2)=E_k₂L((U(k) L )

2₎

−E((U_L(k))2):=Z(_Lk).

Consequently

D′ =

N−1

X

L=0

X

k∈IN−L,0

kodd

E

f_n′′₋₁₋_k₂L(Sk2L)−f′′

n−1−(k−1)2L(S(k−1)2L)Z_L(k)

=

N−1

X

L=0

X

k∈IN−L,0

kodd

E

f_n′′

−1−k2L(Sk2L)−f′′

n−1−k2L(S(k−1)2L+T_k₂L−T₍_k₋₁₎₂L)Z(k) L

, (5.8)

since(X_i)_i_∈N and(Yi)i∈Nare independent. By using (1.2), we get that

D′≤

N−1

X

L=0

X

k∈IN−L,0

kodd

E(_|U_L(k−1)−_U˜(k−1)

L | p−2

|Z(_Lk)|).

From the stationarity of(X_i)_i∈Nand the above inequality,

D′_≤ 1

2 N−1

X

K=0

2N−KE₍_|_U_K₋_U˜_K_|p−2_|_Z(1)

K |). (5.9)

Now letVK be the N(0, 2K)-distributed random variable defined fromUK via the quantile transfor-mation, that is

V_K=2K/2Φ−1(F_K(U_K₋0) +δK(FK(UK)−FK(UK−0)))

whereF_Kdenotes the d.f. ofU_K, and(δK)is a sequence of independent r.v.’s uniformly distributed on

[0, 1], independent of the underlying random variables. Now, from the subadditivity of x →xp−2,

|U_K₋U˜_K_|p−2_{≤ |}U_K₋V_K_|p−2+_|V_K₋U˜_K_|p−2. Hence

E(_|UK−U˜K|p−2|Z

(1)

K |)≤ kUK−VKkpp−2kZ

(1)

K kp/2+E(|VK−U˜K|p−2|Z

(1)

(19)

By definition of V_K, the real number _kU_K ₋V_K_k_p is the so-called Wasserstein distance of order p

between the law of U_K(0)and the N(0, 2K) normal law. Therefrom, by Theorem 3.1 of Rio (2007) (which improves the constants given in Theorem 1 of Rio (1998)), we get that, forp_∈]2, 3],

kU_K−V_Kkp≤2(2(p−1)ζp,K)1/p≤2(4ζp,K)1/p. (5.11)

Now, sinceVK and ˜UK are independent, their difference has theN(0, 2K+1)distribution. Note that if Y is a N(0, 1)-distributed random variable,Q_|_Y_|p−2(u) = (Φ−1(1−u/2))p−2. Hence, by Fréchet’s

inequality (1957) (see also Inequality (1.11b) page 9 in Rio (2000)), and by definition of the norm

k.k1,Φ,p,

E₍_|_V_K₋_U˜_K_|p−2_|_Z(1)

K |)≤2

(K+1)(p/2−1)

kZ_K_k_1,_Φ_,_p. (5.12)

From (5.10), (5.11) and (5.12), we get that

E₍_|_U_K₋_U˜_K_|p−2_|_Z(1)

K |)≤2p−

4/p_ζ(p−2)/p

p,K kZKkp/2+2(K+1)(p/2−1)kZKk1,Φ,p. (5.13)

Then, from (5.6), (5.9) and (5.13), we get

2−Nζp,N ≤Mp+2p/2−3∆′N+2 p−2−4/p

N−1

X

K=0

2−Kζ_p(p−_,_K2)/pkZ_K_k_p/2,

where∆′_N =PN_K₌−₀12K(p/2−2)kZKk1,Φ,p. Consequently we get the induction inequality

2−Nζp,N≤Mp+ 1

2p2∆ ′ N+

N−1

X

K=0

2−Kζ(_pp_,_K−2)/pkZ_K_k_p_/₂. (5.14)

We now prove (5.1) by induction onN. First by (5.6) applied withn=1, one hasζp,0≤Mp, since

D₁′ = f′′(0)E₍_X2

1 −1) =0. Assume now thatζp,L satisfies (5.1) for any L in [0,N−1]. Starting from (5.14), using the induction hypothesis and the fact that∆′_K _≤∆′_N, we get that

2−Nζp,N ≤Mp+ 1

2p2∆ ′ N+

N−1

X

K=0

2−2K/p_kZKkp/2

Mp+ 1

2p2∆ ′ N

2/p

+ 2 p∆K

p/2−1

.

Now 2−2K/pkZKkp/2= ∆K+1−∆K. Consequently

2−Nζp,N ≤Mp+ 1

2p2∆ ′ N+

Z ∆N

0

Mp+ 1

2p2∆ ′ N

2/p

+2 px

p/2−1

d x,

which implies (5.1) forζ_p,N.

In order to prove Theorem 2.1, we will also need a smoothing argument. This is the purpose of the lemma below.

Lemma 5.1. Let S and T be two centered and square integrable random variables with the same variance. For any r in]0,p],ζr(PS,PT)≤2ζr(PS∗G1,PT ∗G1) +4

p

(20)

Proof of Lemma 5.1. Throughout the sequel, letY be aN(0, 1)-distributed random variable, inde-pendent of theσ-field generated by(S,T).

Forr≤2, sinceζris an ideal metric with respect to the convolution,

ζr(PS,PT)≤ζr(PS∗G1,PT∗G1) +2ζr(δ0,G1)≤ζr(PS∗G1,PT∗G1) +2E|Y|r

which implies Lemma 5.1 forr_≤2. Forr>2, from (5.4), for any f inΛ_r,

f(S)₋ f(S+Y) + f′(S)Y−1

2f

′′₍_S₎_Y2

≤ 1

r(r₋1)|Y|

r_.

Taking the expectation and noting thatE_|_Y_|r_≤_r₋_{1 for}_r _in]2, 3], we infer that

E _f₍_S₎₋_f₍_S₊_Y₎₋1

2f ′′₍_S₎

≤1

r.

Obviously this inequality still holds for T instead of S and₋f instead of f, so that adding the so obtained inequality,

E(f(S)₋ f(T))_≤E(f(S+Y)₋f(T+Y)) +1

2E(f

′′₍_S₎₋_f′′₍_T_{)) +}_1.

Since f′′belongs toΛ_r₋₂, it follows that

ζr(PS,PT)≤ζr(PS∗G1,PT∗G1) +

1

2ζr−2(PS,PT) +1. Nowr₋2_≤1. Hence

ζ_r−2(PS,PT) =Wr−2(PS,PT)≤(Wr(PS,PT))r−2.

Next, by Theorem 3.1 in Rio (2007),W_r(P_S,P_T)_≤(32ζr(PS,PT))1/r. Furthermore

(32ζr(PS,PT))1−2/r≤ζr(PS,PT)

as soon asζr(PS,PT)≥2(5r/2)−5. This condition holds for anyr in]2, 3]ifζr(PS,PT)≥4

p

2. Then, from the above inequalities

ζr(PS,PT)≤ζr(PS∗G1,PT ∗G1) +

1

2ζr(PS,PT) +1,

which implies Lemma 5.1.

We go back to the proof of Theorem 2.1. Letn∈]2N, 2N+1]andℓ=n−2N. The main step is then to prove the inequalities below: for r _≥p₋2 and (r,p)₆= (1, 3), for someε(N)tending to zero as

N tends to infinity,

ζr(PSn,Gn)≤cr,p2

N(r−p)/2_ζ

p(PS_ℓ,Gℓ) +C(2N(r+2−p)/2+2N((r−p)/2+2/p)ε(N)(ζp(PS_ℓ,Gℓ))(p−2)/p)

(5.15) and forr=1 andp=3,

ζ1(PSn,Gn)≤C(N+2 −N_ζ