getdoc83a8. 174KB Jun 04 2011 12:04:36 AM

(1)

ELECTRONIC

COMMUNICATIONS in PROBABILITY

RANDOM WALKS WITH

k

_{-WISE INDEPENDENT}

IN-CREMENTS

ITAI BENJAMINI

Department of Mathematics, The Weizmann Institute of Science, Rehovot 76100, Israel email: [email protected]

GADY KOZMA

Department of Mathematics, The Weizmann Institute of Science, Rehovot 76100, Israel email: [email protected]

DAN ROMIK

Department of Statistics, 367 Evans Hall, University of California, Berkeley, CA 94720-3860, USA

email: [email protected]

Submitted July 13, 2005, accepted in final form May 18, 2006 AMS 2000 Subject classification: 60G50, G0K99.

Keywords: Random walk, pseudo-randomness, quasi-randomness, pairwise independence.

Abstract

We construct examples of a random walk with pairwise-independent steps which is almost-surely bounded, and for anymandka random walk withk-wise independent steps which has no stationary distribution modulom.

1 Introduction

Consider a simulation of a simple random walk on a graph. How will the simulation be affected if the source of randomness is not truly random but only pseudo random in the following specific sense, the random bits arek-wise independent for somek >1 and not independent as a family? The first question to ask is, does the pseudo walk converge to the same stationary measure? The most simple graph to consider might be a cycle of size m. This suggests the following problem: given a random walkSn =Pni=1Xi, where theXi are random signs, plus or minus

1 with equal probability, and the Xi’s are k-wise independent, what can be said about the

behavior of the partial sums and in particular modulo some fixed number m? It turns out that there is a fundamental difference between the casesk≤3 andk >3.

Examine first the case k= 2. For this case we shall show

Theorem 1. There exists a sequence of random variables{Xi}∞i=1 taking values±1, pairwise

independent, such that Sn is bounded almost surely.

This result should be contrasted against the fact that we know thatE_S_n_{= 0 and}E_S_n2 ₌_n_{, just}

(2)

like in the completely independent case. In other words,Sn for largenmust have extreme “fat

tail” behavior. Naturally,M := maxnSnsatisfiesEM =∞. The example we will demonstrate

is essentially the Walsh system, which is pairwise independent. We will discuss this in section 2.

Such behavior cannot occur whenk≥4 for the simple reason that in this case we know that E_S4

n= 3n2−2nand this gives

P₍_S2

n> M)≥

E_S2

n−M

2

E_S4

n

→ 1

3 ∀M.

The casek= 3 was explained to us by Ori Gurel-Gurevich and Ron Peled. Simply multiplying the sequence we will construct in section2by an independent random±1 will render it 3-wise independent and of course will retain its boundedness.

The higherkis, the more moments we know and we can approximate the large scale shape of

Sn. However, this does not necessarily mean we can say something about Sn modm. And

indeed, in section3we show the following

Theorem 2. Let m and k be some natural numbers, and let ǫ > 0. Then there exists a sequence of random variables {Xi}∞_i=1 taking values±1, k-wise independent, and a sequence Ij such that

P _S_I

j ≡0 (m)

>1−ǫ ∀j. (1)

Note that the requirement that the condition holds only for some Ij is unavoidable, since,

say, ifk≥10m2 _{then the distribution of}_S

Ij+10m2 is approximately uniform, sinceXIj+1, . . . ,

XIj+10m2 are independent.

Explicit constructions of k-wise independent 0-1 probability spaces and estimates on their sizes are the focus of several papers in combinatorics and complexity, see e.g. [3] for the first construction and [1] for a recent discussion with additional references. Sums of pairwise inde-pendent random variable were extensively studied, see e.g. [2] for some interesting examples. Thus there are already many interesting examples of pairwise independent processes. But it seems behavior modulomwas not studied.

2 Pairwise independent processes

We term the construction we use a “gray walk”, as it is based on the well-known Gray code construction in combinatorics. The nth Gray code is a Hamiltonian path on the discrete n -cube, or a listing of all 2n _{binary strings of length} _n_{, starting with the string 00}_..._{0, where}

every string appears exactly once, and any two consecutive strings differ in exactly one place. The construction (and hence also proof that this is possible) is done recursively, namely: To construct thenth Gray code, write down two copies of the (n−1)th code, where the order of the strings in the second copy is reversed, and add to the first 2n−1 _{strings a zero at the} _n_th

place, and to the second 2n−1_{strings a one at the}_n_{th place.}

(3)

zeros. The first few strings in the infinite code are:

A0= 000000. . . A1= 100000. . . A2= 110000. . . A3= 010000. . . A4= 011000. . . A5= 111000. . . A6= 101000. . . A7= 001000. . . A8= 001100. . .

.. .

To construct the gray walk, we now consider each string Ai as specifying a finite subset (also

denoted Ai) of the natural numbersN, where a 1 in thejth place signifies that j ∈Ai, and

define

Xi=

Y

j∈Ai

ξj

where ξ1, ξ2, ... is a seed sequence of independent ±1 variables. It is easy to verify that the Xi’s are pairwise independent. Therefore to finish theorem1 we only need

Proposition. The gray walkSn is bounded almost surely. More precisely

sup

n

|Sn|= 2min{j≥1:ξj=−1}−1+ 1 = supSn+ 2 =−inf n Sn

Proof. For simplicity we add the element X0 ≡1, and prove that forSn′ =

Pn

i=0Xi we have

supS′

n=−infS′n= 2min{j≥1:ξj=−1}−1. The recursive definition of the Gray code is reflected

in the path: Assume we have defined the first steps S′

0, S1′, S2′, ..., S2′j−1−1 of the walk, then the next 2j−1 _{steps will be the previous steps listed in reverse order, and multiplied by the}

random sign ξj. This implies that the path up to time 2j−1, wherej is the first value for

which ξj = −1, is 0,1,2,3,4, ...,2j−1,2j−1−1,2j−1−2, ...,3,2,1,0, and all the subsequent

values are bounded between−2j _{and 2}j_.

Remark. Theorem 1 holds also if one takes the strings Ai in lexicographic order, which is

simpler and is also the standard ordering of the Walsh system (which consists of all finite productsQ

j∈Jrj, where (rj)

∞

j=1are the Rademacher functions). However, we believe the gray

walk is interesting in its own right. For example, it satisfies that Xi+1

Xi =ξj(i), i.e. the seed

sequence is exposed directly in the quotients of the Xi’s.

On the other hand, there is a vast body of knowledge about the behavior of “weighted walks” with the lexicographical order, which is simply the question of when

∞ X

i=0 ciWi

converges where Wi is the Walsh system. The general philosophy is that it is similar to the

behavior of the Fourier system{einx_}∞

n=−∞, with the exception of the symmetry of the Fourier system which has no equivalent in the Walsh system. Of course, this isvery differentfrom the simple behavior of independent variables. See [5] for a survey, or the book [4].

3 Proof of theorem

2

(4)

new variables by conditioning theξi-s:

is actually satisfied combinatorially and not only in high probability.

How can we generalize this tom6= 4? The remaining ingredient in the proof is based on the following well known fact:

It is possible to construct variables with probability 1₃, 1₃, 1₃ given a sequence of independent variables with probabilities 1₂, 1₂.

The algorithm is simple: take two such variables: if they give 11,“output” 1, if they give 1,−1 “output” 2 and if they give−1,1, “output” 3. If they give−1,−1, take a new set of two variables and repeat the process. The output is 1, 2 or 3, each with probability 1₃. The time required for each step is unbounded but the probability that it is large is exponentially small. The proof below combines these two ideas, which we nickname “generating hidden symmetries” and “simulating uniform variables” to get the result.

Proof of theorem2, generalm. We may assume without loss of generality thatmis even (oth-erwise take 2minstead). Letλ=λ(k, ǫ) be some sufficiently large parameter which we shall fix later. LetL:= 2(k+ 1)

λm2

where⌈·⌉ stands, as usual, for theupperinteger value. Lemma. For every µ = 0,2, . . . , m−2 there exist variables Lµ _and _Yµ _{with the following}

i=1 is a sequence ofk-wise independent variables taking values ±1with

prob-abilities 1₂.

m, and trim the resulting sets a little so that they all have the same size. Precisely, let

A:= min

We note that the distribution ofPN

(5)

and letS:={±1}N _\S

Gj.

So far we have constructed some general objects having very little to do with probability. Next, let{ξi}∞i=1 be i.i.d variables taking values in±1, and let{Ξi}∞i=1be a division of theξi’s into

Properties3and4are obvious from the construction. Property1is a direct consequence of the estimateP_(Ξ_i _∈_S₎_≤_Ce−cλ_{, if only}_λ_{is large enough. Define}_λ_{so as to satisfy this condition.} Therefore we need only prove property2.

Let therefore i1<· · ·< ik andδ1, . . . , δk∈ {±1}and examine the events

The point of the proof is that the eventY is independent ofX ∩ S. This is becauseX depends only onkplaces, butY depends onk+ 1 Ξi’s each of which is distributed uniformly. In other

words

Returning to the proof of the theorem, we let

{Yµ,ν_{, L}µ,ν_}

(µ,ν)∈{0,2,...,m−2}×N

be an independent family of couples of variables constructed using the lemma. Next, we fix some arbitrary number µ0 ∈ {0,2, . . . , m−2} (in the final construction we will set µ0 = 0

but for now we want to keep its value free). We now construct our sequence Xi =Xi(µ0)

inductively as follows: assume that at theνth step we have definedX1, . . . , Xρwhereρ=ρ(ν)

is random. Defineµ=µ(ν)≡µ0−Pρ_i₌₁Ximodmand then defineXρ+1, . . . , Xρ+Lµ,ν using

(6)

and ρ(ν+ 1) = ρ(ν) +Lµ,ν_{. This creates a sequence of} _±_{1 variables. Now set}_I

j =jL. We

have therefore constructed all objects required for the theorem.

First we wish to see that (1) holds when we setµ0= 0. We definer(j) = max{ν:ρ(ν)< jL}

and properties1 and4of the lemma will give that

P

Therefore we need only show that the Xi’s are k-wise independent. While being strongly

related to thek-wise independence of they_iµ,νit is not an immediate consequence of it because of the dependence between they_iµ,ν and theLµ,ν_’s.

We prove that the Xi’s arek-wise independent inductively. Letl≤kand Ibe some integers

and assume that we have shown that Xi1, . . . , Xin are independent for every µ0, every n < l

and forn=l andi1< I. LetI=i1< i2<· · ·< il and letδ1, . . . , δl∈ {±1}. Define

L=Lµ0,1 _q₌_q₍_L_{) = #}_{_n_:_i

n≤L}

and we want to see that

Xiq+1, . . . , Xil|L, Xi1, . . . , Xiq arel−q i.i.d.±1 variables. (2)

and summing over the possible values ofµ′

0 and noting that Xi1, . . . , Xiq can affect the rest

and belowq we have simple equality:

Xin=yin ∀n≤q. (4)

and summing overLwe get the required result:

P₍_X_i

1=δ1, . . . , Xil=δl) =P(yi1=δ1, . . . , yil =δl) = 2

−l

(7)

Remarks.

1. It is easy to generalize the theorem to forcePIj

i=1Xi to have almost arbitrary

distribu-tions. The only obstacle is the even-odd symmetry of the walk. Thus if mis odd, any distribution whatsoever on the congruence classes modulusmmight be achieved, while ifmis even, any distribution supported on the set of even or odd congruence classes can be achieved.

2. It is easy to see from the proof thatL ≈km2_log(_k/ǫ_{). In other words, the precision}

is asymptotically exponential in the stepping of the subsequence Ij. We remind the

reader a fact that was discussed in the introduction: ifk > m2_{log 1}_/ǫ_{then necessarily} Ij+1−Ij > cm2log 1/ǫbecause immediately afterIj we get a sequence of independent

variables.

3. Another direction in whichk-wise independent random variables may be very different from truly independent variables is their entropy. It is known that the entropy ofnsuch variables may be as low asklogn, see [3] and [1]. Our pairwise example is optimal in this respect: ntruly random bits create 2n_{pairwise independent random variables. The}

example above is far from it: the entropy is linear in the number of bits. However, it turns out that it is possible to create an example ofk-wise independent variables with low entropy satisfying the conditions of theorem2. The example, while amusing, is a bit off topic hence we will skip it.

Theorem 3. Let mandk be some natural numbers. Then there exists a sequence of random variables{ǫi}∞i=1 taking values±1,k-wise independent, and a sequenceIj such that

Ij

X

i=1

ǫi≡0 (m)forj sufficiently large with probability 1.

The proof is similar to that of theorem2, but using a differentLin each step. However there are some additional technical subtleties. Essentially, if in the main lemma of the previous theorem we defined a variable Yµ _{where the parameter} _µ _{was used to “return to sync” the}

series if ever the congruence is no longer 0, here we would need variablesYµ,τ _where_τ_{is some}

additional parameter needed to “return to sync” in theLdomain. While being only moderately more complicated, we felt it better to present the simpler proof of the previous theorem.

4 Further related problems

We end by suggesting further problems related to k-wise independent random variables.

4.1 The random sign Pascal triangle

Letξ_n,k+ , ξ_n,k− be a family of i.i.d random (1/2-1/2) signs for −1≤k≤n+ 1. Define random variablesXn,k for−1≤k≤n+ 1 by the recurrence

X0,0= 1, Xn,−1= 0, Xn,n+1= 0,

Xn,k =ξn+−1,k−1Xn−1,k−1+ξ_n−₋₁_,kXn−1,k (0≤k≤n)

Problems: Study the behavior of the central coefficient X2n,n. Find an interpretation of

(8)

4.2 Two dimensional percolation

What about other functions ofk-wise independent random variables? For example, instead of considering Bernoulli percolation, the random variables determining the configuration are only

k-wise independent. Here is an “unimpressive” example: k-wise independent bond percolation on ann×n(k+ 1)- box in Z2_{such that the probability to have a crossing of the narrow side} is 1. Of course, even in usual percolation the probability1_{is 1}₋_e−ck_{so that the improvement}

is not exciting.

The construction is as follows: divide into k+ 1 disjoint boxes of size n×n. Take the usual independent percolation, conditioned such that the number of boxes with crossings is odd. This is like conditioningk+ 1 Bernoulli variables to have an odd sum, so that you get ak-wise independent structure. If the number is odd, then it is non zero. Can it be improved, say to guarantee crossing of then×n-box?

References

[1] N. Alon, J. Spencer.The Probabilistic Method, second edition. Wiley-Interscience Series in Discrete Mathematics and Optimization. Wiley-Interscience, New York, 2000.MR1885388

[2] S. Janson. Some pairwise independent sequences for which the central limit theorem fails. Stochastics 23 (1988), 439–448.MR943814

[3] A. Joffe. On a set of almost deterministick-independent random variables.Ann. Probab. 2 (1974), 161–162.MR356150

[4] F. Schipp, W. R. Wade, P. Simon (with assistance by J. P`al). Walsh Series: An Intro-duction to Dyadic Harmonic Analysis. Adam Hilger Ltd., Bristol and New York, 1990.

MR1117682

[5] W. R. Wade. Dyadic harmonic analysis. In Contemporary Mathematics 208, Harmonic analysis and nonlinear differential equations. Riverside CA 1995, 313–350.MR1467014

1_{This is obvious if you are willing to use the conformal invariance of percolation. Otherwise, this fact can}