getdoce1f0. 213KB Jun 04 2011 12:05:06 AM

(1)

El e c t ro n ic

Jo ur

n a l o

f P

r o

b a b i_{l i t y}

Vol. 14 (2009), Paper no. 13, pages 365–384. Journal URL

http://www.math.washington.edu/~ejpecp/

Small counts in the infinite occupancy scheme

∗

A. D. Barbour

†

and A. V. Gnedin

‡

University of Zürich and Utrecht University

Abstract

The paper is concerned with the classical occupancy scheme with infinitely many boxes, in whichn balls are thrown independently into boxes 1, 2, . . ., with probability pj of hitting the

box j, wherep1≥ p2≥. . .>0 and

P_∞

j=1pj =1. We establish joint normal approximation as n_{→ ∞}for the numbers of boxes containingr1,r2, . . . ,rmballs, standardized in the natural way,

assuming only that the variances of these counts all tend to infinity. The proof of this approx-imation is based on a de-Poissonization lemma. We then review sufficient conditions for the variances to tend to infinity. Typically, the normal approximation does not mean convergence. We show that the convergence of the full vector ofr-counts only holds under a condition of reg-ular variation, thus giving a complete characterization of possible limit correlation structures.

Key words:occupancy problem, normal approximation, poissonization, regular variation. AMS 2000 Subject Classification:Primary 60F05, 60C05.

Submitted to EJP on September 25, 2008, final version accepted January 14, 2009.

∗_{A.D. Barbour gratefully acknowledges financial support from Schweizerischer Nationalfonds Projekt Nr. 20-117625}_/_1. †_{Angewandte Mathematik, Winterthurerstrasse 190, CH–8057 Zürich, Switzerland:}_{[email protected]} ‡_{Department of Mathematics,} _{Utrecht University,} _{PO Box 80010,} _{3508 TA Utrecht,} _{The Netherlands:}

(2)

1 Introduction

In the classical occupancy scheme with infinitely many boxes, balls are thrown independently into boxes 1, 2, . . ., with probability p_j of hitting the box j, where p₁ ≥ p₂ ≥ . . .> 0 andP∞_j₌₁p_j =1. The most studied quantity is the number of boxes K_n occupied by at least one out of the first n

balls thrown. It is known that for large n the law of Kn is asymptotically normal, provided that

Var[K_n]_{→ ∞}; see[6; 7]for references and a survey of this and related results. In this paper, we investigate the behaviour of the quantitiesX_n_,_r, the numbers of boxes hit by exactly r out of then

balls,r≥1.

Under a condition of regular variation, a multivariate CLT for the Xn,r’s was proved by Karlin[8].

Mikhailov[12]also studied theXn,r’s, but in a situation where the pj’s vary withn. In this paper,

we establish joint normal approximation as n_{→ ∞} for the variables X_n_,_r1, . . . ,X_n_,_r

m, centred and

normalized, assuming only that lim_n_→∞VarX_n_,_r

i =∞for eachi. We also give examples to show that

this condition is not enough to ensureconvergence, since the correlation matrices need not converge asn_{→ ∞}. The asymptotic behaviour of the moments of theX_n_,_r is thus of key importance, and we discuss this under a number of simplifying assumptions.

The behaviour of these moments, as also of those ofK_n=P∞_r₌₁X_n_,_r, depends on the way in which the frequencies p_j decay to 0. In the case of power-like decay, p_j _∼ c j−1/α with 0 < α <1, it is known that, for each fixedk, the momentsE_Xk

n,rhave the same order of growth withnfor everyr,

and this is the same order of growth as that ofE_Kk

n; moreover, the limit distributions of Kn and of

Xn:= (Xn,1,Xn,2, . . .)are normal[6; 8]. In contrast, for a sequence of geometric frequenciespj =cqj

(0< q <1), there is no way to scale the X_n,r’s to obtain a nontrivial limit distribution [10], and

the moments ofK_n have oscillatory asymptotics. In a more general setting such that thep_j’s have exponential decay, the oscillatory behaviour of Var[K_n]is typical [3]. The spectrum of interesting possibilities is, however, much wider: for instance, frequenciespj ∼ce−j

β

, with 0< β <1, exhibit a decay intermediate between power and exponential.

Karlin’s [8] multivariate CLT for X_n applies when the index of regular variation is in the range 0 < α < 1. We complement this by the analysis of the cases α = 0 and α = 1, showing that for each α _∈ [0, 1] there is exactly one possible normal limit. Finally, we prove that these one-parameter normal laws are the only possible limits of naturally scaled and centredX_n. Specifically, we show that a regular variation condition holds if VarX_n_,_r _{→ ∞}for allr and if all the correlations {Corr(X_n_,_r,Xn,s), r,s≥1}converge.

2 Poissonization

As in much previous work, we shall rely on a closely related occupancy scheme, in which the balls are thrown into the boxes at the times of a unit Poisson process. The advantage of this model is that, for every t >0, the processes (N_j(t), t ≥0), counting the numbers of balls in boxes j= 1, 2, . . ., are independent. LetY_r(t)be the number of boxes occupied by exactlyr balls at time t. In view of the representation

Yr(t) =

∞

X

j=1

(3)

with independent Bernoulli terms, it follows that

Y_r′(t) := (Y_r(t)₋E[Y_r(t)])/pVar[Y_r(t)] _→_d _N(0, 1) as t→ ∞ (2.2)

if and only if Var[Y_r(t)] _{→ ∞}. This suggests that normal approximation can be approached most easily through theY_r(t), provided that the de-Poissonization can be accomplished. We now show that this is indeed the case.

LetL(_·)denote the probability law of a random element,d_TVthe distance in total variation.

Lemma 2.1.

For any m,k_∈N_{satisfying m}_≤ 1

2npk, we have

d_TV(_L(X_n_,1, . . . ,X_n_,_m),_L(Y₁(n), . . . ,Y_m(n))) _≤ π_k+2ke−npk/10_,

whereπ_j:=P∞_i₌_j₊₁pi.

Proof.We begin by noting that, in parallel to (2.1),

X_n_,_r:=

∞

X

j=1

1[M_n_,_j=r], (2.3)

where M_n_,_j represents the number of balls out of the firstnthrown that fall into box j. Our proof uses lower truncation of the sums (2.1) and (2.3) that defineYr(n)andXn,r.

Since Mn,j ∼Binomial(n,pj), it follows from the Chernoff inequalities[5]that, if m≤ 1₂npk, then

for j_≤k

P_[M_n_,_j_≤_m] _≤ P_[M_n_,_j_≤1

2npj] ≤ exp{−npj/10} ≤ exp{−npk/10},

since thepjare decreasing, andm≤ 1₂npk; and the same bound holds also forNj(n)∼Poisson(npj).

Hence, defining

X_n_,_k_,_r:=

∞

X

j=k+1

1[M_n_,_j=r], Y_k_,_r(t):=

∞

X

j=k+1

1[N_j(t) =r],

it follows that

d_TV(_L(X_n_,1, . . . ,X_n_,_m),_L(X_n_,_k_,1, . . . ,X_n_,_k_,_m)) _≤ ke−npk/10_; _(2.4)

d_TV(_L(Y₁(t), . . . ,Y_m(t)),_L(Y_k_,1(t), . . . ,Y_k_,_m(t))) _≤ ke−t pk/10_. _(2.5)

But now, from an inequality of Le Cam[4]and Michel[11], we have

d_TV(_L(N_j(n), j_≥k+1),_L(M_n_,_j, j_≥k+1)) _≤ π_k, (2.6)

and theX_n_,_k_,_rare functions of_{M_n_,_j, j_≥k+1_}, the Y_k_,_r(n)of_{N_j(n), j_≥k+1_}. The lemma now

follows from (2.4),(2.5) and (2.6).

Proposition 2.2.

Let k(n)be any sequence satisfying

k(n)_{→ ∞} and k(n)e−npk(n)/10_→_0.

Then, for any sequence m(n)satisfying m(n)_≤ 1

2npk(n)for each n, it follows that

(4)

Proof.Sincem(n)_≤ 1₂np_k₍_n₎for eachn, it follows that Lemma 2.1 can be applied for eachn. Since

k(n)_{→ ∞}, it follows that π_k₍_n₎_→0, so that the first element in its bound converges to zero; the

second converges to zero also, by assumption.

Remark.Such sequencesk(n)always exist. For instance, one can take

k(n) =max_{k: 20 logk/p_k_≤n_}.

For this choice, it is immediate that k(n)_{→ ∞}, and that np_k(n) ≥20 logk(n) → ∞, entailing also thatk(n)e−npk(n)/10_≤₁_/k(n)_→_{0. Hence there are always sequences}_m(n)_{→ ∞}_{for which (2.7) is}

satisfied.

Hence, in particular, any approximation to the distribution of a finite subset of the components ofY(n) = (Y₁(n),Y₂(n), . . .)(suitably scaled) remains valid for the corresponding components ofX_n, at the cost of introducing an extra, asymptotically negligible, error in total variation of at most

π_k₍_n₎+2k(n)e−npk(n)/10_, _(2.8)

wherek(n)is any sequence satisfying the conditions of Proposition 2.2.

3 Normal approximation

As noted above, the distribution ofY_r(t)is asymptotically normal as t _{→ ∞}whenever VarY_r(t)_→

∞. Here, we consider the joint normal approximation of any finite set of counts Yr1(t), . . . ,Yrm(t)

such thatr_i ≥1 and lim_t_→∞VarY_r_i(t) =_∞for each 1≤ i≤m. We measure the closeness of two probability measures PandQ onRm _{in terms of differences between the probabilities assigned to}

arbitrary convex sets:

d_c(P,Q) := sup

A∈C|

P(A)₋Q(A)_|,

whereC denotes the class of convex subsets ofRm_{. Let}

Φ_r(t) := E_Y_r₍_t)_, _V_r_(t₎ _:₌ _Var_Y_r_(t₎_, _C_rs_(t₎ _:₌ _Cov_(Y_r₍_t)_,_Y_s_(t))

denote the moments of theY_r(t), and let

Σ_rs(t) := C_rs(t)/pV_r(t)V_s(t) = Cov(Y_r′(t),Y_s′(t))

denote the covariance matrix of the standardized random variablesY_r′(t)as in (2.2).

Now the random vector (Y_r′

1(t), . . . ,Y

′

rm(t)) is a sum of independent mean zero random vectors (Y_l′_,_r1(t), . . . ,Y_l′_,_r

m(t)),l≥1, whereY ′

l,r(t):= (1[Nl(t) =r]−pl,r(t))/

p

Vr(t), and

pl,r(t):=P[Nl(t) =r] =e−t pl

(t p_l)r

r! . (3.1)

A theorem of Bentkus[1, Thm. 1.1]then shows that

dc(L(Yr′1(t), . . . ,Y

′

rm(t)), MVNm(0,ΣR(t))) ≤ C m

(5)

for an absolute constantC, where

result, we obtain the following theorem.

Theorem 3.1.

If limt→∞Vri(t) =∞for each1≤i≤m, where1≤r1<. . .<rm, then, as t and n

Proof. All that we need to do is to control the quantity β_t. This in turn involves bounding the smallest eigenvalue ofΣ_R(t)away from 0. Now direct calculation shows that, for any column vector

(6)

sinceV_r(t)_≤P_l_≥₁p_l_,_r(t). However, for eachl,pl,R(t)_≤1₋p_l_,0(t)₋P_j_>_r

statement of the theorem. The second follows in view of (2.8).

Thus multivariate normal approximation is always good if the variances of the (unstandardized) components Y_r(t) are large. However, convergence typically does not take place: see a series of examples in Proposition 4.4 below.

4 Moments

For normal approximation, in view of Theorem 3.1, we are particularly interested in conditions under whichV_r(t)_{→ ∞}.

For the moments we have the formulas

Φ_r(t) =

From (4.1) and (4.2) we obtain

(7)

It follows that

Vr(t)→ ∞ ⇐⇒ Φr(t)→ ∞;

hence, as long as only the convergence to infinity ofVr(t)is concerned, we can deal with the simpler

quantity Φ_r(t). This facilitates the proof of the following theorem, showing how the asymptotic behaviour ofV_r(t)for different values of ris structured.

Theorem 4.1.

The asymptotic behaviour of the quantities V_r(t)as t_{→ ∞}follows one of the following four regimes:

1. limt→∞Vr(t) =∞for all r≥1;

2. lim sup_t_→∞Vr(t) =∞for all r ≥1, and there exists an r0≥1such thatlim inft→∞Vr(t) =∞

for all1≤r≤r0, andlim inft→∞Vr(t)<∞for all r >r0;

3. lim sup_t_→∞Vr(t) =∞andlim inft→∞Vr(t)<∞for all r ≥1;

4. sup_tV_r(t)<_∞for all r_≥1.

Proof.ReplacingV_rwithΦ_r for the argument, the formula (4.1) yields

Φ_r(t) = X

j≥1

e−t pj(t pj)

r

r! ; Φs(t/2) =

X

j≥1

e−t pj/2(t pj/2)

s

s! .

Fors<r, the ratio of the individual terms is given by

e−t pj/2₍_{t p}

j/2)s/s!

e−t pj_{(t p}

j)r/r!

≥ min

y>0{e

y/2_y−(r−s) } r!

s!2s =

e

r−s

r−s r!

s!2r.

Hence, for alls<r,

Φ_s(t/2) _≥ Φ_r(t)

e

r₋s

r−s r!

s!2r. (4.4)

It now follows that if, for somer, lim_t_→∞V_r(t) =_∞, then lim_t_→∞V_s(t) =_∞for all 1_≤s_≤r also; and that, if sup_tV_r(t)<_∞ for some r, then sup_tV_s(t)< _∞for alls> r. Hence, to complete the proof, we just need to show that, if sup_tVr(t)<∞for somer≥1, then suptV1(t)<∞.

For this last part, writeΦ_r(t) =Lr(t) +Rr(t), where

L_r(t) := X

j:t pj≥1

e−t pj_{(t p}

j)r/r! ; Rr(t) :=

X

j:t pj<1

e−t pj_{(t p}

j)r/r! . (4.5)

Suppose that sup_tΦ_r(t) =K<_∞. Then, for every t>0,

L₁(t) _≤ r!L_r(t) _≤ r!Φ_r(t) _≤ K r! . (4.6)

It thus remains to boundR₁(t), which in turn can be reduced to finding a bound for

S(t):= X

(8)

Leta₀_≥a₁_≥. . ._≥0 be any decreasing sequence such that a_j/a_j₊_h _≥2 holds for someh_≥1 and all j_≥1. Thena_ih+m≤am2−i for everyi≥0 and 0≤m<h. Splitting theaj’s intohsubsequences

that are dominated by the geometric series, we thus have

X

Now if, for someh_≥1, the frequenciesp_j satisfy

p_j/p_j₊_h_≥2 for all j_≥1, (4.7)

In particular, in Theorem 3.1, the quantity min1≤i≤m

p

Vri(t) can thus be replaced in the error

estimates byΦ_r_m(2t).

We now turn to finding conditions sufficient for distinguishing the asymptotic behaviour of theV_r(t). To do so, introduce the measures

ν_r(dx) =

Two special cases areν₀, a counting measure, andν₁, the probability distribution of a size-biased pick from thepj’s. Forr>0 write (4.1) as

Comparing with standard gamma integrals, it is then immediate that

lim inf

(9)

Lemma 4.2.

(a) sup_t_≥₀Φ_s(t)<_∞for all s_≥1if and only if, for some (and then for all) r_≥1, sup_jρ_j_,_r<_∞.

(b) If, for some r_≥1,lim_j_→∞ρ_j_,_r=_∞, then lim_t_→∞Φ_s(t) =_∞for all1_≤s_≤r.

Proof.Ifpj+1≤x <pj then

ρ_j_,_r = νr[0,pj+1]

pr_j =

ν_r[0,x]

pr_j <

ν_r[0,x]

xr ≤

ν_r[0,pj+1]

pr_j₊₁ = 1+ρj+1,r.

Hence (4.9) can be replaced by the inequalities

lim inf

j→∞ ρj,r ≤ lim inft→∞ Φr(t) ≤ lim supt→∞

Φ_r(t) _≤ 1+lim sup

j→∞

ρ_j_,_r. (4.10)

Part (b) of the lemma now follows directly from Theorem 4.1.

For part (a), much as for the last part of the proof of Theorem 4.1, define

h(j):=max{l ≥0: pj+l/pj≥1/2}; h∗:=sup j

h(j).

Then it is immediate that

2−rh(j) _≤ ρ_j_,_r _≤ h∗X

l≥1

2−(l−1) = 2h∗,

so thath∗<_∞if and only if sup_jρ_j_,_r<_∞for some, and then for all, r ≥1. We now conclude the proof by showing that sup_t_≥₀Φ_s(t)<_∞for alls_≥1 if and only ifh∗<_∞. DefiningL_r(t)andR_r(t)

as in (4.5), we observe that, ifh∗<_∞, then

R_r(t) _≤ h∗X

l≥1

2−r(l−1) _≤ 2h∗ and L_r(t) _≤ h∗X

l≥1

e−2l−1 2

l r

r! ,

so thatΦ_r(t) =Lr(t) +Rr(t)<∞for allr≥1. On the other hand,

Lr(1/pj+h(j)) ≥ e−2h(j)/r! ,

implying that, ifh∗=_∞, then lim sup_t_→∞Φ_r(t) =_∞for allr_≥1.

The familiar ratio test yields simpler sufficient conditions. Thus sup_tΦ_r(t)<_∞for allr_≥1 if

lim sup

j→∞

pj+1/pj < 1,

while lim_t_→∞Φ_r(t) =_∞for allr_≥1 if

lim

j→∞pj+1/pj = 1.

For instance, forpj =cqj, the geometric distribution with 0<q<1, we havepj+1/pj =q; hence

sup_tΦ_r(t) < _∞ for all r, and normal approximation is not adequate for any r. This illustrates possibility 4 in Theorem 4.1. For the Poisson distribution p_j =cλj/j! , we even have p_j₊₁/p_j _→0, and so normal approximation is no good here, either.

(10)

Lemma 4.3.

(a) Suppose for some0< λ <1

lim inf

j→∞

p_j₊_h

pj

> λ (4.11)

for every h≥1. ThenΦ_r(t)_{→ ∞}as t→ ∞for all r≥1.

(b) The conditionlim sup_t_→∞Φ_r(t) < _∞holds for some (hence for all) r _≥ 1if and only if there exists h≥1such that

lim sup

j→∞

p_j+h

p_j ≤

1

2. (4.12)

Proof.For part (a), assume thatν₀(λx,x) =#{j:λx <pj <x} → ∞as x →0. Then also

Φ_r(1/x)_≥ X

{j:λx<pj<x}

e−pj/x_(p

j/x)r/r!≥ν0(λx,x) min

{y:λ<y<1}[e

−y_yr_/r_!_]

→ ∞.

As x decreases, the piecewise-constant function ν₀(λx,x) may have downward jumps only at the values x ∈ {pj}, hence the assumption is equivalent toν0(λpj,pj)→ ∞(as j→ ∞), which in turn

is readily translated into (4.11).

For part (b), the same estimate with any 0< λ <1/2 shows that the condition (4.12) is necessary. In the other direction, suppose thatp_j+h/pj<3/4 for all j≥J. Split(pj,j≥J)intohsubsequences

(p_J₊_s₊_ih,i _≥ 0), with 0_≤ s _≤ h₋1. Each of the subsequences has the property that the ratio of any two consecutive elements is at most 3/4. Hence, as above, the sum of the termse−pjt₍_{t p}

j)r/r!

along a subsequence yields a uniformly bounded contribution toΦ_r.

Examples of irregular behaviour of moments may be constructed by breaking the sequence(p_j, j_≥

1)into finite blocks of sizes m₁,m₂, . . ., and setting the p_j’s within the i’th block all equal to some

qi. We use the notationV(t):=Var

P

r≥1Yr(t)

to denote the variance of the number of occupied boxes.

Example 1.[8, p. 384]. Takemi=iandqi=c2−2

i

, withca normalizing factor1to achieveP_jpj=

1. Then both V(t) and Φ₁(t) oscillate between 0 and _∞, approaching the extremes arbitrarily closely. This illustrates possibility 3 in Theorem 4.1.

Example 2. As in [3, Example 4.4], take m_i = 22i, q_i = c2−2i+1. Then Φ₁(t) _{→ ∞}, but Φ₂(t)

oscillates between 0 and_∞ast varies; thusY₁(t)is asymptotically normal, butY₂(t)is not, and the ratiospj+1/pj have accumulation points at 0 and 1. This illustrates possibility 2 in Theorem 4.1.

We now extend this example, showing among other things that one can have any value for r0 in behaviour 2 in Theorem 4.1.

Proposition 4.4.

Fix0< β <1andα >0, and take the blocks construction with mi =⌊2(1−β) −i

⌋, qi=cm−

(1+α)

i , where c is the appropriate normalizing constant. Then we have

1_{In fact, the Poisson sampling model makes sense for arbitrary}_p

j’s, and the enumeration of small counts makes sense

ifP

(11)

(i) lim sup_t_→∞V_r(t) = _∞for all r _≥1;

(ii) lim_t_→∞V_r(t) = _∞if and only if rβ(1+α)_≤1;

(iii) lim_j_→∞ρ_r_,_j=_∞if and only if rβ(1+α)<1;

(iv) The quantitiesΣ_rs(t)do not converge for any r6=s.

Proof.Once again, we work withΦ_r instead ofV_r, now writing

Φ_r(t) =X

for someK>0. It is easily checked that the minimum value of this sum forφ >1 goes to_∞with

l, hence, once again, lim_l_→∞inf_t_∈_J

lΦr(t) =∞.

Forrβ(1+α)>1, these two terms contribute an amount of order

φr_{m(_l1₊−₁β)e−φ+m1_l₊−₁rβ(1+α)}, (4.14)

(12)

asymptotically small asl_{→ ∞}. Hence, fort′_l=2q−_l 1logm_l₊₁, it follows that lim_l_→∞Φ_r(t_l′) =0, and

behaves asymptotically, asl becomes large, in the same way as for the Poisson occupancy scheme with a single block ofml boxes with equal frequenciesql. Computing the limit,

lim

wherem_l cancels because of the additivity of the moments. Asφ varies, this limit value varies too, and hence, forr6=s, the quantitiesΣ_rs(t)do not converge ast→ ∞.

It follows from parts (ii) and (iii) of Proposition 4.4 that the implication in part (b) of Lemma 4.2 cannot be reversed, and from part (iv) that the correlations between different components ofY(t)

need not converge, even when their variances tend to infinity. Hence the approximation in Theo-rem 3.1 does not necessarily imply convergence. Yet another kind of pathology appears whenY1(t) is asymptotically independent of(Y_r(t),r>1), as in the following example.

Example 3. Suppose that the frequencies in the block construction satisfy q_i =1/i!, m_i = (i₋2)! normal distribution with independent components. Because the variances go to ∞, Theorem 3.1 guarantees increasing quality of the normal approximation for any finite collection of components

Y_r′

i(t). However, the full vector (Y ′

r,r = 1, 2, . . .)does not converge: see more on this example in

(13)

Part (ii) of Proposition 4.4 also demonstrates that lim inf_j_→∞p_j₊₁/p_j = 0 does not exclude that

Φ_r(t)_{→ ∞}, hence the condition (4.11) in Lemma 4.3 is not necessary. Finally, by[3, Eqn. 3.1], we have

1

2Φ1(2t) < V(t) < Φ1(t),

meaning that Φ₁(t)is always of the same order as the variance of the number of occupied boxes

V(t). The examples above show that this need not be the case forΦ_r(t), when r_≥2.

5 Regular variation

We now henceforth assume that Φ_r(t) _{→ ∞} for all r _≥1. The CLT for each component of Y(t)

then holds, as observed above, and normal approximation becomes progressively more accurate for the joint distribution of any finite collection of components. A joint normal limit for any collection of the standardized components also holds, provided that the corresponding covariances converge. From (4.3) we have

Cov(Y_r′(t),Y_s′(t)) = Σ_rs(t) = c(r,s)_pΦr+s(2t) V_r(t)V_s(t)

, r₆=s. (5.1)

The RHS converges to a nonzero limit for each pairr,sif, for eachr,Φ_r_≈ f _∈Rα, whereRαdenotes the class of functions regularly varying at_∞with indexα, and where, here and subsequently, we write a≈ bif a(t)/b(t)_→c as t → ∞with 0<c <_∞. IfΦ_r_∈Rα, then the index belongs to the range 0_≤α_≤1, becauseΦ_r(t)cannot converge to 0, and becauseΦ_r(t)/t_→0.

The results in the next section show that, if the covariances converge for a sufficiently large set of pairs r,s, then this is in fact the only possibility. More formally, we say that then regular variation holds in the occupancy problem, meaning that, for some 0≤α_≤1 and somerate function f ∈Rα,

Φ_r _≈ f for all r_≥2 . (5.2)

This setting of regular variation extends the original approach by Karlin[8]in the special caseα=0, and, moreover, it covers all possible limiting covariance structures (Theorem 6.4).

Observe that the functionst−rΦ_r satisfy

dr dtr

¦

t−1Φ₁(t)©= (₋1)rr!¦t−rΦ_r(t)©, (5.3)

thus, in particular, they are completely monotone. This taken together with the standard properties of regularly varying functions[2]implies that, ifΦ_r _∈Rα for some 0≤α <1 andr ≥1, then the same is true for all r ≥1, and we can choose the rate function f = Φ₁. The caseα=1 is special. IfΦ_r_∈R₁ for some r _≥2, then allΦ_r forr _≥2 are of the same order of growth andΦ₁ _∈R₁, but

Φ₁_≫Φ₂ (this motivates the choicer≥2 in (5.2)).

A necessary condition for (5.2) is lim_j_→∞pj+1/pj =1, as follows from the next lemma.

(14)

Proof.We have

t−2Φ₂(t) =

∞

X

j=1

e−t pj_p2

j =

Z 1

0

e−t xν₂(dx)

withν₂[0,x] := P∞_j₌₁p2_j 1[p_j _≤ x]. Suppose t−2Φ_t _∈ R₋β, then 1 ≤ β ≤ 2 and, by Karamata’s Tauberian theorem, alsoν₂[0,t−1]_∈R₋_β. Becauseβ₆=0, the latter implies thatν₂[at−1,bt−1]_∈ R₋β, i.e. that

ν₂[at−1,bt−1]_∼(bβ₋aβ)ℓ(t)t−β, t _{→ ∞} (5.4)

for any positive a < b. However, the assumption of the lemma allows to choose a < b < 1 such thatν₂[ap_j,bpj] =0 for infinitely many j= jk, so (5.4) fails fort =1/pjk → ∞. The contradiction

shows that t−2Φ₂(t)cannot be regularly varying. The assertions regarding r6=2 can be derived in

the same way.

The example below shows thatΦ_r may be regularly varying forr=1 alone.

Example 3(continued). Let g(t) = ν₁[0,t−1] = P∞_j₌₁pj1[pj ≤ t−1]. We have the general

esti-mates

t−1Φ₁(t) _≥ e−1g(t)

and, fora>1 and anyε >0,

t−1Φ₁(t)₋(at)−1Φ₁(at)

≤ εg(at/ε) +_{g(t/log_{1/εg(t)_})₋g(at/ε)_}+

∞

X

j=1

pje−t pj1[pj>t−1log{1/εg(t)}]

≤ 2εg(t) +_{g(t/log_{1/εg(t)_})₋g(at/ε)_}.

Applying these to the block construction withq_i =1/i! andm_i = (i₋2)!, we observe thatg(t)_≍ I(t)−1 _{and that} _g(t_/_log_{₁_/εg(t₎_}₎₋_g(at/ε) _{involves at most two} _q

i, each of the corresponding

terms being of the order of I(t)−2, where I(t) :=min_{i : i! _≥ t_}. It follows that t−1Φ₁(t)_∈R₀, whenceΦ₁ _∈R₁ andΦ₁_≫Φ_r for r_≥2. However, q_i₊₁/q_i _→0, therefore Lemma 5.1 implies that

Φ_r_∈/R1 forr≥2.

Thepropercase of regular variation with index 0< α <1 can be characterized by Karlin’s condition [8, Equation 5]

ν₀[x, 1]:=#_{j:p_j_≥ x_{} ∼} ℓ(1/x)x−α, x _↓0, (5.5)

where and henceforth the symbolℓstands for a function of slow variation at∞. Other equivalent conditions are (see[6])

Φ(t) :=

Z 1

0

(1₋e−t x)ν₀(dx)_∼Γ(1₋α)tαℓ(t),

ν_r[0,x] _∼ α r−α x

r−α_ℓ(₁_/_x) _{for some}_r

≥1,

Φ_r(t) _∼ αΓ(r−α)

r! t

α_ℓ(_t) _{for some}_r

≥1,

(15)

whereℓ∗(y) =1/_{ℓ1/α(y1/α)_}#, and # denotes the de Bruijn conjugate of a slowly varying function [2]. Note that Vr(t) then has the same order of growth, in view of (4.2), yielding behaviour as in

possibility 1 of Theorem 4.1.

We describe next all possible forms of multivariate convergence under the regular variation.

The proper case0< α <1.The joint CLT for

Y_r(t)₋Φ_r(t) p

tαℓ(t)

, r=1, 2, . . .

inR∞_{holds with the limiting covariance matrix}_S_{computed from (4.3) as}

S_rs = ₋αΓ(r+s−α)

r!s!2r+s−α , r 6=s (5.6)

S_{r r} = α

r!

Γ(r₋α)₋Γ(2r−α) r!22r−α

. (5.7)

This agrees with Karlin[8, Theorem 5].

The case of ‘rapid’ regular variation withα=1.If (5.5) holds withα=1 thenℓ(t)must approach 0 ast_{→ ∞}sufficiently fast to havePp_j<_∞2. In this situation we haveΦ_r(t)_∼(r2₋r)−1ℓ(t)t for

r>1 butΦ₁(t)_∼ℓ₁(t)t with someℓ₁_≫ℓ. In fact,X_n_,1_∼K_nasn_{→ ∞}almost surely. Because the scaling ofY1(t)is faster than that for otherYr(t)’s, it follows from (4.2) and (4.3) thatΣ1r(t)→0

for all r _≥ 2, so that the CLT holds withY₁′(t)asymptotically independent of (Y_r′(t),r _≥ 2). The limiting covariance matrix of

Y_r(t)₋Φ_r(t)) p

tℓ(t)

, r_≥2

is obtained by settingα=1 in the above formulas (5.6), (5.7) forS. Our multivariate result extends in this case the marginal convergence that was stated in[8, Thm 5′]3.

The case of slow variationα=0. Karlin’s condition (5.5) withα=0 is too weak to control the

Φ_r(t)’s. We impose therefore a slightly stronger condition

ν₁[0,x]:= X

{j:pj≤x}

p_j _∼ xℓ₁(1/x), (5.8)

which is equivalent to Φ_r _∈ R₀ for any (and hence for all) r _≥ 1. To illustrate the difference, note that in the geometric case, with p_j = (1₋q)qj−1, 0< q < 1, we haveℓ(1/x) _∼log_q(1/x), whereasν₁[0,x] =q⌈logq(x/(1−q))⌉ _{is not regularly varying, since} _ν

1[0,x]/x jumps infinitely often from (1₋q)−1 to q(1₋q)−1 as x _→ 0. The geometric case can be contrasted to the one with

frequencies pj = ce−j

β

(0 < β < 1), for which we have ℓ(1/x) _∼ c|logx|

1

β _and _ν

1[0,x]/x ∼

c_|logx_|

1

β−1_.

2_{One example is}_p

j=c/j{log(j+1)}β+1,β >0, in which caseℓ(t)∼1/c(logt)β+1.

3_Mikhailov_[₁₂_]_{indicated yet other situation where the}_X

n,r’s for r>1 all behave similarly, but their behaviour is

(16)

By[6, Prop. 15], the general connection betweenℓ₁in (5.8) andℓin (5.5) is

ℓ(1/x) =

Z 1

x

u−1ℓ₁(1/u)du, 0<x <1.

Adopting (5.8) we have ν_r[0,x]_∼ r−1xrℓ₁(1/x), r ≥ 1, and the situation is then very similar to that in the proper case: we haveΦ_r(t)_∼r−1ℓ₁(t)and

Y_r(t)₋Φ_r(t) p

ℓ₁(t) , r≥1

jointly converge in law to a multivariate Gaussian limit with covariance matrixS given by

S_rs = ₋ 1

(r+s)2r+s

_r₊_s

r

, r₆=s, (5.9)

Sr r =

₁

r −

1

r22r+1

₂_r

r

. (5.10)

This applies, for instance, to the frequencies p_j _∼ ce−jβ (0< β < 1). This case of slow variation seems not to have been considered before.

6 Convergence of the covariances

We will show in this section that regular variation is essential for the multivariate convergence of the whole standardized vector of counts, so that all possible limit covariance structures are those characterized in the previous section. Our starting point is the following lemma, which asserts that the regular variation is forced by the convergence of the ratios ofΦ_r’s.

Lemma 6.1.

Suppose for some r≥1

lim

t→∞Φr+1(t)/Φr(t) = c. (6.1)

Then(r₋1)/(r+1)_≤c≤r/(r+1)andΦ_r_∈Rα withα:=r−c(r+1). Moreover, we then always

have

lim

t→∞

Φ_s(t) Φ_r(t) =

r!Γ(s₋α)

s!Γ(r₋α) (6.2)

andΦ_s_∈Rα for all s≥1, unlessα=1. If(6.1)holds with r>1and c= (r−1)/(r+1), thenΦs∈R1

for s_≥2, and(6.2)is still true (in particular,Φ₁_≫Φ₂).

Proof. A monotone density result which dates back to von Mises and Lamperti [9] says that the convergencet g′(t)/g(t)_→β impliesg_∈R_β (this holds for arbitraryβ, including_±∞). This result applied to g(t) = t−rΦ_r(t) yields the regular variation Φ_r _∈Rα, with some 0≤ α≤ 1. The rest follows from (5.3), monotonicity and the general behaviour of the regularly varying functions under

integration and differentiation[2].

(17)

Lemma 6.2.

Iflim sup_tΦ_s(t) =_∞for any s_≥1, then no correlation Σ_r_,_r′(t) with2≤ r < r′ can

(iii) Now suppose that jsatisfies (6.3). Then, much as in part (i), for anyr≥2,

r!Φ_r(2j) _≤ X

(18)

Note that the correlationsΣ_1,_s(t),s>1, converge to zero in the case of regular variation with index

α=1. Example 3 illustrates that Σ_1,_s(t)may also converge to zero when regular variation in the sense of (5.2) does not hold.

Lemma 6.3.

If g is continuous and positive, and g(2t)/pg(t)_→k as t_{→ ∞}, with0<k<_∞, then g(t)_→k2.

Proof.Givenǫ >0, let t_ǫbe such thatg(2t)_≤kp(1+ǫ)g(t)for allt _≥t_ǫ. LetK_ǫ:=sup_t_∈_J

ǫg(t),

whereJǫ:= [tǫ, 2tǫ]. Then, for all t∈Jǫ and alln≥0, we have

g(2nt) _{≤ {}k2(1+ǫ)_}1−2−n_{g(t)_}2−n _≤ k2(1+ǫ)K_ǫ2−n.

Thus lim sup_tg(t)_≤k2. A similar argument shows that lim inf_tg(t)_≥k2, proving the lemma.

Theorem 6.4.

Suppose the correlationsΣ_r_,_s(t)converge, as t_{→ ∞}, for r,s satisfying2_≤r<s and r+s≤12. Then the following is true:

(i) (5.2)holds with some0≤α_≤1,

(ii) the correlationsΣ_r_,_s(t)converge for all r,s,

(iii) (Y_r′(t),r=1, 2, . . .)converges weakly to one of the multivariate normal laws described inSection 5for the cases 0< α < 1, α =1 and α= 0, with the obvious adjustment of (5.6),(5.7) and

(5.9),(5.10)in order to have standard normal marginals,

(iv) the same multivariate normal limit holds for the normalized and centred X_n.

Proof.For short, writeV_j=V_j(t), f_j= Φ_j(t)andF_j= Φ_j(2t).

By Lemma 6.2, theΣ_r_,_s(t)converge to nonzero limits, whence, forr,sin the required range,

Fr+s

p V_rV_s ≈

Fr+s

p

V_r₊₁V_s₋₁

and henceVrVs≈Vr+1Vs−1. From this,V5≈V3V4/V2, V6≈V3V5/V2≈V32V4/V22, and substituting in

V₂V₆_≈V₃V₅we get V₄/V₂_≈(V₃/V₂)2. Continuing in this way yields

Vj

V₂ ≈

_V

3

V₂ j−2

for 2_≤ j≤10. (6.4)

From this and F2_j _≈V₂V_j₋₂, we obtain

Fj

V₂ ≈

_V

3

V₂ j/2−2

for 5_≤ j≤12. (6.5)

Substituting (6.4) and (6.5) in fj=Vj+cjF2j (recall (4.2)) yields

f_j

V₂ ≈

_V

3

V₂ j/2−2

(19)

This offers two ways of expressingF_jfor j=5, 6: using (6.5) or (6.6), but with the argument 2t for the latter. The first gives

F₅ _≈ V₂(t)

_V

3(t)

V₂(t) 1/2

, F₆ _≈ V₂(t)

_V

3(t)

V₂(t)

,

and the second gives

F5 ≈ V2(2t)

_V

3(2t)

V₂(2t)

3

, F6 ≈ V2(2t)

_V

3(2t)

V₂(2t)

4

.

It follows that

F6

F5 ≈

_V

3(t)

V2(t)

1/2

≈ V3(2t)

V2(2t) .

Applying Lemma 6.3 to g(t) = V3(t)/V2(t) shows that this must converge, hence from (6.6) the ratioΦ₄(t)/Φ₃(t) must converge too. Parts (i), (ii), (iii) of the theorem now follow from Lemma

6.1, and part (iv) follows by de-Poissonization.

Combining Theorem 6.4 and Lemma 5.1 we arrive at a very simple test for the convergence, which is easy to check in the examples of Section 4:

Corollary 6.5.

The conditionlim_j_→∞p_j₊₁/p_j=1is necessary for the convergence of the (normalized and centred) Xn to a multivariate normal law.

It should be stressed that the condition is by no means sufficient. For instance, the frequencies

pj =c{2+sin(logj)}/j2 satisfypj+1/pj →1 but do not have the property of regular variation due

to the oscillating sine factor. Thus in this caseX_nhas no distributional limit.

AcknowledgementThe authors would like to thank Adrian Röllin and Bero Roos for helpful discus-sions.

References

[1] Bentkus, V. A Lyapunov-type bound in Rd_. _{Theory Probab. Appl.}_{, 49:} _{311-323, 2004.}

MR2144310

[2] Bingham, N., Goldie, C.M. and Teugels, J. L., Regular Variation, Cambridge Univ. Press, Cam-bridge, 1987. MR0898871

[3] Bogachev, L., Gnedin, A. and Yakubovich Y. On the variance of the number of occupied boxes,

Adv. Appl. Math., 40: 401-432, 2008. MR2412154

[4] Le Cam, L. An approximation theorem for the Poisson binomial distribution.Pacific J. Math., 10: 1181-1197, 1960. MR0142174

[5] Chung, F. and Lu, L. Concentration inequalities and martingale inequalities: a survey,Internet

(20)

[6] Gnedin, A., Hansen, B. and Pitman, J. Notes on the occupancy problem with infinitely many boxes: general asymptotics and power laws, Probability Surveys, 4: 146-171, 2007. MR2318403

[7] Hwang, H.-K. and Janson, S. Local limit theorems for finite and infinite urn models, Ann.

Probab.38: 992-1022, 2008. MR2408581

[8] Karlin, S. Central limit theorems for certain infinite urn schemes,J. Math. Mech.17: 373-401, 1967. MR0216548

[9] Lamperti, J. An occupation time theorem for a class of stochastic processes, Trans. AMS 88: 380-387, 1958. MR0094863

[10] Louchard, G., Prodinger, H. and Ward, M. The number of distinct values of some multiplicity in sequences of geometrically distributed random variables,2005 Conf. Anal. Algorithms, DMTCS

Proc.AD: 231-256, 2005. MR2193122

[11] Michel, R. An improved error bound for the compound Poisson approximation of a nearly homogeneous portfolio.ASTIN Bulletin, 17: 165-169, 1988.