getdoc5b86. 239KB Jun 04 2011 12:04:24 AM

(1)

El e c t ro n ic

Jo ur

n a l o

f P

r o

b a b i_{l i t y}

Vol. 16 (2011), Paper no. 29, pages 880–. Journal URL

http://www.math.washington.edu/~ejpecp/

Approximation by the Dickman distribution and

quasi-logarithmic combinatorial structures

∗

A. D. Barbour and Bruno Nietlispach

Institut für Mathematik, Universität Zürich Winterthurertrasse 190, CH-8057 ZÜRICH E-mails:[email protected]; [email protected]

Abstract

Quasi-logarithmic combinatorial structures are a class of decomposable combinatorial structures which extend the logarithmic class considered by Arratia, Barbour and Tavaré (2003). In or-der to obtain asymptotic approximations to their component spectrum, it is necessary first to establish an approximation to the sum of an associated sequence of independent random vari-ables in terms of the Dickman distribution. This in turn requires an argument that refines the Mineka coupling by incorporating a blocking construction, leading to exponentially sharper cou-pling rates for the sums in question. Applications include distributional limit theorems for the size of the largest component and for the vector of counts of the small components in a quasi-logarithmic combinatorial structure.

Key words:Logarithmic combinatorial structures, Dickman’s distribution, Mineka coupling . AMS 2000 Subject Classification:Primary 60C05, 60F05, 05A16.

Submitted to EJP on August 4, 2010, final version accepted April 14, 2011.

(2)

1 Introduction

Many of the classical random decomposable combinatorial structures, such as random permutations and random polynomials over a finite field, have component structure satisfying aconditioning rela-tion: ifC(n)

i denotes the number of components of sizei, the distribution of the vector of component

counts(C(n)

1 , . . . ,C

(n)

n )of a structure of sizencan be expressed as

L C₁(n), . . . ,C(n) n

=_L Z1, . . . ,Zn

T0,n=n

, (1.1)

where(Z_i,i≥1)is a fixed sequence of independent non-negative integer valued random variables, andT_a_,_n:=Pn_i₌_a₊₁i Z_i, 0≤a<n. If, as in the examples above, theZ_i also satisfy

iP[Z_i=1] _→ θ and iE_Z_i _→ _θ_, _(1.2)

the combinatorial structure is calledlogarithmic. It is shown in Arratia, Barbour and Tavaré (2003)

[ABT]that combinatorial structures satisfying the conditioning relation and slight strengthenings of the logarithmic condition share many common properties. For instance, if L(n) is the size of the largest component, then n−1L(n) _→_d L, where L has probability density function f_θ(x) := eγθΓ(θ+1)xθ−2_p

θ((1−x)/x), x ∈ (0, 1], and pθ is the density of the Dickman distribution Pθ

with parameterθ, given in Vervaat (1972, p. 90). Furthermore, for any sequence(a_n,n_≥1) with a_n=o(n),

lim

n→∞dTV L(C

(n)

1 , . . . ,C

(n)

an),L(Z1, . . . ,Zan)

=0 .

Both of these convergence results can be complemented by estimates of the approximation error, under appropriate conditions. The asymptotic behaviour is quite different ifE_Z_i_∼_iα_for_α₆₌_{1: see,}

for example, Freiman and Granovsky (2002) and Barbour and Granovsky (2005).

Knopfmacher (1979) introduced the notion of additive arithmetic semigroups, which give rise to de-composable combinatorial structures satisfying the conditioning relation, with negative binomially distributed Z_i. For these structures,iP_[Z_i ₌_1]_∼_iE_Z_i ₌_θ_i_{, where the}_θ_i _{do not always converge}

to a limit asi_{→ ∞}. In those cases in which they do not, they become close to the integer skeleton of a sum of cosine functions with differing frequencies:

θ_t′:=θ+

L

X

l=1

λlcos(2πflt−ϕl), t∈R, (1.3)

withP_lL₌₁λ_l ≤θ, and thus exhibit quasi-periodic behaviour. It is therefore natural to ask whether the asymptotic behaviour that holds generally for logarithmic combinatorial structures also holds for such structures, in which theθido not converge, but certain averages of theθi do, and, if so, to find

weaker forms of the logarithmic condition that are enough to imply the same asymptotic behaviour.

In this paper, we define a family of combinatorial structures, the quasi-logarithmic class, that include the logarithmic structures as a special case, as well as those of Zhang (1996). For such structures, we give conditions under which n−1L(n) _→_d L (Theorem 4.1) and limn→∞dTV L(C

(n)

1 , . . . ,C

(n)

an),L(Z1, . . . ,Zan)

= 0 (Theorem 4.3), just as in the logarithmic case. A key step in the proofs is to be able to show that, for sequences a_n = o(n), the normalized sum n−1T_a

(3)

3.4 and 3.5), and that the error rates in these approximations can be controlled. To do so, it is in turn necessary to be able to show that, under suitable conditions,

lim

n→∞dTV L(Tan,n),L(Tan,n+1)

= 0 , for allan=o(n), (1.4)

and that the error rate can be bounded by a power of_{(a_n+1)/n_}.

A number of the arguments used are adapted to the more general context from those presented in

[ABT]. There, the sum T0,n :=

Pn

i=1i Zi is close in distribution to that of T0,∗n :=

Pn

i=1i Zi∗, where

Z_i∗∼Po(θi−1), and the latter sum has a compound Poisson distribution CP(θ,n)whose properties are tractable. In the current situation, with theθi’s not all asymptotically equal, it is first necessary

to show that CP(θ,n)is still a good approximation to the sumT_0,_n. This is by no means obviously the case. The intuition is nonetheless that, if the distributions ofT0,n andT0,n+1 are not too different,

then havingθi =2θ andθi+1 =0 instead of θi =θi+1 =θ should leave the distributionL(T0,n)

more or less unchanged; only the average behaviour of theθi should be important. Thus we first

want to establish (1.4). Once we have done so, we are able to show, by way of Stein’s method, that

L(T_0,_n)is indeed close to CP(θ,n)

Proving that (1.4) holds under conditions appropriate for our quasi-logarithmic structures turns out in itself to be an interesting problem. The standard Mineka coupling, used to bound the total variation distance between a sum of independent, integer valued random variables and its unit translate, gives a very poor approximation in this context. To overcome the difficulty, we introduce a new coupling strategy, which yields a much more precise statement in a rather general setting (Theorem 2.1). This is the substance of the next section. We then show that the distributions

L(T_0,_n) and CP(θ,n) are close in Section 3, and conclude that quasi-logarithmic combinatorial structures behave like logarithmic structures in Section 4.

As observed by Manstaviˇcius (2009), when considering only the small components, the distances dTV L(C

(n)

1 , . . . ,C

(n)

an),L(Z1, . . . ,Zan)

can be bounded, even without assuming that theθj’s converge

on average to any fixedθ, as long as they are bounded and bounded away from 0 (we do not require the latter condition). He considers only the case of Poisson distributed Zi, for which, inspecting the

proof of Theorem 4.3, it is enough to obtain an estimate of the form

n_|P[T_a

n,n=n−k]−P[Tan,n=n−l]| ≤ C{|k−l|/n}

γ_, ₀

≤k,l_≤n/2,

for someγ >0. This he achieves by using his refined characteristic function arguments. Since we are also interested in approximating the distribution of the largest components, for which some form of convergence to aθ seems necessary, we do not attempt this refinement.

2 An alternative to the Mineka coupling

Let_{X_i_}_i_∈Nbe mutually independentZ-valued random variables, and letS_n:=

Pn

i=1Xi. The Mineka

coupling, developed independently by Mineka (1973) and Rösler (1977) (see also Lindvall (2002, Section II.14)) yields a bound of the form

dTV L(Sn),L(Sn+1)

≤ π

2

Xn

i=1ui

−1/2

, (2.1)

where

u_i := 1₋dTV L(Xi),L(Xi+1)

(4)

see Mattner & Roos (2007, Corollary 1.6). The proof is based on coupling copies _{X_i′_}_i_∈N and {X′′_i }i∈Nof{X_i}_i_∈Nin such a way that

V_n :=

n

X

i=1

X_i′′₋X_i′, n_∈N,

is a symmetric random walk with steps in{−1, 0, 1}; the coupling inequality (Lindvall, 2002, Section I.2) then shows that

dTV L(Sn),L(Sn+1)

≤ P[τ >n] = P[V_n_{∈ {−}1, 0}],

whereτis the time at which{Vn}n∈Z₊first hits level 1, the last equality following from the reflection

principle. However, this inequality gives slow convergence rates, if X_i = i Z_i and the Z_i are as described in the Introduction; typically,dTV L(i Zi),L(i Zi+1)

is extremely close to 1, and, ifX_i is taken instead to be(2i₋1)Z₂_i₋₁+2i Z2i, we still expect to have 1−dTV L(Xi),L(Xi+1)

≍i−1, leading to bounds of the form

dTV L(Sn),L(Sn+1)

=O (logn)−1/2.

In this section, by modifying the Mineka approach in the spirit of Rogers (1999) to allow the random walk V to make larger jumps, we show that error bounds of order n−γ for some γ > 0 can be achieved, representing an exponential improvement over the Mineka bounds.

Let(X_i,i_≥1)be independentZ₊_{-valued random variables, set}_S_a_,_n_:=Pn

i=a+1Xi, and define

q(i,d) := min{P[X_i =0]P[X_i₊_d =i+d],P[X_i=i]P[X_i₊_d=0]_}, i,d∈N_.

Then it is possible to couple copies(X_i′,X_i′₊_d)and(X′′_i ,X_i′′₊_d)of(X_i,Xi+d)for anyi,d in such a way

that

P[(X′_i,X_i′₊_d) = (0,i+d),(X_i′′,X_i′′₊_d) = (i, 0)]

= P[(X_i′,X′_i₊_d) = (i, 0),(X_i′′,X_i′′₊_d) = (0,i+d)] = q(i,d); (2.2)

P[(X′_i,X_i′₊_d) = (X_i′′,X_i′′₊_d)] = 1₋2q(i,d).

Note that then

(X_i′+X_i′₊_d)₋(X′′_i +X′′_i₊_d) =

  

d with probabilityq(i,d);

0 with probability 1₋2q(i,d);

−d with probabilityq(i,d),

(2.3)

so that sums of such differences, with non-overlapping indices, can be constructed so as to perform a symmetric random walk ondZ_{. By successively coupling pairs in this way, and by using different}

values ofd, it may thus be possible to couple the sumsS_a′_,_n:=1+Pn_i₌_a₊₁X′_i andS′′_a_,_n:=Pn_i₌_a₊₁X_i′′ quickly, even when many of the overlapsq(i,d)are zero. The following theorem is typical of what can be achieved.

Ford∈N_and_{ψ >}_{0, define}E(d,ψ):=_{i: q(i,d)_≥ψ/(i+d)_}. For Da subset ofN_{, suppose that}

there arek_∈N_and_{ψ >}_{0 such that}

(5)

In particular, if X_i = i Z_i with Z_i _∼ Po(θi/i), and if 0 < θ− ≤ θi for all i, then clearly q(i,d) ≥ θ₋/(i+d)for alliandd∈N, and so (2.4) holds for anyDwithk=1 andψ=θ₋. However, (2.4) also holds for any Dif, for instance, 0< θ₋ ≤ θ_i is only given for i ∈3N_{∪ {}₇N+2}, now with k=21 andψ=θ₋.

Theorem 2.1. Let r,s_∈N_{be co-prime, and set}

D := _{r_{} ∪ {}s2g, g_≥0_}. (2.5)

Suppose that, for some k,ψ, (2.4) is satisfied with D as above. Then there exist C,γ >0, depending on r,s,k andψ, such that

dTV(L(Sa,n),L(Sa,n+1)) ≤ 6{(a+1)/n}γ,

for all0_≤a<n for which a+1_≤C n.

Proof. We takeSe′₀=1,eS₀′′=0, and then successively defineSe′_j:=Se₀′+P_i_∈_I

jX

′

i,Se′′j :=S0′′+

P

i∈IjX

′′

i ,

T_j := eS′_j₋eS′′_j, j _≥1. Here, the sequences (X_i′, 1 _≤ i _≤ n) and(X_i′′, 1 _≤ i _≤ n) are two copies of the sequence(X_i, 1_≤i_≤n)of independent random variables, constructed by successively coupling pairs (X_i′

j,X

′

ij+dj) and (X

′′

ij,X

′′

ij+dj), for suitable ij and dj, realized independently of the random

variables (X′_i,X_i′′, i _∈ I_j₋₁), where I_j₋₁ := _∪_lj₌−₁1_{i_l,i_l +d_l_}. This coupling of pairs typically omits some indicesi_{∈ {}1, 2, . . . ,n_}; for such i, we set X_i′= X_i′′, chosen independently from _L(X_i). The coupling of the pairs(X′_i,X_i′₊_d) and(X′′_i ,X_i′′₊_d) is accomplished by arranging thatL((X_i′,X′_i₊_d)) =

L((X_i′′,X′′_i₊_d)) =_L(X_i)_{× L}(X_i+d)and that(Xi′+X′i+d)−(Xi′′+Xi′′+d) ∈ {−d, 0,d}, as described

in (2.2). The indices are defined by taking i₁ =min_{i > a: i_∈ E(r,ψ)_}, and then taking i_j₊₁ := min{i>ij: i∈E(dj+1,ψ),i,i+dj+1∈/Ij}, where

dl :=

  

r, ifT_l₋₁∈/sZ; s2f2(Tl−1/s)_, _if_l_{> τ}_,_T

l−16=0;

0, ifT_l₋1=0,

(2.6)

and where f₂(t)is the exponent of 2 in the prime factorization of_|t_|, t_∈Z. If T_l₋₁=0, we couple X′_i

l =X

′′

il, and thusX

′

i_l′ =X

′′

i_l′ for alll

′_≥_l_{, with}_i

l′ running through alli>i_l₋₁ such thati∈/I_l₋₁.

With this construction, the sequenceT_jcan only change in jumps of size_±r until it first reachessZ. Thereafter, at any jump, the exponent f2(Tj/s)increases by 1 untilTjis of the form±s2l for somel;

after this, the value ofT_j is either doubled or set to zero at each jump, in the latter case remaining in zero for ever. Ifi_J+d_J _≤n, whereJ:=inf_{j: T_j=0_}, then

S_a′_,_n := 1+

n

X

i=a+1

X_i′ =

n

X

i=a+1

X_i′′ =: S_a′′_,_n,

and _L(S′_a_,_n) = _L(S_a_,_n+1), _L(S′′_a_,_n) = _L(S_a_,_n), so that, from the coupling inequality (Lindvall, 2002, Section I.2),

dTV(L(Sa,n),L(Sa,n+1)) ≤ P[iJ+dJ>n]. (2.7)

We thus wish to bound this probability.

(6)

mosts2/4. Thus, and by the Markov property of the simple random walk, the number of jumps N₁ until a multiple of sis hit is bounded in distribution by 1

2s 2_G

1, where P[G1 > j] =2−j, j ≥1; in

particular, for anyγ >0,

P[N₁>1₂s2γlog₂(1/αn)] ≤ 2αγn,

where α_n := (a+1)/n. The remaining number N2 of jumps required for T to reach 0 is then at

most log₂r (in order to reach the form _±s2l for some l), together with an independent random numberG2 of steps until 0 is reached, having the same distribution asG1; hence,

P[N₂>2γlog₂(1/αn)] ≤ 2αγn

also, ifn_≥(a+1)r1/γ. It remains to show that the processT has the opportunity to make this many jumps, with high probability, for suitable choice ofγ.

Now, in view of (2.4), every block of indices_{jk+1, . . . ,(j+1)k_}contains at least onei_∈E(r,ψ). Hence, for any 1/2≤β <1, we can choose a setS1 of non-overlapping pairs(il,il+r), 1≤l ≤L,

such thati_l ∈E(r,ψ)anda+1≤i_l≤k(a+1)α−nβ−r for eachl, and such that

L

X

l=1

1

il+r

> 1

2(k_∨r)

⌊(a+X1)α−nβ⌋

i=2

1

i+a/(k_∨r) ≥

β

4(k_∨r) log(1/αn),

ifn_≥32/β(a+1). The first factor 2 in the denominator is present because a pair(i,i+r)withi_∈ E(r,ψ) can be excluded fromS₁, but only if i= i_l+r for some pair (i_l,i_l +r) already inS₁; the other is to yield an inequality, rather than an asymptotic equality. In similar fashion, for any non-decreasing sequence(ρl, l ≥ 1), we can choose a set S2 of non-overlapping pairs(i′l,il′+s2

ρl∧ln_),

1_≤l_≤L′, wherel_n:=_⌊1

2log2n⌋, such thatk(a+1)α

−β

n <i′l≤n−s⌊ p_n

⌋for each l, and such that

L′

X

l=1

1

i′_l+s2ρl∧ln >

1 2k

⌊_Xn/k⌋

i=⌈2(a+1)α−nβ⌉+1

i−1 _≥ 1−β

4k log(1/αn),

if alson≥(a+1)(4k)2/(1−β).

We now show that, for suitable choices of γ and β, the pairs in S1 with high probability yield

M1≥ 1₂s2γlog2njumps ofT. We then show that those inS2, with the sequence ρl chosen in

non-anticipating fashion such thatρ1is the exponent of 2 inTl at the firstl at whichTl∈sZ,ρl+1=ρl

ifTl =Tl−16=0,ρl+1= (ρl+1)∧lnif 0<Tl6=Tl−1 andρl+1=ln otherwise, yieldM2≥2γlog2n.

Indeed, by the Chernoff inequalities (Chung & Lu 2006, Theorem 3.1), ifϕ1, 0< ϕ1 < 1, is such

that

1 2s

2_γ_log

2(1/αn) =

βψ(1₋ϕ1)

4(k_∨r) log(1/αn), (2.8)

then

P[M₁< 1₂s2γlog₂(1/αn)] ≤ exp{−3ϕ21βψlog(1/αn)/32(k∨r)} ≤ αγn,

if 3ϕ2₁s2/{16(1₋ϕ1)log 2} ≥1. Similarly, using a martingale analogue of the Chernoff inequalities

(Chung & Lu 2006, Theorem 6.1), for f2 such that 3ϕ22/{4(1−ϕ2)log 2} ≥1 and with

2γlog₂(1/αn) =

(1₋β)ψ(1₋ϕ2)

(7)

we get

P[M₂<2γlog₂(1/αn)] ≤ exp{−3ϕ22(1−β)ψlog(1/αn)/32k} ≤ αγn.

Finally, for such choices of f1and f2, equations (2.8) and (2.9) can be satisfed with the same choice

ofβifγis chosen such that

1 = 2γ

ψlog 2

¨

(k_∨r)s2 1₋ϕ1

+ 4k

1₋ϕ2

«

;

then

β = 2γ(k∨r)s

2

(1₋ϕ1)ψlog 2

.

Choosingϕ2 to satisfy 3ϕ22/{4(1−ϕ2)log 2} =1, and then ϕ1 larger than its minimum value, if

necessary, to ensure thatβ≥1/2, this yields the theorem.

Clearly, the exponent γ could be sharpened; the condition (2.4) could also be weakened to one ensuring a positive density of indices in each E(d,ψ)over longer intervals. The set D could also be constructed in other ways. One natural extension would be to replace r co-prime toswith any r₁, . . . ,r_m satisfying gcd_{r_i, . . . ,r_m,s_}=1. Note also that if, for some j₀_≥1,

E(d,ψ)_{∩ {}jk+1, . . . ,(j+1)k_{} 6}= _;, for alld_∈D, j_≥ j₀,

then (2.4) holds withkreplaced by j0k.

The coupling used to establish Theorem 2.1 is not the only possibility. In the example of additive arithmetic semigroups, there is one case in which the setDcan be taken to consist of the integers

{2g+1, g ≥0}, but no odd integers. Here, the jumps in the process T would always be even, and hence, sinceT₀=1,T can never hit 0. However, if we define

˜

q(i) := min_{P[X_i=0],P[X_i=i]_}; E(1,e ψ) := _{i∈2Z+1: ˜q(i)_≥ψ/i},

and if, for all j_≥1,

e

E(1,ψ)_{∩ {}jk+1, . . . ,(j+1)k_{} 6}= _;, (2.10)

then one can begin the coupling construction by definingX_i′=X_i′′for even iand coupling X_i′and X′′_i for oddiin such a way that

P[X′_i=i,X_i′′=0] = P[X_i′=0,X_i′′=i] = 1₋P[X′_i =0,X′′_i =0] = q(i),˜

until the first timei that X_i′ 6=X′′_i , at which time the difference Ti is even, taking either the value

i+1 ori₋1. Thereafter, the coupling is concluded using jumps of sizes 2g+1, with the second half of the strategy in the previous proof. Now the number of steps required to complete the coupling depends on how big the first even value ofT happens to be, but Chernoff bounds are still sufficient to be able to conclude the following theorem, which we state without proof.

Theorem 2.2. Suppose that, for some k,ψ, (2.4) is satisfied with D =_{2g+1, g ≥0}, and (2.10) is

also satisfied. Then there exist C,γ >0, depending on k andψ, such that

dTV(L(Sa,n),L(Sa,n+1)) ≤ C{(a+1)/n} γ_,

(8)

3 Approximation by the Dickman distribution

As in the Introduction, let(C(n)

1 , . . . ,C

(n)

n )be the component counts of a decomposable combinatorial

structure of sizen, related to the sequence of independent random variables(Z_i,i_≥1)through the Conditioning Relation (1.1). In this section, we wish to bound the distance between the distribution of the normalized sumn−1T_a_,_n:=n−1Pn_i₌_a₊₁i Z_i and the Dickman distribution Pθ, when the

quan-tities θi :=iEZi converge in some weak, average sense toθ, and when iP[Zi =1]∼ θi also. In

order to exploit the divisibility properties of the distributions of the random variablesZi that occur

in many of the classical examples, it is convenient first to introduce some further notation.

As in[ABT], we suppose that the random variablesZ_i can be written as sumsZ_i:=Pri

j=1Zi j, where

the random variables(Z_{i j},i≥1, 1_≤ j≤ri)are all independent, and, for eachi, theZi j, 1≤ j≤ri

are identically distributed. This can always be taken to be the case, by settingr_i=1, butr_icould be chosen arbitrarily large ifZ_i were infinitely divisible, and the bounds that we obtain may be smaller if theri can be chosen to be large. We then define

ǫik :=

We now specify our analogue of (1.2). Clearly, assuming µ_i → 0 yields random variables Z_i that mostly only take the values 0 or 1, but we also need some regularity among theθi. To make this

precise, we define

and assume that it converges to zero, for someθ >0, asm→ ∞. Note that this already implies that

θsup := sup

i≥1

θi < ∞. (3.3)

In addition, we need to be able to apply Theorem 2.1, with Z_i forX_i and with T_a_,_n for S_a_,_n. For this, we need practical conditions on the θi which imply that (2.4) is satisfied for D as in (2.5),

for some r,s,k and ψ. So we define i0 := min{i ≥ 3θsup: µi ≤ 1/2}, and set E′(d,ψ) := {i ≥

i₀: min(θ_i,θ_i+d) ≥ ψ}. We now show that E′(d,ψ) ⊂ E(d,ψ/8)∩[i0,∞), so that, if (2.4) is

(9)

thus also

P_[Z_i₌_1] _≥ θi

2iP[Zi=0] ≥

θi

4i, and the lemma follows.

Using these preparations, we can now state relatively straightforward conditions under which a decomposable combinatorial structure can be said to be quasi–logarithmic. Our simplest condition is the following.

Definition 3.2. We say that a decomposable combinatorial structure satisfies the quasi–logarithmic

condition QLC if it satisfies the Conditioning Relation (1.1), if

lim

i→∞µi = 0; mlim→∞δ(m,θ) = 0 for someθ >0,

and if, for somer,scoprime,ψ >0 andDdefined in (2.5), (2.4) is satisfied with E′forE.

For quantitative estimates, a slightly stronger assumption is useful.

Definition 3.3. We say that a decomposable combinatorial structure satisfies the quasi–logarithmic

condition QLC2 if it satisfies the Conditioning Relation (1.1), if

µi = O(i−α); δ(m,θ) = O(m−β) for someθ,α,β >0,

and if, for somer,scoprime,ψ >0 andDdefined in (2.5), (2.4) is satisfied with E′forE.

Under such conditions, we now prove the close link betweenL(n−1_T

a,n) and Pθ. Our method of

proof involves showing first that_L(T_0,_n)is close to the compound Poisson distribution CP(θ,n):=

L(Pn_i₌₁i Z_i∗), where theZ_i∗_∼Po(i−1θ)are independent; the closeness ofn−1CP(θ,n)andP_θ is al-ready known[ABT, Theorems 11.10 and 12.11], and the Wasserstein distance betweenL(n−1_T

0,n)

and_L(n−1T_a_,_n)is at mostn−1Pa_i₌₁θi.

To bound the distance between_L(T_a_,_n)and CP(θ,n), we use Stein’s method (Barbour, Chen & Loh 1992). For any Lipschitz test function f: Z₊_→R, one expresses f in the form

f(j)₋CP(θ,n)_{f} = θ n

X

i=1

gf(j+i)−j gf(j), (3.4)

for an appropriate function g_f [ABT, Chapter 9.1]. Hence, for instance, the Wasserstein distance between_L(T_a_,_n)and CP(θ,n)can be estimated by bounding

E

(

θ n

X

i=1

g_f(T_a_,_n+i)₋T_a_,_ng_f(T_a_,_n)

)

=

n

X

i=a+1

E_{_θg_f(T_a_,_n+i)₋i Z_ig_f(T_a_,_n)_}+

a

X

i=1

θEg_f(T_a_,_n+i)

, (3.5)

uniformly for Lipschitz functions f ∈Lip₁, for which functionskgfk ≤1[ABT, (9.14)]. The right

(10)

First, we re-express the element

of (3.5) by observing that

E_{_{i Z}_il_g_f(T_a_,_n)_} = θi

With the help of these estimates, we can prove the following approximation theorem. We use the notation D1(T) to denote dTV(L(T),L(T +1)); D

1_(T

a,n) appears in the bound (3.9) by way

of (3.11), and is controlled by applying Theorem 2.1 withZ_i forX_i.

Theorem 3.4. With the definitions above,

(11)

Proof. We first consider dW L(Ta,n), CP(θ,n)

, for which we bound the quantities appearing in (3.5), as addressed in (3.6)–(3.8). The contribution from (3.8) is immediate. Then, defining

θ∗ := max_{1,θ, sup

i≥1

θi} and σ∗n := n

X

i=1

max

µi,

1 i r_i

,

we can easily bound the third element in (3.6) by θ∗σ∗_nkg_f_k, and the second, using (3.7), con-tributes at most 4θ∗σ∗_nkg_f_k, since alsoi r_i_≥1. For the first term, we use Lemma 5.2(i) to give

n

X

i=1

(θ−θi)Egf(Ta,n+i)

≤ {2θ

∗_m₊_n_δ_(m,_θ_{) + (1}_/₄₎_θ∗_mnD1_(T

a,n)}kgfk.

In all, and using_kgfk ≤1, this gives the bound

dW L(Ta,n), CP(θ,n)

≤ nǫ1(n,a,m), (3.10)

with

ǫ1(n,a,m) :=

1 4θ

∗_mD1_(T

a,n) +δ(m,θ) +n−1θ∗{5θ∗σ∗n+2m+a}. (3.11)

This bound, together with the inequality

dW L(n−

1_T

a,n), Pθ

≤ n−1dW L(Ta,n), CP(θ,n)

+dW n−

1_CP(

θ,n), Pθ

,

now give the required estimate, since

dW n−

1_CP(_θ_,_{n), P}

θ

≤ n−1(1+θ)2;

see[ABT, Theorem 11.10].

If QLC holds, D1(T_a_,_n) =O(_{(a+1)/n_}γ)for someγ >0, and choosingm=m_n tending to infinity slowly enough ensures thatǫ1(n,an,mn)→0. If QLC2 holds, choosemto be an appropriate power

of_{(a+1)/n_}.

With a little more difficulty, one can prove the analogous local approximation to the distribution ofTa,n. This is the main tool for establishing the asymptotic behaviour of quasi-logarithmic

combi-natorial structures.

Theorem 3.5. For any0≤a≤n and any r≥2a+1, we have

|nP_T_a_,_n=r−pθ(r/n)| ≤ min

1≤m≤nǫ5(n,a,m;r), (3.12)

with ǫ5(n,a,m;r) as defined in (3.24) below. If QLC holds, it follows that

sup_r_≥_nx_|nP_T_a_,_n ₌ _r₋ _p_θ_(r_/_n)_{| →} ₀ _{for any x} _> ₀ _{and any sequence a}_n ₌ _{o(n). If QLC2}

(12)

Proof. Withx:=r/n, we begin by writing

Using these bounds, we can bound the third element in (3.6) by

θ∗

using (3.7), in a very similar way, giving

2θ∗

Finally, the first element in (3.6) is bounded by Lemma 5.2(ii) as

Combining these estimates, we conclude that, for anyl_≥l₀,

(13)

The next step is to bound the difference

∆₂(r) := P_r₋_n_≤_T_a_,_n_<_r₋_a₋CP(θ,n)_{[r₋n,r−a−1]_},

which can once again be accomplished by using (3.4) and (3.5). Since, for f :=1[0,s−1],

kg_f_{k ≤} (1+θ)/(s+θ),

by[ABT, Lemma 9.3], it follows as in the proof of (3.10) in the previous theorem that

|P[T_a_,_n<s]₋CP(θ,n)_{[0,s₋1]_{}| ≤} s−1(1+θ)nǫ1(n,a,m), (3.17)

for anys≥1. For 2a<r≤n, this gives

|∆₂(r)_{| ≤} (r₋a)−1(1+θ)nǫ1(n,a,m) ≤ 2r−1n(1+θ)ǫ1(n,a,m).

Forr>n, two differences as in (3.17) are needed. The first is just as before; the second is bounded by

ǫ3(n,a,m;r) := min{(r−n)−1(1+θ)nǫ1(n,a,m),P[Ta,n<r−n]+CP(θ,n){[0,r−n−1]}}, (3.18)

where the alternative is useful ifr is close ton. Now

CP(θ,n)_{[0,j]_{} ≤}

from Lemma 5.3, and this in turn gives

P_[T_a_,_n_≤ _j] _≤ _2e1+θ∗

optimizing with respect tor, gives

ǫ3(n,a,m;r) ≤ 4e1+θ

∗

{ǫ1(n,a,m)}θ /(4+θ) =: ǫ4(n,a,m).

Hence

(14)

for allr_≥2a+1.

The remainder of the estimate is concerned with comparing the density pθ(r/n) with

nr−1θCP(θ,n)_{[r₋n,r−a−1]_}. From[ABT, Theorem 11.12], it follows that

|CP(θ,n)_{[r₋n,r₋a₋1]_{} −}Pθ{[r/n−1,(r−a)/n)}| ≤ c(θ)n−(θ∧1), (3.22)

for a constantc(θ), and then, from[ABT, (4.23) and (4.20)],

|nr−1θPθ{[r/n−1,(r−a)/n)} −pθ(r/n)| (3.23)

= nr−1θ|P_θ_{[r/n₋1,(r₋a)/n)_{} −}P_θ_{[r/n₋1,r/n)_{}| ≤} c′(θ)nr−1(a/n)(θ∧1)

so long asr≥2a. Combining (3.15), (3.21), (3.22) and (3.23), the theorem follows with

ǫ5(n,a,m;r) :=

n r

n

2θ(1+θ)nr−1ǫ1(n,a,m) +ǫ2(n,a,m)

+θ ǫ4(n,a,m) +c′′(θ)((a+1)/n)(θ∧1)

o

; (3.24)

note that, for 2a_≤n/2_≤r≤n, the bound can be replaced by the uniform

ǫ′₅(n,a,m) := 2¦ǫ2(n,a,m) +4n−1θ(1+θ)ǫ1(n,a,m) +c′′(θ)((a+1)/n)(θ∧1)

©

. (3.25)

If QLC holds, D1(T_a_,_n) =O(_{(a+1)/n_}γ)for someγ >0, and choosingm=m_n tending to infinity slowly enough ensures thatǫl(n,an,mn)→0 forl =1, 2 and 4; this implies thatǫ5(n,an,mn,r)→0

uniformly in r ≥ nx, for any x > 0. If QLC2 holds, choose m to be an appropriate power of

{(a+1)/n_}.

4 Quasi-logarithmic structures

In this section, we consider the two common properties shared by logarithmic combinatorial struc-tures that were discussed in the Introduction, and show that they are also true for quasi-logarithmic structures. For each of the properties, the local approximation of P[T_a_,_n = r]in Theorem 3.5 is the fundamental relation from which everything else follows. Other aspects of the asymptotic be-haviour of logarithmic combinatorial structures could be extended to quasi-logarithmic structures by analogous methods.

4.1 The size of the largest component

The following theorem is an extension of a result proved by Kingman (1977) in the case ofθ-tilted random permutations. A version for logarithmic structures can be found in[ABT, Theorem 7.13].

Theorem 4.1. Let

L(n) _:=

max1≤i≤n:C_i(n)>0

be the size of the largest component. Then, if QLC holds,

lim

n→∞L n −1_L(n)

(15)

where L is a random variable concentrated on(0, 1], whose distribution is given by the density function

whereρis Dickman’s function(Dickman, 1930).

(16)

One can also prove local versions of the convergence theorem. However, they have to involve the particular sequence θi, since, for instance,P[L(n) = r] =0 ifθr =0, because then Zr, and hence

alsoC_r(n), are zero a.s. A typical result is as follows.

Theorem 4.2. If QLC holds, then, for any0<x ≤1such that1/x is not an integer, it follows that

lim

n→∞|nP[L

(n)₌

⌊nx_⌋]₋(θ_⌊nx⌋/θ)fθ(x)| = 0. (4.5)

Under QLC2, the convergence rate is of order O(n−η4_{), for some}_η

4>0.

Proof. Arguing as in the proof of the previous theorem,

P_[_L(n)₌_⌊_nx_⌋_{] =}

n

Y

i=⌊nx⌋+1

P_Z_i ₌₀

⌊n/_X⌊nx⌋⌋

l=1

P_[Z_⌊_nx_⌋₌_l_]

P_T_0,_⌊_nx_⌋−₁=n−l⌊nx⌋

PT_0,_n=n . (4.6)

Now, from Theorem 3.5, the ratios

P_T_0,_⌊_nx_⌋−₁₌_n₋_l_⌊_nx_⌋

P_T_0,_n=n

are bounded asn_{→ ∞}, uniformly for all 2_≤l_{≤ ⌊}n/⌊nx_⌋⌋, provided that 1/x is not an integer, so that 1₋x⌊1/x⌋>0. Then

X

l≥2

P[Z_⌊_nx_⌋=l] _≤ r_⌊_nx_⌋P[Z_⌊_nx_⌋_,1_≥2] +

_r

⌊nx⌋

2

(P[Z_⌊_nx_⌋_,1=1])2

≤ 1

⌊nx⌋µ⌊nx⌋+

(θ∗)2

2_⌊nx_⌋2,

implying that limn→∞n P

l≥2P[Z⌊nx⌋ =l] =0. Hence the sum of the terms for l ≥2 on the right

hand side of (4.6) contributes an asymptotically negligible amount to the quantitynP[L(n)=_⌊nx_⌋] as n → ∞. For the l = 1 term in (4.6), both the product and the ratio of point probabilities are treated as in the proof of Theorem 4.1, giving the limit xθ−1pθ((1−x)/x)/pθ(1), and

|P[Z_⌊_nx_⌋=1]₋θ_⌊_nx_⌋/⌊nx⌋| ≤ µ_⌊_nx_⌋/⌊nx⌋,

so that

lim

n→∞|nP[L

(n)₌

⌊nx⌋]₋xθ−1{pθ((1−x)/x)/pθ(1)}x−1θ⌊nx⌋| = 0.

This completes the proof of (4.5). The remaining statement follows as usual, by taking greater care of the magnitudes of the errors in the various approximation steps.

In order to relax the condition that 1/x is not integral, it is necessary to strengthen the assumptions a little; for example, if x = 1/2 and nis even, the contribution from the l = 2 term in (4.6) is of order O(ǫn/2,2n1−θ), which could be large for θ < 1. In order to get a limit of fθ(x) without

involving the individual valuesθi, it is necessary to average the point probabilities over an interval

(17)

4.2 The spectrum of small components

We prove an analogue of the Kublius fundamental lemma (Kubilius, 1964) for quasi-logarithmic structures, and thus extend results of Arratia et al. (1995) and [ABT, Theorem 7.7]; see also the corresponding result of Manstaviˇcius (2009), proved under different conditions.

Theorem 4.3. For a/n_≤α0, whereα0 is small enough thatminm≥1ǫ′5(n,a,m)≤

where the order of magnitude ofǫ6(n,a)is given in (4.16) below.

If QLC holds, then

for every non-negative integer sequence an = o(n). If QLC2 holds, the convergence rate is of order {(a+1)/n_}η5 _{for some}_η

5>0.

Proof. The proof is similar to that of[ABT, Theorem 5.2]. We fix annwith the required properties, and we setpk:=P[Ta,n=k]. Then the conditioning relation entails

We now separately bound the three terms in (4.8).

The first term is just

X

Now, from Theorem 3.5, using the bound given in (3.25), we have

(18)

For the second term in (4.8), we have two bounds. First,

n

X

l=⌊n/2⌋+1

P_[T_0,_a₌_l_]

P[T_a_,_n=n−l]

P[T_0,_n=n] ≤

nmax_n_/₂_≤_l_≤_nP[T_0,_a=l] nP[T_0,_n=n] ,

where the denominator is uniformly bounded below wheneverǫ₅′(n,a,m)_≤ 1

2pθ(1), and, forn/2≤

l_≤nanda_≤n/4,

nP[T_0,_a=l] _≤ 2lP[T_0,_a=l] _≤ 2θ∗P[T_0,_a_≥n/4] +2ǫ2(a, 0,m) ≤ 8(θ∗)2(a/n) +2ǫ2(a, 0,m),

(4.12) from (3.15), with the last step following becauseET_0,_a_≤aθ∗. The second bound is given by

n

X

l=⌊n/2⌋+1

P[T_0,_a=l]P[Ta,n=n−l]

P[T_0,_n=n] ≤ P[T0,a≥n/2]

sup_j_∈Z

+P[Ta,n= j]

P[T_0,_n=n]

≤ 2aθ

∗_D1_(T

a,n)

nP[T_0,_n=n],

(4.13)

again using Markov’s inequality, and the asymptotically important part isaD1(T_a_,_n). Thus the second term in (4.8) is of order

O n−1a+min_{min

m≥1ǫ2(a, 0,m),aD 1_(T

a,n)}

. (4.14)

Finally, the third term in (4.8) can be simply bounded from above by

2n−1E_T_0,_a _≤ ₂_θ∗_n−1_a_. _(4.15)

Combining (4.11), (4.14) and (4.15) proves the first part of the theorem, with

ǫ6(n,a) = O n−1a+min

m≥1ǫ

′

5(n,a,m) +min{min_m_≥₁ǫ2(a, 0,m),aD1(Ta,n)}

. (4.16)

The remaining statements follow as usual.

4.3 Additive arithmetic semigroups

We now return to the example given in the Introduction, of a quasi-logarithmic combinatorial struc-ture that is not logarithmic. In Knopfmacher’s (1979) additive arithmetic semigroups, the elements of norm n can be decomposed into prime elements, with C_i(n) the number of prime elements of norm i in the decomposition of a randomly chosen element of norm n. The joint distribution of (C₁(n), . . . ,C_n(n))satisfies the conditioning relation, withZ_i _∼NB(p(i),q−i), so that

PZ_i=k:=

_{p(i) +}_k

−1 k

q−ik(1₋q−i)n, k_∈Z₊,

with the convention thatZi =0 ifp(i) =0. Here,p(i)denotes the total number of prime elements

of norm i, and q > 1 enters through the assumption that the number g(n) of elements of sizen satisfies

g(n) =qn

r

X

j=1

(19)

for real numbersρ1<· · ·< ρrandc1, . . . ,cr, withρr>0,cr>0, and withγ >1, an analogue of a

condition under which Beurling (1937) examined prime number theorems of so called generalized integers. In particular, Zhang (1996, Theorem 6.2) shows, under condition (4.17) withγ >2, that

θi := iEZi = i p(i)q−i/(1−q−i) =θi′+o(1), where {θi′}i∈Nis the integer skeleton of a sinusoidal

mixture function

θ_t′:=θ+

L

X

l=1

λlcos(2πflt−ϕl), t∈R, (4.18)

withθ:=ρr>0, amplitudesλl>0 such that

PL

l=1λl ≤θ, non-integral frequenciesfl∈[0,∞)\Z+

and phases 0 ≤ ϕ_l < 2π; note that his arguments and notation are non-probabilistic. In many examples, the sum of cosines is empty(L=0), and the structure logarithmic. When this is the case, the asymptotic behaviour of the small and large components is as described in the Introduction: see Arratia et al. (2005) for these and other results. Here, we are interested in establishing asymptotics in the case whenL_≥1.

First, note that the same sequenceθ_i′, for integrali, is obtained, if each fl is replaced by its fractional

part f_l− ⌊f_l⌋, so that the values of f_l can be taken to lie in(0, 1); and then that, if f_l >1/2, it can first be replaced by f_l₋1, and then by 1₋f_l if alsoϕl is replaced by−ϕl, again without changing

theθ_i′. Hence we may assume that fl∈(0, 1/2]for alll.

Clearly, for L_≥1, the sequenceθ_i′ is in general not convergent in the usual sense, but, in view of the properties of trigonometric functions,

i2

X

t=i1+1

cos(2πf_lt₋ϕl) ≤

1 sinπf_l,

whatever the values ofi₁,i₂, so thatδ(m,θ) =O(m−1)_→0 asm_{→ ∞}; andµi =O(q−i)asi→ ∞.

Hence the condition QLC2 is satisfied by these structures if, for somer,scoprime,k_∈Nandψ >0, the set{i: min(θi,θi+d)≥ψ}has at least one element in each k-interval{jk+1, . . . ,(j+1)k}for

alld_∈D:=_{r_}∪{s2g, g_∈Z₊_}_{, for all} _j_{sufficiently large: see the comment following Theorem 2.1.}

Now, if PL_l₌₁λl < θ, all the θi′ are uniformly bounded below by ψ1 :=θ −

PL

l=1λl > 0, and the

condition QLC2 is clearly satisfied with any choice ofr,sandkifψ= 1₂ψ1. If

PL

l=1λl =θ, define

V_l(i) := min

n∈Z|2πfli−ϕl−(2n+1)π|, (4.19)

and suppose that

V_l(i) _≤ δl := πmin{fl, 1−2fl}

for some pairl,i. Then, ifn_∗:=n_∗(l,i)attains the minimum in (4.19), it is immediate that

2πfl(i+r)−ϕl−(2n∗+1)π ≥ πfl ≥ δl

for allr_≥1. On the other hand,

(2(n_∗+1) +1)π−(2πf_l(i+r)₋ϕl) ≥ 2π−2rπfl−π(1−2fl)

≥ π(1₋2f_l) _≥ δl,

ifr _≤2. Hence, ifV_l(i)_≤δl, thenVl(i+r)≥δl forr=1, 2. Setting

(20)

this implies that, for eachi,

Lc_i ⊂Li+1∩Li+2. (4.20)

Now it follows from the inequality 1−cosx≥c x2in|x| ≤π/3, with 2c=1−π2/108, that

θ_i′ ≥ cX

l∈Li

λlδ2l. (4.21)

Defineψ2:= ₂c

PL

l=1λlδ2l. Then if, for somei,θi′≤ψ2, we havec

P

l∈/Liλlδ

2

l ≥ψ2, and it follows

from the above considerations that

θ_i′₊_r ≥ cX

l∈/Li

λlδ2l ≥ ψ2, r=1, 2,

in view of (4.20) and (4.21). Thus every interval of lengthk=3 far enough from the origin contains an indexiwith min(θi,θi+d)≥ 1₂ψ2, whatever the value ofd. Hence the condition QLC2 is satisfied

for any choice ofr,sco-prime, provided thatψ2>0.

There remains the possibility that fl =1/2 for all 1≤l≤ L, in which caseψ2=0. If any of theϕl

are not multiples ofπ, the functionθ_i′is once again uniformly bounded away from 0, and the same is true if one is an even multiple ofπ and another an odd one. Hence there are only two cases in which QLC2 is not satisfied:

θ_t′ := θ{1+cos(πt)_} and θ_t′ := θ{1+cos(π(t₋1))_}.

In the former case, θ_i′ > 0 only for even i, and if P[Z_i = 0] = 1 for all odd i then dTV(L(T0,n),L(T0,n+1)) =1 for all n; hence, for instance, Theorem 3.5 cannot be expected to

be true. In the latter case, we can take D:= _{2g+1, g _≥ 0_}, and use Theorem 2.2 to show that dTV(L(Ta,n),L(Ta,n+1)) =O({(a+1)/n}η)for someη >0; the rest of the argument is then as

before.

5 Technical bounds

Here we collect some technical results that are needed to smooth out the irregularities in the se-quence of θi’s. Let θ >0, and let {θi}i∈Nbe any non-negative sequence. Let θ_∗ :=sup_i_∈Nθi and θ_∗′:=θ∨θ_∗. For everym,n_∈Nwe set

δ(m,θ):=sup

j≥0

1

m

X

i=1

θjm+i−θ

.

Lemma 5.1. Let{yi}i∈Nbe a real-valued sequence, and0≤l <n. Then

n

X

i=l+1

(θ_i−θ)y_i

≤

  

(2mθ_∗′+nδ(m,θ))_kyk+θ_∗(nm/8)_k∆yk;

2mθ_∗′kyk+δ(m,θ)Pn_i₌₁_|y_i|

+m−1θ_∗P_lm₌₁Pm_l′=1

P_⌊n/m⌋−1

j=1 |yjm+l− yjm+l′|,

(21)

Proof. For any 0_≤ j_{≤ ⌊}n/m_⌋, we have

as for (3.14), and that

(22)

Proof. Choosingy_i:=1/i, the second inequality in Lemma 5.1 gives

n

X

i=l+1

θi−θ

i

≤ 2

m

l +δ(m,θ)

n

X

i=l+1

1

i +θ∗

⌊n/_Xm⌋−1

j=⌊l/m⌋

1 j(j+1)

≤ 2(1+θ_∗)m

l +δ(m,θ)

n

X

i=l+1

1 i,

and the lemma follows.

The authors wish to thank the referees for a number of helpful suggestions.

References

Arratia, R., A. D. Barbour, and S. Tavaré. 2003. Logarithmic combinatorial structures: a probabilistic approach, EMS Monographs in Mathematics, European Mathematical Society (EMS), Zürich. MR2032426

Arratia, R., A. D. Barbour, and S. Tavaré. 2005.A probabilistic approach to analytic arithmetic on algebraic function fields, Math. Proc. Cambridge Philos. Soc.139, no. 1, 1–26.

Arratia, R., D. Stark, and S. Tavaré. 1995.Total variation asymptotics for Poisson process approximations of logarithmic combinatorial assemblies, Ann. Probab.23, no. 3, 1347–1388.

Barbour, A. D., L. H. Y. Chen, and W.-L. Loh. 1992.Compound Poisson approximation for nonnegative random variables via Stein’s method, Ann. Probab.20, no. 4, 1843–1866.

Barbour, A. D. and B. L. Granovsky. 2005.Random combinatorial structures: the convergent case, J. Combin. Theory Ser. A109, no. 2, 203–220.

Beurling, A. 1937.Analyse de la loi asymptotique de la distribution des nombres premiers généralisés, Acta Math.68, 255– 291.

Chung, F. and L. Lu. 2006.Concentration inequalities and martingale inequalities: a survey, Internet Mathematics3, 79– 127.

Dickman, K. 1930.On the frequency of numbers containing prime factors of a certain relative magnitude, Ark. Math. Astr. Fys.22, no. 10, 1–14.

Freiman, G. A. and B. L. Granovsky. 2002. Asymptotic formula for a partition function of reversible coagulation-fragmentation processes, Israel J. Math.130, 259–279.

Kingman, J. F. C. 1977.The population structure associated with the Ewens sampling formula, Theoret. Population Biology 11, no. 2, 274–283.

Knopfmacher, J. 1979.Analytic arithmetic of algebraic function fields, Lecture Notes in Pure and Applied Mathematics, vol. 50, Marcel Dekker Inc., New York.

Kubilius, J. 1964. Probabilistic methods in the theory of numbers, Translations of Mathematical Monographs, Vol. 11, American Mathematical Society, Providence, R.I.

Lindvall, T. 2002.Lectures on the coupling method, Dover Publications Inc., Mineola, NY.

Manstaviˇcius, E. 2009.Strong convergence on weakly logarithmic combinatorial assemblies, arXiv:0903.1051.

Mattner, L. and B. Roos. 2007.A shorter proof of Kanter’s Bessel function concentration bound, Prob. Theory Rel. Fields 139, 191–205.

Mineka, J. 1973.A criterion for tail events for sums of independent random variables, Z. Wahrscheinlichkeitstheorie und Verw. Gebiete25, 163–170.

(23)

Rogers, L.C.G. 1999.Fastest coupling of random walks, J. London Math. Soc.60, 630–640.

Vervaat, W. 1972.Success epochs in Bernoulli trials with applications in number theory, Mathematical Centre Tracts, vol. 42, Mathematisch Centrum, Amsterdam.