getdoc1642. 142KB Jun 04 2011 12:04:08 AM

(1)

in PROBABILITY

LARGE DEVIATIONS AND QUASI-POTENTIAL

OF A FLEMING-VIOT PROCESS

SHUI FENG1

Department of Mathematics and Statistics, McMaster University, Hamilton, ONT, Canada L8S 4K1

email: [email protected]

JIE XIONG2

Department of Mathematics, University of Tennessee, Knoxville, TN 37996-1300 email: [email protected]

submitted December 13, 2001Final version accepted January 7, 2002 AMS 2000 Subject classification: Primary 60F10; secondary 92D10

Fleming-Viot process, large deviations, quasi-potential

Abstract

The large deviation principle is established for the Fleming-Viot process with neutral mutation when the process starts from a point on the boundary. Since the diffusion coefficient is de-generate on the boundary, the boundary behavior of the process is investigated in detail. This leads to the explicit identification of the rate function, the quasi-potential, and the structure of the effective domain of the rate function.

1 Introduction

Let E = [0,1], and M1(E) be the set of all probability measures on E equipped with the weak topology. C(E) is the set of all continuous functions on E. The setC2

b(R) contains all

functions on the real line R that possess bounded continuous derivatives upto second order. For anyθ >0 andν0 inM1(E), define

Af(x) = θ 2

Z

(f(y)−f(x))ν0(dy), f ∈C(E).

ThenAis clearly the generator of a Markov jump process on E. Let

D={F(µ) =f(hµ, ϕi) :f ∈Cb2(R), ϕ∈C(E), µ∈M1(E)}.

For anyǫ >0, consider the following operator on setD:

Lǫ_F_{(µ) =}_f′₍_h_{µ, ϕ}_i₎_h_{µ, Aϕ}_i₊ǫ

2

Z Z

f′′₍_h_{µ, ϕ}_i_{)ϕ(x)ϕ(y)Q(µ;}_{dx, dy),} _(1.1)

1RESEARCH SUPPORTED BY THE NATURAL SCIENCE AND ENGINEERING RESEARCH COUN-CIL OF CANADA

2RESEARCH SUPPORTED PARTIALLY BY NSA

(2)

whereQ(µ;dx, dy) =µ(dx)δx(dy)−µ(dx)µ(dy) andδxstands for the Dirac measure atx∈E.

The measure-valued process generated by Lǫ _{is called the Fleming-Viot process with neutral}

mutation. It models the evolution of the distribution of the genotypes in a population under the influence of mutation and replacement sampling. The setEis called the type space or the space of alleles, the operator A describes the generation independent mutation process with mutation rateθand mutation measureν0, and the last term describes the continuous sampling with sampling rateǫ.

WhenA is replaced by the generator of any other Feller process, the resulting process is still called a Fleming-Viot process. In fact in the original Fleming-Viot process,Ais the Laplacian operator. In the sequel, we will use the term Fleming-Viot (henceforth, FV) process solely for the neutral mutation case.

Let C([0,+∞), M1(E)) be the set of all continuous maps from [0,+∞) to M1(E). For any µ(·) in C([0,+∞), M1(E)), any fixed n ≥ 2, and 0 = x0 < x1,· · · < xn = 1, let Xk(t) =

µ(t)([xk−1, xk)), pk = ν0([xk−1, xk)) for 1 ≤ k ≤ n−1 and Xn(t) = µ(t)([xn−1, xn]), pn =

ν0([xn−1, xn]). It is well-known (cf. Ethier and Kurtz [4]) that under the law of the FV

process, X(t) = (X1(t),· · ·, Xn−1(t)) solves the following degenerate stochastic differential

equation:

dXk(t) =

θ

2(pk−Xk(t))d t+ √

ǫ

n−1

X

j=1

σkj(X(t))dBj(t),1≤k≤n−1,

where for any 1≤i, j≤n−1,Bj(t) are independent Brownian motions, and

n−1

X

k=1

σik(x)σkj(x) =xi(δij−xj).

Because of this partition property, it becomes natural to study the FV process through its finite dimensional analogue.

The first objective of this article is to establish the large deviation principle at the path level for the FV process when the sampling rate ǫ approaches 0. This problem has been studied in [1] and [2]. The only unresolved case is the lower bound when the mutation measure ν0 is not absolutely continuous with respect to the initial point µ. In the finite dimensional case, this corresponds to the case when the process starts from a point on the boundary of the simplex {(x1,· · ·, xn−1) :Pkn−=11xk = 1, xk ≥0, k = 1,· · ·, k}. The difficulty in resolving the

issue comes from the degeneracy and non-Lipschitz property ofσ(x).Our proof of this result reveals that the derivatives of all possible large deviation paths at time zero have to be the same as the derivative of the solution of the limiting dynamic. This is very different from the non-degenerate case. An explicit expression of the rate function is also obtained. All definitions and terminology of large deviations follow those in [3].

Our second objective is to identify the quasi-potential of the process. Consider the following infinite dimensional dynamic:

dµ(t) dt =

θ

(3)

exists, is the minimal energy needed for the transition fromν0 toν. It is known that the FV process has a unique reversible measure Πǫ which satisfies a full large deviation principle ([2]).

Roughly speaking for suitable subsetAofM1(E), we have

Πǫ{A}= exp{−

1

ǫνinf∈AI(ν)(1 +◦(1))},

where I(ν) =θH(ν0|ν) andH(ν0|ν) is the relative entropy ofν0 with respect to ν. If we set F(ν) = H(ν0|ν), then I(ν) = θ(F(ν)−F(ν0)). In physical terms, F(ν) is the free energy functional and θ plays the role of reciprocal temperature. Our second result shows that the quasi-potential equals toI(ν) and the minimal energy is attained by a time-reversed path of (1.2) connecting ν0with ν.

Large deviation lower bounds are obtained in Section 2, and a detailed description of the effective domain is given in Section 3. Result on quasi-potential is in Section 4.

2 Large Deviation at Path Level

For any fixedT >0, letC([0, T], M1(E)) be the space of all continuousM1(E)-valued functions on [0, T] equipped with the uniform convergence topology. In this section, we prove the large deviation lower bound for FV process at the path level, i.e. on spaceC([0, T], M1(E)), under the assumption that the mutation measureν0is not absolutely continuous with respect to the starting pointµof the process. Because of the partition property, we obtain the result through the study of the finite dimensional case.

2.1 Finite Dimension

For any fixedn≥1, let

Sn =

(

x= (x1,· · ·, xn−1) : xi≥0, n−1

X

i=1 xi≤1

)

.

For 1≤k≤n−1, consider

dXkǫ(t) =c(pk−Xkǫ(t))dt+

√ ǫ

n−1

X

j=1

σkj(Xǫ(t))dBj(t), Xǫ(0) =x,

wherec=θ₂, and (σkj(x)) is such that

σ(x)σ(x)′ = (xk(δkj−xj))1≤k,j≤n−1.

Denotepn= 1−Pnj=1−1pj andxn= 1−Pnj=1−1xj. Let

Lp={x∈ Sn : 0< xk+pk <2, 1≤k≤n; there exists 1≤i≤n, such thatxi= 0}.

In the remaining of this subsection, we assume thatxis inLp.

Let

Hxα,β =

φ∈C([α, β],Sn) : φ(t) =x+

Z t

α

˙ φ(s)ds

(4)

Define

Ixα,β(φ) =

(

1 2

Rβ

α

Pn

i=1

( ˙φi(t)−c(pi−φi(t)))2

φi(t) dt, φ∈H

α,β x

∞, φ /∈Hα,β x

where φn(t) = 1−Pnj=1−1φj(t). DenoteIx0,T andHx0,T byIxandHxrespectively.

Lemma 2.1 Suppose thatIx(φ)<∞.

i) If φi(0) = 0, thenφi(t)6= 0,t∈(0, T].

ii) If φi(0) = 1, thenφi(t)6= 1,t∈(0, T].

Proof: i) Suppose that there existst0>0 such thatφi(t0) = 0. If φi(t) = 0 for allt∈[0, t0],

then

φi(t)

=∞,

and hence Ix(φ) = ∞. This contradicts the assumption. Therefore, we may choose t0 < t′0 such thatφi(t0) =φi(t′0) = 0 andφi(t)6= 0 for allt∈(t0, t′0).

Lett0< t1< t2< t′

0. We have

∞ > Ix(φ)≥

Z t2

t1

φi(t) dt

≥ −2c

Z t2

t1

˙

φi(t)(pi−φi(t))

φi(t) dt

= −2c

Z φi(t2)

φi(t1)

pi−x

x dx → ∞

as t2 increases tot′

0.

ii) Forj6=i, we haveφj(0) = 0 and hence, for alltin (0, T],φj(t)6= 0. Therefore, for everyt

in (0, T],φi(t)6= 1.

Lemma 2.2 IfIx(φ)<∞andφi(0) = 0, then there ist0>0 such that for0≤t≤t0

φi(t)≤2(1 +c)t. (2.1)

Proof: Note that

φi(t) =

Z t

0

( ˙φi(s)−c(pi−φi(s)))ds+cpit−c

Z t

0

φi(s)ds

≤

Z t

0

( ˙φi(s)−c(pi−φi(s)))2

φi(s)

ds

!1/2_Z _t

0

φi(s)ds

1/2

+ct.

We can chooset0>0 such that for allt in [0, t0]

φi(t)≤

√

t, φi(t)≤

1 2

s Z t

0

(5)

By induction, it is easy to show that

The next lemma is the key in deriving the large deviation lower bound.

Lemma 2.3 Suppose Ix(φ)<∞. If φi(0) = 0, then φ˙i(0) =cpi; If φi(0) = 1, then φ˙i(0) =

The conclusion follows easily.

(6)

where Wi _{are Brownian motions such that} n−1

X

j=1

Z t

0 √

N σij(Xǫ(s))dBsj =Wτii

t, τ

i t =

Z t

0

N Xǫ,i(s)(1−Xǫ,i(s))ds,

λis a constant determined by Fernique’s theorem (cf. Kuo [5]) such thatc1is finite. Therefore

lim sup

ǫ→0

ǫlogP sup 0≤t≤1/N|

Xǫ_(t)₋_φ(t)_|_{> δ}

!

≤ −λN δ2_. _(2.2)

The conclusion then follows by takingN → ∞.

We are now ready to prove the lower bound.

Theorem 2.5 Forδ0>0 and anyφ satisfyingIx(φ)<∞, we have

lim inf

ǫ→0 ǫlogP{₀_≤sup_t_≤_T|X

ǫ

t−φ(t)|<2δ0} ≥ −Ix(φ) (2.3)

Proof: Letψi(t) =xie−ct+pi(1−e−ct). Then it is clear thatXǫ(t) converges to ψ(t) for all

t asǫgoes to zero. For anyN≥1, define

ϕ(t) =







φ(t), 1/N≤t≤T

ψ(1/2N) + 2N(φ(1/N)−ψ(1/2N))(t−1/2N), 1/2N ≤t≤1/N

ψ(t), 0≤t≤1/2N

Choosing N large enough such thatϕ(t)∈ S◦

n, the interior ofSn, fort∈[1/2N,1/N], and

{ sup 0≤t≤T|

Xǫ(t)−ϕ(t)|< δ0} ⊂ { sup 0≤t≤T|

Xǫ(t)−φ(t)|<2δ0}.

For anyδ < δ0, we have

2 max P{ sup 0≤t≤T|

Xǫ_(t)₋_ϕ(t)_|_{< δ0}}_{, P}_{ _sup

0≤t≤1/2N|

Xǫ_(t)₋_ϕ(t)_{| ≥}_δ0}

!

≥P{ sup 0≤t≤T|

Xǫ_(t)₋_ϕ(t)_|_{< δ0}}₊_P_{ _sup

0≤t≤1/2N|

Xǫ_(t)₋_ϕ(t)_{| ≥}_δ0}

≥P{ sup 1/2N≤t≤T|

Xǫ(t)−ϕ(t)|< δ}

≥P{|Xǫ_(1/2N)₋_ψ(1/2N₎_|_{< δ/2}_}

× inf

|y−ϕ(1/2N)|≤δ/2Py{₀_≤_t_≤sup_T₋₁_/₂_N|X

ǫ_(t)

−ϕ(t+ 1/2N)|< δ}

where δ is small enough such that |y−ϕ(1/2N)| ≤ δ/2 implies y ∈ S◦

n. Since Xǫ(1/2N)

converges to ψ(1/2N) asǫgoes to zero, by Lemma 2.4, we get

lim inf

ǫ_(t)

−φ(t)|<2δ0}

≥lim inf

ǫ_(t)

−ϕ(t)|< δ0} (2.4)

(7)

where the next to last inequality follows from Theorem 3.3 in [1]. By direct calculation, we have

I_ψ1/₍₁2N,_/₂_N1/N₎ (ϕ) =1 2

n

X

i=1

Z 1/N

1/2N

( ˙ϕi(t)−c(pi−ϕi(t))2

ϕi(t)

dt.

Ifxi 6= 0, thenϕi(0)6= 0 and it is clear that

Z 1/N

1/2N

ϕi(t)

dt→0

as N → ∞. Ifxi = 0, then ψi(0) =φi(0) = 0. By lemma 2.3, ˙φi(0) = cpi. It is clear that

˙

ψi(0) =cpi. Hence

lim

N→∞2N ψi(1/(2N)) = limN→∞N ϕi(1/N) =cpi.

Then

Z 1/N

1/2N

ϕi(t)

dt

=

Z 1/N

1/2N

( ˙ϕi−cpi)2

ϕi(t) dt+

Z 1/N

1/2N

(2cϕ˙i+c2ϕi(t)−2pic2)dt

= ( ˙ϕi−cpi) 2 ˙ ϕi

logϕ˙i+ 2N ψi(1/2N) 2N ψi(1/2N)

+

Z 1/N

1/2N

(2cϕ˙i+c2ϕi(t)−2pic2)dt

→ 0

asN → ∞.

2.2 Infinite Dimension

LetC1,0_{([0, T}_]_×_{E) denote the set of all continuous functions on [0, T}_]_×_E _{with continuous} first order derivative in time t. For any µ ∈ M1(E), let Cµ([0, T], M1(E)) be the set of all

M1(E)-valued continuous functions on [0, T] starting atµ.

In this subsection, we will assume that ν0 is not absolutely continuous with respect to µ. Without loss of generality, we also assume that the support ofν0 isE.

For anyµ(·) inCµ([0, T], M1(E)), define

Sµ(µ(·)) = sup g∈C1,0_([0_,T_]_×_E₎

J0µ,T(g) (2.5)

where

J0µ,T(g) = hµ(T), g(T)i − hµ(0), g(0)i

−

Z T

0 h

µ(s),(∂

∂s+A)g(s)ids− 1 2

Z T

0

Z Z

g(s, x)g(s, y)Q(µ(s);dx, dy)ds.

Recall that for anyµinM1(E), we haveQ(µ;dx, dy) =µ(dx)δx(dy)−µ(dx)µ(dy).

LetB1,0_{([0, T}_]_×_{E) be the set of all bounded measurable functions on [0, T}_]_×_E_{with continuous} first order derivative in time. Then by approximation, we have

Sµ(µ(·)) = sup g∈B1,0_([0_,T_])_×_E₎

(8)

Definition 2.1 Let C∞_(E) _{denote the set of all continuous functions on} _E _possessing

con-tinuous derivatives of all order. An element µ(·) in C([0, T], M1(E))is said to be absolutely continuous as a distribution-valued function if there exist M >0and an absolutely continuous function hM : [0, T]→R such that for all t, s∈[0, T]

sup

f |h

µ(t), fi − hµ(s), fi| ≤ |hM(t)−hM(s)|,

where the supremum is taken over all f in C∞_(E)_{with absolute value bounded by} _M_.

Let

Hµ={µ(·)∈C([0, T], M1(E)) :Sµ(µ(·))<∞}

be the effective domain ofSµ(·). Then anyµ(·) inHµis absolutely continuous, and by theorem

5.1 of [2]

Sµ(µ(·)) =

Z T

0 || ˙

µ(s)−A∗(µ(s))||2µ(s)ds, (2.7) whereA∗ _{is the formal adjoint of}_A_{defined through the equality}_h_A∗_{(µ), f}_i₌_h_{µ, Af}_i_{, and for}

any linear functionalϑon spaceC∞_(E)

||ϑ||2µ= sup f∈C∞

(E)

[hϑ, fi −1₂

Z

E

Z

E

f(x)f(y)Q(µ;dx, dy)].

Theorem 2.6 For anyµinM1(E),µ(·)inHµ, and any open neighborhoodU ofµ(·)in space

Cµ([0, T], M1(E)), we have

lim inf

ǫ→0 ǫlogPµ{U} ≥ −Sµ(µ(·)), (2.8) where Pµ is the law of the FV process starting atµ.

Proof:The result is clear when µ(·) is not in the effective domain Hµ. For µ(·) inHµ, both

µ(t,[a, b)) andµ(t,[a, b]) are absolutely continuous int fora, b∈E.

Let {fn ∈ C(E) : n ≥ 1} be a countable dense subset of C(E). Define a metric d on

Cµ([0, T], M1(E)) as follows:

d(ν(·), µ(·)) = sup

t∈[0,T]

∞ X

n=1 1

2n(|hν(t), fni − hµ(t), fni| ∧1). (2.9)

Then d generates the uniform convergence topology on Cµ([0, T], M1(E)). Clearly for any

δ >0, there exists ak≥1 such that

{µ(·)∈Cµ([0, T], M1(E)) : sup t∈[0,T]{|h

ν(t), fni − hµ(t), fni|} ≤δ/2,

n= 1,· · ·, k} ⊂ {d(ν(·), µ(·))< δ}.

Let

P ={A1,· · ·, An:n≥1, A1,· · ·, An is a partition of E},

and element of P is denoted byı, , etc. For any ν in M1(E),µ(·) in C([0, T], M1(E)), and ı={A1,· · ·An}and={[x0, x1),· · ·,[xm, xm+1]}}in P, set

(9)

and

π(µ(·)) = (µ(·)([0, x1)),· · ·, µ(·)([xm,1])).

Next we choosewithx0= 0, xm+1= 1 such that max

0≤i≤m,x,y∈[xi,xi+1]

{|fn(y)−fn(x)|: 1≤n≤k} ≤ δ

6,

and let

U(µ(·), δ

6Γ) =

n

µ(·)∈Cµ([0, T], M1(E)) :

sup

t∈[0,T],0≤i≤m|

ν(t)([xi, xi+1))−µ(t)([xi, xi+1))| ≤

δ 6Γ

o

,

where Γ = supx∈E,1≤n≤k|fn(x)|. Then we have

U(µ(·),

δ

6Γ)⊂ {d(ν(·), µ(·))< δ}. (2.10) The partition property of the FV process implies thatPµ◦π−1is the law of a FV process with

finite type space. This combined with Theorem 2.5, implies

lim inf

ǫ→0 ǫlogPµ{U} ≥lim infδ→0 lim infǫ→0 ǫlogPµ{d(ν(·), µ(·))< δ} ≥lim inf

δ→0 lim infǫ→0 ǫlogPµ{U(µ(·), δ 6Γ)}

≥ −Iπ(µ)(π(µ(·)))≥ −Sµ(µ(·)), (2.11)

where the last inequality follows from Theorem 3.4 in [1].

3 Effective Domain

In this subsection, we will analyze the structure of the effective domain. We start with another representation of the rate function. For anyµ(·)∈Cµ([0, T], M1(E)), let

L2µ(·)([0, T]×E) ={f :kfk 2

L2= Z T

0

Z

E

f2(t, x)µ(t, d x)d t <∞}.

LetH0

µbe the collection of all absolutely continuousµ(·) inCµ([0, T], M1(E)) such that ˙µ(t)−

A∗_{(µ(t)) is absolutely continuous with respect to}_{µ(t) as Schwartz distribution with derivative}

h(t) =d( ˙µ(t)−_dµA₍∗_t₎(µ(t))) in spaceL2

µ(·)([0, T]×E) satisfying

µ(t),d( ˙µ(t)−A

∗_(µ(t)))

dµ(t)

= 0, t∈[0, T].

Define

Fµ(µ(·)) =

(

1 2

d( ˙µ(t)−A∗

(µ(t)))

dµ(t)

2

L2 ifµ(·)∈ H

0

µ

(10)

Theorem 3.1

Sµ(µ(·)) =Fµ(µ(·)).

Proof: First we assume that µ(·)∈ H0

µ. Using integration by parts, we have

J0µ,T(g) = Similar to the Appendix in [1], we have

ℓµs,t(g) =

Letφbe any measurable function onE taking values between 0 and 1.

(11)

Then

The conclusion of the theorem follows from the same arguments as in the proof of Lemma 2.3.

4 Quasi-Potential

For anyν in M1(E), the quasi-potentialV(ν0;ν) is defined as

V(ν0;ν) = inf{ST1,T2

ν0 (µ(·)) :µ(T1) =ν0, µ(T2) =ν,−∞ ≤T1< T2≤ ∞}.

It is the least amount of energy needed for a transition fromν0 toν under large deviations. In this section the quasi-potential of the FV process is identified explicitly.

Theorem 4.1 The quasi-potential of the FV process is given by

V(ν0;ν) =θH(ν0|ν). (4.1)

Thus it follows from (2.5) that

(12)

which implies thatV(ν0;ν)≥θH(ν0|ν).

Next we consider the general case. Note that for any partitionıofE,

ST1,T2

πı(ν0),πı(ν)(πı(µ(·)))≥V(πı(ν0), πı(ν)),

and hence

sup

ı {S T1,T2

πı(ν0),πı(ν)(πı(µ(·)))} ≥sup

ı {V(πı(ν0), πı(ν))}.

Therefore

V(ν0, ν) = inf{ST1,T2

ν0,ν (µ(·)) :µ(T1) =ν0, µ(T2) =ν,−∞ ≤T1< T2≤ ∞}

≥ inf

sup

ı {S T1,T2

πı(ν0),πı(ν)(πı(µ(·)))}:µ(T1) =ν0, µ(T2) =ν,−∞ ≤T1< T2≤ ∞

≥ sup

ı {V(πı(ν0), πı(ν))}

= sup

ı {

θH(πı(ν0)|πı(ν))

= θH(ν0|ν).

Finally we derive the upper bound. Letµ(·) be the solution of the equation on [T1, T2].

dµ(t) dt =

θ

2(µ(t)−ν0), µ(T2) =ν. (4.2)

Then

µ(t) =e−.5θ(T2−t)_ν_{+ (1}

−e−.5θ(T2−t)_)ν0.

It is not hard to see thatµ(T1)→ν0asT1approaches−∞. From the expression (2.7), we get

ST1,T2

µ(T1),ν(µ(·)) = Z T2

T1

sup

f∈B1,0_([0_,T_]_×_E₎{

θhµ(s)−ν0, fi −1₂

µ(s), f2

− hµ(s), fi2}ds. (4.3)

Forf in B1,0_{([0, T}_]_×_{E), let}

M(f)(s) =θhµ(s)−ν0, fi −1₂

µ(s), f2

− hµ(s), fi2. Define g(s, x) =−θ dν0

dµ(s)(x). It is clear thatg is inB

1,0_{([0, T}_]_×_{E), and}

M(f)(s) =M(g+f−g)(s) =M(g)(s)−1₂[

µ(s),(f −g)2

− hµ(s), f−gi2]≤M(g)(s). Hence the sup in (4.3) is attained atg, and we have

ST1,T2

µ(T1),ν(µ(·)) = Z T2

T1

{θhµ(s)−ν0, gi −1₂

µ(s), g2

− hµ(s), gi2}ds=

Z T2

T1

hµ(s), g˙ ids. (4.4) By integration by parts, we get

Z T2

T1

hµ(s), g˙ ids=hµ(T2), g(T2)i − hµ(T1), g(T1)i −

Z T2

T1

hµ(s),g˙ids

=−

Z T2

T1

hµ(s),g˙ids=θ

Z T2

T1

(13)

The upper bound then follows by lettingT1go to−∞.

Acknowledgements. We would like to thank the referee and the associate editor for helpful suggestions and comments.

References

[1] D. Dawson and S. Feng (1998). Large deviations for the Fleming-Viot process with neutral mutation and selection.Stochastic Process. Appl. 77, 207–232.

[2] D. Dawson and S. Feng (2001). Large deviations for the Fleming-Viot process with neutral mutation and selection, II.Stochastic Process. Appl. 92, 131–162.

[3] A. Dembo and O. Zeitouni. (1998).Large deviations techniques and applications. Second edition. Applications of Mathematics,38. Springer-Verlag, New York.

[4] Ethier, S.N. and Kurtz, T.G.(1994). Convergence to Fleming-Viot processes in the weak atomic topology. Stoch. Proc. Appl., 54, 1–27.