getdocb26b. 318KB Jun 04 2011 12:04:51 AM

(1)

in PROBABILITY

FRACTIONAL BROWNIAN MOTION

AND THE MARKOV PROPERTY

Philippe CARMONA and Laure COUTIN Laboratoire de Statistique et Probabilit´es Universit´e Paul Sabatier

118 route de Narbonne

31062 Toulouse Cedex 4, FRANCE e-mail (PC): carmona@cict.fr

submitted September 21, 1997;revised November 2, 1998 AMS 1991 Subject classification: 60FXX,60J25,60G15,65U05,26A33,60A10.

Keywords and phrases: Gaussian processes, Markov Processes, Numerical Approximation, Ergodic Theorem.

Abstract

Fractional Brownian motion belongs to a class of long memory Gaussian processes that can be represented as linear functionals of an infinite dimensional Markov process. This leads naturally to :

• An efficient algorithm to approximate the process.

• An ergodic theorem which applies to functionals of the typeRt

0φ(Vh(s))dswhereVh(s) =

Rs

0 h(s−u)dBu. and B is a real Brownian motion.

1 Introduction

Fractional Brownian motion of Hurst parameterH _∈(0,1) is a real centered Gaussian process (WH

t ;t≥0 ) of covariance

EW_tHW_sH

= c(H) 2 (s

2H₊_t2H_{− |}_t₋_s_|2H₎ ₍_{s, t}

≥0). (1)

One easily sees that it starts from zero, WH

0 = 0 almost surely , it has stationary increments

(WH

t+s−WsH;t≥0 ) d

= (WH

t ;t≥0 ) since

E

(W_tH₋W_sH)2

=c(H)_|t₋s_|2H (s, t_≥0). (2) Fractional Brownian motion enjoys the following scaling (self-similarity) property

(2)

( 1

aHW H

at;t≥0 ) d

= (W_tH;t_≥0 ) (a >0). (3) The increments are independent if and only ifH =1

2 (Brownian motion ). IfH < 12, they are

negatively correlated, and if H > 1

2 they are positively correlated. Furthermore it is a long

memory process, since, for H 6= 1₂, the covariance between far apart increments decrease to zero as a power law:

Cov(WH₍_t₎₋_WH₍₀₎_{, W}H₍_t₊_u₎₋_WH₍_u₎₎_∼

u→+∞constant×u2H−2.

Fractional Brownian motion has been used as a model of signal or noise in various domains: geophysical data [10, 3], communication processes [12, 11], see also the references therein. Therefore it is important to provide good numerical approximating schemes.

Problem I: approximation of fractional Brownian motion

We shall not linger on the first (naive) idea that may cross our mind. We simulate an n -dimensional Gaussian random vector with the same disribution as (WH

s1, . . . , W

H

sn) and we

perform a linear or spline interpolation. This amounts to computing the square root of the

n×ncovariance matrix and is computationally inefficient.

A second solution is to use the representation introduced by Mandelbrot and Van Ness [13]: ifα=H+1₂,

WH t =

1 Γ(α)

Z t −∞

((t₋u)α−1₋₍₋_u₎α−1 + )dBu

where B is a standard Brownian motion on the real line. Without any loss in generality, we can say that the problem is to approximate the finite, but long, memory part:

V(t) =Vh(t) =

Z t

0

h(t₋s)dBs

where (Bt;t≥0 ) is a standard Brownian motion . The naive way to do this is to fix a small

step _|∆n|=_n1 for the partitions we use

∆n= 0< s1=

1

n < s2=

2

n <· · ·.

Hence, ift=r/n, thenV(t) is approximated by

V(t)_≃Xh(t₋si) (Bsi+1−Bsi)

= _√1

n

X

h(t₋si)ξi

= √1_n(h(1

n)ξr+h(2/n)ξr−1+· · ·)

where ξi=√n(Bsi+1−Bsi) are independent identically distributed standard normal random

variables.

(3)

t h(t-s)

1/n

0

Figure 1: The functions_→h(t₋s) whenh(u) = _Γ(1_α₎uα−1₁ (u>0)

V(t+ ∆t)≃ √1_n(h(1/n)ξr+1+h(2/n)ξr+· · ·)

Therefore, all we have to do is to compute the h(i/n), generate theξi, and keep them all in

memory for the next step.

Unfortunately, if we want to keep the approximation error low (that isn big), this amounts to a lot of data to keep in memory.

The natural idea is to impose a threshold, that is to computeh(i/n) and generateξ only for 1_≤i_≤Mn. This may work well for smooth functionsh, but not for

h(u) = 1 Γ(α)u

α−1₁ (u>0)

where 1₂ < α <1, sincehexplodes near zero, and thus our partition is not dense enough near

t!

Unfortunately this example is important since it is in essence fractional Brownian motion of indexH =α−1

2.

In [7],Carmona, Coutin and Montseny derived an approximation scheme from a representation formula inspired by the diffusive input-output model of fractional integration developed by Audounet, Montseny and Mbodje [1, 2].

More precisely Fubini’s stochastic theorem implies that ifhhas the “spectral” representation

h(u) = Z

(4)

where µ is a positive measure on (0,+_∞), finite on compact sets, and such that R (1_∧

x−12)µ(dx)<+∞, then

Vh(t) =

Z ∞

0

Y_txµ(dx) where

Y_tx= Z t

0

e−x(t−s)dBs.

Observe that for fixed x _≥ 0, (Yx

t ;t≥0 ) is an Ornstein-Uhlenbeck process of parameter

x, that is a Gaussian semi-martingale Markov process solution of the stochastic differential equation

dY_tx=dBt−x Ytxdt .

Therefore, Vh=hYiµ is a linear functional of the infinite dimensional Markov process

Y= (_Yt;t≥0 ), Yt= (Ytx, x≥0).

The idea behind the approximation scheme is to first do a spatial approximation

Vh(t)≃Vhπ(t) =

X

π

ciYtηi,

where π = {ηi}i is a chosen finite partition converging to identity, and ci are coefficients

depending only on h. This approximation is uniformly good with respect to time, that is sup_t_≤_T|Vh(t)−Vhπ(t)|goes to zero whenπgrows.

The second step is the time discretization, which can be performed in the following way :

Yηi₍_t_{+ ∆}_t_{) =}_Yηi₍_t_{) + ∆}_B

t−ηiYηi(t). (4)

This second step is a classical one : see e.g. the book of Bouleau and Lepingle [5], chapter V, section B, pages 271–278, where it is stated that to get a precision of ǫ, one has to consider at most a constant C times N = log(1/ǫ) time points. Nevertheless, when one needs to do the whole family of approximations (4), one needs to control precisely the dependence of the constant C = C(ηi, T) with respect to the space and time variables. The classical method,

combined with Gronwall’s lemma, gives an exponential upper bound for the Lp_{-norm of the}

approximation error C(η, T)_≤ c1(T, p)ηexp(c2(T, p)ηp). Therefore, we have to use another

time discretization scheme for which we have polynomial constants (see e.g. [6] or [8]). The first step amounts to approximate a linear functional of an infinite dimensional Markov process by a linear functional of a finite dimensional Markov process. The secret of the success of this method is that for a large class of functionsh, this dimension can be kept low : for a precision of ǫ, the cardinal of the partitionπ(ǫ) is roughly

N(ǫ) =|π(ǫ)| ∼c

r 1

ǫ log(

1

ǫ).

(5)

The example of fractional Brownian motion

We can take advantage not only of the Gaussian nature of the processes Yx_{, but also of their}

semi-martingale nature (via Itˆo’s formula). We have been able to give exact L2 _{and almost}

sure convergence rates for fractional Brownian motion of indexH =α₋1

2 ∈(0,12).

Indeed, to obtain the precision sup

t≤Tk

Vh(t)−Vhπ(t)kL2 ≤ǫ ,

we may choose the compact

K(ǫ)_≃

ǫ1−1α,(1

ǫ)

2 2α+3

and a geometric partition

π(ǫ) =

r−m, r−m+1, . . . , rn

of ratior=r(ǫ) = 1 +c(α)√ǫ.

Furthermore, ifǫn decreases fast enough to 0,

ǫn=o

(logn)−8(22α+3−_α)

(5) then, we have the uniform almost sure convergence

sup

t≤T

Vh(t)−V

π(ǫn)

h (t)

_n7−→_→∞0 almost surely.

Theoretical results (see [7] or [8]) show then that to obtain a precision of ǫ = 10−3_{, we}

need to consider a few hundred points. Practical results show that forty points are enough. Furthermore, we have been able to check that (almost surely) the numerical approximations stick exponentially fast to the actual random process (and this happens uniformly on compact sets), whenǫn goes faster to 0 than indicated in (5).

Problem II : an ergodic theorem

It is well known (Birkhoff’s ergodic theorem) that for a stationary process U, we have the convergence, almost sure and inL1,

1

t

Z t

0

φ(U(s))ds_t7−→

→∞E[φ(U(0))], (6)

as soon asφis an integrable Borel function: E[_|φ(U(0))_|]<+_∞.

Of course, we first have to check the ergodicity, that is that the invariantσ-field is almost surely trivial, which is not so easy to do in general (see e.g. Rozanov [14], page 25, Theorem 6.1). But what we actually want to do is to answer the following question. Given (V(t) ;t_≥0 ) a real Gaussian process, a priori non stationary, such thatVr_{= (}_V₍_t₊_r_{) ;}_t_≥_{0 ) converges in}

distribution to the stationary processU. We want to show that, as soon asφis an integrable Borel function: E[|φ(U(0))|]<+∞, we have the convergence, almost sure and inL1_,

1

t

Z t

0

φ(V(s))ds _7−→

(6)

To be more precise, givenh_∈L2₍_R

+) and (Bt, t∈R) a standard Brownian motion, we let

V(t) = Z t

0

h(t−s)dBs, and U(t) =

Z t −∞

h(t−s)dBs. (8)

We shall show in Proposition 5 that (7) holds ifφis Lipschitz andhis the Laplace transform

h(u) = Z

e−ux_µ₍_dx₎_, ₍₉₎

of a positive measureµ, finite on compact sets and such that Z ₁

√_xµ(dx)<+∞. (10)

2 The Markov and ergodic properties of the underlying

infinite dimensional process

LetB = (Bt;t≥0 ) be a standard Brownian motion . Givenx≥0,Yx= (Ytx;t≥0 ) is the

Ornstein-Uhlenbeck process driven byB of parameter x(starting from 0):

Yx t =

Z t

0

e−x(t−s)_dB

s.

It is also the solution of the stochastic differential equation

dY_tx=dBt−xYtxdt .

For every t≥0, we considerYt= (Ytx, x >0); it is a centered Gaussian process of covariance

Γ_Yt(x, y) =

1₋e−t(x+y)

x+y (x, y >0).

The natural candidate for a limiting process isY∞the centered Gaussian process of covariance Γ_Y∞(x, y) =

1

x+y (x, y >0).

Proposition 1. Suppose that R

x−1

2dµ(x) <+∞. Then, the process Y = (Y_t;t≥0 ) is an

infinite dimensional Markov process with values inL1₍_µ_);_Y

∞is a random variable with values in L1₍_µ_).

Proof. SinceE =L1₍_µ_{) is a separable Banach space, the Borel sigma-fields of weak and strong}

topologies onE coincide with

E =σ(χ:χ_∈L1(µ)∗) =σ(χ:χ_∈L∞(µ)),

(see e.g. Borkar [4], Lemma 2.5.2).

Therefore we first have to check that_Yttakes its values in L1(µ). This is the case since

E Z

|Y_tx_|dµ(x)

=c

Z

dµ(x) ₁

−e−2xt

2x

12

(7)

Similarly,_Y_∞takes its values inL1₍_µ_{) since}

the case since it is a centered Gaussian random variable living in the Gaussian space generated by the Brownian motionB, with covariance, given by an application of Fubini’s Theorem,

E

This last integral is finite, Indeed, since ab_≤ 1

2(a2+b2),we can majorize it by

Therefore, _hχ,_Yticonverges in distribution to a centered Gaussian variable with variance

Z Z

χ(x)χ(y)µ(dx)µ(dy)

x+y ,

and this is easily seen to be the variance ofhχ,Y∞i.

The strong Markov property is quite easy to prove. Indeed, letT be an almost surely finite stopping time (with respect to (_Ft;t≥0 ) the completed, hence right-continuous, filtration

generated by the Brownian motionB). Then, ¯B = ( ¯Bt = Bt+T −BT;t≥0 ) is a

stan-Hence, for any bounded measurable functionψ,

E[ψ(Yt+T;t≥0 )| FT] =EYT[ψ(Yt;t≥0 )],

with the family (Py, y∈L1(µ)) of probabilities defined by

Ey[ψ(Yt;t≥0 )] =E

h

ψ((Yx,y(x), x≥0) ;t≥0 )i,

where (Y_tx,y;t_≥0 ) denotes the Ornstein-Uhlenbeck process driven by B, of parameter x, starting fromy:

Y_tx,y=e−xt_y₊Z t

0

e−x(t−u)_dB

(8)

Proposition 2. Suppose that µis a sigma-finite measure on(0,+_∞)such that, Z

µ(dx)x−12 <+∞.

Then, for everyy_∈L1₍_µ_{), under} _P

y, we have the convergence in distribution

Yt 7−→d t→∞Y

∞_.

Proof. We shall first prove the convergence of characteristic functions, since this will charac-terize the distribution of a possible limit (see [4]), and then prove the tightness of the family of distributions of (_Yt;t≥0 ).

The characteristic function is defined by

φYt(χ) =Ey

h

eihχ,Ytii ₍_χ_∈_L∞₎_.

Observe thathχ,Ytiis, underPy, a Gaussian random variable of mean

Z

χ(x)y(x)e−xtµ(dx) _7−→

t→∞0 by dominated convergence and of variance

E "

Z

χ(x)Y_txµ(dx) 2#

= Z Z

χ(x)χ(y)µ(dx)µ(dy)E[Y_txY_ty]

= Z Z

χ(x)χ(y)µ(dx)µ(dy)1−e−

t(x+y)

x+y

7−→

t→∞ Z Z

x+y

Therefore, hχ,Yticonverges in distribution to a centered Gaussian variable with variance

Z Z

x+y ,

and this is easily seen to be the variance ofhχ,Y∞i.

We shall now prove the tightness of the family of laws of the processes (_Yt, t≥0). We need to

introduce a little topology: we consider that for every t >0, Ytis a random variable taking

its values inL1₍_µ_{) endowed with the weak topology}_σ₍_L1_{, L}∞_{); this is the coarsest topology} for which the functions

y_{→ h}χ, y_i (χ_∈L∞₍_µ₎₎ are continuous.

Letφ(x) = _√1

x. GivenM >0, Lemma 4 implies that, givenp >1, if 1 = 1/p+ 1/q, then

K=

y_∈L1_:Z |y|

p

φp/qdµ≤M

(9)

Let us notice that, by construction, underPy the law ofYtis the law , underP0 ofYt+ (x→

e−xt_y₍_x_{)). Since this last function converges weakly to the null function, we can suppose}_y_{= 0,}

without any loss in generality. We have the sequence of inequalities, P0(Yt∈/K) =P0

Z

|Ytx|p

1

φ(x)p/qµ(dx)> M

≤ _M1E0

Z

|Yx t |p

1

φ(x)p/qµ(dx)

≤ _Mc

Z

1−e−2xt

2x

p/2 1

φ(x)p/qµ(dx)

≤ _Mc

Z ₁ 2x

p/2 ₁

φ(x)p/qµ(dx).

This last quantity goes uniformly to zero asM _{→ ∞}, since the integral is finite.

Proposition 3. Assume thatµis a sigma finite measure on(0,+_∞)such thatR

x−1

2dµ(x)<

+_∞, and letν_∞ denote the law of_Y_∞. Letf be a function depending only on a finite number of coordinates. Then

1. Ift >0, theny_→Ptf(y) is continuous for the weak topology.

2. for everyy∈L1₍_µ_),_lim

t→∞Ptf(y) =ν∞(f).

Consequently,ν∞is an invariant probability measure for the semi group(Pt;t≥0 )of(Yt;t≥0 );

that is, for every bounded (or positive) measurable functiong,ν∞(Ptg) =ν∞(g).

Proof. Indeed, we have, for a bounded measurable functiong:Rn→R, and{χi,1≤i≤n} ⊂

L∞₍_µ₎

f(y) =g(hχi, yi,1≤i≤n).

Hence,

Ptf(y) =Ey[g(hχi,Yti,1≤i≤n)]

=E0g(χi, y(.)e−t.+hχi,Yti,1≤i≤n).

But we know that ifX = (X1, . . . , Xn) is a non degenerate centered Gaussian vector, then we

have the absolute continuity relationship

E[g(z+X)] =E[Λ(z)g(X)] (z∈Rn),

with Λ a continuous bounded function. Applying this to (_hχi,Yti,1≤i≤n), fort >0, yields

Ptf(y) =EΛ(χi, y(.)e−t.,1≤i≤n)g(hχi,Yti,1≤i≤n).

Now if then net yp→y weakly inL1(µ), then, for each i,

χi, yp(.)e−t.

→

χi, y(.)e−t.

(10)

andPtf(yp)→Ptf(y). Therefore, ift >0, then Ptf is a continuous bounded function. Since,

element of H. Since Hcontains constants and the class C of bounded functions depending only on a finite number of coordinates, then, by the monotone class theorem,Hcontains every bounded function measurable with respect to the sigma fieldσ(C) =E.

Lemma 4. Let µ be a sigma-finite measure, and φ be a strictly positive (φ > 0), integrable function. Then, for everyM _≥0,p, q >1such that1 = 1/p+ 1/q, Proof. Following Theorem IV.8.9 of Dunford–Schwarz [9] we have to prove that

1. K is a bounded subset ofL1 . By H¨older’s inequality, forf inK

Z

TakingE0= Ω the whole space, we see thatKis bounded inL1. The dominated convergence

(11)

3. φ is a Lipschitz continuous function such that E[_|φ(N)_|] < +_∞ where N denotes a Without any loss in generality, we may assume thatR∞

0 h2(u)du= 1.

It is not very hard to use Kolmogorov’s continuity criterion to construct, up to a null set, a bicontinuous version of the process (Yx

t , x > 0 ;t≥0 ). Therefore, the law Py may be

considered as a probability measure on the canonical space of continuous functionsw:R+→

(L1₍_µ₎_{, σ}₍_L1_{, L}∞_{). Since}_ν

Birkhoff’s ergodic theorem states then that if F is a measurable function in L1₍_P

ν∞) and

if G(w) = E[F _{| I}](w) is a version of the conditional expectation of F with respect to the sigma-field of shift-invariant sets, then

1

such that, under P0, almost surely,

1

t

Z t

0

φ(Vh(s) +y(.)e−s.)ds→Z .

(12)

Observe first that

Indeed, since φis Lipschitz continuous with, say, constantc, 1

Let us introduce the random variables, fort_≥a,

Xt=

Then, φbeing Lipschitz with constant c,

E[_|Xt−Xta|]≤c

tna.s., is equal, almost surely, to a random variable which is measurable

with respect to the sigma field σ(Bu, u ≥ a). Since a > 0 is arbitrary, we obtain that Z is

a.s. equal to a random measurable measurable with respect to the tail sigma field _T =

(13)

References

[1] J.Audounetand G.Montseny, Modèles non héréditaires pour l’analyse et le contrôle de systèmes à mémoire longue, April 1995, Journes d’etude: les systemes non entiers en automatique, Bordeaux, France.

[2] J.Audounet, G.Montseny, and B.Mbodje,Optimal models of fractional integrators and application to systems with fading memory, 1993, IEEE SMC’s conference, Le Touquet (France).

[3] J.Beranand N.Terrin,Testing for a change of the long-memory parameter, Biometrika 83(1996), no. 3, 627–638.

[4] V.S.Borkar,Probability theory: an advanced course, Universitext, Springer, 1995. [5] N.Bouleauand D.L´epingle,Numerical methods for stochastic processes, Wiley series

in Probability and Mathematical Statistics, John Wiley & Sons, Inc., 1994, ISBN 0-471-54641-0.

[6] Ph.Carmona and L.Coutin, Simultaneous approximation of a family of (stochastic) differential equations, Unpublished, June 1998.

[7] Ph.Carmona, L.Coutin, and G.Montseny, Applications of a representation of long memory Gaussian processes, Submitted to Stochastic Processes and their Applications, June 1997.

[8] ,A diffusive Markovian representation of fractional Brownian motion with Hurst parameter less than1/2, Submitted to Statistic and Probability Letters, November 1998. [9] N. Dunford and J.T. Schwartz, Linear Operators, Part I: General Theory, Wiley

Classics Library Edition Published 1988, John Wiley and Sons, New York, 1988.

[10] H.Graf,Long range correlations and estimation of the self-similarity parameter, Ph.D. thesis, ETH Z¨urich, 1983.

[11] W.E. Leland, M.S. Taqqu, W. Willinger, and D.V. Wilson, On the self-similar nature of Ethernet traffic, IEEE/ACM Trans. Networking 2(1994), no. 1, 1–15.

[12] B.Mandelbrot,Self-similar error clusters in communication and the concept of condi-tional stationarity, IEEE Trans. Commun. Technol. COM–13(1965), 71–90.

[13] B.B. Mandelbrot and Van Ness, Fractional brownian motions, fractionalnoises and applications, SIAM Review10 (1968), no. 4, 422–437.