getdocca1b. 156KB Jun 04 2011 12:04:58 AM

(1)

ELECTRONIC COMMUNICATIONS in PROBABILITY

MEASURABILITY OF OPTIMAL TRANSPORTATION AND STRONG

COUPLING OF MARTINGALE MEASURES

JOAQUIN FONTBONA1

DIM-CMM, UMI(2807) UCHILE-CNRS, Universidad de Chile, Casilla 170-3, Correo 3, Santiago-Chile. email: [email protected]

HÉLÈNE GUÉRIN2

IRMAR, Université Rennes 1, Campus de Beaulieu, 35042 Rennes-France. email: [email protected]

SYLVIE MÉLÉARD3

CMAP, Ecole Polytechnique, CNRS, route de Saclay, 91128 Palaiseau Cedex-France. email: [email protected]

SubmittedAugust 5, 2008, accepted in final formMarch 22, 2010 AMS 2000 Subject classification: 49Q20;60G57

Keywords: Measurability of optimal transport. Coupling between orthogonal martingale mea-sures. Predictable transport process.

Abstract

We consider the optimal mass transportation problem inRd_{with measurably parameterized marginals} under conditions ensuring the existence of a unique optimal transport map. We prove a joint mea-surability result for this map, with respect to the space variable and to the parameter. The proof needs to establish the measurability of some set-valued mappings, related to the support of the optimal transference plans, which we use to perform a suitable discrete approximation procedure. A motivation is the construction of a strong coupling between orthogonal martingale measures. By this we mean that, given a martingale measure, we construct in the same probability space a second one with a specified covariance measure process. This is done by pushing forward the first martingale measure through a predictable version of the optimal transport map between the covariance measures. This coupling allows us to obtain quantitative estimates in terms of the Wasserstein distance between those covariance measures.

1 Introduction

We consider the optimal mass transportation problem inRd_{with measurably parameterized marginals,} for general cost functions and under conditions ensuring the existence of a unique optimal trans-port map. The aim of this note is to prove a joint measurability result for this map, with respect

1_{SUPPORTED BY FONDECYT PROYECT 1070743, ECOS-CONICYT C05E02 AND BASAL-CONICYT} 2_{SUPPORTED BY ECOS-CONICYT C05E02}

3_{SUPPORTED BY ECOS-CONICYT C05E02}

(2)

to the space variable and the parameter. One of our motivations is the construction of couplings between martingale measures (in the sense of Walsh [15]). This problem classically arises in the literature, especially in cases where the martingale measures are compensated Poisson point measures or space-time white noises, for which the covariance measures are deterministic (cf. Grigelionis[5], El Karoui-Lepeltier[1], Tanaka[12], El Karoui-Méléard[2], Méléard-Roelly[7], Guérin[6]). A classical approach for constructing such couplings is to use the Skorokhod repre-sentation theorem. However, this does not give any quantitative estimate.

Here, we aim at a method to construct what we call “strong coupling” between martingale mea-sures. By this we mean that, given a martingale measure, we search to construct in the same probability space a second one with specified covariance measure process. We will do this by pushing forward the given martingale measure through the optimal transport map between the two covariance processes. The interest of such a construction is that it will provide quantitative estimates in terms of the Wasserstein distance between the covariance measures. But in order to make this construction rigorous, the existence of apredictableversion of the above described transport map is needed. To our knowledge, available measurability results on the mass trans-portation problem with respect to some parameter require a topological structure on the space of parameters, or concern measurability oftransference plans, but not oftransport maps(see e.g. [10], or Corollaries 5.22 and 5.23 in[14]). Our main result provides the existence of the jointly-measurable optimal transport map which is required.

Let us establish notations and recall basic facts. We denote the space of Borel probability measures inRd_by_P₍Rd₎_{endowed with the usual weak topology, and by}_Pp₍Rd₎_{the subspace of probability} measures having finitep₋order moment. Givenπ_{∈ P}(R2d₎_{, we write}

π <µ_ν

ifµ,ν _{∈ P}(Rd₎_{are respectively its first and second marginals. Such}_π_{is referred to as a} “trans-ference plan” betweenµandν.

Letc:R2d_→R₊_{be a continuous function. The mapping}

π_→I(π):= Z

R2d

c(x,y)π(d x,d y)

is then lower semi continuous. The Monge-Kantorovich or optimal mass transportation problem with costcand marginalsµ,νconsists in finding

inf π<µν

I(π).

It is well known that the infimum is attained as soon as it is finite, see[13], Ch.1. In this case, we denote byΠ∗_c(µ,ν)the subset of_P(R2d₎_{of minimizers. If otherwise,}_I₍_π_{) = +}_∞_{for all}_{π <}µ

ν, then by convention we setΠ∗_c(µ,ν) =_;.

We shall say thatAssumption H(µ,ν,c)holds if

there exists a unique optimal transference planπ_∈Π∗_c(µ,ν), and it has the form

π(d x,d y) =µ(d x)_⊗δT(x)(d y) for aµ(d x)₋a.s.unique mapping T:Rd_→Rd_.

(3)

We recall that the condition thatµdoes not give mass to sets with Hausdorff dimension smaller than or equal tod₋1 is optimal both for existence and uniqueness ofT, see Remark 9.5 in[14]. Moreover, ifΠ∗_c(µ,ν)₆=_;, then the latter condition impliesH(µ,ν,c)in the following situations (see Gangbo and McCann[4]):

i) c(x,y) =˜c(_|x₋ y_|)with ˜c:R₊_→ R₊ _{strictly convex, superlinear and differentiable with} locally Lipschitz gradient.

ii) c(x,y) =˜c(_|x−y|)with ˜cstrictly concave, andµandνare mutually singular. ConditionH(µ,ν,c)also holds if

iii) c(x,y) = ˜c(_|x− y|)with ˜c strictly convex and superlinear, and moreoverµ is absolutely continuous with respect to Lebesgue measure.

Whenµ,ν∈ Pp(Rd), fundamental examples are the cost functionc(x,y) =_|x−y|p _with _p_≥₂ for case i),p>1 for case iii), andp∈(0, 1)for case ii).

Our main result is

Theorem 1.1. Let(E,Σ,m)be aσ₋finite measure space and consider a measurable functionλ_∈

E_7→(µλ,νλ)∈(P(Rd))2such that for m−almost everyλ, H(µλ,νλ,c)holds, with optimal transport map Tλ:Rd→Rd. Then, there exists a function(λ,x)7→T(λ,x)which is measurable with respect toΣ_{⊗ B}(Rd₎_{and such that m}₍_d_λ₎₋_{almost everywhere,}

T(λ,x) =Tλ(x) µλ(d x)-almost surely.

In particular, Tλ(x) is measurable with respect to the completion of Σ⊗ B(Rd) with respect to m(dλ)µ_λ(d x).

Theorem 1.1 generalizes Theorem 1.2 in[3], where a predictable version of the quadratic trans-port map between a time-varying law and empirical samples of it was constructed. This allowed us to exhibit the convergence rate of a Brownian motion driven interacting particle system towards a white-noise driven nonlinear process. More precisely, we considerednindependent copies(Xi₎n

i=1 of the process inRd_:

Xi_t=X₀i+ Z t

0

Z

Rd

σ(X_si−y)Wi(ds,d y) + Z t

0

Z

Rd

b(X_si−y)Ps(y)d y ds (1)

where σ and b satisfy usual Lipschitz assumptions, Xi

t ∼ Pt(y)d y and Wi is an Rd valued space-time white noise on [0,T]_×Rd _{with independent coordinates of covariance measures} Pt(y)d y⊗d t. By pushing forward eachWi(d t,d x)through the predictable version of the optimal transport map from Pt to 1_n

Pn

j=1δXtj, we constructed in the same probability spacen

2

indepen-dent Rd₋_{Brownian motions}₍_Bik₎_{, suitably correlated with the martingale measures} ₍_Wi₎_{. This} provided us a coupling between the processes(Xi_t)and the particles(X_ti,n)governed by

X_ti,n=X₀i+_p1

n

Z t

0 n

X

k=1

σ(X_si,n−X_sk,n)d Bik_s +1

n

Zt

0 n

X

k=1

b(X_si,n−X_sk,n)ds, i=1, . . . ,n,

yielding the estimate W2 2(l aw(X

1,n₎_,_{l aw}₍_X1₎₎ _≤ _C T,dn

−2

d+4 for the Wasserstein-2 distanceW2 on

(4)

the (squared) expected Wasserstein-2 distanceW2 2(

1 n

Pn

j=1δXtj,Pt)inP2(

Rd₎_{, as}_n_{goes to infinity.} (For details and precise statements, see Theorem 1.1 and Corollary 6.2. in[3])

The proof of Theorem 1.1 is developed in Section2.. We firstly establish a type of measurable de-pendence onλof the support of the optimizers. From this result, we can define measurable (w.r.t.

λ) partitions ofE×Rd induced by a dyadic partition ofRd, and construct bi-measurable discrete approximations of T(λ,x). This approximation procedure was not needed in the simpler case studied in[3], where one of the marginals (the empirical measure) had finite support. In Section 3we address the construction of strong couplings between orthogonal martingale measures.

2 Proof of Theorem 1.1

Let us first state an intermediary result concerning measurability properties of minimizers in the general framework. Its formulation and proof require some notions of set-valued analysis, see e.g. chapter 14 of[9].

Theorem 2.1. The function assigning to(µ,ν)the set ofR2d

Ψ(µ,ν):=C l



 

[

π∈Π∗ c(µ,ν)

supp(π) 



, (2)

where C l stands for the topological closure, is measurable in the sense of set-valued mappings. That is, for any open setθ inR2d_{, its inverse image}Ψ−1₍_θ_{) =}_{₍_µ_,_ν₎_∈₍_P₍_Rd₎₎2_:_Ψ(_µ_,_ν₎_∩_θ₆₌_;}_{is a} Borel set in(_P(Rd))2_.

Remark 2.2. In the case of a set-valued mapping taking closed-set values, measurability is equivalent to the fact that inverse images of closed sets are measurable (see[9]).

Proof.The idea of the proof is similar to the one of Theorem 1.3 in[3], where we considered the quadratic cost and the measurable structure induced by the Wasserstein topology. (In the present case, the topology inP(Rd)andP(R2d)is the usual weak one.)

We observe thatΨwrites as the topological closure of a set-valued composition,

Ψ(µ,ν) =C l U◦S(µ,ν)

:=C l



 

[

π∈S(µ,ν) U(π)



 ,

whereSandUare the set-valued mappings respectively defined by S(µ,ν):= Π∗_c(µ,ν) and U(π):=supp(π).

Measurability ofΨis equivalent toU_◦Sbeing measurable. The latter will be true as soon asSis measurable andU−1(θ)is open for every open setθ(see[9]).

The stability theorem for optimal transference plans of Schachermayer and Teichmann (Theorem 3 in[11]) exactly states that inverse images throughSof closed sets inP(R2d)are closed sets in

(_P(Rd))2_{. This, together with the fact that the mapping}_S_{takes closed-set values (by lower semi} continuity ofI(π)) imply thatSis a measurable set valued mapping.

On the other hand, the inverse image byUof an open setθ ofR2d _is

(5)

It then follows by the Portmanteau Theorem that U−1₍_θ₎ _{is an open set in} _P₍_R2d₎_{, and this} concludes the proof.

Corollary 2.3. Let(E,Σ)be a measurable space, andλ_∈E_7→(µ_λ,ν_λ)_∈(_P(Rd₎₎2_{a measurable} function. We consider the functionΨdefined by (2) and let F be a closed set ofRd_{. Then, the set}

Proof. Without loss of generality, we assume thatF is nonempty. Let us first show that for any open setθ ofR2d_{, the set}

G=_{z_∈Rd_:₍_{_z_{} ×}_F₎_∩_θ₆₌_;}

is open. Indeed, for x _∈ G there exists y _∈ F andǫ > 0 such that B(x,ǫ)_×B(y,ǫ) _⊂ θ. In particular, for allz_∈B(x,ǫ)one has(z,y)_∈θ and soB(x,ǫ)_⊂G. By definition of measurability, the set-valued mappings(λ,x)_{→ {}x_{} ×}Fand(λ,x)_→Ψ(µλ,νλ)−({x} ×F)are thus measurable (see[9], Proposition 14.11(c)). The latter mapping being also closed valued, we conclude that

is a measurable set, which finishes the proof.

Proof of Theorem 1.1Since any σ-finite measure is equivalent to a finite one, we can assume without loss of generality thatmis finite.

For a fixedk_≥1, we denote by(An,k)n∈Zd the partition ofRdin dyadic half-open rectangles of size Notice that sinceAn,k=

S

and soBn,kis measurable thanks to Corollary 2.3.

Denote now by a_n,k_∈A_n,kthe “center” of the set, and define aΣ_{⊗ B}(Rd₎₋_{measurable function} by

Tk(λ,x) = X

n∈Zd

(6)

For eachλ_∈E, letνk

λbe the discrete measure defined by pushing forwardµλthroughTk, that is,

ν_λk(A) =

whereTλhas been defined in the statement of Theorem 1.1. This implies that

(7)

We have forλ_∈E˜that

and a subsequenceTji _{such that lim}

i→∞∆′ji=0 for allλ∈

establishing (6). Next, we introduce for eachk∈Nan approximation ofh:Rd _→Ra bounded Lipschitz-continuous function by step functions

(8)

for allk_∈N_{. Thanks to (6), the sum identically vanishes for all} _j_≥_{k. This means that}_∆

j→0 as jgoes to_∞, and the proof is complete.

3 Application: strong coupling for orthogonal martingale

mea-sures

We now develop an application of Theorem (1.1), which is a generalization of what has been used in [3] to estimate the convergence rate of Landau type interacting particle systems. Let

(Ω,_F,_Ft,P₎_{be a filtered space and consider} _M _{an adapted orthogonal martingale measure on} R

+×Rd(in the sense of Walsh[15]). Assume that its covariance measure has the formqt(d a)d kt, whereq_t(ω,d a)is a predictable random probability measure onRd_{with finite second moment and} k_t a predictable increasing process. Let us also consider another predictable random probability measureˆq_t(ω,d a)onRd _{with finite second moment.}

We want to construct in the same probability space a second martingale measure with covariance measureˆqt(ω,d a)d kt, in such a way that in some sense, the distance between the martingale mea-sures is controlled by the Wasserstein distance between their covariance meamea-sures. This distance is defined forµ,ν∈ P2(Rd)byW₂2(µ,ν) =infπ<µνI(π)with the quadratic costc(x,y) =|x−y|

2_.

It makes the setP2(Rd)a Polish space, and strengthens the weak topology with the convergence of second moments (see e.g.[8]).

Theorem 3.1. In the previous setting, assume moreover that P(dω)d kt(ω)a.e. qt has a density with respect to Lebesgue measure inRd_.

Then, there exists in(Ω,_F,_Ft,P₎_{a martingale measure} _{M on}ˆ R

+×Rd with covariance measure

ˆ

q_t(d a)d k_t, such that for all S>0and for every predictable functionφ:R

+×Ω×Rd →Rthat is Lipschitz continuous in the last variable withERS

0

R

φ2(s,a) qs(d a) + ˆqs(d a)

d ks

<_∞, one has

E _sup t≤S

Z t

0

Z

φ(s,_·,a)M(ds,d a)₋ Z t

0

Z

φ(s,_·,a) ˆM(ds,d a) 2!

≤E

ZS

0

(K_sφ)2W₂2(qs,ˆqs)d ks

!

,

(7) where K_sφ(ω) is a measurable version of a Lipschitz constant of a 7→ φ(s,ω,a) and W₂2 is the quadratic Wasserstein distance in_P2(Rd₎_.

Proof.Sinceq_t(ω,d a)has a density for almost every(t,ω), assumption H(q_t(ω,d a),ˆq_t(ω,d a),c)

is satisfied. We can therefore apply Theorem 1.1 to(E,Σ,m) = (Ω_×R

+,Pr ed,P(dω)d kt(ω)), where_Pr edis the predictableσ₋field with respect to_Ft. Then, there exists a predictable map-pingT :R₊_×_Ω_×Rd_→Rd_{that for}_{m-almost every}₍_t,_ω₎_{pushes forward}_q

t toˆqt. Moreover, for a.e.(t,ω), one has

Z

|a−T(t,ω,a)_|2_q

t(ω,d a) =W22(qt,ˆqt).

One can thus define a martingale measureMˆ by the stochastic integrals

Z t

0

Z

ψ(s,a) ˆM(ds,d a):= Z t

0

Z

(9)

for predictable simple functionsψ. Its covariance measure is by constructionqˆ_t(d a)d k_t, and by Doob’s inequality, the left hand side of (7) is less than

E

ZS

0

Z

|φ(s,a)₋φ(s,T(s,a))_|2qs(d a)d ks

≤E

ZS

0

(K_sφ)2 Z

|a₋T(s,a)_|2qs(d a)

d ks

!

≤E

ZS

0

(K_sφ)2_W2

2(qs,ˆqs)d ks

!

,

by definition of Tand ofW₂2.

Remark 3.2. In general, a coupling satisfying the estimate(7)can be obtained by constructing an or-thogonal martingale measureM˜(d t,d a,d a′)onR+_×Rd_×Rd_{with covariance measure}_π_t₍_{d a,}_{d a}′₎_{d k}_t_,

πt being an optimal transference plan between qt andˆqt. Then, the marginal martingale measures ˜

M(d t,d a,Rd) and M˜(d t,Rd_,_{d a}′) are orthogonal martingale measures with the required covari-ances. Our argument provides a simple way of constructing M˜(d t,d a,d a′)when the realization of one of the “marginal” martingale measures is given in advance.

An immediate consequence is

Corollary 3.3. Let the space(Ω,_F,_Ft,P₎_{, the martingale measure M and the random measure} qt(d a)d kt be as in Theorem 3.1. Assume that(qnt(ω,d a))n∈N is a sequence of predictable random

probability measures onRd _{in the same probability space, such that for all t}_>_0,Rt 0 W

2

2(qs,qsn)d ks goes to0in L1₍_P₎_{when n}_{→ ∞}_.

Then, there exists a sequence(Mn)of orthogonal martingale measures in(Ω,F,Ft,P)with covari-ance measures(qn

t(d a))such that for all t>0and allφwithksups≤tKsφkL∞(P)<∞,

Rt

0φ(s,a)M

n₍_ds,_{d a}₎ converges toR₀tφ(s,a)M(ds,d a)in L2(P).

Acknowledgements We thank the anonymous referees for pointing out a gap in the proof of Theorem 1.1 and for valuable comments that allowed us to improve the presentation of this work.

References

[1] ELKAROUI N., LEPELTIER J.P. : Représentation des processus ponctuels multivariés à l’aide d’un processus de PoissonZ. Wahrsch. Verw. Geb. 39 (1977), 111–133. MR0448546 [2] ELKAROUIN., MÉLÉARDS. : Martingale measures and stochastic calculus, Probab. Th. Rel.

Fields 84 (1990), 83–101. MR1027822

[3] FONTBONAJ., GUÉRIN H., MÉLÉARDS. : Mesurability of optimal transportation and conver-gence rate for Landau type interacting particle systems. Probab. Th. Rel. Fields143 (2009), 329–351. MR2475665

[4] GANGBOW., MCCANNR.J. : The geometry of optimal transportation.Acta Math. 177 (1996), 113–161. MR1440931

(10)

[6] GUÉRIN, H. : Existence and regularity of a weak function-solution for some Landau equations with a stochastic approach. Stochastic Process. Appl.101 (2002), 303–325. MR1931271 [7] MÉLÉARDS., ROELLYS. : Discontinuous measure-valued branching processes and generalized

stochastic equations. Math. Nachr. 154 (1991), 141–156. MR1138376

[8] RACHEVS.T., RUSCHENDORFL. : Mass Transportation Problems, Vol. I . Probability and its applicationsSpringer (1998). MR1619171

[9] ROCKAFELLAR R.T., WETS R. J-B.: Variational Analysis. Grundlehren der mathematischen Wissenschaften317 Springer (1998). MR1491362

[10] RÜSCHENDORFL. : The Wasserstein distance and approximation theorems. Z. Wahrsch. Verw. Gebiete70 (1985), 117–129. MR0795791

[11] SCHACHERMAYER W., TEICHMANN, J. : Characterization of optimal transport plans for the Monge-Kantorovich-problem.Proc. Amer. Math. Soc.137 (2009), 519–529. MR2448572 [12] TANAKA, H.: Probabilistic treatment of the Boltzmann equation of Maxwellian molecules. Z.

Wahrsch. Verw. Geb. 46 (1978), 67–105. MR0512334

[13] VILLANI, C.: Topics in Optimal Transportation. Graduate Studies in MathematicsVol. 58, AMS (2003). MR1964483

[14] VILLANI, C.: Optimal transport, old and new. Grundlehren der mathematischen Wissenschaften 338 Springer (2009). MR2459454