Martingales - Conditional Measures and Algebras

Conditional Measures and Algebras

5.2 Martingales

5.2 Martingales 127

εμ(E)

E(fA)

f f1

as required. The next lemma will be a very useful generalization of this simple observation, which is an analog of a maximal inequality (Theorem 2.24).

Lemma 5.6 (Doob’s inequality). Letf ∈L¹(X,B, μ), let A1⊆A2⊆ · · · ⊆AN ⊆B

be an increasing list ofσ-algebras, and ﬁx λ >0. Let E={x| max

1iN

E(fAi)> λ}. Then

μ(E) 1 λf1.

If (An)_n₁ is an increasing (or decreasing) sequence of σ-algebras then the same conclusion holds for the set

E={x|sup

E fAi

> λ}.

Proof.Assume thatf 0 (if necessary replacingfby|f|, which makesμ(E) no smaller). Let

En={x|E(fAn)> λbut E(fAi)λfor 1in−1}. Then E = E1 · · · EN and En ∈ An since A1,A2, . . . ,An−1 ⊆An. (In the decreasing case of ﬁnitely many σ-algebras we may reverse the order of theσ-algebras, since the statement we wish to prove is independent of the order.) It follows that

fdμ= N n=1

fdμ

= N n=1

E(fAn) dμ

N n=1

λμ(E_n) =λμ(E).

TakingN→ ∞shows the ﬁnal remark.

Proof of Theorem5.5. Using Theorem5.1(4), we may replace the functionf byE(fA) without changingE(fAn).

The theorem holds for allf ∈L¹(X,An, μ),n1. Now

n1L¹(X,An, μ) is dense inL¹(X,A, μ). To see this, notice that

{B∈A |for every ε >0 there existm1, A∈Am withμ(A B)< ε} is aσ-algebra by Theorem A.7. Given anyf ∈L¹(X,A, μ) andε >0, ﬁndm andg∈L¹(X,Am, μ) withf−g1< ε, so that

E(fAn)−f1E(fAn)−E(gAn)1+E(gAn)−g1

= 0 fornm

+g−f1<2ε

fornm. It follows that μ

{x|lim sup

n→∞

E(fAn)−f>√ ε}

=μ

{x|lim sup

n→∞

E(f−gAn)−(f −g)>√ ε}

{x|sup

E(f −gAn)> ¹₂√ ε}

+μ

{x| |f−g|> ¹₂√ ε} ^√²_εf−g1+√²εf−g14√

by Lemma5.6, so

lim sup

n→∞

E(fAn)−f= 0

almost everywhere, showing the almost everywhere convergence.

A similar result holds for decreasing sequences of σ-algebras as follows.

The notation An A_∞ used below means that An+1 ⊆An for all n 1 and

A_∞=

An.

Example 5.7.LetBdenote the Borelσ-algebra on [0,1] and let An={B∈B|B+₂¹_n =B (mod 1)} so thatAn N ={∅, X} modulo m (meaning that

n1An =

m{∅, X}, wheremdenotes Lebesgue measure on [0,1]). As before, what is the connec- tion between the convergence ofσ-algebras and the convergence ofE(fAn)?

As mentioned at the start of this section, the kind of convergence sought here resembles an ergodic theorem⁽⁵⁸⁾. Indeed, the proof is similar in some ways to the proofs of the ergodic theorems (Theorems 2.21 and 2.30). The usual proof of the decreasing martingale theorem is somewhat opaque because it takes place in L¹ rather than inL², forcing us to replace the geometric methods available in Hilbert space with more flexible methods from functional analysis. To illuminate the different approaches—and the more geometrical approach that working inL²allows—we give two different arguments for the

5.2 Martingales 129

ﬁrst part of the proof. Of course the theorem itself is an assertion aboutL¹ convergence, so at some point we must work inL¹.

Theorem 5.8 (Decreasing martingale theorem). Let (X,B, μ) be a probability space. If An A∞ is a decreasing sequence of sub-σ-algebras ofB then

E(fAn)−→E(fA∞) almost everywhere and inL¹, for any f ∈L¹(X,B, μ).

First part of proof of Theorem 5.8, using L².Recall from the proof of Theorem 5.1 that in L²(X,B, μ) the conditional expectation with respect toAn (orA_∞) is precisely the orthogonal projection to L²(X,An, μ) (resp.L²(X,A∞, μ)). LetVn=L²(X,An, μ)^⊥ and letV_∗=

n1Vn. Notice that forf ∈L²(X,A_∞, μ) +V_∗ the theorem holds trivially because

E(fAn) =E(fA∞) for suﬃciently largen. We claim that

V =L²(X,A∞, μ) +V_∗

is dense in L²(X,B, μ) with respect to the L²_μ norm. To see this, we may use the Riesz representation theorem (see Sect. B.5). If V is not dense inL²(X,B, μ), then there is a continuous non-zero linear functional

f →

f¯hdμ deﬁned by someh∈L²(X,B, μ) such that

f¯hdμ= 0

for allf ∈V, and this leads to a contradiction as follows. Clearly h−EhAn

∈V_n⊆V_∗,

h−E(hAn)¯hdμ= 0.

Sincef →E(fAn) is the orthogonal projection, we also have h−E(hAn)E(hAn) dμ= 0, which implies that

h−E(hAn)²dμ= 0

and soh=E(hAn)∈L²(X,An, μ) for alln1. We conclude that h∈L²(X,A_∞, μ)⊆V,

and

h¯hdμ = 0, so h = 0. This contradiction shows that V is dense inL²(X,B, μ) with respect to theL²_μ norm.

Now · 1 · 2 andL²(X,B, μ)⊆L¹(X,B, μ) is dense with respect to the L¹_μ norm. It follows that V is also dense inL¹(X,B, μ) with respect

to theL¹_μ norm.

It might seem unsatisfactory to useL²arguments in this way to avoid the more complicated theory of the spaceL¹and its dualL^∞. To give an example of how it is sometimes possible to decompose functions in a way that mimics the orthogonal decomposition available in Hilbert space, we now do the same part of the proof avoidingL².

First part of proof of Theorem5.8, usingL¹ directly.Let Vn ={f ∈L¹(X,B, μ)|E

fAn

= 0}

forn1, soV₁⊆V₂⊆ · · · is an increasing sequence of subspaces ofL¹(X).

We claim thatV_∗ =

n1V_n is L¹-dense in V_∞={f ∈L¹(X,B, μ)|E

fA∞

= 0}.

This claim will be crucial for the proof, since it will allow us to split any functionf into two parts for which the result will be easier to prove.

By the Hahn–Banach theorem (Theorem B.1), V_∗ is dense in V_∞ if any continuous linear functionalΛ:L¹(X)→RwithV_∗⊆kerΛhasV_∞⊆kerΛ.

Any continuous linear functional onL¹(X) has the form Λh(f) =

f hdμ

for some h ∈ L^∞(X), and h is uniquely determined by Λh. So suppose thatVn⊆kerΛh for alln1; it follows that

(f−E(fAn))hdμ= 0

for allf ∈L¹(X) andn1. In particular, we may takef =h(sinceL^∞(X) is a subset ofL¹(X)), so

(h−E(hAn))hdμ= 0.

On the other hand, by Theorem5.1(3),

E(hAn)E(hAn) dμ=

E(hAn)hAn

dμ=

E hAn

hdμ

5.2 Martingales 131

(h−E(hAn))E(hAn) dμ= 0.

Now

(h−E(hAn))h−(h−E(hAn))E(hAn) =

h−E(hAn)2

and therefore

h−E(hAn)2

dμ= 0.

It follows that h = E(hAn) ∈ L^∞(X,An, μ), and so h ∈ L^∞(X,A∞, μ).

Thus

E(fA∞) = 0 implies that

f hdμ=

E(f hA_∞) dμ=

hE(fA_∞) dμ= 0, showing that kerΛh⊇V_∞ whenever kerΛh⊇V_∗ as required.

Clearly the theorem holds for functions in the space V =L¹(X,A∞, μ) +V_∗,

which isL¹-dense inL¹(X) (to see that this space is dense, write anyf ∈L¹ asf =E

fA∞ +

f−E

fA∞

where the second term belongs toV_∞).

The remainder of the proof of Theorem 5.8 of necessity takes place inL¹(X,B, μ).

Second part of proof of Theorem 5.8. Given f ∈ L¹(X) and ε > 0, ﬁndg∈V with

f−g1< ε.

Then

E(fAn)−E(fA∞)dμ E

(f −g)An

−E

(f−g)A∞dμ + E(gAn)−E(gA_∞)dμ

|f−g|dμ+ E(gAn)−E(gA_∞)dμ, so

lim sup

n→∞

E(fAn)−E(fA∞)dμ2

|f−g|dμ2ε, which shows the convergence inL¹.

To see the almost everywhere convergence, notice that μ

{x|lim sup

n→∞

E(fAn)−E(fA∞)>√ ε}

{x|lim sup

n→∞

(f−g)An

−E

(f−g)A∞ + lim sup

n→∞

E gAn

−E

gA_∞>√ ε}

{x|sup

(f−g)An

−E

(f−g)A_∞>√ ε}

{x|sup

(f−g)An ¹₂√ ε +μ

{x|sup

(f −g)A∞>¹₂√ ε} ^√²_εf−g1+√²εf−g14√

ε,

by Doob’s inequality (Lemma5.6), so lim sup

n→∞

E(fAn)−E(fA∞)= 0

almost everywhere.

Exercises for Sect. 5.2

Exercise 5.2.1.Use the increasing martingale theorem (Theorem 5.5) to prove the following version of the Borel–Cantelli lemma (Theorem A.9). Sup- pose that (X,B, μ) is a probability space and (A_n)_n₁ is a completely independent sequence of measurable sets (that is, for any ﬁnite sequence of indices i1 < · · · < i we have μ(Ai₁∩ · · · ∩Ai) = μ(Ai₁)· · ·μ(Ai)). If additionally

∞ n=1

μ(An) =∞,

then almost everyxis contained in inﬁnitely many of the setsAn; equivalently μ

_∞

N=1

∞ n=N

A_n

= 1.

Exercise 5.2.2.Use the martingale theorems to prove the following analog of the Lebesgue density theorem (Theorem A.24). Letmbe Lebesgue measure on the cubeC= [0,1]^d. Forn1 deﬁne the partitionξ_n ofC into boxes

Dalam dokumen Ergodic Theory (Halaman 145-152)