Directory UMM :Journals:Journal_of_mathematics:GMJ:

(1)

ON OPTIMAL STOPPING OF INHOMOGENEOUS STANDARD MARKOV PROCESSES

B. DOCHVIRI

Abstract. The connection between the optimal stopping problems for inhomogeneous standard Markov process and the corresponding homogeneous Markov process constructed in the extended state space is established. An excessive characterization of the value-function and the limit procedure for its construction in the problem of optimal stopping of an inhomogeneous standard Markov process is given. The form ofε-optimal (optimal) stopping times is also found.

1. Introduction and Statement of the Problem

General questions of the theory of optimal stopping of homogeneous standard Markov processes are set forth in the monograph [1]. In various restrictions on the payoff function there are given an excessive characteriza-tion of the value, the methods of its construccharacteriza-tion, and the form ofε-optimal and optimal stopping times.

In the present work the questions of optimal stopping theory for inhomo-geneous (with infinite lifetime) standard Markov processes are studied. By means of extension of the state space and the space of elementary events it is possible to reduce the problems of optimal stopping for the inhomogeneous case to the corresponding problems for homogeneous standard Markov pro-cesses from which the excessive characterization of the value-function, the method of its construction, and the form of ε-optimal (optimal) stopping times for the initial problem are found.

Note that the idea of reducing the inhomogeneous Markov process to the homogeneous one is wellknown. However, it is nontrivial to construct the appropriate homogeneous Markov process in the extended state space, to show the measurability of transition probabilities, and to prove the coinci-dence of the corresponding value-functions.

1991Mathematics Subject Classification. 62L15, 93E20, 60J25.

Key words and phrases. ε-optimal stopping time, excessive function, inhomogeneous

case, extension of state space, universal completion.

335

(2)

It should be noted that using the method of extension of state space in the paper [2], the form ofε-optimal stopping times has been established for the case of optimal stopping of homogeneous Markov processes on bounded time interval.

Consider now the inhomogeneous (with infinite lifetime) standard Markov process

X =Ω,Ms ,Ms

t, Xt, Ps,x, 0≤s≤t <∞, in the state space (S,B), i.e., it is assumed that:

(1) S is a locally compact Hausdorff space with a countable base, B is theσ-algebra of Borel sets of the space;

(2) for everys≥0,x∈S,Ps,xis a probability measure on theσ-algebra

Ms_; _Ms

t, t≥s, is the increasing family of sub-σ-algebras of the σ-algebra

Ms_{, where}

Ms1 _{⊇ M}s2_, _Ms

t⊆ Muv, for s1≤s2, u≤s≤t≤v,

it is assumed as well that

Ms₌_Ms_, _Ms t=M

s t =M

s

t+, 0≤s≤t <∞,

where Ms _{is a completion of} _Ms _{with respect to the family of measures}

{Pu,x, u ≤s, x∈ S}, Mst is the completion of Mst in Ms with respect to the same family of measures ([3], Ch.I, Sect. 5);

(3) the paths of the processX= (Xt(ω)),t≥0, are right continuous on the time interval [0,∞);

(4) for eacht≥0 the random variablesXt(ω) (with values in (S,B)) are

Ms

t-measurable, t≥s, where it is supposed that

Ps,x

ω:Xs(ω) =x

= 1

and the functionPs,x(Xs+h∈B) is measurable in (s, x) for the fixedh≥0, B∈ B (with respect toB[0,∞)⊗ B);

(5) the processX is strong Markov: for every (Ms

t,t≥s)-stopping time τ (i.e.,{ω:τ(ω)≤t} ∈ Ms

t,t≥s) we should have

Ps,x

Xτ+h∈B|Msτ

=Pτ, Xτ, τ+h, B

({τ <∞}, Ps,x-a.s.),

where

Ps, x, s+h, B≡Ps,x

Xs+h∈B

;

(6) the process X is quasi-left-continuous: for every nondecreasing se-quence of (Ms

t,t≥s)-stopping timesτn↑τ should be

Xτn→Xτ ({τ <∞}, Ps,x-a.s.).

Letg(t, x) be the Borel measurable function (i.e., measurable with respect to the productσ-algebraB′ ₌_B_[0,_∞₎_{⊗ B}_{) which is defined on [0,}_∞₎_×_S

(3)

Assume now the following integrability condition of the random process g(t, Xt(ω)),t≥0:

Es,xsup t≥s

g−t, Xt<∞, s≥0, x∈S. (1)

The problem of optimal stopping of the process

X =Ω,Ms,Mst, Xt, Ps,x

, 0≤s≤t <∞,

with the payoff g(t, x) is stated as follows: the value-function v(s, x) is introduced as

v(s, x) = sup τ∈Ms

Es,xg

τ, Xτ

, (2)

where M_s _{is the class of all finite (P}_s,x_{-a.s.) (}_Ms

t, t ≥s)-stopping times and it is required to find the stopping timeτε(for eachε≥0) for which

Es,xg

τε, Xτε

≥v(s, x)−ε

for anyx∈S.

Such a stopping time is calledε-optimal, and in the case of ε = 0 it is called simply optimal stopping time.

To constructε-optimal (optimal) stopping times it is needed to charac-terize the valuev(s, x) and for this purpose the following notion of excessive function turns out to be fundamental.

The functionf(t, x) given on [0,∞)×Sand taking its values in (−∞,+∞] such that it is measurable with respect to the universal completion B′∗ _of

theσ-algebraB′ ₌_B_[0,_∞₎_{⊗ B}_{, is called excessive (relative to}_{X) if}

1) Es,xf−

t, Xt

<∞, 0≤s≤t <∞, x∈S,

2) Es,xft, Xt≤f(s, x), t≥s, x∈S, (3)

3) Es,xf

t, Xt

→f(s, x), if t↓s, x∈S.

2. Construction of the Homogeneous Standard Markov Process in the Extended State Space

Let us introduce now the new space of elementary events Ω′_{= [0,}_∞₎_×_Ω

with elements ω′ = (s, ω), the new state space (the extended state space) S′ = [0,∞)×S with the σ-algebra B′ ₌ _B_[0,_∞₎_{⊗ B}_{, the new random}

processX′ with values in (S′_,_B′₎

Xt′(ω′) =Xt′(s, ω) =

s+t, Xs+t(ω)

, s≥0, t≥0,

and the translation operators Θ′

t:

Θ′

(4)

where it is obvious that

Xu′

Θ′t(ω′)

=Xu′+t(ω′), u≥0, t≥0.

Introduce in the space Ω′ _the_σ-algebra:

N0=σXu′, u≥0

, Nt0=σ

Xu′,0≤u≤t

and on theσ-algebraN0_{introduce the probability measures}

P′

x′(A) =P₍′_s,x₎(A)≡P_s,x(A_s),

whereA∈N0, andAs is the section ofAat the points:

As=ω: (s, x)∈A,

where it is easy to see that As ∈ Fs≡σ(Xu, u=s) and if a∈ Nt0, then As∈ Fss+t≡σ(Xu, s≤u≤s+t).

Consider the function

P′h, x′, B′≡Px′′

Xh′ ∈B′

.

We have to verify that this function is measurable inx′ _{for fixed}_h_≥_{0. For}

the rectanglesB′= Γ×B which generate theσ-algebraB′ _{we have}

P′h, x′, B′=Ps,x

ω: (s+h, Xs+h(ω))∈Γ×B

= =I(s+h∈Γ)Ps,x

Xs+h∈B

;

therefore for the rectangles the function P′_{(h, x}′_{, B}′_{) is measurable in} _x′_.

Consider now the class of all sets B′_, _B′ _{∈ B}′ _{for which the function}

Px′′(X_h′ ∈ B′) is B′-measurable in x′. It is easy to verify that this class of sets satisfies all the requirements of the monotone class theorem; there-fore it coincides with theσ-algebraB′_.

Thus the function P′_{(h, x}′_{, B}′_{) is measurable in} _x′_{, and hence we can}

introduce the measures P′

µ′ on the σ-algebra N0 for every finite measure µ′ on (S′_,_B′_{) by averaging} _P′

x′ with respect toµ′ [3]. Let us perform the completion ofσ-algebraN0 with respect to the family of all measuresPµ′′, denote this completion byN′ and then perform the completion of each σ-algebra Nt0 in N′ with respect to the same family of measures denoting

them byN′

t.

The following key result (in a somewhat different form) was proved in the paper [2].

Theorem 1. _{The random process}

X =Ω′_{, N}′_{, N}′

t, Xt′,Θ′t, Px′′

, t≥0,

(5)

Proof. The main step in the proof is to verify that the process (Ω′_{, N}0_{, N}0

t+,

X′

t,Θ′t, Px′′),t≥0, is strong Markov, i.e., we have to show that

E_x′′

f′(Xτ′′₊_h)·I₍_τ′_<_∞)

=E_x′′

M_X′ ′

τ′f

′_(X′

h)I(τ′_<_∞)

, (4)

where f′_(x′_{) is an arbitrary bounded}_B′_{-measurable function and} _τ′ _{is an}

arbitraryNt0+-stopping time. Using again the monotone class theorem, it

is clear that this relation suffices to be proved for the indicator functions

f′(x′) =I(s∈Γ)·I(x∈B).

Thus it is needed to check that

Es,x

I(s+τ′₍_s,ω₎₊_h_∈Γ)·I₍_X

s+τ′_(s,ω)+h∈B)· · ·I(τ′(s,ω)<∞)

=

=Es,x

I(s+τ′₍_s,ω₎₊_h_∈Γ)P_u,y(X_u₊_h∈B)

u=s+τ′

(s,ω)

y=X_s+τ′_(s,ω)

·I(τ′₍_s,x₎_<_∞)

.

For this purpose we shall use the following

Lemma 1. _If _τ′_(ω′₎_{is an} _N0

t+-stopping time, then τ(ω) =s+τ′(s, ω)

is a(Fs

t+,t≥s)-stopping time, whereFts=σ(Xu, s≤u≤t),t≥s.

Proof. Indeed, we have

ω:τ(ω)< t=ω:τ′(s, ω)< t−s=

=ω′ :τ′(ω′₎_{< t}₋_s

s,

but (ω′ _:_τ′_(ω′₎_{< t}₋_s)_∈_N0

t−s; therefore the section (ω′ :τ′(ω′)< t−s)s belongs toFs

t. Thusτ(ω) is a (Fts+,t≥s)-stopping time.

It follows from this Lemma that since Fs

t+ ⊆ Mst+ =Mst, the variable τ(ω) = s+τ′_{(s, ω) is a (}_Ms

t, t ≥ s)-stopping time. Hence the desired relation to be proved admits the following form:

Es,x

I(τ+h∈Γ)·I(Xτ+h∈B)×I(τ <∞)

=

=Es,x

I(τ+h∈Γ)Pu,y(Xu+h∈B)

u=τ

y=Xτ

·I(τ <∞)

,

which obviously is a consequence of the strong Markov property of the processX.

We know from Proposition 7.3, Ch. I in [3] that the strong Markov pro-perty (4) of the process X′ _{remains true for arbitrary}_N′

t, t ≥ 0-stopping times τ′ _{and from Proposition 8.12, Ch. I in [3] we get that} _N′

t = Nt′+.

Quasi-left-continuity of the process X′ _{now easily follows from the same}

(6)

3. The Optimal Stopping Problem for Processes_X and_X′ _and the Connection Between Them

Letf(x′_{) =}_f_{(s, x) be an arbitrary Borel measurable function (i.e.,} _B′

-measurable) which is given on S′ _{and takes its values in (}_−∞_,₊_∞_].

Con-sider the following sets:

A=ω′ : lim t↓0f(X

′

t) =f(X0′)

,

B=ω′ : the path f(X′

t(ω′)) is right continuous on [0,∞)

.

Obviously, sectionsAsand Bsare of the following form:

As=

ω: lim

t↓sf(t, Xt(ω)) =f(s, Xs(ω))

,

Bs=

ω: the path f(t, Xt(ω)) is right continuous on [s,∞)

.

Lemma 2. _{The sets} _A_and_B _{belong to} _N0∗ ₍_N0∗ _{is the universal}

com-pletion ofN0) and sections AsandBs belong toFs∗ (Fs∗ is the universal completion of Fs₌_σ(X

u, u≥s)). Further we have

Ps,x′ (A) =Ps,x(As), Ps,x′ (B) =Ps,x(Bs). (5)

Proof. The setAcan be written as follows:

A=ω′ : lim k→∞₀_<t<sup1

k

f(X′

t(ω′)) = lim k→∞0<t<inf1

k

f(X′

t(ω′)) =f(X0′(ω′))

.

Since

ω′ : sup

0<t<1 k

f(X′

t(ω′))> a

= pr_Ω′

(t, ω′_{) : 0}_{< t <} 1

k, f(X

′

t(ω′))> a

,

ω′: inf

0<t<_k1 f(X′

t(ω′))< a

= pr_Ω′

(t, ω′_{) : 0}_{< t <} 1

k, f(X

′

t(ω′))< a

,

we get from Theorem 13, Ch. III in [4] that the latter sets areN0_-analytic

and hence they belong to the universal completion of N0_{. Thus the set}

A itself belongs to N0∗. As for the set B, we get from Theorem 34, Ch. IV in [4] that this set is the completion of theN0-analytic set, henceB ∈ N0∗. The same reasoning shows that As and Bs belong to the universal completionFs∗ _{of the}_σ-algebra_Fs_{. For the measure}_P′

s,x and for the sets AandB belonging to the universal completion ofN0 there obviously exist the setsA1,A2,B1,B2 belonging toN0 such that

A1⊆A⊆A2, B1⊆B⊆B2,

Ps,x′ (A1) =Ps,x′ (A) =Ps,x′ (A2),

(7)

But by the definition of the measureP′

s,x we have

Ps,x′ (A1) =Ps,x′ (A1s), Ps,x′ (A2) =Ps,x(A2s),

Ps,x′ (B1) =Ps,x′ (B1s), Ps,x′ (B2) =Ps,x(Bs2).

From these relations and the inclusionsA1s⊆As ⊆A2s andB1 ⊆B ⊆B2 it easily follows that (5) is true.

Now we return to the optimal stopping problem of the process X with

the payoff functiong(t, x) satisfying the integrability condition (1)

Ms,xsup t≥s

g−t, Xt

<∞, s≥0, x∈S.

We get from Lemma 2 that the set

As=ω: lim

t↓sg(t, Xt(ω)) =g(s, Xs(ω))

belongs to the universal completion Fs∗ _{of the} _σ-algebra _Fs_{, hence it has} the probability measure

Ps,xω: lim

t↓sg(t, Xt) =g(s, Xs)

=Ps,xω: lim

t↓sg(t, Xt) =g(s, x)

.

The second condition we need from the random process g(t, Xt),t ≥0, consists in the requirement that these probabilities should be equal to 1

Ps,x

ω: lim

t↓sg(t, Xt) =g(s, x)

= 1, s≥0, x∈S. (6)

Simultaneously with the problem of optimal stopping of the processX let us consider the problem of optimal stopping of the process

X′=Ω′, N′, Nt′, Xt′,Θ′t, Px′′

, t≥0

with the same payoffg(x′_{) =}_{g(s, x) (x}′_{= (s, x)) satisfying the conditions}

E′

x′sup t≥0

g−_X′

t

<∞, x′_∈_S′_, ₍₇₎

Px′′

ω′: lim t↓0g(X

′

t) =g(x′)

= 1, x′ ∈S′, (8)

and with the valuev′_(x′_{) defined by}

v′(x′_{) = sup}

τ′_∈_M′ Mx′′g

Xτ′′

, (9)

(8)

Observe that the condition (8) shows that the function g(x′_{) is finely}

continuous (relative toX′_{) but, as it is well known, the paths}_g(X′

t(ω′)) are then (P′

x′-a.s.) right continuous (Theorem 4.8, Ch. II in [3]), that is,

P_s,x′ (B) = 1, x′∈S′,

where

B=ω′: the path g(X′

t(ω′)) is right continuous on [0,∞)

.

Now again by Lemma 2 we obtain

Ps,xω: the path g(t, Xt(ω)) is right

continuous on [s,∞)= 1, s≥0, x∈S. (10)

Thus we have obtained an interesting fact that condition (6) and condi-tion (10) are equivalent.

Our next step consists in establishing the connection between the value-functionsv(s, x) andv′_{(s, x).}

Lemma 3. _{The values of the initial optimal stopping problem} ₍₉₎

coin-cide

v(s, x) =v′(s, x), s≥0, x∈S. (11)

Proof. Consider first the (N′

t,t≥0)-stopping timeτ′. By Proposition 7.3, Ch. I in [3] for τ′ _{and fixed}_x′ _{= (s, x) there exists (N}0

t+,t ≥0)-stopping

time_eτ′ _{such that}_P′

x′(τ′=eτ′) = 1. We have

E′

x′g

X′

e

τ′

=Es,xg

s+eτ′_{(s, ω), X}

s+eτ′₍_s,ω₎

=

=Es,xg

τ(ω), Xτ(ω)

,

where s+τe′(s, ω)≡ τ(ω) is the (Ms

t, t ≥s)-stopping time by Lemma 1. Whence it is obvious that

v′(s, x)≤v(s, x).

It remains to establish the opposite inequality. Denote byMn

s the class of all (Ms

t,t≥s)-stopping times taking its values from the finite set

s, s+ 2−n

, . . . , s+k·2−n

, . . . , s+n.

Obviously,

Mn_s _⊆Mn_s+1, n= 1,2, . . . .

For everyτ ∈M_s _{define the sequence}_τ_n _{of stopping times}

τn=

(

(9)

It is clear thatτn∈Mns and starting from somen(ω) the sequenceτn(ω) decreases to τ(ω). Using condition (10) and the right continuity of paths g(t, Xt(ω)),t≥s(Ps,x-a.s.), we can write

g(τ, Xτ

= lim n→∞g

τn, Xτn

(Ps,x-a.s.).

Hence by Fatou’s lemma we get

Es,xg

τ, Xτ

≤lim

n Es,xg

τn, Xτn

.

Consequently

v(s, x) = sup τ∈∪nMns

Es,xg

τ, Xτ

= lim n→∞_τsup_∈Mn

s

Es,xg

τ, Xτ

.

Consider separately the expression

sup τ∈Mn

s

Es,xg

τ, Xτ

which represents the value in the problem of optimal stopping of the se-quence

g(s+k2−n

, Xs+k2−n),Ms_s₊_k₂−n

, k= 0,1, . . . , n2−n .

It is wellknown that for this problem there always exists an optimal stopping time which has the following form:

σn= min

s+k2−n_:_γn

k =g(s+k2−n, Xs+k2−n

,

where the sequenceγn

k is constructed recursively:

γkn= max

g(s+k2−n

, Xs+k2−n), M_s,x(γ_kn₊₁|M_s₊_k₂−n).

It easily follows from these recursion relations thatγn

k is a Borel function ofXs+k2−n. Thereforeσ_n has the following form:

σn= min

s+k2−n_:_X

s+k2−n∈B_kn

,

where the setsBn

k belong to theσ-algebra B(of course,Bnn2−n =S).

Thus we get

v(s, x) = lim

n→∞↑Es,xg

σn, Xσn

.

Define now the corresponding (N0

t,t≥0)-stopping times

σn′ = min

k2−n _:

Xk′2−n∈[0,∞)×B_kn

.

We have

Es,x′ g

Xσ′′

n

=Es,xg

Xσ′′

n(s,ω)(s, ω)

=

=Es,xg

s+σ_n′(s, ω), Xs+σ′

n(s,ω)(ω)

=Es,xg

(10)

ass+σ′

n(s, ω) =σn(ω). Therefore

Es,xg

σn, Xσn

=E′

s,xg

X′

σ′

n

≤v′_{(s, x).}

Thusv(s, x)≤v′(s, x), and finally v(s, x) =v′(s, x).

Our next purpose is the excessive characterization of the costv(s, x). Let us note (as can be easily seen) that our definition of the excessive function (relative toX) coincides exactly with the usual definition of the excessive function (relative toX′). Therefore we can directly use Theorem 1, Ch. III in [1] and get the following result.

Theorem 2. _{Suppose that conditions}₍₁₎_and₍₆₎_{are satisfied. Then the}

value v(s, x) is the least excessive majorant of the function g(s, x). The value v(s, x) is the Borel measurable function, which can be found by the following limit procedure:

v(s, x) = lim

n→∞Nlim→∞Q

N

ng(s, x), (12)

where

Qng(s, x) = max

g(s, x), Es,xg(s+ 2−n, Xs+2−n)

andQN

n is theNth power of the operatorQn.

Proof. The assertion is the consequence of the coincidence of the values v(s, x) andv′_{(s, x) and of Lemma 3, Ch. III in [3] which states that}

v′(x′_{) = lim}

n→∞Nlim→∞Q

N ng(x′),

where

Qng(x′) = max

g(x′_{), E}′

x′g(X₂′−n)

.

Note also that Ex′′g(X₂′−n) is B′-measurable in x′, hence the functions

Qng(x′),QNng(x′) and the functionv′(x′), being the limit of these functions, are alsoB′_-measurable.

Thus the valuev′_(x′_{) is the Borel measurable excessive function (relative}

toX′_{) which obviously satisfies the condition}

E′

x′sup t≥0

v−_X′

t

<∞, x′_∈_S′_.

Then it is well-known (Theorem 2.12, Ch. II in [3]) that the pathsv(X′

t(ω′)) are right continuous with the left-hand limits on [0,∞) (P′

x′-a.s.). Using again Lemma 2 we obtain

Ps,x

ω: the path vt, Xt(ω)

is right

continuous on [s,∞)= 1, s≥0, x∈S. (13)

(11)

Theorem 3. _{Let the payoff} _{g(t, x)} _{satisfy (relative to} X) the following conditions:

(1) Ms,xsupt≥s

g(t, Xt)

<∞, s≥0, x∈S;

(2) Ps,x

ω: limt↓sg(t, Xt(ω)) =g(s, x) = 1

, s≥0, x∈S. Then

(i)for everyε >0 the stopping times

τε= inf

t≥s:v(t, Xt)≤g(t, Xt) +ε

(14)

areε-optimal;

(ii)if the functiong(t, x) is upper semi-continuous, that is,

g(s, x)≥ lim t→s y→x

g(t, y)

and the stopping time

τ0(ω) = inft≥s:v(t, Xt) =g(t, Xt)

(15)

is finite (Ps,x-a.s.), thenτ0(ω)is the optimal stopping time.

Proof. From Theorem 3, Ch. III in [1] we know that for every ε > 0 the stopping time

τ′

ε= inf

t:v(X′

t)≤g(Xt′) +ε

isε-optimal:

Ex′′g

Xτ′′

ε

≥v(x′₎₋_ε, _x′_∈_S′_,

that is,

Es,xg(s+τve′ (s, ω), Xs+τ′

ε(s,ω)(ω)

≥v(s, x)−ε.

But it is obvious thats+τ′

ε(s, ω) =τε(ω), hence

Es,xg

τε, Xτ ε

≥v(s, x)−ε.

Assume now the upper semi-continuity of the functiong(x′_{). Then from}

the same theorem we get again that the stopping time

τ0′ = inf{t≥0 :v(Xt′) =g(x′t)

is optimal:

E_x′′g

X_τ′′

0

=v(x′).

(12)

References

1. A. Shiryayev, Optimal stopping rules. Springer-Verlag, Berlin,1978. 2. V. M. Dochviri and M. A. Shashiashvili, On optimal stopping of ho-mogeneous Markov process on finite time interval. (Russian)Math. Nachr.

156_{(1992), 269–281.}

3. R. M. Blumental and R. K. Getoor, Markov processes and potential theory. Academic Press, New York,1968.

4. C. Dellacherie and P. Meyer, Probabilities and potential.

North-Holland Math. Studies,29, Amsterdam,1978.

(Received 12.11.1993)

Author’s address:

Laboratory of Stochastic Analysis and Statistical Decisions I. Javakhishvili Tbilisi State University