Proof of Theorem 1 - streaming data

Chapter V: Conclusion

A.2 Proof of Theorem 1

We first introduce Lemmas2–5that supply majorization and real induction tools for proving Theorem1.

Functionf majorizesg,f ≻g, if and only if for any Borel measurable setB ∈ B_R with finite Lebesgue measure, there exits a Borel measurable setA ∈ B_Rwith the

same Lebesgue measure, such that [2]

g(x)dx ≤ Z

f(x)dx. (A.14)

Functionf :R→Riseveniff(x) =f(−x)for allx∈R.

Functionf :R→Risquasi-concaveif for allx, y ∈R,0≤λ≤1,

f(λx+ (1−λ)y)≥min{f(x), f(y)}. (A.15) We denote by1(a,b)(x)an indicator function that is equal to1if and only ifx∈(a, b). Lemmas2–4, stated next, show several majorization properties of pdfs.

Lemma 2. ([2, Lemma 2]) Fix two pdfsfX andgX, such thatfX is even and quasi- concave and f_X ≻ g_X. Fix a scalar c > 0, and a function h: R → [0,1], such

that Z

f_X(x)1^(−c,c)(x)dx= Z

g_X(x)h(x)dx, (A.16) Then,

fX|X∈(−c,c) ≻g_X^′ , (A.17)

where the pdfsf_{X|X∈(−c,c)}andg^′_X are given by,

f_X|X∈(−c,c)(x) = f_X(x)1(−c,c)(x) R

Rf_X(x)1^(−c,c)(x)dx g_X^′ (x) = g_X(x)h(x)

Rg_X(x)h(x)dx.

(A.18)

Lemma 3. ([108, Lemma 6.7]) Fix two pdfsfX andgX, such thatfX is even and quasi-concave and thatf_X majorizesg_X,f_X ≻g_X. Fix an even and quasi-concave pdfr_Y. Then, the convolution off_X andr_Y majorizes the convolution ofg_X andr_Y,

f_X ∗r_Y ≻g_X ∗r_Y, (A.19)

Furthermore,f_X ∗r_Y is even and quasi-concave.

Lemma 4. ([2, Lemma 4]) Fix two pdfs f_X and g_X such that f_X is even and quasi-concave and thatf_X majorizesg_X,f_X ≻g_X. Then,

x²f_X(x)dx≤ Z

(x−y)²g_X(x)dx, ∀y∈R. (A.20)

Lemma5, stated next, provides a mathematical proof technique calledreal induction. We will use it to prove that the assertions in Lemma 6, stated below, hold on a continuous interval.

Lemma 5. (Real induction [109, Thm. 2]) A subset S ⊂ [a, b], a < b is called inductive if

1) a∈S;

2) Ifa ≤x < b,x∈S, then there existsy > xsuch that[x, y]∈S;

3) Ifa ≤x < b,[a, x)∈S, thenx∈S.

If a subsetS ⊂[a, b]is inductive, thenS = [a, b].

A technical lemma

We define the following notations for two sampling-decision processes{P_t}^T_t=0and {P_t^sym}^T_t=0(see AppendixA.1). Fix an arbitrary sampling-decision process{P_t}^T_t=0 (A.1) satisfying (S.1)–(S.2). It gives rise to a sampling policy with stopping times τ₁, τ₂, . . . via (A.1). We recall the definition of the mean-square residual error (MSRE) process{X˜_t}^T_t=0in (P.3) and denote the MSRE process under{P_t}^T_t=0 as

X˜_t = ˜X_t({P_s}^T_s=0) (A.21a)

≜X_t−E[X_t|X_τ_i, τ_i], t ∈[τ_i, τ_i+1). (A.21b) We define the residual error estimate (REE) process{X¯˜_t}^T_t=0under{P_t}^T_t=0as

¯˜

X_t =X¯˜_t({P_s}^T_s=0) (A.22a)

≜X¯_t−E[X_t|X_τ_i, τ_i] (A.22b)

=E[ ˜X_t|{X_τ_j}ⁱ_j=1, τⁱ, t < τ_i+1] (A.22c)

=E[ ˜Xt|τi, t < τi+1], t∈[τi, τi+1), (A.22d) where X¯_t = ¯X_t({P_s}^T_s=0) is the MMSE decoding policy defined in (2.2); the equality in (A.22c) holds sinceE[X_t|X_τ_i, τ_i] ∈σ({X_τ_j}ⁱ_j=1, τⁱ, t < τ_i+1); (A.22d) holds because X˜t is independent of {Xτj}ⁱ_j=1, τⁱ due to (P.3-a), and the event {t < τ_i+1}is independent of {X_τ_j}ⁱ_j=1, τⁱ⁻¹ given τ_i due to (S.2). We recall that N({P_t}^T_t=0)defined above Proposition5in AppendixA.1represents the number of stopping times in[0, T], and we simplify this notation as

N ≜N({P_t}^T_t=0). (A.23)

We denote the left-closed continuous interval

Ω_τ_i+1(s)≜{t∈[s, T] :P[τ_i+1 > t|τ_i =s]>0}, (A.24) for alls∈Supp(f_τ_i), and the left-open continuous interval

Ω¯_τ_i+1(s)≜Ω_τ_i+1(s)\ {s}. (A.25)

Given {P_t}^T_t=0, we construct a sampling-decision process {P_t^sym}^T_t=0 (A.1) of the form (2.9), which via (A.1) is associated with a sampling policy with stopping times τ₁^′, τ₂^′, . . . , such that the symmetric thresholds {a_i(r, s)}^T_r=s of {P_t^sym}^T_t=0 satisfy for alls∈Supp(f_τ_i),t∈[s, T],

P[ ˜X_r^′ ∈(−a_i(r, s), a_i(r, s)),∀r ∈[s, t]|τ_i^′ =s]

= P[τi+1 > t|τi =s]. (A.26)

This is possible since by adjusting the thresholds, the left side of (A.26) can be equal to any non-increasing function intbounded between[0,1]. Under{P_t^sym}^T_t=0 (A.26), for alls ∈Supp(f_τ_i),i= 1,2, . . ., it holds that

Ω_τ_i(s) = Ω_τ^′

i(s), (A.27)

Ω¯_τ_i(s) = ¯Ω_τ^′

i(s). (A.28)

We denote the MSRE and the REE processes and the number of stopping times on [0, T]under{P_t^sym}^T_t=0 respectively by

X˜_t^′ = ˜X_t({P_s^sym}^T_s=0), (A.29)

¯˜

X_t^′ = ˜X¯t({P_s^sym}^T_s=0) = 0, (A.30)

N^′ =N({P_s^sym}^T_s=0), (A.31)

where (A.30) holds since we can write X¯˜_t^′ as (A.22d) with τ_i replaced byτ_i^′ using the argument that justifies (A.22d); X˜_t^′ has an even and quasi-concave pdf due to the assumption (P.3-b), and the pdf of X˜t conditioned on τ_i^′, t < τ_i+1^′ under a symmetric threshold sampling-decision process of the form (2.9) is still even and quasi-concave.

We denote the following probabilities

Qi(a, b, c, d)≜P[τ_i+1 > a|τ_i+1 > b, τ_i =c,X˜_a =d] (A.32a) Q^′i(a, b, c, d)≜P[τ_i+1^′ > a|τ_i+1^′ > b, τ_i^′ =c,X˜_a^′ =d]. (A.32b)

We proceed to introduce Lemma6using the notations defined in (A.21)–(A.32b).

We will use the assertions in Lemma6to compare the MSEs achieved by{P_t}^T_t=0 and{P_t^sym}^T_t=0.

Lemma 6. The pdfs fX˜t|τ_i=s,τi+1>t and fX˜_t^′|τ_i^′=s,τ_i+1^′ >t exist for all s ∈ Supp(fτi), t∈Ω¯τi+1(s). Furthermore, for alls∈Supp(fτi),t∈Ω¯τi+1(s), it holds that

exists is similar. SinceX˜_tatt≥τ_i =s, is independent ofF_sby (P.3-a) and is equal toR_t(s, s)by (P.3-b), we computefX˜t|τi=s,τi+1>susing (2.5),

fX˜t|τi=s,τi+1>s=f_R_t_(s,s). (A.35) Thus, fX˜t|τi=s,τi+1>s exists since f_R_t_(s,s) is a valid pdf by (P.3-b). To establish that fX˜t|τ_i=s,τi+1>t(y)exists, we compute

fX˜t|τ_i=s,τi+1>t(y) = fX˜t|τ_i=s,τi+1>s,τi+1>t(y) (A.36a)

= Qⁱ(t, s, s, y)fX˜t|τ_i=s,τi+1>s(y)

P[τ_i+1 > t|τ_i =s, τ_i+1 > s] , (A.36b) where (A.36a) holds sinceτ_i+1 > timpliesτ_i+1 > s. In (A.36b), we observe that for allt∈Ω¯_τ_i+1(s), the pdffX˜t|τ_i+1>s,τi=sexists by (A.35); the denominator of (A.36b) is nonzero. We conclude that the pdf fX˜t|τ_i=s,τi>t exists for all s ∈ Supp(f_τ_i), t∈Ω¯_τ_i+1(s).

The assertion (A.33) holds if and only if

(a) for alls ∈Supp(fτi),t∈ Ω¯τi+1(s)and for any Borel measurable setB ∈ B_R with finite Lebesgue measure, there exists a Borel measurable set A ∈ B_R with the same Lebesgue measure, such that

P[ ˜X_t^′ ∈ A|τ_i^′ =s, τ_i+1^′ > t]

≥ P[ ˜X_t∈ B|τ_i =s, τ_i+1 > t], (A.37) holds. This is because (A.37) is a rewrite of (A.33) using the definition of majorization (A.14).

The assertion (A.34) holds if and only if for all s∈ Supp(f_τ_i), t ∈Ω¯_τ_i+1(s), all of the following hold:

(b) the conditional cdf P[ ˜X_t^′ ≤ y|τ_i^′ = s, τ_i+1^′ > t] is convex for y < 0 and is concave fory >0;

P[ ˜X_t^′ ∈(0, y]|τ_i^′ =s, τ_i+1^′ > t]

=P[ ˜X_t^′ ∈[−y,0)|τ_i^′ =s, τ_i+1^′ > t]. (A.38) This is becausefX˜_t^′|τ_i^′=s,τ_i+1^′ >tis quasi-concave if and only if (b) holds, andfX˜_t^′|τ_i^′=s,τ_i+1^′ >t

is even if and only if (c) holds.

Items (a)–(c) facilitate proving that the assertions (A.33)–(A.34) hold on the left- open intervalΩ¯_τ_i+1(s). Real induction, which must be used on a left-closed interval, does not apply to show (A.33)–(A.34) directly, since the densities in (A.33)–(A.34) do not exist att = s. Instead, we apply real induction to show (a)–(c). Using real induction in Lemma 5, we verify that conditions 1), 3), 2) in Lemma 5 hold for (a)–(c) in ont∈Ωτi+1(s)one by one.

To verify that the condition 1) in Lemma5holds, we need to show that (a)–(c) hold fort=s. This is trivial since

P[ ˜X_s^′ = 0|τ_i^′ =s, τ_i+1^′ > s]

= P[ ˜X_s = 0|τ_i =s, τ_i+1> s]

= 1.

(A.39)

Next, we show that condition 3) in Lemma5holds, that is, assuming that (a)–(c) hold for all t ∈ [s, r), r ∈ Ω¯_τ_i+1(s), we prove that (a)–(c) hold fort =r. Equivalently, we show that (A.33)–(A.34) hold for t =r. Letδ ∈ (0, r−s]. At timet = r, we calculate the left side of (A.33) as

fX˜_r^′|τ_i^′=s,τ_i+1^′ >r(y)

= lim

δ→0⁺fX˜_r^′|τ_i^′=s,τ_i+1^′ >r−δ,τ_i+1^′ >r(y) (A.40a)

= lim

δ→0⁺

Q^′i(r, r−δ, s, y)fX˜_r^′|τ_i^′=s,τ_i+1^′ >r−δ(y) R

RQ^′i(r, r−δ, s, y)fX˜_r^′|τ_i^′=s,τ_i+1^′ >r−δ(y)dy (A.40b)

= lim

δ→0⁺

1(−a_i(r,s),ai(r,s))(y)fX˜_r^′|τ_i^′=s,τ_i+1^′ >r−δ(y) R

R1(−ai(r,s),ai(r,s))(y)fX˜_r^′|τ_i^′=s,τ_i+1^′ >r−δ(y)dy, (A.40c) where (A.40a) holds since the eventτ_i+1^′ > r implies the eventτ_i+1^′ > r−δ; the pdffX˜^′_r|τ_i^′=s,τ_i+1^′ >r−δin (A.40b) exists since (A.36) holds withX˜_t,τ_i =s,τ_i+1 > s,

τ_i+1 > treplaced byX˜_r^′,τ_i^′ =s,τ_i+1^′ > s,τ_i+1^′ > r−δ, respectively; (A.40c) holds since

lim

δ→0⁺Q^′i(r, r−δ, s, y) =1(−ai(r,s),ai(r,s))(y). (A.41) Similarly, replacingQ^′iin (A.40b) byQⁱ, we calculate the right side of (A.33) as

fX˜r|τi=s,τi+1>r(y)

= lim

δ→0⁺

Qi(r, r−δ, s, y)fX˜r|τ_i=s,τi+1>r−δ(y) R

RQi(r, r−δ, s, y)fX˜r|τi=s,τi+1>r−δ(y)dy, (A.42) where the pdf fX˜r|τ_i=s,τi+1>r−δ(y) exists since (A.36) holds with X˜_t, τ_i+1 > t replaced byX˜_r,τ_i+1 > r−δrespectively.

To check that (A.33) holds att = r, we first prove thatfX˜_r^′|τ_i^′=s,τ_i+1^′ >r−δ majorizes fX˜r|τi=s,τi+1>r−δ. Note that Rr(r−δ, s)is independent of {X˜t}^r−δ_t=0 due to (P.3-a), and thus is independent of the event {τ_i+1^′ > r−δ, τ_i^′ = s}. We obtain X˜_r^′ using (2.5),

fX˜_r^′|τ_i^′=s,τ_i+1^′ >r−δ=f_q

r(r−δ) ˜X_r−δ^′ |τ_i^′=s,τ_i+1^′ >r−δ∗f_R_r(r−δ,s). (A.43) By (A.43) and the inductive hypothesis that (a)–(c) holds for t ∈ [s, r), the assumptions in Lemma 3 are satisfied with f_X ← f_q_r_{(r−δ) ˜}_X^′

r−δ|τ_i^′=s,τ_i+1^′ >r−δ, g_X ← f_q_r_{(r−δ) ˜}_X_r−δ_|τ

i=s,τi+1>r−δ,rY ←f_R_r(r−δ,s). We conclude that

fX˜_r^′|τ_i^′=s,τ_i+1^′ >r−δ ≻fX˜r|τi=s,τi+1>r−δ, (A.44) fX˜_r^′|τ_i^′=s,τ_i+1^′ >r−δis even and quasi-concave. (A.45) Due to (A.45) and the fact that the indicator function in (A.40c) is over an interval symmetric about zero, we conclude (A.34) holds for t = r. By (A.26), (A.44) and (A.45), the assumptions in Lemma2are satisfied withfX ←fX˜^′_r|τ_i^′=s,τ_i+1^′ >r−δ, gX ← fX˜r|τi=s,τi+1>r−δ, fX|X∈(−c,c) ← fX˜_r^′|τ_i^′=s,τ_i+1^′ >r, and g^′_X ← fX˜r|τi=s,τi+1>r, c←a_i(r, s),h←Qⁱ(r, r−δ, s, y). Thus, we conclude that (A.33) holds fort=r. Therefore, (A.33)–(A.34) hold fort=r, i.e., (a)–(c) hold fort=r.

To prove that the condition 2) in Lemma5holds, we assume (a)–(c) hold fort=r, and prove that the following holds:

δ→0lim⁺fX˜_r+δ^′ |τ_i^′=s,τ_i+1^′ >r+δ≻ lim

δ→0⁺fX˜r+δ|τ_i=s,τi+1>r+δ, (A.46a) lim

δ→0⁺fX˜_r+δ^′ |τ_i^′=s,τ_i+1^′ >r+δis even and quasi-concave. (A.46b)

The right and the left sides of (A.46a) are equal to (A.40c) and (A.42) respectively withrreplaced by r+δ. It is easy to see that (A.43)–(A.45) and the assumptions in Lemma2hold withrreplaced byr+δ. Thus, we conclude that (A.46) holds.

Using the real induction in Lemma 5, we have shown that (a)–(c) hold for all s ∈ Supp(f_τ_i), t ∈ Ω_τ_i+1(s). Thus, (A.33)–(A.34) hold for all s ∈ Supp(f_τ_i), t∈Ω¯_τ_i+1(s).

Proof of Theorem1

The sampling-decision process{P_t^sym}^T_t=0 leads to the same average sampling frequency as {Pt}^T_t=0. This is because (A.26) implies that for all s ∈ Supp(fτi), t∈[s, T],

P[τ_i+1 > t|τ_i =s] =P[τ_i+1^′ > t|τ_i^′ =s]. (A.47) Together with the Markov property of the stopping times (assumption (S.2)), (A.47) implies that the joint distribution of τ₁, τ₂, . . . is equal to the joint distribution of τ₁^′, τ₂^′, . . . We conclude that {Pt}^T_t=0 and {P_t^sym}^T_t=0 lead to the same average sampling frequency

E[N] =E[N^′]. (A.48)

Next, we show{P_t^sym}^T_t=0achieves an MSE no larger than that achieved by{Pt}^T_t=0. Due to (A.22d), (A.30), and (A.33)–(A.34) in Lemma 6, we can apply Lemma 4 withf_X ←fX˜_t^′|τ_i^′=s,τ_i+1^′ >t andg_X ←fX˜t|τi=s,τi+1>t, yielding

E h

( ˜X_t−X¯˜_t)²|τ_i =s, τ_i+1 > ti

≥E

hX˜_t^′2|τ_i^′ =s, τ_i+1^′ > ti

. (A.49)

Combining (A.47) and (A.49), we conclude by law of total expectation that{P_t^sym}^T_t=0 achieves an MSE no larger than that achieved by{P_t}^T_t=0.

Dalam dokumen streaming data (Halaman 148-155)