Chapter V: Conclusion
A.2 Proof of Theorem 1
We first introduce Lemmas2–5that supply majorization and real induction tools for proving Theorem1.
Functionf majorizesg,f ≻g, if and only if for any Borel measurable setB ∈ BR with finite Lebesgue measure, there exits a Borel measurable setA ∈ BRwith the
same Lebesgue measure, such that [2]
Z
B
g(x)dx ≤ Z
A
f(x)dx. (A.14)
Functionf :R→Riseveniff(x) =f(−x)for allx∈R.
Functionf :R→Risquasi-concaveif for allx, y ∈R,0≤λ≤1,
f(λx+ (1−λ)y)≥min{f(x), f(y)}. (A.15) We denote by1(a,b)(x)an indicator function that is equal to1if and only ifx∈(a, b). Lemmas2–4, stated next, show several majorization properties of pdfs.
Lemma 2. ([2, Lemma 2]) Fix two pdfsfX andgX, such thatfX is even and quasi- concave and fX ≻ gX. Fix a scalar c > 0, and a function h: R → [0,1], such
that Z
R
fX(x)1(−c,c)(x)dx= Z
R
gX(x)h(x)dx, (A.16) Then,
fX|X∈(−c,c) ≻gX′ , (A.17)
where the pdfsfX|X∈(−c,c)andg′X are given by,
fX|X∈(−c,c)(x) = fX(x)1(−c,c)(x) R
RfX(x)1(−c,c)(x)dx gX′ (x) = gX(x)h(x)
R
RgX(x)h(x)dx.
(A.18)
Lemma 3. ([108, Lemma 6.7]) Fix two pdfsfX andgX, such thatfX is even and quasi-concave and thatfX majorizesgX,fX ≻gX. Fix an even and quasi-concave pdfrY. Then, the convolution offX andrY majorizes the convolution ofgX andrY,
fX ∗rY ≻gX ∗rY, (A.19)
Furthermore,fX ∗rY is even and quasi-concave.
Lemma 4. ([2, Lemma 4]) Fix two pdfs fX and gX such that fX is even and quasi-concave and thatfX majorizesgX,fX ≻gX. Then,
Z
R
x2fX(x)dx≤ Z
R
(x−y)2gX(x)dx, ∀y∈R. (A.20)
Lemma5, stated next, provides a mathematical proof technique calledreal induction. We will use it to prove that the assertions in Lemma 6, stated below, hold on a continuous interval.
Lemma 5. (Real induction [109, Thm. 2]) A subset S ⊂ [a, b], a < b is called inductive if
1) a∈S;
2) Ifa ≤x < b,x∈S, then there existsy > xsuch that[x, y]∈S;
3) Ifa ≤x < b,[a, x)∈S, thenx∈S.
If a subsetS ⊂[a, b]is inductive, thenS = [a, b].
A technical lemma
We define the following notations for two sampling-decision processes{Pt}Tt=0and {Ptsym}Tt=0(see AppendixA.1). Fix an arbitrary sampling-decision process{Pt}Tt=0 (A.1) satisfying (S.1)–(S.2). It gives rise to a sampling policy with stopping times τ1, τ2, . . . via (A.1). We recall the definition of the mean-square residual error (MSRE) process{X˜t}Tt=0in (P.3) and denote the MSRE process under{Pt}Tt=0 as
X˜t = ˜Xt({Ps}Ts=0) (A.21a)
≜Xt−E[Xt|Xτi, τi], t ∈[τi, τi+1). (A.21b) We define the residual error estimate (REE) process{X¯˜t}Tt=0under{Pt}Tt=0as
¯˜
Xt =X¯˜t({Ps}Ts=0) (A.22a)
≜X¯t−E[Xt|Xτi, τi] (A.22b)
=E[ ˜Xt|{Xτj}ij=1, τi, t < τi+1] (A.22c)
=E[ ˜Xt|τi, t < τi+1], t∈[τi, τi+1), (A.22d) where X¯t = ¯Xt({Ps}Ts=0) is the MMSE decoding policy defined in (2.2); the equality in (A.22c) holds sinceE[Xt|Xτi, τi] ∈σ({Xτj}ij=1, τi, t < τi+1); (A.22d) holds because X˜t is independent of {Xτj}ij=1, τi due to (P.3-a), and the event {t < τi+1}is independent of {Xτj}ij=1, τi−1 given τi due to (S.2). We recall that N({Pt}Tt=0)defined above Proposition5in AppendixA.1represents the number of stopping times in[0, T], and we simplify this notation as
N ≜N({Pt}Tt=0). (A.23)
We denote the left-closed continuous interval
Ωτi+1(s)≜{t∈[s, T] :P[τi+1 > t|τi =s]>0}, (A.24) for alls∈Supp(fτi), and the left-open continuous interval
Ω¯τi+1(s)≜Ωτi+1(s)\ {s}. (A.25)
Given {Pt}Tt=0, we construct a sampling-decision process {Ptsym}Tt=0 (A.1) of the form (2.9), which via (A.1) is associated with a sampling policy with stopping times τ1′, τ2′, . . . , such that the symmetric thresholds {ai(r, s)}Tr=s of {Ptsym}Tt=0 satisfy for alls∈Supp(fτi),t∈[s, T],
P[ ˜Xr′ ∈(−ai(r, s), ai(r, s)),∀r ∈[s, t]|τi′ =s]
= P[τi+1 > t|τi =s]. (A.26)
This is possible since by adjusting the thresholds, the left side of (A.26) can be equal to any non-increasing function intbounded between[0,1]. Under{Ptsym}Tt=0 (A.26), for alls ∈Supp(fτi),i= 1,2, . . ., it holds that
Ωτi(s) = Ωτ′
i(s), (A.27)
Ω¯τi(s) = ¯Ωτ′
i(s). (A.28)
We denote the MSRE and the REE processes and the number of stopping times on [0, T]under{Ptsym}Tt=0 respectively by
X˜t′ = ˜Xt({Pssym}Ts=0), (A.29)
¯˜
Xt′ = ˜X¯t({Pssym}Ts=0) = 0, (A.30)
N′ =N({Pssym}Ts=0), (A.31)
where (A.30) holds since we can write X¯˜t′ as (A.22d) with τi replaced byτi′ using the argument that justifies (A.22d); X˜t′ has an even and quasi-concave pdf due to the assumption (P.3-b), and the pdf of X˜t conditioned on τi′, t < τi+1′ under a symmetric threshold sampling-decision process of the form (2.9) is still even and quasi-concave.
We denote the following probabilities
Qi(a, b, c, d)≜P[τi+1 > a|τi+1 > b, τi =c,X˜a =d] (A.32a) Q′i(a, b, c, d)≜P[τi+1′ > a|τi+1′ > b, τi′ =c,X˜a′ =d]. (A.32b)
We proceed to introduce Lemma6using the notations defined in (A.21)–(A.32b).
We will use the assertions in Lemma6to compare the MSEs achieved by{Pt}Tt=0 and{Ptsym}Tt=0.
Lemma 6. The pdfs fX˜t|τi=s,τi+1>t and fX˜t′|τi′=s,τi+1′ >t exist for all s ∈ Supp(fτi), t∈Ω¯τi+1(s). Furthermore, for alls∈Supp(fτi),t∈Ω¯τi+1(s), it holds that
fX˜t′|τi′=s,τi+1′ >t ≻fX˜t|τi=s,τi+1>t, (A.33) fX˜t′|τi′=s,τi+1′ >t is even and quasi-concave. (A.34) Proof of Lemma6. We prove thatfX˜t|τi=s,τi+1>texists. The proof thatfX˜t′|τi′=s,τi+1′ >t
exists is similar. SinceX˜tatt≥τi =s, is independent ofFsby (P.3-a) and is equal toRt(s, s)by (P.3-b), we computefX˜t|τi=s,τi+1>susing (2.5),
fX˜t|τi=s,τi+1>s=fRt(s,s). (A.35) Thus, fX˜t|τi=s,τi+1>s exists since fRt(s,s) is a valid pdf by (P.3-b). To establish that fX˜t|τi=s,τi+1>t(y)exists, we compute
fX˜t|τi=s,τi+1>t(y) = fX˜t|τi=s,τi+1>s,τi+1>t(y) (A.36a)
= Qi(t, s, s, y)fX˜t|τi=s,τi+1>s(y)
P[τi+1 > t|τi =s, τi+1 > s] , (A.36b) where (A.36a) holds sinceτi+1 > timpliesτi+1 > s. In (A.36b), we observe that for allt∈Ω¯τi+1(s), the pdffX˜t|τi+1>s,τi=sexists by (A.35); the denominator of (A.36b) is nonzero. We conclude that the pdf fX˜t|τi=s,τi>t exists for all s ∈ Supp(fτi), t∈Ω¯τi+1(s).
The assertion (A.33) holds if and only if
(a) for alls ∈Supp(fτi),t∈ Ω¯τi+1(s)and for any Borel measurable setB ∈ BR with finite Lebesgue measure, there exists a Borel measurable set A ∈ BR with the same Lebesgue measure, such that
P[ ˜Xt′ ∈ A|τi′ =s, τi+1′ > t]
≥ P[ ˜Xt∈ B|τi =s, τi+1 > t], (A.37) holds. This is because (A.37) is a rewrite of (A.33) using the definition of majoriza- tion (A.14).
The assertion (A.34) holds if and only if for all s∈ Supp(fτi), t ∈Ω¯τi+1(s), all of the following hold:
(b) the conditional cdf P[ ˜Xt′ ≤ y|τi′ = s, τi+1′ > t] is convex for y < 0 and is concave fory >0;
(c) for anyy >0,
P[ ˜Xt′ ∈(0, y]|τi′ =s, τi+1′ > t]
=P[ ˜Xt′ ∈[−y,0)|τi′ =s, τi+1′ > t]. (A.38) This is becausefX˜t′|τi′=s,τi+1′ >tis quasi-concave if and only if (b) holds, andfX˜t′|τi′=s,τi+1′ >t
is even if and only if (c) holds.
Items (a)–(c) facilitate proving that the assertions (A.33)–(A.34) hold on the left- open intervalΩ¯τi+1(s). Real induction, which must be used on a left-closed interval, does not apply to show (A.33)–(A.34) directly, since the densities in (A.33)–(A.34) do not exist att = s. Instead, we apply real induction to show (a)–(c). Using real induction in Lemma 5, we verify that conditions 1), 3), 2) in Lemma 5 hold for (a)–(c) in ont∈Ωτi+1(s)one by one.
To verify that the condition 1) in Lemma5holds, we need to show that (a)–(c) hold fort=s. This is trivial since
P[ ˜Xs′ = 0|τi′ =s, τi+1′ > s]
= P[ ˜Xs = 0|τi =s, τi+1> s]
= 1.
(A.39)
Next, we show that condition 3) in Lemma5holds, that is, assuming that (a)–(c) hold for all t ∈ [s, r), r ∈ Ω¯τi+1(s), we prove that (a)–(c) hold fort =r. Equivalently, we show that (A.33)–(A.34) hold for t =r. Letδ ∈ (0, r−s]. At timet = r, we calculate the left side of (A.33) as
fX˜r′|τi′=s,τi+1′ >r(y)
= lim
δ→0+fX˜r′|τi′=s,τi+1′ >r−δ,τi+1′ >r(y) (A.40a)
= lim
δ→0+
Q′i(r, r−δ, s, y)fX˜r′|τi′=s,τi+1′ >r−δ(y) R
RQ′i(r, r−δ, s, y)fX˜r′|τi′=s,τi+1′ >r−δ(y)dy (A.40b)
= lim
δ→0+
1(−ai(r,s),ai(r,s))(y)fX˜r′|τi′=s,τi+1′ >r−δ(y) R
R1(−ai(r,s),ai(r,s))(y)fX˜r′|τi′=s,τi+1′ >r−δ(y)dy, (A.40c) where (A.40a) holds since the eventτi+1′ > r implies the eventτi+1′ > r−δ; the pdffX˜′r|τi′=s,τi+1′ >r−δin (A.40b) exists since (A.36) holds withX˜t,τi =s,τi+1 > s,
τi+1 > treplaced byX˜r′,τi′ =s,τi+1′ > s,τi+1′ > r−δ, respectively; (A.40c) holds since
lim
δ→0+Q′i(r, r−δ, s, y) =1(−ai(r,s),ai(r,s))(y). (A.41) Similarly, replacingQ′iin (A.40b) byQi, we calculate the right side of (A.33) as
fX˜r|τi=s,τi+1>r(y)
= lim
δ→0+
Qi(r, r−δ, s, y)fX˜r|τi=s,τi+1>r−δ(y) R
RQi(r, r−δ, s, y)fX˜r|τi=s,τi+1>r−δ(y)dy, (A.42) where the pdf fX˜r|τi=s,τi+1>r−δ(y) exists since (A.36) holds with X˜t, τi+1 > t replaced byX˜r,τi+1 > r−δrespectively.
To check that (A.33) holds att = r, we first prove thatfX˜r′|τi′=s,τi+1′ >r−δ majorizes fX˜r|τi=s,τi+1>r−δ. Note that Rr(r−δ, s)is independent of {X˜t}r−δt=0 due to (P.3-a), and thus is independent of the event {τi+1′ > r−δ, τi′ = s}. We obtain X˜r′ using (2.5),
fX˜r′|τi′=s,τi+1′ >r−δ=fq
r(r−δ) ˜Xr−δ′ |τi′=s,τi+1′ >r−δ∗fRr(r−δ,s). (A.43) By (A.43) and the inductive hypothesis that (a)–(c) holds for t ∈ [s, r), the as- sumptions in Lemma 3 are satisfied with fX ← fqr(r−δ) ˜X′
r−δ|τi′=s,τi+1′ >r−δ, gX ← fqr(r−δ) ˜Xr−δ|τ
i=s,τi+1>r−δ,rY ←fRr(r−δ,s). We conclude that
fX˜r′|τi′=s,τi+1′ >r−δ ≻fX˜r|τi=s,τi+1>r−δ, (A.44) fX˜r′|τi′=s,τi+1′ >r−δis even and quasi-concave. (A.45) Due to (A.45) and the fact that the indicator function in (A.40c) is over an interval symmetric about zero, we conclude (A.34) holds for t = r. By (A.26), (A.44) and (A.45), the assumptions in Lemma2are satisfied withfX ←fX˜′r|τi′=s,τi+1′ >r−δ, gX ← fX˜r|τi=s,τi+1>r−δ, fX|X∈(−c,c) ← fX˜r′|τi′=s,τi+1′ >r, and g′X ← fX˜r|τi=s,τi+1>r, c←ai(r, s),h←Qi(r, r−δ, s, y). Thus, we conclude that (A.33) holds fort=r. Therefore, (A.33)–(A.34) hold fort=r, i.e., (a)–(c) hold fort=r.
To prove that the condition 2) in Lemma5holds, we assume (a)–(c) hold fort=r, and prove that the following holds:
δ→0lim+fX˜r+δ′ |τi′=s,τi+1′ >r+δ≻ lim
δ→0+fX˜r+δ|τi=s,τi+1>r+δ, (A.46a) lim
δ→0+fX˜r+δ′ |τi′=s,τi+1′ >r+δis even and quasi-concave. (A.46b)
The right and the left sides of (A.46a) are equal to (A.40c) and (A.42) respectively withrreplaced by r+δ. It is easy to see that (A.43)–(A.45) and the assumptions in Lemma2hold withrreplaced byr+δ. Thus, we conclude that (A.46) holds.
Using the real induction in Lemma 5, we have shown that (a)–(c) hold for all s ∈ Supp(fτi), t ∈ Ωτi+1(s). Thus, (A.33)–(A.34) hold for all s ∈ Supp(fτi), t∈Ω¯τi+1(s).
Proof of Theorem1
The sampling-decision process{Ptsym}Tt=0 leads to the same average sampling fre- quency as {Pt}Tt=0. This is because (A.26) implies that for all s ∈ Supp(fτi), t∈[s, T],
P[τi+1 > t|τi =s] =P[τi+1′ > t|τi′ =s]. (A.47) Together with the Markov property of the stopping times (assumption (S.2)), (A.47) implies that the joint distribution of τ1, τ2, . . . is equal to the joint distribution of τ1′, τ2′, . . . We conclude that {Pt}Tt=0 and {Ptsym}Tt=0 lead to the same average sampling frequency
E[N] =E[N′]. (A.48)
Next, we show{Ptsym}Tt=0achieves an MSE no larger than that achieved by{Pt}Tt=0. Due to (A.22d), (A.30), and (A.33)–(A.34) in Lemma 6, we can apply Lemma 4 withfX ←fX˜t′|τi′=s,τi+1′ >t andgX ←fX˜t|τi=s,τi+1>t, yielding
E h
( ˜Xt−X¯˜t)2|τi =s, τi+1 > ti
≥E
hX˜t′2|τi′ =s, τi+1′ > ti
. (A.49)
Combining (A.47) and (A.49), we conclude by law of total expectation that{Ptsym}Tt=0 achieves an MSE no larger than that achieved by{Pt}Tt=0.