Chapter II: Denoising
2.2 Summary of operator-adapted wavelets
We proceed by reviewingoperator-adapted waveletsas in [153, Sec. 2], also named gambletsin reference to their game theoretic interpretation, and their main properties [99,101,104,119]. They are constructed with a hierarchy of measurement functions and an operator. Theorem2.2.4shows these gamblets are simultaneously associated with Gaussian conditioning, optimal recovery, and game theory. By selecting these measurement functions to bepre-Haar wavelets, the gamblets are localized both in space and in the eigenspace of the operator.
Hierarchy of measurement functions
Letπ β Nβ (used to represent a number of scales). Let(I(π))1β€πβ€π be a hierarchy of labels defined as follows. I(π) is a set of π-tuples consisting of elements π = (π1, . . . , ππ). For 1 β€ π β€ π andπ β I(π),π(π) := (π1, . . . , ππ) andI(π) is the set of
π-tuplesI(π) = {π(π)|π β I(π)}. For 1 β€ π β€ π β€ π and π = (π1, . . . , ππ) β I(π), we write π(π) = (π1, . . . , ππ). We say that π is aI(π) Γ I(π) matrix if its rows and columns are indexed by elements ofI(π) andI(π), respectively.
Let {π(π)
π |π β {1, . . . , π}, π β I(π)} be a nested hierarchy of elements of Hβπ (Ξ©) such that(π(π)
π )πβI(π) are linearly independent and π(π)
π = Γ
πβI(π+1)
π(π , π+1)
π, π π(π+1)
π (2.2.1)
forπ β I(π),π β {1, . . . , πβ1}, whereπ(π , π+1) is anI(π) Γ I(π+1) matrix and
π(π , π+1)π(π+1, π) = πΌ(π). (2.2.2)
In (2.2.2), π(π+1, π) is the transpose of π(π , π+1) and πΌ(π) is the I(π) Γ I(π) identity matrix.
Hierarchy of operator-adapted pre-wavelets Let(π(
π)
π )πβI(π)be the hierarchy of optimal recovery splines associated with(π(
π) π )πβI(π), i.e., forπ β {1, . . . , π}andπ β I(π),
π(
π)
π = Γ
πβI(π)
π΄(
π) π, π Lβ1π(
π)
π , (2.2.3)
where
π΄(π) :=(Ξ(π))β1 (2.2.4)
andΞ(π) is theI(π)Γ I(π) symmetric positive definite Gramian matrix with entries (writing [π, π£] for the duality pairing betweenπ β Hβπ (Ξ©) andπ£ β Hπ
0(Ξ©)) Ξπ, π(π) =[π(
π)
π ,Lβ1π(
π)
π ]. (2.2.5)
Note that π΄(π) is the stiffness matrix of the elements(π(
π)
π )πβI(π) in the sense that π΄(
π) π, π =
π(
π) π , π(
π) π
. (2.2.6)
Writing Ξ¦(π) :=span{π(π)
π | π β I(π)} andπ(π) := span{π(π)
π | π β I(π)}, Ξ¦(π) β Ξ¦(π+1) and Ξ¨(π) = Lβ1Ξ¦(π) imply Ξ¨(π) β Ξ¨(π+1). We further write [π(π), π’] =
[π(π)
π , π’]
πβI(π) βRI
(π). The(π(π)
π )πβI(π) and (π(π)
π )πβI(π) form a bi-orthogonal system in the sense that [π(π)
π , π(π)
π ] =πΏπ, π forπ, π β I(π) (2.2.7) and the
Β·,Β·
-orthogonal projection ofπ’ β Hπ
0(Ξ©)onΞ¨(π) is π’(π) := Γ
πβI(π)
[π(
π) π , π’]π(
π)
π . (2.2.8)
Multiple interpretations of operator adapted pre-wavelets Using operator-adapted pre-wavelets,π(π)
π , we summarize the connections between optimal recovery, game theory, and Gaussian conditioning. First, we define Gaussian fields, a generalization of Gaussian processes.
Definition 2.2.1. The canonical Gaussian field π associated with operator L : Hπ
0(Ξ©) β Hβπ (Ξ©) is defined such that π β¦β [π, π] is the linear isometry from Hβπ (Ξ©) to a Gaussian space characterized by


ο£²

ο£³
[π, π] βΌ N (0,kπkβ2) Cov [π, π],[π, π]
= hπ, πiβ,
(2.2.9)
where kπkβ =supπ’βHπ
0(Ξ©)
β«
Ξ©ππ’
kπ’k is the dual norm ofk Β· k.
Remark 2.2.2. Whenπ > π/2, the evaluation functionalπΏπ₯(π) = π(π₯) is continu- ous. Hence,π|πΏπ₯,π₯βΞ©is naturally isomorphic to a Gaussian process with covariance functionπ(π₯ , π₯0) =hπΏπ₯, πΏπ₯0iβ.
Several notable properties of these pre-wavelets are summarized in the following result. Recall we write[π(π), π’] = [π(π)
π , π’]
πβI(π) βRI
(π). Theorem 2.2.3. Consider pre-wavelets π(
π)
π adapted to operator L constructed with measurement functions π(π)
π . Further, suppose that for π’ β Hπ
0(Ξ©) we define π£β (π’) =π’(π) =Γ
πβI(π)[π(π)
π , π’]π(π)
π . 1. For fixedπ’ β Hπ
0(Ξ©),π£β (π’) is the minimizer of


ο£²

ο£³
Minimizekπk Subject toπ β Hπ
0(Ξ©)and [π(π), π] =[π(π), π’].
(2.2.10)
2. For fixedπ’ β Hπ
0(Ξ©),π£β (π’) is the minimizer of


ο£²

ο£³
Minimizekπ’βπk Subject toπ βspan{π(π)
π :π β I(π)}.
(2.2.11)
3. For canonical Gaussian fieldπ βΌ N (0,Lβ1), π£β (π’) =E
π
[π(π), π] = [π(π), π’]
. (2.2.12)
4. It is true that2
π£β βargminπ£βπΏ(Ξ¦,Hπ
0(Ξ©)) sup
π’βHπ
0(Ξ©)
kπ’βπ£(π’) k
kπ’k (2.2.13)
Proof. (1) is a result of [101, Cor. 3.4] (2) is equivalent to [101, Thm. 12.2] (3) and
(4) are results in [101, Sec. 8.5].
This result shows that the operator-adapted pre-wavelets transform defined by π£β (π’) = π’(π) is an optimal recovery in the sense of Theorem 2.2.3.1-2. Simul- taneously, π£β (π’) are conditional expectations of the canonical Gaussian field with respect to the measurements [π(π),Β·] as in Theorem 2.2.3.3. Another interpreta- tion of the transform is game theoretic as expressed in Theorem2.2.13.4. Equation (2.2.13) represents the adversarial two player game where player I selectsπ’ β Hπ
0(Ξ©) and player II approximatesπ’ withπ£(π’) with measurements [π(π), π’]. Player I and II aim to maximize and minimize the recovery error ofπ£(π’). This game theoretic interpretation inspires the name gamblets, referring to operator-adapted wavelets.
Note that the pre-waveletsπ(π)
π lie on only one level of the hierarchy. The following addresses the construction of a wavelet decomposition ofHπ
0(Ξ©)on all hierarchical levels.
Operator-adapted wavelets
Let(J(π))2β€πβ€πbe a hierarchy of labels such that, writing|J(π)|for the cardinal of J(π),
|J(π)|=|I(π)| β |I(πβ1)|. (2.2.14) Forπ β {2, . . . , π}, letπ(π) be a J(π) Γ I(π) matrix such that3
Ker(π(πβ1, π)) =Im(π(π),π). (2.2.15)
Forπ β {2, . . . , π}andπ β J(π) define π(π)
π := Γ
πβI(π)
π(π)
π, π π(π)
π , (2.2.16)
and write π(π) := span{π(π)
π | π β J(π)}. Then π(π) is the
Β·,Β·
-orthogonal complement ofπ(πβ1) inπ(π), i.e. π(π) = π(πβ1) βπ(π),and
π(π) = π(1) βπ(2) β Β· Β· Β· βπ(π). (2.2.17)
2πΏ(Ξ¦,Hπ
0(Ξ©)) is defined as the set ofHπ
0(Ξ©) β Hπ
0(Ξ©) functions that are of form π£(π’) = Ξ¨ [π(π), π’])
with measureableΨ:RI
(π) β H0π (Ξ©).
3We writeπ(π),π andπ(π),β1for the transpose and inverse of a matrixπ(π).
Forπ β {2, . . . , π}write
π΅(π) :=π(π)π΄(π)π(π),π . (2.2.18) Note thatπ΅(π) is the stiffness matrix of the elements(π(π)
π )πβJ(π), i.e., π΅(π)
π, π = π(π)
π , π(π)
π
. (2.2.19)
Further, forπ β {2, . . . , π}, define
π(π) := π΄(π)π(π),ππ΅(π),β1 (2.2.20)
and, forπ β J(π),
π(π), π
π := Γ
πβI(π)
π(π),π
π, π π(π)
π . (2.2.21)
Then definingπ’(π) as in (2.2.8), it holds true that forπ β {2, . . . , π},π’(π)βπ’(πβ1) is the
Β·,Β·
-orthogonal projection ofπ’ onπ(π) and π’(π) βπ’(πβ1) = Γ
πβJ(π)
[π(
π), π π , π’]π(
π)
π . (2.2.22)
To simplify notations, write J(1) :=I(1), π΅(1) := π΄(1), π(1) := πΌ(1), π(1)
, π
π := π(1)
π
forπ β J(1),J := J(1)βͺ Β· Β· Β· βͺ J(π), ππ := π(
π)
π andπ
π π :=π(
π), π
π forπ β J(π) and4 1β€ π β€ π. Then theπ
π
π and ππ form a bi-orthogonal system, i.e., [π
π
π , ππ] =πΏπ, π forπ, π β J (2.2.23) and
π’(π) =Γ
πβJ
[π
π
π , π’]ππ. (2.2.24)
Simplifying notations further, we will write [ππ, π’] for the J vector with entries [π
π
π , π’]and πfor theJ vector with entries ππso that (2.2.24) can be written π’(π) = [ππ, π’] Β·π . (2.2.25) Further, define the J by J block-diagonal matrix π΅ defined as π΅π, π = π΅(π)
π, π if π, π β J(π) and π΅π, π = 0 otherwise. Note that it holds that π΅π, π =
ππ, ππ
. When π =βandβͺβ
π=1Ξ¦(π) is dense inHβπ (Ξ©), then, writingπ(1) := π(1), Hπ
0(Ξ©) =ββ
π=1π(π), (2.2.26)
4The dependence onπis left implicit to simplify notation, forπ β Jthere exists a uniqueπsuch thatπβ J(π).
π’(π) = π’, and (2.2.24) is the corresponding multi-resolution decomposition of π’. When π < β, π’(π) is the projection of π’ on βπ
π=1π(π) and (2.2.25) is the corresponding multi-resolution decomposition. Note that the optimal recovery, game theory, and Gaussian conditioning results in Theorem 2.2.3 also holds for wavelets.
Theorem 2.2.4. Consider pre-wavelets ππ adapted to operatorL constructed with measurement functionsππ. Further, suppose that forπ’ β Hπ
0(Ξ©), we defineπ£β (π’)= π’(π) = [ππ, π’] Β·π.
1. For fixedπ’ β Hπ
0(Ξ©),π£β (π’) is the minimizer of


ο£²

ο£³
Minimizekπk Subject toπ β Hπ
0(Ξ©) and[ππ, π] =[ππ, π’].
(2.2.27)
2. For fixedπ’ β Hπ
0(Ξ©),π£β (π’) is the minimizer of


ο£²

ο£³
Minimizekπ’βπk
Subject toπ βspan{ππ :π β J }.
(2.2.28)
3. For canonical Gaussian fieldπ βΌ N (0,Lβ1), π£β (π’) =E
π
[ππ, π] = [ππ, π’]
. (2.2.29)
4. It is true that5
π£β βargminπ£βπΏ(Ξ¦,H0π (Ξ©)) sup
π’βHπ
0(Ξ©)
kπ’βπ£(π’) k
kπ’k . (2.2.30)
Pre-Haar wavelet measurement functions
The gamblets used in the subsequent developments will use pre-Haar wavelets (as defined below) as measurement functionsπ(
π)
π and our main near-optimal denoising estimates will be derived from their properties (summarized in Thm.2.2.5).
LetπΏ, β β (0,1). Let(π(
π)
π )πβI(π)be uniformly Lipschitz convex sets forming a nested partition of Ξ©, i.e., such that Ξ© = βͺπβI(π)π(π)
π , π β {1, . . . , π} is a disjoint union except for the boundaries, and π(π)
π = βͺπβI(π+1):π(π)=ππ(π+1)
π , π β {1, . . . , πβ1}.
5Here πΏ(Ξ¦,H0π (Ξ©)) is defined as the set ofH0π (Ξ©) β H0π (Ξ©) functions that are of form π£(π’)= Ξ¨ [ππ, π’])
with measureableΞ¨:RJ β H0π (Ξ©).
Assume that eachπ(π)
π , contains a ball of radiusπΏ βπ, and is contained in the ball of radiusπΏβ1βπ. Writing |π(
π)
π |for the volume ofπ(
π) π , take π(
π)
π :=1
π(π)
π
|π(
π)
π |β12 . (2.2.31)
The nesting relation (2.2.1) is then satisfied with π(π , π+1)
π, π := |π(π+1)
π |12|π(π)
π |β12 for π(π) =πandπ(
π , π+1)
π, π :=0 otherwise.
Forπ β {2, . . . , π}, let J(π) be a finite set of π-tuples of the form π = (π1, . . . , ππ) such that{π(πβ1) | π β J(π)} =I(πβ1), and forπ β I(πβ1), Card{π β J(π) | π(πβ1) = π} =Card{π β I(π) |π (πβ1) =π} β1. Note that the cardinalities of these sets satisfy (2.2.14).
Write π½(π) for the J(π) Γ J(π) identity matrix. For π = 2, . . . , π, let π(π) be a J(π) Γ I(π) matrix such that Im(π(π),π) =Ker(π(πβ1, π)),π(π)(π(π))π =π½(π) and π(π)
π, π =0 forπ(πβ1) β π(πβ1).
Theorem 2.2.5. With pre-Haar wavelet measurement functions, it holds true that 1. Forπ β {1, . . . , π} andπ’ β Lβ1πΏ2(Ξ©),
kπ’βπ’(π)k β€πΆ βπ π kLπ’kπΏ2(Ξ©). (2.2.32) 2. Writing Cond(π) for the condition number of a matrix π, we have for
π β {1,Β· Β· Β· , π}
πΆβ1ββ2(πβ1)π π½(π) β€ π΅(π) β€ πΆ ββ2π π π½(π) (2.2.33) andCond(π΅(π)) β€πΆ ββ2π .
3. Forπ β I(π) andπ₯(
π)
π βπ(
π)
π ,
kππkHπ (Ξ©\π΅(π₯(π)
π ,π β)) β€ πΆ ββπ πβπ/πΆ. (2.2.34)
4. The waveletsπ(
π) π , π(
π)
π and stiffness matrices π΄(π), π΅(π) can be computed to precision π (in k Β· k-energy norm for elements of Hπ
0(Ξ©) and in Frobenius norm for matrices) inO(πlog3π ππ) complexity.
Furthermore the constantπΆ depends only onπΏ,Ξ©, π , π , kL k := sup
π’βH0π (Ξ©)
kLπ’kHβπ (Ξ©) kπ’kHπ
0(Ξ©)
and kLβ1k := sup
π’βH0π (Ξ©)
kπ’kHπ
0(Ξ©)
kLπ’kHβπ (Ξ©) .
(2.2.35)
Proof. (1) and (2) follows from an application of Prop. 4.17 and Theorems 4.14 and 3.19 from [100]. (3) follows from Thm. 2.23 of [100]. 4 follows from the complexity analysis of Alg. 6 of [100]. See [101] for detailed proofs.
Remark 2.2.6. The wavelets π(
π) π , π(
π)
π and stiffness matrices π΄(π), π΅(π) can also be computed in O(πlog2πlog2π ππ) complexity using the incomplete Cholesky factorization approach of [119].
Theorem 2.2.5.2-3 implies that the gamblets are localized both in the eigenspace of operator L and inΞ©space. Further, Theorem2.2.5.1 shows the accuracy of the recovery,π’(π), in L norm is bounded byπΏ2norm of Lπ’. This result is used in the proofs of the denoising result shown in the following section.
2.3 Denoising by truncating the gamblet transform