E9 211: Adaptive Signal Processing

(1)

E9 211: Adaptive Signal Processing

Lecture 7: Linear estimation

(2)

Outline

1. Optimal estimator in the vector case (Ch. 2.1) 2. Normal equations (Ch. 3.1 and 3.2)

(3)

Vector-valued data

I Supposex:p×1 andy:q×1are vector valued. Then, we denote the estimator for xasxˆ=h(y). Explicitly,

xˆ =





 ˆ x0

ˆ x1

... ˆ x_p−1







=







h0(y) h1(y)

... h_p−1(y)







I We can then seek optimal functions{h_k(·)} that minimizes the error in each component ofx:

minimize

h_k(·) E(|˜xk|²) =E(|xk−hk(y)|²)

I Therefore, the optimal estimator forx_k giveny in the least-mean-square error sense is xˆ =E(x |y)

(4)

Vector-valued data

I Supposex˜= [x₀−xˆ₀, x₁−xˆ₁, . . . , x_p−1−xˆ_p−1]^T. Then, E(˜x^H˜x) =E(|˜x₀|²) +E(|˜x₁|²) +· · ·+E(|˜x_p−1|²) = Tr(R_˜_x)

I Since each term depends only on the corresponding functionh_k(·), minimizing the sum over eachhk(·)is equivalent to minimizing the sum over all{hk(·)}, i.e.,

minimize

h_k(·) E(|˜xk|²) =E(|xk−hk(y)|²)

is equivalent to minimizing the trace of the error covariance matrix minimize

{hk(·)} Tr(R˜x)

(5)

Linear estimator

I Suppose

¯

x=E(x) =0; ¯y=E(y) =0;

and

Rx=E(xx^H); Ry=E(yy^H); Rxy=E(xy^H)

I We restrict to a subclass of estimators of the form h(y) =Ky+b K:p×qand vector b:p×1

I Such linear estimators will depend on the first- and second-order moments ofxandy and the full knowledge of the conditional pdf is not required.

(6)

Linear estimator

We findK andbsuch that the

I estimator is unbiased

I the trace of the error covariance matrix is minimized

I For unbiasedness, the following equation must be satisfied E(ˆx) =E(Ky+b) =KE(y) +b=b This means, we must haveb=0.

I Explicitly,

ˆ

x=Ky=





 ˆ x0

ˆ x1

... ˆ x_p−1







=





 k^H₀y k^H₁y ... k^H_p−1y







(7)

Linear estimator

To findK we solve

minimize

K Tr(R˜x) or equivalently

minimize

ki

E(|˜xi|²) =E(|xi−k^H_i y|²)

I We denote the cost function

J(k_i) =E(|x_i|²)−E(x_iy^H)k_i−k^H_i E(yx^H_i ) +k^H_i E(yy^H)k_i

=σ_x,i² −Rxy,iki−k^H_i Ryx,i+k^H_i Ryki

I Setting the gradient vectorJ(ki)with respect toki to zero, we get k^H_i Ry =Rxy,i, i= 0,1, . . . , p−1.

or the solution matrix should satisfy

(8)

Normal equations

KRy =Rxy

I For a unique solution, Ry>0, so that K=RxyR⁻¹_y

I Satisfies orthogonality criterion

k^H_i Ry=Rxy,i⇒k^H_i E(yy^H) =E(xiy^H)⇒E[(xi−k^H_i y)y^H] = 0

I For the non-zero mean case, the solution is obtained by replacing x andywith centered variablesx−x¯ andy−y¯

ˆ

x= ¯x+K(y−y)¯