• Tidak ada hasil yang ditemukan

QM−1 i=0 σ2v

i=ÄR

0 detSxx(e)M1 äM

.The MSE of the direct quantization is

εdirect = c2−b 1 M

M−1

X

k=0

Z 0

[Sxx(e)]kk

dω 2π

= c2−b Z

0

Tr(Sxx(e)) M

dω 2π.

Substituting these in (6.6), the maximized coding gain can thus be calculated as

GC = R

0 1

MTr(Sxx(e))dω R

0

Mp

detSxx(e)dω

. (6.18)

Eq. (6.18) gives a nice closed-form expression for the optimal performance that orthonormal GTD FBs can have. For the traditional filter bank optimization problem discussed in the literature [106], it is well known that the coding performance can be further improved by relaxing the or- thonormal constraint. It is thus natural to ask how much performance improvement we can get if we relax the orthonormal precoder constraint in the GTD FBs. This will be addressed in the next section.

LDU decomposition ofE(e)Sxx(e)E(e)isLe(e)D(e)Le(e),where D(e)is the diagonal

matrix for all frequencies. ♦

Substituting the result of Corollary 6.4.2 in (6.5), we can establish a lower bound onφ:

φ ≥

M−1

Y

i=0

Z 0

[L−1e ESxxEL−†e ]ii Z

0

[RR]ii

2π (6.19)

Z 0

M−1

Y

i=0

[L−1e ESxxEL−†e ]ii[RR]ii2M1 dω 2π

!2M

(6.20)

ÇZ

0

detL−1e ESxxEL−†e detRR2M1 dω 2π

å2M

(6.21)

=

ÇZ

0

(detSxx)2M1 dω 2π

å2M

, (6.22)

where in (6.20) we have used H ¨older’s inequality for integrals (6.9 in [29]), in (6.21) we have used Hadamard inequality for positive definite matrices (7.8.1 in [31]), and in (6.22) we have used the fact thatdetLe= 1anddetRE= detI= 1.

Now the question is, whether the bound (6.22) is achievable. The theorem below answers this question in the affirmative, and in particular, this bound can be achieved by a restricted case (where Ecan be decomposed as a paraunitary matrixU and a set of scalar filters) of biorthogonal GTD coder which is shown in Fig. 6.2.

Theorem 6.4.3 (The Cascade of Optimal Orthonormal GTD Filter Bank and a Set of Half-Whitening Filters is Optimal) The structure in Fig. 6.2, which is a restricted class of the biorthogonal GTD filter banks, achieves (6.22). In particular, (6.22) can be achieved by usingU(e)and{λi(e)}, whereU(e)is the precoder solution to the optimal orthonormal GTD filter banks in Section 6.3, and{λi(e)}is a set of half- whitening filters determined by the input psd. In particular, we can use the same filter for all subbands, i.e.,

λi(e) =λ(e). ♦

Proof: The proof is by construction. Consider the solution of the optimal orthonormal GTD filter banks in Sec. 6.3 which performs the frequency dependent GTD for the Cholesky factor of Sxx(e)(we reproduce Eq.(6.14) here without some modifications on the notations):

S†/2xx =QD12ΦLxP, (6.23)

where bothQandPare unitary matrices for all frequencies,Lxis a lower triangular matrix with 1on the diagonals. The middle diagonal matrixD(e)is such that[D(e)]ii= Mp

detSxx(e)ai, where the values in the set{ai}satisfyai>0andQM−1

i=0 ai= 1.

Therefore,Sxx=PLxDLxP. Note thatPis exactly the precoder of the optimal orthonormal GTD coders described in Sec. 6.3. Now, consider the system structure in Fig. 6.2. LetΛ(e) = diag(λ0(e),· · ·, λM−1(e))denote the frequency response of the scalar filters. If we takeU=P, the psd matrix ofy(n)can be expressed asSyy =ΛLxDLxΛ. It can be proved thatΛLx =LyΛ whereLyis a lower triangular matrix with1on the diagonals and[Ly]ij =λλj

i[Lx]ij. Therefore,Syy

can be expressed asSyy =LyΛDΛLy. We can take the PLT part to beL=L−1y , and the resulting psd matrix will beSvv =ΛDΛ. Substituting these quantities in (6.5), we have

φ =

M−1

Y

i=0

Z 0

[D]iii|2dω 2π

Z 0

i|−2

2π (6.24)

M−1

Y

i=0

ÇZ

0

([D]ii)1/2dω 2π

å2

(6.25)

=

ÇZ

0

(detSxx)2M1 dω 2π

å2M

, (6.26)

where (6.25) is from the Cauchy-Schwartz inequality, and (6.26) is from the fact that[D(e)]ii =

Mp

detSxx(e)aiandQM−1

i=0 ai= 1. The equality in (6.25) can be satisfied by choosing λi(e) = αi [D]ii(e)−1/4

= αi a1/4i

detSvv(e)4M1

, (6.27)

whereαi is any nonzero scalar multiplier. Thus,λi is thehalf-whitening filter in theith subband.

If we chooseαi =a1/4i , for all the subband we can use the same half-whitening filter that has the frequency response

λi(e) =λ(e) = detSxx(e)4M1 .

Therefore we just need to design one scalar filterλ(e)for all the subbands.

To summarize, we have constructed a biorthogonal GTD filter bank in the structure of Fig. 6.2 that achieves exactly (6.22), and is thus optimal for the problem (6.8).

)

0(

! ej !01(ej )

)

0(n y

)

1(

!1 ej

1

)

1(

! ej

x (n)

) (ej U

^

) (ej U

x (n)

)

1(n y

)

2(n y

)

1(

!2 ej

)

1(

!3 ej

)

2(

! ej

)

3(

! ej )

3(n y

Middle frequency ! dependent PLT Part

R(ej ) )

(ej E

Figure 6.2: A restricted case of the biorthogonal GTD subband coders forM = 4.

Theorem 6.4.3 proves that the bound (6.22) is achievable. The proof also suggests a design method for the optimal biorthogonal GTD filter banks. By working out the optimal bit loading formula (6.4) of this design scheme, we can find that the bit loading is exactly the same as (6.17).

That is,the optimal biorthogonal GTD FBs have the same bit loading scheme as the corresponding optimal orthonormal GTD FBs.This design method is summarized below.

Design of optimal biorthogonal GTD filter banks:

1. Perform the design procedure for optimal orthonormal GTD filter banks as described in Sec.

6.3 to obtained the precoderU(e)and the estimatorsLx(e).

2. Design the half-whitening filter according to (6.27). The precoder is chosen as E(e) = Λ(e)U(e). The estimators need to be recomputed according to Corollary 6.4.2.

3. Design the optimal bit loading scheme as (6.17).

For the case when the optimal orthonormal GTD filter banks are designed as GMD filter banks, the diagonal matrixD(e)can be written asD(e) = Mp

detSxx(e)I. Therefore, for alliwe have[D(e)]ii = Mp

detSxx(e). This makes the optimal bit loading formula (6.4) correspond to uniform bit loading, which does not have the granularity problem if the average bit budget is an integer!

Theorem 6.4.3 not only shows the lower bound (6.22) can be achieved but also provides more insight to this problem. The optimal solution to (6.8) must have φ equal to (6.22), and thus it must satisfy all the equalities from (6.19) to (6.22). These conditions are necessary conditions of the

optimal systems, and will be examined in the following. First, in (6.20) for the H ¨older’s inequality

M−1

Y

i=0

Z 0

[L−1e ESxxEL−1e ]iidω 2π ≥

Z 0

M−1

Y

i=0

[L−1e ESxxEL−1e ]iidω 2π

!M

to have equality, we require the spectrum equalizing property (6.13) to be satisfied. This gives us another necessary condition for the optimal biorthogonal GTD filter banks:

Corollary 6.4.4 Spectrum equalization of Subband Signals is Necessary for Optimal Biorthogonal GTD Filter Banks: The subband signals of the optimal biorthogonal GTD filter banks have the spectrum equalizing

property (6.13). ♦

Second, in (6.21) for the Hadamard inequality

M−1

Y

i=0

[RR]ii ≥det(RR)

to have equality,RRneeds to be diagonal. This shows that the matrixR(e)must be decompos- able as a paraunitary matrix multiplying with a diagonal matrix. Therefore, the optimal solution to (6.7) will definitely have the structure as in Fig. 6.2. This is stated as the following corollary:

Corollary 6.4.5 (Optimal Biorthogonal GTD FBs Necessarily Has a Decomposable Structure) The optimal biorthogonal GTD filter banks have the structure as shown in Fig. 6.2. ♦ To summarize from Theorem 6.4.1, Corollary 6.4.4, and Corollary 6.4.5, the optimal biorthogonal GTD filter bank also hastotal decorrelationandspectrum equalizationas the two necessary conditions.

Furthermore, the optimal systems can be decomposed as the structure in Fig. 6.2. Surprisingly, all of these results have a parallel fashion to the theory of traditional biorthogonal filter banks devel- oped in [109] and [70]: Theorem 2.3 in [70] suggesting that total decorrelation is necessary, Lemma 2.6 in [70] suggesting that spectral majorization is necessary, and Theorem 2.7 in [70] suggesting that the optimal biorthogonal filter banks have the decomposable structure as shown in Fig. 3 of [70].