H ∞ -based lower bound - Optimization Algorithms in Wireless and Quantum Communications

the size of the tree may be as important as the overall number of multiplication and addition operations. On subplot 3 of Figure 2.7 the comparison of the total number of points kept in the tree by SD and SDsdp algorithms is shown. As expected the SDsdp algorithm keeps significantly less points in the tree than the SD algorithm.

Finally on subplot 4 of Figure 2.7, the comparison of the bit error rate (BER) performance of the exact ML detector (SDsdp algorithm) and the approximate MMSE nulling and cancelling with optimal ordering heuristic is shown. Over the range of SNRs considered here, the ML detector outperforms the MMSE detector significantly, thereby justifying our efforts in constructing more efficient ML algorithms.

Remark: Recall that the lower bound introduced in this section is valid only if the origi- nal problem is binary, i.e.,D={−¹₂,¹₂}^k⁻¹. A generalization to caseD={−³₂,−¹₂, ¹₂,³₂}^k⁻¹ can be found in [106]. It is not difficult to generalize it to anyD={−^L⁻₂¹,−^L⁻₂², . . . ,^L⁻₂³,^L⁻₂¹}^k−1 by noting that any k −1-dimensional vector whose elements are numbers from {−L+ 1,−L+ 2, . . . , L−2, L−1}can be represented as a linear transformation of a (k−1)(L−1)- dimensional vector from D = {−¹₂,¹₂}^(k⁻^1)(L⁻¹⁾. (The interested reader can find more on this in [69]). However, this significantly increases the dimension of the SDP problem in (2.14), which may cause our algorithm to be inefficient. Motivated by this, in the following section we consider a different framework, based onH^∞ estimation theory, which will (as we will see in Section 2.8) produce as a special case a general lower bound applicable for any D.

4 6 8 10 10⁶

10⁸

SNR [db]

flop count

Flop count

SDsdp SD SDsdp−sdp

0 20 40 60

10⁰ 10² 10⁴ 10⁶

level

number of points per level

Distribution of points, SNR=4 db

SDsdp SD

4 6 8 10

10² 10⁴ 10⁶ 10⁸

SNR [db]

total number of points

Total number of points SD SDsdp

4 6 8 10

10⁻³ 10⁻² 10⁻¹ 10⁰

SNR [db]

bit error rate

BER performance ML null−can MMSE

Figure 2.7: Computational complexity of the SD and SDsdp algorithms, m = 50, D = {−¹₂,¹₂}⁵⁰

To simplify the notation, we rewrite (2.11) as

min

a∈D⊂Z^k−1kb−Lak², (2.19)

where we introduceda=s_1:k−1,b=z_1:k−1, and L=R1:k−1,1:k−1.

Consider an estimation problem where a and b−La are unknown vectors, b is the observation, and the quantities we want to estimate are a and b. In the H^∞ framework, the goal is to construct estimators ˆa =f₁(b) and ˆb =f₂(b), such that for some given γ, some β ≥0, and some diagonal matrixD >0, we have

β||a−aˆ||²+||b−bˆ||²

a^∗Da+kb−Lak² ≤γ² (2.20)

forall a andb (see, e.g., [48]).

Obtaining a desired lower bound from (2.20) is now straightforward. Note that for all

a andb we can write

kb−Lak² ≥γ⁻²

β||a−ˆa||²+||b−bˆ||²

−a^∗Da, (2.21)

and, in particular,

mina∈Dkb−Lak² ≥min

a∈D γ⁻²β||a−aˆ||²−a^∗Da

+γ⁻²||b−bˆ||². (2.22)

Note that the minimization on the right-hand side (RHS) of (2.22) is straightforward since it can be done componentwise (which is why we chose D > 0 diagonal). Thus, for any H^∞ estimators â =f₁(b) and ˆb =f₂(b), (2.22) provides a readily computable lower bound. The issue, of course, is how to obtain the best âand ˆb (andD and γ). To this end, let us assume that the estimators are linear, i.e., â=K₁b and ˆb=K₂b for some matrices K₁ and K₂ (see Figure 2.8).

- - ?

a L

b−La K₁

K₂ ˆ a

bˆ

Figure 2.8: An H^∞ estimation analogy used in deriving a lower bound on integer least- squares problem.

Introducing c=



D^1/2a b−La



and noting that

T =



D^−1/2 0 LD⁻^1/2 I



−



K₁ K₂





LD⁻^1/2 I





√β(I−K₁L)D^−1/2 −√ βK₁ (I−K₂)LD⁻^1/2 I−K₂





maps c to





√β(a−ˆa) b−bˆ



, from (2.21) we see that for all cit must hold that

c^∗T^∗Tc≤γ²c^∗Ic

(see [48]). SinceT is square, this implies either of the equivalent inequalities

T T^∗ ≤γ²I or T^∗T ≤γ²I. (2.23)

The tighter the bound in (2.23), the tighter the bound in (2.22). In other words, the closer γ⁻¹T is to a unitary matrix, the tighter (2.22) becomes. Hence we attempt to choose K1

and K₂ to make γ⁻²T T^∗ as close to identity as possible.

To this end, post multiply T with the unitary matrix

Φ =



 ∇⁻¹ D⁻^1/2L^∗∆^−∗

−LD^−1/2∇⁻¹ ∆^−∗



.

∇and ∆ are found via the factorizations

D^−1/2L^∗LD^−1/2+I =∇^∗∇ and LD⁻¹L^∗+I = ∆∆^∗, (2.24)

to obtain

TΦ =



A B

0 C



 (2.25)

where

A=p

βD^−1/2∇⁻¹, B=p

β(D⁻¹L^∗∆^−∗−K₁∆), and C= (I−K₂)∆. (2.26)

ThusT T^∗ ≤γ²I implies 

AA^∗+BB^∗ BC^∗ CB^∗ CC^∗



≤γ²I. (2.27)

Note that we have many degrees of freedom when choosing K₁ and K₂, and wish to make judicious choices. So, to simplify things, let us choose K₂ such that CC^∗ = γ₁²I for some 0≤γ₁≤γ. (Clearly, this can always be done, since from (2.24) we have that ∆ is invertible, and the simple choice K2 = I −γ1∆⁻¹ will do the job.) To make half the eigenvalues of γ⁻²T T^∗ unity, we set the Schur complement of the (2,2) entry of (2.27) to zero, i.e.,

AA^∗+BB^∗−γ²I−BC^∗(CC^∗−γ²I)⁻¹CB^∗= 0. (2.28)

Using CC^∗=C^∗C=γ₁²I, it easily follows that

BB^∗= (1− γ₁²

γ²)(γ²I−AA^∗). (2.29)

Using the definitions of Aand B from (2.26), we obtain pβK1 =p

βD⁻¹L^∗(LD⁻¹L^∗+I)⁻¹−B∆⁻¹. (2.30)

From the (1,1) entry of (2.27) it follows that

γ²I−(AA^∗+BB^∗)≥0,

which is the only constraint on γ. Combining this constraint with the definition of A from (2.26), the definition of ∇ from (2.24), and the expression forBB^∗ from (2.29), we obtain that

γ² ≥ β

λ_min(D+L^∗L).

We summarize the results of this section in the following theorem:

Theorem 2.1. Consider the integer least-squares problem (2.19). Then for any γ² ≥

λmin(D+L^∗L),0≤γ₁ ≤γ, and any matricesD ≥0,B, and∆satisfying∆∆^∗=I+LD⁻¹L^∗

and BB^∗= (1−^γ_γ¹²²)(γ²I−β(D+L^∗L)⁻¹),

mina∈Dkb−Lak² ≥min

a∈Dγ⁻²||p βa−p

βD⁻¹L^∗(LD⁻¹L^∗+I)⁻¹b+B∆⁻¹b||²−a^∗Da+γ₁²

γ²||∆⁻¹b||².

Proof. Follows from the previous discussion, noting that

||b−bˆ||² =||(I −K₂)b||² =||C∆⁻¹b||² =γ²₁||∆⁻¹b||²

and

AA^∗ =β(D+L^∗L)⁻¹.

The next corollary directly follows from Theorem 2.1.

Corollary 2.1. Consider the setting of the Theorem 1 and let β= 1. Then

mina∈Dkb−Lak² ≥ min

a∈Dγ⁻²||a−D⁻¹L^∗(LD⁻¹L^∗ +I)⁻¹b+Bφ||²−a^∗Da+ γ₁² γ²||φ||²,

(2.31)

where B is the unique symmetric square root of(1−^γ_γ¹²2)(γ²I−(D+L^∗L)⁻¹), and φisany vector of the squared length b^∗(I+LD⁻¹L^∗)⁻¹b.

It should be noted that we have several degrees of freedom in choosing the parameters (γ₁, γ, D, φ), and we can exploit that to tighten the bound in (2.31) as much as possible.

Optimizing simultaneously over all these parameters appears to be rather difficult. However, we can simplify the problem and letγ1 →γ. This has two benefits: it maximizes the third term in (2.31) and it sets B= 0 so that we need not worry about the vector φ. Finally, to

maximize the first term, we need to take γ as its smallest possible value, i.e., we set

γ² = 1

λ_min(D+L^∗L).

This leads to the following result:

Corollary 2.2. Consider the setting of the Theorem 2.1 and letβ = 1. Then

mina∈Dkb−Lak² ≥λ_min(L^∗L+D)||a−(L^∗L+D)⁻¹L^∗b||²−a^∗Da+b^∗(I−L((L^∗L+D)⁻¹)L^∗)b (2.32)

Remark: We would like to note that the bound given in the previous Corollary could have been also obtained in a faster way. Below we show a possible derivation that an anonymous reviewer has provided to us.

Let D be a diagonal matrix such that D≥0. Then we have

||b−La||² =a^∗L^∗La−2b^∗La+b^∗b=a^∗(L^∗L+D)a−2b^∗La+b^∗b−a^∗Da

= (a−(L^∗L+D)⁻¹L^∗b)^∗(L^∗L+D)(a−(L^∗L+D)⁻¹L^∗b)−b^∗L((L^∗L+D)⁻¹)L^∗b+b^∗b−a^∗Da

≥λ_min(L^∗L+D)||a−(L^∗L+D)⁻¹L^∗b||²−a^∗Da+b^∗(I−L((L^∗L+D)⁻¹)L^∗)b

It is not difficult to see that this is precisely the same bound as the bound given in Corollary 2.2. The interested reader can find more on this type of bound in [96] and [79].

In the following sections we show how various choices of the free parameters in the general lower bound from Theorem 2.1 yield several interesting special cases of lower bounds. In particular, in Section 2.5 we show that the lower bound obtained by solving a related convex optimization problem, where the search space is relaxed from integers to a sphere, can be deduced as a special case of the lower bound from Theorem 2.1. Then, in Section 2.6, we show that the lower bound obtained by solving another convex optimization problem, where

the search space is now relaxed from integers to a polytope, can also be deduced as a special case of the lower bound from Theorem 2.1. Finally, in Section 2.8, we use (2.32) to deduce the lower bound based on the minimum eigenvalue ofL^∗L.

Dalam dokumen Optimization Algorithms in Wireless and Quantum Communications (Halaman 39-46)