Directory UMM :Data Elmu:jurnal:O:Operations Research Letters:Vol27.Issue1.2000:

(1)

www.elsevier.com/locate/dsw

On improved Choi–Goldfarb solution-containing ellipsoids

in linear programming

Igor S. Litvinchev

1

Computing Center, Russian Academy of Sciences, Vavilov 40, 117967 Moscow, Russia

Received 1 May 1998; received in revised form 1 February 2000

Abstract

Ellipsoids that contain all optimal primal solutions, those that contain all optimal dual slack solutions, and primal– dual ellipsoids are derived. They are independent of the algorithm used and have a smaller size than the Choi–Goldfarb ellipsoids [J. Optim. Theory Appl. 80 (1994) 161–173]. c 2000 Elsevier Science B.V. All rights reserved.

Keywords:Linear programming; Interior point methods; Containing ellipsoids

1. Introduction

Unlike the simplex method for linear programming, which moves from one basic feasible solution to another basic feasible solution, interior algorithms create a sequence of interior points converging to an optimal feasible solution. To identify optimal basic and non-basic variables during the course of the algorithm several criteria have been proposed, associated with the interior point techniques used. These criteria were obtained by constructing ellipsoids which contain all optimal primal solutions and=or ellipsoids which contain all optimal dual slack solutions, and are all algorithm dependent (see e.g. [2,3,5,7–9]). Choi and Goldfarb [1] have further developed this approach, deriving solution containing ellipsoids which are independent of the interior point algorithm used, but larger than algorithm-dependent ellipsoids. Concerning solution-containing ellipsoids we also note that the rst two-sided ellipsoidal approximation of polyhedra based on primal barrier functions is due to Sonnevend [4].

In this paper we propose the approach to diminish the size of the solution-containing ellipsoids derived in [1]. To get the tighter estimation of the ellipsoid size a certain localization of the optimal solution set is

E-mail address:[email protected] (I.S. Litvinchev).

1_{The research supported in part by the grants from FAPESP, Brazil (No. 1997=7153-2) and RFBR, Russia (No. 99-01-01071).}

(2)

used. It is shown that the proper choice of the localization results in a smaller containing ellipsoid. A similar approach is used to derive the primal–dual containing ellipsoids.

The paper is organized as follows. In Section 2, some basic notations and results, associated with Choi– Goldfarb ellipsoids are stated. In Section 3, the improved primal ellipsoids are constructed. In Section 4, the primal–dual ellipsoids are considered, and a new criterion for identifying the optimal basis is stated and compared with the previous results.

2. Denitions and notation

Consider the primal–dual pair of linear programs:

min{ctx|Ax=b; x¿0}; (1)

max_{bty_|Aty+z=c; z¿0_}; (2)

where A ∈ Rm×n _and _{b; c; x; y} _and _z _{are suitably dimensioned. Let} _x∗ _{and (} z∗

; y∗_{) be optimal solutions to}

(1) and (2), respectively. It is assumed that the optimal solution sets for both (1) and (2) are bounded, or equivalently, strictly feasible primal and dual solutions exist, and that rank(A) =m: The upper case letters X

andZ are used to denote diagonal matrices whose elements are the elements of the vectors xandz, and_{k · k} denotes the Euclidean norm.

It was established in [1] that if x and ( z;y) are primal and dual feasible solutions, then the ellipsoid

Ep={x∈Rn| kZ(x−x)k6};

where

2=k_Z_x_k2₊2_; ₌_xt_z

contains all optimal primal solutions x∗

: Here the “ellipsoid” is understood in a general sense allowing also unbounded sets.

Using this ellipsoid the lower bound x−

i () forx ∗

i was derived by the solution of the problem

min{xi| kZ(x−x)k6; Ax=b; ctx6ctx}:

If z ¿0 (which will be assumed valid throughout the paper) then the optimal solutionx−

i () of this problem

takes the following form. Let ei denote the ith unit vector and

R( Z) = Z−1[I₋Z−1At(AZ−2At)−1_A_Z−1 ] Z−1:

Then

x−

i () = xi−

q et

iR( Z)ei−(ctR( Z)ei)2=ctR( Z)c for ctR( Z)ei¡0

and

x−

i () = xi−

q et

iR( Z)ei for ctR( Z)ei¿0:

Respectively, the following criterion for identifying optimal basic variables was obtained: ifx−

i ()¿0; then

the ith primal variable is an optimal basic variable, i.e., x∗

i ¿0 and z ∗ i = 0:

We see that the smaller the size of the ellipsoidEp; the larger is the value of x

−

i (): Below we consider

(3)

3. Improved primal solution-containing ellipsoid

Denote by P∗

the optimal solution set of (1) and let W be a closed bounded subset of Rn _{such that}

W⊇P∗

: We will refer to such a W as a primal localization. Let S ∈ Rn×n _{be a diagonal matrix having}

positive diagonal entries si:

For any x∈P∗

we have

kSZ(x₋x)_k2 =_kSZx_k2+_kSZx _k2₋2 xt(SZ)2x

6_kSZxk2_{+ max}

w∈W{kS

Zwk2₋₂_xt₍_S_Z₎2_w_}

≡2(S; W); (3) where the inequality follows from the condition x∈P∗

⊆W:

Then the ellipsoid

Ep(S; W) ={x∈Rn| kSZ(x−x)k6(S; W)}

contains all optimal primal solutions.

Using the ellipsoid Ep(S; W) the lower bound x

−

i can be derived similarly as in Section 2. It is not hard

to verify that we need only to substitute R( Z) for

R(SZ) = (SZ)−1_[_I

−(SZ)−1_At₍_A₍_S_Z₎−2_At₎−1_A₍_S_Z₎−1_](_S_Z₎−1_: and use (S; W) instead of :

The reason for using the scaled ellipsoid is as follows. Suppose for example that the pair ( x;z) was produced by a primal–dual method [6]. Let S= (XZ)−1=2_: _{For this case} _R₍_S_Z_{) takes the form}

R(D−1_{) =}_D_[_I₋_DAt₍_AD2_At₎−1_AD_]_D;

where D= X1=2Z−1=2: Note that the matrix AD2_At _{is computed and decomposed into its Cholesky factor in} most of the primal–dual methods.

To dene a localization W we can use some constraints of the primal problem or a linear combination of them. Below we consider the localization which results in a closed form expression for (S; W):

Proposition 1. Let the primal localization be dened by W1={w∈Rn|ztw6; w¿0}: Then

2(S; W1) =kSZxk2+max

i {s

2

i(−2 zixi)}:

Proof. For any x∈P∗

we have

ztx= (c₋Aty)tx=ctx₋ytb6ctx₋ytb= ztx=

and henceP∗

⊆W1:Moreover since z ¿0 thenW1 is bounded, i.e.,W1 is a localization. To estimate(S; W1) in (3) we should solve the problem

f∗

= max

w∈W1{k

SZw k2₋₂_xt₍_S_Z₎2_w_}_:

The objective function of this problem is strictly convex and hence the optimal solution is attained at a vertex of the non-degenerate simplex W1: The non-zero component of the ith vertex is=zi and thus

f∗

=max

i {s

2

i(−2 zixi)};

(4)

Note that ifS=I and x ¿0; then

2(I; W1) =kZxk2+2−2min

i zixi¡

2

and hence Ep(I; W1)⊂Ep:

The other choice of the scaling matrix S associated with the primal–dual algorithms is S= (XZ)−1=2_: _For this case, we have

2((XZ)−1=2_{; W} 1) =

2 minizixi −

:

A similar approach can be used to derive a dual ellipsoid that contains all optimal dual slack solutions, i.e.

Ed(S; V) ={z∈Rn| kSX(z−z)k6(S; V)};

where

2(S; V) =_kSXz_k2+ max

∈V{kS

X _k2₋2 zt(SX)2_}:

A dual localization V is a closed bounded subset of Rn _{such that}_z∗

∈V for any optimal dual slack solution

z∗

: It is not hard to verify that if x ¿0; then V1={ ∈ Rn|xt6; ¿0} is the dual localization and

(S; V1) =(S; W1): The ellipsoid Ed(I; V1) is smaller than the dual-containing ellipsoid Ed={z∈Rn| kX(z−

z)k6} considered in [1] and it can be used similarly to identify optimal non-basic primal variables.

4. Primal–dual solution-containing ellipsoid

Denote by F∗

⊆R2n _{the set of all optimal primal–dual pairs (}_x∗_{; z}∗_{) and let} _{be a closed bounded subset}

of R2n _{such that} _⊇_F∗_: _{We will refer to such an} _{as a primal–dual localization. In this section we assume}

( x;z)¿0:

For any (x; z)_∈F∗

we have similar to (3):

kSZ(x₋x)_k2+_kSX(z₋z)_k262_kSZx_k2+ max (w;)∈{kS

Zw_k2+_kSX _k2

−2 xt(SZ)2w₋2 zt(SX)2_{} ≡}2(S; ) and hence the ellipsoid

Epd(S; ) ={(x; z)∈R2n| kSZ(x−x)k2+kSX(z−z)k262(S; )}

contains all optimal primal–dual pairs (x∗ ; z∗

):

Proposition 2. Let the primal–dual localization be dened by1={(w; )∈R2n|ztw+ xt=; (w; )¿0}:

Then

2(S; 1) = 2kSZxk2+max

i {s

2

i(−2 zixi)}:

Proof. For any (x; z)_∈F∗ _{we have}_zt_x₊_xt_z₌ _{since (}_x

−x)t₍_z₋_z_{) = 0 and} _xt_z_{= 0 by the complementarity} condition. Hence F∗

⊆1 and 1 is a localization. Then the required expression for (S; 1) is obtained similar to the proof of Proposition 1.

Now we use Epd to identify, if possible, the optimal basis. Consider the problem

(5)

Ax=b; Aty+z=c; (5)

Here q is such that the objective of this problem coincides with xi up to a positive scale factor. In [9] the

problem similar to (4) – (6) was discussed for q=Dei:

Proposition 3. The optimal solution x−

of problem(4)–(9) is such that

(a) if etq_v¿0 and kq_vk¡ ( etq_v); then

Proof. The KKT optimality conditions for (4) – (9) give

q+Atu+ 2(SZ)2(x−x)−(−) z= 0; (10)

v+ 2(SX)2(z−z)−(−) x= 0; (11)

Av= 0; ¿0; ¿0; ¿0;

where; ; and are the Lagrange multipliers for the restrictions (7) – (9). Multiplying (10) by A(SZ)−2 _{and utilizing the condition} _A₍_x₋_x_{) = 0 we get}

u= [A(SZ)−2_At_]−1_A₍_S_Z₎−1₍₍

−) e₋q):

Multiplying (10) by (SZ)−2 _{and substituting the expression for} _u _{we obtain}

(6)

and hence

z−z=−At(y−y) =− 2 (SX)

−1₍_e₋_e

u):

By (7) we have xt(z−z) + zt(x−x) =−: Substituting x−xand z−zin this expression we obtain the equation for which gives

=d2[ etq_v+_kevk2+(kek2− keuk2)−2]:

If ¿0;then by the complementarity condition for (8) we have xtz=:Hence by (7) it follows 0= ztx ¡

and= 0 by the complementarity condition for (9). Similarly if ¿0, then= 0. We can write both these conditions in the form = 0.

Remembering that= xtzthe complementarity conditions for (8) and (9) take the formxt(z₋z) = 0 and

zt(x−x) = 0. Substituting the expressions for x−xand z−zwe obtain

(−) = 0;

[(₋)_kevk2−etqv] = 0:

Utilizing the expression for and the condition = 0 we get

d2( etq_v₋_kevk2−2) = 0; (12)

−etq_vkek 2_{− k}_e

uk2

kevk2 −

d2(kek2_{− k}_e

uk2)−2d2

= 0: (13)

If etq_v60 then the term in the brackets in (12) is strictly negative and hence = 0. By (13) we have that if etq_v¿0 then = 0.

Since the quadratic constraint (6) must be active, then

2=

1 2

2

k(₋) ev−qvk

2₊

₋

2 2

ke₋euk2:

Substituting the expression for we obtain after the algebraic manipulations:

2=2d2+

1 2

2

(_kq_v_k2₋d2( etq_v)2)₋

1 2

2

d2_{( etq_v)(₋)(_ke_k2_{− k}euk2)

+ 2[_kevk2+(kek2− keuk2)]}: (14)

Now we are in a position to derive the main results of the proposition.

Suppose that etq_v¿0 and kq_vk¡ ( etq_v). By (13) it follows that = 0. Let ¿0. Then by (12) we have

=e t_q

v−2

kevk2

: (15)

Substituting this in (14) with = 0 we obtain

1 2

2

=

2_k_e

vk2−2

kq_v_k2_k_e

vk2−( etqv)2

:

(7)

x₋x; and we have

which nally proves part (a) of the proposition.

Note that if etq_v¿0 and _kq_v_k¿( etq_v) then the case ¿0 is impossible and we have == 0.

Substituting this in (14) with = 0 we obtain

Note that under our assumption the numerator of this expression is strictly positive. Moreover, if we substitute this in (16), then ¿0.

which proves part (b) of the proposition. Note that if etq_v60 and 2_¿_B_[2₍_k_e_k2_{− k}_e

required in part (c) of the proposition.

Since (x∗_{; z}∗_{; y}∗_{) is feasible to (4) – (9) for}₌₍_S;

1), we have the following criterion for identifying the optimal basis: if qt_x−_¿_{0, then} _x∗

i ¿0 and z∗i = 0.

Note that for part (c) of Proposition 3 we have == 0 and hence constraints (8) and (9) can be relaxed without changing the optimal value. For parts (a) and (b), either (8) or (9) is active and hence

qt_x−

In [9] the primal–dual containing ellipsoid centered at the origin was considered for S= (XZ)−1=2 _and ( x;z;y)∈N2(), where ∈[0;1),

and F0 is the set of all strictly feasible primal–dual solutions.

For ( x;z;y)_∈N2() we have zixi¿(1−)=n; i= 1; n and by Proposition 2

(8)

The lower bound qt_x−

YT obtained in [9, Theorem 3] by the solution of the problem similar to (4) – (6) for

q=D−1_e

i is as follows:

qtx−

YT=etiPe−

q

( 2−)(1−p_ii);

where P=Dt_At₍_AD2_At₎−1_AD _and_p

ii is the ith diagonal entry of P.

Since u=v= (I−P) andd2=−1 for S= (XZ)−1=2, part (c) of Proposition 3 gives

qtx−

c =etiPe−

s

( 2₋)

1₋p_ii₋ 1

[e

t

i(I−P) e]2

¿ qtx−

YT:

Hence Proposition 3 results in the stronger criterion for identifying the optimal basis. It is interesting to compare the lower bound obtained in Proposition 3 with the bound x−

i () in Section 2.

We have

2(I; 1) = 2kZxk2+2−2min

i zixi=

2₊

kZx_k2₋2min

i zixi:

If ( x;z;y)_∈N2(), then (1−)=n6zixi6(1 +)=n and

k_Z_x_k2₋₂_min

i zixi6

2₍2_{+ 4}₋₁₎_=n:

For ∈[0;√5−2) the right-hand side of the latter inequality is strictly negative and (I; 1)¡ . Since R( Z)= Z−1( Z−1)v for any ∈Rn and ( Z

−1

c)v=ev by Aty+ z=c, it is not hard to verify that

x−

i () coincides with the optimal solution of the following problem:

x−

i () = min{xi| kZ(x−x)k6; Ax=b; ztx6}: (17)

Comparing (17) with the problem (4) – (9) forq=ei,S=I; ( x;z;y)∈N2(),∈[0; √

5−2),=(I; 1)¡ , we obtain x−

i ()¡ etix −

that results in improved criterion for identifying the optimal basis.

The ellipsoids considered in this paper are algorithm-independent, that is they do not depend on how the primal–dual pair ( x;z) was obtained. Meanwhile the size of the ellipsoid depends on the proximity of ( x;z) to the central path: the closer to the central path, the smaller the size. This is one more reason for maintaining ( x;z) in the neighborhood of the central path during the course of the interior algorithm.

Acknowledgements

The author gratefully acknowledges two anonymous referees for their helpful comments and remarks.

Appendix

Since our expression for x−

i () is slightly dierent from the one used in [1, Criterion (C4), p. 170], we

give here the complete derivation for x− i ().

Proposition A1. The optimal solutionx−

i () of the problem

min_{etix|(x−x)tZ

2

(x−x)62; Ax=b; ctx6ctx} (A.1)

is as follows:

x−

i () = xi−

q et

iR( Z)ei−(ctR( Z)ei)2=ctPAZ−1c forc t_R₍_Z₎_e

(9)

and

x−

i () = xi−

q et

iR( Z)ei forctR( Z)ei¿0:

Proof. The Lagrangian, associated with (A.1) is

L=et_ix+[(x₋x)tZ2(x₋x)₋2] +ut(Ax₋b) +(ctx₋ctx):

The KKT condition @L=@x= 0 gives

ei+ 2Z

2

(x−x) +c+Atu= 0: (A.2)

Multiplying (A.2) by (x−x) and utilizing conditions [(x−x)tZ2(x−x)−2] = 0, (ctx−ctx) = 0,A(x−x) = 0 we obtain

= ( xi−xi)=22:

Multiplying (A.2) by AZ−2 we get

u=−(AZ−2At)−1_A_Z−2

(ei+c):

From (A.2) we have

2( x₋x) = Z−2(Atu+ei+c) =R( Z)(ei+c): (A.3)

Multiplying (A.3) by ct _{we obtain}

062ct( x₋x) =ctR( Z)ei+ctR( Z)c: (A.4)

If ct_R₍_Z₎_e

i¡0, then it follows from (A.4) that ¿0 and hence by the complementarity condition

ct₍_x₋_x_{) = 0. From (A.4) we have} ₌₋_ct_R₍_Z₎_e

i=ctR( Z)c. Substituting and in (A.3) and

multiply-ing by ei we obtain the rst part of Proposition A1.

Letct_R₍_Z₎_e

i¿0. Suppose that we have relaxed the restrictionctx6ctxin (A.1) or, just the same, put = 0

in the Lagrangian. Then similar to (A.3) we obtain for the solution of the relaxed problem

2( x₋x) =R( Z)ei

and since ¿0 andct_R₍_Z₎_e

i¿0 we have ctx6ctx. Since the relaxed solution is feasible to (A.1), then it is

optimal to (A.1). This immediately gives the required expression for x− i ().

Note that in [1, Criterion (C4), p. 170] there is a little aw. The rst expression forx−

i () in Proposition A1

was associated there with the case ct_R₍_Z₎_e

i¿0.

References

[1] I.C. Choi, D. Goldfarb, On solution-containing ellipsoids in linear programming, J. Optim. Theory Appl. 80 (1994) 161–173. [2] J.L. Gon, J.P. Vial, On the computation of weighted analytic centers and dual ellipsoids with the projective algorithm, Math.

Programming 60 (1993) 81–92.

[3] M. Kojima, Determining basic variables of optimal solutions in Karmarkar’s new LP algorithm, Algorithmica 1 (1986) 499–515. [4] G. Sonnevend, An ‘Analytical Centre’ for Polyhedrons and New Classes of Global Algorithms for Linear (Smooth, Convex)

Programming, in: Lecture Notes in Control and Information Sciences, Vol. 84, A. Prekopa, J. Szelezsan and B. Strazicky (Eds.), Springer, Berlin, 1986, pp. 866 –875.

[5] M.J. Todd, Improved bounds and containing ellipsoids in Karmarkar’s linear programming algorithm, Maths. Oper. Res. 13 (1988) 650–659.

[6] S.J. Wright, Primal–Dual Interior-Point Methods, SIAM Publications, Philadelphia, PA, 1997.

[7] Y. Ye, Recovering the optimal basis in Karmarkar’s polynomial algorithm for linear programming, Math. Oper. Res. 15 (1990) 564–572.

[8] Y. Ye, A “build-down” scheme for linear programming, Math. Programming 46 (1990) 61–72.