Three or more players - Finding sparse -equilibria in low-rank games

3.6 Finding sparse -equilibria in low-rank games

3.6.2 Three or more players

2 we obtain

p⁰C2qˆ^T ≥ max

r (˜p⁰C2)r−2

≥ max

r ((¯pC1−δ1k)C2)_r−2 from (3.15)

≥ max

r (ˆpC1C2)_r−3

≥ pCqˆ ⁰^T −3 for any q⁰ (3.22)

Combining (3.21) and (3.22), ˆ

pCqˆ^T ≥pCqˆ ^0T −4

Similarly, ˆpRqˆ^T ≥p⁰Rˆq^T −4 for any p⁰.

We analyze the run-time for the algorithm above in terms of the support parameter kand number of strategiesn, andB. The first step of the algorithm where we “guess”

p⁰ requires exhaustively going through all possible choices for each component of

p⁰ in the interval [−B, B] in steps of δ, and similarly for guessing ˜q⁰. This takes time (2B/δ)^2k = (4B²k/)^2k. For each choice of (˜p⁰,q˜⁰), we will need to solve the linear program above which takes poly(|G|) time. The applications of Carath´eodory’s Theorem also take poly(|G|) and so the running time is as claimed and this completes the proof of the theorem.

it can be expressed as: T =Pp

i=1t⁽ⁱ⁾₁ ⊗t⁽ⁱ⁾₂ ⊗. . .⊗t⁽ⁱ⁾_l where each t⁽ⁱ⁾_j is an n×1vector (i.e. it is the sum of p pure tensors).

It will be convenient to aggregate the information in a p-decomposition of such a tensor T into a `-tuple (C_i1, C_i2, . . . , C_i`) where each C_ij is an n×p matrix whose columns are given by

t⁽¹⁾_ij , . . . , t^(p)_ij

. We will refer to ther-th row of a matrix C_ij as C_ij(r). We will also define to be the component-wise product of vectors:

Definition 3.6.4 For two vectors v, w∈Rⁿ, define (vw) to be the vector (v₁w₁, . . . , v_nw_n)∈Rⁿ.

The following inequality will be useful in obtaining a result for the extension to three or more players.

Lemma 3.6.5 Letx₁, x₂, . . . , x_`,xe₁,xe₂, . . . ,xe_` ∈R^k be vectors satisfying|x_i−xe_i| ≤δ and furthermore, let |xe_i| ≤B for all i= 1, . . . , `. Then,

hx₁,(xe₂. . .xe_`)i −δ`k(B +δ)^`−1 ≤ hx₁,(x₂x₃. . .x_`)i ≤ hx₁,(xe₂. . .xe_`)i+

δ`k(B+δ)^`−1

Proof.

hx₁,(x₂x₃. . .x_`)i ≥ hx₁,(xe₂−δ1_k)x₃. . .x_`i

≥ hx₁,xe₂x₃. . .x_`i −δ(B +δ)^`−2hx₁,1_ki

≥ hx₁,xe₂x₃. . .x_`i −δk(B+δ)^`−1 ...

≥ hx₁,xe₂. . .xe_`i −(`−1)δk(B+δ)^`−1

≥ hx₁,xe₂. . .xe_`i −δ`k(B +δ)^`−1

The other side of the inequality can be similarly shown.

We begin by generalizing Lipton et al. [LMM03] to the multiplayer case. The following theorem shows that low rank games in the multiplayer case have sparse Nash equilibria; in the next section we give nontrivial algorithms to find approximate equilibria with this sparsity.

Corollary 3.6.6 Let G = (T1, . . . , T`, n) be an `-player game, and suppose Ti has rank k_i. Then there exists a Nash equilibrium (p^∗₁, . . . , p^∗_`) with |supp(p^∗_i)| ≤ 1 + P`

j=1k_j for all i.

Proof. Let (q₁, . . . , q_`) be a Nash equilibrium for G and let (C_i,1, . . . , C_i,`) be the tensor decomposition for Ti. Givenq1, . . . , q`, we can define vectors in Rⁿ:

w₁ = (C_1,1( q₂C_1,2 q₃C_1,3 · · · q_`C_1,` )^T) w₂ = (C_2,2(q₁C_2,1 q₃C_2,3 · · · q_`C_2,` )^T) w₃ = (C_3,3(q₁C_3,1 q₂C_3,2 · · · q_`C_3,` )^T)

...

w_` = (C_`,`(q₁C_`,1 q₂C_`,2 · · · q`−1C`,`−1 )^T)

(3.23)

By the definition of a Nash equilibrium, the following conditions are satisfied:

supp(q₁) ⊆ {v : (w₁)_v = max

u (w₁)_u} supp(q₂) ⊆ {v : (w₂)_v = max

u (w₂)_u} ...

supp(q_`) ⊆ {v : (w_`)_v = max

u (w_`)_u}.

And indeed any (qb₁, . . . ,qb_`) satisfying these conditions (when the w_i are defined relative the theqb_i) is a Nash equilibrium.

Set s = P`

j=1k_j. Now, by Carath´eodory’s Theorem there exists qb₁ ∈ Rⁿ with supp(qb₁)⊆supp(q₁) and|supp(qb₁)| ≤s+ 1 for which:

qb₁(C_2,1|C_3,1|C_4,1| · · · |C_`,1) =q₁(C_2,1|C_3,1| · · · |C_`,1),

since the right-hand-side is a convex combination of vectors inR^s

0 fors⁰ ≤s. Similarly, there exists qb₂ ∈Rⁿ with supp(qb₂)⊆supp(q₂) and |supp(qb₂)| ≤s+ 1 for which:

qb₂(C_1,2|C_3,2|C_4,2| · · · |C_`,2) = q₂(C_1,2|C_3,2|C_4,2| · · · |C_`,2), since the right-hand-side is a convex combination of vectors in R^s

0 for s⁰ ≤s.

A symmetric argument gives us a sparse qb_i from each q_i. Moreover, these qb_i produce precisely the same vectors w1, w2, . . . , w` via (3.23). And, since supp(qbi) ⊆ supp(q_i) for all i, the strategies qb_i satisfy the above conditions for being a Nash equilibrium, assuming the original qi did.

We now turn to algorithms for finding sparse approximate equilibria, with three or more players. We first consider the case when the tensor decomposition is known.

This is not an unnatural assumption: in§3.6.3, we describe a class of games for which the tensor decomposition is naturally given by the description of the payoff functions.

Theorem 3.6.7 Let G = (T₁, . . . , T_`, n) be an `-player game, and suppose we are given a k-decomposition of T_i = (C_i1, . . . , C_i`) where each of the C_ij is an n ×k matrix with integer values in [−B, B] for i, j = 1, . . . , `. Then for every >0, there is a deterministic procedure P running in time

(2B)^`k`/^k(`−1)`

poly(|G|)

that returns a 4-Nash equilibrium (p₁, p₂, . . . , p_`) with |supp(p_i)| ≤1 +`k for all i.

Proof. As in the two player case, our first step is to “guess” a collection of vectors to within δ = _k`(2B) `−1 precision. We describe the “correct” guess relative to an (arbitrary) Nash equilibrium (p₁, . . . , p_`) forG.

1. Let (p⁰₁₁, . . . , p⁰_1`) = (p₁C₁₁, p₂C₁₂, . . . , p_`C_1`) be an`-tuple of 1×kvectors. Note that since all entries in (C11, C12, . . . , C1`) lie in [−B, B], entries in (p⁰₁₁, . . . , p⁰_1`) also lie in [−B, B].

2. Guess an (`−1)-tuple (pf⁰₁₂, . . . ,pf⁰_1`) such that

|(p⁰_1i)_j −(p]⁰_1i)_j| ≤δ i= 2, . . . , `;j = 1. . . , k

3. For C₁₁(r), ther-th row inC₁₁ let αr =

C11

pf⁰₁₂pf⁰₁₃. . .pf⁰_1`

; r= 1, . . . , n S₁ = n

r|max

t α_t−2≤α_r ≤max

t α_to

4. By repeating steps 1 through 3 on T₂, . . . , T_` we can similarly obtainS₂, . . . ,S_`. 5. Find (¯p₁, . . . ,p¯_`) satisfying the following linear program

|(¯pjCmj)r−(gpmj)r| ≤δ r= 1, . . . , k;j = 1. . . , `;

m= 1, . . . , j−1, j+ 1, . . . , ` (3.24) (¯pj)i≥0; i= 1, . . . , n;j = 1, . . . , ` (3.25)

i=1

(¯p_j)_i = 1; j = 1, . . . , ` (3.26) (¯p_j)_i = 0; i /∈ S_j;j = 1, . . . , ` (3.27)

6. Let D_i = (C_1,i|C_2,i |. . .|Ci−1,i |C_i+1,i |. . .|C_`,i) for i = 1, . . . , `. v = ¯p_iD_i is a convex combination of column vectors in D_i that are in an`k-dimensional subspace. Apply Carath´eodory’s Theorem (Theorem 3.5.1) to obtain pb_i such that v =pb_iD_i for i= 1, . . . , ` and supp(pb_i)≤`k+ 1.

7. Return (pb₁, . . . ,pb_`).

As before, in order to prove the correctness of the above algorithm we postulate the following claims:

Claim 3.6.8 A feasible solution to the linear program formulated in step 5 of the algorithm exists.

Proof. (Of Claim 3.6.8) We will show that (¯p₁ = p^∗₁,p¯₂, . . . ,p¯_` = p^∗_`), the Nash equilibrium strategies for T₁, . . . , T_` satisfy the linear program. Since p^∗_i⁰ = p^∗_iC₁₁ satisfies |p^∗_i⁰[j]−pf⁰_mi[j]| ≤δ for j = 1, . . . , k and for m = [`]− {i}, p^∗_i satisfies (3.24).

(3.25) and (3.26) follow from ¯p_i being a probability distribution. So, it suffices to show that p^∗₁, . . . , p^∗_` satisfy (3.27), i.e., for any r ∈ supp(p^∗_i), r ∈ supp(S_i). We will show this to be true for i = 1; it symmetrically follows for i = 2, . . . , `. Note that

|fp⁰_1i| ≤B and |p^∗_iC_1i−pf⁰_1iC_1i| ≤δ. Then, applying Lemma 3.6.5 gives us

C11(p^∗₂C12. . .p^∗_`C1`)^T

r− ≤

C11(pf⁰₁₂. . .pf⁰_1`)^T

≤ C11(p^∗₂C12. . .p^∗_`C1`)^T

r+ and likewise,

maxt

C₁₁(pf⁰₁₂. . .pf⁰_1`)^T

− ≤ C₁₁(p^∗₂C₁₂. . .p^∗_`C_1`)^T

≤maxt

C11(pf⁰₁₂. . .pf⁰_1`)^T

t+ Combining these two equations,

maxt

C₁₁(pf⁰₁₂. . .pf⁰_1`)^T

−2 ≤

C₁₁(pf⁰₁₂. . .pf⁰_1`)^T

≤ max

C11(pf⁰₁₂. . .pf⁰_1`)^T

Hence r ∈ S₁.

Claim 3.6.9 (pb₁, . . . ,pb_`) as returned by the algorithm is a 4-equilibrium.

Proof. (Of Claim 3.6.9) For the strategy tuple (p₁, . . . , p_`) and given a decomposition of T_i = (C_i1, . . . , C_i`), the payoff to player i given by T_i(p₁, . . . , p_`) may be expressed as

Ti(p1, . . . , p`) =piCii p1Ci1p2Ci2. . .pi−1Ci(i−1)pi+1C_i(i+1). . . p`Ci`

Put this way, we need to show that for any (p⁰₁, . . . , p⁰_`), pb₁, . . . ,pb_l returned by the algorithm satisfy

pb_iC_ii(pb₁C_i1 . . .pdi−1Ci,i−1pd_i+1C_i,i+1. . .pb_`C_i`)^T ≥ p⁰_iC_ii(pb₁C_i1. . .pb_`C_i`)^T − for i = 1, . . . , `. We will prove this for i = 1, the proof follows symmetrically for i = 2, . . . , `. First, we note that from the application of Carath´eodory’s theorem in step 6,

pb₁C₁₁(pb₂C₁₂. . .pb_`C_1`)^T = pb₁C₁₁(¯p₂C₁₂. . .p¯_`C_1`)^T (3.28) Then, applying Lemma 3.6.5

pb₁C₁₁(¯p₂C₁₂. . .p¯_`C_1`)^T ≥ pb₁C₁₁

pf⁰₁₂. . .pf⁰_1`T

− (3.29)

By the same argument as before, since we picked only those r in supp(pb₁)⊆ S₁ such that

C₁₁

pf⁰₁₂. . .pf⁰_1`T

≥max

C₁₁

pf⁰₁₂. . .pf⁰_1`T

−2 we get

pb1C11

pf⁰₁₂. . .pf⁰_1`

≥ max

r C11(r)

pf⁰₁₂. . .pf⁰_1`

−2 (3.30) Applying Lemma 3.6.5 and from Carath´eodory’s Theorem,

maxr

C₁₁

pf⁰₁₂. . .pf⁰_1`T

≥ max

C₁₁(¯p₂C₁₂. . .p¯_`C_1`)^T

−

= max

C₁₁(pb₂C₁₂. . .pb_`C_1`)^T

r−

≥ p⁰₁C₁₁(bp₂C₁₂. . .pb_`C_1`)^T −(3.31) for any p⁰₁

Combining (3.28)–(3.31),

pb1C11(pb2C12. . .pb`C1`)^T ≥p⁰₁C11(pb2C12. . .pb`C1`)^T −4 for any p⁰₁

The total number of guesses made in step 2. of the procedure is k(` −1)` (k guesses for each component of the (`−1) different vectors in step 2, repeated` times in step 4.). Each component is within [−B, B] and is guessed to δ accuracy, so the guessing takes time

(2B/δ)^k(`−1)` = (2B(k`(2B)^`−1)/)^k(`−1)` = (2B)^`k`/^k(`−1)`

For each guess, we solve a linear program, taking time poly(|G|), for a total running time as claimed. This completes the proof of the theorem.

3.6.3 An example of games with known low-rank tensor de-

Dalam dokumen Limited Randomness in Games, and Computational Perspectives in Revealed Preference (Halaman 72-79)