Summary - Introduction to Operations Research

84 CHAPTER 10. NETWORK PROBLEMS

Minimum-cost ( s, t ) -flow problem

Primal Dual

min

∑

(i,j)∈E

c_ijx_ij

∑

j∈V (i,j)∈E

x_ij −

∑

j∈V (j,i)∈E

x_ji =







f i=s

−^f ⁱ=t 0 else

for alli∈^V

0 ≤ ^xij ≤ ^uij for all(i,j)∈^E

maxf ys−^{f y}t−

∑

(i,j)∈E

u_ijv_ij

y_i − ^yj − ^vij ≤ ^cij for all(i,j)∈Ê vij ≥ ⁰ ^{for all}(i,j)∈Ê y_iunrestricted for alli∈^V Obstruction (to feasibility): setS⊆^V^with^s∈^Sând^t∈^S^{such that} ^f >

∑

(i,j)∈E(S,S)

u_ij

Transshipment problem

Primal Dual

min

∑

(i,j)∈E

cijxij

∑

j∈V (i,j)∈E

x_ij −

∑

j∈V (j,i)∈E

x_ji = b_i

|{z}

net supply

for alli∈^V

x_ij ≥ ⁰ ^{for all}(i,j)∈^E

max

∑

i∈V

b_iy_i

yi − ^yj ≤ ^cij for all(i,j)∈^E y_iunrestricted for alli∈^V

Obstruction (to feasibility): setS⊆^V^{such that}

∑

i∈S

bi>₀_and_E(S,S) =^∅

Minimum-cost network flow problem

Primal Dual

min

∑

(i,j)∈E

cijxij

∑

j∈V (i,j)∈E

x_ij −

∑

j∈V (j,i)∈E

x_ji = b_i for alli∈^V

0 ≤ ^xij ≤ ^uij for all(i,j)∈^E

max

∑

i∈V

biyi−

∑

(i,j)∈E

uijvij

yi − ^yj − ^vij ≤ ^cij for all(i,j)∈^E vij ≥ ⁰ ^{for all}(i,j)∈^E yiunrestricted for alli∈^V Obstruction (to feasibility): setS⊆^V^{such that}

∑

i∈S

b_i >

∑

(i,j)∈E(S,S)

u_ij

11

Game Theory

During the 8PM to 9PM time slot, two TV networks compete for an audience of 100 million viewers. The networks announce their schedule ahead of time and do not know of each other’s decision until the show time. Based on that a certain number of people will tune to Network 1 while the rest will watch Network 2. The market research revealed the following expected number of viewers of Network 1.

Network 2

Network 1 Western Soap Opera Comedy

Western 35 15 60

Soap Opera 45 58 50

Comedy 38 14 70

For instance, if Network 1 shows a Western while Network 2 shows a Comedy, then

60million viewers will watch Network 1, and 100−⁶⁰=40million watch Network 2.

Question: What strategy should the two networks use to maximize their viewership?

Terminology:

• Network 1 is a row player.

• Network 2 is a column player.

• The above matrix is called a payoff matrix.

• This is a constant-sum game (the outcome for both players always sums up to a constant 100 million).

How to solve this game? Let us look at the structure of outcomes. For instance, if Network 1 chooses to show Western, then it can get as many as 60 million viewers (if Network 2 chooses to show a Comedy) but also as little as 15 million (if Network 2 shows a Soap Opera). Thus this choice cannot guarantee more than 15 milion viewers for Network 1 (in the worst case). If the network instead chooses to show a Comedy, the situation is even worse, since then we can guarantee only 14 million viewers (the minimum in the 3rd row). The best therefore for Network 1 is to choose to show a Soap Opera in which case 45 million or more viewers will tune to Network 1 regardless of what Network 2 does. Note that in this strategy Network 1 (being the row player) simply calculates the row minimum of each row and then chooes the row with largest row minimum.

By the same argument, Network 2 (being the column player) can maximize its vieweship by calculating each column maximum and choosing column with the smallest column maximum.

It is easy to see that the two outcomes will satisfy the following inequality all rowsmax (row minimum) ≤ ^min

all columns (column maximum)

In this example, Network 1 chooses Soap Opera and Network 2 chooses Western whereby 45 million viewers will watch Network 1 and 55 million will watch Network 2. Note that this choice is simultaneously best for both Network

86 CHAPTER 11. GAME THEORY 1 and Network 2 (we have equalitymax(row minimum) = min(col maximum)). This is called a saddle point and the common value of both sides of the equation is called the value of the game.

• An equilibrium point of the game: choice of strategies for both players such that neither player can improve their outcome by changing his strategy.

• A saddle point of a game is an example of an equilibrium.

11.1 Pure and Mixed strategies

Note the in the above example each player’s strategy was deterministic; they each examined possible outcomes and made a specific single choice to follow. This is called a pure strategy.

Unfortunately, there are games with no saddle points and following a pure strategy may not always give the players the best outcome. (Consider for instance Poker – see next section.) We need to generalize our definition of strategy.

Instead of choosing one fixed move, the player considers all moves and chooses randomly according to a distribution.

The outcome of the game will depend on the chosen distribution.

• a mixed (randomized) strategy(x₁,x₂, . . . ,xn)is a probability distribution overnpossible moves of the player (i.e. satisfiesx₁+x₂+. . .+xn=1)

• optimal strategy is a strategy maximizing the expected gain for the player

Note that since we are now using chance we can sometimes gain more, sometimes less; thus we will settle for maximizing the average outcome (if, say, we play the game repeatedly for many rounds).

Betting Game

Player 1 draws a card from a card deck (and hides it from the other player). Then he decides to either pass in which case he discards the card and pays$1to Player 2, or he will bet in which case it is 2nd player’s turn. Player 2 can either fold in which case she pays$1to Player 1, or she will call and the card is revealed.

If the revealed card is high (10, Jack, Queen, King, Ace), then Player 2 pays$2to Player 1. Otherwise, the card is low (2 through 9) and Player 1 pays$2to Player 2.

Let us analyze the possible strategies for Player 1. Given the card, the card can be either high or low. Based on that the player can either pass or bet. So there are 4 possible strategies: pass on both high and low (PP), pass on high and bet on low (PB), bet on high and pass on low (BP), and bet on both high and low (BB). Player 2 can either Call or Fold.

The possible expected outcomes of the game are then as follows.

Player 2

Player 1 Call Fold Row

Minimum

PP −¹ −¹ −¹

PB −¹₁₃⁸ ₁₃³ −¹₁₃⁸

BP ₁₃² −₁₃³ −₁₃³

BB −₁₃⁶ ¹ −₁₃⁶

Column Maximum

132 1

Thus, for instance, suppose that Player 1 plays strategy BP (bet on high, pass on low) and Player 2 calls. Then either the card is high, Player 1 bets and wins$2, or the card is low and Player 1 folds and loses$1. The chance of getting a high card is5/13and thus getting a low card has probability of8/13. So on average Player 1 gains

13$2+ ⁸

13(−^$1) = ²

13 ≈^$0.15

On the other hand, if Player 1 playsBPbut Player 2 folds, then the expected gain of Player 1 is only−₁₃³ ≈ −^$0.23. So clearly, the intuitive strategy BP (bet on high, pass on low) is not necessarily best against a worst-case opponent.

Note that this is a zero-sum game since either Player 1 pays Player 2 or vice-versa (the sum of the players’ gains is zero). Looking at the table above, we see that largest row minimum is−₁₃³ while smalest column maximum is ₁₃². So this game does not have a saddle point (unlike the first game we discussed).

11.1. PURE AND MIXED STRATEGIES 87 Notice that some strategies are better then others in all possible cases. For instance,

playing BP instead of PP always gives better outcome for Player 1. We say that the strategy PP is dominated by another strategy (in this case BP). Clearly, if a strategy is dominated, the player can always do better by playing another strategy. Thus we can safely remove any dominated strategy from consideration without changing the problem (its optimal solution). In our case, this eliminates the strategyPPas well as the strategyPBwhich is dominated by the strategyBB. This leaves us with only two strategies and the pay-off matrix shown on the right.

Player 2 Player 1 Call Fold

BP ₁₃² −₁₃³

BB −₁₃⁶ ¹

Now we are ready to determine the best mixed strategy for Player 1. The player chooses to playBPwith probabilityx₁or playsBBwith probabilityx₂, wherex₁+x₂=1. We denote this mixed strategy as(x₁,x₂). An expected outcome of this strategy is₁₃²x₁−₁₃⁶^x2if Player 2 calls and is−₁₃³^x1+x₂if Player 2 folds. So the worst-case outcome is simply the minimum of the two

(xmin₁,x2){₁₃²^x1−₁₃⁶^x2,−₁₃³^x1+x₂} Sincex₁+x₂=1, we can simplify this to

outcome=min

x₁ {₁₃⁸^x1−₁₃⁶^{, 1}− ¹⁶₁₃^x1}

1@ 16

A B C1

AB C1^@ 6

A B

0 1

eC D eEF eG

g H IJKL

PMHNe O 1

PMHNe OQL KMGR

PMHNe OQEHMMR

minimum

We can plot the possible outcomes (based on the choices ofx₁) as shown on the right. From this we determine that the best mixed strategy for Player 1 is the pointEcorresponding to strategy(x₁,x₂)wherex₁=19/24andx₂=5/24.

This guarantees the player expected gain of1/39≈^$0.025.

Answer: The best strategy for Player 1 is to randomly choose between betting on high, passing on low with probability 79¹₆%or always bet with probability20⁵₆%. This gives him expected gain of¢2.5.

Similarly, we can determine the best mixed strategy for Player 2. Let(y₁,y₂)^be the probabilities of Player 2 calling (y₁) or folding (y₂) wherey₁+y₂ = 1. If Player 1 playsBP, then the expected outcome (loss) for Player 2 is ₁₃²y₁−₁₃³^y2. If Player 1 playBB, then the outcome is−₁₃⁶^y1+y₂. Thus the worst-case loss for Player 2 is the maximum of the two.

(ymax₁,y2){₁₃²^y1−₁₃³^y2,−₁₃⁶^y1+y₂} Using the fact thaty₁+y₂=1, this simplifies to

maxy₁ {₁₃⁵^y1−₁₃³^{, 1}−¹⁹₁₃^y1}

We can plot the possible outcomes as shown on the right. The best strategy for Player 2 is the pointFcorresponding to strategy(y₁,y₂)^wherey₁= ²₃ ^andy₂=

13with expected loss of ₃₉¹ ≈$0.025. Thus the best strategy for Player 2 is to call with66²₃%probability and to fold with33¹₃%probability.

1T 1U VW

VW X1^T ^W

0 1

Z [\Z] ^Z _

l `aa`d

flh XZj 2

flh XZjV\lh Xakk

flh XZj 1

\lh Xakf

maximum

Linear programing

We can now see what are the best strategies for the two players. Player 1 tries to choose probabilitiesx₁,x₂so as to maximize themin{₁₃²^x1−₁₃⁶^x2,−₁₃³^x1+x₂}where probabilitiesx₁,x₂sum up to 1. Similarly, Player 2 chooses his probabilitiesy₁,y₂so as to minimizemax{₁₃²^y1−₁₃³^y2,−₁₃⁶^y1+y₂}^where^y1+y₂= 1. Using the tricks we learnt, we see that both these problem can be transformed into linear programs as follows.

88 CHAPTER 11. GAME THEORY maxz

subject to ₁₃²x₁ − ₁₃⁶^x2 ≥ ^z

−₁₃³^x1 + x₂ ≥ ^z x₁ + x₂ = 1 x₁,x₂ ≥ ⁰

minw

subject to ₁₃²y₁ − ₁₃³^y2 ≤ ^w

−₁₃⁶^y1 + y₂ ≤ ^w y₁ + y₂ = 1 y₁,y₂ ≥ ⁰ which we can rewrite as follows:

max z

subject to −₁₃²^x1 + ₁₃⁶x₂ + z ≤ ⁰

133x₁ − ^x2 + z ≤ ⁰

x₁ + x₂ = 1

x₁,x₂ ≥ ⁰ zunrestricted

min w

subject to −₁₃²^y1 + ₁₃³y₂ + w ≥ ⁰

136y₁ − ^y2 + w ≥ ⁰

y₁ + y₂ = 1

y₁,y₂ ≥ ⁰ wunrestricted

Notice that the two programs are duals of each other. This is not a coincidence. In fact, it is always the case in zero-sum (constant-sum) games. This tells us that the optimal solutions to the two programs have the same value by Strong Duality, which defines the value of the game. Moreover, the optimal strategies for the two players satisfy Complementary Slackness (verify this for the solutions we found above). In other words, the solutions to the two problems form an equilibrium point (neither play can do better by changing his/hers strategy). In the literature, this is often called Nash equilibrium.

Dalam dokumen Introduction to Operations Research (Halaman 87-92)