• Tidak ada hasil yang ditemukan

Decision Theory Game Theory

N/A
N/A
Protected

Academic year: 2018

Membagikan "Decision Theory Game Theory"

Copied!
43
0
0

Teks penuh

(1)

Decision Theory &

Game Theory

(2)

Elements of a game

 Players: intelligent opponents (competitors

or enemies)

 Strategies: choices of what to do to defeat

the opponents

 Outcomes = Payoffs: functions of the

different strategies for each player

 Payoff Matrix: Tableau (of the gain of the

Row Player)

 Rule: how to allocate the payoffs to the

(3)

The Game: Example

 Two Players: Player A (Row) and Player B

(Column)

 Tossing a balanced coin

 Possible outcomes: Head (H) and Tail (T)  Rule:

 If the outcome matches (player A selects H and the outcome is also H or player A select T and the

outcomes is also T), Player A gets $1 from Player B;

(4)

The Game: Payoff Matrix

Player A

(Row Player)

Player B

1

– 1

T

– 1

1 H

T H

Strategies of each player : H or T

(5)

Optimum Solution

 Optimum Solution can be achieved if

players choose to apply:

Pure strategy (e.g. select H or T)

(6)

Two-Person Zero-Sum Game

 A game with two players

 A gain of one player equals a loss to the other  The focused player = the row player (Player A)  Player A maximize his minimum gain (why?)  Player B minimize her maximum loss (why?)

 Optimum Solution is gained by Minimax-Maximin

criterion

 Optimum Solution reflects that the game is

(7)
(8)

Two-Person Zero-Sum Game with

Saddle Point

 Player A (Row): Maximin Value = Lower

Value of the game

 Player B (Column): Minimax Value =

Upper Value of the game

 Maximin value = Minimax value  Saddle

(9)

Two-Person Zero-Sum Game with

Saddle Point

 Saddle point leads to Optimum Solution  Saddle point indicates a stable game

(10)

In general

 To maintain the “Optimality” of a game:

maximin value  value of the game  minimax value OR

(11)

Mixed Strategies

 Used for solving a game that does not

have a saddle point

 Used for solving a game that does not

have a saddle point

 Optimum Solution is gained using:

(12)

Unstable Game without Saddle Point 15 9 7 8 Column Max –1 3 –1 4 7 4 4 15 4 5 8 3 2 2 8 7 6 2 –10 0 9 –10 5 1 Player A Min 4 3 2 1 Row Player B Minimax Value Maximin Value

(13)

2

N game

 2  N game:

 Player A has 2 strategies

 Player B has N ( 2) strategies

a2n

a22 a21

x2 = 1 – x1

a1n

a12 a11

x1 A

yn

y2 y1

(14)

2

N game

(a12 – a22)x1 + a22 2

(a11 – a21)x1 + a21 1

… …

(a1n – a2n)x1 + a2n n

(15)

2

N game: Example

B

6 2

3 4

x2

–1 3

2 2

x1 A

y4 y3

(16)

2

N game

– x1 + 3 2

– 2x1 + 4 1

x1 + 2 3

–7 x1 + 6 4

A’s Expected Payoff B’s Pure Strategy

(17)

Graphical Solution x1 = 0 and x1 = 1 = x2

x1 = 0 x1 = 1

1

5

2 3

4 6

-1

4 2

x*1 =1/2

3 1

(18)

Optimum Solution for Player A

 Intercept between lines (2), (3) and (4)

(x1* = ½, x2*= ½)

(2) – x1 + 3 = – ½ + 3 = 5/2

v* (3) x1 + 2 = ½ + 2 = 5/2

(4) –7 x1 + 6 = – 7/2 + 6 = 5/2

(19)

Optimum Solution for Player B

 Combination (2), (3) and (4):

(20)

Optimum Solution for Player B

y2 + 2 3

– y2 + 3 2

B’s Expected Payoff A’s Pure

Strategy

(2,3)  y1 and y4 = 0, y3 = y2 –1 (y2* = y3*)

– y2 + 3 = y2 + 2 – 2 y2 = – 1

y2* = 1/2 dan y

3* = 1/2

(21)

Optimum Solution for Player B

– 7y2 + 6 4

– y2 + 3 2

B’s Expected Payoff A’s Pure

Strategy

(2,4)  y1 and y3 = 0, y4 = y2 –1 (y2* = y4*)

– y2 + 3 = –7y2 + 6 6 y2 = 3

y2* = 1/2 dan y

4* = 1/2

(22)

Optimum Solution for Player B

– 7y3 + 6 4

y3 + 2 3

B’s Expected Payoff A’s Pure

Strategy

(3,4)  y1 and y2 = 0, y4 = y3 –1 (y3* = y4*)

y3 + 2 = –7y3 + 6 8 y3 = 4

y3* = 1/2 dan y

4* = 1/2

(23)

M

2 game

 M 2 game:

Player A has M (2) strategiesPlayer B has 2 strategies

… …

a12 a11

x1 A

y2= 1 – y1 y1

am2 am1

xm

a22 a21

x2

(24)

M

2 game

(a21 – a22)y1 + a22 2

(a11 – a12)y1 + a12 1

… …

(am1 – am2)y1 + am2 m

(25)

M

2 game: Example

B

A x2 3 2

4 2

x1

6 – 2

x3

(26)

M

2 game

y1 + 2 2

– 2 y1 + 4 1

– 8 y1 + 6 3

B’s Expected Payoff A’s Pure Strategy

(27)

Graphical Solution y1 = 0 and y1 = 1 = y2

y1 = 1

1

5

2 3 4 6

-1

2

1 3

-2

(28)

Optimum Solution for Player B

 Intercept between lines (1) and (3)

(y1* = 1/3, y3*= 1/3)

(1) – 2y1 + 4 = – 2/3 + 4 = 10/3

(3) – 8y1 + 6 = – 8/3 + 6 = 10/3

 Player B can mix all the 2 strategies  Player B’s loss = 10/3

(29)

Optimum Solution for Player A

 Combination (1) and (3):

(30)

Optimum Solution for Player A

– 8x1 +6 3

– 2x1 + 4 1

A’s Expected Payoff B’s Pure Strategy

–2x1 + 4 = – 8x1 +6 6 x1 = 2

x1* = 1/3 dan x

3* = 1/3

A’s gain = 10/3

(31)

M

N Games: Simplex

 Focus on Row (Player A)  Duality Problem

 Objective Function: maximize

(32)

M

N Games: Simplex

 Subject to (Constraints):

a11 Y1 + a12 Y2 + . . . + a1nYn  1 a21 Y1 + a22 Y2 + . . . + a2nYn  1

… … …

am1 Y1 + am2 Y2 + . . . + amnYn  1 Y1, Y2, . . . , Yn  0

 w = 1/v  v* = 1/w

(33)

M

N Games: Simplex

 Ensure the tableau does not contain any

zero and negative value

 Use K (a constant value) to make sure that

the tableau does not contain any zero and negative value

 K > negative of the maximin value

(34)

M

N Games: Simplex

 If K is used in the tableau,

v* = 1/w – K

 z = w

(35)

M

N Games: Example

3 3 3 Max Column –4 3 –3 –4 3 –3 –1 3 –3 2 –3 –3 –1 3 1 Min 3 2 1 Row B A
(36)

8 8 8 Max Column 1 8 2 1 3 2 4 8 2 2 2 2 4 8 1 Min 3 2 1 Row B A

(37)

8 8 8 Max Column 1 8 2 1 3 2 4 8 2 2 2 2 4 8 1 Min 3 2 1 Row B A

8Y1 + 4Y2 + 2Y3  1 Subject to :

2Y1 + 8Y2 + 4Y3  1 1Y1 + 2Y2 + 8Y3  1

(38)

8Y1 + 4Y2 + 2Y3  1  8Y1 + 4Y2 + 2Y3 + S1 = 1 Subject to :

2Y1 + 8Y2 + 4Y3  1  2Y1 + 8Y2 + 4Y3 + S2 = 1

1Y1 + 2Y2 + 8Y3  1 1Y1 + 2Y2 + 8Y3 + S3 = 1

Y1, Y2,Y3  0

(39)
(40)

5/49 1/7 -3/98 -1/98 1 0 0 Y3 11/96 -1/14 31/196 -3/98 0 1 0 Y2 1/14 0 -1/14 1/7 0 0 1 Y1 45/196 1/14 11/196 5/49 0 0 0 w Solution S3 S2 S1 Y3 Y2 Y1 Basic

(41)

Solution for B

 w = 45/196

 v* = 1/w – K = 196/45 – 225/45 = –29/45  y1* = Y1/w = (1/14)/(45/196) = 14/45

(42)

Solution for A

 z = w = 45/196  X1 = 5/49

 X2 = 11/196  X3 = 1/14

 x1* = X1/z = (5/49)/(45/196) = 20/45

(43)

The End

Referensi

Dokumen terkait

Dengan hormat kami beritahukan bahwa Komisi Nasional Indonesia untuk UNESCO, Kemdikbud dan PT L’Oreal Indonesia bekerjasama dengan Direktorat Riset dan Pengabdian

The rest parts of the paper are organized as follows: section 2 analyzes characteristics of indoor spatial environment; section 3 analyzes features of indoor spatial

Pengadaan Barang/Jasa Pemerintah yang selanjutnya disebut dengan Pengadaan Barang/Jasa adalah kegiatan untuk memperoleh Barang/Jasa oleh Kementerian/Lembaga/Satuan

After extracting geometric and semantic information of building components for input data, the next step is to fine the most suitable path between two given location

Surat keterangan ini sebagai lampiran ijazah/STTB yang ada.

Our proposed methodology consisted of the viewpoint decision for point-based rendering, the point- based rendering, the normal vector clustering for surface

mengeksekusi setiap alur dalam himpunan basis Data harus dipilih sehingga kondisi pada predicate nodes dapat di uji dengan tepat... UJI KASUS Path 2: nilai (1)

Pembersih wajah yang mengandung minyak biji bunga matahari akan membantu mengurangi pigmentasi pada kulit sehingga kulit akan tampak sehat dan bercahaya, dan