PROOFS FOR CHAPTER 2
A.1 Omitted Proofs of General Properties Proof of Lemma 1Proof of Lemma 1
105 A p p e n d i x A
any available action in every information setI๐ = (๐๐, โ๐ก)with probability at least๐ where๐ < ร๐ 1
๐=1|๐ด๐|. Letฮฃ๐ =ร๐
๐=1ฮฃ๐๐ be set of feasible behavioral strategy profiles for players in the perturbed game ฮ๐. For any behavioral strategy profile ๐ โ ฮฃ๐, let ๐๐(ยท) โก (๐
๐ ๐ (ยท))๐
๐=1be the belief system induced by ๐via ๐-cursed Bayesโ rule.
That is, for each player๐ โ ๐, information setI๐ =(๐๐, โ๐ก)whereโ๐ก =(โ๐กโ1, ๐๐ก)and type profile๐โ๐ โฮโ๐,
๐
๐
๐ (๐โ๐|โ๐ก, ๐๐) = ๐ ๐
๐
๐ (๐โ๐|โ๐กโ1, ๐๐) + (1โ ๐)
"
๐
๐
๐ (๐โ๐|โ๐กโ1, ๐๐)๐โ๐(๐๐ก
โ๐|โ๐กโ1, ๐โ๐) ร
๐โฒ
โ๐โฮโ๐ ๐
๐ ๐ (๐โฒ
โ๐|โ๐กโ1, ๐๐)๐โ๐(๐๐ก
โ๐|โ๐กโ1, ๐โฒ
โ๐)
# . Notice that the ๐-cursed Bayesโ rule is only defined on the family of multi-stage
games with observed actions. As ๐ is fully mixed, the belief system is uniquely pinned down.
Finally, let ๐ต๐ : ฮฃ๐ โ ฮฃ๐ be the cursed best response correspondence which maps any behavioral strategy profile๐ โฮฃ๐ to the set of๐-constrained behavioral strategy profiles ห๐ โฮฃ๐ that are best replies given the belief system๐๐(ยท).
Step 2: Next, fix any 0 < ๐ < ร๐ 1
๐=1|๐ด๐| and show that ๐ต๐ has a fixed point by Kakutaniโs fixed point theorem. We check the conditions of the theorem:
1. It is straightforward thatฮฃ๐ is compact and convex.
2. For any๐ โฮฃ๐, as๐๐(ยท)is uniquely pinned down by ๐-cursed Bayesโ rule, it is straightforward that๐ต๐(๐) is non-empty and convex.
3. To verify that ๐ต๐ has a closed graph, take any sequence of ๐-constrained behavioral strategy profiles{๐๐}โ
๐=1 โ ฮฃ๐such that๐๐ โ ๐ โฮฃ๐as๐ โ โ, and any sequence {๐ห๐}โ
๐=1 such that ห๐๐ โ ๐ต๐(๐๐) for any ๐ and ห๐๐ โ ๐ห. We want to prove that ห๐ โ ๐ต๐(๐).
Fix any player๐ โ ๐and information setI๐ = (๐๐, โ๐ก). For any๐ โฮฃ๐, recall that๐
๐
โ๐(ยท)is player๐โs๐-cursed perceived behavioral strategies of other play- ers induced by ๐. Specifically, for any type profile ๐ โ ฮ, non-terminal history โ๐กโ1 and action profile ๐๐ก
โ๐ โ ๐ดโ๐(โ๐กโ1), ๐
๐
โ๐(๐๐ก
โ๐|โ๐กโ1, ๐โ๐, ๐๐) = ๐๐ยฏโ๐(๐๐ก
โ๐|โ๐กโ1, ๐๐) + (1โ ๐)๐โ๐(๐๐ก
โ๐|โ๐กโ1, ๐โ๐). Additionally, recall that ๐
๐
๐ (ยท) is player ๐โs belief about the terminal nodes (conditional on the history and type profile), which is also induced by๐. Since ๐๐(ยท) is continuous in ๐ we have that ๐๐
โ๐(ยท) and ๐๐
๐ (ยท) are also continuous
in ๐. We further defineS๐
I๐
โก ๐โฒ
๐ โฮฃ๐๐ :๐โฒ
๐( ยท |I๐) =๐ห๐
๐ ( ยท |I๐) andSI
๐ โก
๐โฒ
๐ โฮฃ๐๐ :๐โฒ
๐( ยท |I๐) =๐ห๐( ยท |I๐) . Since ห๐๐ โ ๐ต๐(๐๐), for any๐โฒ
๐ โ ฮฃ๐๐, we can obtain that
max
๐โฒโฒ
๐ โS๐
I๐
(
โ๏ธ
๐โ๐โฮโ๐
โ๏ธ
โ๐โH๐
๐
๐
๐ [๐๐] (๐โ๐|โ๐ก, ๐๐)๐
๐
๐ (โ๐|โ๐ก, ๐ , ๐
๐
โ๐[๐๐], ๐โฒโฒ
๐ )๐ข๐(โ๐, ๐๐, ๐โ๐) )
โฅ โ๏ธ
๐โ๐โฮโ๐
โ๏ธ
โ๐โH๐
๐
๐
๐ [๐๐] (๐โ๐|โ๐ก, ๐๐)๐
๐
๐ (โ๐|โ๐ก, ๐ , ๐
๐
โ๐[๐๐], ๐โฒ
๐)๐ข๐(โ๐, ๐๐, ๐โ๐). By continuity, as we take limits on both sides, we can find ห๐ โ ๐ต๐(๐)because
max
๐โฒโฒ
๐ โSI๐
(
โ๏ธ
๐โ๐โฮโ๐
โ๏ธ
โ๐โH๐
๐
๐
๐ [๐] (๐โ๐|โ๐ก, ๐๐)๐
๐
๐ (โ๐|โ๐ก, ๐ , ๐
๐
โ๐[๐], ๐โฒโฒ
๐ )๐ข๐(โ๐, ๐๐, ๐โ๐) )
โฅ โ๏ธ
๐โ๐โฮโ๐
โ๏ธ
โ๐โH๐
๐
๐
๐ [๐] (๐โ๐|โ๐ก, ๐๐)๐
๐
๐ (โ๐|โ๐ก, ๐ , ๐
๐
โ๐[๐], ๐โฒ
๐)๐ข๐(โ๐, ๐๐, ๐โ๐). By Kakutaniโs fixed point theorem, ๐ต๐ has a fixed point.
Step 3: For any ๐, let ๐๐ be a fixed point of ๐ต๐ and ๐๐ be the belief system induced by๐๐ via๐-cursed Bayesโ rule. We combine these two components and let (๐๐, ๐๐) be the induced assessment. We now consider a sequence of๐ โ0,where {(๐๐, ๐๐)} is the corresponding sequence of assessments. By compactness and the finiteness of ฮ, the Bolzano-Weierstrass theorem guarantees the existence of a convergent subsequence of the assessments. As๐ โ 0, let(๐๐, ๐๐) โ (๐โ, ๐โ). By construction, the limit assessment (๐โ, ๐โ) satisfies ๐-consistency and sequential rationality. Hence,(๐โ, ๐โ) is a ๐-CSE.โ
Proof of Proposition 2
To prove ฮฆ(๐) is upper hemi-continuous in ๐, consider any sequence of {๐๐}โ
๐=1
such that ๐๐ โ ๐โ โ [0,1], and any sequence of CSE, {(๐๐, ๐๐)}, such that (๐๐, ๐๐) โ ฮฆ(๐๐) for all ๐. Let (๐โ, ๐โ) be the limit assessment, i.e., (๐๐, ๐๐) โ (๐โ, ๐โ). We need to show that (๐โ, ๐โ) โฮฆ(๐โ).
For simplicity, for any player๐ โ ๐, any information setI๐ = (โ๐ก, ๐๐), any๐โฒ
๐ โ ฮฃ๐, and any ๐ โ ฮฃ, the expected payoff under the belief system ๐๐(ยท) induced by๐ is denoted as:
E๐๐[๐]
๐ข๐(๐โฒ
๐, ๐โ๐|โ๐ก, ๐๐)
โก โ๏ธ
๐โ๐โฮโ๐
โ๏ธ
โ๐โH๐
๐
๐
๐ (๐โ๐|โ๐ก, ๐๐)๐
๐
๐ (โ๐|โ๐ก, ๐ , ๐
๐
โ๐, ๐โฒ
๐)๐ข๐(โ๐, ๐๐, ๐โ๐).
Suppose(๐โ, ๐โ) โ ฮฆ(๐โ). Then there exists some player๐ โ ๐, some information setI๐ =(โ๐ก, ๐๐), some๐โฒ
๐ โฮฃ๐, and some๐ > 0 such that E๐๐โ[๐โ]
๐ข๐(๐โฒ
๐, ๐โ๐โ |โ๐ก, ๐๐)
โE๐๐โ[๐โ]
๐ข๐(๐โ
๐, ๐โ๐โ |โ๐ก, ๐๐)
> ๐ . (A) Since๐๐(ยท)is continuous in๐, it follows that for any strategy profile๐,๐
๐
โ๐(ยท)and ๐
๐
๐ (ยท)are both continuous in ๐. Thus, there exists a sufficiently large ๐1 such that for every๐ โฅ ๐1,
E๐๐
๐[๐๐]
๐ข๐(๐๐
๐ , ๐๐
โ๐|โ๐ก, ๐๐)
โE๐๐โ[๐โ]
๐ข๐(๐โ
๐, ๐โ
โ๐|โ๐ก, ๐๐)
<
๐
3. (B) Similarly, there exists a sufficiently large๐2such that for every๐ โฅ ๐2,
E๐๐
๐[๐๐]
๐ข๐(๐โฒ
๐, ๐โ๐๐ |โ๐ก, ๐๐)
โE๐๐โ[๐โ]
๐ข๐(๐โฒ
๐, ๐โ๐โ |โ๐ก, ๐๐)
<
๐
3. (C) Therefore, for any๐ โฅ max{๐1, ๐2}, inequalities (A), (B) and (C) imply:
E๐๐
๐[๐๐]
๐ข๐(๐โฒ
๐, ๐โ๐๐|โ๐ก, ๐๐)
โE๐๐
๐[๐๐]
๐ข๐(๐๐
๐ , ๐โ๐๐ |โ๐ก, ๐๐)
>
๐ 3, implying that๐โฒ
๐ is a profitable deviation for player๐at information setI๐ = (โ๐ก, ๐๐), which contradicts(๐๐, ๐๐) โฮฆ(๐๐). Therefore, (๐โ, ๐โ) โฮฆ(๐โ), as desired. โ Proof of Proposition 3
Fix any ๐ โ [0,1] and let (๐, ๐) be a ๐-consistent assessment. We prove the result by contradiction. Suppose (๐, ๐) does not satisfy ๐-dampened updating property. Then there exists ๐ โ ๐, ห๐ โ ฮ and a non-terminal history โ๐ก such that ๐๐(๐โ๐|โ๐ก,๐ห๐) < ๐ ๐๐(๐โ๐|โ๐กโ1,๐ห๐).
Since (๐, ๐) is ๐-consistent, there exists a sequence {(๐๐, ๐๐)} โ ฮจ๐ such that (๐๐, ๐๐) โ (๐, ๐) as ๐ โ โ. By Lemma 1, we know for this ๐,๐ห and โ๐ก, ๐๐
๐(๐หโ๐|โ๐ก,๐ห๐)equals to ๐ ๐๐
๐ (๐หโ๐|โ๐กโ1,๐ห๐) + (1โ ๐)
"
๐๐
๐ (๐หโ๐|โ๐กโ1,๐ห๐)๐๐
โ๐(๐๐ก
โ๐|โ๐กโ1,๐หโ๐) ร
๐โฒ
โ๐
๐๐
๐ (๐โฒ
โ๐|โ๐กโ1,๐ห๐)๐๐
โ๐(๐๐ก
โ๐|โ๐กโ1, ๐โฒ
โ๐)
#
โฅ ๐ ๐๐
๐(๐หโ๐|โ๐กโ1,๐ห๐). There will be a contradiction as we take the limit๐ โ โon both sides:
๐๐(๐หโ๐|โ๐ก,๐ห๐) = lim
๐โโ
๐๐
๐(๐หโ๐|โ๐ก,๐ห๐) โฅ lim
๐โโ
๐ ๐๐
๐ (๐หโ๐|โ๐กโ1,๐ห๐) = ๐ ๐๐(๐หโ๐|โ๐กโ1,๐ห๐). โ
Proof of Corollary 2
We prove the statement by induction on๐ก. For๐ก =1, by Proposition 3,๐๐(๐โ๐|โ1, ๐๐) โฅ ๐ ๐๐(๐โ๐|โโ , ๐๐) = ๐F (๐โ๐|๐๐).Next, suppose there is๐กโฒsuch that the statement holds for all 1โค ๐ก โค ๐กโฒโ1. At stage๐กโฒ, by Proposition 3 and the induction hypothesis, we can find that
๐๐(๐โ๐|โ๐ก
โฒ
, ๐๐) โฅ ๐ ๐๐(๐โ๐|โ๐ก
โฒโ1
, ๐๐) โฅ ๐ h
๐๐ก
โฒโ1F (๐โ๐|๐๐)i
= ๐๐ก
โฒF (๐โ๐|๐๐). โ
Proof of Proposition 5
Let the assessment(๐, ๐)be a pooling๐-CSE. We want to show that for any๐โฒ โค ๐, the assessment(๐, ๐)is also a๐โฒ-CSE. Consider any non-terminal historyโ๐กโ1, any player๐, any๐๐ก
๐ โ ๐ด๐(โ๐กโ1) and any๐ โฮ. We can first observe that
ยฏ
๐โ๐(๐๐กโ๐|โ๐กโ1, ๐๐) = โ๏ธ
๐โฒ
โ๐
๐๐(๐โฒโ๐|โ๐กโ1, ๐๐)๐โ๐(๐๐กโ๐|โ๐กโ1, ๐โฒโ๐)
=๐โ๐(๐๐ก
โ๐|โ๐กโ1, ๐โ๐)
๏ฃฎ
๏ฃฏ
๏ฃฏ
๏ฃฏ
๏ฃฏ
๏ฃฐ
โ๏ธ
๐โฒ
โ๐
๐๐(๐โฒ
โ๐|โ๐กโ1, ๐๐)
๏ฃน
๏ฃบ
๏ฃบ
๏ฃบ
๏ฃบ
๏ฃป
= ๐โ๐(๐๐ก
โ๐|โ๐กโ1, ๐โ๐) where the second equality holds because๐ is a pooling behavioral strategy profile, so๐โ๐is independent of other playersโ types. For this pooling ๐-CSE, let๐บ๐be the set of on-path histories and ห๐บ๐ be the set of off-path histories. We can first show that for everyโ โ๐บ๐,๐ โ ๐and๐ โฮ, ๐๐(๐โ๐|โ, ๐๐) =F (๐โ๐|๐๐).
This can be shown by induction on๐ก. For๐ก =1, any โ1 = (โโ , ๐1)and any ๐ โ ฮ, by Lemma 1, we can obtain that
๐๐(๐โ๐|โ1, ๐๐) = ๐ ๐๐(๐โ๐|โโ , ๐๐) + (1โ ๐)
"
๐๐(๐โ๐|โโ , ๐๐)๐โ๐(๐1
โ๐|โโ , ๐โ๐)
ยฏ ๐โ๐(๐1
โ๐|โโ , ๐๐)
#
= ๐F (๐โ๐|๐๐) + (1โ ๐)F (๐โ๐|๐๐)
"
๐โ๐(๐1
โ๐|โโ , ๐โ๐)
ยฏ ๐โ๐(๐1
โ๐|โโ , ๐๐)
#
| {z }
=1
= F (๐โ๐|๐๐).
Now, suppose there is๐กโฒsuch that the statement holds for 1 โค ๐ก โค ๐กโฒโ1. At stage ๐กโฒand โ๐ก
โฒ = (โ๐ก
โฒโ1, ๐๐ก
โฒ) โ ๐บ๐, by Lemma 1 and the induction hypothesis, we can
again obtain that the posterior belief is the prior belief ๐๐(๐โ๐|โ๐ก
โฒ
, ๐๐) = ๐ ๐๐(๐โ๐|โ๐ก
โฒโ1
, ๐๐) + (1โ ๐)
"
๐๐(๐โ๐|โ๐ก
โฒโ1, ๐๐)๐โ๐(๐๐ก
โฒ
โ๐|โ๐ก
โฒโ1, ๐โ๐)
ยฏ ๐โ๐(๐๐ก
โฒ
โ๐|โ๐กโฒโ1, ๐๐)
#
= ๐F (๐โ๐|๐๐) + (1โ ๐)F (๐โ๐|๐๐)
"
๐โ๐(๐๐ก
โฒ
โ๐|โ๐ก
โฒโ1, ๐โ๐)
ยฏ ๐โ๐(๐๐ก
โฒ
โ๐|โ๐กโฒโ1, ๐๐)
#
| {z }
=1
=F (๐โ๐|๐๐).
Therefore, we have shown that players will not update their beliefs at every on-path information set, so the belief system is independent of ๐. Finally, for any off-path history โ๐ก โ ๐บห๐, by Proposition 3, we can find that the belief system satisfies for any๐ โฮ,
๐๐(๐โ๐|โ๐ก, ๐๐) โฅ ๐ ๐๐(๐โ๐|โ๐กโ1, ๐๐) โฅ ๐โฒ๐๐(๐โ๐|โ๐กโ1, ๐๐),
implying that when ๐โฒ โค ๐, ๐ will still satisfy the dampened updating property.
Therefore, (๐, ๐) remains a ๐โฒ-CSE.โ A.2 Omitted Proofs of Section 2.4 Pooling Equilibria in Signaling Games
Proof of Claim 1 First observe that after player 1 chooses ๐ต, it is strictly optimal for player 2 to choose๐ for all beliefs ๐2(๐1|๐ต), and after player 1 chooses ๐ด, it is optimal for player 2 to choose๐ฟif and only if
2๐2(๐1|๐ด) + [1โ๐2(๐1|๐ด)] โฅ 4๐2(๐1|๐ด) โโ ๐2(๐1|๐ด) โค1/3. Equilibrium 1.
If both types of player 1 choose ๐ด, then ๐2(๐1|๐ด) =1/4, so it is optimal for player 2 to choose ๐ฟ. Given๐(๐ด) =๐ฟand๐(๐ต) =๐ , it is optimal for both types of player 1 to choose ๐ดas 2 > 1. Hence ๐(๐1) =๐(๐2) = ๐ด, ๐(๐ด) = ๐ฟ and๐(๐ต) = ๐ is a pooling ๐-CSE for any ๐ โ [0,1].
Equilibrium 2.
In order to support๐(๐1) =๐(๐2)= ๐ตto be an equilibrium, player 2 has to choose ๐ at the off-path information set๐ด,which is optimal if and only if๐2(๐1|๐ด) โฅ1/3.
In addition, by Proposition 3, we know in a ๐-CSE, the belief system satisfies ๐2(๐2|๐ด) โฅ 3
4๐ โโ ๐2(๐1|๐ด) โค1โ 3 4๐ .
Therefore, the belief system has to satisfy that ๐2(๐1|๐ด) โ 1
3,1โ 3
4๐
, which requires๐ โค 8/9.
Finally, it is straightforward to verify that for any ๐ โ 1
3,1โ 3
4๐
, ๐2(๐1|๐ด) = ๐ satisfies ๐-consistency. Suppose type ๐1 player 1 chooses ๐ด with probability ๐ and type๐2 player 1 chooses ๐ด with probability๐ where ๐, ๐ โ (0,1). Given this behavioral strategy profile for player 1, by Lemma 1, we have:
๐2(๐1|๐ด) = 1
4๐+ (1โ ๐) ๐
๐+3๐
. In other words, as long as(๐, ๐) satisfies
๐ =
4โ4๐โ3๐ 12โ3๐
๐,
we can find that๐2(๐1|๐ด)= ๐. Therefore, if{(๐๐, ๐๐)} โ (0,0)such that ๐๐ =
4โ4๐โ3๐ 12โ3๐
๐๐, then ๐๐
2(๐1|๐ด) = ๐ for all ๐. Hence, lim๐โโ๐๐
2(๐1|๐ด) = ๐, suggesting that ๐2(๐1|๐ด)= ๐is indeed ๐-consistent. This completes the proof. โ
Proof of Proposition 6 Here we provide a characterization of ๐-CSE of Game 1 and Game 2. For the analysis of both games, we denote ๐๐ผ โก ๐2(๐1|๐ = ๐ผ) and ๐๐ โก ๐2(๐1|๐ =๐).
Analysis of Game BH 3.
At information set๐, given๐๐, the expected payoffs of๐ถ, ๐ท,๐ธare 90๐๐, 30โ15๐๐ and 15, respectively. Therefore, for any ๐๐, ๐ธ is never a best response. Moreover, ๐ถ is the best response if and only if 90๐๐ โฅ 30โ15๐๐ or ๐๐ โฅ 2/7. Similarly, at information set ๐ผ, given๐๐ผ, the expected payoffs of๐ถ, ๐ท, ๐ธ are 30, 45โ45๐๐ผ and 15, respectively. Therefore, ๐ธ is strictly dominated, and๐ถ is the best response if and only if 30โฅ 45โ45๐๐ผ or ๐๐ผ โฅ 1/3. Now we consider four cases.
Case 1 [๐(๐1) = ๐ผ , ๐(๐2) =๐]:
By Lemma 1, ๐๐ผ =1โ ๐/2 and ๐๐ = ๐/2. Moreover, since ๐๐ผ = 1โ ๐/2 โฅ 1/2 for any ๐, player 2 will choose๐ถat information set๐ผ. To support this equilibrium, player 2 has to choose๐ถ at information set ๐. In other words, [(๐ผ , ๐);(๐ถ , ๐ถ)] is separating ๐-CSE if and only if๐๐ โฅ 2/7 or ๐ โฅ 4/7.
Case 2 [๐(๐1) =๐, ๐(๐2) =๐ผ]:
By Lemma 1,๐๐ผ = ๐/2 and๐๐ =1โ๐/2. Because๐๐ โฅ 1โ๐/2โฅ 1/2, it is optimal for player 2 to choose ๐ถ at information set ๐. To support this as an equilibrium, player 2 has to choose๐ทat information set๐ผ. Yet, in this case, type๐2player 1 will deviate to๐. Therefore, this profile cannot be supported as an equilibrium.
Case 3 [๐(๐1) = ๐ผ , ๐(๐2) =๐ผ]:
Since player 1 follows a pooling strategy, player 2 will not update his belief at information set ๐ผ, i.e., ๐๐ผ = 1/2. ๐-dampened updating property implies ๐/2 โค ๐๐ โค 1โ ๐/2. Since ๐๐ผ > 1/3, player 2 will choose ๐ถ at information set ๐ผ. To support this profile to be an equilibrium, player 2 has to choose๐ทat information set ๐, and hence, it must be the case that๐๐ โค 2/7. Coupled with the requirement from ๐-dampened updating, the off-path belief has to satisfy ๐/2 โค ๐๐ โค 2/7. That is, [(๐ผ , ๐ผ);(๐ถ , ๐ท)] is pooling ๐-CSE if and only if ๐/2 โค 2/7 or ๐ โค 4/7.
Case 4 [๐(๐1) =๐, ๐(๐2) =๐]:
Similar to the previous case, since player 1 follows a pooling strategy, player 2 will not update his belief at information set ๐, i.e., ๐๐ = 1/2. Also, the ๐-dampened updating property suggests ๐/2 โค ๐๐ผ โค 1โ ๐/2. Because ๐๐ > 2/7, it is optimal for player 2 to choose ๐ถ at information set ๐. To support this as an equilibrium, player 2 has to choose ๐ท at information set๐ผ. Therefore, it must be that ๐๐ผ โค 1/3.
Combined with the requirement of ๐-dampened updating, the off-path belief has to satisfy ๐/2 โค ๐๐ผ โค 1/3. As a result, [(๐, ๐);(๐ท , ๐ถ)] is a pooling ๐-CSE if and only if ๐ โค 2/3.
Analysis of Game BH 4.
At information set ๐ผ, given ๐๐ผ, the expected payoffs of๐ถ, ๐ท, ๐ธ are 30, 45โ45๐๐ผ and 35๐๐ผ. Hence,๐ทis the best response if and only if๐๐ผ โค 1/3 while๐ธ is the best response if ๐๐ผ โฅ 6/7. For 1/3 โค ๐๐ผ โค 6/7, ๐ถ is the best response. On the other hand, since player 2โs payoffs at information set๐are the same as in Game 1, player 2 will adopt the same decision ruleโplayer 2 will choose๐ถif and only if๐๐ โฅ 2/7, and choose๐ท if and only if๐๐ โค 2/7. Now, we consider the following four cases.
Case 1 [๐(๐1) = ๐ผ , ๐(๐2) =๐]:
In this case, by Lemma 1, ๐๐ผ = 1โ ๐/2 and ๐๐ = ๐/2. To support this profile to be an equilibrium, player 2 has to choose ๐ธ and๐ถ at information set ๐ผ and ๐, respectively. To make it profitable for player 2 to choose ๐ธ at information set๐ผ, it must be that:
๐๐ผ =1โ ๐/2โฅ 6/7 โโ ๐ โค 2/7.
On the other hand, player 2 will choose๐ถat information set๐if and only if๐/2 โฅ 2/7 or ๐ โฅ 4/7, which is not compatible with the previous inequality. Therefore, this profile cannot be supported as an equilibrium.
Case 2 [๐(๐1) =๐, ๐(๐2) =๐ผ]:
In this case, by Lemma 1, ๐๐ผ = ๐/2 and ๐๐ = 1โ ๐/2. To support this as an equilibrium, player 2 has to choose๐ทat both information sets. Yet,๐๐ =1โ๐/2>
2/7, implying that it is not a best reply for player 2 to choose ๐ท at information set ๐. Hence this profile also cannot be supported as an equilibrium.
Case 3 [๐(๐1) = ๐ผ , ๐(๐2) =๐ผ]:
Since player 1 follows a pooling strategy, player 2 will not update his belief at information set ๐ผ, i.e., ๐๐ผ = 1/2. The ๐-dampened updating property implies ๐/2 โค ๐๐ โค 1โ ๐/2. Because 1/3 < ๐๐ผ = 1/2 < 6/7, player 2 will choose ๐ถ at information set ๐ผ. To support this profile as an equilibrium, player 2 has to choose ๐ท at information set ๐, and hence, it must be the case that ๐๐ โค 2/7.
Coupled with the requirement of ๐-dampened updating, the off-path belief has to satisfy ๐/2 โค ๐๐ โค 2/7. That is, [(๐ผ , ๐ผ);(๐ถ , ๐ท)] is pooling ๐-CSE if and only if ๐/2 โค 2/7 or ๐ โค 4/7.
Case 4 [๐(๐1) =๐, ๐(๐2) =๐]:
Similar to the previous case, since player 1 follows a pooling strategy, player 2 will not update his belief at information set ๐, i.e., ๐๐ = 1/2. Also, the ๐-dampened updating property implies๐/2โค ๐๐ผ โค 1โ๐/2. Because๐๐ > 2/7, it is optimal for player 2 to choose๐ถ at information set๐. To support this as an equilibrium, player 2 can choose either๐ถor๐ท at information set๐ผ.
Case 4.1: To make it a best reply for player 2 to choose ๐ท at information set ๐ผ, it must be that๐๐ผ โค 1/3. Combined with the requirement from๐-dampened updating, the off-path belief has to satisfy ๐/2 โค ๐๐ผ โค 1/3. As a result,[(๐, ๐);(๐ท , ๐ถ)]is a pooling ๐-CSE if and only if ๐ โค 2/3.
Case 4.2: To make it a best reply for player 2 to choose๐ถ at information set ๐ผ, it must be that 1/3 โค ๐๐ผ โค 6/7. Combined with the requirement from ๐-dampened updating, the off-path belief has to satisfy
max 1
2๐, 1 3
โค ๐๐ผ โค min 6
7,1โ 1 2๐
.
For any ๐ โ [0,1], one can find ๐๐ผ that satisfies both inequalities. Hence [(๐, ๐);(๐ถ , ๐ถ)] is a pooling๐-CSE for any ๐.
This completes the analysis of Game BH 3 and Game BH 4. โ A Public Goods Game with Communication
Proof of Proposition 7 To prove this set of cost cutoffs form a ๐-CSE, we need to show that there is no profitable deviation for any type at any subgame. First, at the second stage where there are exactly 0 โค ๐ โค ๐ โ1 players sending 1 in the first stage, since no players will contribute, setting๐ถ๐
๐ =0 is indeed a best response.
At the subgame where all ๐ players send 1 in the first stage, we use ๐
๐
๐ (๐โ๐|๐) to denote player๐โs cursed belief density. By Lemma 1, the cursed belief about all other players having a cost lower than๐is simply:
๐น๐(๐) โก
โซ
{๐๐โค๐,โ๐โ ๐}
๐
๐
๐ (๐โฒโ๐|๐)๐๐โฒโ๐
=
๏ฃฑ๏ฃด
๏ฃด
๏ฃฒ
๏ฃด๏ฃด
๏ฃณ
๐(๐/๐พ)๐โ1+ (1โ ๐) ๐/๐ถ
๐ ๐
๐โ1
if๐ โค ๐ถ
๐ ๐
1โ ๐+ ๐ ๐/๐ถ
๐ ๐
๐โ1
if๐ > ๐ถ
๐ ๐, and๐ถ
๐
๐ is the solution of the fixed point problem of๐ถ
๐
๐ =๐น๐(๐ถ
๐ ๐).
Moreover, in equilibrium,๐ถ
๐
๐ type of players would be indifferent between sending 1 and 0 in the communication stage. Thus, given ๐ถ
๐ ๐, ๐ถ
๐
๐ is the solution of the following equation
0= ๐ถ
๐ ๐
๐พ ๐โ1
โ๐ถ
๐
๐ +๐น๐(๐ถ
๐ ๐)
. As a result, we obtain that in equilibrium,๐ถ
๐
๐ =๐ถ
๐
๐ =๐น๐(๐ถ
๐
๐) โค 1 and denote this cost cutoff by๐ถโ(๐ , ๐พ , ๐). Substituting it into๐น๐(๐), gives:
๐ถโ(๐ , ๐พ , ๐) โ ๐
๐ถโ(๐ , ๐พ , ๐) ๐พ
๐โ1
=1โ ๐ .
In the following, we show that for any ๐ โฅ 2 and ๐, the cutoff ๐ถโ(๐ , ๐พ , ๐) is unique.
Case 1: When ๐ = 2, the cutoff ๐ถโ(2, ๐พ , ๐) is the unique solution of the linear equation
๐ถโ(2, ๐พ , ๐) โ๐
๐ถโ(2, ๐พ , ๐) ๐พ
=1โ ๐ โโ ๐ถโ(2, ๐พ , ๐) = ๐พ โ๐พ ๐ ๐พ โ๐
.
Case 2: For๐ โฅ 3, we define the function โ(๐ฆ) : [0,1] โRwhere โ(๐ฆ)= ๐ฆโ ๐
๐ฆ ๐พ
๐โ1
โ (1โ ๐).
It suffices to show that โ(๐ฆ)has a unique root in [0,1]. When ๐=0, โ(๐ฆ) = ๐ฆโ1 which has a unique root at ๐ฆ = 1. In the following, we will focus on the case where ๐ > 0. Since โ(๐ฆ) is continuous, โ(0) = โ(1 โ ๐) < 0 and โ(1) = ๐
1โ (1/๐พ)๐โ1
> 0, there exists a root ๐ฆโ โ (0,1) by the intermediate value theorem. Moreover, as we take the second derivative, we can find that for any ๐ฆ โ (0,1),
โโฒโฒ(๐ฆ) =โ ๐ ๐พ๐โ1
(๐โ1) (๐โ2)๐ฆ๐โ3 < 0,
implying thatโ(๐ฆ)is strictly concave in[0,1]. Furthermore,โ(0) < 0 andโ(1) > 0, so the root is unique, as illustrated in the left panel of Figure 3. This completes the proof. โ
Proof of Corollary 3 By Proposition 7, we know the cutoff๐ถโ(๐ , ๐พ , ๐) โค 1 and it satisfies
๐ถโ(๐ , ๐พ , ๐) โ ๐
๐ถโ(๐ , ๐พ , ๐) ๐พ
๐โ1
=1โ ๐ .
Therefore, when ๐ =0, the condition becomes๐ถโ(๐ , ๐พ ,0) =1. In addition, when ๐ =1, the condition becomes
๐ถโ(๐ , ๐พ ,1) โ
๐ถโ(๐ , ๐พ ,1) ๐พ
๐โ1
=0, implying๐ถโ(๐ , ๐พ ,1) =0.
For๐ โ (0,1), to prove๐ถโ(๐ , ๐พ , ๐)is strictly decreasing in๐,๐พand๐, we consider a function๐(๐ฆ;๐ , ๐พ , ๐) : (0,1) โ Rwhere๐(๐ฆ;๐ , ๐พ , ๐) = ๐ฆโ ๐[๐ฆ/๐พ]๐โ1.For any ๐ฆ โ (0,1)and fix any๐พ and ๐, we can observe that when ๐ โฅ2,
๐(๐ฆ;๐ +1, ๐พ) โ๐(๐ฆ;๐ , ๐พ) =โ๐ h๐ฆ
๐พ i๐
+๐ h๐ฆ
๐พ i๐โ1
> 0,
so ๐(ยท;๐ , ๐พ , ๐) is strictly increasing in ๐. Therefore, the cutoff ๐ถโ(๐ , ๐พ , ๐) is strictly decreasing in ๐. Similarly, for any๐ฆ โ (0,1) and fix any๐ and ๐, observe that when๐พ >1,
๐ ๐
๐ ๐พ
= ๐(๐โ1) ๐ฆ๐โ1
๐พ๐
> 0,
which implies that cutoff ๐ถโ(๐ , ๐พ , ๐) is also strictly decreasing in ๐พ. For the comparative statics of ๐, we can rearrange the equilibrium condition where
1โ๐ถโ(๐ , ๐พ , ๐) ๐
=1โ
๐ถโ(๐ , ๐พ , ๐) ๐พ
๐โ1
.
Since LHS is strictly decreasing in๐, the equilibrium cutoff is also strictly decreasing in ๐. Finally, taking the limit on both sides of the equilibrium condition, we obtain:
lim
๐โโ
๐ถโ(๐ , ๐พ , ๐) = lim
๐พโโ
๐ถโ(๐ , ๐พ , ๐) =1โ ๐ . This completes the proof. โ
The Centipede Game with Altruistic Types
Proof of Claim 2 By backward induction, we know selfish player two will choose ๐4 for sure. Given that player two will choose๐4 at stage four, it is optimal for selfish player one to choose ๐3. Now, suppose selfish player one will choose ๐1 with probability๐1and player two will choose ๐2 with probability ๐2. Given this behavioral strategy profile, player twoโs belief about the other player being altruistic at stage two is:
๐= ๐ผ
๐ผ+ (1โ๐ผ)๐1 .
In this case, it is optimal for selfish player two to pass if and only if 32๐+4(1โ ๐) โฅ8 โโ ๐ โฅ 1
7.
At the equilibrium, selfish player two is indifferent between๐2 and๐2. If not, say 32๐+4(1โ๐) > 8, player two will choose๐2. Given that player two will choose๐2, it is optimal for selfish player one to choose ๐1, which makes ๐ =๐ผand๐ผ > 1/7.
However, we know ๐ผ โค 1/7 which yields a contradiction. On the other hand, if 32๐+4(1โ๐) < 8, then it is optimal for player two to choose๐2 at stage two. As a result, selfish player one would choose ๐1 at stage one, causing ๐ = 1. In this case, player two would deviate to choose๐2, which again yields a contradiction. To summarize, in equilibrium, player two has to be indifferent between๐2 and๐2, i.e., ๐=1/7. As we rearrange the equality, we can obtain that
๐ผ ๐ผ+ (1โ๐ผ)๐โ
1
= 1
7 โโ ๐โ
1= 6๐ผ 1โ๐ผ
.
Finally, since the equilibrium requires selfish player one to mix at stage one, selfish player one has to be indifferent between๐1 and๐1. Therefore,
4=16๐โ
2+2(1โ๐โ
2) โโ ๐โ
2 = 1 7. This completes the proof. โ
Proof of Proposition 8 By backward induction, we know selfish player two will choose๐4 for sure. Given this, it is optimal for selfish player one to choose๐3. Now, suppose selfish player one will choose ๐1 with probability ๐1 and player two will choose๐2 with probability๐2. Given this behavioral strategy profile, by Lemma 1, player twoโscursedbelief about the other player being altruistic at stage 2 is:
๐๐ = ๐๐ผ+ (1โ ๐)
๐ผ
๐ผ+ (1โ๐ผ)๐1
. In this case, it is optimal for player two to pass if and only if 32๐๐+4(1โ๐๐) โฅ 8 โโ ๐๐ โฅ 1
7.
We can first show that in equilibrium, it must be that ๐๐ โค 1/7. If not, then it is strictly optimal for player two to choose ๐2. Therefore, it is optimal for selfish player one to choose๐1 and hence ๐๐ =๐ผโค 1/7, which yields a contradiction. In the following, we separate the discussion into two cases.
Case 1: ๐ โค 6
7(1โ๐ผ)
In this case, we argue that player two is indifferent between๐2 and๐2. If not, then 32๐๐ +4(1โ ๐๐) < 8 and it is strictly optimal for player two to choose๐2. This would cause selfish player one to choose๐1 and hence ๐๐ = 1โ (1โ๐ผ)๐. This yields a contradiction because
๐๐ =1โ (1โ๐ผ)๐ < 1
7 โโ ๐ > 6 7(1โ๐ผ).
Therefore, in this case, player two is indifferent between๐2 and๐2 and thus, ๐๐ = 1
7 โโ ๐๐ผ+ (1โ ๐)
"
๐ผ ๐ผ+ (1โ๐ผ)๐
๐ 1
#
= 1 7
โโ ๐+ 1โ ๐
๐ผ+ (1โ๐ผ)๐
๐ 1
= 1 7๐ผ
โโ ๐ผ+ (1โ๐ผ)๐
๐
1 =(1โ ๐) 1 7๐ผ
โ ๐
โโ ๐
๐ 1 =
7๐ผโ7๐ผ ๐ 1โ7๐ผ ๐
โ๐ผ (1โ๐ผ).
Since the equilibrium requires selfish player one to mix at stage 1, selfish player one has to be indifferent between๐1 and๐1. Therefore,
4=16๐
๐
2 +2(1โ๐
๐
2) โโ ๐
๐ 2 = 1
7. Case 2: ๐ > 6
7(1โ๐ผ)
In this case, we know for any๐
๐
1 โ [0,1], ๐๐ = ๐๐ผ+ (1โ ๐)
"
๐ผ ๐ผ+ (1โ๐ผ)๐
๐ 1
#
โค 1โ (1โ๐ผ)๐ < 1 7,
implying that it is strictly optimal for player two to choose๐2, and hence it is strictly optimal for selfish player one to choose๐1 at stage 1. This completes the proof. โ Sequential Voting over Binary Agendas
Proof of Proposition 9 If ๐1(๐1) = ๐ and all other types of voters as well as type๐1at stage 2 vote sincerely, voter๐โs ๐-cursed belief in the second stage upon observing๐1
โ๐= (๐, ๐) is
๐
๐
๐ (๐โ๐|๐1
โ๐ =(๐, ๐))=
๏ฃฑ๏ฃด
๏ฃด๏ฃด
๏ฃด๏ฃด
๏ฃฒ
๏ฃด๏ฃด
๏ฃด๏ฃด
๏ฃด
๏ฃณ
๐1๐3๐+ ๐1
๐1+๐2(1โ ๐) if๐โ๐ =(๐3, ๐1) ๐2๐3๐+ ๐2
๐1+๐2(1โ ๐) if๐โ๐ =(๐3, ๐2)
๐๐๐๐๐ otherwise.
As mentioned in Section 4.4, a voter would act as if he perceives the other votersโ
(behavioral) strategies correctly in the last stage. However, misunderstanding the link between the other votersโ types and actions would distort a voterโs belief updating process. In other words, a voter would perceive the strategies correctly but form beliefs incorrectly. As a result, the continuation value of the ๐ vs ๐ subgame to a type ๐1voter is simply the voterโs ๐-cursed belief, conditional on being pivotal, about there being at least one type ๐1 voter among his opponents. Similarly, the continuation value of the ๐ vs ๐ subgame is equal to the voterโs conditional ๐- cursed belief about there being at least one type๐1or๐2voter among his opponents multiplied by ๐ฃ. Therefore, the continuation values to a type ๐1 voter in the two possible subgames of the second stage are (let ห๐2โก ๐1
๐1+๐2):
๐vs๐ : ๐
1โ (1โ ๐1)2
+ (1โ ๐)๐ห2 ๐vs๐ :
1โ๐2
3๐
๐ฃ
It is thus optimal for a type๐1voter to vote for๐in the first stage if ๐
1โ (1โ ๐1)2
+ (1โ๐)๐ห2 โค 1โ๐2
3๐
๐ฃ
โโ [2๐1โ๐2
1โ ๐ห2+ ๐2
3๐ฃ]๐ โค ๐ฃโ๐ห2 (A.1)
Notice that the statement would automatically hold when ๐ =0. In the following, we want to show that given ๐ฃ and ๐, if condition (A.1) holds for some ๐ โ (0,1], then it will hold for all ๐โฒ โค ๐. As๐ > 0, we can rewrite condition (A.1) as
2๐1โ ๐2
1โ ๐ห2+๐2
3๐ฃ โค ๐ฃโ ๐ห2 ๐
. (2โ)
Case 1: ๐ฃโ๐ห2 <0.
In this case, we want to show that voting๐in the first stage is never optimal for type ๐1voter. That is, we want to show condition (2โ) never holds for๐ฃ < ๐ห2. To see this, we can first observe that the RHS is strictly increasing in๐. Therefore, it suffices to show
2๐1โ ๐2
1โ ๐ห2+๐2
3๐ฃ > ๐ฃโ ๐ห2. This is true because
2๐1โ ๐2
1โ ๐ห2+๐2
3๐ฃโ (๐ฃโ ๐ห2) =2๐1โ ๐2
1โ (1โ ๐2
3)๐ฃ
> 2๐1โ ๐2
1โ (1+๐3)๐1= ๐1๐2 โฅ 0 where the second inequality holds as๐ฃ <
๐1 ๐1+๐2. Case 2: ๐ฃโ๐ห2 โฅ 0.
Since the RHS of condition (2โ) is greater or equal to 0, it will weakly increase as ๐ decreases. Thus, if condition (2โ) holds for some ๐ โ (0,1], it will also hold for all ๐โฒโค ๐. This completes the proof. โ
Proof of Proposition 10 Assuming that all voters vote sincerely in both stages, voter๐โs ๐-cursed belief in the second stage upon observing๐1
โ๐ =(๐, ๐) is
๐
๐
๐ (๐โ๐|๐1โ๐ =(๐, ๐))=
๏ฃฑ๏ฃด
๏ฃด๏ฃด
๏ฃด๏ฃด
๏ฃฒ
๏ฃด๏ฃด
๏ฃด๏ฃด
๏ฃด
๏ฃณ
๐1๐2๐+ ๐1
๐1+๐3(1โ ๐) if๐โ๐ =(๐1, ๐2) ๐2๐3๐+ ๐3
๐1+๐3(1โ ๐) if๐โ๐ =(๐3, ๐2)
๐๐๐๐๐ otherwise.
Similar to the proof of Proposition 9, the continuation values to a type๐1 voter in the two possible subgames of the second stage are (let ห๐3โก ๐1
๐1+๐3):
๐vs๐ : ๐
1โ (1โ ๐1)2
+ (1โ ๐)๐ห3 ๐vs๐ :
1โ๐2
3๐
๐ฃ
Thus, it is optimal for a type๐1voter to vote for๐in the first stage if ๐
1โ (1โ ๐1)2
+ (1โ๐)๐ห3 โฅ 1โ๐2
3๐
๐ฃ
โโ ๐
2๐1โ ๐2
1โ ๐ห3+๐2
3๐ฃ
โฅ ๐ฃโ ๐ห3. (A.2)
Case 1: ๐ฃโ๐ห3 >0.
In this case, we want to show that given ๐ and๐ฃ, there exists ห๐such that condition (A.2) holds if and only if ๐ โฅ ๐ห. Let ๐ โก 2๐1โ ๐2
1โ ๐ห3+ ๐2
3๐ฃ. If ๐ > 0, then condition (A.2) holds if and only if ๐ โฅ ๐ห โก ๐ฃโ๐ห3
๐ . On the other hand, if ๐ โค 0, condition (A.2) will not hold for all ๐ โ [0,1]and hence we can set ห๐=2.
Case 2: ๐ฃโ๐ห3 โค 0.
In this case, we want to show that given ๐ and๐ฃ, there exists ห๐such that condition (A.2) holds if and only if ๐ โค ๐ห. If๐ <0, then condition (A.2) holds if and only if ๐ โค ๐ฃโ๐ห3
๐ where the RHS is greater or equal to 0. On the other hand, if๐ โฅ 0, then condition (A.2) will hold for any ๐ โ [0,1]and hence we can again set ห๐ =2. This completes the proof. โ
The Dirty Faces Game
Proof of Proposition 11 When observing a clean face, a player will know that he has a dirty face immediately. Therefore, choosing 1 (i.e., choosing ๐ท at stage 1) when observing a clean face is a strictly dominant strategy. In other words, for any ๐ โ [0,1], ห๐๐(๐) =1.
The analysis of the case where the player observes a dirty face is separated into two cases.
Case 1: ๐ > ๐ผยฏ
In this case, we show that ห๐๐(๐) =๐+1 is the only๐-CE. If not, suppose ห๐๐(๐)=๐ก where๐ก โค ๐can be supported as a๐-CE. We can first notice that ห๐๐(๐) =1 cannot