GAMES PLAYED BY TEAMS OF PLAYERS
2.5 Stochastic Rationality and Team Response Functions
a scoring rule but satisfies the Nash convergence property. Finally, we conjecture that the class is much broader than scoring rules, including many other anonymous and neutral collective choice rules. Scoring rules operate only on the individual ordinalrankings of estimated expected payoffs. One imagines that there are many collective choice rules that operate on the cardinal values of the estimates and also have the Nash convergence property, such as weighted average rules.
response functions to satisfy these two properties. We first show that payoff mono- tonicity of team response functions requires only two weak assumptions on the collective choice rule, unanimity and positive responsiveness. On the other hand, rank dependence holds only for a more restricted class of neutral collective choice rules. Many non-neutral collective choice rules, such as those that give a status quo advantage to an action, will fail to satisfy rank dependence, as the last example in section 3.1 demonstrates. Second, we show that rank dependence is satisfied for πΎπ‘ =2 with any collective choice rule satisfying unanimity, positive responsiveness, and neutrality, and forπΎπ‘ > 2 with plurality rule or weighted average rules.
Payoff Monotonicity For any team game, Ξ, ππΆ
π‘ depends on the strategy profile πΌ, the distribution of memberβs estimation errors, πΉπ‘, and the team collective choice rule,πΆπ‘. In this section we identify conditions onπΆπ‘that are sufficient forππΆ
π‘ to be payoff monotone for all admissibleπΉπ‘. The formal definition of payoff monotonicity is given below.
Definition 4. A team collective choice ruleπΆπ‘ satisfiesPayoff Monotonicityif, for allππ‘
π, πΌ, πΌβ²: ππ‘
π(πΌ) > ππ‘
π(πΌβ²)andππ‘
π(πΌ) =ππ‘
π(πΌβ²) βπ β π β ππΆ
π‘
π (ππ‘(πΌ)) > ππΆ
π‘
π (ππ‘(πΌβ²)).
Specifically, we require team collective choice rules to satisfy two axioms: unanimity and positive responsiveness. The first condition, unanimity, simply states that if all members of the team estimate that ππ‘
π has the highest expected utility, then it is uniquely chosen byπΆπ‘.17
Definition 5. A team collective choice ruleπΆπ‘ satisfiesUnanimityif:
πbπ‘
π π >πbπ‘
ππ for allπ βπ‘and for allπ β π βπΆπ‘(πbπ‘) ={ππ‘
π}.
In addition to using this axiom to prove payoff monotonicity, it also guarantees that team response functions are interior, in the sense that every action is chosen with positive probability. The second axiom, positive responsiveness, requires that the team choice responds positively to all members of a team increasing their estimated expected payoff of an action, keeping all other estimated expected payoffs the same.
The following definition is used in the statement of the axiom.
Definition 6. A profile πΛπ‘ of member estimated expected utilities isa monotonic transformation ofπΛπ‘ with respect to action ππ‘
π if, for all membersπ β π‘, we have
Λ ππ‘
π π β₯πΛπ‘
π π andπΛπ‘
ππ =πΛπ‘
ππ for allπ β π.
17For the "standard" case of games played by one-person teams, unanimity implies that ifπ=1 then every team equilibrium is equivalent to a quantal response equilibrium of the strategic form game,[π , π΄, π’].
Definition 7. A team collective choice rule πΆπ‘ satisfies Positive Responsiveness if ππ‘
π β πΆπ‘(πΛπ‘) β ππ‘
π β πΆπ‘(πΛπ‘) β πΆπ‘(πΛπ‘), for all ππ‘
π, πΛπ‘ and all monotonic transformationsπΛπ‘ofπΛπ‘ with respect to actionππ‘
π.
This definition of Positive Responsiveness is essentially a cardinal version of the usual definition of positive responsiveness from the social choice literature. It says that if an action ππ‘
π is chosen at some profile of estimated expected utilities, and all team membersβ estimates of the expected utility of that action weakly increase, ceteris paribus, thenππ‘
π must still be chosen, and no new actions can be added to the choice set.
Many collective choice rules satisfy positive responsiveness. For example, any weighted average rule, where the team choice corresponds to the action with the highest weighted average of individual members estimates, is positively responsive.
Plurality rule also clearly satisfies this condition. In this section we consider a class of collective choice rules, calledgeneralized scoring rules, and show that positive responsiveness is satisfied for any such collective choice rule. A generalized scoring rule is substantially more general than the standard definition of a scoring rule in the social choice literature, which was defined in the previous section as ananonymous scoring rules (i.e., all the individual scoring functions are the same). Generalized scoring rules relax the anonymity requirement that all individual scoring functions are the same. It includes a wide range of non-anonymous collective choice rules, including dictatorial rules.
Definition 8. A team collective choice ruleπΆπ‘is aGeneralized Scoring Ruleif there exists a profile of individual scoring functions,(ππ‘
1, ...ππ‘
ππ‘), such that for allππ‘
π β π΄π‘ and for allπΛπ‘ β βπΎπ‘ππ‘, ππ‘
π βπΆπ‘(πΛπ‘) if and only ifΓππ‘ π=1ππ‘
π π(πΛπ‘
π) β₯ Γππ‘ π=1ππ‘
ππ(πΛπ‘
π) for allπ β π.
Proposition 1. All generalized scoring rules satisfy positive responsiveness.18 Proof. Positive responsiveness follows from the fact that the value of the team score function evaluated at any alternative is weakly increasing in that alternativeβs estimated expected utility for each team member, and weakly decreasing in every
other alternativeβs estimated expected utility. β‘
We can now state the main result of this subsection.
18Ifπ π‘
π1> π π‘
π2for allπβπ‘, then the scoring rule also satisfies unanimity.
Theorem 3. IfπΉπ‘ is admissible andπΆπ‘ satisfies unanimity and positive responsive- ness thenππΆ
π‘
satisfies payoff monotonicity.
Proof. Let πΆπ‘ satisfy positive responsiveness and unanimity and πΉπ‘ admissible.
Suppose thatππ‘
πβπ
β²π‘
π =πΏ >0, andππ‘
π =π
β²π‘
π ,βπ β π. Then for all realizations of the estimation errorsππ‘, we have thatππ‘+ππ‘is a monotonic transformation ofπ
β²π‘
+ππ‘with respect toππ‘
π. So by positive responsiveness ofπΆπ‘ we have that ifππ‘
π βπΆπ‘(π
β²π‘
+ππ‘) then ππ‘
π β πΆπ‘(ππ‘ + ππ‘), and if ππ‘
π β πΆπ‘(ππ‘ + ππ‘) then ππ‘
π β πΆπ‘(π
β²π‘
+ ππ‘). So ππΆ
π‘
π (ππ‘+ππ‘) β₯ ππΆ
π‘
π (π
β²π‘+ππ‘)for allππ‘, and thereforeππΆ
π‘
π (ππ‘) β₯ ππΆ
π‘
π (π
β²π‘). To show the strict inequality,ππΆ
π‘
π (ππ‘) > ππΆ
π‘
π (π
β²π‘), we show that there exists a regionπ½ β βπΎπ‘Γππ‘ with positive measure such that if ππ‘ β π½, then ππΆ
π‘
π (ππ‘ +ππ‘) > ππΆ
π‘
π (π
β²π‘ +ππ‘). In particular, unanimity ofπΆπ‘is used as follows to constructπ½such that ifππ‘ β π½, then ππΆ
π‘
π (ππ‘ +ππ‘) = 1 > ππΆ
π‘
π (π
β²π‘
+ππ‘) = 0. That is, such that ππ‘
π is uniquely chosen underππ‘+ππ‘, and not chosen underπ
β²π‘
+ππ‘. Let Λππ‘be an estimated expected utility profile such that all team members strictly prefer some action ππ‘
π to action ππ‘
π, all members prefer ππ‘
π to all other actionsππ‘
π (i.e., all members rank ππ‘
π second), and for all members we have Λππ‘
π βπΛπ‘
π = πΏ2. Define:
π½ ={πΛπ‘βπ
β²π‘
+π :ππ β (0, πΏ
4), ππ β (βπΏ
4,0), ππ < 0} Then ifππ‘ β π½, by unanimity we haveπΆπ‘(π
β²π‘
+ππ‘) = {ππ‘
π} andπΆπ‘(ππ‘ +ππ‘) ={ππ‘
π}, soππΆ
π‘
π (ππ‘+ππ‘) = 1 > ππΆ
π‘
π (π
β²π‘
+ππ‘) = 0. π½ is an open set and hence has positive measure since the distribution ofππ‘has full support. ThereforeππΆ
π‘
π (ππ‘) > ππΆ
π‘
π (π
β²π‘
),
as desired. β‘
Rank Dependence
In this section we show that ππΆ
π‘ satisfies rank dependence for πΎπ‘ = 2 with any collective choice rule satisfying unanimity, positive responsiveness and neutrality, and forπΎπ‘ > 2 with plurality rule and weighted average rules. The formal definition of rank dependence is:
Definition 9. A team collective choice ruleπΆπ‘satisfiesRank Dependenceif, for all ππ‘
π
, ππ‘
π
, πΌ, ππ‘
π(πΌ) > ππ‘
π(πΌ) β ππΆ
π‘
π (ππ‘(πΌ)) > ππΆ
π‘
π (ππ‘(πΌ))
Neutrality is an essential property for proving that team response functions satisfy rank dependence. Informally a neutral team collective choice rule is one that is not biased against or in favor of any particular action. This is analogous to the neutrality axiom from the social choice literature.
Let π : π΄π‘ β π΄π‘ be any permutation of team actions. Denote by ππ‘ ,π = (ππ‘
π(1), ..., ππ‘
π(πΎπ‘))the permuted profile of expected utilities and by Λππ‘ ,π β‘ (ππ‘
π(1)+ ππ‘
π(1), ..., ππ‘
π(πΎπ‘) +ππ‘
π(πΎπ‘)) the permuted profile of estimated expected utilities. We can then define neutrality formally.
Definition 10. A team collective choice ruleπΆπ‘satisfiesNeutralityif, for allππ‘
π,πΛπ‘, for all permutationsπ,ππ‘
π βπΆπ‘(πΛπ‘) βππ‘
π(π) βπΆπ‘(πΛπ‘ ,π)
Neutrality, along with admissibility ofπΉπ‘, imply that when the expected payoffs of team actions are permuted, the team choice probabilities are permuted.
Lemma 1. IfπΉπ‘is admissible andπΆπ‘satisfies neutrality, thenππΆ
π‘
π (ππ‘) =ππΆ
π‘
π(π)(ππ‘ ,π) for all expected utility profiles,ππ‘, actions,ππ‘
π, and permutationsπ.
Proof. By neutrality ofπΆπ‘, for any expected utility profile, ππ‘, action, ππ‘
π, belief error profile,ππ‘, and permutation π, ππΆ
π‘
π (πΛπ‘) = ππΆ
π‘
π(π)(πΛπ‘ ,π), that is the probability that ππ‘
π is chosen at Λππ‘ is equal to the probability that ππ‘
π(π) is chosen at Λππ‘ ,π. Therefore ππΆ
π‘
π (ππ‘) = β«
ππ‘
ππΆ
π‘
π (πΛπ‘)ππΉπ‘(ππ‘) = β«
ππ‘
ππΆ
π‘
π(π)(πΛπ‘ ,π)ππΉπ‘(ππ‘). Finally, since the estimation errors are i.i.d, β«
ππ‘
ππΆ
π‘
π(π)(πΛπ‘ ,π)ππΉπ‘(ππ‘) = β«
ππ‘
ππΆ
π‘
π(π)(πΛπ‘ ,π)ππΉπ‘(ππ‘ ,π) = ππΆ
π‘
π(π)(ππ‘ ,π). β‘
A corollary to the lemma is that when two actions have equal expected payoffs, the team must play these actions with equal probability. It is easy to see that non-neutral collective choice rules can lead to violations of rank dependence. For example, collective choice rules that favor one action (e.g. a status quo action) over another will generally lead to violations, as in the last example of Section 3.1 with a 2/3 voting rule. ConsiderπΎπ‘ =2 and a choice rule that selects actionππ‘
1if and only if all team members estimate its expected utility to be greater than that of actionππ‘
2, and selects actionππ‘
2 otherwise. For any admissible πΉπ‘, if the size of the team is large enough, a team using this choice rule will select action ππ‘
2more often than action ππ‘
1even whenππ‘
1(πΌ) > ππ‘
2(πΌ).
Next, for the case of πΎπ‘ = 2, we prove that neutrality, together with unanimity and positive responsiveness is sufficient to guarantee that a team response function satisfies rank dependence for all admissibleπΉπ‘. This is proved below.
Theorem 4. IfπΎπ‘ =2,πΉπ‘ is admissible andπΆπ‘satisfies unanimity, positive respon- siveness and neutrality, thenππ‘ satisfies rank dependence.
Proof. Pick anyππ‘ such that ππ‘
1 > ππ‘
2 and let πΏ = ππ‘
1β ππ‘
2. Let π
β²π‘
= (ππ‘
1 β πΏ, ππ‘
2), then by lemma 1, ππΆ
π‘
1 (π
β²π‘) = ππΆ
π‘
2 (π
β²π‘) = 12. Since πΆπ‘ satisfies positive responsiveness and unanimity, theorem 3, together with the fact that ππΆ
π‘
1 (ππ‘) + ππΆ
π‘
2 (ππ‘) =1, implies thatππΆ
π‘
1 (ππ‘) > ππΆ
π‘
1 (π
β²π‘
) =ππΆ
π‘
2 (π
β²π‘
) > ππΆ
π‘
2 (ππ‘). β‘
IfπΎπ‘ > 2 the next two propositions prove rank dependence with additional restric- tions on the team collective choice rule.
Theorem 5. If πΉπ‘ is admissible and πΆπ‘ is plurality rule, then ππΆ
π‘
satisfies rank dependence.
Proof. Consider any profile of expected payoffsππ‘. By neutrality and admissibility we can without loss of generality label the actions such thatππ‘
1 β₯ ππ‘
2 β₯ ... β₯ ππ‘
πΎπ‘. By lemma 1, ifππ‘
π =ππ‘
π, thenππΆ
π‘
π =ππΆ
π‘
π . Supposeππ‘
π
> ππ‘
π. The probability that any team memberπranks actionπ highest is ππ =ππ π π(ππ‘
π+ππ‘
π πβmaxπβ π{ππ‘
π +ππ‘
π π} β₯0). Letπ :{1, ..., πΎπ‘} β {1, ..., πΎπ‘}be the pairwise permutation ofπ andπ, that is the permutation that mapsπ toπandπto πand all else to itself. By exchangeability of the error terms,( (ππ‘
1+ππ‘
1, ..., ππ‘
πΎπ‘+ππ‘
πΎπ‘) has the same joint distribution as (ππ‘
1+ππ‘
π(1), ..., ππ‘
πΎπ‘ +ππ‘
π(πΎπ‘)), and soππ‘
π +ππ‘
π π β maxπβ π{ππ‘
π +ππ‘
π π} has the same distribution asππ‘
π +ππ‘
ππ(π) βmaxπβ π{ππ‘
π +ππ‘
ππ(π)}.
Sinceππ‘
π > ππ‘
π, we have for allππ‘
π,ππ‘
π(π)+ππ‘
ππ(π) < ππ‘
π+ππ‘
ππ(π), and maxπβ π{ππ‘
π(π)+ ππ‘
ππ(π)} β₯ maxπβ π{ππ‘
π + ππ‘
ππ(π)}. By the full support assumption, it follows that ππ π π(ππ‘
π+ππ‘
ππ(π)βmaxπβ π{ππ‘
π+ππ‘
ππ(π)} β₯0) > ππ π π(ππ‘
π(π)+ππ‘
ππ(π)βmaxπβ π{ππ‘
π+ ππ‘
ππ(π)} β₯0) = ππ, soππ > ππ. Now, still supposing thatππ‘
π
> ππ‘
π, and therefore thatππ > ππ, denote by(π1, ..., ππΎ) the tuple of number of team members that rank each action first for a given Λππ‘. For any choice setπ΅ β π΄π‘, letππ΅ ={(π1, ..., ππΎ) |βππ βπ΅,βππ, ππ β₯ ππ and ΓπΎ
π=1ππ = π} be the set of feasible βvoteβ totals that result in π΅ being chosen. Then, since estimated expected utilities are independent across individuals conditional onππ‘, we can write the probability of this subset being chosen as
ππ π π(πΆπ‘(πΛπ‘) =π΅)=βοΈ
ππ΅
π!
π1!π2!...ππΎ!Ξ πΎπ=1π
ππ
π
Let π : {1, ..., πΎπ‘} β {1, ..., πΎπ‘} be the pairwise permutation between π andπ as defined earlier. Pick any π΅ that contains ππ and not ππ. Then (π1, ..., ππΎ) β ππ΅
if and only if (ππ(1), , ..., ππ(πΎ)) β π(π΅β{ππ})βͺ{ππ}, the set of vote totals that results in the choice set being π΅, minus ππ and adding ππ. Then, since ππ > ππ, we have that πππ
π
π
ππ π
> π
ππ π
πππ
π , so every term of the sum in ππ π π(πΆπ‘(πΛπ‘) = π΅) is greater than the corresponding term in ππ π π(πΆπ‘(πΛπ‘) = (π΅β {ππ}) βͺ {ππ}). So we have for all π΅ containing ππ and not ππ, that ππ π π(πΆπ‘(πΛπ‘) = π΅) > ππ π π(πΆπ‘(πΛπ‘) = (π΅β {ππ}) βͺ {ππ}). Finally, define π΅0 to be the subsets of π΄π‘ that contain neither ππ norππ, π΅π the subsets containing onlyππ, π΅π the subsets containing onlyππ and notππ, andπ΅π π the set containing both. Then
ππΆ
π‘
π (ππ‘) =0Γ βοΈ
π΅βπ΅0
ππ π π(πΆπ‘(πΛπ‘) =π΅) + βοΈ
π΅βπ΅π
1
|π΅|ππ π π(πΆπ‘(πΛπ‘) = π΅) +0Γ βοΈ
π΅βπ΅π
ππ π π(πΆπ‘(πΛπ‘) = π΅) + βοΈ
π΅βπ΅π π
1
|π΅|ππ π π(πΆπ‘(πΛπ‘) =π΅) ππΆ
π‘
π (ππ‘) =0Γ βοΈ
π΅βπ΅0
ππ π π(πΆπ‘(πΛπ‘) =π΅) +0Γ βοΈ
π΅βπ΅π
ππ π π(πΆπ‘(πΛπ‘) =π΅) + βοΈ
π΅βπ΅π
1
|π΅|ππ π π(πΆπ‘(πΛπ‘) =π΅) + βοΈ
π΅βπ΅π π
1
|π΅|ππ π π(πΆπ‘(πΛπ‘) =π΅) ππΆ
π‘
π andππΆ
π‘
π share all terms of the fourth sum, so ππΆ
π‘
π [ππ‘] βππΆ
π‘
π [ππ‘] = βοΈ
π΅βπ΅π
1
|π΅|ππ π π[πΆπ‘(πΛπ‘)= π΅] β βοΈ
π΅βπ΅π
1
|π΅|ππ π π[πΆπ‘(πΛπ‘) =π΅] ππΆ
π‘
π [ππ‘] βππΆ
π‘
π [ππ‘] = βοΈ
π΅βπ΅π
1
|π΅|[ππ π π[πΆπ‘(πΛπ‘) =π΅] βππ π π[πΆπ‘(πΛπ‘) = (π΅β {ππ}) βͺ {ππ}]] > 0 Therefore, wheneverππ‘
π
> ππ‘
π, we haveππ > ππ, which impliesππΆ
π‘
π (ππ‘) > ππΆ
π‘
π (ππ‘).
β‘
Define a weighted average rule as follows:
Definition 11. A team collective choice rule πΆπ‘ is a Weighted Average Rule if there exists a profile of non-negative individual voting weights, (π€π‘
1, ..., π€π‘
ππ‘) with Γππ‘
π=1π€π‘
π = 1such that for allππ‘
π β π΄π‘ and for allπΛπ‘ β βπΎπ‘ππ‘, ππ‘
π β πΆπ‘(πΛπ‘) if and only ifΓππ‘
π=1π€π‘
ππΛπ‘
π π β₯ Γππ‘ π=1π€π‘
ππΛπ‘
ππ for allπ β π .
Theorem 6. IfπΉπ‘is admissible andπΆπ‘is a weighted average rule, thenππΆ
π‘ satisfies rank dependence.
Proof. Consider any profile of expected payoffs ππ‘, and suppose ππ‘
π
> ππ‘
π. We have ππΆ
π‘
π (ππ‘) = β«
1{Γππ‘
π=1π€π‘
ππΛπ‘
π π β₯ max{Γππ‘
π=1π€ππΛπ‘
π π}πΎπ‘
π=1}ππΉπ‘. Note that the prob- ability that any of these weighted averages are exactly equal is 0. Now, since
πΉπ‘(π¦βππ‘
π) < πΉπ‘(π¦ βππ‘
π) for all π¦ β β, we have Λππ‘
π π
>π π‘ πΛπ‘
ππ, where >π π‘ denotes thestrictfirst stochastic order, for all membersπ. This order is closed under convo- lutions, soΓππ‘
π=1π€π‘
ππΛπ‘
π π >π π‘ Γππ‘
π=1π€π‘
ππΛπ‘
ππ. Since1{π§ > 0} is increasing, non-constant and bounded, we therefore have
β« 1{
π
βοΈ
π=1
π€ππΛπ‘
π π β₯ max{
π
βοΈ
π=1
π€ππΛπ‘
π π}πΎπ=1}}ππΉ >
β« 1{
π
βοΈ
π=1
π€ππΛπ‘
ππ β₯ max{
π
βοΈ
π=1
π€ππΛπ‘
π π}πΎπ=1}}ππΉ ππΆ
π‘
π (ππ‘) > ππΆ
π‘
π (ππ‘)
β‘