Behavioral Implications of Causal Misperceptions Part I

(1)

Behavioral Implications of Causal Misperceptions Part I

Ran Spiegler (TAU & UCL)

ES Winter School, Delhi

December 2019

(2)

Equilibrium without Rational Expectations

• Standard equilibrium analysis in economics:

1. Steady state approach (even in dynamic models) 2. Agents best-reply to their beliefs.

3. Agents’ beliefs reflect perfect understanding of the steady- state empirical regularities.

• Research program: Keep 1+2, relax 3

(3)

Examples from the Literature

• Sampling: Players evaluate action-consequence mapping via finite samples (Osborne-Rubinstein 1998)

• Coarseness: Beliefs are measurable w.r.t a partition of all contingencies (Piccione-Rubinstein 2003, Jehiel 2005)

• “Cursedness”: Players cannot perceive dependence of endogenous variables on factors other than their information (Eyster-Rabin 2005)

• Naïve extrapolation from selective samples (Esponda 2008)

(4)

Common Feature

• Agents interpret statistical regularities through the prism of a (wrong) subjective model.

– The model involves errors of causal attribution.

– The concepts differ in the kind of causal misattribution they assume and the data agents use to “estimate” their model.

• In this lecture series: A formalism of equilibrium with non-rational expectations that takes this description as a starting point

(5)

In this Lecture Series…

• Decision makers are endowed with subjective causal models, formalized as directed acyclic graphs (DAGs).

– Relying heavily on a rich Statistics/AI literature on probabilistic graphical models (“Bayesian networks”)

• Capturing agents who “mistake correlation for causation”

• Partial unification of earlier approaches

(6)

In this Lecture Series…

• Bayesian networks offer tools for representing causal

misperceptions and analyzing their behavioral implications.

• Applications: Health/lifestyle/occupational decisions, demand for education, monetary policy, narratives and political beliefs,

contracting with agents who misperceive their production function

• Opening the door for the study of causal reasoning by people other than Joshua Angrist...

(7)

Sources

• Forthcoming article in Annual Rev. of Econ.

• Three specific papers:

1. “Bayesian Networks and Boundedly Rational Expectations” (QJE 2016)

2. “Can Agents with Causal Misperceptions be Systematically Fooled?” (JEEAforthcoming) 3. “A Model of Competing Narratives” (joint with Kfir Eliaz)

• Bayesian networks in Statistics and AI (Lauritzen 1996, Cowell et al.

1999, Pearl 2009, Koller-Friedman 2009, Pearl-Mackenzie 2018)

• Psychology of causal reasoning (Sloman 2005, Lagnado-Sloman 2015)

(8)

Lecture Plan

• Lecture 1: Individual Behavior

– Using DAGs to represent causal misperceptions

– Individual decision making as “personal equilibrium”

• Lecture 2: Interaction

– Leader-follower model

– A “monetary policy” application

• Lecture 3: Endogenous Causal Models

– A model of competing political narratives

(9)

• An agent chooses whether to consume a dietary supplement.

• Three variables take values in {0,1}:

– 𝑎𝑎 represents the agent’s action (1 means consuming) – ℎ represents state of health (1 means good health) – 𝑐𝑐 represents blood chemical level (1 means abnormal)

• The agent’s payoff is ℎ − 𝑘𝑘𝑎𝑎, where 𝑘𝑘 > 0 is constant.

Example: The Dieter’s Dilemma

(10)

• 𝑝𝑝 is a long-run (steady-state) distribution over 𝑎𝑎, ℎ, 𝑐𝑐.

• 𝑝𝑝 ℎ = 1 = 0.5, independently of 𝑎𝑎.

– The agentʹs rational choice would be 𝒂𝒂 = 𝟎𝟎.

• 𝑝𝑝 𝑐𝑐 = 1 𝑎𝑎, ℎ) = (1 − 𝑎𝑎)(1 − ℎ)

– Chemical level is normal if the agent is healthy or if he takes the supplement.

The Dieter’s Dilemma

(11)

• The agent has a subjective causal model, represented by a directed acyclic graph (DAG) 𝑅𝑅 over the three variables:

𝑎𝑎 → 𝑐𝑐 → ℎ

– A causal chain from action to health via chemical level

• The agent fits his causal model to the long-run distribution:

𝑝𝑝_𝑅𝑅 𝑎𝑎, ℎ, 𝑐𝑐 = 𝑝𝑝 𝑎𝑎 𝑝𝑝 𝑐𝑐 𝑎𝑎 𝑝𝑝(ℎ|𝑐𝑐)

The Dieter’s Dilemma

(12)

• The agent relies on 𝑝𝑝_𝑅𝑅 𝑎𝑎, ℎ, 𝑐𝑐 = 𝑝𝑝 𝑎𝑎 𝑝𝑝 𝑐𝑐 𝑎𝑎 𝑝𝑝(ℎ|𝑐𝑐) to compute

𝑝𝑝_𝑅𝑅 ℎ 𝑎𝑎 = �

𝑐𝑐𝑝𝑝 𝑐𝑐 𝑎𝑎 𝑝𝑝(ℎ|𝑐𝑐)

• Why doesn’t the agent directly estimate 𝑝𝑝 ℎ 𝑎𝑎 ? – The benefit of using models

– Differential availability of data about various correlations

(13)

• 𝑅𝑅: 𝑎𝑎 → 𝑐𝑐 → ℎ assumes ℎ ⊥ 𝑎𝑎 | 𝑐𝑐. This assumption is false:

– Given normal chemical level, if we learn that the agent didn’t take the supplement, we infer that he must be healthy.

• 𝑝𝑝 𝑎𝑎, ℎ, 𝑐𝑐 = 𝑝𝑝 𝑎𝑎 𝑝𝑝 ℎ 𝑝𝑝(𝑐𝑐|𝑎𝑎, ℎ)

– This is consistent with a “true” causal model 𝑎𝑎 → 𝑐𝑐 ← ℎ. – 𝑅𝑅 exhibits reverse causality w.r.t the true model.

The Dieter’s Dilemma

(14)

• The agent’s subjective expected utility from 𝑎𝑎:

�ℎ𝑝𝑝_𝑅𝑅(ℎ|𝑎𝑎)(ℎ − 𝑘𝑘𝑎𝑎) = 𝑝𝑝_𝑅𝑅 ℎ = 1 𝑎𝑎 − 𝑘𝑘𝑎𝑎

• Does 𝑝𝑝_𝑅𝑅 ℎ 𝑎𝑎 correctly measure the causal effect of 𝑎𝑎 on ℎ given the agent’s subjective model?

• Mistaking correlation for causation

(15)

𝑝𝑝_𝑅𝑅 ℎ 𝑎𝑎 = �

𝑐𝑐𝑝𝑝 𝑐𝑐 𝑎𝑎 𝑝𝑝(ℎ|𝑐𝑐)

• The agent computes the effect of 𝑎𝑎 on ℎ as if ℎ ⊥ 𝑎𝑎 | 𝑐𝑐.

• But we saw that 𝑝𝑝 does not satisfy this property.

– It follows that 𝑝𝑝(ℎ|𝑐𝑐) is not invariant to 𝑝𝑝(𝑎𝑎).

– Calls for an equilibrium notion of subjective maximization!

The Dieter’s Dilemma

(16)

• “Personal equilibrium” : If 𝑝𝑝 𝑎𝑎 > 0, then 𝑎𝑎 maximizes subjective expected utility w.r.t 𝑝𝑝_𝑅𝑅 ℎ 𝑎𝑎 .

• Need to introduce trembles for definition to be precise.

(17)

• Look for a personal equilibrium with full support:

𝑊𝑊𝑊𝑊𝑊𝑊 = 𝑝𝑝_𝑅𝑅 ℎ = 1 𝑎𝑎 = 1 − 𝑝𝑝_𝑅𝑅 ℎ = 1 𝑎𝑎 = 0 = 𝑘𝑘

𝑝𝑝_𝑅𝑅 ℎ = 1|𝑎𝑎 = �

𝑐𝑐𝑝𝑝 𝑐𝑐 𝑎𝑎 𝑝𝑝 ℎ = 1 |𝑐𝑐

• Just calculate the relevant conditional probabilities!

• 𝛼𝛼 = 𝑝𝑝 𝑎𝑎 = 1 denotes the probability of taking the supplement.

The Dieter’s Dilemma

(18)

• Recall: ℎ = 1 is good health, 𝑐𝑐 = 0 is normal chemical level.

• 𝑝𝑝 𝑐𝑐 = 0 𝑎𝑎 = 1 = 1 𝑝𝑝 𝑐𝑐 = 0 𝑎𝑎 = 0 = 0.5

• 𝑝𝑝 ℎ = 1 𝑐𝑐 = 1 = 0 𝑝𝑝 ℎ = 1 |𝑐𝑐 = 0 = _{0.5+0.5𝛼𝛼}^0.5

• Condition for interior equilibrium

𝑊𝑊𝑊𝑊𝑊𝑊 = 𝑝𝑝 𝑐𝑐 = 0 𝑎𝑎 = 1 − 𝑝𝑝 𝑐𝑐 = 1 |𝑎𝑎 = 0 � 𝑝𝑝 ℎ = 1 𝑐𝑐 = 0

= 0.5 � _{0.5+0.5𝛼𝛼}^0.5 = 𝑘𝑘

(19)

• 𝛼𝛼 = (1 − 2𝑘𝑘)/2𝑘𝑘

• An interior (unique) equilibrium exists for 𝑘𝑘 ∈ 0. 25, 0.5 .

• Summary:

– The agent misreads 𝑐𝑐 − ℎ correlation as a causal effect of 𝒄𝒄 on 𝒉𝒉. – This results in equilibrium “mixing” (with sub-optimal

consumption with positive long-run frequency).

The Dieter’s Dilemma

(20)

Bayesian Networks

• 𝑥𝑥₀, 𝑥𝑥₁, … , 𝑥𝑥_𝑛𝑛 is a collection of variables, 𝑥𝑥_𝑖𝑖 ∈ 𝑋𝑋_𝑖𝑖.

• 𝑋𝑋 = 𝑋𝑋₀ × ⋯× 𝑋𝑋_𝑛𝑛

• 𝑝𝑝 ∈ ∆(𝑋𝑋) is an objective long-run probability distribution.

• Standard chain rule (arbitrary enumeration of variables):

𝑝𝑝 𝑥𝑥 = 𝑝𝑝 𝑥𝑥₀ 𝑝𝑝(𝑥𝑥₁|𝑥𝑥₀) ⋯ 𝑝𝑝(𝑥𝑥_𝑛𝑛|𝑥𝑥₀, … ,𝑥𝑥_𝑛𝑛−1)

(21)

Bayesian Networks

• A causal model is a directed acyclic graph (DAG) (𝑁𝑁, 𝑅𝑅): – 𝑁𝑁 is a set of nodes that represent variables.

– 𝑅𝑅 is a set of directed links (use 𝑗𝑗𝑅𝑅𝑗𝑗 or 𝑗𝑗 → 𝑗𝑗 interchangeably) that represent perceived causal relations.

– 𝑅𝑅 𝑗𝑗 = {𝑗𝑗 ∈ 𝑁𝑁|𝑗𝑗𝑅𝑅𝑗𝑗} is the set of “immediate causes of 𝑗𝑗”.

– I’ll often suppress 𝑁𝑁 and identify causal models with 𝑅𝑅.

(22)

Bayesian Networks

• Factorize 𝑝𝑝 according to 𝑅𝑅:

𝒑𝒑

_𝑹𝑹

𝒙𝒙 = �

𝒊𝒊∈𝑵𝑵

𝒑𝒑(𝒙𝒙

_𝒊𝒊

|𝒙𝒙

_{𝑹𝑹(𝒊𝒊)}

)

• 1 → 0 → 3 ← 2 ⇒ 𝑝𝑝_𝑅𝑅 𝑥𝑥 = 𝑝𝑝 𝑥𝑥₁ 𝑝𝑝 𝑥𝑥₂ 𝑝𝑝 𝑥𝑥₀ 𝑥𝑥₁ 𝑝𝑝(𝑥𝑥₃|𝑥𝑥₀, 𝑥𝑥₂)

• Fully connected DAG ⇒ Standard chain rule

• Empty DAG ⇒ 𝑝𝑝_𝑅𝑅 𝑥𝑥 = 𝑝𝑝(𝑥𝑥₀) ⋯ 𝑝𝑝(𝑥𝑥_𝑛𝑛)

(23)

Bayesian Networks

• The set of distributions that are consistent with 𝑝𝑝_𝑅𝑅 constitute a Bayesian network.

– Representing conditional-independence assumptions

– Platform for algorithmic probabilistic inference (Pearl 1988, Cowell et al. 1999, Koller-Friedman 2009)

– Platform for algorithmic causal inference (Pearl 2009)

(24)

What does 𝑅𝑅 Mean in Present Context?

• 𝑅𝑅 encodes a systematic distortion of objective distributions into subjective beliefs.

• Unlike the subjective-priors approach, here the primitive is not a belief but a belief distortion function.

• In larger models, 𝑅𝑅 will be part of an agent’s “type”.

(25)

• Suppose 𝑛𝑛 = 2 and 𝑝𝑝 𝑥𝑥 ≡ 𝑝𝑝 𝑥𝑥₀ 𝑝𝑝 𝑥𝑥₁ 𝑝𝑝(𝑥𝑥₂|𝑥𝑥₀, 𝑥𝑥₁). – Consistent with a “true DAG”: 0 → 2 ← 1

• Subjective DAGs that exhibit specific errors:

Coarse reasoning 0 → 2 1 (omitting a link) Reverse causality 0 → 2 → 1 (inverting a link)

Capturing Belief Errors

(26)

• Suppose 𝑛𝑛 = 2 and 𝑝𝑝 𝑥𝑥 ≡ 𝑝𝑝 𝑥𝑥₀ 𝑝𝑝 𝑥𝑥₁|𝑥𝑥₀ 𝑝𝑝(𝑥𝑥₂|𝑥𝑥₀).

– Consistent with a “true DAG”: 2 ← 0 → 1

– Interpret 0 as a state of Nature and 2 as an opponent’s move

• A subjective DAG that exhibits an attribution error (reorienting a link): 0 → 1 → 2

– “Illusion of control”: 1 represents a decision maker’s action – Analogical reasoning: 1 is an “analogy class”

Capturing Belief Errors

(27)

• Suppose 𝑛𝑛 = 3 and 𝑝𝑝 𝑥𝑥 ≡ 𝑝𝑝 𝑥𝑥₀ 𝑝𝑝 𝑥𝑥₁ 𝑝𝑝(𝑥𝑥₂|𝑥𝑥₀, 𝑥𝑥₁)𝑝𝑝(𝑥𝑥₃|𝑥𝑥₁).

1

– Consistent with a “true DAG”: 0 → 2 → 3

• Subjective DAG that exhibits confounder neglect: 0 → 2 → 3

– Omitting a node and its links

– Causal interpretation of confounding-based correlation

Capturing Belief Errors

(28)

• Suppose 𝑛𝑛 = 2 and 𝑝𝑝 𝑥𝑥 ≡ 𝑝𝑝 𝑥𝑥₀ 𝑝𝑝 𝑥𝑥₁|𝑥𝑥₀ 𝑝𝑝(𝑥𝑥₂|𝑥𝑥₀)𝑝𝑝(𝑥𝑥₃|𝑥𝑥₁, 𝑥𝑥₂).

1

– Consistent with a “true DAG”: 0 → 2 → 3

• Subjective DAG that neglects a causal channel: 0 → 2 → 3

– Unawareness of some causal channels

– Neglecting indirect/equilibrium effects of economic policies

Capturing Belief Errors

(29)

Observationally Equivalent DAGs

Definition: 𝑅𝑅 and 𝑄𝑄 are equivalent if 𝑝𝑝_𝑅𝑅 ≡ 𝑝𝑝_𝑄𝑄 for every 𝑝𝑝.

• 1 → 2 and 2 → 1 are equivalent since

𝑝𝑝 𝑥𝑥₁ 𝑝𝑝 𝑥𝑥₂ 𝑥𝑥₁ = 𝑝𝑝 𝑥𝑥₂ 𝑝𝑝(𝑥𝑥₁|𝑥𝑥₂)

• Observational equivalence ≠ Causal equivalence

(30)

• Proposition (Frydenberg 1990, Verma-Pearl 1991): Two DAGs are equivalent if and only if they have the same undirected version and the same set of “immoralities” (𝑗𝑗 → 𝑘𝑘 ← 𝑗𝑗

without a direct link between 𝑗𝑗 and 𝑗𝑗).

– 0 → 2 → 1 and 0 ← 2 ← 1 are equivalent.

– 0 → 2 → 1 and 0 → 2 ← 1 are not equivalent.

Observationally Equivalent DAGs

(31)

Decision Model

• For simplicity, let us focus on an uninformed DM.

– 𝑥𝑥₀ is the DMʹs action.

– 0 is an ancestral node in true and subjective DAGs.

• 𝑝𝑝 𝑥𝑥₋₀ 𝑥𝑥_{0 𝑥𝑥} is exogenous and fixed.

• 𝑝𝑝(𝑥𝑥₀) _𝑥𝑥₀ is the DM’s (endogenous) strategy.

• 𝑢𝑢: 𝑋𝑋 ⟶ ℝ is the DM’s vNM function.

(32)

• Subjective EU maximization: If 𝑝𝑝(𝑥𝑥₀) > 0, 𝑥𝑥₀ should maximize

�

𝑥𝑥₋₀

𝑝𝑝_𝑅𝑅 𝑥𝑥₋₀ 𝑥𝑥₀ 𝑢𝑢(𝑥𝑥₀, 𝑥𝑥₋₀)

• The DM treats 𝑝𝑝_𝑅𝑅 𝑥𝑥₋₀ 𝑥𝑥₀ as the causal effect of 𝑥𝑥₀ on 𝑥𝑥₋₀.

• “Mistaking correlation for causation”?

• The assumption that 0 is ancestral in 𝑅𝑅 ensures the DM does not err in this regard (Pearl’s “do-calculus” (2009))

Decision Model

(33)

�

𝑥𝑥₋₀

• Do-calculus is left outside the scope of this lecture series.

• The DM has a wrong causal model, but he draws correct causal inferences from observational data given the model.

Decision Model

(34)

�

𝑥𝑥₋₀

• As we saw in the “Dieter’s Dilemma”, 𝑝𝑝_𝑅𝑅 𝑥𝑥₋₀ 𝑥𝑥₀ may be ill- defined without knowing 𝑝𝑝(𝑥𝑥₀) (the DM’s strategy).

• Need for an equilibrium model of individual choice

Decision Model

(35)

Definition: A strategy 𝑝𝑝(𝑥𝑥₀) _𝑥𝑥₀ with full support is a personal 𝜀𝜀-equilibrium if whenever 𝑝𝑝(𝑥𝑥₀) > 𝜀𝜀,

𝑥𝑥₀ ∈ 𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑥𝑥 _𝑥𝑥₀^′ �

𝑥𝑥₋₀

𝑝𝑝_𝑅𝑅 𝑥𝑥₋₀ 𝑥𝑥₀^′ 𝑢𝑢(𝑥𝑥₀^′, 𝑥𝑥₋₀)

• A personal equilibrium is a limit of personal 𝜀𝜀-equilibria.

– Because 0 is an ancestral node, the exact perturbation doesn’t matter (it would in an informed-DM model).

Personal Equilibrium

(36)

• Consider a three-variable environment:

– 𝑎𝑎 is the DM’s action.

– 𝜃𝜃 is an exogenous state of nature.

– 𝑧𝑧 is a consequence.

• 𝑝𝑝 𝑎𝑎, 𝜃𝜃, 𝑧𝑧 ≡ 𝑝𝑝 𝑎𝑎 𝑝𝑝 𝜃𝜃 𝑝𝑝(𝑧𝑧|𝑎𝑎, 𝜃𝜃)

• 𝑝𝑝 is consistent with a “true DAG” 𝑎𝑎 → 𝑧𝑧 ← 𝜃𝜃.

When is Personal Equilibrium Needed?

(37)

• True DAG: 𝑎𝑎 → 𝑧𝑧 ← 𝜃𝜃 – 𝑝𝑝 𝜃𝜃, 𝑧𝑧 𝑎𝑎 = 𝑝𝑝 𝜃𝜃 𝑝𝑝 𝑧𝑧 𝑎𝑎, 𝜃𝜃

• Subjective DAG: 𝑎𝑎 → 𝑧𝑧 𝜃𝜃

– 𝑝𝑝_𝑅𝑅 𝜃𝜃, 𝑧𝑧 𝑎𝑎 = 𝑝𝑝 𝜃𝜃 𝑝𝑝 𝑧𝑧 𝑎𝑎 = 𝑝𝑝 𝜃𝜃 ∑_𝜃𝜃^′ 𝑝𝑝 𝜃𝜃^′ 𝑝𝑝 𝑧𝑧 𝑎𝑎, 𝜃𝜃^′

– Invariant to (𝑝𝑝 𝑎𝑎 )_𝑎𝑎 ; personal equilibrium is reducible to maximization w.r.t a wrong belief.

Coarse Reasoning

(38)

• True DAG: 𝑎𝑎 → 𝑧𝑧 ← 𝜃𝜃 – 𝑝𝑝 𝜃𝜃, 𝑧𝑧 𝑎𝑎 = 𝑝𝑝 𝜃𝜃 𝑝𝑝 𝑧𝑧 𝑎𝑎, 𝜃𝜃

• Subjective DAG: 𝑎𝑎 → 𝑧𝑧 → 𝜃𝜃

• 𝑝𝑝_𝑅𝑅 𝜃𝜃, 𝑧𝑧|𝑎𝑎 = 𝑝𝑝 𝑧𝑧 𝑎𝑎 𝑝𝑝 𝜃𝜃 𝑧𝑧 = 𝑝𝑝 𝑧𝑧 𝑎𝑎 ∑_𝑎𝑎^′ 𝒑𝒑 𝒂𝒂^′|𝒛𝒛 𝑝𝑝(𝜃𝜃|𝑎𝑎^′, 𝑧𝑧)

• 𝑝𝑝_𝑅𝑅 𝜃𝜃, 𝑧𝑧|𝑎𝑎 is sensitive to (𝑝𝑝 𝑎𝑎 )_𝑎𝑎 ; personal equilibrium is not reducible to maximization (as in the “Dieter’s Dilemma”).

Reverse Causality

(39)

• When is personal equilibrium reducible to maximization?

– 𝑝𝑝_𝑅𝑅 � 𝑥𝑥₀ shouldn’t change with the marginal of 𝑝𝑝 over 𝑥𝑥₀.

Definition: 𝑅𝑅 is c-rational w.r.t a “true DAG” 𝑅𝑅^∗ if for every 𝑝𝑝, 𝑞𝑞 that are consistent with 𝑅𝑅^∗, if 𝑝𝑝 � 𝑥𝑥₀ = 𝑞𝑞(� |𝑥𝑥₀) for every 𝑥𝑥₀, then 𝑝𝑝_𝑅𝑅 � 𝑥𝑥₀ = 𝑞𝑞_𝑅𝑅(� |𝑥𝑥₀) for every 𝑥𝑥₀.

C-Rationality

(40)

• 𝑥𝑥_𝑖𝑖 ⊥_𝑅𝑅 𝑥𝑥_𝑗𝑗 | 𝑥𝑥_𝐴𝐴 denotes a conditional-independence property that holds for all distributions that are consistent with 𝑅𝑅.

• This property has a computationally simple graphical characterization known as d-separation.

– Patterns of “path blocking”

– Basic material in any introduction to Bayesian networks

C-Rationality

(41)

Proposition: Let 𝑅𝑅^∗ 0 = 𝑅𝑅 0 = ∅. Then, 𝑅𝑅 is c-rational w.r.t 𝑅𝑅^∗ iff 0 ∉ 𝑅𝑅(𝑗𝑗) implies 𝑥𝑥_𝑖𝑖 ⊥_𝑅𝑅^∗ 𝑥𝑥₀ | 𝑥𝑥_{𝑅𝑅(𝑖𝑖)}.

• Illustrating the result for 𝑅𝑅^∗ : 0 → 2 ← 1 – Coarse reasoning 0 → 2 1 – 𝑅𝑅 1 = ∅ ; and indeed, 𝑥𝑥₁ ⊥_𝑅𝑅^∗ 𝑥𝑥₀ .

C-Rationality

(42)

Proposition: Let 𝑅𝑅^∗ 0 = 𝑅𝑅 0 = ∅. Then, 𝑅𝑅 is c-rational w.r.t 𝑅𝑅^∗ iff 0 ∉ 𝑅𝑅(𝑗𝑗) implies 𝑥𝑥_𝑖𝑖 ⊥_𝑅𝑅^∗ 𝑥𝑥₀ | 𝑥𝑥_{𝑅𝑅(𝑖𝑖)}.

• Illustrating the result for 𝑅𝑅^∗ : 0 → 2 ← 1 – Reverse causality 0 → 2 → 1 – 𝑅𝑅 1 = {2} ; but not 𝑥𝑥₁ ⊥_𝑅𝑅^∗ 𝑥𝑥₀| 𝑥𝑥₂ .

C-Rationality

(43)

• True DAG 𝑅𝑅^∗: 𝑎𝑎 → 𝑠𝑠 ← 𝜃𝜃 → 𝑤𝑤

• The DM is a parent.

– 𝑎𝑎 ∈ [0,1] is the parent’s investment in his child’s education.

– 𝜃𝜃 ∈ {0,1} is the child’s “innate ability”.

– 𝑠𝑠 ∈ {0,1} is the child’s school performance.

– 𝑤𝑤 ∈ {0,1} is the child’s labor-market outcome.

Example: Spurious Demand for Education

(44)

• 𝑢𝑢 is purely a function of 𝑎𝑎 and 𝑤𝑤.

• The parent’s subjective DAG is 𝑅𝑅: 𝑎𝑎 → 𝑠𝑠 → 𝑤𝑤.

• Interpretation: 𝜃𝜃 is an unobservable confounder; the parent’s error is that he ignores it.

Demand for Education

(45)

• The parent’s DAG 𝑅𝑅: 𝑎𝑎 → 𝑠𝑠 → 𝑤𝑤 violates c-rationality:

– 𝑅𝑅 𝑤𝑤 = {𝑠𝑠} , but not 𝑤𝑤 ⊥_𝑅𝑅^∗ 𝑎𝑎|𝑠𝑠

– Reason: 𝑝𝑝 𝑤𝑤 𝑠𝑠 = ∑_𝜃𝜃 𝑝𝑝 𝜃𝜃 𝑠𝑠 𝑝𝑝(𝑤𝑤|𝜃𝜃) , and 𝑝𝑝 𝜃𝜃 𝑠𝑠 is sensitive to parental investment.

– Easy to check graphically using d-separation

Demand for Education

(46)

• Put more structure on exogenous components of 𝑝𝑝.

• 𝑝𝑝 𝑠𝑠 = 1 𝑎𝑎, 𝜃𝜃 = 𝑎𝑎𝜃𝜃 ; 𝑝𝑝 𝑤𝑤 = 1 𝜃𝜃 = 𝜃𝜃𝜃𝜃

– 𝑠𝑠 = 1 represents success at school.

– 𝑤𝑤 = 1 represents success in the labor market.

– High ability (𝜃𝜃 = 1) is necessary for both.

– School success as such is irrelevant for labor-market success.

(47)

• 𝑝𝑝 𝜃𝜃 = 1 = 𝛿𝛿 ; 𝑝𝑝 𝑠𝑠 = 1 𝑎𝑎, 𝜃𝜃 = 𝑎𝑎𝜃𝜃 ; 𝑝𝑝 𝑤𝑤 = 1 𝜃𝜃 = 𝜃𝜃𝜃𝜃

• 𝑢𝑢 𝑎𝑎, 𝑤𝑤 = 𝑤𝑤 − 0.5𝑎𝑎²

• Under rational expectations, the parent chooses 𝑎𝑎^∗ = 0.

– Parental investment doesn’t affect the child’s ability, which is the sole determinant of his labor-market outcome.

Demand for Education

(48)

Result: Under 𝑅𝑅: 𝑎𝑎 → 𝑠𝑠 → 𝑤𝑤, there is a unique personal equilibrium. The parent plays 𝑎𝑎^∗∗ > 0 given by

𝑎𝑎^∗∗ = 𝜃𝜃𝛿𝛿(1 − 𝛿𝛿)

1 − 𝛿𝛿 + 𝛿𝛿(1 − 𝑎𝑎^∗∗)

– Marginal cost = Perceived marginal benefit

– Long-run behavior affects perceived marginal benefit.

– 𝑎𝑎^∗∗ increases with 𝜃𝜃 and as 𝛿𝛿 moves away from 0 or 1.

(49)

• 𝑝𝑝 𝑠𝑠 = 1 𝑎𝑎 = 𝛿𝛿𝑎𝑎. Perceived benefit from investment 𝑎𝑎 is:

𝑝𝑝 𝑠𝑠 = 1 𝑎𝑎 � 𝑝𝑝 𝑤𝑤 = 1 𝑠𝑠 = 1 + 𝑝𝑝 𝑠𝑠 = 0 𝑎𝑎 � 𝑝𝑝 𝑤𝑤 = 1 𝑠𝑠 = 0 = 𝑝𝑝 𝑤𝑤 = 1 𝑠𝑠 = 0 + 𝑝𝑝 𝑠𝑠 = 1 𝑎𝑎 � 𝑝𝑝 𝑤𝑤 = 1 𝑠𝑠 = 1 − 𝑝𝑝 𝑤𝑤 = 1 𝑠𝑠 = 0

• 𝑝𝑝 𝑤𝑤 = 1 𝑠𝑠 = 1 = 𝜃𝜃 because high ability is necessary for success at school.

• In contrast, 𝑠𝑠 = 0 can result from 𝜃𝜃 = 0 or low 𝑎𝑎.

Sketch of Proof

(50)

𝑝𝑝 𝑤𝑤 = 1 𝑠𝑠 = 0 = 𝑝𝑝(𝜃𝜃 = 1) � ∑_𝑎𝑎^′ 𝑝𝑝(𝑎𝑎^′) � (1 − 𝑎𝑎^′) � 𝜃𝜃

𝑝𝑝 𝜃𝜃 = 0 + 𝑝𝑝(𝜃𝜃 = 1) � ∑_𝑎𝑎^′ 𝑝𝑝(𝑎𝑎^′) � (1 − 𝑎𝑎^′) = 𝛾𝛾𝜃𝜃

– The parent’s chosen value of 𝑎𝑎 does not feature in this term!

• Perceived benefit of investment 𝑎𝑎 is 𝛾𝛾𝜃𝜃 + 𝛿𝛿𝑎𝑎 � 𝜃𝜃 − 𝛾𝛾𝜃𝜃 .

• Marginal perceived benefit from 𝑎𝑎 is 𝛿𝛿 𝜃𝜃 − 𝛾𝛾𝜃𝜃 .

– Overestimation gets worse when 𝑝𝑝(𝑎𝑎^′) shifts to the right. This complementarity could lead to multiple equilibria for other 𝑐𝑐.

Sketch of Proof

(51)

• DAGs represent subjective causal models.

• The factorization formula 𝑝𝑝_𝑅𝑅 represents systematic belief

distortion due to fitting a wrong causal model to long-run data Personal equilibrium: Subjective EU maximization w.r.t 𝑝𝑝_𝑅𝑅

• Bayesian-network tool (d-separation) helps understanding when to expect equilibrium effects

Behavioral Implications of Causal Misperceptions Part I