PORTFOLIO SELECTION WITH MONOTONE MEAN V

(1)

Portfolio Selection with

Monotone Mean-Variance Preferences

∗

Fabio Maccheroni

Istituto di Metodi Quantitativi and IGIER, Universit`a Bocconi

Massimo Marinacci

Dipartimento di Statistica e Matematica Applicata and ICER, Universit`a di Torino

Aldo Rustichini

Department of Economics, University of Minnesota

Marco Taboga

†

Research Department, Banca d’Italia and Universit`a di Torino

June 2005

∗_{We thank Erio Castagnoli, Rose-Anne Dana, Paolo Guasoni, Giovanna Nicodano,}

Giacomo Scandolo, Elena Vigna, participants to the VI Quantitative Finance Workshop (Milan, January 2005), SMAI 2005 (Evian, May 2005), RUD 2005 (Heidelberg, June 2005) and the seminar audiences at Banca d’Italia, Boston University, Università di Salerno, Uni-versità Bocconi, Università del Piemonte Orientale, Università di Torino, and Università di Udine for helpful discussion. Maccheroni and Marinacci gratefully acknowledge the fi-nancial support of the Ministero dell’Istruzione, dell’Università e della Ricerca; Rustichini gratefully acknowledges the financial support of the National Science Foundation (grant # 0136556).

†_{The views expressed in the article are those of the author and do not involve the}

(2)

Abstract

(3)

1 Introduction

Since Markowitz’s [Ma] and Tobin’s [To] seminal contributions, mean-variance preferences have been extensively used to model the behavior of economic agents choosing among uncertain prospects (see, e.g., Bodie, Kane, and Mar-cus [BKM]) and have been one of the workhorses of portfolio selection theory (see, e.g., Britten-Jones [Br], Gibbons, Ross, and Shanken [GRS], Kandel and Stambaugh [KS], and MacKinlay and Richardson [MR]). These preferences, denoted bymv, assign to an uncertain prospectf the following utility score:

Uθ(f) = EP [f]−

θ

2Var

P

[f],

where P is a given probability measure and θ is an index of the agent’s aversion to variance.

The success of this specification of preferences is due to its analytical tractability and clear intuitive meaning. Mean-variance preferences have, however, a major theoretical drawback: they may fail to be monotone. It may happen that an agent with mean-variance preferences strictly prefers less to more, thus violating one of the most compelling principles of economic rationality. This is a well known problem (see, e.g., Cerny [Ce] and Jarrow and Madan [JM]) and not a minor one, since it has been proved (Bigelow [Bi]) that mean-variance preferences are rational only under very restrictive assumptions about the statistical distribution of asset returns.

Non-monotonicity of mean-variance preferences can be illustrated with a simple example. Consider a mean-variance agent with θ = 2. Suppose she has to choose between the two following prospects f and g:

States of Nature s1 s2 s3 s4

Probabilities 0.25 0.25 0.25 0.25

Payoff of f 1 2 3 4

Payoff of g 1 2 3 5

Prospect g yields a higher payoff than f in every state. Any rational agent should prefer g to f. However, it turns out that our mean-variance agent strictly prefers f tog. In fact:

U2(f) = 1.25>0.5625 =U2(g).

The reason why monotonicity fails here is fairly intuitive. By choosing g

(4)

unit increases the mean payoff, but it also makes the distribution of payoffs more spread out, thus increasing the variance. The increase in the mean is more than compensated by the increase in the variance, and this makes our mean-variance agent worse off.

In this paper we propose an adjusted version of mean-variance preferences that satisfies monotonicity, based on Maccheroni, Marinacci, and Rustichini [MMR]. As it will be detailed in the next section, they axiomatize a class of preferences, called variational preferences, that includes as a special case the preferences mmv _{represented by the choice functional}

Vθ(f) = min Q

EQ_[_f_{] +} 1

2θC(Q||P)

∀f ∈ L2(P),

where Q ranges over all probability measures with square-integrable density with respect to P, andC(Q||P) is the relative Gini concentration index (or

χ2_{-distance), a concentration index that enjoys properties similar to those}

of the relative entropy (see Liese and Vajda [LV] for details). Unlike mean-variance preferences, the preferences mmv _{are monotone.}

Consider the setGθ ⊂ L2(P) where mean-variance preferences are monotone

(i.e., where∇Uθ is positive). This is the set where mean-variance preferences

are economically meaningful. We show that the two choice functionals Uθ

and Vθ coincide onGθ, that is,

EP[f]−θ

2Var

P

[f] = min

Q

EQ[f] + 1

2θC(Q||P)

∀f ∈ Gθ.

Moreover, we show that: (i) Vθ is the minimal monotone choice functional

that extends the mean-variance functionalUθoutside the domain of

monotonic-ity Gθ, and (ii) Vθ pointwise dominatesUθ.

The preferencesmmv have, therefore, the following key properties:

• They agree with mean-variance preferences where mean-variance pref-erences are economically meaningful.

• Their choice functional Vθ is the minimal, and so the most cautious,

(5)

• The functionalVθ is also the best possible monotone approximation of

Uθ: if Vθ′ is any other monotone extension of Uθ outside Gθ, then

|Vθ(f)−Uθ(f)| ≤ |Vθ′(f)−Uθ(f)|

for each prospect f.

These properties make the preferences mmv a natural adjusted version

of mean-variance preferences that satisfies monotonicity. For this reason we call them monotone mean-variance preferences.

In view of all this, it is natural to wonder what happens, in a portfolio problem `a la Markowitz, if we use monotone mean-variance preferences in place of standard mean-variance preferences. This is the main subject matter of this paper. Markowitz’s well-known optimal allocation rule under mean-variance preferences is:

α_mv∗ = 1

θVar

P _[_X_]−1

EPh_X₋_~₁_Ri_,

where α∗

mv is the optimal portfolio of risky assets, X is the vector of gross

returns on the risky assets, R is the gross return on the risk-free asset, and

~_{1 is a vector of 1s. We show that with monotone mean-variance preferences}

the optimal allocation rule becomes:

α∗mmv =

1

θP(W ≤κ)Var

P_[_X_|_W _≤_κ_]−1

EPh_X₋_~₁_R_|_W _≤_κi_,

where W is future wealth and κ is a constant determined along with α∗

mmv

by solving a suitable system of equations.

Except for a scaling factor, the difference between Markowitz’s optimal portfolio α∗

mv and the above portfolio α∗mmv is that in the latter conditional

moments of asset returns EP_[_{· |}_W _≤_κ_{] and Var}P _[_{· |}_W _≤_κ_{] are used instead}

of unconditional moments, so that the allocationα∗

mmvignores the part of the

(6)

of monotone mean-variance preferences. We illustrate this point further in Section 6 by showing how this functional avoids some pathological situations in which the more the payoff to an asset is increased in some states, the more a mean-variance agent reduces the quantity of it in her portfolio, until in the limit she ends up holding none.

A fundamental property enjoyed by our model is that optimal portfolios satisfy the so called two fund separation principle (see, e.g., Sharpe [Sh-1] and [Sh-2] ): portfolios of risky assets optimally held by agents with different degrees of uncertainty aversion are all proportional to each other and at an optimum the only difference between two agents is the proportion of wealth invested in the risk-free asset. This has significant theoretical implications because it allows to identify the equilibrium market portfolio with the optimal portfolio of risky assets held by any agent, as in [Sh-1], [Sh-2] and [To], and it allows to derive CAPM-like relations, with potential empirical content.

The paper is organized as follows. Section 2 illustrates in detail monotone mean-variance preferences. Sections 3 and 4 derive the optimal allocation rule under the proposed specification of preferences. Section 6 contains the CAPM analysis. Section 5 presents some examples that illustrate the dif-ference between the optimal allocation rule proposed here and Markowitz’s. Section 7 concludes. All proofs are collected in the Appendix, along with some general results on monotone extensions of concave functionals.

2 Monotone Mean-Variance Preferences

We consider a measurable space (S,Σ) of states of nature. An uncertain prospect is a Σ-measurable real valued function f : S → R_{, that is, a}

sto-chastic monetary payoff.

The agent’s preferences are described by a binary relation on a set of uncertain prospects. [MMR] provides a set of simple behavioral conditions that guarantee the existence of an increasing utility functionu:R_→R _and

a convex uncertainty index c : ∆ → [0,∞] on the set ∆ of all probability measures, such that

f g ⇔ inf

Q∈∆

EQ[u(f)] +c(Q) ≥ inf

Q∈∆

EQ[u(g)] +c(Q) (1)

(7)

Preferences having such a representation are called variational, and two important special cases of variational preferences are the multiple priors pref-erences of Gilboa and Schmeidler [GS], obtained whenconly takes on values 0 and ∞, and the multiplier preferences of Hansen and Sargent [HS], ob-tained when c(Q) is proportional to the relative entropy of Q with respect to a fixed probability measure P.1 Variational preferences satisfy the basic tenets of economic rationality. In particular, they are monotone, that is, given any two prospects f and g, we have f g whenever f(s) ≥ g(s) for each s∈S.

For concreteness, given a probability measure P on (S,Σ), we consider the set L2₍_P_{) of all square integrable uncertain prospects. A mean-variance}

preference relation mv on L2(P) is represented by the choice functional

Uθ :L2(P)→R given by

Uθ(f) = EP[f]−

θ

2Var

P _[_f_] _∀_f _{∈ L}2₍_P₎_,

with θ >0. Let Gθ be the subset of L2(P) where the Gateaux differential of

Uθ is positive (as a linear functional). Some algebra shows that

Gθ =

f ∈ L2(P) :f−EP [f]6 1

θ

.

Clearly, mv is monotone on Gθ. The set Gθ is convex, closed, and called

domain of monotonicity of Uθ. As we discussed in the Introduction, Gθ

is where the mean-variance preference mv is economically meaningful (see

also Appendix A). We show that the restriction ofmv toGθ is a variational

preference, and that

Uθ(f) = min Q∈∆2₍_P₎

EQ_[_f_{] +} 1

2θC(Q||P)

∀f ∈ Gθ,

where ∆2₍_P_{) is the set of all probability measures with square-integrable}

density with respect to P, and C(Q||P) is the relative Gini concentration index given by

C(Q||P) =

(

EPh dQ dP

2i

−1 if Q≪P,

∞ otherwise.

1_{The relative entropy of} _Q_given_P _{is E}PhdQ dPln

dQ dP i

(8)

Along with the Shannon entropy, the Gini index is the most classic concentra-tion index. For discrete distribuconcentra-tions it is given byPn

i=1Q2i−1, andC(Q||P)

is its continuous and relative version. We refer to [LV] for a comprehensive study of concentration indices.

Now, say that a preference is amonotone mean-variance preference, writ-ten mmv, if it is represented by the choice functionalVθ :L2(P)→Rgiven

by

Vθ(f) = min Q∈∆2₍_P₎

EQ_[_f_{] +} 1

2θC(Q||P)

∀f ∈ L2(P), (2)

for some θ >0. Our first result is the following:

Theorem 1 The functional Vθ : L2(P) → R given by (2) is the minimal

monotone functional on L2₍_P₎ _{such that}_V

θ(g) =Uθ(g) for all g ∈ Gθ; that

is,

Vθ(f) = sup{Uθ(g) :g ∈ Gθ and g 6f} ∀f ∈ L2(P). (3)

Moreover, Vθ(f)≥Uθ(f) for each f ∈ L2(P).

The functional Vθ is concave, continuous, and in view of this theorem it

has the following fundamental properties:

(i) Vθ coincides with the mean-variance choice functional Uθon its domain

of monotonicity Gθ;

(ii) Vθ is the minimal monotone extension of Uθ outside the domain of

monotonicity Gθ, and so it is the most cautious monotone adjustment

of the mean-variance choice functional.

(iii) Vθ is the best possible monotone approximation ofUθ: ifVθ′ is any other

monotone extension of Uθ outside the domain of monotonicityGθ, then

V′

θ(f)≥Vθ(f)≥Uθ(f) and so

|Vθ(f)−Uθ(f)| ≤ |Vθ′(f)−Uθ(f)| ∀f ∈ L2(P).

In view of (i)-(iii), the monotone choice functional Vθ provides a natural

adjustment of the mean-variance choice functional. It also has the remarkable feature of involving, like multiplier preferences ([HS]), a classic concentration index. This ensures to Vθ a good analytical tractability, as it will be seen in

the next section.

(9)

Theorem 2 Let f ∈ L2₍_P₎_{. Then:}

Vθ(f) =

Uθ(f) if f ∈ Gθ,

Uθ(f ∧κf) else,

where

κf = max{t ∈R:f ∧t∈ Gθ}. (4)

A monotone mean-variance agent can thus be regarded as still using the mean-variance functionalUθ even in evaluating prospects outside the domain

of monotonicity Gθ. In this case, however, the agent no longer considers the

original prospects, but rather their truncations at κf, the largest constant t

such that f∧t belongs toGθ.

Observe that, besides depending on the given prospect f, the constant

κf also depends on the parameter θ. Corollary 12 in Appendix provides an

explicit formula forκf, which shows thatκf becomes lower whenθ increases.

We conclude this section with the observation that, while Uθ does not

preserve second order stochastic dominance (SSD), Vθ does.2 This means

that, also from a risk aversion perspective, monotone mean-variance prefer-ences are on a sounder economic ground than mean-variance ones. We refer to Dana [Da], to which the proof of the next result is inspired, for references on second order stochastic dominance.

Theorem 3 Let f, g∈ L2₍_P₎_{. If} _f

SSD g, then Vθ(f)≥Vθ(g).

3 The Portfolio Selection Problem

We consider the one-period allocation problem of an agent who has to decide how to invest a unit of wealth at time 0, dividing it amongn+ 1 assets. The first n assets are risky, while the (n+ 1)-th is risk-free. The gross return on the i-th asset after one period is denoted by Xi. The (n×1) vector of

the returns on the first n assets is denoted by X and the (n×1) vector of portfolio weights, indicating the fraction of wealth invested in each of the risky assets, is denoted by α. The return on the (n+ 1)-th asset is risk-free and equal to a constant R.

2_{Recall that} _f

SSDg if and only if EP h

(f −t)−i≤EPh₍_g₋_t₎−i

(10)

The end-of-period wealth Wα induced by a choice ofα is given by:

Wα =R+α·

X−~1R.

We assume that there are no frictions of any kind: securities are perfectly divisible; there are no transaction costs or taxes; agents are price-takers, in that they believe that their choices do not affect the distribution of asset returns; there are no institutional restrictions, so that agents are allowed to buy, sell or short sell any desired amount of any security.3 _{As a result,} _α _can

be chosen in Rn_.

We adopt mmv as a specification of preferences, and so portfolios are

ranked according to the preference functional:

Vθ(Wα) = min Q∈∆2₍_P₎

EQ[Wα] +

1

2θC(Q||P)

,

where P is the reference probability measure. Hence, the portfolio problem can be written as:

max

α∈Rn_Q_∈min_∆2₍_P₎

EQ[Wα] +

1

2θC(Q||P)

.

4 The Optimal Portfolio

In this section we give a solution to the portfolio selection problem outlined in the previous section. The characterization of the optimal portfolio is given by the following theorem.4

Theorem 4 The vector α∗ _{is a solution of the portfolio selection problem}

if and only if there exists κ∗ _∈ _R _{such that} ₍_α∗_{, κ}∗₎ _{satisfies the system of}

equations:

(

θP (Wα ≤κ) VarP [X|Wα ≤κ]α= EP

h

X−~1R|Wα≤κ

i ,

EP

(Wα−κ)−

= 1/θ.

3_{This assumption can be weakened, by simply requiring that at an optimum} institu-tional restrictions are not binding.

4_EP_[_{· |W}

α≤κ] and VarP[· |Wα≤κ] are the expectation and variance conditional on

(11)

The optimal portfolio α∗ _{is thus determined along with the threshold} _κ∗

by solving a system of n+ 1 equations in n+ 1 unknowns. Although it is not generally possible to find explicitly a solution of the above system of equations, numerical calculation with a standard equation solver is straight-forward: given an initial guess (α, κ), one is able to calculate the first two moments of the conditional distribution of wealth; if the moments thus cal-culated, together with the initial guess (α, κ), satisfy the system of equations, then (α, κ) = (α∗_{, κ}∗_{) and numerical search stops; otherwise, the search}

pro-cedure continues with a new initial guess for the parameters.5 _{In the next}

Section we will solve in this way few simple portfolio problems in order to illustrate some features of the model.

The optimal allocation rule of Theorem 4 is easily compared to the rule that would obtain in a classic Markowitz’s setting. In the traditional mean-variance model we would have:

α∗ = 1

θ

VarP [X]−1

EPhX−~1Ri. (5)

The monotone mean-variance model yields:

α∗ = 1

θP (Wα∗ ≤κ∗)Var

P

[X|Wα∗ ≤κ∗]−1EP h

X−~1R|Wα∗ ≤κ∗ i

.

Relative to Markowitz’s optimal allocation (5), here the unconditional mean and variance of the vector of returns X are replaced by a conditional mean and a conditional variance, both calculated by conditioning on the event

{Wα∗ ≤κ∗}. Furthermore a scaling factor is introduced, which is inversely

proportional to the probability of exceeding the threshold κ∗_.

Roughly speaking, when computing the optimal portfolio we ignore that part of the distribution where wealth is higher than κ∗_{. To see why it is}

optimal to ignore the part of the distribution where one obtains the highest returns, recall the example of non-monotonicity of mean-variance illustrated in the Introduction. In that example, high payoffs were increasing the vari-ance more than the mean, thus leading the mean-varivari-ance agent to prefer a strictly smaller prospect, contrary to what would commonly be reputed eco-nomically reasonable. With monotone mean-variance preferences, this kind

(12)

of behavior is avoided by artificially setting the probability of some high payoff states equal to zero. In our portfolio selection problem we set the probability of the event {Wα∗ > κ∗} equal to zero.

When there is only one risky asset, the optimal quantity of it to be held according to our model and according to the mean-variance model can be compared by means of the following result. Here α∗

mmv denotes the

opti-mal portfolio according to the monotone mean-variance model and α∗

mv the

optimal portfolio according to the mean-variance model.

Proposition 5 Suppose that S is finite, with P(s) > 0 for all s ∈ S, and that there is only one risky asset. Then, either

α∗mmv ≥α∗mv ≥0

or

α∗mmv ≤α∗mv ≤0.

If, in addition, P Wα∗

mmv > κ

∗

>0, then:

α∗

mmv> α∗mv if α∗mmv >0,

α∗mmv< α∗mv if α∗mmv <0.

That is, an investor with monotone mean-variance preferences always holds a portfolio which is more leveraged than the portfolio held by a mean-variance investor. If she buys a positive quantity of the risky asset, this is greater than the quantity that would be bought by a mean-variance in-vestor; on the contrary, if she sells the risky asset short, she sells more than a mean-variance investor would do. This kind of behavior will be thor-oughly illustrated by the examples in the next section: the intuition behind it is that in some cases a favorable investment opportunity is discarded by a mean-variance investor because of non-monotonicity of her preferences, while a monotone mean-variance investor exploits the opportunity, thus taking a more leveraged position.

5 Monotone CAPM

(13)

The first step is to prove a two-fund separation theorem, which shows that an agent’s optimal portfolio is always an affine combination of a given portfolio of risky assets with the risk-free asset. As a result, the optimal investment choice can be done in two separated stages: first the agent decides which proportion of her wealth to invest in the risk-free asset; then, she decides how to allocate the remaining wealth to the risky assets. The outcome of this second decision is the same for every agent, regardless of her aversion to uncertainty and of her initial wealth. The following proposition proves separability in two funds:

Proposition 6 Let θ, γ > 0. If αθ_{, κ}θ

solves the portfolio selection prob-lem for an investor with uncertainty aversion θ, then θ

γα

θ_{, R}₊ θ γ κ

θ₋_R

solves it for an investor with uncertainty aversion γ. Moreover,

∇Vθ(Wαθ) = ∇V_γ(W_αγ).

Now, choose anyθ > 0 and, providedαθ_·~₁_>_{0, define:}

αm ₌ α θ

αθ_·~₁

Proposition 6 guarantees that the definition ofαm _{is unambiguous, as}_αm _is

the same regardless of the θ we choose. Since αm_·_~_{1 = 1,} _αm _{is the portfolio}

held by an investor who does not invest any of her wealth in the risk-free asset. Following the majority of the literature we call it market portfolio.

In an economy consisting of monotone-mean variance agents, all investors hold a portfolio of risky assets that is proportional to the market portfolio. This has strong positive implications, since at an equilibrium the composition of the market portfolio can be inferred by observing the proportions of risky assets existing in the economy. One of the implications that is usually tested in mean-variance settings concerns the relation between the expected excess return on any one asset and that on the market portfolio. In the standard setting, their ratio (the so-called “beta”) is proportional to the covariance between the return to the specific asset and that to the market portfolio, with the same coefficient of proportionality for all assets. A similar property holds in our setting. Denote by Xm ₌_αm_·_X _{the gross return to the market}

(14)

Proposition 7 (Monotone CAPM) If αθ_{, κ}θ

solves the portfolio selec-tion problem for an investor with uncertainty aversion θ >0, and αθ_·_~₁_>₀_,

then

E [Xi]−R=βi(E[Xm]−R), ∀i= 1, ..., n, (6)

where

βi =

Cov [∇V, Xi]

Cov [∇V, Xm_]

and ∇V =∇Vθ(Wαθ).

This gives a formulation of a Monotone CAPM with a potential for empir-ical testing. Proposition 7 also suggests that, by regressing the excess returns to the single assets on the excess return to the market portfolio, the betas of the single assets should be estimated by instrumental variables, using ∇V

as an instrument, rather than by ordinary least squares. To see why this is the case, define:

εi =Xi−R−βi(Xm−R) ,

so that:

Xi =R+βi(Xm−R) +εi

The usual requirement for the consistency of an ordinary least squares esti-mator is that the error εi and the regressor Xm−R be orthogonal to each

other. However, in general Xm₋_R _{is not orthogonal to} _ε

i because

E [εi(Xm−R)] = E [(Xi−R) (Xm−R)]−βiE

(Xm₋_R₎2

= E [(Xi−R) (Xm−R)]−

Cov [∇V, Xi]

Cov [∇V, Xm_]E

(Xm₋_R₎2

Instrumental variables estimation with ∇V as an instrument can be instead carried out because ∇V is orthogonal to εi, as shown at the end of the

Appendix.

6 Some Examples

In this section we present three simple examples to illustrate the optimal portfolio rule we derived above. In every example there are five possible states of Nature. Each of them obtains with a probabilityP (si) that remains

(15)

Example 1 is a case in which our model and the traditional mean-variance model deliver the same optimal composition of the portfolio. This is not interesting per se, but it serves as a benchmark and it helps to introduce Example 2, where the two optimal portfolios differ. In Example 1 there is only one risky asset, whose gross return is denoted by X1 and is reported in

the next table, and a risk-free asset, whose gross returnRis equal to 1 across all states. In this example, the optimal portfolio α∗

mmv calculated according

to our rule is equal to the mean-variance optimal portfolio α∗

mv. Wmv and

Wmmvrepresent the overall return to the two optimal portfolios for each state

of the world. The table also displays the value of the constantκ∗ _{at which it}

is optimal to truncate the distribution of the return to the portfolio of risky assets.

P(si) P (si|Wmmv≤κ∗) R X1 Wmv Wmmv

s1 0.1 0.1 1 0.97 0.9375 0.9375

s2 0.2 0.2 1 0.99 0.9791 0.9791

s3 0.4 0.4 1 1.01 1.0208 1.0208

s4 0.2 0.2 1 1.03 1.0620 1.0620

s5 0.1 0.1 1 1.05 1.1041 1.1041

α∗

mv = 2.083

α∗

mmv = 2.083 κ∗ = 1.1211

Example 2 is a slight modification of Example 1. We increase the payoff to the risky asset in state s5 from 1.05 to 1.10, leaving everything else

un-changed. The effect of this change is an increase in both the mean and the variance ofX1, the payoff to the risky asset. The optimal behavior according

to the mean-variance model is to reduce the fraction of wealth invested in the risky asset from 2.083 to 1.3574. According to our model it is also optimal to decrease the position in the risky asset, but less, from 2.083 to 1.8382.

P(si) P (si|Wmmv≤κ∗) R X1 Wmv Wmmv

s1 0.1 0.1111 1 0.97 0.9592 0.9448

s2 0.2 0.2222 1 0.99 0.9864 0.9816

s3 0.4 0.4444 1 1.01 1.0135 1.0183

s4 0.2 0.2222 1 1.03 1.0407 1.0551

s5 0.1 0 1 1.10 1.1357 1.1838

α∗

mv = 1.3574

α∗

(16)

In both cases the optimal behavior might seem puzzling at a first sight: when the payoff of an asset increases in one state, it is optimal to hold less of that asset. This behavior can be understood by looking at the distributions of the overall return in the two tables. By reducing the fraction of wealth invested in the risky asset, the overall return increases in the states where the risky asset pays less than the risk-free asset. On the contrary, the overall return decreases in the states where the risky asset pays more than the risk-free asset. In state s5, however, the effect of this decrease is compensated

by the fact that we have raised the payoff to the risky asset from 1.05 to 1.10. Hence, by reducing the amount of wealth invested in the risky asset, the investor gives up some of the extra payoff received in state s5 in order

to guarantee himself a higher overall return in the states where the risky asset has a low payoff. The problem with this kind of behavior is that it can become pathological with mean-variance preferences. The next table shows what happens if we further increase the payoff in state s5.

X1(s5) 1.05 1.10 1.15 1.20 1.50 2 3

α∗

mv 2.0830 1.3574 0.9174 0.6747 0.2465 0.1175 0.0572

α∗

mmv 2.0830 1.8382 1.8382 1.8382 1.8382 1.8382 1.8382

The more we increase the payoff in states5, the more the mean-variance

optimal fraction α∗

mv of wealth invested in the risky asset decreases, until it

goes to zero when the payoff in states5becomes very large. In our model this

does not happen. At first α∗

mmv decreases, but it then stops decreasing and

it remains fixed at the same value, though the payoff in state s5 is further

increased. The reason why this happens is that, once probabilities have been optimally reassigned to states and a zero probability has been assigned to state s5, any further increases of the payoff in s5 are disregarded and have

no influence on the formation of the optimal portfolio.

Example 3 is slightly more complicated. Everything is as in Example 2, but a second risky asset is added. The payoff to this new asset, denoted by

(17)

P (si) P(si|Wmmv ≤κ∗) R X1 X2 Wmv Wmmv

s1 0.1 0.1111 1 0.97 1.05 1.002 1.0231

s2 0.2 0.2222 1 0.99 1.00 0.9833 0.9570

s3 0.4 0.4444 1 1.01 0.99 1.0061 1.0125

s4 0.2 0.2222 1 1.03 0.99 1.0393 1.0985

s5 0.1 0 1 1.10 0.99 1.1556 1.3994

α∗

mv = (1.6613,1.0495)

α∗

mmv = (4.2989,3.0423) κ∗ = 1.1316

Also in this case the optimal portfolios suggested by our model and by the traditional model are different. To get an intuitive idea of what is happening, note that, although the market is still arbitrage-free, asset 2 allows to hedge away almost completely the risks taken by investing in asset 1. Consider for example a portfolio formed by 0.5 units of asset 1 and 0.5 units of asset 2. Its payoffs in the five states are collected in the following vector:

(1.01,0.995,1,1.01,1.045)

A qualitative inspection of this payoff vector reveals that in state s2 this

portfolio pays off slightly less than the risk-free asset, while in all other states it pays off more and in some states considerably more. Roughly speaking, if it was not for the slightly low payoff in state s2, there would be an arbitrage

opportunity because the portfolio would pay off more than the risk-free asset in every state. As a consequence, we would expect an optimal portfolio rule to exploit this favorable configuration of payoffs by prescribing to take a highly levered position. As reported in the last table, according to our model it is optimal to take a highly levered position in the risky assets in order to exploit this opportunity, at the cost of facing a low payoff in state

s2. In contrast, with the mean-variance model the optimal portfolio is much

less aggressive, and the investor is overly concerned with the unique state in which the payoff is lower than the payoff to the risk-free asset.

7 Conclusions

(18)

(19)

A

Monotone Fenchel Duality

In this section we recall some important definitions of Convex Analysis, and we prove some properties of monotone extensions of concave functionals that pave the way for the proof of Theorem 1.

Let (E,k·k,>) be an ordered Banach space, i.e., an ordered vector space endowed with a Banach norm such that the positive cone E+ is closed and

generates E. Denote by E′ _{the norm dual of} _E_{, and by} _E′

+ the cone of all

positive, linear, and continuous functionals on E. (See [Ch].)

Letψ : E → R _{be a concave and continuous functional. The} _directional

derivative of ψ atx is

d+ψ(x) (v) = lim

t→0+

ψ(x+tv)−ψ(x)

t ∀v ∈E.

If the above limit exists fort →0 and allv ∈E,ψisGateaux differentiable at

x, and the functional∇ψ(x) :v 7→d+_ψ₍_x_{) (}_v_{) is called} _{Gateaux differential}_.

(See [Ph]). The superdifferential of ψ atx is the subset of E′ _{defined by}

∂ψ(x) = {x′ ∈E′ :ψ(y)−ψ(x)≤ hy−x, x′i ∀y∈E};

while its Fenchel conjugate ψ∗ _:_E′ _→_[_−∞_,_∞_{) is given by}

ψ∗(x′) = inf

x∈E{hx, x

′_{i −}_ψ₍_x₎_} _∀_x′ _∈_E′_.

Finally, thedomain of monotonicity of ψ is the set Gψ ⊆E given by

Gψ =

x∈E :∂ψ(x)∩E₊′ 6=∅ _.

Next proposition confirms the intuition that the set Gψ is where the

func-tional ψ is monotone.

Proposition 8 Let E be an ordered Banach space and ψ : E → R _{be a}

concave and continuous functional with Gψ 6=∅.

(i) The functional ψ˜ :E →R_{, given by}

˜

ψ(x) = min

x′_∈_E′

+

{hx, x′i −ψ∗(x′)} ∀x∈E, (7)

is the minimal monotone functional that dominates ψ; that is,

˜

(20)

(ii) Let x∈E, x∈Gψ ⇔d+ψ(x) (v)≤0∀v 60⇔ψ˜(x) =ψ(x).

(iii) Let x′ _∈_E′_,

˜

ψ∗(x′) =

ψ∗₍_x′₎ _if _x′ _∈_E′

+,

−∞ otherwise. (9)

(iv) Ifψ∗ _{is strictly concave on the intersection of its domain with} _E′

+, then

˜

ψ is Gateaux differentiable and∇ψ˜(x) = arg minx′_∈_E′

+{hx, x

′_{i −}_ψ∗₍_x′₎_}

for all x∈E.

Moreover, if Gψ is convex, for all x ∈ E there exists y ∈ Gψ such that

y6x, and there exists a linear subspace F ⊆Gψ such that ψ|F is linear and

ψ∗₍_x′₎ _{is attained for all} _x′ _∈ _E′

+ with x′|F = ψ|F; then, ψ˜ is the minimal

monotone functional that extends ψ|Gψ from Gψ to E. In this case,

˜

ψ(x) = sup{ψ(y) :y∈Gψ and y6x}= min x′_∈_E′

+:x′|F=ψ|F

{hx, x′i −ψ∗(x′)}

(10)

for all x∈E.

By (i) and (ii), ˜ψ is the minimal monotone extension of ψ|Gψ from Gψ

to E that dominates ψ on E; in particular, ˜ψ(x) ≥ ψ(x) for each x ∈ E

and ˜ψ(x) = ψ(x) for all x ∈ Gψ. Moreover, (iii) shows that the Fenchel

conjugates of ˜ψandψcoincide on the coneE′

+. (iv) is a useful differentiability

property. The final part, shows that, under additional assumptions (satisfied, e.g., by the mean-variance functional), the extension ˜ψ has even stronger minimality properties.6

Proof. By the Fenchel-Moreau Theorem,ψ(x) = infx′_∈_E′{hx, x′i −ψ∗(x′)}

for all x∈E. Set

˜

ψ(x) = inf

x′_∈_E′

+

{hx, x′i −ψ∗(x′)} ∀x∈E. (11)

Let x0 ∈Gψ (6=∅) and x′0 ∈∂ψ(x0)∩E+′ (6=∅), then

ψ(x)≤ψ(x0) +hx, x′0i − hx0, x′0i ∀x∈E.

(21)

Therefore, the functional x′

0+ (ψ(x0)− hx0, x′0i) is affine, monotone, and it

dominates ψ. Notice that

hx0, x′0i −ψ(x0)≤ hx, x′0i −ψ(x) ∀x∈E and ψ∗(x′0) = hx0, x′0i −ψ(x0).

(12) In particular, x′

0 ∈E+′ and ψ∗(x′0)∈R, hence, for all x∈E, −∞< ψ(x)≤ inf

x′_∈_E′

+

{hx, x′i −ψ∗(x′)}<∞.

Then ˜ψ dominatesψ and it takes finite values (i.e. it is a functional). Obvi-ously, ˜ψ is concave and monotone.

(i) Letϕ :E →R_{be a concave and monotone functional such that}_ϕ_≥_ψ_.

Thenϕ is continuous,7 _and_ϕ₍_x_{) = inf}

x′_∈_E′

+{hx, x

′_{i −}_ϕ∗₍_x′₎_}_{for all}_x_∈_E_.8

In addition, since ϕ ≥ψ, then ϕ∗ _≤_ψ∗ _and

ϕ(x) = inf

x′_∈_E′

+

{hx, x′i −ϕ∗(x′)} ≥ inf

x′_∈_E′

+

{hx, x′i −ψ∗(x′)}= ˜ψ(x) ∀x∈E.

This shows that ˜ψ is the minimal concave and monotone functional that dominates ψ. It is easy to check that the function defined by Eq. (8) is the minimal monotone functional that dominates ψ, and it is concave, hence it coincides with ˜ψ.

(iii) Follows from the definition of ˜ψ (Eq. 11) and the observation that the function defined by Eq. (9) is proper, concave, and weak* upper semi-continuous.

Since ˜ψ is concave and continuous, for all x0 ∈E,9 (iii) delivers

x′₀ ∈∂ψ˜(x0) ⇔ x′0 ∈arg min_x′_∈_E′ n

hx0, x′i −ψ˜

∗

(x′)o

⇔ x′₀ ∈arg min

x′_∈_E′

+

{hx0, x′i −ψ∗(x′)}.

7

If ϕ : Ω→ R _{is concave and monotone on an open subset Ω of an ordered Banach}

space, then it is continuous.

8_If_ϕ_:_E_→_[_−∞,_∞_{] is monotone and not identically}_−∞_{, then}_ϕ∗₍_x′_{) =}_−∞_{for all}

x′_∈_/_E′

+.

9_If_ϕ_:_E_→_R_{is concave and continuous, then, for all}_x 0∈E,

∂ϕ(x0) = arg min

x′∈E′{hx0, x

(22)

This shows that the infimum in Eq. (11) is attained, and that (iv) holds. (ii) Ifx∈Gψ, then there is x′₀ ∈∂ψ(x)∩E₊′ , and for all v 60

d+ψ(x) (v) = min

x′_∈_∂ψ₍_x₎hv, x

′_{i ≤}D_{v, x}′

0

E ≤0.

Conversely, assume that d+_ψ₍_x_{) (}_v₎_≤_{0 for all} _v ₆_{0 and, by contradiction,}

thatx /∈Gψ, that is∂ψ(x)∩E+′ =∅. SinceE+′ is a weak* closed and convex

cone and∂ψ(x) is weak* compact, convex, and nonempty, there existsv ∈E

such that

hv, y′i ≤0<hv, x′i ∀y′ ∈E₊′ , x′ ∈∂ψ(x). (13) Since E+ is closed, then E+ =

x∈E :hx, y′_{i ≥}₀_∀_y′ _∈_E′

+ . The left

hand side of Eq. (13) amounts to say that v 6 0, the right hand side that minx′_∈_∂ψ₍_x₎hv, x′i>0, which is absurd.

Ifx∈Gψ, then there is x′0 ∈∂ψ(x)∩E+′ that is

x′₀ ∈ arg min

x′_∈_E′{hx, x

′_{i −}_ψ∗₍_x′₎_} _and _x′

0 ∈E+′ , then

x′₀ ∈ arg min

x′_∈_E′

+

{hx, x′i −ψ∗(x′)}.

The first line implies ψ(x) = hx, x′

0i − ψ∗(x′0), the second that ˜ψ(x) = hx, x′

0i − ψ∗(x′0). Conversely, if ψ(x) = minx′_∈_E′

+{hx, x

′_{i −}_ψ∗₍_x′₎_}_{, then}

there exists x′

0 ∈E+′ such that

hx, x′₀i −ψ∗(x′₀) = ψ(x) = min

x′_∈_E′{hx, x

′_{i −}_ψ∗₍_x′₎_}_{, then}

x′₀ ∈ arg min

x′_∈_E′{hx, x

′_{i −}_ψ∗₍_x′₎_} _and _x′

0 ∈E+′ ,

therefore, x′

0 ∈∂ψ(x)∩E+′ and x∈Gψ.

Finally, assumeGψ is convex, for all x∈E there exists y∈Gψ such that

y6x, and there exists a linear subspace F ⊆Gψ such thatψ|F is linear and

ψ∗₍_x′_{) is attained for all} _x′ _∈_E′

+ with x′|F =ψ|F. Set

ˆ

ψ(x) = sup{ψ(y) :y ∈Gψ and y6x} ∀x∈E.

It is easy to check that ˆψ : E → R _{is the minimal monotone functional}

that extends ψ|Gψ from Gψ to E, and that convexity of Gψ implies that it

(23)

monotone, and linear on F (where it coincides with ψ). Hence, for all x ∈

these properties there exists x0 ∈E such that

hx0, x′i −ψ(x0)≤ hx, x′i −ψ(x) ∀x∈E.

where the last inequality descends from Eq. (14). We conclude that

ˆ

−∞Ff(z)dz its integral distribution function.

Next two lemmas regroup some useful properties of integrated distribu-tion funcdistribu-tions. We report the proofs for the sake of completeness.

10_If_ϕ_:_E_→_[_−∞,_∞_{] is linear on a subspace}_F_{, then}_ϕ∗₍_x′_{) =}_−∞_if_x′

(24)

(25)

Observe thatz−(f∧z)≥0, and so

Z

z−(f ∧z)dP =

Z ∞

0

P (z−(f∧z)≥u)du.

On the other hand, {z−(f ∧z)≥u}={f ≤z−u} for all u >0. In fact,

z−(f ∧z)≥u⇒(f ∧z)≤z−u < z ⇒(f ∧z) =f ⇒f ≤z−u

and

f ≤z−u⇒f∧z ≤(z−u)∧z ⇒f ∧z ≤z−u⇒z−(f ∧z)≥u.

Hence,

Z

z−(f ∧z)dP =

Z ∞

0

P (z−(f∧z)≥u)du

=

Z ∞

0

P (f ≤z−u)du=

Z z

−∞

P (f ≤t)dt =gf(z),

thus the first four equalities hold.

Finally, notice that P (f < t) = limu→t−P (f ≤u) 6= P (f ≤t) for at

most a countably many ts.

Lemma 10 The function gf :R→[0,∞) is continuous and

Ff(z) = lim ε→0+

gf(z+ε)−gf(z)

ε

∀z ∈R_.

That is, Ff is the right derivative of gf, and Ff(z) is the derivative of gf at

every point z at which Ff is continuous. Moreover, setting ζ = essinf (f), gf

is strictly increasing on (ζ,∞), gf ≡ 0 on (−∞, ζ],11 limz→ζ+g_f(z) = 0+,

and limz→∞gf(z) =∞.

Proof. The Fundamental Theorem of Calculus guarantees the continuity

and derivability properties of gf. Recall that

essinf (f) = sup{α∈R_:_P₍_{f < α}_{) = 0}_}_.

(26)

If z ∈ R _and _z _≤ _{essinf (}_f_{), for all} _{t < z} _{there exists} _{α > t} _{such that}

P (f < α) = 0. Then,

0≤P (f < t)≤P (f < α) = 0.

This implies gf (z) = 0 for all z ∈(−∞, ζ].

On the other hand, if ζ < z < z′_{, then}

gf(z′)−gf(z) =

Z z′

z

P (f < t)dt≥P (f < z) (z′−z).

But P (f < z) = 0 would imply z ≤ ζ, a contradiction. Therefore, gf(z′)−

gf(z)>0. That is, gf is strictly increasing on (ζ,∞).

Notice that limt→∞Ff(t) = 1. Then, for all n > 1 there exists k ≥ 1

(n, k ∈N_{) such that} _F_f₍_t₎_>₁₋ 1

n for all t≥k. Therefore,

gf(k+n) =

Z k+n

−∞

Ff(t)dt≥

Z k+n

k

Ff(t)dt

≥ n

1− 1 n

=n−1.

Since gf is increasing on R, then limz→∞gf(z) =∞.

Ifζ >−∞,gf(ζ) = 0, continuity and nonnegativity imply limz→ζ+g_f(z) =

0+_{. Let} _ζ ₌_−∞_{, for all} _{n >}_1,

gf(−n) =

Z

(−n−(f ∧(−n)))dP =

Z

(((−f)∨n)−n)dP

=

Z

((−f)−((−f)∧n))dP.

The Monotone Convergence Theorem guarantees that limn→∞gf(−n) = 0.

Monotonicity and nonnegativity imply limz→ζ+g_f (z) = 0+.

For the rest of the Appendix we indifferently write EP _{or just E. Denoting}

by > the relation ≥ P-a.s., (L2₍_P₎_,_k·k

2,>) is an ordered Banach space,

and its norm dual can be identified with L2₍_P_{), with the duality relation} hf, Yi= E [f Y].

Lemma 11 Let f ∈ L2₍_P₎_{− G}

θ and t ∈R. Then

f ∧t∈ Gθ ⇔gf(t)≤

1

(27)

Proof. Notice that

For the converse implication, notice that we are assumingf /∈ Gθ. Then,

being f ∧t ∈ Gθ, it cannot be f∧t =f P-a.s.. Hence, essup (f∧t) =t. It

Lemmas 10 and 11 immediately yield the following:

(28)

(iv) {Y ∈ L2₍_P_{) : E [}_Y_{] = 1}_} ₌ n_Y _{∈ L}2₍_P_{) :}_h·_{, Y}_i

|T =Uθ|T

o

and U∗

θ is

attained and strictly concave on this set.

Proof. (i) and (ii) are trivial.

For allt ∈R_,_U_θ₍_t₁_S_{) =} _t_{, then for all} _Y _{∈ L}2₍_P₎ is well defined, convex, and Gateaux differentiable. Its Gateaux differential is

This concludes the proof of Eq. (17). Strict concavity of U∗

θ in its domain is

a straightforward application of the the Cauchy-Schwartz inequality.

Proof of Theorem 1. As observed, Uθ : L2(P) → R is a concave and

continuous functional on an ordered Banach space, Gθ = GUθ, Lemma 13

and Corollary 12, guarantee that all the hypotheses of Proposition 8 are satisfied. For all f ∈ L2₍_P_),

(29)

Remark 1 Proposition 8.(iv) and Lemma 13.(iv) guarantee thatVθ is Gateaux

. Moreover, the Gateaux differential of Vθ at f is

∇Vθ(f) = θ(κ−f) 1{f≤κ}.

Remark 1 guarantees that the solution of such problem exists, is unique, and it coincides with the Gateaux derivative of Vθ at f. Notice that Y is

a solution of problem (19) if and only if it is a solution of the constrained optimization problem:

+(P), λ ∈ R. The Karush-Kuhn-Tucker optimality conditions

(30)

Since µ, Y >0, they are equivalent to:

f +1

θY −µ−λ= 0 (P-a.s.) µY = 0 (P-a.s.)

Y >0, µ>0 E (Y) = 1

that is,

f+ 1

θY −λ≥0 P-a.s. (21)

f +1

θY −λ

Y = 0 P-a.s. (22)

Y ≥0 P-a.s. (23)

E [Y] = 1 (24)

It is sufficient to find (Y∗_{, λ}∗_{) that satisfy (21) - (24) everywhere (not}

only P-a.s.).

Ifs∈ {Y∗ _>₀_}_{, then by (22)} _f₍_s_{) +} 1

θY

∗₍_s₎₋_λ∗ _{= 0 and}

Y∗₍_s_{) =} _θ₍_λ∗₋_f₍_s₎₎_. ₍₂₅₎

In particular, λ∗₋_f₍_s₎_>_{0, and}_s _{∈ {}_{f < λ}∗_}_{. Conversely, if} _s_{∈ {}_{f < λ}∗_}_,

then by (21) Y∗₍_s₎_≥_θ₍_λ∗ ₋_f₍_s₎₎_>_{0 and} _s_{∈ {}_Y∗ _>₀_}_{. In sum,}

{Y∗ _>₀_}₌_{_{f < λ}∗_} _and

Y∗ =θ(λ∗−f) 1{f <λ∗_}.

By (24),

1 = E [Y∗] = E

θ(λ∗−f) 1{f <λ∗_}

= θ λ∗P(f < λ∗)−E

f1{f <λ∗_},

that is,

gf(λ∗) = λ∗P (f < λ∗)−E

f1{f <λ∗_} = 1 θ.

In other words,

λ∗ =g_f−1

1

θ

(31)

and λ∗ _{is unique. A fortiori,}_Y∗ _{is unique and}

Y∗ =θ(κ−f) 1{f <κ}. (27)

By construction, the pair (Y∗_{, λ}∗_{) defined by (26) and (27) is a solution of}

(21) - (24). Since the solution of (19) exists and it is unique, we conclude that Y∗ _{defined as in Eq. (27) is the unique solution of (19).}

(32)

and

Proof of Theorem 2. It is now enough to combine Corollary 12 and

Theorem 14.

Remark 2 Inspection of the proof of Theorem 14 shows that:

(33)

• Moreover, 1

Proof of Theorem 3. By Theorem 13.8 of Chong and Rice [CR], for all

(34)

hence

Proof of Theorem 4. The maximization problem is

sup a vector of 1s). From Theorem 14 we know that Vθ is Gateaux differentiable

and

the first order conditions for an optimum are:

(35)

set A={Wα ≤κα} to obtain

These are the first n equations. The (n+ 1)-th is the equation which deter-mines κα, that is (28) or (see Lemma 9)

Concavity of Vθ guarantees the sufficiency of first order conditions.

Proof of Proposition 5. Set α∗ ₌ _α∗

mmv. The maximization problem it

solves is

2_{]. Moreover, notice that}

G:R_×Y_→R _{is continuous, it is affine in} _α _{(for each fixed}_Y_{) and strictly}

convex in Y (for each fixed α). Set v = maxα∈RminY∈YG(α, Y), by (a version of) the Min-Max Theorem (e.g. [Au, p. 134]) there exists ¯Y ∈ Y

(36)

therefore

E [Y∗(X−R)] = 0. (31)

Y∗ _{is the solution of problem (20) in the proof of Theorem 14 with} _f ₌

R + α∗₍_X₋_R_{) =} _W

α∗. Therefore, there exist λ∗ ∈ R and µ ∈ L2(P)

(=RS_{) such that} _Y∗ _{satisfies the following conditions:}

R+α∗(X−R) + 1

θY

∗₋_µ₋_λ∗ _{= 0}_, ₍₃₂₎

E [Y∗] = 1, (33)

Y∗ >0, µ >0, µY∗ = 0. (34)

Taking the expectation of both sides of (32) we obtain:

(1−α∗)R+α∗E [X] + 1

θE [Y

∗_]₋_{E [}_µ_]₋_λ∗ _{= 0} ₍₃₅₎

and, subtracting (35) from (32):

α∗(X−E [X]) + 1

θ(Y

∗₋_{E [}_Y∗_])₋₍_µ₋_{E [}_µ_{]) = 0}_.

Rearranging and using (33):

Y∗ _{= 1}₋_θα∗₍_X₋_{E [}_X_{]) +}_θ₍_µ₋_{E [}_µ_])_. ₍₃₆₎

Multiply both sides by µ, take expectations and use (34) to get:

E [µ]−θα∗Cov [µ, X] +θVar [µ] = 0

and, rearranging terms:

θα∗Cov [µ, X] = E [µ] +θVar [µ]. (37)

Since µ>0, then E [µ]≥0 thus:

α∗ = 0 ⇒µ= 0 ⇒Cov [µ, X] = 0, (38)

α∗ > 0⇒Cov [µ, X]≥0, (39)

α∗ < 0⇒Cov [µ, X]≤0. (40)

Now, plugging (36) into (31) we obtain:

(37)

or:

E [X]−θα∗Var [X] +θCov [µ, X] =R

which becomes:

α∗ = 1

θ

E [X−R] Var [X] +

Cov [µ, X] Var [X] Recalling that:

α∗mv =

1

θ

E [X−R] Var [X] we obtain:

α∗ =α∗_mv+ Cov [µ, X]

Var [X] (41)

Using (38) - (40), it is now obvious that:

α∗ = 0⇒α∗ =α∗mv = 0, (42)

α∗ > 0⇒α∗ ≥αmv∗ , (43)

α∗ < 0⇒α∗ ≤αmv∗ . (44)

¿From the proof of Theorem 14 – Eq. (26) – we know that λ∗ ₌_g−1

Wα∗

1

θ

=

κ∗_{. Furthermore, if} _P ₍_W

α∗ > κ∗) > 0, since S is finite, there exists s such

that

R+α∗(X(s)−R) =Wα∗(s)> κ∗ =λ∗,

that is R+α∗₍_X₍_s₎₋_R₎₋_λ∗ _>_{0. Since} _Y∗₍_s₎_≥_{0, by (32), we have}

µ(s) =R+α∗(X(s)−R)−λ∗+1

θY

∗ _>₀_,

and E [µ] > 0. Thus, in this case (37) implies α∗_{Cov [}_{µ, X}_] _> _{0 and the}

inequalities in (43) and (44) become strict. Finally, we want to show that α∗_α∗

mv ≥ 0. By contradiction, suppose

α∗_α∗

mv < 0. Then, either α∗ > 0 and α∗mv < 0 or α∗ < 0 and α∗mv > 0.

Suppose α∗ _>_{0 and} _α∗

mv <0, since

α∗mv =

1

θ

E [X−R] Var [X] ,

it must be E [X−R]<0 and

(38)

Clearly, if α∗ _<_{0 and} _α∗

where the last inequality follows from (45). But,

min

Proof of Proposition 6. Assume

(39)

that is

(

γP (Wαγ ≤κγ) VarP [X|W_αγ ≤κγ]αγ = EP h

X−~1R|Wαγ ≤κγ i

,

EP

(Wαγ−κγ)−

= 1

γ.

so that (αγ_{, κ}γ_{) solves the portfolio selection problem for an investor with}

uncertainty aversion γ. Moreover,

∇Vθ(Wαθ) = θ κθ−W_αθ

1_{_W

αθ≤κθ}

= θγ

θκ

γ₋ γ

θWαγ

1{Wαγ≤κγ}

= γ(κγ₋_W

αγ) 1_{_W

αγ≤κγ} =∇Vγ(Wαγ).

Proof of Proposition 7. Remember (see Eq. 29) that the first order

conditions for an optimum are E [X∇Vθ(Wαθ)] = ~1R. In particular, for i= 1, ..., n

E [Xi∇V] =R and αm·E [X∇V] =αm·~1R, then

E [Xi∇V] =R and E [Xm∇V] =R (47)

therefore

E [Xi∇V]−E [Xi] E [∇V] = R−E [Xi] E [∇V] and

E [Xm∇V]−E [Xm] E [∇V] = R−E [Xm] E [∇V] but E [∇V] = 1 (see Eq. 19), then

Cov [Xi,∇V] =−E [Xi −R] and Cov [Xm,∇V] =−E [Xm−R]

we conclude

βi =

−E [Xi−R]

−E [Xm₋_R_] =

Cov [Xi,∇V]

Cov [Xm_,_∇_V_].

Finally, to prove ∇V is orthogonal to εi notice that

E [εi∇V] = E [(Xi−R)∇V]−βiE [(Xm−R)∇V]

and Eq. (47) and E [∇V] = 1 guarantee that E [(Xi−R)∇V] = 0 and

(40)

References

[Au] Aubin, J-P., Optima and equilibria, Springer-Verlag, Berlin, 1998.

[Bi] Bigelow, J. P., Consistency of mean-variance analysis and expected utility analysis: a complete characterisation, Economic Letters, 43, 187-192, 1993.

[Br] Britten-Jones, M., The sampling error in estimates of mean-variance efficient portfolio weights,Journal of Finance, 45, 655-671, 1999.

[BKM] Bodie, Z., A. Kane, and A. J. Marcus, Investments, McGraw Hill, Boston, 2002.

[Ce] Cerny, A., Mathematical techniques in finance, Princeton University Press, Princeton, 2004.

[CR] Chong, K.M. and N.M. Rice, Equimeasurable rearrangements of func-tions, Queens Papers in Pure and Applied Mathematics, 28, 1971.

[Cn] Chong, K.M., Doubly stochastic operators and rearrangement theo-rems, Journal of Mathematical Analysis and Applications, 56, 309– 316, 1976.

[Ch] Choquet, G., Lectures on Analysis (vol. 2), Benjamin, New York, 1969.

[Da] Dana, R-A., A representation result for concave Schur concave func-tions, Mathematical Finance, to appear.

[GS] Gilboa, I. and D. Schmeidler, Maxmin expected utility with non-unique prior, Journal of Mathematical Economics, 18, 141-153, 1989.

[GRS] Gibbons, M. R., S. A. Ross and J. Shanken, A test of the efficiency of a given portfolio, Econometrica, 57, 1121-1152, 1989.

[HS] Hansen, L. and T. Sargent, Robust control and model uncertainty,

American Economic Review, 91, 60-66, 2001.

(41)

[KS] Kandel, S. and R. Stambaugh, A mean-variance framework for tests of asset pricing models,Review of Financial Studies, 2, 125-156, 1987.

[LV] Liese, F. and I. Vajda,Convex Statistical Distances, Teubner, Leipzig, 1987.

[MMR] Maccheroni, F., M. Marinacci and A. Rustichini, Ambiguity aver-sion, robustness, and the variational representation of preferences, mimeo, 2004.

[MR] MacKinlay, A. C. and M. P. Richardson, Using generalized method of moments to test mean-variance efficiency, Journal of Finance, 46, 511-527, 1991.

[Ph] Phelps, R. R., Convex functions, monotone operators and differentia-bility, Springer-Verlag, New York, 1992.

[Ma] Markowitz, H. M., Portfolio selection, Journal of Finance, 7, 77-91, 1952.

[Sh-1] Sharpe, W. F., Capital asset prices: a theory of market equilibrium under conditions of risk,Journal of Finance, 19, 425-442, 1964.

[Sh-2] Sharpe, W. F., Capital asset prices with and without negative hold-ings, Journal of Finance, 46, 489-509,1991.