Random Utility Theory
3.3 Some Random Utility Models
3.3.2 The Single-Level Hierarchical Logit Model
case described: namely, that the decision-maker perceives the alternatives as com- pletely distinct, and therefore that their random residuals are independent. A more realistic choice model can be obtained by introducing a covariance between the random residuals of alternativesB andC, as shown in the following sections. In general, as shown below, a multinomial logit model has the property that any vari- ation in the choice probability of one alternative (resulting from a change in its attributes) leads to proportional variations in the choice probabilities of all other alternatives.
In applications, the multinomial logit model should be used with choice alterna- tives that are sufficiently distinct for the assumption of independent random residu- als to be plausible.
3.3 Some Random Utility Models 101 Fig. 3.5 Choice tree of a
single-level hierarchical logit model
These assumptions imply that the decision-maker perceives alternatives belonging to the same group as similar; the similarity is captured by the covariance among the overall random residuals of these alternatives. In a mode choice situation, for example, the available modes can be divided into two groups: public modes (bus and train) and private modes (car and motorbike). Assumption (3.3.8) implies that the decision-maker perceives the modes belonging to the same group to be similar inasmuch as they share a number of attributes (flexibility, privacy, etc.).
The utility structure and the choice mechanism corresponding to a single-level hierarchical logit model can be represented by a choice tree, as shown in Fig.3.5.
In the choice tree, “elementary” choice alternatives (e.g., transport modes) corre- spond to nodes with no exit links (“leaves” of the tree), whereas the root nodeo has no entering links. The intermediate nodesk, one for each group, represent com- pound alternatives: groups of elementary alternatives. The random residualsηk and τj/ k are associated with the branches that correspond to groups and to elementary alternatives, respectively.
The choice tree can be viewed as the representation of a hypothetical choice process. Starting from the root node, the decision-maker first chooses groupkfrom the available groups (represented by nodes linked to the root); she then chooses elementary alternativejfrom those belonging to groupk(represented by the leaves connected to the nodek). The expression for the overall choice probability of an alternativej,p[j], is obtained as the product of the conditional probabilityp[j/ k]
of choosing elementary alternativej within groupk(lower level), multiplied by the probabilityp[k]of choosing groupk(upper level):
p[j] =p[j/ k] ·p[k] (3.3.9) The name of the model is derived, in fact, from this probability structure.
To specify the probabilities in (3.3.9), further assumptions on the distribution of random residuals must be introduced. For the single-level hierarchical logit model, it is assumed that the random residuals relative to the alternatives available at each de- cision node (the root and the intermediate nodes) are identically and independently distributed Gumbel random variables.
Considering first the lower-level nodes (elementary alternatives), the residuals τj/ k are assumed to be i.i.d. Gumbel variables with zero mean and the same pa- rameterθ for all groupskand all alternatives j. In the choice among alternatives belonging to groupk, the perceived utility associated with alternativej, Uj/ k, can
be expressed as
Uj/ k=Vj+τj/ k ∀j∈Ik, ∀k E[τj/ k] =0 ∀j∈Ik, ∀k
Var[τj/ k] =π2θ2/6 ∀j ∈Ik, ∀k
(3.3.10)
Under these assumptions, the conditional choice probability of the elementary alternativej can be expressed as
p[j/ k] =Pr[Uj/ k> Ui/ k] =Pr[Vj−Vi> τi/ k−τj/ k] ∀i∈Ik, i=j (3.3.11) and, given the assumptions on the distribution of the residuals τj/ k, probability (3.3.11) can be expressed as a multinomial logit model:
p[j/ k] = exp(Vj/θ )
i∈Ikexp(Vi/θ ) (3.3.12)
At the upper level, the choice is made among groups of alternatives, with each groupkbeing considered as a compound alternative. Groupkwill be chosen if any one of the elementary alternatives belonging to it is chosen. Because the perceived utilities of elementary alternatives are random, the probabilityp[k] that group k is chosen is the same as the probability that one of its elementary alternatives has the maximum perceived utility among all elementary alternatives in the choice set.
Equivalently, probabilityp[k]can be obtained by assigning to groupkan inclusive perceived utilityUk∗equal to the utility of its most attractive alternative, that is, the maximum utility of all the elementary alternatives belonging to the group
Uk∗=max
j∈Ik
{Uj} =max
j∈Ik
{Vj+τj/ k} +ηk (3.3.13) which is again a random variable. The probability that groupk is chosen is then the probability that its inclusive perceived utilityUk∗is greatest among the different groups.
The perceived utilities Uj =Vj +τj/ k of the various alternatives j in group kare, by assumption (3.3.8), independently distributed Gumbel variables with the same scale parameterθ. As stated earlier, the maximum of a set of such random variables is also distributed as a Gumbel variable with parameterθand with mean equal to:
Vk∗=E[Uk∗] =E maxj∈Ik
{Vj+τj/ k}
=θln
j∈Ik
exp(Vj/θ )=θ Yk (3.3.14)
whereVk∗ is the Expected Maximum Perceived Utility (EMPU) or inclusive sys- tematic utility andYk is the corresponding logsum variable. In the expression for the inclusive perceived utility (3.3.13), the r.v. max(Vj+τj/ k)can be replaced by
3.3 Some Random Utility Models 103 its expected value plus a deviationτk∗12from this value, which is another zero-mean Gumbel variable with parameterθ. Then:
Uk∗=θ Yk+τk∗+ηk=θ Yk+εk∗ (3.3.15) Thus, the perceived utility of groupkhas a mean valueθ Yk and a deviationεk∗, which is the sum of the two zero-mean random variablesτk∗andηk. The basic as- sumption of the hierarchical logit model is that at each choice level the random residuals of the available alternatives are i.i.d. Gumbel variables; that is, it is as- sumed that theεk∗are i.i.d. Gumbel variables with zero mean and parameterθo, with ηk distributed in a way that makes this so:
E εk∗
=0 ∀k Var
εk∗
=π2θo2/6 ∀k (3.3.16)
In accordance with this assumption, the choice probability of group k is ex- pressed by a multinomial logit model. In fact:
p[k] =Pr
Uk∗> Uh∗
=Pr
θ Yk−θ Yh> εh∗−ε∗k
∀h=k and, given the results of the previous section:
p[k] = exp(θ Yk/θo)
hexp(θ Yh/θo)= exp(δYk)
hexp(δYh) (3.3.17) whereδis the ratio of parametersθandθoassociated with the two choice levels:
δ=θ/θo (3.3.18)
Replacing expressions (3.3.12) and (3.3.17) in (3.3.9), the choice probability of the generic elementary alternativej is obtained:
p[j] =p[j/ k] ·p[k] = exp(Vj/θ )
i∈Ikexp(Vi/θ )· exp(δYk)
hexp(δYh) (3.3.19) Variances and covariances of the random residualsεj of the elementary alter- natives’ overall perceived utility (3.3.8) can also be derived. The variance of εj coincides with that of the random residualεk∗because the two variables are the sum of the same variable(ηk)and another independent Gumbel variable (τk∗ andτj/ k, respectively) with zero mean and the same parameterθ. Therefore:
Var[εj] =Var εk∗
=π2θo2/6 ∀j (3.3.20)
12From the Gumbel variable’s property of stability with respect to maximization, the r.v.τk∗ is distributed like the variableτj/ kassociated with each alternativejbelonging to groupk, that is, as a Gumbel variable with zero mean and parameterθ.
The variance of the random residualεjis identical for all elementary alternatives.
There is also a positive covariance between the random residuals of any pair of alternativesiandj belonging to the same group. In fact:
Cov[εi, εj] =E
(ηk+τi/ k)·(ηk+τj/ k)
=E η2k
+E[ηkτj/ k] +E[ηkτi/ k] +E[τi/ kτj/ k] ∀i, j∈Ik
Because all the variablesηk, τi/ k, and τj/ k have zero mean and are mutually independent, the first term is equal to the variance ofηk and the others are zero, because they are the covariances of independent random variables:
Cov(εi, εj)Var(ηk) ∀i, j∈Ik (3.3.21) However, if two elementary alternativesiandj belong to different groups, all the terms are zero and so also is the covariance betweenεi andεj.
The variance ofηk can be expressed as a function of the two parametersθ and θo:
Var[ηk] =Var[εj] −Var[τj/ k] =π2(θo2−θ2)
6 ∀k (3.3.22)
From the previous results, the structure of the random residual variance–
covariance matrix can be determined. The elements of the main diagonal are all equal to the variance of the residualsεj, expressed by (3.3.20). The covariance be- tween each pair of alternatives belonging to the same group is the same and equal to the value given by (3.3.21) and (3.3.22), whereas the covariance between alter- natives belonging to different groups is zero. Therefore, if the alternatives of each group are ordered sequentially, the resulting variance–covariance matrix has a block diagonal structure. Figure3.6shows a choice tree and the corresponding variance–
covariance matrix.
It is also possible to express the coefficient of correlation between the perceived utilities of two alternativesiandjas a function of the basic model parameters:
ρij=
⎧
⎨
⎩
Cov[εiεj]
Var[εi]1/2Var[εj]1/2 =θo2−θ2
θo2 =1−δ2 ifi, j∈Ik
0 otherwise (3.3.23)
The parametersθ, θo, andδplay a major role in the structure of the hierarchical logit model and in determining the choice probabilities.
First, parameterδdefined by (3.3.18) must take on values in the interval[0,1]. It is defined by the ratio between two nonnegative quantities and, because the variance ofεj(π2θo2/6)cannot be less than that of one of its componentsτj/ k(π2θ2/6), the following must hold.
θo≥θ → 0≤δ≤1
As the variance ofτj/ ktends to that ofεj (i.e., asθtends toθo), parameterδtends to one. In this case, the variance ofηk(3.3.22) and the covariance between two alter-
3.3 Some Random Utility Models 105
car motorcycle
walking bus metro
π2 6
⎡
⎢
⎢
⎢
⎣
car motorcycle walking bus metro
θo2 θo2−θ2 0 0 0
θo2−θ2 θo2 0 0 0
0 0 θo2 0 0
0 0 0 θo2 θo2−θ2
0 0 0 θo2−θ2 θo2
⎤
⎥
⎥
⎥
⎦
Fig. 3.6 Choice tree and variance–covariance matrix of a single-level hierarchical logit model
natives belonging to the same group (3.3.21) both tend to zero, and the hierarchical logit model (3.3.19) reduces to the multinomial logit model.
This can be seen by substitutingδ=1 in (3.3.19), yielding:
p[j] = exp(Vj/θ )
i∈Ikexp(Vi/θ )· exp[ln
i∈Ikexp(Vi/θ )]
hexp[ln
i∈Ihexp(Vi/θ )]
= exp(Vj/θ )
h
i∈Ikexp(Vi/θ ) (3.3.24)
which is a multinomial logit model with a different expression for the summation in the denominator.
If the variance ofτj/ k tends to zero (i.e.,θtends to zero), parameterδ will also tend to zero. In this case, the two probabilities in the model (3.3.19) will be modified as follows.
– The conditional choice of an elementary alternative within a group degenerates into a deterministic choice of the alternative with maximum systematic utility:
θ→0limp[j/ k] =lim
θ→0
exp(Vj/θ )
i∈Ikexp(Vi/θ )=
1 ifVj=maxi∈Ik(Vi)
0 otherwise (3.3.25)
– The systematic utilities of alternative groups, equal toθ Yk, assume the value of the maximum systematic utility among the elementary alternatives in each group:
θ→0limθ Yk=lim
θ→0θln
i∈Ik
exp(Vi/θ )=max
i∈Ik
(Vi)
The choice probability of the group therefore becomes p[k] = exp[maxi∈Ik{Vi}/θo]
hexp[maxi∈Ih{Vi}/θo] (3.3.26) Thus, if parameterδis zero, the random residuals associated with the conditional utilities of elementary alternatives within a group are zero(Var[τj/ k] =0). In this case, the choice between groups is modeled by comparing, using a probabilistic logit model, the alternatives having maximum systematic utility within each group:
a random residual at the group level still exists, and the maximum utility alternative is deterministically chosen within each group.
Some special cases of the model presented can be analyzed. If a groupkconsists of a single alternativej, then p[j/ k] =1 and the general expression (3.3.19) for this alternative becomes
p[j] = exp(Vj/θo) exp(Vj/θo)+
h=kexp(δYh) (3.3.27) In some applications of the single-level hierarchical logit model, and in particular for systems of partial share models covered in the next chapter, the systematic utility Vjof alternativej in groupkis decomposed into two parts: one part,Vk, associated with groupk itself; and a second part,Vj/ k, associated with the alternative within the group:
Vj=Vk+Vj/ k (3.3.28)
This decomposition leads to an alternative formulation of the choice probabilities p[j/ k]andp[k]. By replacing (3.3.28) in (3.3.12) and (3.3.17), respectively, it fol- lows that
p[j/ k] = exp(Vj/θ )
i∈Ikexp(Vi/θ )= exp[(Vk+Vj/ k)/θ] exp(Vk/θ )·
i∈Ikexp(Vi/ k/θ )
= exp(Vj/ k/θ )
i∈Ikexp(Vi/ k/θ ) (3.3.29)
and
p[k] = exp(Vk/θo+δYk′)
hexp(Vh/θo+δYh′) (3.3.30) because
δYk=δln
j∈Ik
exp(Vj/θ )=δln
j∈Ik
exp
(Vk+Vj/ k)/θ
=δln
exp(Vk/θ )·
j∈Ik
exp(Vj/ k/θ )
=δVk/θ+δln
j∈Ik
exp(Vj/ k/θ )
=Vk/θo+δYk′
3.3 Some Random Utility Models 107 whereYk′ is the logsum variable of groupkobtained with the alternative specific systematic utilitiesVj/ k.