Use of Mixed Revealed-Preference and Stated

Use of Mixed Revealed-Preference
and Stated-Preference Models
with Nonlinear Effects in Forecasting
Elisabetta Cherchi and Juan de Dios Ortúzar
dent. However, use of SP data in forecasting is not recommended
because it is hypothetical and may not account for certain types of real
market constraints (4). Another important problem is that apparently
good models can be obtained with almost any SP survey, but if the
technique is not applied appropriately (for example, use of a noncustomized design in a general context instead of focusing on specific
behavior), serious problems may remain undetected until forecasts are
compared with actual outcomes (5).
Thus the recommended approach involves using both data sources
jointly (1, 2, 4). The mixed RP-SP approach has now been used in
many applications, both in research and in practical work, even with
very complex model structures. However, as is often the case, most
of the attention has been given to estimation, leaving the correct use
of these much-improved models in forecasting in need of better
understanding.
An in-depth review of the literature shows that many papers consider the joint RP-SP estimation problem, but few discuss the use of
these models in forecasting. Making predictions with a mixed RP-SP
model does not imply major problems when models are estimated
with all attributes being generic between RP and SP. However, this is
not always the case. Often, some attributes can be measured only in
one data set or, although available, can be estimated properly only
in one data set. Moreover, because of differences in the nature of
the attributes, different RP and SP parameters can be estimated,
both highly significant. In fact, it is likely that complete preference
homogeneity does not hold for multiple data (6). Finally, differences
between RP and SP parameters might be implicit in the need for SP
data, or even what is looked for when SP data are used, especially when
what is wanted is to test in prediction structural change (i.e., depart
from the current real market) and when utilities are not linear in the
attributes. It is interesting that often nonlinearities are included
only in the SP data, so the problems of nonlinear utility specifications
and mixed RP-SP estimation are strictly connected. Moreover,
although some papers (7–9) agree that often the typical linear utility
assumption does not allow one to correctly reproduce individual
behavior, no evidence exists on the effects of not accounting for the
nonlinear part of utility in forecasting.
This paper investigates the problem of using mixed RP-SP models
in forecasting and, in particular, with the effect of nonlinearities estimated in a mixed RP-SP model but specific only for the SP data. The
focus is on two types of nonlinearity that can be estimated in a mixed
RP-SP model and that often imply partial preference homogeneity.
The first case is the specification of interaction terms that because of
correlation problems in the RP data are normally included only in the
SP data. The second case occurs when different parameters are estimated for the RP and SP environments. Since in prediction only the
Estimation of mixed revealed-preference (RP) and stated-preference (SP)
data has become almost common practice. However, most attention has
gone to estimation, with the correct use of these much-improved models
in forecasting left in need of better understanding. In particular, their
potentiality in accounting for nonlinear effect has not been fully explored,
either in estimation or in prediction. The problem of using an RP-SP
mixed model for accounting for nonlinear effects in the attributes in forecasting, especially when the nonlinearities have been specifically estimated only for the SP data, is addressed. By using a mixed RP-SP data set
gathered especially for a modal choice context in Cagliari, Italy, several
nested logit models with nonlinear systematic utility functions were estimated and used to provide empirical evidence on the effect of incorrect
use of RP-SP specifications in forecasting under different policies. It was
concluded that the partial data enrichment approach is well suited to treat
nonlinearities and allows for consistent improvements in model estimation results. More important, nonlinearities might impose severe limitations in the applicability of these models in forecasting because of more
restrictive microeconomic conditions. Failure to treat these problems correctly can lead to serious forecasting errors, even in the case of relatively
mild policies.
The joint revealed-preference (RP) and stated-preference (SP) estimation method was first proposed in the early 1990s (1); since then it has
become recommended practice because it allows one to exploit the
advantages of each type of data and to overcome their limitations (2).
It is well known that RP data, as based on observations of actual
choices, need a large number of observations, may include only
existing alternatives, and require defining the choice set and calculating detailed and costly level-of-service information for all available options. Moreover, variables such as cost and time are often
correlated, and if data are not measured with a high level of precision, appropriate model structures and functional forms may not be
selected, which may lead to unknown bias in forecasting (3).
Conversely, SP experiments allow eliciting of individual preferences for alternatives that do not exist in reality or that individuals
have never experienced. They also allow researchers to have goodquality information (design under the analyst’s control) at a relatively
small cost, since many observations can be obtained for each responE. Cherchi, Centro Ricerche Modelli di Mobilità, Dipartimento di Ingegneria del
Territorio, University of Cagliari, Piazza d’Armi, 16, Cagliari, CA, 09123, Italy.
J. de D. Ortúzar, Department of Transport Engineering, Pontificia Universidad
Católica de Chile, Casilla 306, Cod. 105, Santiago 22, Chile.
Transportation Research Record: Journal of the Transportation Research Board,
No. 1977, Transportation Research Board of the National Academies, Washington,
D.C., 2006, pp. 27–34.
27
28
Transportation Research Record 1977
RP environment is used, when mixed RP-SP models are estimated
under the assumption of partial preference homogeneity, there is a
problem with how to introduce the specific SP attributes in the RP utility. The effect of not accounting for nonlinearities and for SP specific
attributes in forecasting is also explored, providing further empirical
evidence.
Given two sources of data, say, one coming from an RP survey and
the other from an SP survey, omitting individual q for simplicity, let
the random utility functions associated to alternative j be specified as
follows:
U
= β ′X
RP
j
+ α′Y
RP
j
+
RP
j
SP
SP
SP
U SP
j = β ′X j + γ ′ Z j + j
RP
j
≈ ( o,σ
)
)
2
SP
j ≈ ( o, σ SP
(1)
RP ≠ SP because of the different unknown scale parameters
where involved. As noted by Hensher, using mixed RP-SP data to estimate
choice models does not mean to “simply join the data” (10); the
scale factor in the indirect utility function must be considered. As
the scale factor depends on the standard deviation of the error terms
in the sample [for example, in the multinomial logit (MNL) it is
λ = π / 6 σ], two identical models estimated with different data
are very likely to give different estimated parameters, even if the
individual choice process is the same.
An efficient and correct way to combine two different data sources
(1) is to scale one data set to achieve the same variance in both. It does
not matter what utility is scaled; however, commonly the SP utility is
scaled,
(3)
where to comply with the joint estimation requirement, φ must be such
that φ σSP = σRP; or since in the MNL one has
6 λ SP
and
σ RP =
π
6 λ RP
2
φ SP
j ≈ ( 0 , σ RP )
(5)
and the joint likelihood function is
L=∏
e
V jRP
∑e
V jRP
i
j ∈I RP
Finally, RP and SP are random terms associated to the RP and SP util2
2
ities, respectively, the variances of which, σRP
and σSP
, will be generally different. (The variance is associated to the data used to estimate
the utility.)
First, if two different models using only RP or only SP data were
estimated, one would get the following estimates for the parameters:
α RP = λ RP α ; β RP = λ RPβ
(22)
γ SP = λ SP γ ; β SP = λ SPβ
π
(4)
Therefore, the new (scaled) utility function for the SP data set becomes
RP
, , = vectors of true parameters to be estimated,
XRP, XSP = vectors of attributes common to both data sets, and
YRP, ZSP = vectors of attributes specific to each type of data
[including alternative-specific constants (ASC)].
σ SP =
λ SP
λ RP
2
RP
where
= φU SP
U RP
j
j
φ=
SP
SP
SP
SP
U SP
j = φU j = φβ ′ X j + φ′ Z j + φ j
SP
V jSP
j
JOINT ESTIMATION WITH RP-SP DATA
RP
j
the scale factor should be
∏
SP
e
V jSP
∑e
(6)
V jSP
j ∈I SP
where the two choice sets in the RP (I RP) and SP (I SP) data sets might
be different.
The actual effect of scaling the SP utility by φ is that of forcing the
SP utility to have the same scale parameter of the RP utility. Therefore,
when the RP-SP data are used to estimate a model jointly, the overall
log-likelihood function is scaled only by the unknown (inestimable)
RP factor scale of the Gumbel distribution:
L=∏
RP
e
λ RPV jRP
∑e
j ∈I RP
λ
RP
V jRP
i∏
SP
e
(
λ RP φ V jSP
∑e
λ
RP
)
(φ V )
SP
j
(7)
j ∈I SP
In fact, whatever method is used to estimate the parameters, the scale
parameter φ will also be estimated in the process. In particular, in the
simultaneous estimation (2), Equation 7 is solved as a special nested
logit structure, where φ represents the common scale parameter
associated to the SP nests, yielding the following estimates:
αˆ = λ RP α ; βˆ = λ RPβ; γˆ = λ RP γ
(8)
ˆ , ˆ ) their
where (, , ) are the true population parameters and (ˆ , estimates. Note that since φ always multiplies some parameters, its estimated value is the inverse of the SP scale parameter deflected by the
RP scale parameter (φ̂ = λSP/λRP ). Also, the RP scale parameter of
the joint estimation will be a function of both the RP and the SP
environment, that is, λRP = f(, , ; XRP, XSP, YRP, ZSP), and thus,
it generally will be different from the RP scale parameter associated
with use of only the RP data set; in this latter case one would obtain
λRP = f RP (, ; XRP, YRP ).
Rescaling is based on the data enrichment paradigm; this has the
implicit assumptions that SP data are pooled only to enrich RP
information that may be deficient and that the true parameters are
invariant whatever method is used to elicit the individual preferences. Notwithstanding, there are various reasons why, after controlling for scale differences, the common parameters may still show
some differences.
Differences in the common parameters for both data sets do not
prevent use of the mixed RP-SP approach. As long as there is at least
one parameter common to both data sets, a partial data enrichment
approach can be applied. This is an important point, as one of the
major powers of the joint RP-SP estimation is that it allows use of
each source of data to capture those aspects of the choice process
for which it is superior. In particular, this is crucial to account for
Cherchi and Ortúzar
29
nonlinearities because measurement problems, high correlations,
and limited trade-offs often make RP data unsuitable for testing
nonlinear behavior.
FORECASTING WITH MIXED RP-SP MODEL
WITH NONLINEARITIES
To use a model estimated with mixed RP-SP data for prediction purposes, only the RP environment should be considered, since it represents real behavior. Thus, even if a joint RP-SP model is built to get
better estimates, when the joint model is used in forecasting, all information must refer to the RP environment. When all attributes are
generic for both sets, making predictions with a mixed RP-SP
model does not imply additional problems, because the parameters
are constrained to be the same in the RP and SP environments and
are estimated by using both data sets.
However, in the partial preference homogeneity approach, when
parameters are estimated specific for SP data, one must address the
problems of whether to use in forecasting the RP or the SP parameters and whether the SP parameters should be scaled. The common
rule that all parameters moved from the SP to the RP environment
should be scaled refers to the case of the ASC and has been explicitly discussed in the literature (11–13). The same consideration cannot be extended to parameters associated to real variables, as these
are of a different nature to the ASC. In fact, the ASC are not associated to data; rather, they tend to reproduce the market shares in the
sample (12).
Scaling an SP parameter by φ corresponds to measuring the individual preferences in the SP environment. Since any parameter estimated in a mixed RP-SP model is deflected by the unknown RP scale
parameter (see Equation 8), when specific SP parameters are included
in the RP utility function they should not be scaled. To see this, consider the simple case of an MNL and two utility functions specified
as in Equation 1. When RP and SP data are estimated jointly, the SP
utility is scaled as in Equation 7 through the maximization of the log
likelihood function:
However, as stated, the key point is that scaling is required because
the different data show different market equilibriums. In fact, what
is scaled in Equation 5 and following is the whole random utility, that
is, both the systematic utility (parameters and attributes together) and
the error term. At the same time, beyond a problem of scale, differences between RP and SP parameters associated to the same attribute
are often generated by differences in the attribute trade-offs implied
in each data set.
Therefore, the problem of using parameters estimated specific only
for SP in the RP probability (10) should be regarded as a problem of
having ˆ ZPj consistent with the new RP environment and with the
trade-offs implied in the SP data. In particular, since ˆ = λRP , this
guarantees consistency with the scale of the logit model used for prediction. The consistency with the trade-offs, instead, affects the measure of the Z attributes used in the RP environment. In fact, the
attribute values should belong to the range of variation of the SP
attribute used to estimate the parameter in both cases, that is, even if
Z can be observed in the RP environment or if only an engineering
estimate is available. In this way, the attribute trade-offs implied in
Equation 10 will be consistent with the ratio between the parameters
associated to them.
When nonlinear utilities are specified, a third set of conditions must
be verified: the microeconomic conditions on the marginal utilities. In
nonlinear modal utilities, the effect of a unit change in an attribute is
not simply equivalent to its associated parameter, as the marginal utility is also a function of the other parameters involved in the nonlinearity, including the individual characteristics. This limits the range
of variation of Z in forecasting. It is also important to note that the
microeconomic conditions need to be verified for both the estimated
model and the model to be used in forecasting; in the case of partial
preference homogeneity, these two models are not the same.
Suppose, for clarity, that the variable Z is indeed the product of X
and Y. The conditions for the estimated model use the utility functions
in Equation 9 and imply two different tests: for the RP data only (Conditions A) and for the SP data only (Conditions B), because the two
data sets are disjointed:
ˆ
L=∏
ˆ
RP
exp ( λ RP) X RP
+
λ
) Y RP
(
j
j
[
∑%
RP
A:
]
∂V jqRP
∂X RP
jq
j
ˆ
i
∏
ˆ
RP
SP
RP
exp φ ( λ ) X j + φ ( λ ) Z SPj
[
∑%
SP
B:
]
(9)
Multiplying the whole SP utility by a scale factor (as in Equation 5)
makes the estimated parameters, in both environments, scaled by the
same RP scale parameter. Therefore, when a specific SP parameter
is used in the RP environment, the model probability is
⎛
⎞
⎜⎝ λ RP ⎟⎠
RP
PjPR =
exp
[ (βˆ ) X
RP
j
]
∂V SP
jq
ˆ + ˆ X SPjq
= ˆ + ˆ Y SP
=
;
jq
∂Y SP
jq
(11)
Microeconomic conditions for the model to be used in forecasting
apply instead to the specification used in Equation 10 and require that
the following conditions be satisfied only by the RP data:
∂V jqRP
∂X RP
jq
⎛
⎞
⎜⎝ λ RP ⎟⎠
+ ( αˆ ) Y Rj P + ( ˆ ) Z Pj
∑%
∂V SP
jq
∂X SPjq
j
(λ)
∂V jqRP
ˆ
= ˆ ;
=
∂Y RP
jq
∂V jqRP
ˆ + ˆ X RP
= ˆ + ˆ Y RP
=
;
jq
jq
∂Y RP
jq
(12)
(10)
j
where ZP denotes an engineering set of attributes that define the policy
to be forecasted.
DATA SET
In the wake of changes all over Europe, the local rail company in
Sardinia, Italy, decided to transform the rail system for an urban
corridor in Cagliari (with 200,000 inhabitants and approximately
30
19,000 trips per day) into a metropolitan-like service by introducing
major changes in speed, frequency, comfort, ticketing, number of
stations, and so forth. The corridor was served by a major highway
and two other transport modes: a railway line and an urban bus, with
quite different levels of service, to the extent that the private car was
by far the dominant mode, having more than 80% of trips versus
fewer than 15% by bus and barely 3% by train. In 1998 a survey was
set up to build a data bank for a modal choice context. The method used
for building the data bank incorporated three kinds of surveys (7):
a qualitative survey using focus groups to gain a good understanding
of the phenomenon, an RP survey for describing current travel, and
an SP survey for evaluating the introduction of a revamped train
commuter option.
To collect data on current trips, a 24-h travel diary survey filled in
personally by each respondent was administered to 900 individuals,
and several socioeconomic characteristics relative to each individual
and family were collected as well. A maximum of 10 trips described
in considerable detail was recorded. Moreover, specific questions
were included to clarify whether the person who made the trip was
the person who paid for it and chose the mode of transport. Also, a
list of nonavailable alternatives was required, stating the reasons why
these were not available. This information allowed distinguishing
between objective and subjective availability, and only the first kind
was considered.
The SP survey, conducted on a selected subsample (300 individuals) of people who answered the RP questionnaire, had the objective of expanding the RP data set and checking commuter responses
to the major changes in the train alternative (i.e., the current train
service with far superior characteristics). A binary choice experiment
between the proposed new train service and the current transport
mode was used. The SP experiment did not consider the current train
users because the new train service proposed in the SP design was
superior to the current system in all characteristics (including the fare).
No other significant categories of user were present in the study corridor. The design included four variables at three levels (trip time,
cost, frequency, and comfort) for bus users and three variables at three
levels (trip time, cost, and frequency) and one variable (comfort) at
two levels for car users. Several other variables were analyzed and
tested in pilot surveys. For example, walking time was not included
in the final design because the railway line was fixed, and thus walking time could not freely be modified without departing from the real
context. However, because of its importance, the walking time for
each individual in the new situation was calculated and included in the
front page of the SP form as general information. Walking time was
computed directly from the distance walked, on the basis of known
origin and destination points. Also, depending on each user’s final
destination, parking availability was checked and average parking
time calculated. The experimental design allowed estimation of twoterm interactions to account for nonadditive effects of cost, frequency,
and travel time.
After a careful check of the data, a final sample (i.e., mixed RP/SP
data set) of 1,396 observations, composed of 338 RP individuals and
1,058 SP pseudoindividuals, was used for the model estimation.
Details on the data set are available elsewhere (7 ).
MODELS ESTIMATED
AND DEMAND FORECASTS
Several nested logit (NL) models with linear and nonlinear utility
functions, including allowance for correlation among RP options,
were estimated. As the SP design was a binary choice experiment
Transportation Research Record 1977
between car and rail for current car users and between bus and rail
for current bus users, correlation was tested only for the RP alternatives. Notwithstanding, a mixed RP-SP model is always a special
type of NL model, as the NL artificial structure comes from the
parameter used to scale one data set with respect to the other. The
models are summarized in Table 1. All the attributes have the usual
meaning, but “comfort” and “early/late” deserve remark. Comfort
was coded as two dummies in the public transport alternatives:
comf1 equals 1 if the level of comfort was poor, and comf2 equals 1
if comfort was sufficient, living the “good” level as reference. The
early/late variable, instead, measures by how many minutes a user
must anticipate or postpone her or his departure time to adjust to the
scheduled time of the train service. It was included for the train alternative alone to try to capture the differences between a scheduled
time system (train), a frequency (bus), and a continuous departure
time (car). Finally, all models in Table 1 were estimated by using an
expenditure rate specification (14), where the cost variable is divided
by the expenditure rate (g), which is equal to the ratio between
income and free time. As the data bank had only a small percentage
of self-employed, it was found (8) that this yielded superior results
to the more typical specification, where the cost variable is divided
by the wage rate (15).
When RP and SP data are used jointly to estimate models, it is
good practice to first estimate separate models for each data set to
detect which attributes are candidates to be generic in both sets. This
analysis suggested that all the linear and nonlinear variables (12, 16)
were good candidates for such a generic estimation, except walking
time. Table 1 reports the results of the joint RP-SP estimation. In particular, Models NL1 and NL2 clearly show that the specification
improves significantly [likelihood ratio (LR) = 99.99%] when walking time is specific, rather than generic, in RP and SP. However, the
marginal utility in the RP data is seven times larger than that in the SP
data. Although from a pure modeling-exercise point of view this is
quite a good model, using it to test policies implying a change in
walking time could be quite risky, because depending on whether the
RP or the SP parameter is used, very different predictions could be
obtained.
As for interaction terms, results in Table 1 show that they improved
significantly model results. In fact, Model NL2 is clearly superior to
Model NL3 (LR = 99.99%). Moreover, specifying the interactions
only in the SP alternatives clearly gave the best results. As illustrated
in Model NL4, interactions in the RP data do not enrich the data;
when specified as RP-SP generic, the interactions are not significant,
and the log likelihood also worsens for Model NL2.
Although Model NL2 appears to be very good from a statistical
point of view, from a microeconomic point of view it is quite poor.
Checking the microeconomic conditions on the marginal utility
shows that the number of individuals who do not satisfy them is quite
large. The first three columns in Table 2 show the number of observations that do not satisfy the microeconomic conditions for the estimated models. In all cases except Model NL4, only SP data do not
satisfy the microeconomic conditions, because the RP utility is linear
in the attributes. Model NL3 has not been included because the utility is linear in the attributes in both the RP and the SP environment.
The values in the first three columns of Table 2 have been calculated
by using the conditions presented in Equation 11.
To improve the model microeconomically, the reason these individuals had a counterintuitive marginal utility was carefully checked.
It was found that the problem arose mainly for individuals with a combination of high travel time and fare and low expenditure capability.
Model NL5 shows the results obtained by excluding those individuals. As can be seen, this is a very good model; although not superior
Cherchi and Ortúzar
31
TABLE 1
Model Estimation Results
Attribute
NL1
NL2
Walking time (RP)
−0.0439
(−1.8)
−0.1963
(−3.2)
—
Walking time (SP)a
—
NL3
NL4
NL5
−0.04575
(−2.1)
−0.09
(−1.9)
−0.2221
(−4.0)
−0.02621
(−1.7)
−0.04476
(−1.5)
−0.1191
(−2.0)
−0.2237
(−3.9)
−0.0227
(−1.6)
—
−0.06187
(−2.1)
−0.1699
(−2.9)
−0.2165
(−3.4)
−0.03998
(−2.0)
−0.03593
(−2.7)
0.4719
(3.3)
−3.106
(−3.6)
−1.541
(−3.3)
−1.148
(−2.4)
−0.1920
(−2.3)
12.19
(3.2)
0.00127
(2.9)
−0.00692
(−2.1)
—
−0.006689
(−0.8)
0.2334
(1.9)
−2.311
(−2.2)
−1.134
(−2.1)
−0.743
(−1.6)
−0.2944
(−3.3)
8.838
(2.5)
—
−0.0186
(−1.4)
0.2594
(1.9)
−2.028
(−2.2)
−0.9987
(−2.0)
−0.7488
(−1.7)
−0.2867
(−3.3)
9.444
(2.5)
—
−0.04833
(−3.0)
0.4745
(3.2)
−3.172
(−3.6)
−1.453
(−3.1)
—
—
Traveltime*fare
−0.0660
(−2.6)
−0.03278
(−2.9)
0.5944
(3.8)
−3.261
(−4.0)
−1.608
(−3.6)
−1.262
(−2.5)
−0.2203
(−2.7)
11.38
(3.2)
0.00119
(2.9)
−0.01026
(−3.0)
—
−0.0619
(−2.2)
−0.2043
(−3.2)
−0.2123
(−3.5)
−0.0369
(−2.0)
—
—
Traveltime*freq
—
—
—
K_car (RP+SP)
1.624
(1.8)
−0.9868
(−2.9)
0.4091
[3.93]
0.6179
(3.8)
[2.01]
−744.3223
1.677
(1.8)
−1.044
(−2.8)
0.2988
[6.74]
0.6704
(3.5)
[1.71]
−735.5511
0.1255
(0.2)
−0.7164
(−2.0)
0.2962
[1.99]
0.8827
(2.1)
[0.279]
−747.6844
0.000518
(1.7)
−0.002257
(−0.8)
0.7863
(1.0)
−0.6820
(−2.0)
0.2813
[6.53]
1.031
(2.1)
[0.06]
−744.8081
1.611
(1.7)
−0.9548
(−2.6)
0.3170
[6.15]
0.6355
(3.4)
[1.97]
−672.7029
0.1365
1,396
0.1467
1,396
0.1326
1,396
0.1360
1,396
0.2688
1,289
Travel time PT
Travel time car
Walking time
Cost/g
Frequency
Comfort 1
Comfort 2
Transfer
Early/late (RP)
Car/licenses (RP)
Traveltime*fare (SP)
Traveltime*freq (SP)
K_train (RP+SP)
f1 (EMU)b
f2 (SP scale factor)b
Log likelihood
ρ2(C)
Sample size
−0.9008
(−1.9)
−0.1816
(−2.2)
11.29
(3.2)
0.00097
(2.0)
−0.00597
(−1.8)
—
—
NOTE: Values in parentheses are t-statistics.
EMU = expected marginal utility.
a
Car walking time introduced only in the RP alternatives.
b
Values in brackets are t-statistics with respect to 1.
to Model NL2 for the t-test of some parameters, it is globally better
(the ρ2 value is higher). Moreover, the number of individuals who do
not satisfy the microeconomic conditions is significantly reduced.
The microeconomic conditions need to be verified also for the
model used in forecasting, and, as discussed previously, this puts a
limit to the policies that can be tested. Table 3 shows the limits that
are implicit in the estimated models, that is, the maximum value (or
better combination of values) that each attribute can assume to satisfy
the microeconomic conditions implicit in each specific model. These
values also define the range of policies that can be tested with each
model. As can be seen, Model NL2 also has the narrowest range of
policies that can be tested in forecasting.
Finally, the last three columns in Table 2 report the number of individuals in the RP environment whose marginal utility has an incorrect
sign when the models are used to forecast, that is, by using the conditions illustrated in Equation 12. It clearly appears that Model NL5 is
superior to all the other models. Moreover, it is important to note that
in nine of the 13 cases in which the marginal utility of the bus fare is
positive, the microeconomic condition is at the limit (i.e., the travel
time is equal to 50 min instead of 49.87, as in Table 3).
32
Transportation Research Record 1977
TABLE 2
Number of Observations Not Satisfying Microeconomic Conditions
RP+SP Observations
RP Observations
Conditions on
Estimated Models
Conditions on Models
Used in Forecasting
Number of Individuals with
NL2
NL3
NL4
NL2
NL3
NL4
MU(travel time by train) > 0
MU(travel time by car) > 0
MU(travel time by bus) > 0
MU(cost by train) >0
MU(cost by car) > 0
MU(cost by bus) > 0
MU(frequency by train) < 0
MU(frequency by bus) < 0
234
110
25
175
0
322
0
0
184
13
12
84
0
216
0
0
38
17
0
6
0
20
0
0
48
1
19
53
0
160
1
0
15
0
3
19
0
82
0
0
19
1
2
3
0
13
1
0
MU = marginal utility.
To test the effect of misusing RP-SP specifications in prediction,
the variation in aggregate market shares for various policy measures
was computed. The response to a change in the prediction was calculated as the percentage change in the aggregate share of mode j over
the initial situation (do-nothing):
ΔPj =
Pj − P j0
(13)
P j0
where Pj0, Pj are the aggregate probabilities of choosing mode j before
(do-nothing) and after introducing the measure, calculated by sample
enumeration.
By using the estimated models, two types of analyses aimed at
evaluating the effect of misusing RP-SP specifications on market
share predictions are described here. First, the effect of using the RP
or the SP parameter (when different RP and SP parameters are estimated) is investigated. Second, the effect of using only RP information in forecasting is tested, and, in particular, the effect of not
including the interactions in prediction and the effect of not satisfying the microeconomic conditions are tested. In all cases, the policy
tested implied a change in only one attribute (walking time or cost)
at a time.
Table 4 shows the variation in market shares for some changes in
walking time. In particular, in Case NL2(a) the RP walking time
parameter was used; in Case NL2(b) the SP walking time parameter
was used. Following the theoretical discussion in earlier sections,
the SP attributes were not scaled. The RP walking time parameter is
roughly six times larger than the SP walking time parameter. This
reflects on the market share variation, which is systematically bigger in Model NL2(a) than in Model NL2(b). As expected, Model
NL1 produces results that stay halfway between the two versions of
Model NL2.
Table 5 shows the variation in market shares for changes in the
travel costs by train and by car, and it allows one to appreciate the
effect of not accounting for interaction terms in predictions. By comparing Model NL3 with Models NL2 and NL5, it is clear that not
including interactions systematically reduces the effect of any policy.
Note that the interaction between travel time and cost reduces the
effect of the linear variables (because they have opposite signs; see
Table 1). However, in this case a comparison is made with a model
estimated without interactions (Model NL3), where the estimated linear parameter of the cost (which is a measure of the marginal utility
of the cost for an individual with an average travel time) is about three
times bigger than the value estimated when interactions are included
(Model NL5).
Regarding the effect of the microeconomic conditions on the marginal utility, results reported in Table 6 are interesting. Table 6 shows
for each policy tested the number of individuals who do not satisfy the
microeconomic conditions. Model NL5 confirms its superiority;
Model NL2 does not perform well (because the number of individuals
who have a counterintuitive behavior is high), even when the policy
implies a strong reduction in the travel cost.
TABLE 3 Maximum Values That Can Assume Attributes to Satisfy Microeconomic
Conditions on Marginal Utility
Attribute
MU(travel time PT) < 0
Cost/g <
MU(travel time car) < 0
MU(cost) < 0
MU(frequency) > 0
Cost/g <
Travel time <
Travel time <
Frequency
1
6
12
NL2
NL3
NL4
54.18
81.43
114.13
160.87
28.29
68.19
90.66
112.42
138.53
229.66
35.86
114.93
69.94
100.71
137.63
175.15
49.82
79.48
Cherchi and Ortúzar
TABLE 4
33
Effects in Prediction of Using Different Walking Time Specifications
% Variation in Demand Predictions by Mode
Model NL1
Attribute
% Change
Walking time
by train
Walking time
by bus
TABLE 5
−5
−10
−25
−50
−5
−10
−25
−50
Model NL2(a)
Model NL2(b)
Bus
Train
Car
Bus
Train
Car
Bus
Train
Car
−1.4
−2.8
−7.1
−14.1
1.1
2.2
5.6
11.1
3.1
6.3
15.8
31.8
−2.0
−4.0
−10.0
−19.7
0.0
−0.1
−0.2
−0.5
−0.1
−0.1
−0.3
−0.7
−2.9
−6.0
−15.8
−32.5
2.6
5.1
12.1
21,7
10.5
21.6
57.5
120.7
−7.6
−14.8
−34.3
−58.4
−0.1
−0.1
−0.4
−1.2
−0.2
−0.4
−1.1
−2.4
−0.9
−1.8
−4.6
−9.1
0.7
1.4
3.4
6.9
1.8
3.5
8.9
17.9
−1.1
−2.2
−5.4
−10.8
0.0
−0.1
−0.1
−0.3
0.0
−0.1
−0.2
−0.4
Effects in Prediction of Including SP Specific Interaction Terms
% Variation in Demand Predictions by Mode
Model NL2
Attribute
Fare by train
Cost by car
Model NL3
Model NL5
% Change
Bus
Train
Car
Bus
Train
Car
Bus
Train
Car
−5
−10
−25
−50
−5
−10
−25
−50
−0.1
−0.2
−0.5
−1.0
0.0
0.0
0.1
0.2
0.3
0.7
1.7
3.8
0.0
−0.1
−0.1
−0.3
0.0
0.0
0.0
0.0
0.0
0.0
0.0
−0.1
−0.1
−0.3
−0.6
−1.3
0.1
0.1
0.4
0.7
0.8
1.7
4.2
8.6
0.1
0.2
0.5
1.0
0.0
0.0
0.0
−0.1
−0.1
−0.1
−0.3
−0.6
−0.5
−1.1
−2.8
−5.7
0.2
0.4
1.0
2.0
2.3
4.6
11.9
24.4
0.2
0.4
1.1
2.4
0.0
0.0
−0.1
−0.2
−0.1
−0.3
−0.7
−1.4
have been reported, and some important issues (about moving
from estimation to prediction) do appear not to have been reported
before.
This report considered the problem of using mixed RP-SP models
in forecasting and analyzed the effect in predictions of using nonlinearities estimated in a mixed RP-SP model as specific variables of
the SP data. The potentiality of the partial data enrichment approach
in accounting for nonlinearities in the attributes was demonstrated; it
CONCLUSIONS
The mixed RP-SP approach has received a great deal of attention
over the years, and many major advances have been experienced
both in theory and in practice. Joint RP-SP estimation has also
been used in many applications, including cases with complex
utility functions and a large number of options. However, not
many applications of mixed RP-SP models as prediction tools
TABLE 6 Number of Observations Not Satisfying Microeconomic Conditions
for Different Policies
% Variation in Demand Predictions by Mode
Model NL2
Attribute
Fare by train
Cost by car
Model NL5
% Change
Bus
Train
Car
Bus
Train
Car
−5
−10
−25
−50
−5
−10
−25
−50
19
19
19
19
19
19
19
19
42
38
25
5
48
48
48
48
1
1
1
1
3
6
8
18
2
2
2
2
2
2
2
2
18
16
5
1
1
1
1
1
1
4
8
20
20
20
20
34
allows use of each data source to capture those aspects of the choice
process for which it is superior.
However, the partial preference homogeneity approach poses
some theoretical problems of whether to use in forecasting the RP
or the SP parameters and whether the SP parameters should be
scaled. This problem was analyzed in depth. It was concluded that
scaling an SP parameter corresponds to measuring the individual
preferences in the SP environment. Since any parameter estimated
in a mixed RP-SP model is deflected by the unknown RP scale
parameter, specific SP parameters should not be scaled when used
in forecasting. Moreover, as scaling is required because the different data show different market equilibriums, the problem of using
parameters estimated specific only for SP in the RP probability
should be regarded as a problem of having consistency between the
new RP environment and the trade-offs implied in the SP data.
Also discussed in depth was the problem of microeconomic conditions on the marginal utilities, specifically in the case of nonlinear
utilities and partial preference homogeneity. Nonlinear utilities
imply that the effect of a unit change in an attribute is a function of
the parameters involved in the nonlinearity. Thus the microeconomic
conditions should be verified for each individual and for each policy
tested, as they limit the range of the scenarios that can be tested.
Moreover, the partial homogeneity approach requires that the microeconomic conditions be verified for both the estimated model and the
model to be used in forecasting, because in the case of partial preference homogeneity these two models are not the same.
The empirical results demonstrate that the partial data enrichment
approach is particularly suitable to account for nonlinearities and
allows improving consistently the estimation results. In particular,
the estimation of interaction terms specific only for the SP data provided a significant improvement in model fit. However, the microeconomic conditions play a crucial role in evaluating the goodness
of a specification, as it was found that the model with the best fit was
quite poor from a microeconomic point view, with a large number
of individuals having a counterintuitive behavior.
It was found that the potential errors in predicting demand for reasonably sensible policies can be quite high. Although the magnitude
of these effects certainly depends on the specific application context,
this raises an alarm about a problem that merits further examination.
ACKNOWLEDGMENTS
The authors thank David Hensher and Joffre Swait for their comments. The authors also thank three anonymous referees for their
constructive comments. The second author acknowledges the support of the Chilean Fund for Scientific and Technological Research,
FONDECYT.
Transportation Research Record 1977
REFERENCES
1. Ben-Akiva, M. E., and T. Morikawa. Estimation of Travel Demand
Models from Multiple Data Sources. Presented at 11th International
Symposium on Transportation and Traffic Theory, Yokohama, Japan,
1990.
2. Bradley, M. A., and A. J. Daly. Estimation of Logit Models Using Mixed
Stated Preference and Revealed Preference Information. In Understanding Travel Behaviour in an Era of Change (P. Stopher and M. LeeGosselin, eds.), Pergamon Press, Oxford, United Kingdom, 1997.
3. Daly, A. J., and J. de D. Ortúzar. Forecasting and Data Aggregation:
Theory and Practice. Traffic Engineering and Control, Vol. 31, 1990,
pp. 632–643.
4. Louviere, J. J., D. A. Hensher, and J. D. Swait. Stated Choice Methods:
Analysis and Application. Cambridge University Press, Cambridge,
United Kingdom, 2000.
5. Ortúzar, J. de D., and L. G. Willumsen. Modelling Transport, 3rd ed.
John Wiley & Sons, Chichester, United Kingdom, 2001.
6. Swait, J., and A. Bernardino. Seeking Taste Homogeneity in Choice
Processes: Distinguishing Taste Variation from Error Structure in Discrete Choice Data. Intelligent Marketing System, Edmonton, Alberta,
Canada, 1997.
7. Cherchi, E., and J. de D. Ortúzar. Mixed RP-SP Models Incorporating
Interaction Effects: Modelling New Suburban Train Services in
Cagliari. Transportation, Vol. 29, 2002, pp. 371–395.
8. Galilea, P., and J. de D. Ortúzar. Valuing Noise Level Reductions in a
Residential Location Context. Transportation Research Part D: Transport and Environment, Vol. 10, 2005, pp. 305–322.
9. Hensher, D. A. Extending Valuation to Controlled Value Functions and
Nonuniform Scaling with Generalised Unobserved Variances. In Theoretical Foundations of Travel Choice Modelling (T. Gärling, T. Laitila,
and K. Westin, eds.), Pergamon Press, Oxford, United Kingdom, 1998.
10. Hensher, D. A. Stated Preference Analysis of Travel Choice: The State
of Practice. Transportation, Vol. 21, 1994, pp. 107–133.
11. Bhat, C., and R. Sardesai. On Examining the Impact of Stop-Making and
Travel Time Reliability on Commute Mode Choice: An Application to
Predict Commuter Rail Transit Mode for Austin, TX. Department of Civil
Engineering, University of Texas, Austin, 2004.
12. Cherchi, E., and J. de D. Ortúzar. On Fitting Mode Specific Constants
in the Presence of New Options in RP-SP Models. Transportation
Research Part A: Policy and Practice, Vol. 40, No. 1, 2006, pp. 1–18.
13. Daly, A. J., and C. Rohr. Forecasting Demand for New Travel Alternatives. In Theoretical Foundations for Travel Choice Modelling (T. Gärling,
T. Laitila, and K. Westin, eds.), Pergamon Press, Amsterdam, Netherlands,
1998.
14. Jara-Díaz, S. R., and M. Farah. Transport Demand and User’s Benefits
with Fixed Income: The Goods/Leisure Trade-off Revisited. Transportation Research Part B: Methodological, Vol. 21, 1987, pp. 165–170.
15. Train, K. E., and D. McFadden. The Goods/Leisure Tradeoff and Disaggregate Work Trip Mode Choice Models. Transportation Research,
Vol. 12, 1978, pp. 349–353.
16. Cherchi, E., and J. de D. Ortúzar. Income, Time Effect and Direct Preferences in a Multimodal Choice Context: Application of Mixed RP-SP Models with Non-linear Utilities. Network and Spatial Economics, Vol. 6, 2006,
pp. 7–23.
The Transportation Demand Forecasting Committee sponsored publication of
this paper.