Socially Optimal Subsidy for Entry: The Case of Medicare Payments

Socially Optimal Subsidy for Entry: The Case of
Medicare Payments to HMOs
Shiko Maruyama1
University of New South Wales
October 19, 2008
Abstract
Should governments subsidize entry to promote competition? The US Medicare has
increased its spending on private Medicare plans, based on the idea that they provide
consumers more alternatives and enhance the program’s e¢ ciency. In this paper, I develop an empirical framework with endogenous entry to evaluate welfare consequences
of such policy interventions with the data from 2003 and 2004. Counterfactual simulation reveals the following: (1) in Medicare, subsidizing entry can be justi…ed to enhance
national welfare (no excessive entry); (2) the level of payments in 2008, however, is far
beyond the optimal level; and (3) the geographic inconsistency between the payment
rate and the Medicare fee-for-service (FFS) cost is another source of ine¢ ciency. Aligning payments with FFS costs raises the national welfare gain of contracting with private
plans by an estimated 3 billion dollars, or 33 percent, compared to the simulated 2008
1
I am thankful to Leemore Dafny, David Dranove, Michael Mazzeo, Aviv Nevo, and Robert Porter for
guidance. Helpful comments from two anonymous referees, Raquel Bernal, Rosa Matzkin, and Hodaka
Morita are gratefully acknowledged. I thank the Centers for Medicare and Medicaid Services for kindly
providing the data …les and answering my inquiries. E-mail: [email protected].
1
benchmark, with lower Medicare spending on private plans. This signi…cant gain stems
from the restructuring of entry, not from lower payments.
1
Introduction
Should governments subsidize entry to promote competition? Advocates of market competition argue for more deregulation and liberalization of markets. On the other hand, few argue
that competition is so welfare-enhancing that governments should intervene further to bring
about more entry and intensify competition. The theoretical literature has con…rmed that,
from the social welfare point of view, whether additional entry has a positive or negative
impact on welfare is ambiguous, due to the trade-o¤ between the variety of products and
ine¢ cient replication of …xed costs.
The US Medicare program has struggled with this trade-o¤ for two decades. US Medicare
is the federal entitlement program that provides comprehensive health insurance coverage to
individuals aged 65 and older and certain disabled people (hereafter "Medicare eligibles"). In
addition to a government-administered fee-for-service plan (hereafter "traditional Medicare
plan"), Medicare features private health insurance plans, which are currently called Medicare
Advantage plans (formerly Medicare+Choice plans). The underlying idea is that private
plans give Medicare eligibles more alternatives and make the system more e¢ cient through
private plans’incentives and ‡exibility to take advantage of managed care techniques such
as utilization review and price negotiation with providers.
Private plans make entry-exit and premium decisions in each county every year, taking
into consideration the responses of both the demand side and competing …rms. Medicare
eligibles make their choice between the government plan and the private plan(s) available
in their county, taking into account each plan’s monthly premium, if any, and the insurance
coverage, e.g. whether it covers prescription drugs. The key government instrument here
2
is the rate of per-enrollee government payments to private plans, since payments can be
regarded as a price subsidy. Payments are the private plans’main source of revenue. Payment
rates vary across counties, but the same rate applies to every plan in a county. Thus,
the payment rate directly a¤ects the government spending and indirectly a¤ects consumer
surplus by altering private plans’coverage, premiums, and entry decision. Consequently, the
appropriateness of the payment rate has been an intense policy debate.
In this paper, I propose an empirical framework to evaluate the welfare consequences
of a policy intervention that a¤ects …rms’entry decisions, and to explore socially desirable
payment rates in Medicare, using national data of the Medicare market for the years 2003
and 2004. By allowing the …rms’entry-exit decision to be endogenous, the framework enables
evaluation of the welfare impacts of such policy interventions. To make policy simulations
even more relevant, the model incorporates product di¤erentiation and …rm heterogeneity.
This study advances the empirical literature of entry and provides unique, relevant policy
implications for the Medicare payment debate.
The literature on industrial organization theory …nds that free entry does not guarantee the socially optimal number of entrants. As discussed in Spence (1976) and Mankiw
and Whinston (1986), in the homogeneous product setting, free entry can lead to socially
excessive entry. Additional entry would expand consumer surplus with lower prices but, at
the same time, would reduce the market shares of other …rms ("business stealing") and lead
to an ine¢ cient replication of …xed costs. In di¤erentiated product markets, however, the
direction of bias is unclear, since entry brings more variety and consumers’appreciation of
variety causes a trade-o¤. In Medicare, for example, higher payments attract more entrants
and promote competition. New entrants enhance consumer surplus through lower prices,
improved product quality, and more variety. On the other hand, new entry lowers social welfare through duplicative set-up costs and "business stealing" from incumbents. In addition,
price subsidies cause market distortions. Therefore, whether the government can achieve
3
higher social welfare by encouraging entry is an empirical question.
Despite its policy importance, this empirical question has not been well explored. In an
attempt to answer this question, my empirical framework adopts three key features: endogenous entry, social welfare analysis, and fully incorporated product di¤erentiation and …rm
heterogeneity. As discussed in the next section, no previous work has combined these three
characteristics at once, mainly due to the computational complexity. A technical novelty
in my empirical framework is adoption of a recursive conditioning simulator in a maximum
simulated likelihood (MSL) estimator, which facilitates estimation of a very general class of
discrete choice sequential-move games. This estimator outperforms the methods standard in
this literature, such as the method of simulated moments, in several ways. The estimator’s
performance and ‡exibility help (1) obtain more e¢ cient estimates, (2) explicitly incorporate …rm heterogeneity into the framework, (3) estimate structural payo¤ functions that are
much more complicated than those used in the previous literature, and (4) analyze markets
with signi…cantly di¤erent structures in the same framework. As a result, I can exploit
detailed …rm/product level data and draw relevant policy implications by simulations with
endogenous entry, which is beyond the scope of previous studies.
The Medicare market is one of the most fruitful applications, because the products are
highly di¤erentiated and the payment policy is directly related to entry. The changes in
payment policy over the years re‡ect the di¢ culty of determining the socially optimal level
of subsidy. Before 1997, the base payment rate in each area was set at 95 percent of the
average fee-for-service (FFS) costs of the government plan. As a natural consequence, highcost markets, which are typically large, urban markets, attracted more entrants. After 1997,
Medicare started shifting payment rates toward uniform national rates, by increasing rates
in low-cost counties and decreasing them in high-cost counties in order to equalize potential
pro…ts of entrants and the level of entry across counties. This led to a signi…cant reduction
in the average payment and massive exodus of private plans around 2000. Since then,
4
Medicare has gradually implemented several adjustments to boost payments, such as "‡oor"
rates. MedPAC (2008) reports that average payments to private plans are 13% higher than
the FFS costs in 2008 and that payments amounted to $75 billion in 2006, representing 20%
of Medicare spending. To remove the inconsistency between the payment rate and the FFS
cost and to contain the growing expenditure, the government currently plans to set the base
payment rates equal to the FFS costs by 2012 (…nancially neutral payment rates).2
I use data from 2003 and 2004. Based on the estimated parameters, I perform counterfactual simulations to draw policy implications for the 2008 payment rates. Since this
approach does not utilize the information from recent years and incorporates neither the recent institutional changes nor the heathcare cost increases, the results need to be interpreted
with caution. On the other hand, this approach has several advantages. First, the system
was institutionally simpler in 2003 and 2004. The legislation in 2006 introduced Medicare
Part D and changed the way payment rates were determined.3 In addition, various new
types of private plans became popular, such as private fee-for-service plans. By avoiding
these complications, my approach helps develop an appropriate framework relatively easily
and concentrate on the essential questions on entry and payments. In addition, my sample
years are relatively stable because they occur long after the huge exodus of private plans
and su¢ ciently before the implementation of the 2006 legislation. Since I use a static entry
game framework, the stable environment is desirable for providing reliable estimates. Along
the same lines, I concentrate this research on Health Maintenance Organizations (HMOs),
since in 2003 and 2004, the vast majority of the private plans were HMOs.
My empirical strategy is as follows. All estimations are at the plan-county-year level.
I …rst estimate a discrete choice demand model of the Medicare health insurance market.
2
MedPAC (2008) provides detailed information and discussion about the payment rate.
The legislation is the Medicare Prescription Drug Improvement and Modernization Act. Currently "plan
bids" partially determine the payment rates, though the previous payment rates are used as "benchmarks"
and still largely determine current payment rates.
3
5
The consumer surplus is calculated using these results. Since private plans are allowed
to charge monthly premiums, I use premiums as a price. Second, by using the estimated
demand parameters and assumptions on …rms’price-setting behavior, I recover the health
plans’ marginal costs. Third, the pool of potentially entering HMOs is created. This is
necessary because the entry game is played not only by the observed …rms but also by …rms
that choose not to enter and so do not appear in the data. I then estimate HMO pro…t
functions by using a sequential-move entry game, which guarantees a unique pure-strategy
equilibrium. The observed market structure and inclusion of the demand model into the
framework allow me to identify the level of …xed and variable pro…ts, despite the lack of
cost information. Finally, I perform counterfactual simulations, where all the consumers
and HMOs re-optimize their decisions on plan choice, price, and entry-exit in response to
an exogenous change, which generates a new market equilibrium. I follow the approach
that is customary in the empirical entry literature and employ no scope of dynamics, since
my dataset has only a few time points and fully incorporating …rm heterogeneity remains a
considerable challenge in dynamic modeling.
The results are as follows. First, the national welfare gain of having private health
insurance plans in Medicare was approximately 10 billion dollars per year in 2003 and 2004,
which is consistent with the study by Town and Liu (2003). Second, the Medicare HMO
market exhibits no evidence of socially excessive entry. Consumers appreciate increase in
variety of products so much that entry subsidy beyond the …nancially neutral level can be
a justi…able option for greater national welfare. Third, the positive e¤ect of such subsidy,
however, is small, and the 2008 level of payments is far beyond the optimal level. Fourth, the
geographic inconsistency between the payment rate and the FFS cost is another source of
ine¢ ciency. Aligning the payment with the FFS cost raises national welfare by an estimated
3 billion dollars, or 33 percent more than the simulated 2008 benchmark case. This gain
occurs together with a healthier government budget and stems from the restructuring of
6
entry, not from the adjustment of the average of payments. These results illustrates how
extensively the current payment distorts the incentives for the Medicare private plans and
harms social welfare.
2
Empirical Entry Literature
The study I follow most closely is Berry and Waldfogel (1999), one of the few to apply the
empirical entry literature to welfare analysis. They model the broadcasting industry as a
homogeneous product market4 and quantify social ine¢ ciency from free entry. Dutta (2005)
extends their work to the case of di¤erentiated products. Her study of the pharmaceutical
industry in India performs counterfactual simulations of entry and …nds no evidence of
excessive entry. Her empirical framework is similar to mine, except that her estimation
relies heavily on the assumption of certain …rm homogeneity. I extend these studies by
incorporating full …rm heterogeneity.
Firm heterogeneity has not been fully incorporated into the empirical entry literature
for several reasons. The di¢ culty of dealing with multiple equilibria is one of the largest issues. To guarantee certain equilibrium properties, researchers have speci…ed the econometric
setting in a very restrictive way, such as homogeneous product markets, symmetric games,
simple reduced-form pro…t functions, and an in…nite pool of homogeneous potential entrants.
The computational burden is another di¢ culty, since the introduction of …rm heterogeneity
requires the estimation of an asymmetric game.
Various approaches have been proposed to address these di¢ culties. Berry (1992) avoids
this problem by introducing …rm heterogeneity into the …xed cost component, thereby keeping his game symmetric. Berry and Waldfogel (1999) and Town and Liu (2003) follow the
4
In their framework, while listeners’ demand for broadcasting is modeled as a discrete choice demand,
the industry’s primary output –advertisement –is treated as a homogeneous product.
7
same approach. Dutta (2005) incorporates …rm heterogeneity by categorizing …rms into three
types and, within each type, …rms are symmetric. Mazzeo (2002) models …rm heterogeneity
through endogenous choices of location and product types. Seim (2001, 2004) proposes the
use of an incomplete information setting to alleviate the computational burden. Ciliberto
and Tamer (2004) pursue a more ‡exible model of …rm heterogeneity and identities, by not
making point identifying assumptions on equilibrium selection. All of these studies, however, still use a reduced-form pro…t function and/or certain symmetry assumptions in entry
games, which is a restriction, especially when a researcher’s goal is a welfare simulation.
Berry (1992) and Mazzeo (2002) employ the idea that the use of a sequential-move game can
avoid the multiple equilibrium issue by the uniqueness of the subgame perfect equilibrium.
Schmidt-Dengler (2006) uses this approach to analyze the timing of MRI adoption of US
hospitals. His framework incorporates …rm heterogeneity, but its tractability relies on the
simplicity in his game setting, such as the irreversibility of adoption and the focus on only
small oligopolistic markets. I introduce a new maximum simulated likelihood estimator,
which allows me to fully incorporate …rm and market heterogeneity in a more general entry
game setting and to perform nationwide welfare simulations.
Another key feature of my model is the structural combination of the demand and supply
sides, which o¤ers a signi…cant advantage: counterfactual simulations by extrapolation. It
enables analysis of how a policy change a¤ects entry and exit, market competition and, in
turn, social welfare. I am not the …rst to embed demand into an entry model. Reiss and
Spiller (1989) estimate the demand and supply of airline services simultaneously, but only
for carefully selected, small homogeneous product markets. Berry and Waldfogel (1999)
explicitly model pro…t functions with price and quantity, which allow …rms to have di¤erent
…xed costs, but their focus is on homogeneous product markets. Another study along these
lines is Town and Liu (2003). They quantify the welfare impact of Medicare HMOs by
combining structural models of demand and entry. Though they model the market as a
8
di¤erentiated product market, their entry game assumes in…nitely many plans with the
same …xed costs and does not fully incorporate …rm heterogeneity. By adding more supply
side structure to their framework, my model can evaluate each plan’s …xed costs, estimate
the potential social ine¢ ciency due to excessive or insu¢ cient entry and perform various
counterfactual simulations.
3
Data
Medicare is administered by the Centers for Medicare and Medicaid Services (hereafter
CMS). The main body of the dataset comes from the CMS. I use data from 2003 and 2004.5
The three main data sources are the following: (1) the Enrollment data at the county-plan
level, (2) the Monthly Report data, and (3) the Plan Bene…t Package data (PBP data).6
3.1
The Unit of Observation
The unit of observation is a plan-county-year. In the CMS data …les, there is a clear distinction between organization, plan (or contract, as the CMS sometimes calls it), and product (or
plan, as the CMS sometimes calls it). The organization is the governing body: for example
Paci…care, Aetna, and Humana. In the Medicare program, each organization may enter into
one or more plans (or contracts). Each of these plans is assigned a unique contract number by the CMS. In addition, each plan can o¤er more than one product, i.e. a particular
bene…t package with unique bene…ts, copayments, premiums, and counties covered. I focus
this study on the plan level analysis, because many important variables — such as county
enrollment — are reported at the plan level.7 I aggregate product level information to the
5
Information from 2002 is also used to identify incumbents in the entry estimation.
The following description of institutional backgrounds is based on the Medicare program in 2003 and
2004. For the detailed description of the data, see Maruyama (2007).
7
The plan level analysis is also e¤ective at reconciling the demand and supply side models. While HMOs
are likely to maximize their pro…ts at the plan or organization level, for the demand side, the product level
6
9
Table 1: The Number of HMO Plans and their Market Share in 2004
Number of plans Enrollment Share
HMO (Health Maintenance Organizations) 145
4,684,304
85%
PPO (Preferred Provider Organizations)
41
119,110
2%
Other Medicare+Choice types
70
266,247
5%
Other prepaid plan types
44
428,833
8%
Total prepaid plans
300
5,498,494 100%
Entire Medicare eligibles
42,992,077
Source: Medicare Managed Care Contract Report and the Enrollment data
plan level in each county by choosing the product with the largest enrollment as the representative product. For ambiguous cases, I average the variables across products, but such
cases are rare.
3.2
The Sample Population
Under the Medicare program, several types of private health plans are eligible to participate,
but HMOs are dominant players. An HMO forms its network of health providers to enhance
its purchasing power and to restrict and manage the ‡ow of its patients. A Medicare HMO
receives a monthly payment for each enrollee from the CMS and, in return, is responsible
for providing all covered services and takes full …nancial responsibility for the actual costs
generated. Due to these patient management devices and …nancial incentives, Medicare
HMOs are assumed to be more e¢ cient than the government plan.
Table 1 shows that HMOs’ enrollment share in private health plans was 85% in 2004.
While I use most plan types in the demand estimation, I focus my supply side model on
HMOs because the shares of non-HMO plans are small and because their pro…t structure may
di¤er from that of HMOs. I estimate pro…ts only for HMOs and the presence of non-HMO
types is treated as exogenous throughout this research.
analysis may be more reasonable because consumers are more likely to choose a product than a plan or an
organization.
10
I choose the observations used in this research as follows.
Omitted Plan Types With various types of private Medicare plans, I exclude the plan
types regarded as not open to regular Medicare bene…ciaries, such as plans in the Program of
All-Inclusive Care for the Elderly (PACE) and long-term care Demonstration plans. Ignoring
these plans excludes only 3% of the entire private plan enrollment.
Choosing Operating Plans
Next, at the plan-county-year level, I select operating plans.
In the CMS Enrollment data …les, individual members are assigned to counties according
to the reported residence. This implies that many counties contain an unrealistically high
number of health plans with very low enrollment. I include a plan-county observation in the
dataset as a valid observation if it satis…es both of the following: (1) it is in a plan’s o¢ cial
service counties under its contract to the government, and (2) it has at least 50 members.8
A very small number of observations with missing values are also excluded. Then, I recalculate the market size by subtracting the enrollment in the excluded observations from the
original number of Medicare eligibles. This completes the dataset for the demand estimation.
Creating Potential Entrants In my supply side model, all the potential operating plans
play a one-shot entry game each year. The pool of potential entrants consists of the operating
HMOs observed in the data and hypothetical potential entrants that I add to the dataset.
I limit my entry model to entrants that are operating in other areas in the same state
or the same metropolitan statistical area (MSA).9 This means that I exclude: (a) entrants
from outside the state or MSA and (b) brand-new entrants from the commercial sector.
8
I set this threshold higher than the previous studies to make the data stabler. Town and Liu (2003),
for example, de…ne the HMOs operating in a county as the HMOs with at least 12 members if the county
has fewer than 5,000 eligibles, and at least 35 members otherwise. Lowering my threshold to 35 does not
signi…cantly a¤ect the estimates, although it makes the standard errors slightly larger and the estimation
procedure less stable and more time-consuming.
9
Multi-state MSAs exist. Entry from outside the state but in the same MSA is considered in the analysis.
11
The potential bias due to this exclusion, however, seems to be minimal to a …rst order
approximation. First, no entry from outside the state or MSA is observed in my sample
periods. Second, although each Medicare HMO plan has its main entity in the commercial
sector, only a few HMOs newly started Medicare business during my sample periods. In
2004, for example, only …ve plans were new in Medicare.
The pool of potential entrants consists of the following six types: (1) plans that actually
operate in the county in the year, (2) plans that operate in the same MSA in the year, (3)
plans that operate in the same state in the year, (4) plans that operate in the county in
the previous year, (5) plans that operate in the same MSA in the previous year, and (6)
plans that operate in the same state in the previous year. Thus, in the supply estimation,
observations that satisfy one of (2) to (6) but not (1) are created and included as hypothetical
entrants.10 As a result, the supply dataset has many counties with one or more hypothetical
entrants but no observed entrants. These counties are retained in the sample in order to
avoid unwanted selection bias and make use of the information value from the fact that no
entry occurs. The plan level characteristics of the newly created potential entrants are the
same as the original plan. For the product level characteristics, I pick the product with the
largest enrollment under the plan as the representative product.11
3.3
Variables, Sample Size, and Summary Statistics
The PBP dataset contains a signi…cant amount of detailed information about the additional
bene…ts that a product o¤ers. In Maruyama (2007) I used factor analysis to summarize this
10
The concept of hypothetical potential entrants di¤ers from that of incumbency. A hypothetical potential
entrant that operates in the previous year but not the current year plays the entry game as an incumbent.
11
Another approach is to construct predicted characteristics of hypothetical entrants by running a reducedform regression. This approach can take into account the possibility that a plan adjusts its characteristics
to suit the markets it enters, although it ignores the possibility that product type choice depends on the
decision of other …rms. I test this approach as a robustness check by estimating a simple probit model that
predicts the drug bene…t availability based on market and plan types. This generates slight di¤erences,
especially in the estimates of producer surplus, but all in all the results are not signi…cantly a¤ected.
12
Table 2: The Sample Size
Number
Number
Number
Number
Number
of
of
of
of
of
Number of
Number of
Number of
2003
Demand dataset
plans
196
plans (HMO)
133
counties
829
plan-counties
1,478
plan-counties (HMO)
997
Entry estimation dataset
HMO plans
133
counties
2,574
plan-counties
9,617
2004
Total
207
138
938
1,811
1,094
207
138
948
3,289
2,091
138
138
2,501 2,630
9,927 19,544
information. Based on the results, I aggregate relevant bene…t variables into six composite
bene…t variables: drug, education, physicals, peripheral1, peripheral2, and screenings. The
payment rate data in the original CMS datasets is the standardized, payment base in each
county. The actual payment for a particular enrollee is determined by certain formulae,
which take demographic and risk factors into consideration. In this study, I use the average
of actual payments to private plans in the county calculated by using demographic and risk
factor information, rather than using the payment base.
Table 2 shows the sample size for the demand and supply estimation. Table 3 shows descriptive statistics of selected variables. The minimum of payment rates is negative because,
since 2003, the premium can be "negative" in the form of a premium rebate. A comparison
of the average and standard deviation of monthly premiums and payment rates indicates
that payments from the government are the primary source of revenue, though private plans
adopt various strategies in setting their premiums.
Table 4 shows the distribution of observations and the average market size, payment
rates, and monthly premiums by the number of observed entrants in a county. The majority
of observations are from concentrated markets. The table suggests that counties with larger
demands and/or higher payment rates attract more private plans. In addition, the more
13
Table 3: Descriptive Statistics of Selected Variables
Average
Plan enrollees
Plan market share
Monthly premium
Payment rate
bene…t: drug
3,060
0.063
51.3
549.4
0.61
(2003) (2004)
3,339 2,833
0.066 0.061
58.9
45.0
530.7 564.7
0.55
0.65
Std.dev.
Min
Max
7,715
0.070
47.5
91.1
0.48
50
0.0002
62.6
296.7
0
142,050
0.447
189.0
1067.5
1
The number of observations: 3289 plan-county-years. Monthly premium and payment rate are in dollars.
competitive a county is, the lower premiums they charge. Payment rates positively correlate
with FFS costs, but with smaller variance, due to the government payment policy described
above.
Table 4: Sample Distribution, Average Market Size, Average Premiums, and Average Payment Rates by the Number of Plans in Each County: 2004
Number
Number of
Mean:
County mean:
plan-counties:
premium eligibles Std FFS per base payof plans in
a county demand data supply data
capita cost ment rates
0
5,386
1
497
1,860
51.6
15,215
543.7
585.3
2
456
1,167
45.8
35,606
561.6
600.7
3
339
635
44.0
50,835
615.6
644.2
4
184
367
54.1
59,641
629.3
649.2
5
130
180
51.6
65,552
611.5
645.9
6
84
146
30.0 130,593
732.5
694.9
7
21
40
14.6 234,969
806.0
785.5
8
24
41
3.2 157,267
686.7
828.0
9
36
56
4.8 261,881
903.0
812.5
10
40
49
8.1 513,123
926.2
780.1
Total
1,811
9,927
45.0
40,535
570.7
605.8
Non-HMO plans included. Mean values are simple averages with no weights.
Table 5 explains the variables used in this study. The names and de…nitions of the
variables are given in the …rst two columns, and the next three columns indicate whether a
given variable is used to estimate demand, supply, or marginal cost. Summary statistics are
14
provided in the following columns.
4
Econometric Speci…cations
4.1
The Demand
The demand model follows the approach of Berry (1994) and Town and Liu (2003); it uses
market share data with the nested Logit model. A Medicare bene…ciary chooses a Medicare
plan every year by comparing the utility from each plan available in the bene…ciary’s county
and picking the plan with the greatest utility. Utility is derived from health plan characteristics such as premiums and bene…ts. The notation is as follows:
M : Year-markets (year-counties), m = 1; :::; M
J : Private Medicare plans, j = 1; :::; Jm
If j = 0, it means the traditional Medicare plan (the outside option).
Im : Medicare bene…ciaries, i = 1; : : : ; Im :
Bene…ciary i’s utility from plan j in year-market m is denoted as
0
uijm = wjm
(1)
jm
pjm +
+
jm
j
+
M SA
+
jm
+ "ijm
+ "ijm ;
where wjm is a vector of observed plan characteristics, pjm is the premium, "ijm is a nested
Logit error, and
j
+
M SA
+
jm
is a scalar contribution of the plan characteristics and
demand shocks unobserved by the econometrician.
jm
is the mean utility. Without loss of
generality, the error term is normalized as E["ijm ] = 0. Thus a bene…ciary’s expected utility,
E[uim ], increases with better bene…ts, lower premiums, and a greater number of choices. To
alleviate the endogeneity problem discussed below, I apply …xed e¤ects to plans and selected
15
Table 5: Definitions of Variables, Estimations Used (Demand, Supply, and Marginal Cost), and Summary Statistics
Variable Name
Definition
D S MC Obs
Mean Std. dev. Min
Max
(i) Plan or plan-year level variables (the numbers of plan and plan-year observations: 207 and 403)
no experience
experience year
nonprofit
national chain
group model
staff model
IPA model
entire enrollees
DEMO plan
cost plan
cost missing
PPO plan
PFFS plan
PSO plan
POS
=1 if a plan has no experience in Medicare before
Years in business since the first Medicare enrollee
Dummy variable: not-for-profit organizations
Dummy variable: national chains
=1 if the plan is a group model HMO (The omitted category is ”network,
mixed, and other models”.)
=1 if the plan is a staff model HMO
=1 if the plan is an IPA model HMO
Total enrollees of the plan over its entire service areas (in 1,000)
Dummy variable: DEMO plans
Dummy variable: cost contract HMOs
Dummy variable: cost contract HMOs with missing data
Dummy variable: PPO plans
Dummy variable: PFFS plans
Dummy variable: PSO plans
Dummy variable: HMOPOS
X X X
X X X
X X
X X
X X
403
403
207
207
207
0.11
8.18
0.34
0.29
0.47
0.31
6.44
0.47
0.46
0.50
0
0
0
0
0
1
27
1
1
1
X X
X X
X
X
X
X
X
X
X
X
207
207
403
207
207
207
207
207
207
207
0.07
0.43
25,313
0.15
0.13
0.05
0.17
0.02
0.01
0.03
0.25
0.50
57,033
0.36
0.33
0.22
0.38
0.15
0.12
0.17
0
0
52
0
0
0
0
0
0
0
1
1
648,281
1
1
1
1
1
1
1
(ii) Plan-county level variables (the number of plan-county-year observations: 3,289)
s jm
ln s j | private, m
monthly premium
# product in plan
benefit: drug
benefit: education
benefit: physicals
benefit: peripheral1
benefit: peripheral2
benefit: screenings
avg # competitor
competitor: npo
competitor: chain
competitor: IPA
competitor: group
competitor: staff
copayments
Plan j ’s market share in market m
”Within-group share” in log
Monthly premium in $ (Medicare premium not included)
The number of products the plan offers in the market
Dummy variable: supplemental benefits in outpatient prescription drugs
Composite score: supplemental benefits in health education/welness
Composite score: supplemental benefits in routine physicals
Composite score: supplemental benefits in preventive and comprehensive
dental, chiropractic, and acupuncture
Composite score: supplemental benefits in eye exams, eye wear, hearing
exams, and hearing aids
Composite score: supplemental benefits in screenings (mammography,
colorectal, prostate cancer, pap smears, and pelvic exams)
Average number of competitors in the other markets served by the plan
The number of competing NPO plans in the market
The number of competing national-chain plans in the market
The number of competing IPA model plans
The number of competing group model plans
The number of competing staff model plans
Out-of-pocket costs (averaged over demographic groups, net of premiums)
X
X
X
X
X
X
X
X
3,289
3,289
3,289
3,289
3,289
3,289
3,289
3,289
0.06
-1.21
51.3
2.16
0.61
0.36
0.51
0.10
0.07
1.44
47.5
1.28
0.48
0.27
0.21
0.18
0.00
-7.55
-62.6
1
0
0
0
0
0.45
0.00
189.0
10
1
0.9
1
1
X
3,289
0.48
0.29
0
1
X
3,289
0.20
0.26
0
1
IV
IV
IV
IV
IV
IV
3,289
3,289
3,289
3,289
3,289
3,289
X 3,289
1.75
0.70
0.71
0.97
0.74
0.11
335.4
1.69
1.13
1.08
1.30
0.99
0.34
85.9
0
0
0
0
0
0
0
9
5
5
6
4
2
589.9
0.88
527.5
519.7
552.8
0.27
3.18
0.33
0.22
32,939
0.16
0.55
0.19
0.28
0.25
0.14
0.04
0.53
0.95
0.11
80.5
118.8
131.0
0.36
4.58
0.19
0.17
64,364
0.36
0.50
0.39
0.45
0.44
0.35
0.19
0.50
0.04
X 1,767
X 1,767
0.93
0.86
0.09
0.12
0.73
0.52
1.66
1.53
X 1,767
X 1,767
137.4
1.06
38.4
2.06
70.1
0.00
231.8
51.97
X 1,767
67,939 157,012
(iii) County or county-year level variables (the numbers of county and county-year observations: 948 and 1,767)
s 0m
payment rate
FFS per capita cost
Std FFS per capita cost
per capita # hospital
per capita # hospital bed
per capita # medical doctor
HMO penetration rate 98
eligibles
county: no hospital
MSA
county: small
county: medium
county: large
county: extra large
county: huge
year: 2004
demo factor
demo factor HMO
risk factor HMO
medigap premium
hospital expenditure
inpatient days
The market share of the traditional Medicare plan
CMS monthly payment rate per enrollee (case-mix adjusted for each plan)
Average fee-for-service monthly cost per enrollee
FFS per capita cost standardized by demographic factor
The number of hospitals per 10,000 residents in 2001
The number of hospital beds per 1,000 residents in 2001
The number of general practice M.D.s per 1,000 residents in 2001
Estimated (HMO enrollment / total population) in 1998
Medicare eligibles (in 1,000)
Dummy variable for counties with no hospital
Dummy variable for counties in an MSA
County dummy: 3,000 < number of eligibles < 6,000
County dummy: 6,000 < number of eligibles < 15,000
County dummy: 15,000 < number of eligibles < 50,000
County dummy: 50,000 < number of eligibles < 150,000
County dummy: 150,000 < number of eligibles
Year dummy variable for 2004
Demographic factors (e.g. age, sex, institutional status, and Medicaid
status) to adjust the payment rate (the higher, the more costly)
Demographic factors for HMO enrollees
Risk factors for HMO enrollee (e.g. disease group, community factors, and
institutional factors) to adjust the payment rate
Monthly Medigap premium in $
Total reported facility expenditures (short term general hospitals) (not
only Medicare) per 1,000 residents in $1,000 in the county
Total Medicare inpatient days (short term hospitals) in 1,000 days
X
X
X X
IV X X
IV X X
IV X X
X
X
X
IV X X
X
X
X
X
X
X
X X X
X
1,767
1,767
1,767
1,767
948
948
948
948
1,767
1,767
948
1,767
1,767
1,767
1,767
1,767
1,767
1,767
0.50
1.00
296.7 1,067.5
214.9 1,217.3
203.6 1,262.2
0.00
5.12
0.00
87.50
0.00
1.54
0.00
0.96
370 1,046,829
0
1
0
1
0
1
0
1
0
1
0
1
0
1
0
1
0.82
1.11
0 2,351,152
”IV” means the variable used in the estimation as an instrument. For the summary statistics, Non-HMO observations are included. The ”hypothetical potenti
entrants” used in the supply estimation are not included. The six benefit variables from ”benefit: drug” to ”benefit: screenings” are the quality composite
variables made by aggregating the detailed characteristics information in the PBP data set, based on the results of the factor analysis (Maruyama (2007)).
15
MSAs, which are represented by
j
+
12
M SA .
Thus,
jm
is the …rst-di¤erenced demand
shock. I normalize the utility from the traditional Medicare plan as: ui0m = "i0m . This
normalization does not a¤ect the welfare calculation, because consumer surplus in my study
measures the utility gain from enrolling in private Medicare plans relative to traditional
Medicare.
Following previous studies, I assume a nested Logit error. The group structure is the
private plan group against the traditional Medicare plan. Thus, my assumption allows
substitution among private plans to di¤er from substitution between private plans and the
traditional Medicare plan. The market share of plan j is derived as:
sjm = sjjprivate;m sprivatejm
exp 1 1 jm
h
i;
=
(1 )
Dm 1 + Dm
(2)
where
represents the similarity within the private plan group, and
Dm
X
exp
kjprivate;m
1
1
km
The correlation within a group is possible through . As
correlation becomes one and, when
:
goes to one, the within-group
= 0, the model is reduced to the multinomial Logit
model. When the market share data is used, this model can be estimated using the following
linear form (see Berry 1994):
(3)
ln sjm
0
ln s0m = wjm
pjm +
j
12
+
M SA
+ ln sjjprivate;m +
jm
County …xed e¤ect dummies are not used because many counties appear in the dataset with only one
observation. By the same token, I apply MSA …xed e¤ects only to MSAs for which at least 10 observations
exist in the dataset.
17
for j = 1; :::; Jm : For the estimation, I use the instrumental variable (IV) method to deal
with the potentially endogenous variables in expression (3): the monthly premium, pjm , and
the within-group share, ln(sjjprivate;m ). These two variables may be correlated with the plan
demand shock,
jm .
Since I use the …xed-e¤ect approach, the identi…cation of parameters
comes from within-plan and within-MSA changes in the instruments.
I construct instruments using three strategies. The …rst set of instruments comprises
a county dummy variable for counties with no hospital and the per-resident numbers of
hospitals, hospital beds, and general practice medical doctors. These variables are valid
instruments because they a¤ect the plans’ relative bargaining power with providers, their
cost structure, and the number of competitors. Second, I use the characteristics of competing
plans in a county. This approach is traditional in the literature on product-di¤erentiated
market demand (e.g. Bresnahan 1987 and Berry 1994).13 This is valid if competitors’entryexit decisions and changes in product characteristics are uncorrelated with changes in
jm .
Speci…cally, I choose not-for-pro…t ownership, chain a¢ liation, and HMO network types
(IPA, Group, and Sta¤ models in Table 5), which are expected to be relatively …xed and
predetermined. I use the sums of these indicator variables of rival plans as instruments.
Third, I use the average number of competitors in the other markets in which the plan
operates. This use of the panel structure of data is a strategy similar to that of Hausman
(1997) and Nevo (2001). Private Medicare plans typically set premiums for each product
rather than for individual counties. Thus, the plan premium in a county is likely to correlate
with the competitive environment in the plan’s other service counties.
jm
is speci…c to
the market, so it is likely to be uncorrelated with this instrument.14 In the end, I have 10
13
Dafny and Dranove (2005) also use this approach in the demand estimation of Medicare HMOs.
The argument here does not hold if each plan chooses its premium at the county level, as I assume below.
However, due to set-up costs of products, the reality is in between. Without set-up costs, a plan could set
up a product in each county so that it could charge an optimal price in each county. In reality, however, a
product usually covers several markets. A plan carefully chooses its products’service area boundaries, which
allows a plan to charge roughly optimal prices with lower set-up costs. On this basis my instrument is valid,
14
18
instruments for two endogenous variables. The …rst stage regression is reported in Appendix
A. For both pjm and ln(sjjprivate;m ), the F -statistics reject the joint hypothesis that the
coe¢ cients on the instruments are all zero.
4.2
4.2.1
HMO’s Behavioral Model
The Game Speci…cation
pot
potentially operating HMO plans in each market. Each county can accomThere are Jm
modate zero, one, or more than one HMO plan. At the beginning of each year, all the
potential entrants in each market play the following game. The game consists of two parts, a
sequential-move entry game and a simultaneous-move price-setting game. Only HMOs that
choose "enter" play the price setting game. If a plan chooses "not enter", it receives zero
pro…t from the county. The plan characteristics are exogenously …xed in this study.
I employ a complete information sequential-move entry game. This guarantees the existence of a unique subgame perfect Nash equilibrium, even for large asymmetric games.15
In the sequential game, I assume the following decision order. Among all the potential entrants in a market, the incumbents move …rst. Incumbents make decisions in the order of
the following incumbency classes: (1) presence in the county during the previous year, (2)
presence in the same MSA during the previous year, and (3) presence in the same state
during the previous year. Within the same incumbency class, HMOs make decisions based
on the size of their entire enrollment. This assumption is more likely if larger incumbents
must announce their changes in service area earlier. I also perform a robustness test of this
while the county-level pro…t-maximization assumption still holds as an approximation. Exclusion of this IV
does not change the results signi…cantly.
15
There is no guarantee that this assumption will be realistic, but a simultaneous game is no less a priori
than a sequential game. The order of decisions is studied by Berry (1992), Mazzeo (2002), Einav (2003), and
Schmidt-Dengler (2006). In general, these papers suggest that di¤erences resulting from di¤erent decision
orders or di¤erent game settings are small.
19
order assumption by randomizing the order. As reported in the result section, this does not
signi…cantly a¤ect the results.
The game is assumed to be a complete information game, because private (or incomplete)
information models with publicly observable heterogeneity are relatively di¢ cult to estimate.
The number of potential entrants, plan characteristics, random shocks in pro…ts, and other
information are common knowledge for all players. As the only exception, I assume that
no player observes the random shock in the demand,
jm ,
when players make decisions.
This technical assumption is required to keep the estimation consistent and well-behaved.
Since the value of
jm
can only be calculated as the residual term in the demand esti-
mation, it cannot be obtained for hypothetical entrants whose price and quantity are not
observed in the dataset. However, to calculate more precise national estimates of welfare
components, this residual term should be included in the calculation. Thus I need the assumption that allows me to incorporate
game. By de…nition,
jm
jm
in the welfare calculation but not in the entry
is mean-zero and independent of the decision of plans, so the
assumption that this is unknown ex-ante does not a¤ect the desirable equilibrium properties
of the sequential-move game as long as plans are risk-neutral. At the same time, I believe
that this simplifying assumption is reasonably realistic. First, though there certainly exists
information on quality and characteristics that is known to plans but not to econometricians,
most of such information is likely to be captured by the plan and MSA …xed e¤ects. Second,
the residual from the demand estimation consists of not only variation in characteristics
but also demand shocks at the plan-market level, which are often di¢ cult even for plans to
perfectly predict when they make entry and price decisions. As shown later, my demand
model explains approximately 80% of market share variation. If the accuracy of the plan’s
demand predictions is somewhat similar, this assumption of
jm
is more or less realistic.
I use the concept of the subgame perfect pure strategy Nash equilibrium (SPNE). An
SPNE in a county is obtained when: (1) all entering …rms are pro…table with their optimized
20
prices, and (2) all …rms that do not enter expect non-positive pro…ts from entry. Each HMO’s
entry-exit strategy in county m is represented by a binary variable, yjm , which takes the value
"0" if plan j does not enter and "1" if it enters. A market con…guration is denoted as ym ,
which is a vector that stacks (y1m ; :::; yJ m m ). The equilibrium solution, ym , can be calculated
by the backward induction algorithm.
4.2.2
The Pro…t Function
I assume that each …rm maximizes pro…ts independently at the plan-county level, by assuming the pro…t to be additively separable across counties, and across plans if an organization
has multiple plans. Relaxing this assumption to incorporate the economy of scale or strategic
cooperation within a county is beyond the scope of this paper, since it requires substantial
modi…cation of the estimation algorithm. I assume the following local pro…t function for
plan j in county m:
(4)
jm (x m ; zjm ; "jm ; ym ;
) = [pjm (x m ; ym ) + Payment Rate m
M Cjm ]
qjm (p m (x m ; ym ); x m ; ym ) + zjm + "jm
V Pjm (x m ; ym ) + zjm + "jm ,
where the expression in square brackets is marginal pro…ts and qjm () is the demand. p
is (p1m ; :::; pJ m m )0 , x
m
m
is (x1m ; :::; xJ m m )0 , xjm is the set of predetermined observables and
…xed e¤ects in the demand function, namely (wjm ;
j ; M SA ),
and Payment Rate means
the expected government payment rate per enrollee. zjm is the vector of the independent
variables that explain the …xed part of the pro…ts,
is a parameter vector to be estimated,
and "jm is an idiosyncratic shock to plan j in market m that is observed by all the …rms
ex-ante, but unobserved by the econometrician. V Pjm () denotes variable pro…ts. HMOs are
assumed to incur …xed costs at the county level, which is the sum of the last two terms. For
21
the independent variables, zjm , I choose market-speci…c variables and relatively exogenous or
predetermined plan-speci…c variables, such as non-pro…t status, chain a¢ liation, and years
of experience. Since I regard this …xed cost part as a reduced form, I exclude variables that
are likely to correlate with "jm , such as bene…t coverage.
Before estimating the entry model, I estimate marginal costs. The estimation of marginal
costs for the …rms that actually enter relies on the …rst order condition of the price-setting
game. The …rst order condition in market m can be written as
(5)
sjm (p m ; x m ; ym ) + (pjm + Payment Rate m
M Cjm )
@sjm
= 0;
@pjm
enter
enter
for j = 1; :::; Jm
, where Jm
is the number of actual entrants in county m. This equation
can be solved for M Cjm by writing16
@sjm
@pjm
(6) pjm +Payment Rate m M Cjm =
1
sjm =
1
1
sjjprivate;m
(1
)sjm
:
To calculate M Cjm using (6), I use …tted shares for sjm and sjjprivate;m , instead of observed
shares, under the assumption that
jm
is unknown to plans when they make decisions.17
For hypothetical entrants, the lack of observed pjm and sjm requires M Cjm to be extrapolated. I obtain M Cjm by a reduced-form linear regression, in which I choose independent
variables that are likely to be exogenous or predetermined, such as non-pro…t status, chain
a¢ liation, and market characteristics variables, as listed in Table 5. The result is reported
in Appendix A.
16
While plans have been allowed to charge "negative" premiums in the form of Medicare premium rebates
since 2003, only 0.5% of the plan-county observations charge negative premiums in my data and 30.0% charge
zero premiums. The mass point at zero premiums may re‡ect the administrative costs of charging non-zero
premiums. As long as this censoring occurs symmetrically around zero, the bias is considered to be modest,
if any.
17
Including
jm changes the mean of M Cjm by only 0:2%, so the potential bias of this behavioral
assumption is negligible.
22
4.3
The Estimation Algorithm
I assume the components unobserved to the econometrician to correlate across plans within
each market and to exhibit heteroskedasticity based on the market size, Im . Speci…cally,
(7)
"jm = Im
where !; ; and
!
jm
are parameters to be estimated.
+
jm
m
and
;
m
are distributed i.i.d. standard
normal across plans and markets and assumed to be independent of z m . The correlation of
the unobservable "jm across plans in a given market is then Im
.18 I use Im to control the
variance of the error, since the market size varies signi…cantly.
I employ maximum likelihood estimation. I denote the observed market con…guration as
o
, and the maximum likelihood problem can be written as
ym
bM L = arg max
(8)
where
(
)
M
1 X
o
ln Pr [ym
= ym (x m ; z m ; )] ,
M m=1
is the vector of model parameters, ( ; !; ; ), and ym is the predicted market
con…guration given (x m ; z m ; ; "jm ), which can be solved uniquely by backward induction.
The probability in the likelihood, however, does not have an analytical solution due to
the multi-dimensional integrals, so I use the maximum simulated likelihood method.19 To
make the estimation of the large game computationally feasible, I propose a modi…cation of
the Geweke-Hajivassiliou-Keane (GHK) simulator, one of the smooth recursive conditioning
simulators.20 This simulator provides a di¤erentiable, unbiased estimator of likelihood with
18
Unlike usual discrete choice entry models, normalization for the variance of the error is not required.
The variance is identi…ed because variable pro…ts are unparameterized.
19
In the literature, method of simulated moments (MSM) estimators have been used (Berry 1992, SchmidtDengler 2006). However, this approach is less e¢ cient, especially when the number of playes in a game varies
across markets.
20
For the GHK simulator and related methods, see Geweke and Keane (2001).
23
smaller variance than the crude frequency simulator. The original GHK simulator allows
interactions across j through the disturbance structure, but not strategic interactions across
j. In Appendix B, I modify the simulator to …t my model, by claiming that the simulator can
be dovetailed with sequential-move games by exploiting its recursive conditioning structure.
To solve the game and obtain ym in each likelihood evaluation, the (partial) equilibrium
prices and quantities, p
m
and q m , need to be available for any market con…guration, ym ,
in order to determine the payo¤s. First I calculate p
m
by solving the system of the …rst
order conditions in the price-setting game, (5). Given the estimated demand parameters,
enter
equations
( ; ; ), and the values of M C m ; x m , Payment Rate m , and ym , there are Jm
enter
and Jm
unknowns. Due to the numerical property of the discrete choice model, however,
the price function, pjm (x m ; ym ), does not have a closed form solution. I solve the system
of equations for pjm using the following numerical algorithm. The …rst order condition, (5),
can be written as:
pjm =
sjm (p m ; x m ) (
@sjm (p m ; x m )
)
@pjm
1
Payment Rate m + M Cjm ;
enter
for j = 1; :::; Jm
: The following numerical iteration gives the numerical solution for pjm .
(9)
pt+1
jm =
sjm pt m ; x
m
(
@sjm (pt m ; x m )
)
@pjm
1
Payment Rate m + M Cjm ;
enter
for j = 1; :::; Jm
and t = 0; 1; 2; :::. The observed values of pjm are used for the initial
values of ptjm . It turns out that this numerical iteration converges in an acceptable amount
of time.21 After p
m
is calculated, I can use the estimated demand equation to calculate q m .
21
The demand shock,
jm , is consistently excluded from these calculations. If I use M Cjm calculated
with
,
the
performance
of this numerical iteration is sometimes poor.
jm
24
4.4
Welfare Measures and Simulations
The net welfare gain of having private plans in Medicare is given by
(10)
W = (CSw=PrivatePlans
(Gw=PrivatePlans
CSw=oPrivatePlans )
Gw=oPrivatePlans ) + private plan pro…ts;
where CSx denotes the aggregated consumer surplus attributable to program x, and Gx
denotes government expenditures. I use HMO pro…ts instead of the pro…ts of all private
plans, as discussed above. Following McFadden (1981), annual expected consumer surplus
is derived as:
(11)
CSm
(CSw=PrivatePlans
= Im
12
(1
b
CSw=oPrivatePlans )
2 ent
Jm
bjm +
X
b) ln 4
exp
1 b
j=0
jm
!3
5:
The wealth e¤ect is assumed to be zero. In the dataset, all premiums and payment rates are
de…ned on a monthly basis, so the calculated surplus is multiplied by 12.
Counterfactual simulations use the same framework as above. The only di¤erence is that
I exclude
jm
from all calculations in the simulations, because
jm
can be calculated only
for observed entrants. Due to the convexity of the expression in square brackets in (11),
consumer surplus is understated without
jm ,
even though E
jm
= 0. By the same
token, the predicted number of HMO enrollees, government gain, and HMO pro…ts are also
understated. In the simulations, readers should focus on changes in numbers, not on the
level.
Not only bene…ciaries and suppliers but also the government may enjoy the welfare gain
from the program. This gain comes from per-enrollee cost di¤erence between per-enrollee
25
payments to private plans and expected per capita costs in the traditional Medicare plan that
the government would have to pay without private Medicare plans. Thus, if the government
payment rate is lower than the expected fee-for-service cost in a county, the government expects more gain when more members transfer from traditional Medicare to Medicare HMOs.
I calculate Gw=oPrivatePlans by using demographic data and the average fee-for-service cost of
traditional Medicare. I discount the average cost data by 8.0% to take favorable selection into
account; favorable selection refers to the widely acknowledged phenomenon that Medicare
bene…ciaries who choose to join an HMO are systematically healthier than bene…ciaries who
choose to remain in the fee-for-service sector even after adjusting for the demographic characteristics included in Medicare’s HMO payment methodology (e.g. Mello et al. 2003). GAO
(2000) concludes that Medicare HMO enrollees are an average of 11.7% less costly than other
Medicare eligibles. My choice of 8.0% is fairly arbitrary, but I choose 8.0% instead of 11.7%
for the following reasons: (1) the Medicare HMO market had been shrinking since the years
analyzed by GAO (2000), while the main source of the cost di¤erence is new HMO joiners;
(2) the cost di¤erence for new enrollees has been shrinking over the years (GAO 2000); and
(3) after 1998, the government gradually introduced a payment methodology with more detailed risk adjustments. If 8.0% is misspeci…ed by 1%, for example, the net social welfare
gain is a¤ected by only 2 to 5%. This implies that the following welfare simulation results
and policy implications are robust overall with respect to favorable selection.22
26
Table 6: Nested Logit Demand with Fixed E¤ects
Dependent variable:
ln(sjm ) ln(s0m )
ln sjjprivate;m
monthly premium
bene…t: drug
bene…t: education
bene…t: physicals
bene…t: peripheral1
bene…t: peripheral2
bene…t: screenings
# products in plan
no experience
ln experience year
HMO penet rate 98
year: 2004
constant
Adjusted R2
Nested
without IV
0.639
(.014)
0.0022 (.0008)
0.203
(.062)
0.139 (.150)
0.457
(.149)
0.351
(.195)
0.069
(.113)
0.370
(.196)
0.112
(.021)
0.364 (.077)
0.720
(.195)
1.802
(.137)
0.118
(.040)
4.277 (.288)
0.831
Logit
with IV
0.348
(.055)
0.0105 (.0057)
0.338
(.068)
0.296
(.182)
0.419
(.167)
0.551
(.236)
0.096
(.124)
0.517
(.232)
0.163
(.027)
0.389
(.107)
0.794
(.236)
1.255
(.194)
0.253
(.075)
4.955
(.411)
0.797
The number of observations: 3,289. Standard errors are in parentheses.
5
5.1
Results
Parameter Estimation
Table 6 shows the results of the demand estimation. The signi…cant coe¢ cient on ln sjjprivate;m
implies that the imposed grouping structure is relevant. The results indicate that consumers
are attracted by plans with lower premiums, more bene…ts (except for education bene…ts), more options ("# products in plan"), and longer experience of business in Medicare.
Medicare HMOs have more popularity in the counties where HMOs are more common in the
commercial sector. Table 7 reports price elasticities, marginal costs, and per-capita consumer
surplus. The conventional price elasticity is not de…ned, because charging a non-positive pre22
Another possible concern is that the degree of favorable selection may depend on market share. However,
in the counterfactual simulations below, the simulated changes in market share are generally local. The
aggregate market share of private plans ranges from 8% to 20%, with most shares around 13%, so assuming
a constant discount factor is a reasonable approximation.
27
mium is a common practice. Table 7 shows two alternative measures of price sensitivity:
the semi-elasticity,
jm
point of view,
(pjm + Payment Ratem ).23 The estimated semi-elasticity is somewhat
jm
(@sjm =@pjm ) (1=sjm ) and the price elasticity from the producers’
higher than the estimate of Town and Liu (2004)’s,
0.009. A possible explanation of this
di¤erence may be the huge decline in HMO enrollment between their sample periods and
mine. In this market, those who choose private plans tend to be with less risk and more
price sensitive. It may be the case that those who are more price sensitive remained in the
private plans in my sample years. The latter elasticity is well-de…ned because premiums are
much smaller than payment rates. With values less than
1:0, the estimated elasticity does
not contradict the assumption that …rms maximize pro…t. The table also shows that elderly
people in counties with at least one Medicare HMO enjoy an average consumer surplus of $50
per month, regardless of whether the person joins a private plan or the traditional Medicare
plan. As discussed in the model section, I calculate these values in two ways, depending on
whether I include
jm
or not, i.e. depending on whether I use observed market share or
…tted market share. When
jm
is in the calculation, the values obtained are more accu-
rate, but the values calculated without
jm
are necessary for counterfactual simulations.
The table shows that this does not make a di¤erence in the estimated elasticities, but does
produce some di¤erence in the consumer surplus.
Table 8 shows the results of the entry estimation, in which the dependent variable is
each plan’s entry decision and the estimated equation is the latent pro…t function, (4). Since
variable pro…ts are all computed numerically in the entry game, the estimated coe¢ cients
represent each variable’s contribution to …xed pro…ts. The results suggest that, while a
newcomer in Medicare incurs signi…cant setup costs in each county, the length of business in
Medicare is irrelevant. The results that non-pro…t and local plans face lower …xed costs can be
explained by the view that those plans tend to stay in the market for lower pro…t thresholds,
23
If the semi-elasticity is
0.0124, a $1 increase in the premium is expected to reduce enrollment by 1.24%.
28
Table 7: Price Semi-Elasticity, Marginal Costs, and Consumer Surplus
# obs
Mean
Std. dev.
Own-price semi-elasticity (in 100%)
with
3,289
0.0124
0.0026
jm
w/o
3,289
0.0125
0.0024
jm
Own-price elasticity for private plans (in %)
with
3,289
7.529
2.246
jm
w/o
3,289
7.575
2.167
jm
Marginal costs (monthly in $)
w/o
3,289
517.6
103.0
jm
Consumer surplus (per capita, monthly in $)
with
1,767
53.2
80.3
jm
w/o
1,767
41.1
64.4
jm
Min
Max
0.0161
0.0161
0.0058
0.0075
17.452
16.583
2.841
2.952
196.2
1146.3
0.02
0.09
512.8
396.3
because their optimization behavior involves not only pro…ts but also other factors, such
as quality and quantity of services. This view is consistent with the arguments in Gaynor
and Vogt (2003) that nonpro…t …rms act like for-pro…t …rms with di¤erent cost functions.
The negative coe¢ cient on standardized per capita fee-for-service costs indicates that a
plan incurs additional …xed costs in a county where utilization of medical care is relatively
high. Variables of per capita medical care infrastructure are included to control the costs
of organizing and maintaining the provider network. The results indicate that a plan incurs
the highest cost when there is only one hospital in the county — the case where the plan’s
bargaining power is weakest. While the marginal cost regression suggests that marginal costs
are lower in larger counties, a plan incurs higher …xed costs in larger cities. On the other hand,
being in an MSA is favorable to HMO plans, probably because health providers’bargaining
power is weaker compared to isolated cities, or maybe because the coe¢ cient captures some
economy of scale, which is beyond the scope of my supply side model. The estimated
parameters, ! and , indicate that plan-county-speci…c shocks,
the variance of …xed cost errors and county-speci…c shocks,
estimated value of
m,
jm ,
for 40%. The signi…cantly
rejects the null hypothesis of homoskedasticity.
29
account for 60% of
To con…rm how the order assumption may bias the results, I also estimate the model
using randomized order sequences. The di¤erences in estimated values are nearly as small as
the changes caused by di¤erent simulation draws, while the …t of the model always becomes
worse.
Table 8: Entry Estimation
E¤ects on county level …xed pro…ts in $1,000,000
no experience
1.252
(.241)
experience year
0.006
(.005)
nonpro…t
0.159
(.045)
national chain
0.505
(.064)
group model
0.568
(.196)
IPA model
0.166
(.191)
sta¤ model
0.069
(.209)
Std FFS per capita cost
0.002
(.0002)
per capita # hospital
0.223
(.035)
per capita # hospital bed
0.001
(.004)
per capita # medical doctor
0.114
(.062)
county: no hospital
0.937
(.077)
MSA
0.321
(.078)
county: small
0.554
(.069)
county: medium
1.462
(.095)
county: large
4.430
(.216)
county: extra large
7.334
(.771)
county: huge
12.124
(4.85)
year: 2004
0.091
(.030)
constant
1.158
(.229)
! (plan-county-speci…c error)
0.267
(.017)
(county-speci…c error)
0.216
(.014)
(heteroskedasticity)
0.859
(.014)
The number of markets: 5,075. The number of potential entrants: 19,544.
Standard errors are in parentheses. The number of simulation draws: 40.
30
5.2
Welfare Estimates
Table 9 summarizes the calculated net social welfare gain of the Medicare HMO program.
Adding the three welfare components above results in net welfare gains of 9.04 billion dollars
in 2003 and 10.27 billion in 2004. The majority of the gain comes from producer surplus.
Consumer surplus is much smaller than producer surplus and the government is losing money.
The table also shows the numbers calculated without
jm .
Whether I use …tted market
share or predicted market share a¤ects not only consumer surplus but also HMO pro…ts and
government gain through the change in enrollment.
Table 9: Welfare Results
2003
2004
Consumer surplus
2000 (Town&Liu)
2,806
2,862
(1,362)
(1,366)
HMO pro…ts
7,751
8,027
(210)
(243)
Net government gain
CMS payment to private plans
1,119
371
N/A
34,147 37,825
Expected FFS payment without private plans
33,028 37,454
41,726
Net social welfare gain
9,044
10,267
Consumer surplus w/o
jm
HMO pro…ts w/o
jm
Net government gain w/o
Net social welfare gain w/o
2,262
7,357
1,133
8,486
2,476
7,777
305
9,948
jm
jm
4,061
8,757
N/A
N/A
Annual, in millions of dollars. Standard errors shown in parentheses are
calculated by Monte Carlo simulation over simulation draws and parameters.
My estimates are all smaller than the …gures from 2000 estimated by Town and Liu (2003),
shown in the last column. A drastic decrease in the number of participating HMOs and
enrollees between these years may explain the di¤erences. The decrease in producer surplus
31
is smaller than the decreases in the other two components, possibly because average actual
payments have gradually increased since 1998 (GAO 2000). Overstatement of producer
surplus may be another explanation. Whereas …rms may make their entry decision based
on long-run payo¤, my entry model, as well as Town and Liu (2003), assumes that …rms
consider only the current year’s payo¤s. Under a certain stationarity assumption of the
market, this simpli…cation should be reasonable. However, after the exodus of HMOs around
2001, it is likely that remaining …rms are systematically optimistic about their future pro…ts,
which leads to an upward bias in current-year producer surplus. The assumption of additive
separability across counties may be another source of bias.
5.3
Counterfactual Simulations
The structural estimation strategy with endogenous entry-exit decisions of …rms enables me
to evaluate the impact of payment rate changes on the national social welfare. At the same
time, readers should note the simpli…ed setting used in this study, especially in the discussion
of policy implications in relation to the current payment rate policy. Non-HMO private plans
and Medicare Part D are not included in the simulation framework. I endogenize the entryexit decision but not the decisions on plan characteristics, which implies that …rms make all
the pass-through of additional payments simply through premium changes. The results for
2003 are not shown but are generally consistent with the 2004 results.
Payment Rate Simulations First I perform welfare simulations with four di¤erent payment rates. In the simulations, a change in the payment rate …rst a¤ects the cost structure
of HMOs. They then re-optimize their entry-exit and price decisions by taking the demand
and the rivals’response into consideration, which leads to a new equilibrium. Based on the
recalculated market shares, the welfare components are recalculated.
Table 10 shows the results with four di¤erent payment rates in 2004. A uniform rise of 25
32
Table 10: Payment Rate Simulation: 2004
Payment rate:
$25
$0
Consumer surplus
HMO pro…ts
Net government gain
CMS payments
Expected FFS payment
1,671
7,067
950
27,064
28,013
2,476
7,777
307
35,401
35,094
3,589
8,689
2,237
45,817
43,581
5,121
9,786
5,022
58,388
53,366
8,163
11,683
10,747
73,344
68,994
9,688
9,947
10,041
9,885
9,100
3,813,610
3,270,073
1,712
995
60.6
591.4
4,795,493
4,309,146
1,811
1,094
36.5
615.2
5,952,749
5,516,147
1,895
1,178
14.6
641.4
7,304,955
6,918,159
1,990
1,273
7.0
666.1
9,497,744
9,178,289
2,154
1,437
37.1
699.7
Net social welfare gain
Enrollment (total)
Enrollment (HMO)
# of plan-mkt (total)
# of plan-mkt (HMO)
Avg premium (HMO)
Avg payment rate
+$25
All welfare measures are in millions of dollars, calculated without
+$50
+$85:6
jm :
or 50 dollars in the payment rate leads to a decrease in the average premium and increases
in the enrollment and entrants. This, in turn, increases consumer surplus, producer surplus,
and the government de…cit. Among the …ve di¤erent payment rates, the maximum social
welfare gain is achieved with +$25.
I include the last column as an approximation of the 2008 payment rate. MedPAC
(2008) projects that the 2008 payment to Medicare HMOs is, on average, 112 percent of
the FFS costs. Taking into consideration the fact that MedPAC’s FFS cost estimates are
approximately 2 percentage points lower than CMS’s estimates24 and comparing the average
base payment and standardized FFS cost in my 2004 data (684.8 and 700.4 dollars per
month, respectively), I can roughly approximate the 2008 payment level by increasing the
payment by $85.6.25 According to the CMS data, enrollment in all Medicare private plans
24
This is because MedPAC factors out certain spending related to graduate medical education in teaching
hospitals (Merlis 2007).
25
What I simulate here is only payment rates. The level of health care costs, the number of Medicare
33
was 4.7 million in 2004 and 9.4 million in 2008, which is quite close to the …gures in the table
and indicates that the simulated numbers are consistent with reality. The results show that,
although consumers and …rms bene…t from the 2008 level of subsidy, this level is nevertheless
not justi…able from the social welfare perspective.
Decomposing Welfare Changes Further counterfactual simulations give additional insight into how a change in payment rate a¤ects social welfare. I decompose net social welfare
changes into three components: entry e¤ect, market power e¤ect, and subsidy e¤ect. Entry
e¤ect, the welfare e¤ect due to HMOs’entry-exit response, can be con…rmed by turning the
entry-exit response on and o¤ in a payment simulation. HMOs’ market power may be a
source of social ine¢ ciency. To clarify this point, the changes in entry occurrence and average premium when the payment rate increases by 50 dollars are shown according to type of
county in Table 11. Of the last three columns, the …rst and second show the results with and
without simulated entry, respectively. Each plan sets its premium based on the …rst order
condition, (6), i.e. based on its market power. The table shows that, while monopolists
"bank" approximately 10% of the change in payment rate through increased price-cost margins, when there are nine entrants, most of the payment rate increase is passed to consumers
in the form of premium reductions.26 The last column of the table shows the average payment rate changes made by incumbents accommodating an entrant. The more concentrated
the market is, the more premiums fall as a result of entry.
Subsidizing too much is another possible source of dead weight loss. The per-enrollee
payment from the government to HMOs can be seen as a price subsidy. After the welfare
impact of entry is quanti…ed, I decompose the residual into a market power e¤ect and a
subsidy e¤ect using the following experimental simulation: (1) there is no additional entry,
and (2) each …rm passes all the incremental payment along to its enrollees through premium
eligibles, and other institutional details remain unchanged from 2004.
26
Concerning pass-through behavior of a …rm with a cost advantage, see Besanko et al. (2001).
34
Table 11: Counties Where New Entry Occurs and Average Incumbent Premium Change
When Payments Increase by $50: 2004
Initial number
of plans in
a county
0
1
2
3
4
5 to 7
8 to 9
Total
Number
of
markets
1,918
362
181
49
27
8
8
2,553
Number
of entry
occurrence
110.2
34.9
19.1
7.5
3.0
2.4
2.0
178.9
Entry
occurrence
ratio
5.7%
9.6%
10.5%
15.2%
10.9%
29.6%
24.6%
7.0%
Average change in
premium over all plans
w/ entry w/o entry
46.4
47.6
48.2
48.4
49.4
49.6
44.1
46.8
47.5
48.2
48.8
49.2
Average change
over plans facing
new entry
69.9
55.5
52.4
51.1
51.8
51.1
Average over simulation draws. Non-HMO plans are not included.
reduction.2728
The results of the decomposition are shown in Table 12. The second column shows the net
welfare changes as a result of the payment rate increase. The next three columns show the
three e¤ects, which add up to the total e¤ect. For both +$25 and +$50 cases, subsidy and
entry e¤ects signi…cantly increase consumer surplus by lowering premiums and providing
more choice. The market power e¤ect, in contrast, is relatively small.29 The net welfare
impacts of these three e¤ects di¤er between the two cases. The subsidy e¤ect is positive in
both cases, but larger in the $25 case. On the contrary, the entry e¤ect is negative in both
cases, but larger in the $50 case. As a result, increasing payment rates by 25 dollars has a
positive net welfare impact because of the large subsidy e¤ect, while increasing payment rates
27
I do not assume HMO’s price cost margins to be zero; I assume there is no room for them to exploit
their market power for the $50 payment rate increase.
28
In this decomposition, cross-term e¤ects are included in the entry e¤ect. In other words, the entry e¤ect
here includes not only the pure entry e¤ect but also market power and subsidy e¤ects for new entrants. I
have con…rmed the fact that the e¤ect of this treatment on the results is negligibly small.
29
The reason the market power e¤ect on government gain is positive is that, at this level of payment rates,
it is bene…cial for the government to have fewer HMO enrollees even if it is due to HMOs’price-cost margins.
For the same reason, entry e¤ect increases government de…cits. The entry e¤ect on premiums is positive
because some entry occurs in counties without incumbents, resulting in high, monopoly premiums.
35
Table 12: Welfare Change Decomposition: 2004
Total
e¤ect
2004
+$25
2004
+$50
Total welfare
Cons surplus
Prod surplus
Gov savings
Enrollment
Premium
Total welfare
Cons surplus
Prod surplus
Gov savings
Enrollment
Premium
Subsidy
e¤ect
+94:1
+132:7
+1; 112:4
+949:9
+954:0
+911:8
1; 771:3
1; 930:1
+1; 207; 001
+942; 134
25:0
21:9
61:6
+77:5
+2; 223:6
+2; 644:5
+2; 009:1
+2; 031:6
4; 177:8
4; 715:2
+2; 609; 013 +2; 004; 772
50:0
43:5
Market
power
Entry
e¤ect
13:0
25:5
85:6
+248:1
+27:0
69:2
+45:5
204:4
81; 704 +346; 571
+1:7
+1:3
+8:6
147:6
233:5
+654:4
+78:2
100:8
+163:8
701:2
191; 360 +795; 601
+3:6
+2:8
All welfare …gures are in millions of dollars.
by 50 dollars has a negative net welfare impact because the negative entry e¤ect dominates.
Beyond this level of payment rates, per-enrollee government loss is huge and entry of HMOs
harms social welfare.
Budget Neutral Payment Rate As discussed earlier, the payment rate is intended to
slowly converge toward the FFS cost. I evaluate this policy by setting the base payment
equal to the standardized FFS per capita cost in each county. The county-level correlation
coe¢ cient between the original and the new rates is 0.75. Overall, many urban counties
experience a payment increase, while many rural counties experience a reduction. If I calculate the average of the new payment rate based on the same 2004 enrollment and market
con…guration, it is 2.3 percent higher than the average of the original rate.
Table 13 reports the simulation results. As shown in the …rst and second columns, the
new rate leads to a signi…cant increase in social welfare from 9.9 billion dollars to 12.1
billion, a 21 percent increase compared to the 2004 benchmark case.30 Although the new
30
Conceptually, the FFS rate is a budget-neutral payment rate and should make the net government gain
36
payment rate is only 2.3 percent higher if calculated under the 2004 enrollment and market
con…guration, the simulation results show a remarkable increase in average payment rate
from 615.2 to 684.6 dollars, an increase by more than 10 percent. This is due to the changes
in enrollment and entry decisions of …rms. As a result, consumer surplus and enrollment
increase, which signi…cantly increases the net social welfare gain.
Table 13: Payment Rate Simulation with Fee-For-Service Payment Rate: 2004
Payment rate:
$0
Consumer surplus
HMO pro…ts
Net government gain
CMS payments
Expected FFS payment
Net social welfare gain
Enrollment (HMO)
# of plan-mkt (HMO)
Avg premium (HMO)
Avg Payment Rate
FFS rate
FFS+$20
2,476
7,777
307
35,401
35,094
4,725
8,495
1,153
46,401
45,248
5,886
9,198
2,944
54,862
51,917
8,163
11,683
10,747
73,344
68,994
9,947
12,067
12,140
9,100
4,309,146
1,094
36.5
615.2
5,167,844
1,018
45.5
684.6
6,094,323
1,082
27.59
699.3
9,178,289
1,437
37.1
699.7
All welfare measures are in millions of dollars, calculated without
+$85:6
jm :
This welfare gain is not due to adjustments in the level of payments but rather to restructuring of the payment rate distribution. To clarify this point, I perform payment rate
simulations in which I increase or decrease the FFS rate uniformly across counties. I …nd
that social welfare can be maximized by uniformly adding approximately 20 dollars to the
FFS rate, the results of which are reported in the third column of Table 13. Coincidentally,
the average payment rate in this case is commensurate with the case of the 2008 rate simulation, which is discussed above and shown again in the last column of the table. Despite
zero. However, the results still exhibit some di¤erences between the actual payment rate and the FFS cost
due to technical reasons, in particular the fact that in 2004 the CMS was still undergoing a lengthy transition
to full risk adjustment and favorable selection by HMOs was substantial.
37
Table 14: Counties Where New Entry Occurs When FFS Payment Rate Is Employed: 2004
Initial number of HMO
plans in a county
0
1
2
3
4
5 to 7
8 to 9
Total
Number of
markets
1,918
362
181
49
27
8
8
2,553
Number of
entry occurrence
26.1
7.4
3.9
3.0
0.8
1.3
3.5
45.9
Number of
exit occurrence
71.0
32.2
9.2
9.6
0.3
0.1
122.4
Entry
ratio
1.4%
2.0%
2.2%
6.1%
2.8%
16.4%
44.0%
1.8%
Exit
ratio
Premium
change
19.6%
17.8%
18.8%
35.4%
4.1%
0.6%
4.8%
25.1
22.0
0.3
14.4
27.8
110.1
Average over simulation draws. Non-HMO plans are not included.
the same level of the average payment rates, the optimal case generates signi…cantly greater
social welfare gain with much lower Medicare spending. This …nding illustrates how extensively the current subsidy distorts the incentives for Medicare private plans and harms social
welfare.
How does this payment adjustment bring about such a considerable welfare improvement?
In comparing the FFS rate case with the 2004 benchmark case, the critical di¤erence from
the previous payment rate simulations is the signi…cant decrease in the number of entrants
— from 1,094 to 1,018 — despite the signi…cantly higher average payment. Table 14 details
this point by county type. In large markets, the increase in payments attracts entrants,
while many HMOs exit small markets. In this way, Medicare can exploit the potential
welfare-enhancing ability of HMOs in large markets and avoid unnecessary …xed costs in
small markets. At the same time, as long as an entrant can make positive pro…ts under the
FFS rate, entry occurs even in small markets, which generates additional social welfare.
Naturally this may prompt an equity concern: most of the increase in welfare is generated
in large markets at the expense of small markets. To address this point, I calculate per capita
welfare change by county type. The results, which are shown in Table 15, do not exhibit
such inequality. While the larger counties bene…t greater welfare gain, the change in payment
38
Table 15: Per Capita Welfare Components by Market Type: 2004
Number
of plans in
a county
1
2
3
4
5 to 7
8 to 9
Total
Welfare
85:4
216:9
342:5
475:0
891:9
1; 377:9
2004 rate
Cons
Prod
Surplus Surplus
26:7
77:4
109:8
168:8
232:3
178:6
108:0
218:3
288:7
429:7
633:9
781:2
Total
Welfare
FFS payment rate
Cons
Prod
Surplus Surplus
Gov
Savings
49:2
122:2
78:8
239:0
55:9
368:7
118:5
509:7
25:7
942:7
418:1 1; 853:3
35:5
108:2
71:9
212:8
120:2
298:8
128:8
422:8
334:5
684:1
911:5 1; 007:5
21:5
45:7
50:3
41:8
75:9
65:6
Gov
Savings
Number of plans is based on numbers before simulation.
rates enhances per capita consumer surplus in small markets as well.
Simulating Entry Is the entry level excessive? To answer this question, I perform entry
simulations using the FFS rate case as a benchmark. Changing the level of entry is not
straightforward because the simulation results may depend on which plan is included. I
determine several di¤erent lists of plans using the results of the payment rate simulations
with di¤erent payment rates uniformly changed from the FFS rate. The results, presented in
Table 16, suggest that there is no evidence of excessive entry in this market. The benchmark
FFS entry level lies near, or marginally below, the optimum.
Table 16: Entry Simulation: 2004
HMO entry:
55
Consumer surplus
HMO pro…ts
Net government gain
CMS payments
Expected FFS payment
4,549
8,559
1,091
44,649
43,557
Net social welfare gain
12,017
+64
+132
+207
4,725
8,495
1,153
46,401
45,248
4,889
8,378
1,216
48,036
46,820
5,042
8,212
1,274
49,609
48,334
5,162
8,023
1,321
50,843
49,521
12,067
12,051
11,980
11,864
FFS Benchmark
All welfare measures are in millions of dollars, calculated without
39
jm :
6
Conclusion
In this paper, I develop an econometric model to examine welfare consequences of policy
changes through entry, and I take as a case study the US Medicare HMO market. The
innovation of this paper is the explicit treatment of …rm heterogeneity in the welfare analysis
with endogenous entry. It helps exploit detailed …rm level data and allows relevant policy
simulations, which is beyond the scope of previous studies but important in the analysis of
the Medicare HMO market. I …nd no evidence of excessive entry in terms of social welfare
and I show that aligning the payment rate with the FFS cost may lead to a signi…cant gain
in national welfare.
Appendix
A.1 Additional Tables
Table 17 shows the results of the …rst stage regression in the demand estimation. In general,
the instruments have reasonable coe¢ cients and signi…cances. Table 18 shows the results
of the linear regression for marginal costs. The estimated coe¢ cients should be interpreted
with caution, since the regression is a reduced form and serves only for extrapolation.
A.2 Applying the GHK Simulator
In this appendix, I follow the customary notation and change the subscript for a market from
m to i = 1; :::; N , since each market is the unit for which the individual likelihood is de…ned.
For a vector of indices (1; :::; J), the notation "< j" denotes the subvector (1; :::; j
1), "
j"
denotes the subvector (1; :::; j), and " j" denotes the subvector that excludes component j.
In the sequential-move game, the order of subscripts for …rms (1; 2; :::; Ji ) comprises the
reverse of the decision order in market i — …rm Ji makes a decision …rst, …rm 1 makes a
40
Table 17: The First Stage IV Regression
ln sjjprivate;m
Dependent variable:
avg # competitor
competitor: npo
competitor: chain
competitor: IPA
competitor: group
competitor: sta¤
per capita # hospital
per capita # hospital bed
county: no hospital
per capita # medical doctor
R2
F -test (distributed F (10; 2; 987))
0.063
(.068)
0.207
(.056)
0.022
(.057)
0.547
(.058)
0.269
(.033)
0.596
(.118)
2.067
(.617)
0.003
(.004)
0.012
(.042)
0.156
(.094)
0.770
43.98
***
***
***
***
***
*
***
premium
0.446
(1.189)
5.404 ***
(1.03)
6.423 ***
(1.18)
0.989
(1.03)
0.110
(.601)
3.127
(1.98)
2.783
(10.56)
0.007
(.072)
0.305
(.841)
0.825
(1.50)
0.921
7.68
***
The number of observations: 3,289. *, **, and *** denote p<.1, p<.05, and
p<.01, respectively. Heteroskedasticity consistent standard errors are used.
Other independent variables and …xed e¤ects are used but not reported.
decision last, and so on. Firm j’s strategy in market i is represented by yj;i ; a binary variable
that takes "1" if the …rm enters and "0" otherwise. To simplify the notation, I omit the
di¤erence between the independent variables in the demand and the …xed pro…ts, xjm and
zjm , and let Xji denote the set of independent variables. The pro…ts of entering …rm j in
market i are:
(12)
ji (X i ; "ji ; y j;i ;
)
V P (yi ; X i ) + Xji + "ji :
41
Table 18: Marginal Cost Regression
Dependent variable:
copayments
entire enrollees
no experience
experience year
nonpro…t
national chain
group model
sta¤ model
IPA model
DEMO plan
cost plan
cost missing
PPO plan
PFFS plan
PSO plan
POS
demo factor HMO
risk factor HMO
Std FFS per capita cost
medigap premium
ln per capita # hospital
ln per capita # hospital bed
ln hospital expenditure
impatient days
per capita # medical doctor
county: no hospital
HMO penetration rate 98
eligibles
year: 2004
constant
R2
M C jm
0.108
0.082
18.30
0.343
18.06
18.51
39.57
29.92
31.52
31.62
35.39
147.8
11.69
24.29
33.40
31.35
346.0
149.0
0.332
0.396
2.228
4.588
6.928
0.033
19.60
22.21
68.82
0.090
33.05
165.3
(.023)
(.011)
(3.71)
(.226)
(2.41)
(2.29)
(4.19)
(5.69)
(4.65)
(6.72)
(4.36)
(8.82)
(7.43)
(4.61)
(12.6)
(4.36)
(21.3)
(9.97)
(.014)
(.046)
(1.84)
(2.16)
(2.24)
(.012)
(6.25)
(5.47)
(8.67)
(.031)
(2.12)
(27.4)
0.830
The number of observations: 3,289. Standard errors are in the
parentheses. Heteroskedasticity consistent standard errors
are used. MSA …xed e¤ects are included in the estimation.
42
A subgame perfect Nash equilibrium (SPNE) is obtained when (1) all entering …rms are
pro…table with their optimal prices, and (2) all …rms that do not enter expect non-positive
pro…ts from entry. Formally, an SPNE strategy in market i, (yie ); is any strategy that satis…es:
(13)
ji
(14)
ji
X i ; "ji ; y e j;i
0; if …rm j enters,
e
e
X i ; "ji ; y>j;i
; y<j;i (y>j;i
; yj;i = 1)
0; if …rm j does not enter,
for all j = 1; :::; Ji ; where y<j (y j ) is the solution to the downstream subgame, i.e. the best
responses of the downstream players given X; "<k ; and the upstream players’strategies. The
unique equilibrium solution always exists. Denote this solution, after dropping index i, as
y (X; "; )
fy is the unique solution in the sequential-move game with X; "; g:
The component unobserved to the econometrician is speci…ed as
(15)
ji
and
"ji = !
i
ji
+
i.
are assumed to be independent of X i , and distributed i.i.d. standard normal across
…rms and markets.31 The heteroskedasticity adjustment in (7) is dropped for simplicity of
explanation. This simpli…cation does not a¤ect the argument in this appendix.
The log likelihood function can be written as
(16)
31
bM L = arg max
(
)
N
1 X
ln Pr [yio = yi (X i ; )] ,
N i
Unlike the previous studies, no normalization such as ! 2 + 2 = 1 is necessary because the level of pro…ts
is identi…ed. Moreover, this speci…cation of the error components is not crucial for the argument below.
43
where yio is the observed market con…guration and
is the vector of parameters, ( ; !; ).
The probability expression in the likelihood does not have an analytical solution because of
the multi-dimensional integrals involved. Thus, unless the dimension of the unobservables is
very small, the numerical approximation is not feasible. Hence, I use the maximum simulated
likelihood (MSL).
The Modi…ed GHK Simulator The most straightforward simulator for MSL is the
crude frequency simulator. However, such simple discontinuous simulators require many
random draws and are not feasible in practice. My dataset has at most 16 players in some
markets, and the use of the backward induction technique makes each likelihood evaluation
computationally expensive.
I propose the use of the GHK simulator. The GHK simulator is a smooth recursive
conditioning simulator and is often useful when the log-likelihood function involves highdimensional integrals with a multivariate normal distribution. The GHK algorithm draws
recursively from truncated univariate normals. It relies on the decomposition,
f (v1 ; :::; vJ ) = f (v1 )f (v2 jv1 ):::f (vJ
1 jvJ 2 ; :::; v1 )f (vJ jvJ 1 ; :::; v1 );
along with the fact that the conditional normal density can be written as a univariate normal.
The GHK simulator produces probability estimates that are bounded away from 0 and 1.
The estimates are continuous and di¤erentiable with respect to parameters, because each
contribution is continuous and di¤erentiable. It is also an unbiased estimator of individual
likelihood, l( ; !; ; yio ; X i ). It has a smaller variance than the crude frequency simulator,
because each element is bounded away from 0 and 1.
The use of the GHK simulator in a game-theoretic situation, however, is not straightforward, because the original GHK simulator can deal with interactions across j through the
44
disturbance structure, but not strategic interactions across j. The use of the GHK simulator in this study relies on the sequential-game assumption. In a sequential-move game,
the prior players’decisions are known to a player. This fact harmonizes the estimation of a
sequential-move game with recursive conditioning simulators, as shown below.
The GHK simulator exploits the Cholesky triangularization to decompose the multivariate normal into a set of univariate normal distributions. The multivariate normal disturbance
vector, "i , can be rewritten as: "i =
standard normal variates,
i
where
N (0; IJi +1 ), and
i
Thus, "i can be rewritten as: "i
i i,
is a (Ji + 1)
i
i
1 vector of independent
(Ji + 1) parametric array,32
is a Ji
2
3
0 7
6!
6
7
...
6
7
6
7
=6
:
.. 7
...
6
7
.7
6
4
5
0
!
N (0;
i ),
where
i
is a positive de…nite matrix,
i
=
0
i i.
It follows that "i can be written by using the Cholesky decomposition as:
(17)
"i = L( i ) vi ;
where L( ) is the lower-triangular Cholesky factor of
multivariate standard normal vector, vi
, or LL0 =
N (0; IJi ).
The individual likelihood can be written, after dropping index i, as
l( ; y o ; X) = Pr [y o = y (X; ; !; )]
Z
=
n("; )d":
y o =y (X;"; )
32
More ‡exible models can be dealt with by changing
45
i
and the size of
i.
, and vi is another
This expression involves multiple integrals, which are hard to compute in a straightforward
way. The general objective here is to obtain random draws from the distribution "i subject
to y o = y (X; ; !; ). The probability expression that explicitly expresses the rectangle in
which the event, y o = y (X; ; !; ), occurs can be rewritten as:
8
3
>
< > 0 if y o = 1
j
7
6
o0
o0
; 1)0 ); )
Pr [y o = y (X; ; !; )] = Pr 4for 8j, j (Xj ; "j ; (y>j
; y<j (y>j
5:
>
: 0 if yjo = 0
2
By de…ning
8
>
< a = 0; b = 1 if y o = 1
j
j
j
>
: a = 1; b = 0 if y o = 0
j
j
j
the probability can be written as:
9
>
=
>
;
;
(18)
Pr [y o = y (X; ; !; )] = Pr[for 8j, aj (yjo )
o0
o0
0
j (Xj ; "j ; (y>j ; y<j (y>j ; 1) );
)
bj (yjo )]:
Using the form of the pro…t function, (12), and de…ning
aj
aj
Xj
o
o
V P ((y>j
; y<j (y>j
; 1)); X)
bj
bj
Xj
o
o
V P ((y>j
; y<j (y>j
; 1)); X);
(18) can be rewritten as
Pr [y o = y (X; ; !; )] = Pr[for 8j, aj (y o ; X; "<j ; )
"j
bj (y o ; X; "<j ; )]:
This expression shows us the rectangle in which the event, y o = y (X; ; !; ), occurs. To
obtain the interval of "j , we only need "<j . This is because the upstream …rms’decisions are
46
given for …rm j. The Cholesky decomposition (17) makes this expression:
(19)
Pr [y o = y (X; ; !; )] = Pr[for 8j, aj (y o ; X; v<j ; ; )
L( ) v
bj (y o ; X; v<j ; ; )]
or
Z
o
Pr [y = y (X; ; !; )] =
for 8j, aj (y o ;X;v<j ; ; ) L( ) v bj (y o ;X;v<j ; ; )]
"
#
J
Y
(vj ) dv;
j=1
where () is the probability density function of standard normal.
Now we are ready to apply the GHK simulator. For each time of simulation draws,
a vector of independent uniform (0; 1) random variates, (u1 ; :::; uJi ), is drawn. De…ne the
following function:
(20) q(u; a; b)
1
( (a) (1
u) + (b) u) , where 0 < u < 1 and
1
a<b
1:
This function, q( ), is a mapping that takes a uniform (0; 1) random variate into a truncated
standard normal random variate on the interval [a; b].
For given y o ; X; u; ; L, it is possible to recursively de…ne for j = 1; :::; J:
ve1
ve2
veJ
a1 b1
;
L11 L11
a2 (e
v1 ) L2;1 ve1 b2 (e
v1 ) L2;1 ve1
q u2 ;
;
L22
L22
q u1 ;
:::
q uJ ;
aJ (e
v<J
1)
LJ;1 ve1 :::
LJJ
LJ;J
eJ 1
1v
47
;
bJ (e
v<J
1)
LJ;1 ve1 :::
LJJ
LJ;J
eJ 1
1v
and
Q1
Pr
Q2
Pr
:::
QJ
Pr
a1
b1
v1
L11
L11
a2 (e
v1 ) L2;1 ve1
L22
aJ (e
v<J
1)
v2
LJ;1 ve1 :::
LJJ
b2 (e
v1 ) L2;1 ve1
L22
LJ;J
eJ 1
1v
vJ
bJ (e
v<J
LJ;1 ve1 :::
LJJ
1)
LJ;J
eJ 1
1v
Given all the a; b; L; and ve, every Qj is truncated univariate standard normal, so Qj can be
calculated by, for example,
a1
L11
b1
L11
Q1 =
:
Based on repeating this simulation R times, the likelihood contribution simulator is de…ned
as
1 XY
Qj (e
v1r ; :::; veJ
R r=1 j=1
R
(21)
e
l( ; ; y o ; X; R; u)
J
1;r ):
The model is estimated by solving the following maximum simulated likelihood problem:
(22)
bM SL = arg max
= arg max
(
(
)
N
1 X e
ln l( ; ; yio ; X i ; R; ui )
N i
N
R
J
1 X 1 XY
ln
Qj (e
v1r ; :::; veJ
N i
R r=1 j=1
)
1;r )
.
In the computation, I use the Quasi-Newton method with BFGS updating algorithm for
the maximization routine. In making the random draws for the simulator, I use antithetics
to reduce simulation variance and bias. For more details, see Maruyama (2007).
48
:
References
[1] Berry, S.T., "Estimation of a Model of Entry in the Airline Industry," Econometrica
60(4) (1992), 889-917.
[2] Berry, S.T., "Estimating Discrete-Choice Models of Product Di¤erentiation," RAND
Journal of Economics 25(2) (1994), 242-262.
[3] Berry, S.T. and J. Waldfogel, "Free Entry and Social Ine¢ ciency in Radio Broadcasting," RAND Journal of Economics 30(3) (1999), 397-420.
[4] Besanko, D., D. Dranove and M. Shanley, "Exploiting a Cost Advantage and Coping
with a Cost Disadvantage," Management Science 47(2) (2001), 221-235.
[5] Bresnahan, T.F. and P.C. Reiss, "Do Entry Conditions Vary across Markets," Brookings
Papers on Economic Activity 3 (1987), 833-881.
[6] Ciliberto, F. and E. Tamer, "Market Structure and Multiple Equilibria in Airline Markets," mimeo, Princeton University, 2004.
[7] Dafny, L.S. and D. Dranove, "Do Report Cards Tell Consumers Anything They Don’t
Already Know? The Case of Medicare HMOs," NBER Working Paper 11420 (2005).
[8] Dutta, A., "Free Entry in the Market for Drugs in India: Implications for Social Welfare," mimeo, MIT, 2005.
[9] Einav, L., "Not All Rivals Look Alike: Estimating an Equilibrium Model of the Release
Date Timing Game," mimeo, Stanford University, 2003.
[10] Gaynor, M. and W. Vogt, "Competition among Hospitals," RAND Journal of Economics 34(4) (2003), 764-85.
[11] General Accounting O¢ ce (GAO), "Medicare+Choice: Payments Exceed Cost of Feefor-Service Bene…ts, Adding Billions to Spending," Report to Congressional Re-
49
questers, Publication number: GAO/HEHS-00-161 (Washington, D.C., 2000).
[12] Geweke, J. and M. Keane, "Computational Intensive Methods for Integration in Econometrics," Chp. 56 in J.J. Heckman and E. Leamer, eds., Handbook of Econometrics,
vol 5, Elsevier, 2001.
[13] Hausman, J.A., "Valuation of New Goods under Perfect and Imperfect Competition,"
in T.F. Bresnahan and R.J. Gordon, eds., The Economics of New Goods, NBER
Studies in Income and Wealth, vol.58 (Chicago; University of Chicago Press, 1997).
[14] Mankiw, N.G. and M.D. Whinston, "Free Entry and Social Ine¢ ciency," RAND Journal
of Economics 17(1) (1986), 48-58.
[15] Maruyama, S., "Welfare Analysis Incorporating a Structural Entry-Exit Model: A Case
Study of Medicare HMOs," Ph.D. Dissertation, Northwestern University, 2007.
[16] Mazzeo, M.J., "Product Choice and Oligopoly Market Structure," RAND Journal of
Economics, 33(2) (2002), 221-242.
[17] McFadden, D., "Econometric Models of Probabilistic Choice," in C.F. Manski and D.
McFadden, eds., Structural Analysis of Discrete Data with Econometric Applications
(Cambridge, Mass.: MIT Press, 1981).
[18] MedPAC, "Update on the Medicare Advantage Program," in Report to the Congress:
Medicare Payment Policy, March 2008.
[19] Mello, M.M., S.C. Stearns, E.C. Norton and T.C. Ricketts, "Understanding Biased
Selection in Medicare HMOs," Health Services Research 38(3) (2003), 961-992.
[20] Merlis, M., "Medicare Advantage Payment Policy," Background Paper, National Health
Policy Forum, The George Washington University, 2007.
[21] Nevo, A., "Measuring Market Power in the Ready-To-Eat Cereal Industry," Econometrica 69(2) (2001), 307-342.
50
[22] Seim, K., "Spatial Di¤erentiation and Market Structure: The Video Retail Industry,"
Ph.D. Dissertation, Yale University, 2001.
[23] Seim, K., "An Empirical Model of Firm Entry with Endogenous Product-Type Choices,"
mimeo, Yale University, 2004.
[24] Schmidt-Dengler, P., "The Timing of New Technology Adoption: The Case of MRI,"
mimeo, 2006.
[25] Spence, A.M., "Product Selection, Fixed Costs, and Monopolistic Competition," Review
of Economic Studies 43 (1976), 217-236.
[26] Town, R. and S. Liu, "The Welfare Impact of Medicare HMOs," RAND Journal of
Economics 34(4) (2003), 719-736.
51