The Effectiveness of Price Discrimination in the Airline Industry Leonardo Rezende (PUC-Rio) Fábio Roitman (PUC-Rio and BNDES) This version: July 20, 2014 Abstract We study price discrimination in the Brazilian airline market using a dataset that contains information on consumer characteristics that are expected to correlate with willingness to pay but are not directly observable to airlines, such as the passenger’s income. This allows us to investigate the effectiveness of price discrimination practices by airlines, decomposing the effect of an observable characteristic on price into a direct and an informational effect. For instance, we find that when purchasing a ticket one week later, consumers in our sample are willing to pay on average US$ 17 more, but airlines charge US$ 33 more, since late purchases correlate with higher income and traveling for business. We also perform a tentative analysis of the welfare effects of price discrimination. Our estimates suggest that in our setting price discrimination on average benefits consumers. Keywords: Price discrimination; Air transportation. Field: Applied Microeconomics. JEL codes: L11; L93. Preliminary. Please do not quote. 1. Introduction Economy class tickets for flight 3909 from Rio de Janeiro to São Paulo in August 24, 2012 had prices ranging from R$ 129 to R$ 989. This is an example of the extreme dispersion in airline pricing (Borenstein and Rose, 1994). Since airline tickets cannot be resold, this dispersion may reflect price discrimination strategies by the airline companies. In this paper, we seek to investigate the effectiveness of price discrimination by exploring a data set that allows us to observe consumer attributes that are not directly observable by airline companies. If there are other attributes that are visible to the airline, such as the timing of the ticket purchase, that are correlated with unobserved attributes, the airline may price discriminate in a way that seeks to capture both the direct effect of the attribute and the implicit information it contains. Our hope is to devise a method that decomposes the two effects. We provide a stylized model of a profit-maximizing monopolist that faces a linear demand and uncertainty about consumer willingness to pay. We show that if the expectation of the willingness to pay is linear, then it is possible to estimate two sets of parameters: those that describe how attributes affect the expected willingness to pay from the point of view of the airline, and those that affect the true willingness to pay based on the data. Comparing these two sets allows us to better understand how price discrimination works. As we employ a very stylized (but fully structural) model for estimation, it is possible to perform some counterfactual exercises. We report some preliminary calculation of what would be counterfactual pricing decisions if price discrimination was restricted. Since this is a very preliminary work, all our findings, and certainly the counterfactual calculations, have to be taken with a grain of salt. For example, to simplify our analysis, we assume each airline operates as a monopolist, such as in Lazarev (2012). Contrary to him, however, in most of the routes in our data there are several airlines operating, and presumably competing to some extent with each other. This mismatch may be a problem if price dispersion increases (Borenstein and Rose, 1994) or decreases (Gerardi and Shapiro, 2009) with competition. To account for that, we restrict attention to a sample of markets with similar degrees of concentration. Another simplifying assumption is that we do not account for the possibility of scarcity pricing (Dana, 1999; Gale and Holmes, 1993), either explicitly or implicitly. In our framework, the marginal cost of a ticket in an airplane is constant, and the pricing decision is static. Our work at this point is silent about whether price dispersion is due to price discrimination or scarcity pricing (Puller, Sengupta and Wiggins, 2009). Our task is to investigate to what extent the observed price discrimination, that is, how prices correlate with visible attributes of the trade, is effective in capturing variation in consumer characteristics not directly visible by the airline. That connection is consistent with a model of price discrimination, but may be consistent with other theories (Escobari and Gan, 2007; Puller, Sengupta and Wiggins, 2009). The paper is organized as follows. The next section describes the dataset we use and provides some descriptive statistics. Section 3 describes the model we propose and section 4 discusses the estimation strategy. The main findings are presented in section 5. Section 6 presents a welfare evaluation of counterfactual restrictions on price discrimination. We conclude in section 7 with a discussion of some extensions of the analysis we plan to do in the future. 2. Data 2.1. Data Sources The sample used in this paper is a survey of airline passengers in Brazil flying in July and August, 2009. The survey has been conducted by the Fundação Instituto de Pesquisas Econômicas (FIPE) and the data has been used previously in McKinsey & Co. (2010) to describe airport utilization in Brazil. To our knowledge, we are the first to use the sample to study airline ticket pricing. The dataset contains information on flight characteristics, the price paid for the ticket, when the ticket was purchased, and also individual characteristics of the passenger that may affect his or her willingness to pay that are not directly observable by the airline company. This is the unique feature of the data that we seek to exploit in this paper. The survey interviewed passengers in 32 Brazilian airports. The number of interviewees in a route was proportional to the universe of air travel in the route. Interviews were uniformly distributed over time and days of the week available for each desired route. The sample used is formed by passengers flying non-stop between two Brazilian airports. For reasons explained below, we restrict attention to passengers in 43 large routes. Here and elsewhere in the paper, we define a route as a pair of airports, and do not distinguish between directions. We complement this database with information obtained from Agência Nacional de Aviação Civil (ANAC, the regulatory agency) and Instituto Tecnológico de Aeronáutica (ITA). From ANAC we collected flight occupancy ratios, route distances and fuel consumption. From ITA we obtained data on airport utilization. 2.2. Descriptive Statistics Our sample contains data on 9850 passengers flying through 43 routes. We have more observations on routes with more passenger traffic. For instance, there are 1057 observations on Brazil's busiest route, between São Paulo (Congonhas) and Rio de Janeiro (Santos Dumont). Since interviews were distributed over time within a period of one month, we have data on passengers flying on 4523 distinct flights. In 51% of these flights, we observe a single passenger in our sample. Table 1 below presents descriptive statistics of the variables we use in our analysis. On average, tickets are bought one week in advance and price is R$ 316, or US$ 166 at the exchange rate at that time. Average stated monthly income is R$ 4731, or US$ 2490. Figure 1 illustrates the dispersion of prices practiced by the same airline company in the same route. We picked the airline-route pair over which we had most observations, flights from Gol Linhas Aéreas between São Paulo (Congonhas) and Rio de Janeiro (Santos Dumont). We also computed the Gini coefficient for price dispersion within each airlineroute pair. Considering the 31 pairs with more than 100 observations in our sample, we obtain a median Gini coefficient of 0.23, slightly higher but still similar to estimates for the US market. Borenstein and Rose (1994) and Gerardi and Shapiro (2009) report 0.18 and 0.22, respectively. In order to describe statistical relationships between variables in our data, table 2 presents regressions that show how (log) prices depend on consumer characteristics that are not observable by the airline: (log) income and a dummy of whether the ticket was Table 1: Descriptive statistics Variable Definition price income employer time of purchase distance Ticket price, in R$ Monthly per capita income, in R$ Dummy indicating the ticket is paid by the employer Number of weeks between the purchase and the flight Distance between the airports, in kilometres Dummy indicating Guarulhos is one of the airports in the Guarulhos route Galeão Dummy indicating Galeão is one of the airports in the route Dummy indicating the flight occurs on a Tuesday, peak day Wednesday or Thursday Dummy indicating the flight departs between 7am and 10am peak time or 5pm and 8 pm fuel Average fuel consumption in the route, in 1000 litres saturation Sum of 4 dummy variables that indicate airport saturation Note: 9850 observations. Mean Standard deviation 316 4731 0.53 1.17 832 163 5821 0.50 1.43 487 0.24 0.43 0.20 0.40 0.44 0.50 0.42 0.49 4.30 1.83 1.87 1.08 Figure 1: Histogram of Gol prices in Congonhas –Santos Dumont route .3 Fraction .2 .1 0 0 200 Note: 584 observations. 400 600 price (R$) 800 1000 paid by the employer (employer). Both coefficients are positive and statistically significant, and robust to controlling for various alternative sources of variation. Our aim when we control for flight occupancy rate is to indicate that this relationship is not due (or at least not only due) to scarcity pricing1. Table 2: Descriptive regressions Dependent variable: log(price) (1) 0.036 (0.006) 0.324 (0.013) log(income) employer (2) 0.036 (0.006) 0.319 (0.013) time of purchase occupancy rate (3) 0.036 (0.006) 0.204 (0.013) -0.126 (0.004) (4) 0.035 (0.006) 0.206 (0.013) -0.102 (0.007) 0.209 (0.045) Yes Yes Yes (5) 0.029 (0.007) 0.199 (0.016) -0.091 (0.010) 0.276 (0.069) Day of the week dummies (6) Yes Yes Time of the day dummies (11) Yes Yes Route-airline fixed effects (126) Yes Yes Yes Flight fixed effects (2059) Yes Constant Yes Yes Yes Yes Yes Observations 6970 6970 6970 6970 6970 Adjusted R² 0.297 0.297 0.383 0.385 0.407 Notes: (1) Heteroscedasticity-robust standard deviation in parentheses. (2) There are at least two observations in each flight. (3) Each time of the day dummy indicates an interval of two hours. 3. Model We describe demand for an airline ticket by the distribution of an unobserved random variable , the willingness to pay of a randomly chosen customer for that ticket. We assume that a customer makes the binary decision between buying the ticket or not depending on whether ≥ , where is the price of the ticket and is the customer-specific willingness to pay. The willingness to pay variables and associated with flight, route, time of purchase and/or consumer characteristics that correlate with that determine 1 is not directly observed, but we observe two sets of . The variables in are the observable variables ; the objective of our analysis is to estimate | . Not all Ideally, one wishes to control for occupancy rate at the time of the ticket purchase. We do not have this information. Our occupancy rate variable is a proxy combining the time of purchase and the occupancy rate at the time of the departure, assuming seats fill up according to the empirical distribution of purchase times we observe within each route. components of are observable by the airline at the moment of pricing. is the set of characteristics that are observed and used by the airline company when determining price. We assume that are the determinants of that is, in the presence of , | , = ∗ | ; . However, as the airline may be useful to predict be the marginal cost of the flight ticket. Based on pricing decision is to charge price where | would be redundant to predict does not directly observe , variables in Let in the sense that ∈ is the distribution of − 1− | conditional on and | ∗ . and , the airline is the price that maximizes the expected profits for that particular seat. By considering the pricing decision for each seat in isolation, we abstract away from explicitly modeling scarcity pricing or competition across airlines in order to concentrate on price discrimination practices based on the informational relationship between , and . To proceed, we assume the conditional expectations of are linear: | and | with respect to and = = . Similarly, we assume the marginal cost is a linear function of a vector of covariates !: − = !". Our objective is to estimate , and ". We rely on a simple distributional assumption for | ~% − , , where | . We assume that is a dispersion parameter to be estimated. This is equivalent to assuming that the airline faces a linear demand, conditional on the information it has about consumer willingness to pay. Under these assumptions, the optimal choice of pricing is ∗ − '( + + !" ⁄2 '( − ≤ =& ./ ' 0 1230 + '( − !" ≥ 3 − !" ≤ 3 − !" ≤ − For a market with intermediate values of expected willingness to pay compared with marginal cost, the optimal price is simply the monopoly price, the average between the marginal cost !" and the choke price + . When the expected willingness to pay is too large compared with marginal cost, the airline prefers to charge a price that guarantees the ticket is sold. Conversely, if willingness to pay does not cover cost, the optimal price is higher than the choke price, to prevent the ticket from being bought. 4. Estimation We estimate our model using a two-step procedure. 4.1. First Step In the first step, we estimate a pricing equation using nonlinear least squares. Our linear demand monopoly pricing model predicts that prices of the tickets that are sold depend on parameters and attributes according to the following expression: , !, , ", ∗ − '( + + !" ⁄2'( − ≤ =4 − !" ≥ 3 − !" ≤ 3 (recall that the ticket is not sold if the airline decides to charge a price above We estimate , " and that minimize prediction squared error 5 − ∗ + ). 7 , !, , ", 6 . 4.2. Second Step In the second step, we obtain , the coefficients that describe the expected willingness to pay with respect to its determinants. We use the estimates of in the first step, | and to recover . Recall that we are assuming that conditional expectations are linear ( | and | = . From these relationships we obtain the following proposition: 8 Proposition: Proof: Let 9 = 9= 8 ) and that an exclusion restriction is valid: − = +: 8 | , 8 +0= 8 = − = , so 8 8 | 8 and : = 9| , +9 = − 8 | , = = | . By the exclusion restriction, 9 = 0 . We then have +0 □ This is a linear set of equations that must be valid in population. We employ it to estimate directly as follows: If the number of parameters in to-one linear mapping between linear equations for and and are the same, this is (generically) a one- . In this case we simply solve the system of , plugging in the first step estimates of 8 covariance estimates for and If the number of parameters in 8 and sample variance- . is larger (equivalently, if has more variables than ), this is a system of equations with more equations than unknowns. All equations are satisfied in population by the true value of . In this case our estimate of 8 8 8 16 5 > 8 8 16 <− > <− > '.= 5 > < 9 is where our plug-in estimates in the right-hand side are as before. 5. Empirical Results Since our model describes pricing decisions of a monopolist, ideally one would like to circumscribe the empirical analysis to (isolated) routes with a single operating airline, as in Lazarev (2012). Unfortunately this is not feasible, as the survey design did not focus on monopoly routes. As a feasible alternative to control for market concentration heterogeneity, we selected the 43 routes that meet the following criteria: (i) presence of the two largest airline companies (Tam and Gol); (ii) at least one more carrier. Our selection includes the 10 domestic routes with largest traffic in 2009. We need to specify three sets of regressors: , the attributes that determine willingness to pay; , the attributes the airline uses to predict willingness to pay; and !, the attributes that affect cost. Partition that 7 = 7 . The variables in ? and into = ? , 7 and = ? , 7 so are consumer attributes that affect willingness to pay ? but the airline does not directly observe. The variables in are used by the airline to determine pricing; these variables do not directly affect willingness to pay. In our sample we observe the reported income of the passenger and whether the trip was paid by the employer. These variables are expected to have a positive impact on willingness to pay, but are not observable by the airlines at the level of individual customers. These are the variables included in the vector The vector 7 , which is equal to 7 ? . , is formed by variables observed by the airline that directly affect both prices and willingness to pay. We included in 7 : route distance, dummies for two particular airports2, a dummy for flight on a peak day3, another dummy for flight on rush hour4 and time of purchase. The variables in ! are the ones that affect cost. There are two variables in !: the average fuel consumption in the route and airport saturation, which is defined as the sum of four dummy variables that indicate saturation of the airports in the route. We report results of two econometric specifications of our model that differ in the definition of ? . These variables, despite not directly affecting willingness to pay, are used by the airline when determining price. In our first specification, we use two variables in ? : the average passenger income in the route and the proportion of trips in the route paid by the employer. These variables are clearly correlated with the variables in ? , since the former are averages over routes of the latter. Therefore they would be valid in our framework, if we assume they are observable by the airlines. This is reasonable, since firms are likely to be well informed about the distribution of consumer characteristics in each route, while still uninformed about which consumer is which at the time of trade. In our second specification, we use city dummies as the components of ? . This is a somewhat conservative choice: these variables are certainly within the relevant information set of the airlines. On the other hand, the exclusion restriction that ? should not directly affect willingness to pay is likely to be invalid, and for that reason the first specification is our preferred one. Our main estimates are presented in Table 3. In our preferred specification (1), we find positive and statistically significant effects for both components of the ? vector on the expected willingness to pay from the point of view of the airline. We also find effects that are significant and with the expected sign for distance, undesirable airports, rush hour flights and time of purchase. The coefficients of cost drivers have the expected sign, but the effect of fuel expense is not significant. We suspect this is due to collinearity with distance. Our estimate for is R$113, around US$ 60. This means our best fit is a model where the consumer willingness to pay, conditional on the information of the airline company, is uniformly distributed in a range of US$ 120. 2 Guarulhos (São Paulo) and Galeão (Rio de Janeiro) are airports located far from city centers, and presumably are less desirable than alternative airports within each city. 3 A dummy for flight departing on Tuesday, Wednesday or Thursday. 4 A dummy for flight departing between 7am and 10am or 5pm and 8pm. Table 3: Estimated parameters - main results Specification 1 standard coefficient error Specification 2 standard coefficient error Panel A: first-stage estimates ? log(mean income by route) employer by route São Paulo Campinas Rio de Janeiro Belo Horizonte Curitiba Porto Alegre Brasília Salvador Recife 7 distance Guarulhos Galeão peak day peak time time of purchase constant ! fuel saturation 153.49 346.95 (26.65) (37.80) 0.214 -46.83 -23.23 6.14 9.64 -61.85 -1182.3 6.31 44.02 113.61 (0.018) (7.03) (7.96) (4.07) (4.11) (2.20) (201.9) (3.72) (4.09) (21.52) 97.09 -87.94 -24.77 -18.99 -18.64 -12.65 59.90 -1.32 29.37 0.120 -119.91 -16.55 4.76 12.08 -50.04 343.1 11.88 -5.54 113.16 (9.21) (17.22) (14.69) (10.05) (10.73) (11.90) (9.57) (10.66) (9.34) (0.015) (12.42) (20.61) (3.72) (4.07) (6.30) (21.4) (84.56) (71.01) (20.59) Panel B: second-stage estimates ? log(income) 202.99 (40.92) 243.22 (44.12) employer 274.99 (45.55) 248.19 (75.00) 7 distance 0.250 (0.016) 0.243 (0.016) Guarulhos -24.50 (8.87) 4.39 (15.21) Galeão -29.49 (9.15) -55.32 (10.15) peak day -27.65 (7.70) -24.65 (10.26) peak time -22.33 (6.40) -23.48 (9.78) time of purchase -32.10 (4.40) -20.87 (8.51) constant -1516.2 (304.9) -1814.8 (355.9) Notes: (1) 9850 observations. (2) Standard errors are bootstrap standard errors from 100 resamplings. Panel B presents our estimates for , the implied effect of the variables we observe (but the airline does not necessarily observe). All coefficients are statistically significant. Within this framework, comparing the magnitude of and for the same attribute allows us to identify two effects on willingness to pay: the direct effect of the attribute on willingness to pay, captured by , and the informational effect of the attribute: the effect caused by the fact that the airline uses the attribute to make an inference about unobserved characteristics that correlate with it. This effect corresponds to the difference between and . Our estimates for time of purchase show that if a consumer buys a ticket one week in advance, airlines infer that his or her expected willingness to pay is R$ 62 (approximately US$ 33) less than anticipated. Half of it (R$32 or US$ 17) is the direct effect: early buyers are willing to pay less for the ticket. The other half is the information effect: passengers that buy tickets on short notice tend to have high income and/or have the tickets paid by the employer, and airlines use this correlation to price discriminate across these unobserved attributes. Similar effects are also found in the case of flights using the Guarulhos airport. In the case of the rush hour flights, our estimates suggest that the direct and the information effect are both present, but operate in opposite directions: the informative effect is positive (R$ 32 or US$ 17), again reasonable because passengers during rush hour are more likely to be traveling for business reasons. The direct effect is negative: passengers themselves do not like to travel during rush hours, and are willing to pay R$ 22 or US$ 12 less in that case. Table 3 also reports our estimates for specification 2. While point estimates change, the findings we discuss above hold qualitatively, with the exception of those related to Guarulhos airport. The estimates related to cost are much less precise in this specification, probably because of the correlation with the city dummies in Surprisingly, our estimate for ? . is virtually identical in both specifications. In specification 1 in table 3, we assume airlines do not observe income and whether the ticket was paid by the employer at the level of individual buyers, but they do observe the average values of these variables in each route. If this is true, airlines would explore variation of these variables between routes to price discriminate. On the other hand, if the airlines knew how the distribution of consumer attributes vary over days of the week, then they would wish to price discriminate across days of the week as well. Therefore, by varying our assumption about what firms observe, we can investigate the types of price discrimination strategies they are using. In table 4 we rerun specification 1 using alternative assumptions about how informed airlines are about consumer attributes. We find that if we assume that the information about consumer attributes is finer, estimated coefficients are similar. Compared to the estimates in Table 3, the coefficients in consumer attributes are smaller, as well as those in employer. associated with average associated with income and Table 4: Estimated parameters under alternative assumptions about consumer attributes observability Attributes by routepeak day coeff. ? 7 ! standard error Attributes by routepeak time coeff. Attributes by routepeak day-peak time standard error coeff. standard error 81.21 245.78 0.218 -57.54 -23.7 -24.1 -3.69 -64.53 -505.0 0.37 54.58 134.39 (16.23) (33.55) (0.016) (7.30) (9.04) (7.29) (5.60) (2.11) (129.4) (2.93) (4.08) (19.56) Panel A: first-stage estimates log(mean income by route-day) employer by route-day log(mean income by route-time) employer by route-time log(mean income by route-day- time) employer by route-day-time distance Guarulhos Galeão peak day peak time time of purchase constant fuel saturation 101.99 325.02 (23.23) (40.00) 109.22 305.39 0.213 -55.23 -24.64 -40.02 11.6 -62.66 -715.1 3.77 49.27 120.26 (0.025) (7.68) (9.94) (9.13) (5.20) (2.30) (254.4) (4.24) (4.41) (183.32) 0.209 -48.3 -24.69 7.02 -13.76 -62.65 -772.8 3.56 49.74 119.31 (22.99) (38.00) (0.016) (6.93) (8.91) (4.73) (5.46) (2.13) (177.1) (3.14) (4.43) (18.99) Panel B: second-stage estimates ? log(income) 145.36 (34.56) 139.16 (33.79) 110.56 employer 266.69 (38.62) 237.2 (45.28) 202.14 7 distance 0.24 (0.023) 0.232 (0.015) 0.238 Guarulhos -34.92 (9.24) -37.21 (7.58) -45.25 Galeão -30.90 (10.52) -31.7 (9.82) -29.11 peak day -20.20 (8.22) -21.02 (7.63) -10.04 peak time -17.05 (6.94) -14.23 (6.11) -7.04 time of purchase -34.56 (3.81) -37.53 (4.40) -43.37 constant -1042.7 (316.8) -967.9 (249.0) -731.2 Notes: (1) 9850 observations. (2) Standard errors are bootstrap standard errors from 100 resamplings. (25.22) (35.94) (0.016) (8.09) (9.51) (7.32) (6.25) (3.89) (190.2) 6. Counterfactuals This section presents some counterfactual calculations using our estimates. We begin by discussing welfare assessments within the confines of our model. 6.1. Measuring welfare in our model In our setting, once a ticket is sold at price willingness to pay , profits are @ = − and at cost to a buyer with and consumer surplus is AB = − . However, as we only directly observe tickets that are sold, our calculations must account for selection bias as follows. The expected consumer surplus, conditional on attributes , is AB| = − | ; ≥ .D ≥ | since when a ticket is not sold, the consumer surplus is 0. Similarly, expected profits are @| = − !" . D ≥ | We cannot empirically estimate D ≥ | , the probability of the ticket being sold conditional on attributes, since our data contains only information on consumers that bought the ticket. Within our model, we imposed a distributional assumption on , so this probability can be calculated directly from our estimated coefficients, as well as and . We wish to be able to compute consumer surplus and profit taking the route as our market definition. For this, we partition E into , attributes of the route, and attributes of the particular flight and time of purchase. We are interested in and AB| @| @| E E : =H J∈K =H E J∈K AB| @| FG FG where B is the support of = I; = I; FG E E .D .D FG FG . = I| FG The conditional distribution of = I| | E AB| FG E , E E is outside of our model and not directly observable in the data due to selection bias. However, we can empirically estimate D FG D FG = I| E = I| E , 1L/ , the distribution of FG E conditional on and the event that the ticket was bought. This a conditional distribution between variables observed in our sample. According to Bayes Law, for any I ∈ B: , 1L/ ∝ D 1L/| Note that D 1L/| FG , E FG = I, =D E .D ≥ | Thus, we are able to recover the distribution of AB| E and @| E . Figure 2 shows our estimates for D 1L/| FG = I| E can be computed from the model. FG E | E and, hence, we can calculate using this estimation strategy for the 5 routes with most passengers in 2009. These are demand curves within each route. Unlike D 1L/| , which are linear by assumption in our model, these curves are non- linear, as they depend on the distribution of FG | E . Figure 2: Demand curves for the 5 routes with most passengers in 2009 800 CGH-SDU BSB-CGH GRU-SSA GRU-REC GRU-POA 700 price (R$) 600 500 400 300 200 100 0 0 0.1 0.2 0.3 0.4 0.5 Prob. 0.6 0.7 0.8 0.9 1 Notes: (1) We employ the estimates obtained in specification 1. (2) In the horizontal axis, Prob. means D 1L/| E . 6.2. Counterfactual exercises In this section, we employ our estimates to compute the impact on (relative) consumer surplus and profits of hypothetical policies that restrict price discrimination. In the first exercise, the airline is restricted to charge a single (profit maximizing) price for all flights and passengers in the route. In the second exercise, prices are allowed to differ across flights within each route, but are not allowed to depend on the time of purchase. We use the distribution of each counterfactual. FG | E to obtain the profit maximizing prices in Table 5 presents predicted price ranges for some routes in our sample in the baseline and under the two counterfactuals, using as estimates the ones obtained in specification 1. Our calculations predict that if restricted to charge a single price, a monopolist facing the type of demand we estimate would elect to charge a high price, near the top of the baseline distribution. This is true for almost all routes in our sample, but we restrict Table 5 to the 10 routes with most passengers in 2009 to save space. Table 6 presents the relative impact of the counterfactual policies in consumer surplus, profits and social welfare, using our estimates in specification 1. Profits fall in all cases, as they should for a monopolist. More interestingly, consumer surplus in most routes also falls, as airlines react to the policy by hiking up prices. Thus, for this set of estimates, social welfare would be reduced by around 10% if price discrimination were restricted5. Table 5: Predicted price ranges in counterfactual scenarios, in R$ Prices depend on Route route, flight and time of purchase route and flight route (model) (counterfactual) (counterfactual) min max min max CGH-SDU 240 371 352 368 364 BSB-CGH 313 459 443 459 452 GRU-SSA 243 388 310 388 378 GRU-REC 299 532 454 470 455 GRU-POA 163 295 275 291 281 CGH-CNF 233 394 378 394 382 BSB-GIG 215 348 277 348 338 GIG-SSA 208 370 292 370 299 CGH-CWB 213 359 343 359 353 CGH-POA 237 395 327 395 379 Notes: (1) We employ the estimates obtained in specification 1. (2) We report results for the 10 routes with most passengers in 2009. Table 6: Changes in welfare measures in the counterfactual scenarios, relative to the baseline Route Counterfactual Prices depend on route and flight Prices depend on route AB| E @| E AB| E + @| E AB| E CGH-SDU -9% -5% -6% -13% BSB-CGH -25% -9% -14% -25% GRU-SSA -8% -15% -13% -31% GRU-REC -8% -17% -15% -3% GRU-POA -22% -14% -17% -22% CGH-CNF -28% -10% -15% -26% BSB-GIG -4% -14% -11% -27% GIG-SSA -6% -15% -13% +1% CGH-CWB -21% -7% -12% -23% CGH-POA -24% -13% -17% -29% Notes: (1) We employ the estimates obtained in specification 1. (2) We routes with most passengers in 2009. 5 @| E AB| E + @| E -6% -10% -17% -17% -15% -10% -16% -16% -8% -14% report results -8% -15% -22% -14% -17% -15% -19% -11% -12% -19% for the 10 If the airline is restricted to charge a single price in each route, the mean reduction of the social welfare, considering all the routes in the sample, is 13%. 7. Concluding remarks While our empirical findings in this paper are, in our opinion, reasonable, much remains to be done. We explore the unique feature of the data set (namely that we observe some consumer characteristics firms do not) to investigate price discrimination practices. The strategy that we pursued was of postulating (and trusting) a stylized linear demand monopoly pricing model. To trust our results, we need to better understand to what extent they depend on the specificities of the model. More specifically, we would like to know to what extent our results depend on the linear demand/ uniform willingness to pay assumption. We would also like to extend the analysis to an oligopoly framework where consumers make multinomial decisions between flights and airlines within the same route. Another extension we are considering is to introduce the possibility that the airline may observe something about consumer willingness to pay that we do not observe in our data. This unobserved component would provide a structural interpretation for the residual of the pricing equation, and would allow us to deal with selection issues in a more explicit way. References BORENSTEIN, S.; ROSE, N. (1994): Competition and Price Dispersion in the U.S. Airline Industry. Journal of Political Economy, 102 (4), 653-683. DANA, J. (1999): Equilibrium Price Dispersion Under Demand Uncertainty: the Roles of Costly Capacity and Market Structure. RAND Journal of Economics, 30 (4), 632-660. ESCOBARI, D.; GAN, L. (2007): Price Dispersion Under Costly Capacity and Demand Uncertainty. Discussion paper, National Bureau of Economic Research. GALE, I.; HOLMES, T. (1993): Advance-Purchase Discounts and Monopoly Allocation of Capacity. The American Economic Review, 83 (1), 135-146. GERARDI, K.; SHAPIRO, A. (2009): Does Competition Reduce Price Dispersion? New Evidence from the Airline Industry. Journal of Political Economy, 117 (1), 1-37. LAZAREV, J. (2012): The Welfare Effects of Intertemporal Price Discrimination: an Empirical Analysis of Airline Pricing in U.S. Monopoly Markets. Discussion paper, New York University. MCKINSEY & CO. (2010): Estudo do Setor de Transporte Aéreo do Brasil: Relatório Consolidado. 1st edition, Rio de Janeiro. PULLER, S.; SENGUPTA, A.; WIGGINS, S. (2009): Testing Theories of Scarcity Pricing in the Airline Industry. Discussion paper, National Bureau of Economic Research.
© Copyright 2026 Paperzz