The Probability Distribution of Future Demand: The Case of Hydro Quebec Author(s): Jean-Thomas Bernard and Michael R. Veall Source: Journal of Business & Economic Statistics, Vol. 5, No. 3 (Jul., 1987), pp. 417-424 Published by: American Statistical Association Stable URL: http://www.jstor.org/stable/1391617 . Accessed: 24/08/2011 21:28 Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at . http://www.jstor.org/page/info/about/policies/terms.jsp JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. American Statistical Association is collaborating with JSTOR to digitize, preserve and extend access to Journal of Business & Economic Statistics. http://www.jstor.org Journalof Business&EconomicStatistics,July1987,Vol.5, No.3 ? 1987 American Statistical Association The Future Probability Distribution of Demand The Case of HydroQuebec Jean-Thomas Bernard of Economics,UniversiteLaval,Ste. Foy,QuebecG1K7P4, Canada Department Michael R. Veall of Economics,McMaster OntarioL8S4M4,Canada Hamilton, Department University, Thebootstrapping distribution of techniqueof Efron(1979)is usedto estimatethe probability futureelectricity demandforHydroQuebec.Theapplication followsthe regressionapproach of andPeters(1984a,b)butalso allowsforseriallycorrelated Freedman disturbances anduncervariableforecasts. taintyinthe independent KEYWORDS:Bootstrapping; intensivestatistics;Electricity demand;ForeComputationally cast uncertainty; Serialcorrelation. 1. INTRODUCTION time andlow fixed-cost,highvariable-costcapacitysuch as thermal generation and rather less high lead time and high fixed-cost, low variable-costcapacitysuch as nuclear (Ellis 1980; Veall 1983). Therefore, the planning process needs more than simple point projections of future demand; it needs an estimate of the entire probabilitydistributionof future peak demand, conditionalon currentinformation. In this articlea regressionmodel is employedin the estimationof the probabilitydistributionof futureelectricity demand for Hydro Quebec, one of the largest utilities in North America. The model has been kept very simple to ensure the tractabilityof the approach and to keep the emphasisof the articleon the second moment ratherthan the first moment of the forecast. The technique used is bootstrappingas proposed by Efron (1979, 1982) and describedin general terms by Efron and Gong (1983). The particularapplicationto forecastingfollows the Freedmanand Peters (1984a,b) one-equation approach, modified to accommodatea recursivesystem with autocorrelateddisturbancesand stochasticforecastsof the independentvariables.Veall (1987a)appliesa somewhatsimilarapproachto Ontario Hydrodata, althoughin thatcase the recursivestructure is simplerand there is no need to treat the serial correlation, which representsa significantcomplication. Computationallyintensive techniquessuch as bootstrappingare becomingincreasinglyimportantas tools of applied statisticsand econometrics.They appearto be particularlyvaluable in cases in which standarderrors or confidenceintervalshave either no known formulas or the analytic expressionsavailable are based on tenuous asymptoticand parametricassumptions. A common situationof this sort is the predictionof a futurevalue of a time series given some set of current information.If, for example,a regressionmodelis used, it is straightforwardto forecast the future value of a dependentvariableby takingthe innerproductof forecastsof the independentvariableswiththe least-squares coefficient estimates from the observed sample. But although there is a standardformula (e.g., Johnston 1984, p. 195) for confidenceintervalsof such forecasts, it requiresthat the dependentvariablebe normallydistributed(an assumptionthat cannotbe relaxedfor large samples) and that the predicted values of the independent variablesbe somehow known with certainty. This latter assumptionespeciallyis unlikelyto be even approximatelytrue in most practicalsituations. Confidence-interval-typeinformationis likely to be particularlyimportantin the predictionof future electricitydemand. First, most utilities are less concerned about expected future demandthan about the amount of capacityrequiredto ensuresome (presumablylarge) probabilitythat there will be no shortage. Second, the degree of forecastuncertaintyitself is importantto the utility'sdecisions. All other things being equal, as the degree of demand uncertaintyincreases, the optimal mix of capacity will include relatively more low lead 2. THE MODEL Many utilities forecast peak demand by forecasting averagedemandfor the relevantperiod and assuming that averagedemandis some given constantproportion (calleda "loadfactor")of peak demand.Our approach instead employs a regressionlinear in logarithms.To illustrate, we use annual data for Hydro Quebec for 417 418 Journalof Business & EconomicStatistics,July 1987 the period 1965-1983 and estimate the ordinaryleast squares(OLS) regression log(peakt) = -.652696 + 1.126434 log(AMW,), (.017234) (.153962) R2 = .9960, DW = 2.0403, (1) where peaktis the annualpeak demandin megawatts (Mw), AMW,is the averagedemandover the year also in Mw, standarderrors are in parentheses, and DW representsthe Durbin-Watsonstatistic. The constant load factor assumption would imply that the log(AMW,) coefficient is unity, a hypothesisthat can be easily rejected at the 5% level of significance. The next step is to estimate a forecastingequation for AMW,. Again emphasizingsimplicity,we estimate with OLS and the same sample period as in (1): log(AMW,) = -2.849834 + .066910 log(pricet) (2.307198) (.130148) + .983279 log(GPP,) (.308234) + .026412 time,, (.010444) R2 = .9942, DW = 1.3464, (2) where pricetis the real price of electricitycalculated somewhatcrudelyby dividingaverageHydro Quebec revenuesper Mw by the grossprovincialproduct(GPP) deflator for Quebec, GPP, is the real gross provincial product, time, is a linear time trend equal to the last two digits of the year, and standarderrors are in parentheses.Two things should be noted about these estimates. First, the log(price,)coefficientdoes not have a negative sign, as would be expected theoretically. Second, the value of the Durbin-Watson statistic is fairlylow, indicatingthe possibilityof seriallycorrelated disturbances.As the test statistic is in the indecisive region of the Durbin-Watsontables, the exact probability value of the null hypothesiswas calculatedusing the SHAZAM package'suse of the routinesof Imhof (1961). As the value was only .0109, we reject the null hypothesisof no serial correlation. We therefore estimate a new model for AMW, by takingthe residualsfrom (2) and estimatingtheir firstorderautoregressivecoefficientas .3030.Wethentransform Equation (2) using the noniteratedprocedureof Cochraneand Orcutt(1949)with the PraisandWinsten (1954) weighting of the first observation by (1 - p2)1/2. The result is log(AMW,) = 1.695002 - .15127 log(price,) (2.481269) (.141747) + .830044 log(GPP,) + .031422 time,, (.011210) (.331061) p = .303009, R2 (on transformed variables) = .9970, DW (on transformedvariables) = 1.8800, (3) wherestandarderrorsare in parentheses.The DurbinWatsonstatisticnow cannot reject the null hypothesis of no autocorrelationat the 5% level, and the price coefficienthas a negativesign, althoughits value is not significantlydifferentfrom0 at the 5% level. The coefficientsof log(GPP)andtime seem to be estimatedwith acceptable precision, although the trend coefficient's estimate of a 3.1% per annum growth in demand for givenpriceandincomemay seem somewhathigh. (This may be partlydue to increaseduse of electric heating in both the residentialand commercialsectors.) Equations(1) and (3) are then taken as the working model and subjected to a series of diagnostic tests. Monte Carloresearchby Whiteand MacDonald(1980) suggests that a suitable test for normalityin the disturbances is the variation on the Shapiro and Wilk (1965)test suggestedby ShapiroandFrancia(1972)and appliedto the OLS residuals.The test values are .9080 and .9821for Equations(1) and (3), respectively,where the 95% acceptance region is [.9010, 1.0000], so the null hypothesisof normalitycannot be rejected at the 5% level, although the value of the test on (1) does indicatemore than a 90% probabilitythat the disturbances are nonnormal.Similarconclusionsare yielded by the ordinaryShapiro-Wilktest statistic.The Breusch and Pagan (1979) test (TR2 version; see Judge, Griffiths, Hill, Liitkepohl, and Lee 1985, p. 447) cannot rejectthe null hypothesisof homoscedasticityfor either equation, with statisticsof .0169 and 1.5942 compared with respectivechi-squaredcriticalvaluesof 3.8415and 7.8147. The Engle (1982) test againstan alternativeof first-orderautoregressiveconditionalheteroscedasticity (ARCH) effects giveschi-squaredstatisticsof .3392and 2.1591 compared with a critical value of 3.8415 and hence cannot reject the null hypothesis. Finally, as a test againstthe potential correlationof log(AMW,) and the disturbancein (1), a specification test was conducted as describedby Hausman (1978). The test value was a t statisticof -.5876 and hence is unable to reject the null hypothesisof no correlation. 3. THE BOOTSTRAP Now that a simple model has been developed, it can be used to forecast and the probabilitydistributionof the forecast errorscan then be estimated. First, however, some discussionof the techniqueof bootstrapping is required.This will be brief because, as noted, there are both formaland informaldescriptionsof the bootstrap available elsewhere and in particularFreedman and Peters (1984a,c) discussedthe bootstrapin a forecastingcontext. Much of the intuition behind bootstrappingcan be given by a simple example, such as the estimationof the median of a sample of independentobservations drawn from the same, perhaps unknown, underlying distribution.The probabilitydistributionof the estimated median is typically complex, except for a few casessuchas one in whichthe observationsare normally 419 Bernardand Veall:FutureDemandof HydroQuebec distributed.The bootstrap,however, replacesthis calculationby creatinga largenumberof artificialsamples by drawingrandomlyand with replacementfrom the originalsample. The probabilitydistributionof the estimated median can then be approximatedby the histogramof the mediansof the artificialsamples. We shall now jump directly to the specifics of our context. Firstwrite the model of Equations(1) and (3) as Y,= [1 X]a + e,, , t= 1, . . ., T, (4) are mutually independent, Yt is log(peakt), Xt is log(AMWt), and Zt is a 1 x 4 row vector correspondingto the explanatoryvariables in Equation (3). The advantageof a two-equationmodel (ratherthan solvingfor Xt and makingYta functionof Z,) is that the recursive system imposes structure on the dynamicsas well as on the way in whichthe variables Zt affect Yt. These restrictions(whichare not rejected for this example by the Hausmanspecificationtest in Sec. 2) can improvethe forecastif true. As the tests in Section 2 indicate, this structurealso appearsto correspondto disturbancesthat are homoscedasticand serially uncorrelated.Moreover, althoughnot discussed in this articlefor purposesof brevity, analysisof the X, forecast is also an importantpotential output. The forecast process consideredhere consists of es- where fGLSis the original Prais-Winsten estimate. The sample can then be completed by using = &o+ &a1X Y t + t= t, 1,..., and forecast Xt for the target period T*: XT* = ZT fGLS (5) for a given set of values for ZT*(GLS representsgeneralized least squares). Note that T* is assumedto be far enoughin the futurethat the contributionof the last sample disturbance ATto the optimal forecast for T* is assumednegligible (which seems reasonablegiven the smallestimateof p andthe fact that the forecasthorizon is seven years). The forecastXT*is then used to predict YT*as YT* = [1 XT*]aOLS- (6) To bootstrap this process, take the Prais-Winsten estimators (p, fGLS) and construct B artificial samples. For each artificialsample i, this is done by drawingT times randomlywith replacementfrom the elements of I to create an artificial residual set 0'. This can be used to create an artificialfi by taking the first element of Af and dividingit by (1 - /2)1/2 to reversethe effect of the Prais-Winsten transformationon the first observation, thus yieldingAfi. The rest of the vector can then be constructedby the iterativeprocedure Ai = P;At-i + At, T, (9) where &Oand &aare the original OLS estimates and E are from artificialresidualssampled randomlyand same exogenous data Zt (t = 1, .. ., T), but each differingin the stochasticcomponent. The originalestimationprocesscan then be repeated on all of the artificialsamples. The forecastfor period T* can then be bootstrappedas the distributionsof XT* = ZT* flGLS, yi'. = [1 xT.]i, t = 2,..., T. (10) where /GLSand a' are the estimates on artificial sample i. But, because the distributionof the forecasterroris more interestingthan the distributionof the forecast, we also bootstrapwhat Freedmanand Peters (1984a,c) called "simulatedactuals."First, we randomlysample B more residuals independently from i = 1, ..., a E, call these 'iT- and Ei*, respectively, and each of and again transformthe former into fii- by dividing by (1 p2)1/2. Then the simulatedactualsare XiT = ZT* AGLS + iT*, Y'i. = [1 X'iT] + E'T*, (11) timating fJOLSand then using a first-order autoregression coefficient on the OLS residualsto estimate a f. This is then used in a Prais-Winstentransformto estimate iGLS (8) represent B artificialsamples, all consistent with the underlyingmodel assumedin (1) and (3), all havingthe At = P_t-1 + qt, qt iid, t = 1, . . ., T, rt Xt = ZtfGLS + pt, with replacement from e. The sets (Yi, Xi, Zt) then iid xt = z,t + P, where Et and The artificialsample is then developed as that is, the forecastvaluesplus a bootstrappedresidual. The simulatedforecasterrorsui andeiare the difference between (11) and the bootstrapprediction: UT* = XT* - ZT*1GLS, e.* Y'. - [1 XT.]&'; (12) therefore, the distributionof XT*and YT*,conditional on the forecast information,can be estimated as the empiricaldistributionof these forecasterrorscentered on the actual forecast (XT*, YT*). Before discussingthe results, it mightbe worthwhile to discuss the reasons for using bootstrappingrather than some analytic approach.The simplest answer is that there are no standardanalyticformulasavailable, even for large samples. For example, even if the disturbances in (4) are normal, the forecast error e,T will be a complexmixtureof a seriesof productsof normally distributedvariates. (This is complicatedfurtherwhen ZT*is allowed to be randomas in Sec. 4.) Recall also that XT* and YT*are logarithms, but what is of more interestis a forecast of these exponents, which are actual average and peak demand. If (XT*, YT*) are unbiased forecasts of (XT*, YT*), (exp(XT*), exp(YiT)) are not unbiased forecasts of (exp(XT.), exp(YTr)) [see Goldberger(1968) for a discussionof the parametric normalcase]. To deal with this, we alter the procedure 420 Joural of Business&Economic Statistics,July1987 1974-1983 trend is projected to 1990. Given that real pricesfell until 1973and then grewsteadilyafterwards, a similar1974-1983 trendwas projectedfor real price. More formally, the following OLS results were obtained for 1965-1983: slightlyby setting the forecast errorsas the difference in the exponentsof the simulatedactuals (11) and the exponentsof the bootstrappingforecasts(10) andcenter these on (exp(XTr), exp(YTr)). The result is, therefore, a bootstrapestimate of the probabilitydistributionof peak demandusing the linear as opposed to the logarithmicscale. A bootstrapestimate of the bias due to takingexponents is the mean of this probabilitydistribution estimate less the original forecast (exp(XT*), exp(YT.)), which is just the average of the forecast errorson the linear scale (see Efron 1982, p. 33). Although all of these considerationssuggest that there can be no realistic analytic alternative to the bootstrap,there is also some theoreticalsupportfor the effectivenessof the bootstrapin small samples(Beran 1982;Hall 1986). Moreover,limitedMonte Carloanalysis in related situations(Freedmanand Peters 1984b; Veall 1986, 1987b)suggeststhatthe performanceshould be acceptable. log(GPP,) = 6.785629 + 2.191371Dt + .043964T1l (.264790) (.190046) + .014966T2, + t,, (.002347) R2 = .9898, DW = 1.2259, log(pricet) = 1.012691 - 4.838032Dt - .015200Tt1 (.005647) (.389925) (.543280) + .047675T2 + t,, (.004816) R2 = .8806, DW = 1.8111, (14) where Dt is 0 for 1965-1973 and 1 thereafter, T1 = (1 - Dt) * time, and T2, = D * time,. 4.1 Forecasts of the Exogenous Variables To use the bootstraptechniquedescribedin Section 3, forecastsof the exogenousvariableswill be needed. One approachthat could be used is to take these forecasts from other sources. The referee correctlyemphasizes, however, the advantageof explicit models at all stages, and we follow this advice, althoughthe equations used are somewhatsimplistic. Therefore,we examinedthe time series of both GPP, and P,, as in Figures1 and 2. GPPtgrew fairlyrapidly up untilthe 1973oil shock, afterwhichgrowthwasmuch slower. GPP in 1982 was substantiallybelow even this slower trend with the 1983 value also below trend but by a smallermargin.A researcherexaminingthese data in 1984mightreasonablyhave adopteda modelin which the 1982-1983recessionwas transitorybut in whichthe lower growth since 1974 would continue. Hence the There is an indicationof serialcorrelationby the low Durbin-Watsonvalue for (13) that correspondsto an exact probabilityvalue of .003 for the null hypotheses of no serial correlationusing the Imhof procedure.It is, therefore, assumedthat /Vt= Pi t-1 + Ot with ct( iid. Using the same GLS procedure used to log(GPP,) = 6.711738 + 2.231869Dt (.229394) (.346317) + .045077Tt, + .015349T2t, (.003319) (.002884) p1 = .374714, R2 (on transformed variables) = .9992, DW (on transformedvariables) = 1.6770, (16) 4+ + + + + 4. + + 4+ + +** 0LL +- 0 o 4.* 9.8+ 4 * 4. U- - LII. 1 65 ? 70 (15) obtain (3), the resultsare 10.2 10.0 (13) and 4. RESULTS a. (.002752) 75 80 Figure 1. Plot of the Log of Gross Provincial Product, Quebec, 1965-1983. 85 Bernardand Veall:FutureDemandof HydroQuebec 421 0.1 u x -LJ I- + 0.0 - - 65 - + T t + + I 75 + I 85 80 + 0 -0.21 -i + -0.3 Figure2. Plot of the Real Average Price of Electricity,Quebec, 1965-1983. with almost the same coefficient estimates as (13). Equations(14) and (16) each pass at the 5% level the Breusch-Pagan(1979) and Engle (1982)/ARCH diagnostictests as used previouslyin (1) and (3). Of interest for Section 4.5, the null hypotheses of normalityalso cannot be rejected using the same tests as applied to (1) and (3). Using (16), the forecast for 1990 GPP (ignoringthe time serialcorrelationeffect fromthe 1983disturbance) is 30,484.9million1971dollars.Using (14), the forecast for real price is 1.59264 or about 60% above its 1971 level. [The regressionsand calculationswere all done on a VAX 11/780 in double precision.The readerwho checksthese forecastsusing (14) and (16) will note very small discrepanciesin the last digit.] 4.2 The Simple Boolstrap This will be the firstof three sets of resultspresented to illustratethe outputfromthis kindof approach.First, we try a simplerapproachthanthatdescribedin Section 3 and assume 1990 average demand will be with certainty 16,254 Mw, the value predictedby putting the preceding forecasts of real GPP and real price into Equation (1). (This is somewhathigher than the comparableforecastof 15,526Mw in Hydro Quebec 1985.) Under this assumptiononly Equation (1) needs to be bootstrapped;the resultingestimate of the probability distributionfor peak demand is graphedin Figure 3. The OLS forecast for peak demandis 28,834 Mw and the bootstrapestimate of the bias, as describedin Section 3, is only 15 Mw. (As the bias estimatesare always similarlysmall, they will not be reportedin the following.) The amount of capacityto be 95% sure that demand will be met is 29,954 Mw. In this simple case we can also easily calculate the 95% point analyticallyunder the assumptionof normalityusing the method of Salkever(1976). This value is 29,666 Mw, somewhat less than the nonparametric bootstrapvalue. 4.3 Recursive Boolstiap With Certain Exogenous Variables Here we proceedexactlyas in Section3, allowingfor uncertaintyin average demand by recursivelybootstrappingEquations(1) and (3) but assumingthat the 1990 real GPP and real price forecastsare knownwith certainty. This does not change forecast average demand or peak demand, but the added uncertaintyincreasesthe 95%point by over 1,500Mw to 31,556Mw, as in Figure 3. 4.4 Bootstiapping the Entire Model We now relaxthe unrealisticassumptionused in Section 3 and so far in Section4 that 1990priceand income are knownwith certainty.One methodwould be to use subjectiveprobabilitydistributionsfor these forecasts, but we instead bootstrap the entire model, including Equations (14) and (16). It should be noted that this extra layer of bootstrappingincreases the computational burdenconsiderably. The bootstrappingprocedureof Sections 3 and 4.3 is modified in four ways. First, bootstrapsamples are also created for log(Pi) and log(GPPi) in the usual way using (14) and (16), with the serial correlationfor the latter handled just as for log(AMW,) as described in (7) and (8). Second, in bootstrappinglog(AMW,) in (8), log(P,) and log(GPP,) in Z, are replaced by log(Pi) and log(GPPi), the correspondingbootstrap values. "Simulated actuals" for log(Pl99) and log(GPP1990)are then calculatedas the forecastsof those values plus bootstrapresidualsfrom these corresponding equations, and these values are put in ZiT along with a 1 (for the constant) and timeTr = timel990= 90. The bootstrap method then proceeds as before, with the thirdmodificationbeingthat for (11), the simulated actualsformulafor averagedemand,ZT. is replacedby ZIT..To emphasize,this means that each bootstraprun uses a differentvalue for the 1990exogenousvariables 422 Journalof Business&Economic Statistics,July1987 =A.&--------- 24000 26000 28000 30000 32000 I 34000 - nW n% II Figure3. Plots of BootstrapEstimatesof the ProbabilityDensityof 1990 Peak ElectricityDemand, Quebec. Theinnercurve is the estimate conditionalon an average demand of 16,254 Mw and employs Equation(1). The next curve is the recursivebootstrapestimate employing average demand Equation(3), as well as Equation(1), and is conditionalon estimates of GPP of 30,484.9 million1971 dollarsand a real price of 1.59264, where the 1971 real price is 1.0. The outermostcurve represents a recursive bootstrapdescribed in Section 4.4, where GPP and real price are not assumed exogenous but are also bootstrapped,this time fromtime trendregressions (14) and (16), which feed into (3), which in turnfeeds into (1). Densityestimates have been normalizedto approximatelythe same height with verticaldashed lines indicatingboundariesof 5% upper tails. to reflectthe uncertaintyof and log(GPP1990) log(P1990) these forecastsgiven currentinformation. There is one final modificationthat is not strictly necessary but that seems appropriate.As noted, the growthincreaseis much less smooth after 1973;this is reflectedby a sample standarddeviationof the (transformed) residualsof (16) of .0221 for 1974-1983 comparedwith only .0099for 1965-1973.Althoughbecause of the small sample this difference is not statistically significantusing a X2test, it still seems appropriateto constructartificialsamplesfor 1965-1973log(GPP) using only 1965-1973 residuals and for 1974-1983 log(GPP) using only 1974-1983 residuals.For the constructionof the simulatedactualsof log(GPP99o),only 1974-1983 residualsare used. All of these modificationsdo not change forecast peak, but they do increase the uncertainty,as shown in Figure3. The 95% point is 31,841 Mw. 4.5 Parametric and Smoothed Bootstraps For each of (1), (3), (14), and (16), tests on residuals have been unable to reject the null hypothesesof normality. Since these equationsare in logs, we might expect more skewness after taking exponents than is apparentin Figure 3, but this is not inconsistentgiven the relative tightnessof fit of all of the equationsand the lack of power of normalitytests in such small samples. But the small samples cause anotherproblemin that the tails of the probabilitydistributionsof disturbancesare likely to be poorlyestimatedby the frequency of residuals.In some situationsa researchermay feel thatit is worthwhileto sacrificethe nonparametriccharacter of the bootstrap because of this; therefore, we experimentwith a parametricbootstrapusing the normal distribution.This is exactlyas describedin the previous sections except for every instance in which a bootstrapresidual is drawn, instead a pseudorandom variateis generatedwith mean 0 and varianceequal to the sample variance of the correspondingresiduals. When this is applied to the "complete"bootstrapof Section 4.4 the tails do become more importantwith, for example, a 95% point of 32,139 Mw (not shown in Fig. 3). An intermediatepositionis the smoothedbootstrap (Efron 1982, p. 30), which instead uses a convolution of the empirical residual distributionand a constant s multipliedby the normal distributionwith the same variance,scaledby dividingby (1 + s2)1/2.We use s = .1 and find the results almost identicalto the nonsmoothedbootstrapwith, for example, a 95%point of 31,816 Mw. Bernardand Veall:FutureDemandof HydroQuebec TableA.1. ElectricityDemand and Its Determinants,Quebec, 1965-1983 Peak demand Average megawatts Real price GPP Time 6,450.0 7,050.0 7,800.0 8,000.0 8,650.0 9,300.0 9,450.0 10,250.0 11,450.0 11,950.0 13,350.0 14,800.0 15,800.0 17,050.0 17,600.0 19,400.0 19,700.0 18,400.0 19,800.0 4,224.0 4,589.0 4,954.0 5,214.0 5,639.0 6,027.0 6,141.6 6,626.0 7,123.0 7,900.0 8,105.0 9,073.0 9,452.0 10,217.0 10,434.0 11,282.0 11,256.0 11,119.0 11,416.0 1.01312 .94621 1.00383 1.00225 1.00774 .97959 1.00000 .89634 .84642 .77242 .79034 .76675 .83040 .90802 .97764 .99728 .99405 1.11771 1.14199 15,526.47794 16,301.18064 16,799.53669 17,421.83723 18,330.99493 18,869.11237 19,923.23555 21,023.36693 22,331.32845 23,593.38093 23,768.58888 24,738.42980 25,198.21432 25,756.75781 26,752.87497 26,942.16301 27,224.75647 25,873.76142 26,832.19945 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 NOTE: Peak demandand averagemegawattsare given in thousandsof megawattsand are fromHydroQuebec (1985). Priceis calculatedon average revenueper kilowatthour (source: HydroQuebec, "DemandReports,"1965-1983) dividedby the GPP deflator (source:ConferenceBoardof Canada)and normalizedso that the 1971 value = 1.00. GPP is gross provincialproductin millionsof 1971 dollars(source:ConferenceBoardof Canada). 5. CONCLUSIONS This article has illustrated the case of bootstrapping to estimate the probability distribution of future peak demand for Hydro Quebec, conditional on current information. The results illustrate that reasonable estimates of demand uncertainty are typically very large and hence such measures should likely play an important role in the planning process. If anything, these large uncertainty estimates from the nonparametric bootstrap are likely to be conservative because, as shown, parametric bootstrap estimates are even larger. Moreover, we have largely employed deterministic time trends as opposed to more volatile autoregressive integrated moving average processes, and it has been assumed throughout that there will be no structural shifts before the target period of the forecast. ACKNOWLEDGMENTS Thanks are due to W. Cheng, J.-F. Dion, D. Fretz, and G. Green for research assistance and to B. Efron, R. Ellis, I. McLeod, R. Tibshirani, an associate editor, and an anonymous referee for valuable advice. Related work was presented by Veall at the World Congress of the Econometric Society, Cambridge, Massachusetts, in August 1985. The assistance of Formation des Chercheurs et Aide a la Recherche, the Department of Education, the Government of the Province of Quebec, the Centre for the Study of International Economic Relations at the University of Western Ontario, and the Social Sciences and Humanities Research Council of Canada is acknowledged. 423 APPENDIX: THE DATA When needed in the bootstrapping procedure, random numbers are generated using the routine of Wickman and Hill (1982). When a normal random number generator is needed, this routine is supplemented with that of Beasley and Springer (1977). The observed data are presented in Table A.1. [ReceivedOctober1985. RevisedJuly 1986.] REFERENCES Beasley, J. D., and Springer,S. G. (1977), "The PercentagePoints of the NormalDistribution,"AppliedStatistics,26, 118-121. Beran,R. (1982), "EstimatedSamplingDistributions:The Bootstrap and Competitors,"TheAnnals of Statistics,10, 212-225. Breusch,T. S., and Pagan, A. R. (1979), "A SimpleTest for HeteroskedasticityandRandomCoefficientVariation,"Econometrica, 47, 1287-1294. Cochrane, D., and Orcutt, G. H. (1949), "Applicationof Least Squares to Relationships Containing Autocorrelated Error Terms,"Journalof the AmericanStatisticalAssociation,44, 3261. Efron, B. (1979), "BootstrappingMethods:Another Look at the Jackknife,"TheAnnals of Statistics,7, 1-26. (1982), TheJackknife,The Bootstrapand OtherResampling Plans, Philadelphia:Society for Industrialand Applied Mathematics. Efron,B., andGong, G. (1983),"A LeisurelyLookat the Bootstrap, the Jackknife,and Cross-Validation,"The AmericanStatistican, 37, 36-48. Ellis, R. (1980), "OptimalPricingandInvestmentDecisionsby ElectricUtilitiesUnderDemandUncertainty,"WorkingPaper80-048, MassachusettsInstituteof Technology,EnergyLaboratory. Engle, R. F. (1982), "AutoregressiveConditionalHeteroskedasticity With Estimatesof the Varianceof United KingdomInflations," Econometrica,50, 987-1008. Freedman,D. A., and Peters, S. C. (1984a), "Usingthe Bootstrap to Evaluate ForecastingEquations,"TechnicalReport 20, Universityof California,Berkeley, Dept. of Statistics. (1984b), "Bootstrappinga RegressionEquation:Some Empirical Results," Journalof the AmericanStatisticalAssociation, 79, 97-106. (1984c), "Bootstrappingan EconometricModel: Some EmpiricalResults," Journalof Businessand EconomicStatistics,2, 150-158. Goldberger,A. S. (1968), "The Interpretationand Estimationof Cobb-DouglasFunctions,"Econometrica,36, 464. Hall, P. (1986), "On the Bootstrapand ConfidenceIntervals,"The Annalsof Statistics,14, 1431-1452. Hausman,J. A. (1978),"SpecificationTestsin Econometrics,"Econometrica,46, 1251-1271. HydroQuebec (1985), Demandede l'Electricittau Quebec,Horizon 2001, Montreal:Author. Imhof,J. P. (1961), "Computingthe Distributionof QuadraticForms in NormalVariables,"Biometrika,48, 419-426. Johnston, J. (1984), EconometricMethods(2nd ed.), New York: McGraw-Hill. Judge,G. G., Griffiths,W. E., Hill, R. C., Liitkepohl,H., and Lee, T.-C. (1985), The Theoryand Practiceof Econometrics(2nd ed.), New York:John Wiley. Prais,G. J., andWinsten,C. B. (1954),"TrendEstimatorsandSerial Correlation,"DiscussionPaper383, CowlesCommission,Chicago. Salkever,D. S. (1976), "The Use of DummyVariablesto Compute Predictions,PredictionErrorsand ConfidenceIntervals,"Journal of Econometrics,4, 393-397. 424 Journalof Business & EconomicStatistics,July 1987 Shapiro,S. S., andFrancia,R. S. (1972),"An ApproximateAnalysis of VarianceTestfor Normality,"Journalof theAmericanStatistical Association,67, 215-216. Shapiro,S. S., and Wilk, M. B. (1965), "An Analysisof Variance TestforNormality(CompleteSamples),"Biometrika,52,591-611. Veall, M. R. (1983), "IndustrialElectricityDemand and the HopkinsonRate: An Applicationof the ExtremeValueDistribution," Bell Journalof Economics,14, 427-440. (1986),"Bootstrapping RegressionModelsUnderFirst-Order SerialCorrelation,"EconomicsLetters,21, 41-44. (1987a), "Bootstrappingthe ProbabilityDistributionof Peak EconomicReview,28,203-212. ElectricityDemand,"International (1987b), "BootstrappingForecast Uncertainty:A Monte Carlo Analysis," in Time Series in EconometricModelling,eds. J. B. MacNeiland G. J. Umphrey,Dordrecht,the Netherlands: D. Reidel, pp. 373-384. White, H., and MacDonald,G. M. (1980), "Some Large-Sample Tests for Nonnormalityin the KeynesianModel,"Journalof the AmericanStatisticalAssociation,75, 16-28. Wickman,B. A., and Hill, I. D. (1982), "An Efficientand Portable Pseudo-RandomNumberGenerator,"AppliedStatistics,31, 188190.
© Copyright 2026 Paperzz