DETERMINANTS OF DIFFUSION MODELS` FORECAST

DETERMINANTS OF DIFFUSION MODELS’ FORECAST ACCURACY
This paper investigates factors that affect the accuracy of forecasts generated by diffusion
models. The analysis is performed using long-enough simulated series based on 926 real new
product diffusion datasets, together with a collection of diffusion models with good forecasting
performance. The magnitude of the forecast error is modelled as function of early diffusion
summary data, structural characteristics of the diffusion models, and estimation performance
measures. Information on these factors is readily available to the forecaster prior to the generation
of the forecasts. The method is capable to explain part of the variation in forecast error, and can
be used as an aid to diffusion model selection for new product forecasting.
Keywords: Diffusion models, Technology forecasting, Forecast evaluation
Track: Modelling and Forecasting
1. Introduction
Diffusion models are used to model the temporal evolution of new product first-purchases (e.g.
Mahajan, Muller, and Wind, 2000) and to forecast the development of the diffusion process (e.g.
Parker, 1994). A large number of alternative models are available to the marketing researcher.
However little is known about their forecasting performance and the factors that affect the
magnitude of their forecast errors. This lack of knowledge makes model selection difficult when
facing a diffusion-forecasting problem. Fildes and Kumar (2002) and Armstrong (2005) are some
of the authors who point to the limited research effort in assessing and improving forecast
accuracy of diffusion models. Important exceptions are the studies of Meade and Islam (1995a,
1998) and of Islam, Fiebig, and Meade (2002). These authors have analyzed the performance of
diffusion models in telecom diffusion datasets and discussed model selection and forecast
combination. However they utilized few actual diffusion series (typically 50 or less) and did not
systematically investigate the possible sources of forecast error and the magnitude of their effect.
Of major interest to the diffusion modeler is the question of how to assess the likely forecast error
from a diffusion model utilizing the information contained in a single diffusion data series of
usually limited length. One obvious way to answer this question is by providing forecast
confidence intervals to accompany the generated point forecasts. However there are problems
associated with estimating prediction intervals for non-linear models (see for example Chatfield
1993, Meade and Islam 1995b). Most diffusion models are non-linear. One serious problem is
that prediction intervals are correct only when the diffusion model is more or less correctly
identified for the particular data series. The data generating process (DGP) for a diffusion series
is not known (and even may change over time), hence there is no guarantee about the quality of
the generated intervals. Selecting an appropriate model as an approximation to the real DGP is a
very important step in generating reliable forecasts and associated prediction intervals.
To facilitate model selection, this paper analyses measurable factors that affect the long-range
forecast error from diffusion models. The analysis is performed using long-enough simulated
series based on 926 real new product diffusion datasets, together with a collection of diffusion
models with good forecasting performance. The magnitude of the expected forecast error is
modelled as function of early diffusion summary data, structural characteristics of the employed
diffusion models, and estimation performance measures. The forecaster can use the expected
forecast error and the associated prediction interval as an indicator of model appropriateness.
Thus, the method can be used as an aid to diffusion model selection for new product forecasting.
In section 2 we present the framework for the analysis and the data. Section 3 describes the
regression model for the forecast error and the explanatory variables. In section 4 we discuss
model estimation results and the predictive validity of the model in a test dataset. Finally, we
conclude with a summary of the results and directions for future research.
2. The Analysis Framework - Data and Models
The available data cover a wide range of diffusion realizations. They are 926 annual
multinational time series (for a total of about 50 countries) for the penetration Ft ∈ (0,1] of
household appliances, home electronics, and telecommunication equipment introduced after year
1950 with 7 or more observations. 496 series have length of 8 or more and only 73 with 20 or
more observations. Given that a number of observations are reserved for model estimation, it is
clear that more series are required for detailed medium to long-term forecast error computations
and comparisons. For this reason simulated data are generated.
The simulation strategy is the following: (a) For each diffusion model included in the analysis
parameters and associated error variances are estimated in all 926 series using all available
observations. Estimation is via NLS. Arguments for using NLS estimation can be found in Meade
and Islam (1998). (b) 250 pseudo-random draws per model were obtained from the empirical
multivariate distribution of estimates and the observed minimum penetration to generate (i)
model parameters, (ii) error variance, and (iii) minimum observed penetration. (c) Finally, the
draws were used to generate artificial series of 100 observations.
This process was repeated for the 20 diffusion models listed in Table 1 at the Appendix. In such a
way about 5000 artificial diffusion series were generated. For all series every model was fitted in
estimation windows including the first 7,8, … to 25 observations respectively. Then forecasts for
up to 10 periods ahead were made for the corresponding 19 forecast origins and several forecast
accuracy measures were recorded. Forecast performance comparison is performed across all
origins and data series. The details are lengthy and are presented elsewhere. Based on these
comparisons, and in order to keep the forecast error analysis task to a manageable size, only the 5
best performing models were selected for further analysis. These are the Extended Riccati, the
Extended Logistic - which gives the cumulative penetration of the Bass (1969) model as a
function of time-, Flexible Logistic, Autoregressive Gompertz, and the Mean Field model. All
models are 4-parametric with the 4th parameter being the saturation level.
3. Modeling Forecast Error
In the diffusion forecasting literature several factors related to forecast accuracy have attracted
some - but not systematic - attention (e.g. Meade 1984; Young and Ord 1989; Young 1993;
Meade and Islam 1995a, 1998). These factors pertain to the ability of a model and the estimation
procedure to properly capture the characteristics of a diffusion process and can be classified into
three broad categories: factors related to (a) model misspecification, (b) model fit, and (c) quality
of estimates. We model forecast error as a function of these factors. In addition, we explore
possible relationships between the descriptive statistical characteristics of an observed series and
the forecasting accuracy of a diffusion model. The forecast error measure we adopt is the 10steps ahead mean absolute percentage error (10-MAPE). MAPE is the most commonly used
measure in the relevant literature. It is scale-free and allows for forecast comparisons in different
series with different diffusion levels. The forecast origin is fixed to the 10th period for all
simulated series. These choices are made because: (a) in most real-life situations long-range
forecasts are required, and (b) ten observations allow for parameter estimation and measurement
of the explanatory variables that are described in detail below.
Kolmogorov-Smirnov tests show that the natural logarithmic transformation of 10-MAPE is
approximately normal for all five models. Hence, it can be modeled as a linear in the parameters
(m )
function with a model specific normally distributed residual ei
y i( m ) = ∑ ak( m ) f k ( xi(,mk ) ) + ei
( m)
,
ei
(m)
(
)
~ N 0, σ m ,
2
(1)
k
where yi(m ) is the logarithm of 10-MAPE for diffusion model m for forecast-error observation i,
(m )
and x i is a K × 1 vector of characteristics related to model m and the data series. f k is generally
a smooth non-linear transformation function that can be determined via a linearization procedure,
for example a Box-Cox transformation or the alternating conditional expectation (ACE)
transformation method of Breiman and Friedman (1985). The explanatory variables and their
measurement are discussed below.
Model misspecification and fit
Factors related to model misspecification that may influence forecast error include
heteroscedasticity, serial correlation, and temporal model stability. The presence of
heteroscedastic and serially correlated residuals is tested using the White (1980) and Durbin
(1970) tests respectively. Results are recorded using the indicator variables HET and SER; they
assume the value 1 when the corresponding test rejects the null hypothesis. The 5% significance
level is the decision boundary in all tests we perform. Model instability over time has been
observed in many diffusion modeling occasions. We adopt the approach of Meade and Islam
(1998) who employ a test suggested by Harvey and Collier (1977). The test examines one-step
ahead forecast errors in recursive model fits within the estimation window with respect to their
estimated variance. If the null hypothesis of correct model specification is rejected, the indicator
variable STAB takes the value 1. Of particular interest is the temporal consistency of the
estimated saturation level that is the upper bound for the model forecasts. We examine the
presence of a linear trend in the recursively estimated parameter and whenever the absence of a
trend cannot be rejected the dummy SAT takes value 1. Model fit, measured by R2, is an indicator
of model adequacy in describing the observed diffusion data. A poor model fit is expected to lead
to poor forecasts. We use the logarithmic transformation of R2, ln( R2 ), coded as LRSQ.
Estimation quality
Estimation quality is judged by the statistical significance of estimated parameters and the
theoretical plausibility of their estimated values. Insignificant estimates render the model a poor
descriptor of the series and the generated forecasts are unreliable and useless (associated forecast
intervals would be very wide). At each model estimation instance we record the number of
significant parameters (SIGPAR) and the values of the corresponding t-statistics (denoted as T1,
T2, T3, and T4). Theoretically implausible estimates (e.g. wrong signs) indicate that the model’s
theoretical foundation is not adequate for the process realized in a particular series (Meade and
Islam, 1995a). In our optimization approach we constrain parameters to take values within their
theoretical bounds in order to achieve meaningful forecasts and thus increase the power of
forecast comparisons. However, we count the total number of model parameters held at a bound
at an estimation instance with the variable NBOUND. A parameter estimate at a bound may
indicate redundancy, a possible reversion to a nested model, and/or inadequacy of the model to
properly describe the process.
Data series characteristics
If relationships between descriptive characteristics of a series and forecasting accuracy of a
model exist, they can be attributed to model identification: high accuracy of forecasts when a
series exhibits certain statistical properties implies that the model is more likely to be correctly
identified. Since little is known about the relationship of a series characteristics and model
identification, we adopt a “data mining” approach by trying out a number of measures such as:
the value of the 1st observation in a series (MINPEN), the average observed growth rate (i.e. the
average of the differenced series - AVDIFF), the average of the twice differenced series
(AVDIFF2), higher moments of the data such as variance, skewness and kurtosis (STDEV,
SKEW, KURT). To capture more detailed higher order characteristics of the series and to
identify natural groupings that could be related to forecast error, we classify them to three sets of
clusters using (a) the original series (the first 10 observations), (b) the differenced series, and (c)
the twice-differenced series. The clustering algorithm led to the identification of 2, 3, and 2
corresponding clusters. The indicator variables CLUS1, CLUS2, and CLUS3 respectively record
the membership of a series to the developed clusters.
4. Results
The analysis focuses on forecasts with origin the 10th observation. A total of 4304 simulated data
series for which the estimation algorithm converged and produced proper forecasts and
parameters’ standard errors for each of the five models were employed. Of those, 3292 (76%)
were used to estimate model (1), while 1012 observations (24%) were reserved for validating
predictions. Table 2 summarizes 10-MAPE from each of the 5 diffusion models. Initially we
performed least squares (LS) estimation of regression (1) for each diffusion model. The variance
minimizing alternating conditional expectation (ACE) transformation method of Breiman and
Friedman (1985) improved greatly the LS fits (achieved R2 ranging from 0.48 to 0.54) and
reduced the residual standard errors considerably. The residual standard error of the regression is
important as it determines the width of prediction intervals for the forecast error. Due to the
presence of influential observations that bias LS parameter estimates and affect the estimate of
the residual standard error, we estimated the untransformed versions of (1) with robust MMtechniques (e.g. Yohai, Stahel and Zamar 1991, Pena and Yohai 1999). The method de-biases
parameter estimates and in our case reduced further the standard error of the regressions. Thus,
predictions from (1) are less biased and more accurate. Robust estimation results, fit statistics and
prediction intervals for each of the 5 regressions are given in Table 3. Below we discuss the
estimated effects of the explanatory variables on forecast error and we present prediction results.
Model misspecification and fit
Other things being equal, the presence of heteroscedastic residuals (HET) has only a marginally
significant effect on the forecast error from the Extended Ricatti and the Extended Logistic
models. Serially correlated residuals (SER) and model stability (STAB) do not appear
informative. The presence of a linear trend in the saturation level estimate (SAT) has a significant
impact on the forecast error only for the Mean Field model; it is strongly associated with large
errors. As expected, better model fit is significantly (LRSQ) associated with smaller forecast
errors for all models.
Estimation quality
The number of significant parameters (SIGPAR) together with the value of the corresponding tstatistics (T1, T2, T3 and T4) has a significant effect on forecast error. Their net effect is
significantly negative for all models. However, due to correlations between them (not very high
though), their partial effect in the regression model may change sign and sometimes loose its
significance. A highly significant estimate of the saturation level (T4) is strongly related to higher
forecast accuracy for all five models. The number of parameters in bounds (NBOUND) is
positively associated with forecast error.
Data series characteristics
The forecast error increases significantly when the 1st observation of the penetration level
(MINPEN) in a series decreases. For 4 models (except of the Mean Field) the error is a linearly
decreasing function of the average diffusion rate (AVDIFF) in the observed series when
controlling for the effects of other variables. Combining this effect with the strong negative effect
of the standard deviation (STDEV) of the observed series we can say that the more spread-out the
observed data the better is the accuracy of the forecasts. The effect of the average rate of change
of the diffusion rate (AVDIFF2) is less important. The skewness (SKEW) has a significant and
strong effect: strong positive skew is related to high forecast errors. Kurtosis (KURT) appears
unrelated to error. Membership of a series to clusters developed using the first and the second
order differences (CLUS2 and CLUS3 respectively) has a significant impact on the forecast
accuracy of all diffusion models.
Prediction
For each diffusion model, the estimated regression (1) was applied to the test sample to provide
predictions of the forecast error. Prediction intervals were calculated using the fit statistics of
Table 3. The calculated 90% prediction intervals for the forecast errors in the test sample were
proven to be accurate, i.e. contain the actual error approximately in the 90% of the cases,
indicating the statistical validity of the prediction procedure. Figure 1 gives an example of the
predicted (a) log(10-MAPE) and (b) 10-MAPE in the test sample for the Extended Logistic
model. The average across models median width of the 90% prediction interval is quite wide
(20.32% in the MAPE scale). This is rather normal for a highly uncertain process such as
diffusion, particularly for long-range predictions made at relatively early stages (times) as in the
present analysis. Intervals are expected to have smaller width for shorter-range forecasts and
predictions made at later stages of the product life cycle.
5. Summary of the findings and concluding comments
The main findings from the robust regression fit and analysis of variance for the forecast error
from the 5 diffusion models are summarized as follows: When controlling for the effects for other
variables: (a) Model fit is the single most important factor with a negative impact on forecast
error. (b) Quality of parameter estimates is crucial for accurate forecasts. The significance of the
saturation level estimate is the second most important predictor of forecast error. The number of
parameter estimates at the optimization bounds is positively associated with forecast error. (c)
Data series characteristics are strongly related to forecast error. The penetration level at the 1st
observation has the third most important (negative) effect. Data skewness is the fourth most
important predictor, with a strong positive effect. The typical spread of the observed data has a
significant negative impact on forecast error. Higher order characteristics of the data are
informative about the accuracy of predictions. (d) Model misspecification appears to have little
impact on forecast error.
The exploratory approach adopted in this paper has identified determinants of the forecast
accuracy of diffusion models and quantified their effect. It provides an additional input to the
forecaster’s toolset for assessing the likely forecast error and for choosing among diffusion
models. The modeling effort has to be extended further to investigate more possible sources of
error in order to achieve more accurate predictions. It must also be extensively validated for more
forecast origins and lead times, not only on artificially generated data but also on a sufficient
number of long-enough real diffusion series. This requires updating the available series with the
latest released observations and collection of more datasets. Under current investigation are
complementary probabilistic modeling strategies for model selection and diffusion forecast
combination.
Appendix: Tables and Figures
Table 1: Diffusion models in forecast comparison
Model
Abbreviation
1. Bass – Skiadas
2. Cumulative log-normal
CloNo2
3. Extended Riccati
ExRic
4. Extended Logistic
ExloB
5. Floyd
Flo
6. Flexible Logistic
FLOG
7. Jeuland
Jeul
8. Gompertz
Gomp
9. Gompertz Autoregressive
GompAR
10. Harvey
Harv
11. Kumar & Kumar
KK3
12. Local Logistic
LoLog
13. Mean Field
MF
14. Mansfield
Man
15. NSRL
NSRL
16. NUI
Nui
17. Sharif – Kabir
Shk
18. Simple Logistic
Slog
19. Simple Logistic Autoregressive SlogAR
20. Weibull
Weib
(1)
S=Symmetric, NS=Non-Symmetric, F=Flexible
Reference
Skiadas (1986)
Bain (1963)
Kendall et al. (1983)
Meade (1998)
Floyd (1968), Mahajan et al (1993)
Bewley & Fiebig (1988)
Jeuland (1981)
Gompertz (1825), Stone (1980)
Meade and Islam (1995a)
Harvey (1984)
Kumar & Kumar (1992)
Meade (1988)
Emmanouilides (1997)
Mahajan et al. (1993)
Easingwood et al. (1981)
Easingwood et al. (1983)
Mahajan et al. (1993)
Verhulst (1838), Stone (1980)
Meade & Islam (1995a)
Sharif & Islam (1980)
(2)
Including the saturation level
Class(1)
F
F
F
F
NS
F
F
NS
NS
F
F
S
F
S
F
F
F
S
S
F
Number of
Parameters(2)
3
3
4
4
2
4
4
3
4
4
3
2
4
2
3
4
3
3
4
3
Table 2: 10-period lead time MAPE Statistics by Model (1) – Forecast origin is 10th observation
MAPE Statistics
Geometric
1st Quartile
Median
3rd Quartile
Mean
Model
Mean
Extended Riccati
3.65
7.69
16.67
7.78
13.55
Extended Logistic
3.44
7.12
16.13
7.21
12.58
Flexible Logistic
3.63
7.88
17.41
7.75
13.59
AR Gompertz
3.70
7.64
15.83
7.55
12.61
Mean Field
3.37
6.99
15.07
6.94
11.76
(1)
Pooled estimation and test samples of 4304 total observations per model
Standard
deviation
15.87
14.38
15.29
14.05
13.47
Table 3: Estimation results, fit statistics, and prediction intervals for the ln(10-MAPE) robust regressions(1)
Model
Extended Riccati Extended Logistic Flexible Logistic
AR Gompertz
Variable
Mean Field
(INTERCEPT)
0.46
-1.53 ****
1.99 ****
0.35
-1.85 **
Specification and fit
HET
-0.14
-0.14 **
0.13 *
-0.07
-0.05
SER
-0.04
-0.09
0.00
0.04
-0.06
STAB
-0.09
-0.09
0.62
0.27
SAT
0.08
-0.03
-0.11
-0.07
0.25 ****
LRSQ
-0.26 ****
-0.25 ****
-0.09 ****
-0.16 ****
-0.17 ****
Estimation quality
SIGPAR
-0.54 ****
-0.02
0.50 ****
-0.10
0.18 *
(T1)0.4
0.18
0.17 ***
0.43 ****
0.17 ****
0.12 ****
(T2)0.5
-0.18
0.39 ****
-0.56 ****
0.01
-0.05
T3
0.11 ***
-0.01
0.03 ***
0.00
0.01
0.4
(T4)
-0.17 ****
-0.35 ****
-0.37 ****
-0.25 ****
-0.31 ****
NBOUND
0.28 ***
0.33 ****
0.05 *
0.08 **
0.86 *
Data series characteristics
MINPEN
-3.43 ****
-4.58 ****
-2.01 ****
-5.55 ****
-6.05 ****
AVDIFF
-15.60 ***
-22.83 ****
-20.15 ***
-17.39 ****
-8.70
AVDIFF2
-3.40
-9.91 **
8.30 *
-3.70
-7.31 *
(STDEV)0.1
-3.29 ****
-1.13 **
-5.79 ****
-3.00 ****
-1.92 *
SKEW
0.46 ****
0.21 ****
0.53 ****
0.34 ****
0.29 ****
KURT
-0.01
0.01
-0.01
0.01
-0.01
CLUS1 (2)
0.03
0.10
0.04
0.09
0.04
CLUS2 (2)
0.26 ****
0.19 ***
0.14 **
0.24 ****
0.24 ****
CLUS2 (3)
0.18 ***
0.14 ***
0.21 ****
0.14 ***
0.13 **
CLUS3 (2)
0.14 **
0.10 **
0.25 ****
0.14 ***
0.14 ***
Fit statistics and prediction intervals
R2
0.38
0.41
0.41
0.44
0.39
Residual S.E.
0.70
0.62
0.67
0.70
0.71
Half width of 90% PI (log scale)
1.15
1.02
1.10
1.15
1.17
Upper 90% PI (original scale)
3.16 x fitted MAPE 2.77 x fitted MAPE 3.00 x fitted MAPE 3.16 x fitted MAPE 3.21 x fitted MAPE
Lower 90% PI (original scale)
0.32 x fitted MAPE 0.36 x fitted MAPE 0.33 x fitted MAPE 0.32 x fitted MAPE 0.31 x fitted MAPE
Median 90% PI for 10-step MAPE
(2.5, 24.2)
(2.6, 19.7)
(2.6, 23.6)
(2.5, 24.1)
(2.2, 22.4)
(1)
Estimation sample of 3292 observations per model. Significance levels for the two-sided t-test: (*) 10%, (**) 5%, (***) 1%, (****) 0.1%
(a)
(b)
Figure 1: Predicted 10 step-ahead (a) ln(MAPE) and (b) MAPE in the test sample for the Extended Logistic model. Predictions (black
line) are displayed in ascending order to facilitate exposition, together with the corresponding 90% prediction intervals (red and green
lines). Actual forecast errors are displayed as points.
References
Armstrong, J. S. (2005). Findings from evidence-based forecasting methods for reducing forecast
error. Available at http://www-marketing.wharton.upenn.edu/forecast/journals.html
Bain, A.D. (1963). Demand for new commodities. J. Royal Statistical Society, Series A, 126,
285-299.
Bass, F.Μ. (1969). Α new product growth model for consumer durables. Manαgement Science,
15, 215-227
Bewley, R., and Fiebig, D. (1988). Flexible logistic growth model with applications in
telecommunications. International Journal of Forecasting, 4, 177-192.
Breiman, L., and Friedman, J. H. (1985). Estimating optimal transformations for multiple
regression and correlation. (with discussion). Journal of the American Statistical Association, 80,
580-619.
Chatfield, C. (1993). Calculating interval forecasts. Journal of Business and Economic Statistics,
11, 121-135.
Durbin, J. (1970). Testing for serial correlation in least-squares regression when some of the
regressors are lagged dependent variables. Econometrica, 38, 410-421.
Easingwood, C., Mahajan, V., and Muller E. (1981). Α non-symmetric responding logistic model
for forecasting technologica1 substitution. Technological Forecasting and Social Change, 20,
199-213.
Emmanouilides, C.J. (1997). A Mean Field model with social interactions and network effects in
technology diffusion. Proceedings of the 26th EMAC Conference, Warwick Business School, 2023 May 1997, pp. 416-436.
Fildes, R., and Kumar, V. (2002). Telecommunications demand forecasting – a review.
International Journal of Forecasting, 18, 489-522..
Floyd, A. (1968). Trend forecasting: A methodology for figure of merit. In Technological
forecasting for industry and government: Methods and applications, Bright, J. (ed.), Prentice
Hall: Edgewood Cliffs, NJ.
Harvey, A. C., and Collier, P. (1977). Testing for functional misspecification in regression
analysis. Journal of Econometrics, 6, 103-119.
Harvey, Α. C. (1984). Time series forecasting based οn the logistic curve. J. Oper. Res. Society,
35, 641-646.
Islam, T., Fiebig D. G., and Meade, N. (2002). Modeling multinational telecommunications
demand with limited data. International Journal of Forecasting, 18, 605-624.
Jeuland, Α. Ρ. (1981). Parsimonious models οf diffusion οf innovations: part Α, derivatiοns and
comparisons. Working Paper, Graduate School οf Business, University οf Chicago, Chicago, IL.
Kendall, Μ. G., Stuart A., and Ord, J. Κ. (1983). The αdvanced theoτy of statistics, 3 (4th ed).
Macmillan: New York.
Kumar, U., and Kumar V. (1992). Technologica1 innovation diffusion: the proliferation οf
substitution models and easing the user dilemma. ΙΕΕΕ Trans. on Engineering Manαgement; 39,
158-168.
Mahajan, V., Muller, E., and Bass, F.M. (1993). New Product Diffusion Models. In Marketing,
Eliashberg, J., and Lilien, G.L. (eds), 349-407. Handbooks in Operations Research and
Manufacturing Science, Vol. 5, Elsevier: Amsterdam.
Mahajan,V., Muller, E., and Wind, Y. (2000). New-Product Diffusion Models. International
Series in Quantitative Marketing, Kluwer Academic Publishers: Massachusetts.
Meade, N. (1984). The use of growth curves in forecasting market development – a review and
appraisal. Journal of Forecasting, 3, 429-451.
Meade, N., and Islam, T. (1995a). Forecasting with growth curves: An empirical comparison.
International Journal of Forecasting, 11, 199-215.
Meade, N., and Islam, T. (1995b). Prediction intervals for growth curve forecasts. Journal of
Forecasting, 14, 413-430.
Meade, N., and Islam, T. (1998). Technological forecasting – model selection, model stability,
and combining models. Management Science, 44(8), 1115-1130.
Parker, P. M. (1994). Aggregate diffusion forecasting models in marketing: A critical review.
International Journal of Forecasting, 10, 353-380.
Pena, D., and Yohai, V. (1999). A fast procedure for outlier diagnostics in large regression
problems, Journal of the American Statistical Association, 94, 434-445.
Sharif, N., and Islam, M. (1980). The Weibull distribution as a genera1 model for forecasting
technological change. Technological Forecasting and Sociαl Chαnge, 18, 247-256.
Skiadas, C. (1986). Innovation diffusion models expressing asymmetry and/or positive1y or
negatively influencing forces. Technological Forecasting and Sociαl Chαnge, 30, 313-330.
Stone, R. (1978). Sigmoids. Biαs, 7, 59-119.
White, H. (1980). A heteroskedasticity-consistent covariance matrix estimator and a direct test of
heteroskedasticity. Econometrica, 48, 817-838.
Young, P. (1993). Technological growth curves: A competition of forecasting methods.
Technological Forecasting and Social Change, 44, 375-389.
Young, P., and Ord, J. K. (1989). Model selection and estimation for technological growth
curves. International Journal of Forecasting, 5, 501-513.
Yohai, V., Stahel, W. A., and Zamar, R. H. (1991). A Procedure for Robust Estimation and
Inference in Linear Regression. In Directions in Robust Statistics and Diagnostics, Part II, W. A.
Stahel & S. W. Weisberg (Eds.). New York: Springer-Verlag.