Stefan Seifert & Maria Nieswand
Operational Conditions
in Regulatory Benchmarking –
A Monte-Carlo Simulation
Workshop: Benchmarking of Public Utilities
November 13, 2015, Bremen
Agenda
1
2
3
4
5
6
2
Motivation and Literature
Methodologies
The DGP
Simulation Design and Performance Measures
Initial Results
Conclusion and Outlook
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
1
Motivation
Regulatory Approaches for Electricity DSOs
Source: Agrell & Bogetoft, 2013
3
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
1
Motivation
• Benchmarking widely used in regulation – sectors in which environmental factors play
an important role
• Accuracy of estimates influences revenue caps, industry performance, firm survival,
and ultimately customers via prices
• Methodological advances to account for environmental factors and heterogeneity
• Non-parametric approaches: z-variables in 1-stage DEA (Johnson and Kuosmanen,
2012), conditional DEA (Daraio & Simar, 2005 & 2007), …
• Parametric approaches: Latent Class (Greene, 2002; Orea & Kumbhakar, 2004),
Zero-inefficiency SF (Kumbhakar et al., 2013), …
• Semi-parametric approaches: StoNEzD (Johnson & Kuosmanen, 2011), …
• BUT: Regulatory models typically based on standard DEA or SFA
4
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
1
Motivation
Aim of this study
• Systematical performance evaluation of Latent Class, StoNEzD and conditional DEA in
the presence of environmental factors
• Generalization of results via Monte-Carlo-Simulation
Guidance for regulators to choose estimators given industry
structure and industry characteristics
Scope of this study
• Consideration of different model set-ups imitating real regulatory data
Cross section with variation in sample sizes, noise and inefficiency distributions
and in terms of the true underlying technology
Consideration of different cases of impact of environmental variables
5
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
1
Related Literature
Monte Carlo Simulation Studies
• Basic MC evidence in original research papers
• Andor & Hesse (2014): StoNED vs SFA vs DEA
• Henningsen, Henningsen & Jensen (2014): multi-output SFA
• Krüger (2012): order-m vs order-𝛼 vs DEA
• Badunenko, Henderson & Kumbhakar (2012): KSW bootstrapped DEA vs FLW
• Badunenko & Kumbhakar (forthcoming): persistent and transient ineff. SFA
Few studies focusing on environmental variables
• Cordero, Pedraja & Santin (2009) – z-variables in DEA
• Yu (1998) – z-variables in DEA and SFA
6
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
2
Methodologies
7
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
2
Methodology – Notation
• Production function 𝑦𝑖 = 𝑓 𝑥𝑖 ∗ exp(𝛿𝑧)
• 𝑁 observations, 𝑖 = 1, … , N
1
• Input 𝑥𝑖 , … , 𝑥𝑖
𝑚
∈ ℝ 𝑚 to produce output 𝑦𝑖 ∈ ℝ
• Deviation from the frontier 𝜀 = 𝑣 − 𝑢
• 𝑢~𝑁 + 0, 𝜎𝑢
2
, 𝑣~𝑁 0, 𝜎𝑣
2
• Expected inefficiency 𝜇
• Environmental factors
1
• Vector of environmental factors 𝑧𝑖 , … , 𝑧𝑖
𝛿1, … , 𝛿 𝑞 )
8
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
𝑞
∈ ℝ 𝑞 with impact (𝛿 =
2
Methodology – conditional DEA
• DEA with firm specific reference sets (Daraio & Simar, 2005, 2007)
depending on realization of 𝑧 s.t. 𝜃 𝐷𝐸𝐴 = sup 𝜃 𝐷𝐸𝐴 𝑥, 𝜃 𝐷𝐸𝐴 𝑦 ∈ Ψ 𝑧
• Estimation of the reference set: Kernel estimation
• Frontier reference point is 𝑓 = 𝜃 𝐷𝐸𝐴 𝑦 (output oriented for comparability)
9
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
2
Methodology – Latent Class
• LC SFA tries to account for unobserved factors and heterogeneity in technologies
(Greene, 2002; Orea & Kumbhakar, 2004)
• Consideration of J classes to estimate – class-specific shape of 𝑓(𝛽𝑗 , 𝑥𝑖 )
ln𝑦𝑖 = ln 𝑓(𝛽𝑗 , 𝑥𝑖 ) + 𝑣𝑖 |𝑗 − 𝑢𝑖 |𝑗
• Endogenous selection of class membership: multinomial logit model
𝑃𝑖𝑗 (𝜑𝑗 ) =
10
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
exp(𝜑𝑗 𝑧𝑖 )
𝑗 exp(𝜑𝑗 𝑧𝑖 )
with 𝜑𝐽 = 0
2
Methodology – Latent Class
Estimation: ML or MSL - Likelihood function (𝐿𝐹) as function of
• 𝜏 - parameters of the technology – pre-specified functional form
• 𝜑 - parameters describing class membership
𝑛
ln 𝐿𝐹(𝜏, 𝜑) =
𝑛
ln 𝐿𝐹𝑖 (𝜏, 𝜑) =
𝑖=1
𝐽
ln
𝑖=1
𝐿𝐹𝑖𝑗 𝜏𝑗 ∗ 𝑃𝑖𝑗 (𝜑𝑗 )
𝑗=1
• Posterior class membership probability can be calculated as
𝑃 𝑗𝑖 =
𝐿𝐹𝑖𝑗 𝜏𝑗 ∗ 𝑃𝑖𝑗 (𝜑𝑗 )
𝑗 𝐿𝐹𝑖𝑗
𝜏𝑗 ∗ 𝑃𝑖𝑗 (𝜑𝑗 )
• This class membership probability can then be used to either weight the efficiency
scores – or the frontier reference points
Weighted frontier reference point: 𝑓𝑖 =
11
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
𝑗𝑃
𝑗 𝑖 ∗ 𝑓𝑖 (𝛽𝑗 , 𝑥𝑖 )
2
Methodology – StoNEzD
StoNEzD for Normal-Half-Normal Noise / Ineff.
1. Stage QP: Estimation of average function 𝑔(𝑥)
• No functional form (but piece-wise linear)
• 𝛿 is common to all firms
2. Stage: Decomposing residuals of first stage
• MM estimator to derive 𝜎𝑢
E u = 𝜇 = 𝜎𝑢 2/𝜋
• Shift of 𝑔(𝑥) by expected value of inefficiency
to derive frontier estimate 𝑓(𝑥)
Frontier reference point: 𝑓𝑖 = 𝜙𝑖 ∗ exp 𝛿𝑧𝑖 ∗ exp(𝜇)
12
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
2
Methodology – Comparison of cDEA, LC SFA and StoNEzD for production function
cDEA
LC SFA
StoNEzD
Type
Non-parametric
Parametric
Semi-parametric
Error / Inefficiency
Deterministic
Stochastic
Stochastic
Shape
Constrained
Parametrically
constrained
Constrained
Scaling
assumption
Necessary
Possible
Possible
Convexity of T
Yes
No
Yes
Reference set
Observation specific
All observations,
weighted
All observations
Effect of z on
frontier
Observation specific
Grouped, but
observation specific
via weighting
General effect
13
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
3
The DGP
14
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
3
Data Generating Process
• DGPs are created to replicate real world regulatory data
General relationship
𝑦 = 𝑓 𝑥 ∗ exp 𝛿𝑧 ∗ exp(𝑣 − 𝑢)
Sample Size
𝑛𝑠𝑚𝑎𝑙𝑙 = {25, 50,100,150,250}
+ 4% observations twice as large in terms of inputs
𝑛 = {26,52,104,156,260}
Inputs
15
4 correlated Inputs
𝑥1 … 𝑥4 ~Uni(0,10) for small and 𝑥~Uni(10,20) for large firms
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
3
Data Generating Process
Functional form of 𝑓(𝑥)
• Translog 𝑝0 = 1, 𝑝𝑚 = 0.10, pml = 0.05, pmm = 0.02
Inefficiency and Noise 𝑢~|𝑁 0, 𝜎𝑢
2
|, 𝑣~𝑁 0, 𝜎𝑣
𝜎𝑢 = {0.05,0.1,0.15}
𝜎𝑣 = {0,0.05,0.1}
• Noise-to-Signal:
16
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
2
with
3
Data Generating Process
Environmental Factors 4 different distributions considered,
1 symmetric, 3 skewed, 1 correlated with inputs
17
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
4
Simulation Design and Performance
Measures
18
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
3
Simulation Design
Scenarios
• So far only two different scenarios: Baseline (BL) and High Impact (HI) scenarios
• Only one 𝑧-Variable considered each, variation in impact 𝛿
𝑦 = 𝑓 𝑥 ∗ exp 𝛿𝑧 ∗ exp(𝑣 − 𝑢)
• Each scenario estimated with variation in sample size, 𝜎𝑢 and 𝜎𝑣 , for each estimator
19
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
3
Simulation Design
Implementation
Replications: 𝑅 = 100 100*9*5 = 4500 data sets for 3 estimators
R samples for u and v for each scenario
x,y and z are constant over one scenario
Samples with strong deviations from the DGP are discarded
(correlations in 𝜌𝑥 = ±0.15, wrong skewness in 𝑣 − 𝑢)
StoNEzD
Implemented with Sweet Spot Approach (Lee et al. 2013)
MoM with 𝑀3 set to -0.0001 if wrong skewness occurs
Latent Class
CD estimation
Estimation with 2 - 4 classes, reported is max BIC
5 repetitions with „randomized“ starting values
cDEA
20
Least squares cross validation, Epanechnikov kernel
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
3
Performance measures
Performance Evaluation
Evaluated at frontier reference points corrected for 𝛿𝑧
Performance Measures
Equally weighted deviation in percentage points
Bias > 0 overestimation of the frontier and of inefficiency
Average squared deviation, higher impact of larger
deviation
21
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
5
Initial Results
22
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
5
Initial Results
Generally…
• LC most often outperforms cDEA and StoNEzD
• Distribution of z does not seem to matter concerning bias
• Correlation of z & x has only little effect (BL4 vs. the others)
• Also magnitude of environmental effect seems to play a minor role (HI vs BL)
23
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
5
Initial Results
• LC SFA
• Performs generally well, stable and efficient
• Frontier overestimation tendecies in higher noise cases
• cDEA
• High sensitivity against noise
• Underestimation of frontier in small samples, overestimation in larger samples
• StoNEzD
• General underestimation of the frontier favorable for firms
• Performs well with low inefficiency and small samples
• But problems with high inefficiency
• … but does not seem to be generally efficient
24
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
6
Outlook
25
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
5
Conclusion and Outlook
• Additional Scenarios
• Scenarios with multiple z variables
• Scenarios with heterogeneity in technologies induced by zs 𝛽 = 𝛽(𝑧)
• Misspecified scenarios?
• Estimation
• Optimization of optimization routines – still failed estimations although the
estimated model is the true underlying model
• Suggestions?
26
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
Vielen Dank für Ihre Aufmerksamkeit.
DIW Berlin — Deutsches Institut
für Wirtschaftsforschung e.V.
Mohrenstraße 58, 10117 Berlin
www.diw.de
Redaktion
5
28
Initial Results
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
5
29
Initial Results
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
5
30
Initial Results
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
5
31
Initial Results
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
5
32
Initial Results
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
5
33
Initial Results
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
5
34
Initial Results
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
5
35
Initial Results
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
0
References
• Agrell, P. and Bogetoft, P. (2013). Benchmarking and Regulation. CORE Discussion Papers 2013008.
• Andor, M. and Hesse, F. (2014). The stoned age: the departure into a new era of efficiency analysis? a monte carlo comparison of stoned and the
oldies (SFA and DEA). JPA, 41(1):85-109.
• Badunenko, O., Kumbhakar, S. (2015) When, Where and How to Estimate Persistent and Time-Varying Efficiency in Panel Data Models. WP.
• Cordero, J. M., Pedraja, F., and Santin, D. (2009). Alternative approaches to include exogenous variables in DEA measures: A comparison using
Monte carlo. Comput. Oper. Res., 36(10):2699-2706.
• Daraio, C. and Simar, L. (2005). Introducing Environmental Variables in Nonparametric Frontier Models: a Probabilistic Approach. JPA, 24(1):93-121.
• Daraio, C. and Simar, L. (2007). Conditional nonparametric frontier models for convex and nonconvex technologies: a unifying approach. JPA,
28(1):13-32.
• Greene, W. H. (2005). Reconsidering heterogeneity in panel data estimators of the stochastic frontier model. Journal of Econometrics, 126(2):269303.
• Haney, A. B. and Pollitt, M. G. (2009). Efficiency analysis of energy networks: An international survey of regulators. Energy Policy, 37(12):5814- 5830.
• Johnson, A. and Kuosmanen, T. (2011). One-stage estimation of the effects of operational conditions and practices on productive performance:
asymptotically normal and efficient, root-n consistent StoNEzD method. JPA, 36(2):219-230.
• Jondrow, J., Knox Lovell, C. A., Materov, I. S., and Schmidt, P. (1982). On the estimation of technical inefficiency in the stochastic frontier production
function model. Journal of Econometrics, 19(2-3):233-238.
• Krüger, J. J. (2012). A monte carlo study of old and new frontier methods for efficiency measurement. EJOR, 222:137-148.
• Kuosmanen, T. (2012). Stochastic semi-nonparametric frontier estimation of electricity distribution networks: Application of the stoned method in
the Finnish regulatory model. Energy Economics, 34(6):2189-2199.
• Lee, C.-Y., Johnson, A. L., Moreno-Centeno, E., and Kuosmanen, T. (2013). A more efficient algorithm for convex nonparametric least squares. EJOR,
227(2):391-400.
• Orea, L. and Kumbhakar, S. C. (2004). Efficiency measurement using a latent class stochastic frontier model. Empirical Economics, 29(1):169-183.
• Yu, C. (1998). The effects of exogenous variables in efficiency measurement - a monte carlo study. EJOR, 105(3):569-580.
36
Benchmarking of Public Utilities
Stefan Seifert & Maria Nieswand
© Copyright 2026 Paperzz