Modeling Labor Supply in the Netherlands: The effect of Tax

Modeling Labor Supply in the Netherlands:
The Effect of Tax Credits for Employees as a First Application
Henk-Wim de Boer
Ministry of Social Affairs and Employment
The Netherlands
Email: [email protected]
Tel: 0031-703335170
Paper presented at the 3rd General Conference of the International Microsimulation
Association, Stockholm, Sweden, June 8th to 10th, 2011
May 2011
*********** PRELIMINARY RESULTS: PLEASE DO NOT QUOTE ***************
Abstract
This paper presents a recently developed Dutch labor supply model, with labor tax credits for
employees as a first application. In the Netherlands, the tax benefit model MICROS is used almost
on a daily basis to support the decision making process of policy makers. Until now, the analysis
with MICROS has been static without behavioral responses of individuals. The current model is
extended by modelling labour supply effects in a discrete choice framework (Van Soest 1995). This
paper starts with a presentation of this model. The labour supply model estimates wage elasticities
for men and women of approximately 0.1 and 0.2% respectively. These elasticities are surprisingly
low compared to findings in other studies. Next, this paper focuses on the effect of tax credits for
employers in the Netherlands as a first application.
1. Introduction
In the Netherlands, the Ministry of Social Affairs and Employment uses the micro simulation model
MICROS almost on a daily basis to support the decision making process of policy makers. MICROS
is a tax benefit model which has been developed in the late eighties by the Ministry of Social Affairs
and Employment. The model is primarily used for simulating income effects of proposed policy
measures. In this way, policy makers get a better understanding of the effects of their (proposed)
policy measures for different types of households. Until now, the analysis with MICROS has been
static without behavioral responses of individuals. Many policy measures, however, have labor
supply effects as well. An increase in the labor tax credit, for instance, results in some employees
adjusting their labor supply. Obviously, policy makers are interested in these labor supply effects
as well 1. For that reason the current model MICROS is extended by modelling labour supply effects
in a discrete choice framework (Van Soest 1995).
The tax benefit model MICROS uses data from the Dutch survey WOON (In Dutch: Woononderzoek
Nederland) in 2005. WOON is a representative survey for the Dutch population with administrative
data from the Dutch Tax Administration. MICROS adjusts the population by static weighting on a
yearly basis. Static weighting means that decomposition of households, sources of income and
living situation of the individual cases in the survey remains unchanged. Only income of the
individuals is adjusted on a yearly basis. In this way, one can monitor the development of
purchasing power of households as a result of changes in regulations, wages, taxes, social security
premiums and prices. The structure of the population, such as the number of households,
unemployed and employees, is adjusted by changing the weight factors of individual cases.
The main goal of this study is to extend the simulation model MICROS with a labor supply module.
It is important to note that the labor supply model is restricted to direct labor supply effects of
individuals only. In reality, an increase in labor supply may contribute to wage restraints which in
turn result in a higher demand for labor and this effect is not taken into account. Moreover, the
demand side of the labor market is excluded as well. In order to take these effects into account, a
general equilibrium model is needed which is beyond the scope of this study. As a first application
of the recently developed labor supply model, the labor supply effects of tax credits for employees
are estimated. Employees in the Netherlands receive a labor tax credit AK (In Dutch:
Arbeidskorting) which depends on their income. In addition, secondary earners and sole parents,
who combine work with the care of children, receive an extra income dependent tax credit IACK (In
Dutch: Inkomensafhankelijke Combinatiekorting).
The structure of this paper is as follows. Section 2 provides background information on labour force
participation in the Netherlands and results from other scientific studies. Section 3 presents the
underlying theory of the labour supply model. In section 4, a description of the data is given and
section 5 contains the first preliminary results. The labour supply effects of tax credits for
employees are shown in section 6. Section 7 concludes.
1
The Netherlands Bureau for Economic Policy Analysis (CPB) does take behavioral responses into account in
their analysis. By using a general equilibrium model the effects of several policy measures can be simulated.
2
2. Background
Labour force participation in the Netherlands is high compared to other OECD countries. In 2005,
the labour force participation rate for men was 84.1% and for women 69.0%. The OECD averages
are 83.2 and 61.9 respectively 2. However, the majority (70%) of these women work part-time 3.
On average men worked 37.1 hours per week whereas women worked 24.2 hours per week in
2005. From a policy’s perspective, the latter group is particularly interesting. For these part-timers
there is still room to increase their labour supply and this group is relatively elastic with respect to
income.
Many studies on labor supply models use the discrete choice framework developed by Van Soest
(1995). He uses a discrete choice model to estimate a basic labor supply model, in which
individuals choose their working hours from a finite set of alternatives. Van Soest uses cross
section data for Dutch couples from 1987. This basic model produces median elasticities for an
average family of 0.15 and 1.03 for men and women respectively. Van Soest shows that the basic
labor supply model strongly overpredicts the number of part-time jobs. In order to correct for this,
alternative specific dummies for small part-time jobs are added. As a result, the median elasticities
drop to 0.10 and 0.52 for men and women respectively. A later study by Van Soest (2000) with
data from 1995 estimates a higher elasticity (0.7) for cohabiting/married women in the
Netherlands. Nelissen et al (2005) use panel data from 1990 to 2001 and estimate an elasticity of
0.30-0.40 for women (with a partner) in the Netherlands.
Evers et al (2008) apply a meta analysis in order to estimate labor supply elasticities in the
Netherlands. The analysis is based on 32 scientific studies, 6 of which refer to the Netherlands. The
authors conclude that labor supply elasticities are 0.10-0.20 for men and 0.50 for women. The
studies above show that women’s labor supply is more elastic and that they respond more heavily
to financial incentives than men. A possible explanation is the relatively low participation rate for
women.
As stated earlier, several recent studies on labor supply use the discrete choice framework by Van
Soest (1995). Some of these studies use cross section data meaning that the labor supply decision
is not intertemporal. In reality however, the time dimension of labor supply is important and we
can regard the labor supply decision as an intertemporal choice. Consider women, for instance,
who often reduce their labor supply on a temporary basis due to the birth of children and the care
afterwards. In addition, many studies assume that gross wage is exogenous and does not vary
between the labor supply alternatives. However, gross wage is endogenous for the following
reasons:
•
‘Simultaneity’: wages affect labor supply but labour supply influences wages as well (due
to progressivity of the tax system).
•
Unobserved heterogeneity: there are unobserved characteristics like talent and social skills
that influence labor supply.
2
3
OECD Employment Outlook
Statistics Netherlands (emancipatiemonitor 2009).
3
The methodology used by Blundell et al (1998) does take the time dimension of labour supply and
endogeneity of wages into account. The authors investigate how several tax reforms in the United
Kingdom affected labor supply of females in the 1980s. A tax reform results in exogenous variation
in income. In this way, we have a natural experiment which makes it highly suitable to study the
effect of net wages on labor supply. In particular, a tax reform is independent from individual
characteristics, or working hours, which makes it extremely suitable to study labor supply.
Van der Klaauw and Bosch (2009) follow Blundell’s approach in studying labor supply of females.
By using repeated cross section data for the period 1999-2003, the authors use a large, Dutch tax
reform in 2001 to study labor supply effects of cohabiting/married females. The tax reform in 2001
resulted in a sharp reduction in the maginal tax rates in the Netherlands. Furthermore, the general
allowance was replaced by a tax credit. With the old system prior to 2001, individuals only paid
taxes on income above the general allowance. Hence, individuals with a high marginal tax rate
benefited the most from this general allowance. With the old system it was possible for non
working individuals to transfer this allowance to their partner (with a higher marginal tax rate).
This was extremely beneficial in terms of reducing tax payments and in turn influenced non
working individuals’ decision to participate. For some non working individuals, with a relatively low
expected wage (conditional on participation) and hence a low marginal tax rate, it was profitable
not to participate on the labor market. Hence, the old tax system contained an opposing financial
incentive not to work. Van der Klaauw en Bosch estimated a negative wage elasticity which means
that the income effect dominates the substitution effect. The main reason for this is that the tax
reform primarily reduced high marginal tax rates. Especially employees with a relatively high
income profited from the drop in marginal tax rates and these employees were relatively inelastic.
Despite the negative income elasticity, the average number of working hours increased after the
tax reform from 17.92 to 18.28 hours per week. The authors showed that the increased
participation rate was responsible for this result. The participation rate increased because the
financial incentive to work was increased by replacing the general allowance by a tax credit. Hence,
the effect of the increased participation rate dominates the effect of the negative wage elasticity
and as a result, the tax reform increased labor supply.
A disadvantage of the studies by Van der Klaauw and Blundell is that marginal utility of income is
considered constant. In reality, marginal utility is decreasing in income. Many recent studies, based
on the discrete choice framework, allow for diminishing marginal utility of income. Another
drawback is that both studies assume that labor supply of males is fixed which is unreasonable.
Many households make a joint decision with respect to labor supply. The pooling hypothesis for
instance 4 holds that the source of income in the household is not relevant. Finally, the method
described above is not suitable to simulate proposed policy measures, such as expenditure cuts on
child care or the introduction of a flat tax in the future. By contrast, the discrete choice labor
supply models estimate the preferences of individuals for leisure and income. That is, these models
estimate the parameters of the (quadratic) utility function which can be used to simulate proposed
4
Lungerd, S, Pollak, R. and T.Wales (1997), Do Husbands and Wives Pool Their Resources? Evidence from the
United Kingdom Child Benefit, Journal of Human Resources, 32(3) pp. 463-479.
4
policy measures. This is the main reason for opting for a labor supply with discrete choices. For the
Ministry of Social Affairs and Employment, it is important that the labor supply model can simulate
future policy measures as well and develops into an essential instrument to support the decision
making process of policy makers in the Netherlands.
The choice for a discrete choice model instead of a continuous labor supply model is motivated as
follows. Firstly, the system of taxes and social security premiums creates a non-linear budget
constraint seriously complicates estimation. In practice, the calculations in continuous labor supply
models become too complex. Creedy and Duncan (2002) provide a nice technical survey of recent
developments in behavioural microsimulation. In their survey, they also concentrate on the pros
and cons of continuous labor supply models and discrete labor supply models. Secondly, the labor
supply decision of employees is not a continuous choice in practice. In reality, employees only have
a limited number of options (discrete choice).
3. Theory
Section 3.1 sets out the underlying theory of the labor supply model. In the end, the model should
not only focus on the decision of individuals to work more or less (intensive margin) but also on the
decision of individuals to participate or not (extensive margin). For non-working individuals it is
impossible to observe wage rates in the sample and we have to estimate these wages. Section 3.2
describes the theory of this wage estimation.
3.1 Labor supply model
Starting point of the analysis is an individual who maximizes utility with respect to leisure and
income, taking a budget constraint into account. The budget constraint is as follows 5:
Y = w * L + y non = w(80 − H ) + y non
(1)
Income (Y) equals the sum of the product of net wage (w) and labor supply (L) and other income
(ynon), such as wealth and family allowances. Here, the following time restriction holds: L+H=80 6,
where H represents hours of leisure per week. An individual chooses a combination of income (Y)
and leisure (H) so that his/her utility is maximized. For this we take the following quadratic utility
function:
U (Y , H ) = α yy Y 2 + α hh H 2 + α yhYH + β y Y + β h H
(2)
The advantage of a quadratic utility function is that this function is flexible and that the marginal
utility of income is not constant, as opposed to the linear utility function.
5
The gross wage is considered fixed throughout the analysis. In reality, one can argue that the gross wage is
endogenous.
6
Experimentation with other time endowments hardly affected the results.
5
We are not able to observe utility of the individuals. We only observe the actual working hours and
some relevant properties such as net salary, age, sex, education level and number and age of
children. Suppose we can observe all relevant properties (and we do not suffer from measurement
errors, etc.), then the model is deterministic. In that case, we can calculate utility for all individuals
in each of the alternatives and simply pick the alternative which yields the highest utility. In reality,
however, we have to deal with measurement error and unobserved heterogeneity. There are
properties that we can not observe, such as talent and social skills, that affect wages and hence
the utility of the individuals. Hence, utility consists of a deterministic component and an error term:
V (Y , H ) = U (Y , H ) + v
(3)
The error term ν here represents measurement errors and unobserved heterogeneity. An individual
i chooses labor supply option k if utility is higher than for the other alternatives:
pik = P(Vk > Vi ) = P(U k − U i > vi − v k )
(4)
If we assume that the error terms vi are independent and identically distributed over all
alternatives, then it follows that the difference in utility between the alternatives follows a logistic
distribution 7. The probability of alternative k,
pik , is a conditional probability. This probability is
conditional with respect to the error term. The unconditional probability is obtained by taking into
account the distribution of the error term. In discrete choice models of labor supply it is often
assumed that the error terms follow an extreme value distribution. McFadden (1973) showed that
in this case the probability for alternative k,
pik , for individual i equals:
n
pik = P(Vik > Vij ) = exp(Vik ) / ∑ exp(Vij )
(5)
j =1
In order to calculate the probabilities for the alternatives, we must first estimate the parameters of
the utility function. In other words, we must determine the preferences of individuals for leisure
and income. These parameters can be estimated by a conditional logistic regression (using
maximum likelihood) 8. The joint probability that all M individuals choose alternative k is thus equal
to the product of each individual’s probability for alternative k:
M
n
i =1
j =1
P(h1k ,...hmk ) = p1k p 2 k .... p mk = ∏ exp(Vik ) / ∑ exp(Vij )
(6)
7
See for instance Cameron and Trivedi (2005)
In a conditional logit the properties vary over the alternatives (alternative varying regressors). Characteristics
such as education and age only vary by individuals and not by alternatives. These properties interacted with
leisure ensure that we are dealing with properties that vary over the alternatives. The interpretation is that the
preference for leisure varies with these properties. This results in estimated parameters β that are the same
for all individuals and over the alternatives. By comparison, a multinomial regression with alternative invariant
regressors results in parameters that vary by alternative.
8
6
The equation above is the likelihood function that we want to maximize over the parameters β of
the utility function. We are looking for the parameters of the utility function with the highest
likelihood given the observed disribution of labor supply. The log likelihood is as follows:
M
n
i =1
j =1
log( L) = ∑ Vik − log(∑ exp(Vij )
(7)
It is expected that the preferences of individuals for income and leisure depend on characteristics
such as age, education, ethnicity and the presence of children. Individuals with children will
probably (on average) have a higher preference for leisure than someone without children. In
addition, unobserved heterogeneity (ε) plays an important role as well. Here we assume that ε is
normally distributed. The utility function then becomes 9:
U (Y , H ) = α yy ln 2 (Y ) + α hh ln 2 ( H ) + α yh ln(Y ) ln( H ) +
β y 0 ln(Y ) + {β h 0 + β h1 ln( Age) + β h 2 ln 2 ( Age) + β h 3 Educlow + β h 4 Educhigh +
(8)
β h 5 Child 04 + β h 6 Child 412 + β h 7 Child1218 + β h8 Re gion + β h 9 Etnic + ε } ln( H )
where:
•
Y = net income per week
•
H = leisure per week
•
Age = age (natural logarithm, ln)
•
Age2 = ln(age)*ln(age)
•
Educlow = dummy low education
•
Educhigh = dummy high education
•
Child04 = age youngest child 0-4 year
•
Child412 = age youngest child 4-12 year
•
Child1218 = age youngest child 12-18 year
•
Region = dummy for region West in the Netherlands
•
Etnic = dummy for non-Western immigrant
10
The quadratic utility function allows marginal utility of income and leisure not to be constant. For
age a quadratic term is included as well since it is expected that the relationship between age and
the preference for leisure is not constant. That is, for younger people the preference for leisure
probably declines with age whereas for older people this relationship is reversed. In addition, the
utility function contains dummy variables for education, children, region and non-Western
immigrants. One would expect that lower education, the presence of children and a non-Western
background raises the preference for leisure. Finally, a dummy for the region west is included as
9
For the variables income, leisure and age, the natural logarithm is taken
Medium education is the reference category
10
7
well since the Western part of the Netherlands is characterized by high economic activity and high
population density.
Unobserved heterogeneity is a property that, like age and education, is constant over the
alternatives and is therefore multiplied by the natural logarithm of leisure. The additional random
term ε, with a normal distribution, complicates the estimation. By adding a random error term ε,
the likelihood function must take the distribution of this random term into account (as an integral
in the likelihood function). This likelihood function has no closed form solution and therefore we
must use simulated maximum likelihood. For each draw of the random term ε we calculate the
likelihood and then take the average of the likelihood over R draws 11. The average maximum
likelihood is then maximized:
R
log( L) = (1 / R)∑
r =1
n




V
V
log(
exp(
)
−
∑
∑
ikr
ijr


i =1 
j =1

M
(9)
For the draw of the random terms, Halton sequences are used as they provide better coverage
than pseudo-random draws. Train (2003) provides a good description of this method. Haan and
Uhlendorf (2006) provide a good description of STATA routines for simulated maximum likelihood
and Halton draws.
Once the parameters of the utility function are estimated (including the random terms for
unobserved heterogeneity), the actual choice of all individuals must be calibrated. Creedy and Kalb
(2005) present a nice overview of the calibration process. Error terms are drawn from the extreme
value distribution and added to the calculated utility for each alternative. A draw is accepted when
the actual choice for an individual yields the highest utility. This exercise should be repeated
several times (eg 100 draws). Calibration results in a distribution of predicted labor supply that is
perfectly consistent with the observed distribution of labor supply in the sample. Next, we can
calculate labor market effects of policy measures by using the same (calibrated) error terms again.
This results in a new probability for each of the alternatives, given the characteristics (including
random terms for unobserved heterogeneity), calibrated error terms and the labor supply in the
baseline scenario.
3.2 Estimate expected wages non-working individuals
The labor supply model should also be able to model the participation decision of individuals, the
so-called extensive margin. For the inactive population in the sample the gross wage conditional on
participation is unknown and must be estimated. This section describes the theory of this wage
estimation.
For the wage estimation we use the observed wage distribution of workers in the sample. We can
estimate the determinants of gross wages for this group and use these results to estimate
11
As R approaches infinity, the maximum likelihood estimation is consistent.
8
expected gross wages for non-working individuals. Here, we have to consider a (possible) selection
effect. The population of workers will probably differ from the inactive population. By using the socalled Heckman selection model (1979), it is possible to correct for selection bias.
The Heckman selection model consists of a wage and participation equation. The wage equation for
gross wages is as follows 12:
ln(W ) = Xβ + ε
(12)
where X is a vector of explanatory variables (such as age and education) and ε is the unexplained
part, which is assumed to be normally distributed with mean 0 and variance σ2. We estimate the
vector β of the wage equation.
The participation equation estimates the probability of participation, given certain characteristics Z.
The participation equation is estimated using a Probit regression13:
Pr( Active = 1 | Z ) = Φ ( Zγ )
(13)
where Z is a vector of explanatory variables (age, education etc.). The unexplained portion is equal
to ν, assuming that ν is (standard) normally distributed. The vector γ contains the coefficients of
the participation equation while Φ is the cumulative distribution function. With respect to the error
terms of both equations, the following holds:
Corr[ ε ,υ ] = ρ
(14)
That is, the correlation between the two error terms of the wage equation and participation
equation equals rho. That is, rho is the correlation between unobserved determinants of the
probability of participation and unobserved determinants of wages. If rho is positive, it means that
unobserved characteristics are positively correlated. Take for instance the characteristic talent in
both equations. Talent has a positive effect on the gross wage and the probability of participation,
with a positive rho as a result. If rho is negative, then there are unobserved characteristics that
have a positive impact on participation, but a negative effect on the gross wage (or vice versa). If
rho is equal to 0, then the error terms are independent and selection bias is not present.
The conditional expected value of gross wage (that is, provided that the individual works) can be
approximated by:
E[ W * | Active = 1] = Xβ + E[ε | X , Active = 1] = Xβ + ρσM
12
13
(15)
The wage equation is estimated by OLS.
The participation regression can also be estimated by a logistic regression.
9
where
M = φ (Zγ ) / Φ(Zγ )
The second term on the right hand side of the equation is a variant of the Inverse Mill's Ratio. Here
φ
is the probability density function whereas
Φ represents the cumulative probability density
function. Both functions use the vector of explanatory variables Z as input for each individual. The
participation equation makes sure that the effect of the characteristics (Z) from the participation
equation on wages is taken into account. That is, a correction is made for selection bias. The
Heckman selection model is estimated for single women, single men and couples (again
differentiated between men and women). The explanatory variables are as follows:
Wage equation:
o
Dummies education
o
Age
o
Age squared
o
dummy non-Western immigrant
o
dummy Western region
Participation equation:
o
Dummies education
o
Age
o
Age squared
o
dummy non-Western immigrant
o
dummy Western region
o
dummy youngest child 0-4 year
o
dummy youngest child 4-12 year
o
dummy youngest child 12-18 year
It is expected that individuals with a higher education have a higher probability of participation and
receive higher wages on average. A similar story holds with respect to age. However, the positive
relationship between age and participation and wages is expected to weaken when individuals
become older (diminishing returns). Furthermore, dummies for Non-Western immigrans (negative
relationship) and Western region (positive relationship) are included as well. Finally, one expects
that the presence of children lowers the probability of participation.
10
4. Data
This section describes the dataset in more detail. As stated earlier, labor supply will be estimated
for singles and couples separately where a distinction is made between men and women. For
singles there are 6 alternatives to choose from: 0, 8, 16, 24, 32 and 40 hours 14. Couples are
offered 15 alternatives in the model where a distinction is made between men and women. For
men, there are 3 alternatives (0, 30 and 40 hours of work) whereas for women there are 5
alternatives (0, 10, 20, 30 and 40 hours of work), bringing the total number of alternatives to 15.
The gross hourly wage of workers can be calculated by by dividing the gross annual income in the
sample by 52 (number of weeks per yaer) times the working hours per week. For non-working
individuals the gross hourly wage is estimated by using the Heckman Selection model. Next, net
income has to be calculated for each of the alternatives. The simulation model MICROS offers a
comprehensive gross/net income module by which net income for households in each of the
alternatives can be calculated accurately.
The micro dataset contains the following relevant information for the labor supply model: working
hours, age, education, region, ethnicity and age and number of children. For this study, only
individuals aged between 23 and 60 years are included. Self-employed individuals and individuals
with multiple sources of income (for example wages and profits) are not included. This is because
the budget constraint becomes very complex if multiple income sources are present. Inactive
people are part of the population, but only if they receive social assistance or unemployment
benefits. For couples, non-working partners are included as well. Finally, disabled individuals are
also excluded. The reason for this is that some of these disabled individuals are permanently
disabled, with no prospect of recovery, and it is not possible to distinguish this group in the sample
in a proper way.
Table 1 shows the descriptive statistics of the singles population. For single men and women holds
that non-working individuals are relatively low educated. That is, 75.8% of non-working individuals
has a low education, while the share of individuals with a low education is only 31.5% in the active
population of single women. Furthermore, non-working individuals are older on average and often
have a non-Western background. An important difference between single men and women is the
the employment rate. Approximately 14% of single men are inactive while 24% of single women is
not working. One possible explanation is the presence of children. The share of single men with
children is very small (4%) while more than half of single women have children (54.5%).
Table 2 shows a similar table with descriptive statistics for couples, again differentiated between
men and women. As expected, most men work (94%) and average working hours is high (36.6
hours). Approximately one quarter of women are inactive. As with singles, we see that ethnicity
and low education are relatively common in the inactive population. Inactivity comes in two
different ways for couples: (1) both persons in the household do not work and the household
14
The results are quite robust with respect to the number or design of the alternatives. The fit of the model, in
terms of the predicted labor supply distribution, is highest for the current design. However, the differences are
small.
11
receives social assistance or unemployment benefits and (2) either one of the partners does not
work while the other partner does have a job.
Table 1: Descriptive statistics singles
Total
42,3
26,8
31,0
Low education
Medium education
High education
Singles men
(# 3.276)
Active
Inactive
37,4
71,9
28,1
18,6
34,5
9,5
Total
42,1
25,8
32,1
Singles women
(# 3.647)
Active
Inactive
31,5
75,8
27,6
20,1
40,9
4,1
No children
Youngest child 0-4 yr
Youngest child 4-12 yr
Youngest child 12-18 yr
96,1
0,2
1,7
2,0
96,1
0,2
1,8
1,9
95,9
0,4
1,5
2,2
70,0
5,2
13,6
11,2
77,8
2,8
9,5
9,9
45,5
12,9
26,5
15,2
Other
Non-Western immigrant
88,6
11,4
91,7
8,3
69,7
30,3
83,0
17,0
89,0
11,0
64,2
35,8
Region West
Other region
50,4
49,6
50,7
49,3
48,1
51,9
54,4
45,6
55,2
44,8
51,9
48,1
Age (average)
39,6
39,2
42,3
40,5
40,1
41,6
Active
Inactive
Working hours (average)
Working hours (average),
only active population
85,9
14,1
32,9
76,0
24,0
25,5
38,3
33,5
Table 2: Descriptive statistics couples
Low education
Medium education
High education
Total
34,4
30,0
35,6
Couples, men
(# 14.184)
Active
Inactive
33,3
51,5
30,5
21,6
36,1
26,9
Total
39,8
31,2
29,0
Couples, women
(# 14.184)
Active
Inactive
33,1
60,6
33,6
23,8
33,3
15,5
No children
Youngest child 0-4 yr
Youngest child 4-12 yr
Youngest child 12-18
yr
41,3
22,5
22,8
41,3
22,5
22,8
41,3
21,8
23,7
41,3
22,5
22,8
42,2
22,5
21,9
38,4
22,3
25,8
13,4
13,4
13,3
13,4
13,4
13,5
Other
Non-Western
immigrant
90,6
92,2
65,3
90,5
93,0
82,7
9,4
7,8
34,7
9,5
7,0
17,3
Region West
Other region
44,5
55,5
44,3
55,7
47,6
52,4
44,5
55,5
45,0
55,0
43,0
57,0
Active
Inactive
Age (average)
Working hours
(average)
Working hours
(average), only active
population
94,1
5,9
42,1
42,1
75,7
24,3
42,5
39,8
39,1
36,6
18,2
38,9
24,0
42,0
12
5. Estimation results
This section presents the first preliminary results of the recently developed labor supply model.
Section 5.1 shows the results for the wage estimation from the Heckman Selection Model. The
estimated parameters of the utility function are presented in section 5.2.
5.1. Results Heckman Selection Model
Table 3 shows the results for the wage and participation equation for single men and women. Most
of the coefficients of the wage equations for singles are highly significant (at 1% level) and have
the expected sign. The only exception is that the coefficient for ethnicity is not significant (and
slightly positive) for single women.
Table 3:
Estimation results Heckman Selection Model singles (by maximum
likelihood)
Men
Women
(# 10.464)
(# 10.216)
Wage equation
Dummy education 1+
- 0.459**
-0.558**
- 0.362**
-0.448**
Dummy education 2
Dummy education 3
- 0.296**
-0.313**
Dummy education 4
- 0.111**
-0.154**
Age (ln) ++
3.896**
2.523**
Age squared (ln) ++
- 0.484**
-0.307**
Ethnicity
- 0.061**
0.004
Region West
0.064**
0.058**
Constant
- 4.833**
-2.203**
Participation equation
Dummy education 1+
Dummy education 2
Dummy education 3
Dummy education 4
Age (ln) ++
Age squared (ln) ++
Ethnicity
Region West
Youngest child 0-4 years
Youngest child 4-12 years
Youngest child 12-18 years
Constant
rho (ρ)
sigma (σ)
lambda (λ= ρ*σ)
Log Likelihood
- 0.843**
- 0.274**
- 0.038
0.117*
7.728**
-1.187**
-0.512**
0.099**
-0.475*
0.021
0.015
-11.517**
-1.043**
-0.519**
-0.112
0.165**
13.667**
-2.023**
-0.444**
0.253**
-0.842**
-0.459**
-0.254**
-22.051**
-0.073
0.351
-0.026
-0.0977
0.369
-0.036
-8401.5
-8458.4
+ Category university is the reference category.
++ natural logarithm
** significant at 1%, * significant at 5%
Coefficients of the wage equation are directly interpretable. For instance, the average wage in the
Western region of the Netherland is 6.6% higher than in the rest of the Netherlands (exp (0064)).
Furthermore, we see that a higher education results in a higher predicted wage. The relationship
between wage and age is positive as well, but diminshing in age. Finally, there is a negative
relationship between wages and ethnicity.
13
In the participation equation, we see that most coefficients are significant as well. Only the
coefficient for the third education dummy (medium education) is not significant for single men and
women. We also see that people with higher vocational education (education dummy 4) have a
higher participation probability than those with a university degree (education dummy 5 which is
the reference category). One possible explanation is that some university graduates are searching
longer for a job compared to graduates with higher vocational education, since more studies are
offered that less fit the needs of the labor market. As a result, the probability of finding a job is
lower. The coefficients for age of the youngest child have the expected sign: as the youngest child
is older, the probability of participation increases. For men, we see that these coefficients are not
significant at the 5% level. This is probably due to the low number of observations of men with
young children in the sample. Furthermore, cultural factors may also play an important role. The
relationship between the probability of participation and age (positive, but declining), ethnicity
(negative) and region (positive) is as expected. It is important to note that the coefficients of the
participation equation are not directly interpretable. The participation equation is estimated with a
Probit model, where the marginal effects are not constant and depend on the level of explanatory
variables.
The coefficient rho is less than 0, but not significant at the 5% level. This means that we cannot
reject the null hypothesis that unobserved characteristics are not negatively correlated. The
coefficient for lambda is equal to the product of rho and sigma, and equal to -0.026 for men. The
final correction varies by individual as it is equal to the quotient of the probability density function
(PDF) and cumulative distribution function (CDF), multiplied by lambda. The model described
above results in an estimated wage rate for active and inactive individuals, while taking selection
bias into account.
14
Table 4 shows the results for the wage and participation equation for couples, again estimated
separately for men and women.
Table 4:
Estimation results Heckman Selection Model couples (by
maximum likelihood)
Men
Women
(# 17.669)
(# 19.777)
Wage equation
Dummy education 1+
- 0.711**
-0.745**
Dummy education 2
- 0.576**
-0.598**
Dummy education 3
- 0.444**
-0.374**
Dummy education 4
- 0.195**
-0.121**
Age (ln) ++
5.101**
9.283**
Age squared (ln) ++
- 0.622**
-1.282**
Ethnicity
- 0.189**
-0.130**
Region West
0.073**
0.065**
Constant
- 7.134**
-13.906**
Participation equation
Dummy education 1+
Dummy education 2
Dummy education 3
Dummy education 4
Age (ln) ++
Age squared (ln) ++
Ethnicity
Region West
Youngest child 0-4 years
Youngest child 4-12 years
Youngest child 12-18 years
Constant
- 0.710**
- 0.221**
- 0.040
0.069
10.229**
-1.375**
-0.706**
0.072**
-0.013*
-0.099**
-0.044
-17.202**
-0.950**
-0.508**
-0.100*
0.254**
18.974**
-2.737**
-0.487**
0.107**
-0.423**
-0.327**
-0.069**
-31.587**
rho (ρ)
sigma (σ)
lambda (λ= ρ*σ)
0.681**
0.366
0.249
0.784**
0.442
0.347
Log Likelihood
-9840.3
-16681.9
+ Category university is the reference category.
++ natural logarithm
** significant at 1%, * significant at 5%
Most of the coefficients for couples are significant and have the expected sign. In the wage
estimation for couples we see that rho significantly differs from 0. The coefficient for rho is positive
which means that unobservable characteristics have a positive effect on both wages as well as
participation.
5.2 Results parameters utility function
This section presents the estimation results of the parameters of the utility function. Table 5 shows
these results for singles. For single women, all coefficients are significant except for the quadratic
term for income. This does not apply for single men. The coefficients of income, leisure and higher
education are not significant at the 5% level. Furthermore, the interaction terms of children with
leisure are not significant. The latter result is not surprising given that single men with children are
rare in the sample.
15
The coefficients of the interaction terms of dummy variables with leisure are directly interpretable.
For example, single persons with low education have a greater preference for leisure. The same
holds for singles with a non-Western background. Singles in the Western region have a lower
preference for leisure. The coefficients for income, leisure and age are not directly interpretable.
For income and leisure, a quadratic term and an interaction term is included so that the marginal
utility of income and leisure is not constant and depends on the level of income and leisure.
However, it can be shown that leisure and income have a positive effect on the utility of individuals
and that marginal utility of income falls as income increases. This is intuitive. The model also
includes a quadratic term for age. The interpretation is that preferences for leisure decrease with
age (that is preference for income/work increases), but this relationship becomes positive for older
individuals (that is, a higher preference for leisure). As expected, single women with children have
a greater preference for leisure. This effect becomes smaller as children are older.
Table 5 shows that a dummy for participation is estimated as well. Initially, a model was estimated
without a dummy and this basic model strongly overpredicted the number of part-time jobs. This is
a well known issue by these types of labor supply models (see Van Soest for example). In order to
correct for this problem, researchers often include dummies which improve the predicted labor
supply distribution. The dummy for participation can be interpreted as follows:
•
Dummy represents fixed costs of participation (such as costs of childcare and travel costs)
•
Dummy corrects for the fact that small part-time jobs are rarely offered (correction for
supply side of labor)
•
Dummy displays search costs for workers and these workers have to search longer for
small part-time jobs
I have included a dummy for participation and the estimated coefficient is negative. In this case,
the interpretation is that the dummy represents fixed costs of participation.
Finally, table 5 shows that the variance of the random coefficients is significant for single men and
women. As previously stated, the random terms for leisure ensure that the coefficient of leisure
varies by individuals. Therefore, these models are often called random coefficients models. If the
variance is not significantly different from 0, this means that we have fixed parameters and
unobserved heterogeneity plays no role.
16
Table 5: estimation results parameters utility function, singles
Income (Y)
Leisure (H)
Income2
Leisure2
Income*Leisure
Men
(#3.276)
2.784
-3.258
-0.142
0.201
-0.251
Women
(#3.647)
10.234 **
46.322 **
0.426
-6.740 **
-1.905 **
H*Age
H*Age2
H*Educlow
H*Educhigh
H*Child0-4
H*Child4-12
H*Child12-18
H*ethnic
H*region
-2.000 *
0.500 **
1.317 **
-0.148
1.000
0.067
- 0.068
2.330 **
- 0.339 *
-9.676 **
2.382 **
4.179 **
-3.033 **
12.263 **
9.222 **
4.010 **
3.076 **
-1.660 **
Dummy participation
ln(σ)
- 2.032 **
0.994 *
-1.835 **
2.178 **
Log Likelihood
Pseudo R2
* p<0.10, **P<0.05
- 2.836.4
0.52
-4.607.0
0.29
An important consideration in the labor supply model above is that the quadratic utility function is
not automatically quasi-concave for all values of Y and H. In the discrete choice model, this is not a
problem as long the utility function is quasi-concave for the observed combinations of Y and H in
the sample. An important condition here is that the marginal utility of income is not negative. For
all observations, this condition is fulfilled.
Figure 1 and 2 show the actual and predicted distribution of working hours for single men and
women respectively. Both figures show that there is a good match between the actual distribution
and the predicted distribution.
Figure 1: Actual and predicted labor supply distribution, single men
80.0
72.4
72.4
70.0
60.0
50.0
40.0
30.0
20.0
14.3
14.4
8.0
10.0
0.6
0.0
0 hours
0.7
8 hours
1.8
1.6
2.8
7.8
3.2
16 hours
24 hours
Actual labor supply
Predicted labor supply
32 hours
40 hours
17
Figure 2: Actual and predicted labor supply distribution, single women
45
39.4
40
39.2
35
30
24.4
25
24.1
21.6
21.6
20
15
10.3
10
5
3.6
0.7
10.8
3.6
0.8
0
0 hours
8 hours
16 hours
Actual labor supply
24 hours
32 hours
40 hours
Predicted labor supply
In order to calculate elasticities, net wage is increased by 10% which results in an increase in
average working hours from 32.5 to 32.8. This amounts to an elasticity of 0.10 for single men. For
single women, the average working hours increase from 25.8 to 26.1 which leads to an elasticity of
0.15.
Table 6 shows the estimation results for couples. The structure of the model is as follows:
•
Alternatives labor supply men: 0, 30 and 40 hours
•
Alternatives labor supply women: 0, 10, 20, 30 and 40 hours
This combines to a total of 15 pairs of alternatives. The second column of table 6 shows the results
of the model in which couples jointly make a labor supply decision. Columns 3 and 4 include the
results of labor supply models where only one partner is flexible with respect to labor supply, while
the other partner keeps its labor supply fixed. Most coefficients are significant at 5% level. Only a
few coefficients for men are not significant. These are the dummies for high education, older
children (12-18 years), ethnicity and region. In addition, the signs for ethnicity (negative, so less
preference for leisure) and region West (positive, so higher preference for leisure) for both men
and women are not anticipated. I have no explanation for these results. For the remaining
variables all signs are significant and have the expected sign. The presence of children increases
the preference for leisure and this effect is stronger for women.
Initially, a basic model without dummies was estimated which again resulted in overprediction of
the number of small part-time jobs. Individuals with these small part-time jobs are mainly women.
Therefore only alternative specific dummies are added for women with small part-time jobs,
namely for 10 and 20 hours of work. The estimated coefficients for these dummies are significant.
Finally, the results show that the estimated (log) of the variance for the random coefficients for
leisure is significant for men and women.
18
Table 6: Estimation results parameters utility functions couples
Couples jointly
(# 14.184)
Couples, men
flexible
(# 14.184)
-7.308 **
-23.770 **
Income (Y)
Leisure men (Hm)
Leisure female (Hf)
Income2
Hm2
Hf2
Y*Hm
Y*Hf
Hm*Hf
-6.844 **
-5.177 **
13.061 **
1.176 **
1.555 **
-0.859 **
0.699 **
0.656 **
-1.834 **
Hm*Age
Hm*Age2
Hm*Educlow
Hm*Educhigh
Hm*child0-4
Hm*child4-12
Hm*child12-18
Hm*etnic
Hm*region
-2.760 **
0.666 **
0.121 *
-0.016
1.294 **
0.786 **
0.114
1.549 **
-0.108
Hf*Age
Hf*Age2
Hf*Educlow
Hf*Educhigh
Hf*child0-4
Hf*child4-12
Hf*child12-18
Hf*etnic
Hf*region
-6.172 **
1.546 **
0.926 **
-0.953 **
3.275 **
2.795 **
1.234 **
1.153 **
-0.317 **
Dummy 10 hours
Dummy 20 hours
-0.874 **
0.358 **
-1.996 **
-1.102 **
ln(σm)
ln(σf)
0.367 **
0.538 **
-1.897
Log Likelihood
Pseudo R2
* p<0.10, **P<0.05
-27534.5
0.29
-9150.6
0.64
1.061 **
3.524 **
Couples, women
flexible
(# 14.184)
-13.096 **
-21.533 **
1.032 **
2.656 **
1.451 **
2.703 **
-1.703 **
0.335 **
0.231 **
-0.020
0.060
-0.007
-0.145 **
1.154 **
-0.099 **
-6.735 **
1.771 **
1.731 **
-1.925 **
4.696 **
3.948 **
1.631 **
1.234 **
-0.832 **
-1.140 **
0.087 **
-5.280
-22127.1
0.13
Figure 3 shows the actual and predicted distribution of couples for the model where couples decide
cooperatively. Figure 3 shows that the model predicts the labor supply distribution well. To
determine the elasticities, net wages of men and women are increased by 10%. This results in an
elasticity of 0.20 for women, while the elasticity for men equals 0.07. Surprisingly, the elasticities
for women are very low compared to other studies. Van Soest (1995), for instance, found elasticity
for women (couples) of approximately 0.50. It is important to note that Van Soest uses data from
1987 and in this period the participation of women was low: only 42%. This may be an important
explanation for the relative high elasticity for women.
19
Figure 3: Actual and predicted labor supply distribution couples
30
27.2
26.5
25
22.8 22.8
20
15
11.5
12.5
11.4
11.5
10.3 9.8
10
5
2.6 2.1
0.3 0.4
3.1 3.4
2.3
1.11.5
1.1 1.1
1.5 1.5
1.8
0,20
hours
0,30
hours
0,40
hours
30,0
hours
0.7
1.1
2.7
1.7
1.9 1.7
0
0,00
hours
0,10
hours
30,10
hours
30,20
hours
Actual labor supply
30,30
hours
30,40
hours
40,00
hours
40,10
hours
40,20
hours
40,30
hours
40,40
hours
Predicted labor supply
The elasticities above are based on the expected value of labor supply in the baseline scenario en
the scenario in which wages are increased. Until now, the calibration method described by Creedy
and Kalb (2005) is not implemented yet. This means that the elasticities are based on uncalibrated
results. With uncalibrated probabilities we must restrict our attention to aggregated results only.
Calibration of the error terms (see section 3.1) however ensures that the actual choices of all
individuals obtain the highest utility. Hence, calibration guarantees that the labor supply model
perfectly predicts the labor supply distribution in the baseline scenario. Consequently, we are able
to focus on households into more detail, i.e. at the micro level. In this way, we can look beyond the
aggregated results and focus on several specific groups. The elasticity of 0.20 for women is an
average for the whole population. It is expected that part of the women that work full-time have a
negative wage elasticity, thus lowering average elasticity, while small part-timers are much more
elastic.
6. Policy simulation
The previous section showed that the labour supply model does a good job at predicting the labour
supply distribution in the Netherlands. However, the estimated elasiticities for women are rather
low compared to previous studies. The next step is to simulate some policy measures. In this
section the labour supply effects of tax credits for employees are presented as a first application of
the recently developed labor supply model. I focus on two important tax credits: a labour tax
credit for employees (in Dutch: arbeidskorting; AK) and a labour tax credit for employees, who are
second-earners or sole parents with young children (in Dutch: inkomensafhankelijke
combinatiekorting; IACK). Evers et al (2007) simulate some policy measures with a general
equilibrium model MIMIC, which is developed by the Netherlands Bureau for Economic Policy
20
Analysis (CPB) 15. They conclude for instance that the IACK stimulates labour supply of secondary
earners but at the expense of primary earners’ labor supply.
In the Netherlands, various tax credits exist that give tax payers a reduction on liability for taxes
and social security contributions. Employees for instance receive a general labor tax credit AK
which rises with income. Figure 4 shows the design of this tax credit AK. The maximum tax credit
is equal to € 1.431 and total budget amounts € 7.6 billion in 2005.
Figuur 4: Labor tax credit AK
1600
1400
1200
Tax Credit (AK)
1000
800
600
400
200
0
0
5000
10000
15000
20000
25000
30000
35000
40000
Income
For employees with (young) children, there is an extra tax credit: the IACK. Secondary earners, or
single parents, with a child younger than 12 years, are entitled to this tax credit. Figure 5 shows
the structure of the IACK in 2009. The IACK depends on the level of income: starting from an
individual income of € 4.368 (deflated to 2005 prices), the IACK increases with a rate of 3.8% until
the maximum of € 1.722 (deflated to 2005 prices) is reached. The fiscal loss due to the IACK is
approximately € 1.1 billion.
Figuur 5: Labor tax credit IACK
2000
1800
1600
Tax credit (IACK)
1400
1200
1000
800
600
400
200
0
0
5000
10000
15000
20000
25000
30000
35000
40000
Income
15
The CPB memorandum 143, Tax reform and the Dutch Labor Market: an applied general equilibrium
approach provides a nice description of this model.
21
Table 7 shows the first preliminary results of the recently developed labor supply model. Complete
abolishment of the tax credit for all employees (AK) would reduce labor supply of single men by
0.7% and single women by 1.1%. The drop in labor supply is higher for single women, which is
intuitive since the average elasticity is higher as well. For couples a similar story applies.
Surprisingly, the labor supply effect is smaller for women with a partner compared to single women
despite the fact that the elasticity is slightly higher. This is probably due to the fact that only 30%
of single women have children whereas 60% of cohabiting women have children. One would expect
that women with children are on average less elastic compared to women without children.
As a second application of the labor supply model, elimination of the tax credit IACK is simulated.
According to first preliminary results, the total labor supply effect (aggregated over the whole
population) of the IACK is negligible. The total sum of the tax credit IACK is modest (€ 1 billion)
compared to the tax credit AK (€ 7.6 billion) which explains why the labor supply effect of the tax
credit AK is much higher. Only single parents, and secondary earners, with young children receive
the tax credit IACK.
Table 6: Labor supply effects tax credits employees
Baseline scenario
Tax credit AK
Tax credit IACK
Single men
Labor supply
(average in hours)
Labor supply effect
(in %)
32.50
32.27
32.50
-0,7%
0.0%
Single women
Labor supply
(average in hours)
Labor supply effect
(in %)
25.78
25.51
25.77
-1.1%
0.0%
36.26
36.32
-0.2%
-0.0%
17.77
17.91
-0.8%
-0.1%
Couples, men
Labor supply
(average in hours)
Labor supply effect
(in %)
Couples, women
Labor supply
(average in hours)
Labor supply effect
(in %)
36.33
17.92
22
8. Conclusion
This paper presents a recently developed labor supply model for the Dutch tax benefit model
MICROS. The ultimate goal of this study is to develop a labor supply model that plays an important
role in the decision making process in the Netherlands. The labor supply model uses a discrete
choice framework and the estimates are based on cross section data from 2005. The model does a
good job at predicting the labor supply disttibution for single men, women and couples. First
preliminary results show, however, that elasticities are surprisingly low. That is, the labor supply
model estimates elasticities of 0.10 for single men and 0.15 for single women. For couples, the
elasticities are 0.07 for men and 0.20 for women. The low elasticities for men are in line with other
studies. However, most studies find higher elasticities for women. An explanation in favor of our
preliminary results is that other Dutch studies use older datasets. Over the years participation of
women increased and this in turn lowered their elasticities. On the other hand, the recently
developed labor supply model is still under construction and one needs to interpret these first
preliminary results with caution. In the near future, refinements of the labor supply model will take
place.
23
Literatuur
Bargain, O. (2005), On modelling Household Labor Supply with Taxation, IZA discussion paper
1455
Blundell, R., Duncan, A. and C. Meghir, Estimating Labor Supply Responses Using Tax
Reforms, Econometrica, 66(4), pp. 827-861
Bosch, N. And B. Van der Klaauw (2009), Analyzing Female Labor Supply: Evidence from a Dutch
Tax Reform, IZA discussion paper, 4238
Bovenberg, A.L., Graafland, J.J. and R.A. de Mooij (1998), Tax reform and the Dutch Labor Market:
an applied general equilibrium approach, CPB memorandum 143, The Hague.
Cameron, C.A. and P.K. Trivedi (2005), Microeconometrics Methods and Applications, Cambridge
University Press
Creedy, J. and G. Kalb (2005), Discrete Hours Labour Supply Modelling: Specification, Estimation
and Simulation, Journal of Economic Surveys, 19(5), pp.697-734
Evers, M., de Mooij R. And D. Van Vuren (2007), Arbeidsaanbodelasticiteit en beleid, Economisch
Statistische Berichten (ESB), pp. 171-173
Evers, M., de Mooij R. And D. Van Vuren (2008), The Wage Elasticity of Labour Supply: A Synthesis
of Empirical Estimates, De Economist, pp. 25-43
Haan, P. (2005), Much Ado About Nothing: Conditional Logit vs. Random Coefficient Models for
Estimating Labour Supply Elasticities.
Haan, P. en A. Uhlendorf (2006) Estimation of multinomial logit models with unobserved
heterogeneity using maximum simulated likelihood. The Stata Journal, 6(2): pp. 229-245.
Heckman, J. (1979), Sample Selection Bias as a Specification Error, Econometrica, 47(1), pp. 153161
Keane, M. and R. Moffitt (1998), A Structural Model of Multiple Welfare Program Participation and
Labor Supply, International Economic Review, 39(3), pp. 553-589
Lungerd, S, Pollak, R. and T.Wales (1997), Do Husbands and Wives Pool Their Resources? Evidence
from the United Kingdom Child Benefit, Journal of Human Resources, 32(3) pp. 463-479.
24
McFadden, D. (1973), Conditional Logit Analysis of Qualitative Choice Behaviour, In Frontiers in
Econometrics, Academic Press New York, pp.105-142
Nelissen, J.H.M., Fontein, P.F. and A. van Soest (2005), The Impact of Various Policy Measures on
Employment in the Netherlands, The Japanese Journal of Social Security Policy, 4(1), pp. 17-32
Train, K.E. (2003), Discrete Choice Methods with Simulation, Cambridge University Press
Van Soest, A. (1995), Structural Models of Family Labor Supply: A Discrete Choice Approach, The
Journal of Human Resources, 30(1), pp. 63-88
Van Soest, A. and M. Das (2000), Family Labor Supply and Proposed Tax Reforms in the
Netherlands, De Economist, pp. 191-218
Van Vuuren, D. en R. Euwals (2006), De structurele groei van het arbeidsaanbod op
middellange termijn, CPB Memorandum, 155, Den Haag: Centraal Planbureau
25