Effects of an Enforced Activation Policy on Firm Profits

Profiting from Presenteeism? Effects of an
Enforced Activation Policy on Firm Profits
Anna Godøy∗
December 11, 2014
Abstract
Activation requirements and graded benefits are a strategy for reducing social insurance costs in comprehensive welfare states. In Norway, a
policy of issuing graded rather than full time sickness absence certificates,
is viewed as a strategy not just to reduce direct costs of sick pay but also
to facilitate returns to work and reduce inflows to permanent disability.
This paper analyzes effects of partial sick leave on firm profits, on average and across different firm groups. An instrumental variable approach
exploiting variation in physicians’ certification practices is implemented
to identify causal effects of graded absence. On average, graded sickness
absence is found to increase firm profits at high skill firms; the effects are
strongest at the top of the productivity distribution. For low skill firms,
grading has no significant effects on profitability.
Keywords: absenteeism, productivity, activation, work loss
JEL Classification Numbers: I18, I38, J30, J48
1
Introduction
Activation policies have become a feature of different social insurance arrangements. In the case of sick pay, activation requirements in the form of graded
absence certificates have been found to reduce the time it takes to find work, as
well as reduce inflow to permanent disability benefits. There is limited evidence
on the effect these kinds of activation policies have on the firm side. This paper
investigates the impact of graded sickness absence on the profitability of firms.
The a priori effects of such activation policies on firm profits are ambiguous. On the one hand, absenteeism is generally perceived to be costly for firms,
so reducing absenteeism by having workers come in part time could have positive effects on profits. This is most clearly the case in regimes where the firm
∗ Institute for Social Research,
P. box 3233 Elisenberg, 0208 Oslo, Norway.
[email protected]. The research has received support from the Norwegian Research
Council (grant number 227117). Data made available by Statistics Norway have been essential.
1
continues to pay wages to absent workers on temporary disability. Moreover,
the cost of absence is typically thought to be higher than direct wage costs,
due to disruption of the firm production process. As a result, firms tend to be
concerned with absence rates even when firms do not cover sick pay for temporarily disabled workers, i.e. when sick pay is covered by the government or
in systems without sickness benefits. Issuing graded rather than part time sickness certificates could reduce the disruption of the production processes, which
would increase firm profits. In this way, activation policies reducing the degree
of absence also reduce the cost of absence, improving firm outcomes.
On the other hand, more use of graded sickness absence would mean that the
workers who actually show up to work on average have worse health, and thus
may be less productive, that is, there may be costs associated with increased
presenteeism. Full time absence may be preferable for firms if absent workers
are less productive per hour. Graded absence certificates are often implemented
by a proportional reduction in working hours - that is a worker who is assigned a
50% absence certificate will work 50% of their contracted hours. Put differently,
with graded absence certificates, the employer might end up paying full price
for workers who, due to a temporary disability, are less productive than usual.
Finally, fulfilling such activation requirements during sickness absence may have
direct implementation costs for firms, for instance some medical conditions may
require the employer to adjust the physical working environment.
The impact of graded sickness absence on firms’ outcomes should be of interest to policymakers for a number of reasons. The policy of issuing graded rather
than full time absence certificates has been found to lead to shorter absences
and higher re-employment probabilities (Høgelund et al. 2010, Viikari-Juntura
et al. 2012, Markussen et al. 2012) reducing the cost of sickness absence to the
public. To get a more complete view of the economic consequences of grading,
we should also look at costs incurred by the employers. By doing this, we can
find out whether grading reduces the costs of health related absences to all involved parties, as opposed to simply shifting the costs of sickness absence from
the taxpayers to the individual firms.
If grading impacts firm profitability, the incentives of firms to manage absence might also change. In Norway, as in several European countries, sick
pay for long term absence spells is covered by the government. In this kind of
institutional environment, economic theory suggests we should expect a moral
hazard problem. As firms are insured against the direct costs of long term
sickness absence, they will exert too little effort in keeping absence rates low
(OECD 2010). Empirical evidence suggests this is indeed the case: Fevang et al.
(2014), using data from a Norwegian reform removing firms’ copay for pregnant
women’s sickness absences, found significant increases in absence rates. If grading sickness certificates increases the costs of absence to firms, this could be
one channel explaining the beneficial effects of grading on patients’ labor market outcomes. That is, as graded benefits become more common, long term
absences become more costly for firms, giving firms stronger incentives to reduce absence rates. Conversely, if graded absences are less costly, moral hazard
problems may become more important.
2
At the same time, if grading has an impact on firm profits, this may also
affect firms’ hiring strategies. The willingness of firms to hire workers with
poor health status likely depends on the perceived cost of future absence spells.
If partial sick leave increases the cost of sickness absence relative to full time
absence, this could make firms more reluctant to hire workers with a history of
long sickness absences.
As a first step in investigating this issue, we could compare profits of firms
with mostly full time sickness absence to firms with higher shares of partial
sickness absence. However, this is problematic for several reason. First, absence
spells are more likely to be full time if they are expected to last a short time.
For this reason, firms with higher shares of partial sickness absence will also
typically have higher total absence rates.
Even controlling for firm total absence rates and diagnosis groups, this exercise risks giving biased estimates if some types of firms have an easier time
accomodating part-time sickness absence than others. The decision of whether
a sickness absence spell should be full time or part time is typically made following meetings between the patient, the certifying physician and the employer;
the preferences of the employer and the nature of work are both likely to influence the grading decision. For instance, some highly productive firms may
have a higher requirement for how healthy a worker needs to be in order to be
productive. Controlling for worker characteristics, these firms would have low
rates of graded absence (as the threshold for returning to work would be high)
and high profitability, making it seem as though there is a negative relationship
between grading and profits even in the absence of any causal effect.
To identify the effect, I will exploit variation the practice style of physicians.
There is significant variation in the practice style of physicians when it comes
to issuing graded rather than full time absence certificates. Linking employees
to physicians, we can estimate effects of the average practice style of the workers’ doctors on the profits of the firm. Some firms have workers whose doctors
regularly prescribe part-time rather than full time sickness certificates. Other
firms’ workers may have doctors who - on average - prefer to issue full time absence certificates. Comparing profits across firms with different types of doctors
should help uncover the causal effects of activation on firm profitability.
The identification strategy used in the present paper was first introduced by
Markussen et al. (2012), who used variation in physicians’ certification practices
to identify effects of activation requirements during long term certified sickness
spells on individual labor market outcomes. The authors find that receiving
a partial sickness absence, rather than being absent full time, leads to shorter
absence spells and higher employment rates. In particular, the positive effects of
grading on the probability of returning to employment suggests that activation
grading might have a positive effect on firm profits, as the firm will not have to
incur costs of hiring and training new workers.
The present paper is related to a literature on the cost of sickness absence
to firms. If absence is costly to the firm above and beyond direct wage costs,
we might expect part-time absences to reduce some of these costs. Pauly et al.
(2002) present a theoretical model of the costs of employee absence. Absence
3
is found to be more costly when there is team production and where substitute
workers are difficult to find. In these cases, the firm’s cost of absence will be
higher than the wage costs. Nicholson et al. (2006) use survey data on perceived
costs of absences and job characteristics to empirically evaluate the implications
of this model. The authors find that absence costs typically exceed wage costs,
with an average figure of 28%. When disaggregating results by profession, excess costs were found to be particularly high for high skill occupations (e.g.
engineers), and lower skill jobs working closely with high skill professionals (e.g.
paralegals). For low skill jobs (fast food cooks, waiters etc,) there was little excess cost of absence above the wage rate. From their findings, we should expect
positive effects of grading, in particular for high skill firms.
There is also a large related literature concerned with measuring the impact
of various health conditions on firm performance. These studies typically focus
on the impact of specific medical conditions, for example depression (Stewart
et al. 2003, Simon et al. 2001), back pain (van Tulder et al. 1995). Collins et al.
(2005) use survey data from a single US corporation to assess the cost of chronic
health conditions to the firm from treatment costs and output loss from absenteeism and on-the-job productivity loss (”presenteeism”). The authors conclude
that the cost of presenteeism are large and higher than the costs associated with
absences. These studies are focused on a somewhat different question than the
present paper - attempting to estimate the full impact of health conditions, taking into account treatment costs - whereas the present paper seeks to identify
effects on the firm of absence patterns for a given (initial) health status.
The contribution of this paper is twofold. First, it contributes to the literature on costs of sickness absence to firms, by analyzing detailed register data
on the full population of firms. Second, it contributes to the literature on activation strategies in social insurance arrangements by explicitly discussing the
role of the firm.
The rest of the paper is organized as follows. Section 2 gives a brief overview
of the relevant institutional context and presents the data used in the analysis.
The physician grading propensity instrument is described in section 3. Section
4 presents the empirical models and the estimated effects, and section 5 presents
some concluding remarks.
2
Institutions and data
In Norway, persons who are unable to work due to sickness can receive sickness
benefits at 100% replacement ratio for sickness spells lasting up to 12 months.
Workers who are absent up to 12 months benefit from additional job protection,
meaning they cannot be fired for reasons related to their absence. While very
short absences - typically up to three days - can be self certified, absences lasting
longer than this need to be certified by a doctor. The first 16 days of each spell
sickness benefits are paid by the employer, after that benefits are paid by the
government. In other words, firms incur the largest direct costs of sickness
absence for relatively short sickness absence, while the direct costs of long term
4
absences will be considerably lower. After 12 months, the worker is no longer
entitled to sick pay, but is eligible for other benefits with significantly lower
replacement ratios (typically around 66%). Job protection is also reduced after
12 months absence.
Employers are required by law to make necessary accommodations to facilitate return to work. For longer absence spells, the responsibilities of the
firm become more involved: Within four weeks, the employer should provide a
”follow-up”-plan. At 8 weeks, the certifying physician and the Labour and Welfare Administration (NAV, the agency responsible for paying sickness benefits)
meet to formally discuss possibilities of returning to work.
Absence certificates will state the degree of disability; the minimum absence
grade is 20%. A worker who is issued a graded absence certificate -that is, less
than 100% disability - can comply in two ways: The first, and most common way,
is by working the number of contracted hours multiplied by the assigned grade,
at full capacity. For example, a full time worker who is issued a 40% absence
certificate could fulfill this requirement by working two full days a week. The
second way a graded absence can be implemented is by working a larger number
of hours at a reduced work capacity, e.g. a full time worker can be issued a 40%
absence certificate, be present at work four full days a week and work at 50%
capacity.
In this paper, data on physician certified absences is merged to data on firm
performance to shed some light on how these activation -requirements affect
firm profits. The main sample is a firm-year dataset of annual profits, firm
characteristics and sickness absence rates. The starting point of the sample is
data from 2004 to 2010 on all private sector limited liability companies where
accounting data is available. Using this accounting data, the return on assets
(ROA) is defined as operating profits divided by total assets. This data is linked
to information on the firm’s location and industry classification.
The sample is then linked to demographic and educational data on the workers. This data includes information about gender, age, education and immigrant
status. Finally, data on doctor-certified sickness absence is added to the sample.
Absences are classified as full time if the disability rate is 100%, and part time
if it is less than 100%.
Firm-years with missing data are excluded from the sample. The final sample
contains 167,107 firm-year observations. Table 1 shows some summary statistics
of this firm level sample. A large majority, 91% of firms experience some sickness absence each year, with the average sickness absence rate being at 5.0%.
Meanwhile, about 50% of firms experience any part-time sickness absence.
These numbers mask substantial variation in absence patterns between firms.
Specifically, there appear to be differences in both the level of absence rates
between high and low skill firms. Figure 1 shows the evolution of the average
rate of sickness absence and the average share of absence days that are graded
during this period. In this figure, firms are classified according to the education
background of their employees. Firms are classified as ”high education” if more
than 33% of employees have at least a college degree, and low education if less
than 10% of workers have a college degree.
5
Table 1: Summary statistics, main sample
Return on assets
Employment
Fraction female
Fraction immigrant
Avg employee age
Share college degree
Sickness absence (all)
Sickness absence (partial)
Any sick absence
Any partial sickness absence
Observations
mean
0.0715
20.30
0.377
0.0778
37.77
0.221
0.0498
0.0133
0.908
0.498
167107
sd
0.142
67.16
0.322
0.151
7.447
0.242
0.0515
0.0256
0.290
0.500
Note: Figure shows summary statistics of the panel of firms (2004-2010).
Figure 1: Absence rates, 2004-2010
Share of absence days graded
.15
.035
.04
.2
.045
.05
.25
.055
.06
.3
Average absence rates
2004
2006
2008
2010
2004
Year
2006
2008
2010
Year
Low education
All
High education
Note: Figure shows certified absence rates, calculated as the fraction of worker-years with any
sickness absence, not adjusted for grading. The share of absence graded is calculated as the
graded absence rate relative to the total average absence rate. Low education firms are firms
where 10% or less have a college degree; high education firms are firms where at least 33% of
workers have a college degree.
6
Figure 2: Grading and absence rates
0
.1
Share graded
.2
.3
.4
.5
Avg. share of absence graded and avg. absence rate
0
.05
.1
.15
Absence rate
.2
.25
Note: Figure shows a binned scatter plot of the share of absence that is graded and the average
absence rate in firms.
The left panel shows the evolution in average absence rates. High skill firms
have on average lower rates of sickness absence than low skill firms; the difference
is around 2 percentage points and appears to be stable over time. There has
been an upward trend in the use of graded absence certificates. In 2004, 17.7% of
the average firm’s absence days were associated with spells with graded absence
certificates; in 2010, that fraction had grown to 22.9%. Looking at the panel
on the right, graded absence certificates are used more often in high skill firms
than in low skill firms.
Initial costs associated with accommodating the needs of employees returning to work on a graded absence certificate might make doctors less likely to
issue graded absence certificates Figure 2 shows the relationship between the
average annual absence rate (not adjusted for absence grade) and the fraction
of absence days that are graded. There appears to be a positive relationship
between absence rates and the fraction of the absence that is graded, which
flattens out at high absence rates.
Finally, to motivate the formal analysis to follow, figure 3 plots the relationship between the share of absence graded and firm profits. Figure 3 plots return
on assets (ROA) and the share of long-term absence - absence excluding the first
16 days of each spell - that is graded. As discussed above, short term absence
has a higher direct cost to firms, and short lasting spells are also less likely to
be graded. If short term spells were included when computing the average share
of graded spells, we would capture in part the relationship between short term
absence and profits, rather than the relationship between profits and grading.
Figure 3 appears to indicate a positive relationship between grading and
profits. On average, firms where a larger share of long term absence is graded
have higher return on assets, compared to firms where a higher share of absence
is full time. Of course, this relationship could reflect many things and should
7
Figure 3: Return on assets and share graded
.065
.07
ROA
.075
.08
ROA and avg. share of absence graded
0
.2
.4
.6
Share graded
.8
1
ROA and avg. share of absence graded
>16 days only
.065
.065
.07
ROA
ROA
.07
.075
.075
.08
Up to 16 days
0
.2
.4
.6
Share graded
.8
1
0
.2
.4
.6
Share graded
.8
1
Note: Figure shows binned scatter plots of firms’ return on assets (ROA) and the share of
absence that is graded.
8
not be given a causal interpretation. Some of this likely reflects the way grading
is used more often in high skill firms (figure 1), which may be more productive.
Any analysis of this relationship should then at a minimum control for employees’ observable characteristics. Moreover, a higher share of graded absence
could reflect favorable firm characteristics such as better health of the workers,
better quality of management, less workplace conflicts and so on, that are not
observed in the available register data. One way to investigate this further will
be through an instrumental variable approach. The next section presents the
instrumental variable strategy.
3
Physician instrument
As discussed in the previous sections, the decision to issue a graded rather
than a full time absence certificate is typically not random, but correlated with
observed and unobserved characteristics of patients and their work situations.
Data presented in the previous section showed that the use of graded sickness
absence varies across firms in a non-random way: High skill firms have a higher
relative incidence of partial sickness absence than low skill firms. To identify the
causal effect of graded absence on firm profits, I will implement an instrumental
variable strategy. The instrumental variable strategy used follows the work of
Markussen et al. (2012), where the authors use variation in the grading practices
of primary physicians to identify the impact of grading on individual absence
durations and employment outcomes.
The basic premise of the identification strategy is that doctors would differ in
their intrinsic grading propensity, i.e. how likely they are to issue a graded rather
than a full time absence certificate, even when faced with identical patients.
The grading decision is largely left up to the individual physician’s discretion,
as there are few formal guidelines for what diagnoses should be graded. If
doctors have different views on the usefulness of graded absence, or differ in
their willingness to negotiate with less cooperative patients (or their employers),
this could potentially give rise to exogenous variation in the likelihood that an
individual absence is graded. A major challenge to this identification strategy is
that the intrinsic grading propensity is unobserved, but will have to be estimated
using available data on actual grading decisions.
To estimate the grading propensity, I look at all absence spells lasting 8
weeks or longer, observing whether or not there is some incidence of graded
sickness absence in the first 8 weeks of the spell. To avoid confounding leniency
and grading propensity, a cutoff is imposed where only spells lasting longer
than the minimum duration are included in the estimation sample, ignoring
grading decisions that happen after this minimum duration. As discussed previously, doctors are generally more likely to issue full time absence certificates
when absences are expected to be of short duration. A lenient doctor, who is
willing to extend absences for longer, will then conceivably be observed with a
higher share of absence spells graded than a stricter doctor, even if the two have
identical intrinsic grading propensities. Setting the cutoff at exactly 8 weeks is
9
rather arbitrary, but allows for straightforward comparison with the findings of
Markussen et al. (2012).
It should be noted that this procedure does not rule out correlation between
leniency and grading propensity, rather, it merely avoids confounding the two
when they are uncorrelated. A positive correlation is problematic as this could
imply that doctors with high grading propensity on average have healthier patients at 8 weeks absence duration. Markussen et al. (2012) find no evidence
that the two are positively correlated, rather, the correlation is negative (but
small).
There is substantial variation in the observed grading practices of physicians, however this does not necessarily reflect differences in grading propensity.
Rather, this can be the result of differences in the characteristics of the patients
and their employment situations. In the empirical models, early grading will
be modeled as a flexible function of patients’ observable characteristics in order to remove these composition effects. The remaining variation in grading
probability will be interpreted as the doctors’ intrinsic grading propensity.
In order to estimate the doctor effects, an auxiliary dataset is constructed
consisting of long term absence spells merged with doctor identifiers and demographic and employment characteristics of the patients. The starting point of
the sample is all long term absence spells - defined as spells lasting at least 56
days - starting between 2004 and 2010. These absence data include information
on diagnosis and whether there was any grading prior to 8 weeks. Absence
spells are linked via unique person identifiers to the physician at the start of the
spell. Finally, data on demographics (age, gender, marital status, immigration
background, neighborhood), education (level and field) and job characteristics
(occupation and industry) are attached to the sample. The sample only includes
employed persons aged between 20 and 65 at the start of the absence spell.
Table 2 shows some descriptive statistics of the auxiliary sample. The sample
contains a total of 1,394,676 long term absence spells, of which 27% are graded
at some time during the first 56 days. Comparing columns 2 and 3 reveals
some differences in the characteristics of patients whose spell is graded early
and patients who are absent full time throughout the first 8 weeks. Patients
experiencing early grading are more likely to be female, less likely to be immigrants, have somewhat higher earnings, and are more likely to have a college
degree. The total duration of the absence spells are also shorter for persons
with some early grading.
Formally, the early grading decision is modeled as a simple linear probability
model:
partj = xj β 0 + εj
(1)
Where the dependent variable partj is an indicator variable part is constructed equal to 1 if the spell is graded within the first 8 weeks of the spell,
and 0 otherwise. xj is a number of observable characteristics that can be thought
to influence the grading decision, through health status or work situation. xj
contains age (dummies for five-year bins), education (78 dummies representing
10
Table 2: Summary statistics, auxiliary sample
Age
Female
Immigrant
Earnings (1000 NOK)
High school
College
Psychological
Musculoskeletal
Duration
Observations
(1)
All
mean
42.84
0.633
0.0769
338.6
0.443
0.247
0.198
0.379
178.8
1394676
(2)
Early graded
mean
42.11
0.752
0.0655
355.5
0.401
0.336
0.193
0.372
158.9
377337
(3)
No early graded
mean
43.12
0.589
0.0812
332.4
0.459
0.215
0.200
0.382
186.2
1017339
Note: Figure shows descriptive statistics of the auxiliary sample of long term absence spells
(duration >= 56 days). Column 2 represents spells where there was some grading during the
first 56 days of the spell, while column 3 represents spells where all first 56 days of the spell
were certified full time absences.
level and field of study), immigration background, industry and occupation, log
earnings, dummies for calendar month of entry and diagnosis group. xj also includes a set of 13,733 neighborhood indicators. As patients are likely to choose
doctors whose offices are close to where they live, differences in outcomes across
doctors could reflect socioeconomic sorting on unobservables if there is spatial
sorting within cities. Using residence data on a detailed level allows us to control
for spatial sorting in a more convincing way.
Equation (2) is estimated by OLS and the residual ε̂j is calculated for each
spell. ε̂j captures both the impact of unobserved factors related to spell j (severity of illness, precise nature of work), and the practice style of the physician.
These residuals are used to construct physician practice indicators for all patients registered with a primary care doctor, including persons with no sickness
absence spells. For each individual l registered with physician k, the physician
practice indicator is based on the average residual ε̂ of the doctor’s others patients. Letting K denote the set of long term spells certified by of doctor k,
L the set of long term spells belonging to patient l, the average residual for
individual l at doctor k is defined as
ε̂¯kl =
1
Nk − Nlk
X
ε̂j
j∈K,j ∈L
/
Nk is the total number of long term spells certified by doctor k and Nlk is
the number of spells certified by doctor k to patient k. For an individual with
no long term sickness absence, this is simply equal to the unadjusted mean of
11
0
.1
.2
.3
.4
Figure 4: Grading indicators
1
2
3
4
5
6
7
8
9 10 11 12 13 14 15 16 17 18 19 20
Doctors
Firms
Note: Figure plots the distribution of estimated firm-level physician grading indicators across
doctors (dark gray) or firm-years (light gray). The bars show binned averages, with each bar
representing 5% of doctors/firm-years.
the residuals of doctor k’s long term absence spells1 .
To facilitate interpretation, these residuals are added to the average predicted share of graded absences, given by x̄β̂ 0 , where x̄ is the average values of
observables x in the estimation sample, defining the physician grading indicator
of person l with physician k, ψkl :
ψkl = x̄β̂ 0 + ε̂¯kl
(2)
Finally, the individual workers grading instruments are aggregated to form
a firm level instrument ψ̂it . The physicians’ grading practice is less likely to
influence the absence behavior - and possibly firm profits - for workers who do
not have any sickness absence. On the other hand, current absence rates could
be influenced by previous years’ grading decisions. In the main specification, is
defined as the average values of all workers in firm i year t who entered some
certified sickness absence of any duration in the four years from t − 3 to t.
This aggregation is illustrated in figure 4, which plots the distribution of
physician grading indicators across individual physicians and across firm-years
in the sample. The dark gray bars show grouped averages of grading indicators
among doctors (ε̂¯k ), while the light gray bars show similarly constructed grouped
averages of (aggregated) grading indicators across firm-years (ψ̂it ). The columns
1 Excluding individual l’s own residuals from the calculation of physician averageε̂
¯kl in this
way removes a source of bias that would otherwise occur through the direct effect of l’s own
grading outcome on the profits of their employer. A similar bias could occur in a more indirect
way if co-workers tend to cluster around the same physician. In a robustness check to address
this concern, alternative physician averages are constructed where the residuals of all patients
who are working at individual l’s company are excluded from the calculation of ε̂¯kl ; estimated
effects remain virtually unchanged.
12
labeled 1 represent the average grading indicators in the group containing the
lowest 5% of doctors/firm-years, the columns labeled 2 represent the average
grading indicators in the group containing the next 5% of doctors/firm-years
etc.
As the firm-level grading indicators are essentially weighted averages of the
doctor level indicators, we would expect the firm level distribution to be less
dispersed. As indicated by figure 4, this is indeed the case, in particular at
the extremes of the distributions. The interquartile range of the firm level
instrument is 2.41 percentage points - a degree of dispersion that, while smaller
than the interquartile range of the distribution across doctors (6.33 percentage
points), is not insignificant. Meanwhile, a few firms have associated physicians
with predicted grading probabilities that are very low (including some below
zero) or very high. To avoid these few observations driving the results, firms
with the lowest and highest two percent of this distribution are excluded from
the estimation samples.
An objection to the use of this instrument is that patients are free to select
their doctors. If patients ”shop around” for doctors to find one that matches
their preference for full time versus graded absence, IV estimates may be contaminated by sorting effects. As noted by Markussen et al. (2012), sorting is
not necessarily problematic; as long as patients sort themselves to doctors according to observable characteristics that are included in the dataset, sorting
can be controlled for in the empirical models. However, if patients sort according to unobserved characteristics, such as ambition, health or willingness
to work, estimates will be biased. In the empirical section, this is addressed
through implementing robustness tests. In one robustness test, the grading
instrument is calculated using only workers who did not enter any certified sickness absence during the four years from t − 3 to t. If their physicians’ grading
practice are found to influence firm profits, this can be interpreted as evidence
of uncontrolled-for sorting.
As discussed above, grading propensity may be correlated with other aspects
of physician practice style, i.e. leniency. Failing to properly account for this
could lead to omitted variable bias of the IV estimates. For instance, if doctors
with a high grading propensity on average are less lenient, firms with high values
of the grading instrument ψ̂it would be expected to have lower absence rates
on average, all other things being equal, as their employees are less likely to
be granted absence certificates in the first place. This could in turn bias the
correlation between firm profitability and ψ̂it upward.
To address this, I follow Markussen et al. (2012) and estimate physician
indicators of physician leniency, estimating versions of (2) for two additional
outcomes: annual probabilities of receiving one or more sickness absence certificates (of any duration), and receiving one or more long term absence certificates
(absent for 8 weeks or longer). These equations are estimated using a personyear panel of all physicians’ registered patients aged 20-65 who are employed.
That is, the sample includes patient-years without sickness absence. Physicians are matched to their patients at the beginning of each calendar year. The
models include all the controls of equation (2), with the exception of diagnosis
13
categories (as these are not defined for persons with zero absence).
Having estimated these regressions, the predicted residuals are aggregated
by doctor and firm as outlined above. As the sample size is much larger (as it
includes all employed patients, not only on long term absence), physician averages are not corrected for own absences. However, physicians with less than 200
registered employed patients are excluded when computing firm averages. The
firm indicators are calculated using the average of all employees, without conditioning on absence history. These estimated leniency indicators are included
as controls in extended models.
4
Empirical models
To get a first impression of the data, I estimate a simple model of firm profits
yit = xit β x + θt + sickit β s + partit β p + εit
(3)
where yit is the profits of firm i year t (measured by return on assets). xit
is a vector of firm observable characteristics: the share of female workers, share
of immigrant workers, average age of workers and number of employees. θt are
year dummies.
The main variables of interest, sickit and partit represent the average incidence of sickness absence (total) and partial sickness absence, not adjusted for
certification degree. That is, β p captures the differential effect of partial sickness absence on profits: a positive estimate of β p indicates an that part-time
absence is less costly for the firm compared to full time absence. The upper
panel of table 3 contains estimated parameters from estimation of (3) on the
full sample.
A problem with estimating (3) by simple OLS is that both variables of interest sickit and partit are likely to be endogenous. If unobserved characteristics of
the firm influence both firm profits and sickness absence, as well as the proportion of sickness absence that is full time, the estimates will suffer from omitted
variable bias. To the extent that these unobserved characteristics are constant
over time, the problem can be mitigated by including firm fixed effects. Estimates from this specification are shown in column 2 of table 3. These models
find that sickness absence in general is costly for firms. However, for graded
absences, there is an additional positive effect. In the fixed effects specification,
this additional effect is slightly larger than the main effect of absence, indicating
that absence has no negative effects on firm profits as long as it is graded.
This interpretation is problematic however, as the specification does not take
into account the duration of absences. The lower panel of table 3 show estimates
from an alternative specification where I allow for differential effects of absence
that exceeds 16 days. For absences of this length, sick pay is covered by the
government and not by the firm, so the direct expenses will be smaller. In this
specification, long term absence is found to be less costly. The specification
without fixed effects finds significant differences in the costs of graded absence.
For short term absence, graded absence appears to be more expensive than full
14
Table 3: Effects of sickness absence rates
(a) Absence rates
Frac sick
Frac partial
Observations
(1)
OLS
-0.199∗∗∗
(-18.79)
(2)
Firm FE
-0.0881∗∗∗
(-8.77)
0.118∗∗∗
(5.91)
160244
0.0996∗∗∗
(5.28)
160244
t statistics in parentheses
∗
p < 0.10,
∗∗
p < 0.05,
∗∗∗
p < 0.01
(b) Long vs short term absence
(1)
OLS
-0.594∗∗∗
(-13.53)
(2)
Firm FE
-0.352∗∗∗
(-8.61)
Frac long time sick
0.463∗∗∗
(9.49)
0.303∗∗∗
(6.69)
Frac partial
-0.0999
(-0.77)
0.0643
(0.52)
Frac long time sick, partial
0.198
(1.42)
160244
0.0164
(0.13)
160244
Frac sick
Observations
t statistics in parentheses
∗
p < 0.10,
∗∗
p < 0.05,
∗∗∗
p < 0.01
Note: Models include controls for education background of the employees (share of employees
in each of 7 education levels), share of women, share of immigrants, average age, number of
employees, industry (68 categories), municipality and calendar year. Long term absence covers
all absence from day 17 of each spell, corresponding to the start of government refunded sick
pay. Standard error clustered at firm level.
15
time absence. For long term absences, the opposite appears to be true: the
additional effect of graded absence is positive. Both effects however are barely
significant. Moreover, when including firm fixed effects, estimated additional
effects of graded absence become smaller and statistically insignificant.
Including fixed effects does not account for unobserved heterogeneity that
changes over time. To account for this, I use an instrumental variable approach
exploiting residual variation in physicians’ grading propensity. The construction
of the physician practice style instrument is discussed in the previous section.
Consider the following model of firm profits:
yit = xit β + θt + sit γ + εit
(4)
where, as before, yit is return on assets and xit is a vector of firm observable
characteristics. sit is our main variable of interest and represents the share of
long term (> 16 days) sickness absence that was graded (any grade < 100%).
Table 4 shows selected estimates from model (4). The first panel contains
results from estimation of the model on the full sample of firms. From the OLS
estimates, the estimated effects of sit is positive and statistically significant.
Quantitatively, the effect is rather small. A one percentage increase in the share
of long term absence that is graded is associated with an increase of 0.0075
percent relative to the average ROA in the estimation sample (0.0696). The IV
estimates are presented in columns 2 and 3. The physician grading indicator
appears to be a strong instrument for the share of graded absence. Column
3 shows the estimated first stage: a one percentage point increase in the firm
average physician grading propensity increases the share of firm absences that
is graded by 0.617 percentage points. The instrumental variable approach does
not find any significant effects of the graded absence share on firm profitability.
The IV estimates are negative but not statistically significant.
The next set of models investigates whether the effect of graded absences
depends on the nature of work. As discussed in the previous sections, the use
of graded absence is higher in high education firms than in low education firms.
We would expect full time absence to be comparatively more costly when the
absent worker is difficult to replace, either by having the remaining workers
take over some of the responsibilities or by hiring a replacement worker on a
temporary contract. If work is more specialized at knowledge intensive firms, we
would expect the effect of activation to be larger for firms where more employees
have higher education degrees. The ”low education” category covers firms where
less than 10% of workers have a university degree, while the ”high education”
category are firms where at least 1/3 of workers have a university degree.
The estimated first stage is largely similar across the firm types, and the
instrument is strong with F of excluded instrument above 10 for both firm
categories. For low education firms, the IV estimates are slightly negative and
not significant. For high education firms, the IV estimates are positive and
significant. The IV estimates are also much larger in magnitude than the OLS
estimates: A one percent increase in the share of absence that is graded increases
return on assets by an average of 3.8 percent.
16
Table 4: IV estimates
(a) All firms
(1)
OLS
0.00523∗∗∗
(4.00)
Share partial (long)
(2)
IV
0.0429
(0.90)
Grading propensity
Observations
Kleibergen-Paap F-stat
118804
118804
85.59
(3)
First stage
0.617∗∗∗
(9.25)
118804
t statistics in parentheses
∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01
(b) Low education
(1)
OLS
0.00539∗∗∗
(2.64)
Share partial (long)
(2)
IV
-0.0173
(-0.28)
Grading propensity
Observations
Kleibergen-Paap F-stat
48851
48851
47.25
(3)
First stage
0.703∗∗∗
(6.87)
48851
t statistics in parentheses
∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01
(c) High education
Share partial (long)
(1)
OLS
0.00624∗∗
(2.41)
(2)
IV
0.303∗∗
(2.44)
Grading propensity
Observations
Kleibergen-Paap F-stat
27745
27745
17.95
(3)
First stage
0.582∗∗∗
(4.24)
27745
t statistics in parentheses
∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01
Note: Models include controls for education background of the employees (share of employees
in each of 7 education levels), share of women, share of immigrants, average age, number of
employees, industry (68 categories), municipality and calendar year. Low education firms are
firms where 10% or less have a college degree; high education firms are firms where at least
33% of workers have a college degree. Standard error clustered at firm level.
17
Grading appears to matter primarily for high skill firms. The models above
were estimated separately for small firms, at or below the sample median of 10
employees, and larger firms, with 11 or more employees. The IV estimates (not
reported) indicate no significant differential effects by firm size.
The IV estimate of the effect of graded absence on the profitability of high
skill firms seems large compared to the OLS estimates. There may be several
reasons for this: The first reason relates to some implicit sample selection issues: For firms with zero long term sickness absence, the dependent variable of
interest, share of absence graded, is not defined. As a result, these firms are
excluded from the regressions reported in table 4.
Second, the estimates above should be interpreted as local average treatment
effects (LATE). That is, the estimates measure the effects of grading on firms
whose graded absence share was manipulated by the grading practice of the
employees’ physicians. The effects on this group of compliers may be different
compared to effects of graded absence on “never-takers”, whether this is due to
the structure of production making it difficult to work when partially disabled
(e.g. physically and mentally demanding work, like firefighter or surgeon), or if
it is due to the underlying health condition causing the absences, i.e. diagnoses
that lead to complete loss of work function. Increasing the share of graded
absence in this group could conceivably have negative effects, as the disruption
to production would be unchanged while wage costs would increase.
For the instrumental variable strategy here to be valid, the following exclusion restriction should be satisfied: The physicians’ grading propensity should
influence firm profits only through the share of absence that is graded. Whether
or not this condition holds is in this case a matter of interpretation. For instance, Markussen et al, using a similar instrumental variable strategy, find that
graded absence certificates reduces the adjusted duration of the absence spell
and the likelihood of leaving the labor force, compared to full time absence.
This could also have consequences for the firm. In a way, this can be thought
of as one mechanism underlying the positive IV estimates of table 4: A higher
share of partial absence might improve profitability because partial absences
last a shorter time and fewer persons leave the firm for good, meaning there is
less hiring costs. The estimated effects should not be thought of as the effects
of simply changing the composition of absence, keeping all other things equal.
To supplement the IV estimates in table 4, table 5 reports estimates from
reduced form models that estimate the effects of physician practice style on
profits. As could be expected from the IV estimates, physician practice style
has no significant effects for the pooled sample of firms or for the low education
firms. For high education firms, a one percentage point increase in the average physician grading probability increases ROA by 0.000967. Relative to the
average annual ROA for high education firms in the sample (0.081), a one percentage point increase in the propensity of physicians to issue graded absence
certificates increases profitability by 1.2%.
A central assumption that should hold in order for the reported estimates to
have a causal interpretation is that any non-random sorting of firms to doctors
should be controlled for by the vector of observables xit . That is, the estimated
18
Table 5: Reduced Form, Firms
Grading propensity ×100
Observations
(1)
All
0.000224
(0.89)
152392
(2)
Low ed
-0.000133
(-0.34)
60595
(3)
High ed
0.000974∗∗
(2.02)
39427
t statistics in parentheses
∗ p < 0.1, ∗∗ p < 0.05, ∗∗∗ p < 0.01
Note: Models include controls for education background of the employees (share of employees
in each of 7 education levels), share of women, share of immigrants, average age, number of
employees, industry (68 categories), municipality and calendar year. Low education firms are
firms where 10% or less have a college degree; high education firms are firms where at least
33% of workers have a college degree. Standard error clustered at firm level.
effects of physician practice should reflect the effect of grading at the firm and
not uncontrolled for sorting. To check this, a robustness check is implemented
where the physician instrument at the firm level is calculated using only employees with no new sickness absence that started in the current year and the three
preceding years. For this group of workers, their absence patterns is unlikely
to have been influenced by their physician’s willingness to assign graded rather
than full time absence certificates. In the absence of sorting, there should then
be no effect of the instrument on firm profits.
Table 6: Placebo regression: never-absent workers
Grading propensity ×100
Observations
(1)
All
0.000142
(0.67)
140638
(2)
Low ed
-0.000133
(-0.43)
54612
(3)
High ed
0.000464
(1.05)
37208
t statistics in parentheses
∗ p < 0.1, ∗∗ p < 0.05, ∗∗∗ p < 0.01
Note: Placebo regression: Firm grading indicator constructed using only workers who did not
enter any sickness absence during the years t − 3 to t. Models include controls for education
background of the employees (share of employees in each of 7 education levels), share of women,
share of immigrants, average age, number of employees, industry (68 categories), municipality
and calendar year. Low education firms are firms where 10% or less have a college degree;
high education firms are firms where at least 33% of workers have a college degree. Standard
error clustered at firm level.
Results from this exercise are shown in table 6. The estimated effects of
grading propensity are not statistically significant for any firm category. While
the point estimates are have the same sign as the estimates in table 5, estimates
are closer to zero for all firms pooled together and for the high education firms
subsample. Overall, this exercise indicates that uncontrolled for sorting is not
19
driving the estimated effects of grading.
The estimates presented so far may be suffering from omitted variable bias if
physician grading practice is correlated with other aspects of physician practice
style. In particular, if doctors with high grading propensities are also ”stricter”
in the sense that they are less likely to issue absence certificates in the first place,
the estimated positive effects of grading propensity could be biased upward. Table 7 show estimates from extended models that include firm level indicators of
physician leniency. The leniency indicators included in these models reflect the
propensity of the employees’ physicians to issue absence certificates in general,
however, models where leniency indicators are based exclusively on issuance of
long term absence certificates yield similar results. As with the grading indicators, physician indicators are constructed in a way that controls for patient
characteristics including neighborhood of residence, demographics and work situation (see previous section for details).
Physician leniency is found to negatively impact firm profits (though the
estimates are not statistically significant). The estimated effects of grading
propensity are remarkably robust to the inclusion of these additional physician
characteristics.
Table 7: Extended models: physician leniency
Grading propensity ×100
Leniency indicator
Observations
(1)
All
0.000191
(0.75)
(2)
Low ed
-0.000171
(-0.44)
(3)
High ed
0.000953∗∗
(1.98)
-0.0825
(-1.54)
152391
-0.0755
(-0.93)
60594
-0.103
(-1.02)
39427
t statistics in parentheses
∗
p < 0.1,
∗∗
p < 0.05,
∗∗∗
p < 0.01
Note: Extended models. Models include controls for education background of the employees
(share of employees in each of 7 education levels), share of women, share of immigrants,
average age, number of employees, industry (68 categories), municipality and calendar year.
Low education firms are firms where 10% or less have a college degree; high education firms
are firms where at least 33% of workers have a college degree. Standard error clustered at
firm level.
So far, the estimated models have focused on impacts on the mean. The
impact of activation could be differ along the productivity distribution. If the
partial presence of a sick worker is most useful for firms that are close to failure, we would expect to see the largest impacts at the lower end of the firm
profitability distribution. However, it could also be that full time absence is the
most costly when times are good and there is a lot of work to do. In this case
we would expect the largest impacts on the top of the firm productivity distribution. To investigate this, I simultaneously estimate conditional quantiles at
20
the 25th, 50th and 75th percentile of the distribution of ROA.
Table 8: Quantile regressions
q25
Grading propensity ×100
q50
Grading propensity ×100
q75
Grading propensity ×100
Observations
(1)
All
(2)
Low ed
(3)
High ed
0.000209
(1.32)
-0.000257
(-1.03)
0.000674
(1.58)
0.000161
(0.71)
-0.000492∗
(-1.76)
0.000958∗∗∗
(2.84)
0.0000911
(0.32)
152393
-0.000398
(-0.89)
60595
0.00130∗∗∗
(3.11)
39427
t statistics in parentheses
∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01
Note: Models include controls for average age of workers, share of women and calendar year.
Low education firms are firms where 10% or less have a college degree; high education firms
are firms where at least 33% of workers have a college degree. Standard error clustered at
firm level.
Results are shown in table 8. For the full sample of firms, the estimated
effects of grading practice are positive across the distribution, however they are
(barely) significant only at the higher parts of the profit distribution. As before,
the impact of grading on low education firms is negative; for the median the
effect is significant at the ten percent level. For high education firms, the effects
of grading propensity on the 25th percentile are not statistically significant,
while effects are larger and significant for the 50th and 75th percentile. Overall,
the grading propensity of firms appears to be the most important for high
productivity, high education firms.
To summarize, the likelihood that a doctor issues a part time rather than a
full time absence certificate varies across individuals. Firms whose patients have
doctors with higher grading propensity have a higher share of partial absence.
Using this variation in graded absence rates, estimated models find that grading
appears to increase profits compared to full time absence. The effects are largest
for more knowledge intensive firms, defined as firms where a large share of
workers (> 33%) have a college education.
5
Conclusions
This paper asks the question of how firm profits are affected by activation requirements for workers on sickness absence. Descriptive statistics show a positive relationship between the share of long term absence that is graded and
21
the profits of the firm, as measured by the return on assets (ROA). This could
indicate that part time absence may be less costly for firms when the alternative is full time absence. Meanwhile, the use of graded rather than full time
absence certificates is likely to vary with observed and unobserved characteristics of firms. Data on use of graded sickness absence certificates indicate that
patients who are employed at high skill firms, defined as firms where a large
share of workers have a college education, are more likely to be issued a graded
absence certificate compared to people working at low skill firms.
An instrumental variable approach used to identify the causal effect of grading on firms finds that effects vary according to firm observable characteristics.
A strategy of issuing part time rather than full time absence certificates is found
to increase firm profits at high skill firms. The effects are largest and most significant at the top of the firm productivity distribution. For low skill firms, the
effects of grading are small and negative, though not statistically significant at
conventional significance levels.
Policymakers in comprehensive welfare states frequently face two potentially
contradictory policy goals: on the one hand, providing a generous welfare state
while on the other hand maintaining incentives for people to be economically
self-sufficient. The policy of encouraging physicians to issue graded rather than
full time absence certificates can be thought of as an attempt to reconcile these
goals (Markussen et al. 2012). Ideally, grading should reduce the costs to the
public of sickness absence, while still providing a high level of insurance to
workers with poor health.
This policy may be less desirable if grading simply shifts costs from taxpayers to firms. Grading could be costly to firms, through increased costs of
presenteeism (if workers are less productive than usual) or through direct costs
of accommodating partially disabled workers. Overall, the findings in this paper
indicate that such concerns are unwarranted. It would seem that the gains from
reduced interruptions of the production process generally outweigh the costs of
accommodating workers returning part time. This appears to be especially true
for high skill, highly productive firms, where grading has a net positive impact
on profitability.
References
Collins, J. J., Baase, C. M., Sharda, C. E., Ozminkowski, R. J., Nicholson, S.,
Billotti, G. M., Turpin, R. S., Olson, M. & Berger, M. L. (2005), ‘The assessment of chronic health conditions on work performance, absence, and total
economic impact for employers’, Journal of Occupational and Environmental
Medicine 47(6), 547–557.
Fevang, E., Markussen, S. & Røed, K. (2014), ‘The sick pay trap’, Journal of
Labor Economics 32(2), 305–336.
Høgelund, J., Holm, A. & McIntosh, J. (2010), ‘Does graded return-to-work
22
improve sick-listed workers’ chance of returning to regular working hours?’,
Journal of health economics 29(1), 158–169.
Markussen, S., Mykletun, A. & Røed, K. (2012), ‘The case for presenteeism—evidence from norway’s sickness insurance program’, Journal of Public Economics 96(11), 959–972.
Nicholson, S., Pauly, M. V., Polsky, D., Sharda, C., Szrek, H. & Berger, M. L.
(2006), ‘Measuring the effects of work loss on productivity with team production’, Health Economics 15(2), 111–123.
OECD (2010), Sickness, Disability and Work: Breaking the Barriers; A Synthesis of Findings Across OECD Countries, OECD.
Pauly, M. V., Nicholson, S., Xu, J., Polsky, D., Danzon, P. M., Murray, J. F.
& Berger, M. L. (2002), ‘A general model of the impact of absenteeism on
employers and employees’, Health Economics 11(3), 221–231.
Simon, G. E., Barber, C., Birnbaum, H. G., Frank, R. G., Greenberg, P. E.,
Rose, R. M., Wang, P. S. & Kessler, R. C. (2001), ‘Depression and work productivity: the comparative costs of treatment versus nontreatment’, Journal
of Occupational and Environmental Medicine 43(1), 2–9.
Stewart, W. F., Ricci, J. A., Chee, E., Hahn, S. R. & Morganstein, D. (2003),
‘Cost of lost productive work time among us workers with depression’, Jama
289(23), 3135–3144.
van Tulder, M. W., Koes, B. W. & Bouter, L. M. (1995), ‘A cost-of-illness study
of back pain in the netherlands’, Pain 62(2), 233–240.
Viikari-Juntura, E., Kausto, J., Shiri, R., Kaila-Kangas, L., Takala, E.-P., Karppinen, J., Miranda, H., Luukkonen, R. & Martimo, K.-P. (2012), ‘Return to
work after early part-time sick leave due to musculoskeletal disorders: a randomized controlled trial’, Scandinavian Journal of Work Environment and
Health 38(2), 134.
23