The effects of cognitive and noncognitive skills

Explaining social policy preferences: The effects
of cognitive and noncognitive skills
Daniel Stegmueller
Department of Government, University of Essex
[email protected]
Abstract
What explains individuals’ social policy preferences? This paper argues for the relevance of
basic cognitive and noncognitive skills, which so far have not received much attention in
empirical political economy models. It provides empirical evidence on how innate cognitive
abilities and noncognitive skills, such as self-confidence and motivation, affect individuals’
preferences. They do so by shaping labor market outcomes, such as educational choices,
permanent incomes, and experiences of unemployment spells. This mechanism is tested
explicitly by building on recent advances in the statistical analysis of causal mechanisms
using high-quality data from the German Socio-Economic Household Panel. In contrast to
recent claims made by psychologists, I find that higher cognitive and noncognitive abilities
lead to more conservative social policy preferences. My results also confirm that this effect
is due to the causal link between skills, labor market outcomes, and preferences.
1. Introduction
A recent wave of research in political economy moves beyond country-level analyses and
tries to understand the social policy preferences of individual citizens (e.g., Iversen and
Soskice 2001; Alesina and La Ferrara 2005; Cusack et al. 2006; Scheve and Stasavage
2006; Shayo 2009; Rehm 2011; Rehm et al. 2012; Margalit 2013). However, individual
traits like intelligence and self-confidence are ignored in this literature. In this paper
I show empirically that an individual’s basic set of skills – cognitive abilities as well as
noncognitive capacities, such as self-confidence and motivation – systematically shapes
social policy preferences. I provide direct evidence for the mechanism linking skills and
policy preferences. Building on recent advances in causal mediation analysis (Imai et al.
2010a, 2011), I show that an individual’s labor-market characteristics, i.e., education,
permanent income, and unemployment experiences, mediate the effect of both cognitive
and noncognitive skills on social policy preferences.
This extends the vast majority of the literature, which focuses on socio-economic factors,
such as social class (Svallfors 2006), current and future income (Benabou and Ok 2001),
welfare attitudes and beliefs (Alesina and Angeletos 2005), or on ‘cultural’ factors such as
religion (Scheve and Stasavage 2006). Recent research has stressed the insurance aspect of
social policy and focuses on risks due to holding specific skills or occupational unemployment
patterns (Iversen and Soskice 2001; Cusack et al. 2006; Rehm et al. 2012). What all these
approaches have in common is that they use variables which are not exogenously determined,
but the result of individual choices and behavior. For example, skill specificity is a function
of one’s choice of education; income is determined by one’s ability and motivation, which
also influence the probability of becoming unemployed. I argue that these factors are
endogenous to an individual’s cognitive and non-cognitive skills, which should be included
in models of social policy preferences and examined as their distal causes.
The papers’ main contribution is empirical and methodological. I develop a mixed outcome measurement model for cognitive and noncognitive skills, which explains observed
ability test scores, psychological test items, and observed labor market status variables.
Furthermore, I distinguish between true (latent) preferences and observed survey responses,
and use an ordinal ideal point model to measure individuals’ policy preferences. Jointly
estimating these components allows me to examine the effects of cognitive and noncognitive
skills on preferences, while controlling for measurement error in each component. Using a
2
causal mediation modeling strategy allows me to disentangle effects of skills on preferences
that operate via labor market variables from unspecified alternative explanations.
2. Cognitive skills, noncognitive skills, and social policy preferences
Cognitive skills.
My claim regarding the importance of skills is supported by a wave of
recent research, which demonstrates the importance of cognitive abilities for labor market
outcomes (e.g., Cameron and Heckman 1993; Cawley et al. 2001; Green and Riddell 2003;
Bronars and Oettinger 2006; Heckman et al. 2006; Strenze 2007; Anger and Heineck 2010).
Cognitive skills (or ‘intelligence’) represents the general ability of efficient “highly general
information processing” (Gottfredson 1997: 81), which determines individuals’ performance
in education and the labor market (Jensen 1998). Higher cognitive abilities are not just
relevant for academic pursuits. Personell psychologists generally agree on the fact that they
are relevant for a wide range of jobs, since they affect job performance and trainability.
Thus, if cognitive abilities determine individuals’ choices and failures or successes in the
labor market, they can be expected to exert systematic effects on policy preferences. More
precisely, since previous research shows that advantageous labor market positions tends to
lower demand for redistributive policies, I expect to find that individuals with high levels of
cognitive skills hold more conservative social policy preferences, and that this effect operates
predominantly through labor market variables.
In contrast, (political) psychologists, who have examined effects of intelligence on ideology
and preferences, generally claim that cognitive ability leads to more liberal social policy
preferences (e.g., Deary et al. 2008; Kanazawa 2009, 2010; Schoon et al. 2010). These
claims stand in direct contrast to findings in economics and the literature on the political
economy of redistribution. In this paper, I will re-examine these claims, using a population
survey instead of more limited (student) samples and explicitly defined measurement
models.1
Noncognitive skills. Focussing only on cognitive abilities is too one-sided. Recent research
in labor economics argues for, and provides evidence of, the importance of noncognitive
skills – such as motivation and self-confidence – for a range of economic and social outcomes
1
For a recent critique of this research see Morton et al. 2011. The dependent variable in the studies
mentioned above is often not clearly specified and different studies use differing attitudinal or general
left-right measures.
3
(Bowles et al. 2001; Carneiro et al. 2003; Cunha et al. 2005; Heckman et al. 2006; Lindqvist
and Vestman 2011). Similarly, political scientists have begun to study the role of personality
on preferences and ideologies (see Gerber et al. 2010 for an extensive overview).
An especially relevant noncognitive skill is self-confidence, which is related to the psychological concept of locus of control (Rotter 1966). Individuals with a strong internal locus
of control are characterized by the belief that outcomes are mainly shaped by their own
skills and choices. Thus having high confidence in one’s own abilities can compensate
for lower levels of cognitive ability in the labor market (Benabou and Tirole 2002; Drago
2011). Indeed, Groves (2005) shows that a higher locus of control is positively related to
female earnings in the US, while Cebi (2007) shows that higher locus of control of men and
women is rewarded positively in the US labor market. This finding is supported by Heckman
et al. (2006) who show that cognitive and noncognitive skills are determinants of economic
success. Consequently, I expect to find that noncognitive skills exert a negative effect on social
policy preferences, and that this effect operates mainly through labor market variables.
3. Explaining social policy preferences
In the following I outline my analytical strategy and describe a causal mediation framework
(e.g., Imai et al. 2011), which explicates how skills affect policy preferences via labor
market outcomes.2 Imagine a simplified scenario where individual i (i = 1, . . . , N ) possesses
a preferred level of social policy θi , which is a function of a set of skills Ω. Different
(counterfactual) skill bundles are denoted by ωi and ω0i , respectively. Realized values of
confounding factors Xi are denoted by xi . A straightforward estimate of the total effect of
skills on preferences is
T E = E(θi (ω0i |xi ) − θi (ωi |xi )),
(1)
i.e., the expected (counterfactual) difference in social policy preferences for, say, high and
low skill holding other confounding variables constant.
To understand how skills shape preferences, one needs to move beyond estimating total
effects. In testing the argument that labor market outcomes are the primary mechanism
linking skills to policy preferences, I separate the effect of skills mediated via labor market
outcomes from the vast number of alternative mechanisms that could link skills and prefer-
2
General identification results for this setup are discussed in Imai et al. (2010b).
4
ences. More precisely, denote by Mi (Ωi = ωi |Zi = zi ) an individual’s labor market position
as function of skills, Ωi , and other background factors, Zi . The (average) effect of skills on
social policy preferences that is due to labor market outcomes – the mediated effect – is given
by
M E = E(θi (ωi , Mi (ω0i |zi )|xi ) − θi (ωi , Mi (ωi |zi )|xi )),
(2)
i.e., the change in preferences caused by a change in labor market outcomes resulting from
a change in skill levels from ω to ω0 while holding other confounding factors zi constant.
The remaining (or: direct) effect is calculated by holding labor market outcomes at a constant
skill level
DE = E(θi (ω0i , M (ωi |zi )|xi ) − θi (ωi , M (ωi |zi )|xi ))
(3)
and represents all mechanisms other than labor market outcomes linking skills to preferences.
In section 5 I present the statistical model used to estimate these effects in detail. Before
doing so I describe the unique data set that enables my analyses.
4. Data
I use individual-level panel data from the German Socio-economic Panel (GSOEP), a longitudinal representative survey of German households conducted since 1984.3 It provides
high quality data on individuals’ labor market activities, such as income, work experience,
and unemployment spells. In addition to a core set of questions, additional modules related
to specific topics are fielded. Relevant to my question are modules on personality measures
fielded in 1999, a battery of welfare preference items contained in the 2002 wave, and, for
the first time, a test of cognitive abilities administered to a subset of individuals in 2006.4 In
the following I describe the data on cognitive and noncognitive skills and preferences. More
detailed information on covariates used in the analysis is given in [online] Appendix A.1.
The number of individuals who participated in all necessary waves and which are inter3
For details see /www.diw.de/en/soep. The original version of the data were assembled while I was
ECASS visitor at the Institute of Socio-Economic Research, University of Essex.
4
The obvious downside is that cognitive skills are measured after preferences. However, evidence suggests
that fluid intelligence (as used in this paper) represents essentially innate abilities and is stable after
childhood, independent of one’s environment (e.g., Hopkins and Bracht 1975; Schuerger and Witt 1989).
Furthermore, I also conducted a robustness test where cognitive skills are age standardized, which produced
identical results.
5
viewed using CAPI in 2006 (and thus eligible for participating in the cognitive test) is 1,816.
In order to create a homogeneous sample, I focus on males with completed education.5 This
yields a sample of 827 male adults. Of those 202 refused to participate in the cognitive
ability test. I treat missing responses as part of my measurement models, thus my final
sample size 827.6
Cognitive skills are of course hard to measure, especially in the context of a large household
survey. The GSOEP uses an adapted version of a standard psychological symbol-digits
correspondence test (Lang et al. 2007; Schupp et al. 2008). It measures the concept of ‘fluid
intelligence’, which represents general analytical performance, such as pattern recognition,
perceptual speed, and problem solving skills, which are ‘fluid’ in the sense that they can
be applied to almost any problem (Cattell 1987: 97). After pre-testing, this test was
administered in 2006 to the subset of individuals surveyed via CAPI. In the implemented
symbol-digit test (SDT), individuals have to match numbers to symbols. Nine symbols
corresponding to nine numbers are displayed permanently on the screen, and individuals
have to match numbers to a stream of displayed symbols as quickly as possible.7 The
frequency of correct matches is assessed in three 30 second intervals, yielding an “ultrashort” 90 second test (Lang et al. 2007; Schupp et al. 2008). While such a short test cannot
provide the same level of detail as longer test batteries (such as the widely used AFQT),
it is well suited for application in a general survey, where participation is voluntary and
question time limited. Pre-tests have shown that scores obtained from this ultra-short test
correlate highly with established long-form psychological ability tests (Lang et al. 2007;
von Rosenbladt and Stocker 2005). Its shortness will result in less precise measurements of
individual ability. In my latent variable setup, described below, this uncertainty is taken into
account in all parts of the model.8
Noncognitive skills have been represented by an individual’s self-confidence or locus of
control in a range of previous studies (e.g., Drago 2011; Flossman et al. 2007). Similarly,
I use a set of items measuring locus of control, originally introduced by Rotter (1966),
and fielded in the GSOEP in 1999. Individuals responded to a number of statements
5
A proper analysis including women would have to solve the selection problem of female labor market
participation, or present separate analyses for men and women (which would increase the paper’s length
and complexity). But note that simply including all women does not change my core results.
6
In models estimated as robustness checks using listwise deletion of missing values, sample size is 625.
Results are indistinguishable between both data sets.
7
Classical psychological tests usually require individuals to choose symbols. This procedure is reversed in the
GSOEP to enable data entry using standard CAPI tools.
8
See Jackman (2008: section 4) for a discussion of the biases when ignoring measurement uncertainty.
6
regarding self-efficacy, self-confidence and achievement of success using agree-disagree
scales. Initial exploratory analyses suggest that a subset of the full 10 item scale is sufficient
to represent a one-dimensional measure of noncognitive skills. Flossman et al. (2007) arrive
at a similar result using the same data; Heckman et al. (2006) discuss the preference for a
one dimensional representation of noncognitive skills using US data. Thus, I use five items
to measure the extent to which an individual thinks he is in control of his life and success
depends on his own actions. Using an ordinal item response theory model (described below),
I create a one-dimensional measure of noncognitive skills. Online appendix A.2 lists items
and provides more details of my initial analyses.
To capture Social policy preferences, I use the 2002 wave of the GSOEP, which provides
a set of items tapping if individuals prefer the private market or the state as provider of
financial security for people of old age, for families, for individuals needing care, in case
of illness, or in case unemployment.9 I argue that responses to these items are driven
by individuals’ underlying social policy preferences: individuals who prefer a more active
welfare state, which spends more on social services, will respond positively to those items.
This underlying policy preference should be captured using an appropriate measurement
model (e.g., Ansolabehere et al. 2008; Jackman 2008), as described in the following section.
5. Model
I estimate the effect of skills (cognitive and noncognitive) on policy preferences θi of
individual i via the following equation:
θi = γ0 xi + ζ0 mi + η1 ωci + η2 ωni + εi , εi ∼ N (0, σε2 )
(4)
where xi are control variables capturing heterogeneity between individuals; mi is a vector
of endogenous labor market outcomes, mi = ( yil , yie , yiu )0 , shaped by skills (detailed below);
and ωci and ωni are cognitive and noncognitive skills. Both preferences and skills are not
directly observable, but have to be captured by appropriate measurements described below.
9
Response categories range from ‘only the private market’ to ‘only the state’. For more details see appendix A.2.
These items have been used previously (e.g., Alesina and Fuchs-Schündeln 2007), however no measurement
model for them has been proposed.
7
5.1. Measuring social policy preferences
The majority of studies on individual policy preferences uses a single survey item. Often born
out of limitations of survey design or coverage, this widespread practice has some drawbacks.
Preferences are captured only imperfectly when using a single survey item as dependent
variable. Using multiple items is preferable since it leads to an improved measurement of
social policy preferences (cf. Ansolabehere et al. 2008; Jackman 2008). Furthermore, jointly
estimating measurement model equations together with substantive model equations takes
measurement error into account and yields more conservative standard errors (Skrondal and
Laake 2001). Following recent research (e.g., Treier and Jackman 2008; Stegmueller 2011),
I employ an ordinal probit item response theory model, which expresses preferences as an
unobserved (latent) variable, θ , which generates observed categorical survey responses to
policy questions (for an introduction to Bayesian IRT models, see Jackman 2009: ch.9). I
model response ysis of individual i to each question s (s = 1, . . . , S), which has Ts categories,
as a function of a threshold and a discrimination parameter:10
ysis = Φ(τsst − λss θi )
(5)
To ensure that the model reflects the ordinal nature of survey items, τss is a vector of
thresholds for item s with length Ts − 1 with a strictly monotonous ordering constraint, such
that τsa < τs b , ∀a < b, ∀s. The latent variable θ represents individuals’ social policy ideal
points. Its location and scale are identified by specifying its distribution as normal with
fixed variance, θ ∼ N (0, 1). This latent variable strategy is preferable over using simple
sum-scores as it includes measurement uncertainty in the model.
A further advantage of a latent variable model, is that response bias can be captured in
the model. The sample used in my analyses includes individuals who lived in the former
German Democratic Republic, and experienced a period of socialist socialization (Alesina and
Fuchs-Schündeln 2007; Svallfors 2010). It is possible that those individuals will instinctively
respond more positively to any question involving state activity. This response bias (often
10
To avoid a proliferation of variables and coefficients in the remainder of the paper, I opt for a slight abuse of
notation and use superscripts to denote the equation they belong to (e.g. ‘s’ for social policy preference
equations, ‘l’ for labor income, etc.)
8
termed differential item functioning in the psychometric literature) can be captured by
including a parameter in the measurement model which captures distinct response behavior.
yssj = Φ(τsst − λss θi − δ1 w i )
(6)
Here w i is an indicator variable equal to one if someone grew up under East German
socialism; and δ1 captures the extend to which responses are biased up- or downwards. For
more details on this strategy see Perez (2011) and Stegmueller (2011).
5.2. Measuring cognitive and non-cognitive skills
I model skills via a two-dimensional model with two orthogonal factors for cognitive and
noncognitive skills, following recent work in economics by Heckman and colleagues (e.g.,
Carneiro et al. 2003; Heckman et al. 2006; Cunha et al. 2010). Since my measures of
cognitive abilities are continuous while my noncognitive skill items are categorical, I employ
a mixed measurement model for joint continuous and discrete variables (Heckman 2001;
Quinn 2004).11
In the measurement equation for cognitive ability, I assume that measurements made using
the Symbol-Digit-Test are generated by latent continuous cognitive skills. In the equation for
locus of control items, I stipulate that categorical item responses are driven by an underlying
latent continuous variable representing noncognitive skills. More precisely, let ycic be the
number of correct number-symbols pairs in each 30 second block c (c = 1, . . . , C) achieved
by individual i; and let ynin be his response to non-cognitive test item n (n = 1, . . . , N ) with
Tn categories. Then, responses are modeled as a function of an individual’s latent cognitive
ωci and non-cognitive ωni abilities:
11
ycic = µc + λcc ωci + ψc ,
ωc ∼ N (0, 1)
(7)
ynin = Φ(τnnt − λnn ωni ),
ωn ∼ N (0, 1)
(8)
In a more complex measurement model setup Heckman et al. (2006) allow for reverse causality between
schooling and cognitive and non-cognitive skills, since many of their subjects were still at school when
completing the tests. In my case all individuals still in school are excluded from the analysis, and I therefore
do not extend my model in that direction. I do, however, allow for endogenous schooling, i.e. I model the
fact that schooling is a choice influenced by latent cognitive and non-cognitive ability.
9
with ωci ⊥ ωni .12 Here, λn and λc are discrimination or loading parameters indicating how
latent skills are transferred into observed responses. Equation (7) describes a classical
linear measurement equation with means µc and residual variances ψc . In the following
application I restrict µc = 0 since I use standardized values from the cognitive abilities test.
Equation (8) describes an IRT model for categorical noncognitive skill items with thresholds
or difficulties τnnt . Since responses are given on an ordinal scale, I enforce monotonically
ordered thresholds such that τna < τnb , ∀a < b, ∀n. Location and scale of both latent skill
variables are fixed by using a standard-normal prior distribution.13
5.3. Endogenous schooling, income, and unemployment experience
When analyzing social policy preferences, economic outcomes such as (permanent) income,
unemployment experiences and choice of education are not exogenous. Rather, they are
shaped by cognitive and noncognitive skills. Heckman et al. (2006) demonstrate how a
variety of economic and social outcomes are determined by cognitive as well as noncognitive
skills, using US panel data. Thus, I model economic outcomes relevant to policy preferences
as functions of cognitive and noncognitive skills.14
Education
I model an individual’s educational choice as the highest level of education
achieved. For each individual i, I record ysie , where s = 1, . . . , S denotes the school certificate,
such as vocational or high school education. Adopting the perspective that certificates
can be (roughly) ordered on a continuum, I model education choice via an ordered probit
specification:
ysie = Φ(τse + αe0 zei + β1e ωci + β2e ωni ).
(9)
Here τse are ordered thresholds distinguishing between different educational certificates, zei
is a vector of control variables such as immigration status and parental background with
12
The orthogonality restriction between cognitive and noncognitive skills allows for a parsimonious model
setup, where individual skills can vary independently on both dimensions. Nonetheless, the data provide
enough information to identify a model with correlated factors (Carneiro et al. 2003). This yields only a
negligible correlation of 0.056 ± 0.036.
13
Alternatively, one could fix one of the λs in each equation and freely estimate the variance of both skill
measures. I prefer this approach since it normalizes the distribution of both latent variables and makes
easier the visual comparisons in plots presented below. Furthermore, one might question the normality
assumption for both latent skill factors. In online appendix A.5 I show that a model that allows for
non-normal factors (using a finite mixture of normals) leads to quite similar latent skill estimates.
14
As an “added bonus” including behavioral outcomes in skill measurement equations (in addition to psychological survey items) improves measurements of cognitive and noncognitive skills (e.g. Heckman et al.
2006; Cunha et al. 2010), and makes the two factor model rotation invariant (cf. Carneiro et al. 2003).
10
coefficients αe . Finally, β1e and β2e capture effects of cognitive and noncognitive skills on
education choice.
Income As argued before both cognitive and noncognitive abilities determine and individual’s productivity and performance and are thus important parts of his earning function
(Griffin and Ganderton 1996). I concentrate on long-term or ‘permanent’ labor income,
abstracting from transitory income shocks in the current cross-section, since previous work
suggests that permanent income is the more important factor shaping policy (or redistribution) preferences (Idema and Rueda 2011; Ansell 2013; Stegmueller forthcoming). I
model an individual’s permanent labor income yil (calculated from longitudinal income
information for each individual) as function of standard variables such as work experience
and immigration status collected in zli and of cognitive (ωci ) and noncognitive (ωni ) skills:
yil = αl0 zli + β1l ωci + β2l ωni + εli , εl ∼ N (0, σε2l ).
Unemployment
(10)
Researchers modeling unemployment dynamics often rely on random
effects specifications to capture effects of unobserved factors such as ability and motivation.
Having measures of cognitive and noncognitive allows me to model these previously unobserved effects directly. Instead of focusing on which individuals are unemployed in the
current cross-section of data, I use information in the panel to construct a variable indicating
if a respondent has experienced spells of unemployment in its work history. I model the
propensity of experiencing unemployment yiu via a probit equation:
yiu = Φ(τu + αu0 zui + β1u ωci + β2u ωni ),
(11)
where τu is a threshold or intercept, zui is a vector of control variables, such as immigration
status and age, with associated coefficients αu . The effect of an individual’s cognitive skills,
ωci , and his noncognitive skills, ωni are captured by β1u and β2u , respectively.
This completes the specification of the skills measurement model. The model setup implies
that conditional on control variables in zi , the dependence across all measurements, choices,
and outcomes is due to ωc and ωn . Controlling for this dependence in equations (7) – (11)
means controlling for endogeneity in the model (Heckman et al. 2006: 424).
11
5.4. Priors and estimation
I specify and estimate this system of equations in a Bayesian framework (for introductions see
Gill 2008a or Jackman 2009). I assign appropriate (hyper-) priors to all model parameters.
Priors for coefficients of latent variables in measurement equations are elicited by choosing
parametrization of a half-normal distribution such that parameters are expected to have
mean 0.5 and values greater than 10 occur with a low probability of p = 0.1: λc , λn , λ r ∼
N+ (0.5, 7.413). This represents an a priori expectation of a non-zero relationship between
latent skills and their observable manifestations, and orients the latent variables (e.g.,
such that higher test scores are related to higher latent cognitive skills).15 For threshold
parameters in probit equations, I choose parameters for the normal distribution such
that values lie in the interval [−10, 10] with probability p = 0.9: τ r , τn , τu ∼ N (0, 6.08).
Residuals in equations with continuous left hand side variables are drawn from an inverse
Gamma distribution εl , ε r ∼ Γ−1 (1, 2). A zero-centered normal prior with large variance
ensures regression type estimates for all control variables in my policy preferences equation,
γ v ∼ N (0, 100). The same prior is used for controls in skill outcome measurement equations
αu , αl , αe ∼ N (0, 100). Finally, priors for coefficients for effects of cognitive and noncognitive
skills in economic outcomes and choice equations and in the policy preferences equation are
normally distributed with prior mean zero and a large variance β l , β e , β u , η ∼ N (0, 100).
Given the relatively large sample size of several hundred cases, the data dominate these
priors, and results are insensitive to prior perturbations.16
I estimate the model using MCMC sampling using a standard Gibbs sampler with Metropolis
steps for updating the probit threshold vectors. I run two chains for 200,000 iterations,
discarding the first half as burn-in. To reduce memory usage, I thin each chain by a factor of
20. Visual inspection as well as diagnostics of the resulting 2 × 10, 000 samples suggested by
Gelman and Rubin (1992) and Geweke (1992) show no signs for absence of convergence.
Furthermore, I conducted an “insurance run” (Gill 2008b) running the sampler for 500,000
iterations – yielding identical results.
15
Note that this restriction does in no way influence my results (since the latent factors are rotation invariant
through their relation to the outcome equations).
16
I conducted robustness tests using a different Gamma prior specifications for residuals (shape and scale =
0.001); priors for coefficients with variances 10 times larger; and lambda priors with mean 0. Results are
indistinguishable from the ones presented here.
12
Table 1: Ordinal IRT measurement equations for social policy preferences. Posterior means
with highest posterior density regions.
Discrimination
parameters λir
Thresholds τirt
Sick
λ1r
1.470
[1.292, 1.652]
Unemployed
λ2r
0.867
[0.772, 0.966]
Care
λ3r
1.384
[1.223, 1.546]
Old
λ4r
1.590
[1.394, 1.797]
Families
λ5r
0.543
[0.480, 0.616]
τ11
τ12
τ21
τ22
τ31
τ32
τ41
τ42
τ51
τ52
τ53
0.180
1.756
−0.504
0.736
0.097
1.789
0.329
1.990
−1.300
0.464
1.388
[0.072, 0.293]
[1.582, 1.936]
[−0.588, −0.419]
[0.654, 0.829]
[−0.009, 0.202]
[1.632, 1.963]
[0.210, 0.448]
[1.782, 2.197]
[−1.388, −1.208]
[0.389, 0.534]
[1.291, 1.481]
Note: Estimated East bias parameter δ1 =0.414 with 95% HPD interval from 0.218 to 0.606.
6. Results
In this section, I start by describing results from the measurement equations of my model
in the next two subsections. Readers only interested in core results are welcome to skip
ahead to subsection 6.3, which describes the total effect of skills on social policy preferences
followed by subsection 6.4, which details the mediating effects of labor market outcomes.
6.1. Social policy preferences measurement
Table 1 shows estimates for the ordinal IRT model of social policy preferences. I find that
latent preferences are most strongly related to statements regarding state responsibility for
financial security for the sick, the old, and those needing care. Their relationship to state
responsibility for financial security of families is somewhat weaker, but still unequivocally
non-zero.
Estimates of thresholds for each item are located at different points of θ , thus providing
information over a wide range of latent policy preferences. Besides providing an explicit
model of observed survey responses, the ordinal IRT model used here allows me to include
possible response bias due East German socialist socialization. The posterior mean of this
bias parameter, δ1 , is 0.41 with a highest posterior density interval that does not include
13
zero (ranging from 0.22 to 0.61). This suggests the existence of systematically different
response tendencies of respondents who grew up under socialism. Capturing this tendency
as part of the measurement model yields an unbiased individual preference estimate θi ,
irrespective of the side of the wall an individual grew up.
6.2. Cognitive and noncognitive skills measurement
Estimates for measurement equations for cognitive and noncognitive skills are given in
Table 2. Not surprisingly latent cognitive skills and observed test scores are highly correlated,
as shown in Panel (A). A one-dimensional latent skill factor explains the majority of the variance in observed scores. Similarly, latent noncognitive skills and observed categorical survey
responses are strongly related, as high discrimination parameters show. The relationship is
somewhat weaker for the last item, but the HPD region of its discrimination is still bound
n
n
away from zero. Furthermore, its threshold estimates τ51
and τ52
show that it provides
valuable information at the lower end of the latent noncognitive skill spectrum. Estimates
displayed in Panel (B), show a strong relationship between latent skills and individuals’
observed education choice and labor market outcomes (more details in subsection 6.4).
I plot the marginal distributions of cognitive and noncognitive skills in Figure 1. For three
eduction levels (those with less than high school, high school, and more than high school
education), I plot kernel density estimates of the posterior means of ωc in panel A and
ωn in panel B. Figure 1 shows considerable individual variation in both cognitive and
noncognitive skills. A substantial portion of the sample has low levels of cognitive skills of
two standard deviations below the (normalized) mean, especially among those with no high
school education. At the other end of the distribution are individuals with high cognitive
skills, found predominantly among the higher educated. A similar picture emerges with
regard to noncognitive skills, which exhibits similar spread with considerable proportions
of the sample more than one standard deviation below or above the mean. One should
note the existence of lower educated individuals with high skills and (especially) of highly
educated individuals with lower cognitive skills. This suggests that using education as proxy
for ability or productivity in models explaining policy preferences is bound to produce rather
unreliable results.
Panel (C) of Figure 1 shows the joint distribution of cognitive and non-cognitive skills. I plot
the posterior mean of [ωc , ωn ] for each individual. A contour plot of the two-dimensional
density estimate is superimposed to visualize the distribution of skills. It again emphasizes
14
Table 2: Measurement equations for cognitive and noncognitive skills. Posterior means and 95%
highest posterior density regions.
(A) Skill indicators
Thresholds τnit
Residual variances ψc
Loading/Discrimination
parameters λi
SDT 1−30 sec.
SDT 31−60 sec.
SDT 61−90 sec.
λ1c
λ2c
λ3c
0.876
0.948
0.865
[0.827, 0.921]
[0.904, 0.990]
[0.818, 0.910]
ψ1c
ψ2c
ψ3c
0.255
0.115
0.269
LoC 1
λ1n
0.916
[0.797, 1.030]
LoC 2
λ2n
0.714
[0.617, 0.808]
LoC 3
λ3n
1.030
[0.886, 1.157]
LoC 4
λ4n
0.768
[0.664, 0.869]
LoC 5
λ5n
0.429
[0.353, 0.501]
n
τ11
n
τ12
n
τ21
n
τ22
n
τ31
n
τ32
n
τ41
n
τ42
n
τ51
n
τ52
n
τ53
−1.072
0.432
−0.535
0.679
−1.323
0.338
−0.999
0.451
−1.333
−0.251
0.878
[0.229, 0.280]
[0.097, 0.134]
[0.244, 0.297]
[−1.177, −0.973]
[0.351, 0.519]
[−0.612, −0.455]
[0.597, 0.763]
[−1.444, −1.200]
[0.249, 0.423]
[−1.092, −0.909]
[0.376, 0.532]
[−1.422, −1.245]
[−0.319, −0.185]
[0.800, 0.953]
(B) Skill outcomes
Cognitive skills ωC
Education
Perm. Income
Unemployment
β1e
β1l
β1u
0.223
0.265
−0.251
[0.159, 0.291]
[0.201, 0.327]
[−0.331, −0.165]
Noncognitive skills ωN
β2e
β2l
β2u
0.220
0.293
−0.184
[0.148, 0.291]
[0.225, 0.352]
[−0.264, −0.106]
Note: Foreign bias in cognitive test (SDT items) δ2 = −0.416 with 95% HPD interval from −0.511 to −0.318. Estimates α of
controls in skill outcome equations not shown (see appendix A.3 for full table).
the wide variety of existing cognitive–non-cognitive skill combinations. It is not uncommon
for individuals with below average cognitive abilities to have higher noncognitive skills and
vice versa. Inasmuch as both cognitive and noncognitive skills are factors determining labor
market success (as argued above), this stresses that both latent skills need to be included in
a complete model of redistribution preferences.
6.3. Total Effect of skills on social policy preferences
Table 3 shows estimates (posterior means and standard deviations) of the total effect (eq. 1)
of cognitive and noncognitive skills. Besides latent skills I include a number of socio-
15
A
0.6
less than HS
high school
more than HS
0.5
0.4
C
2
●
0.02
●
●
●
●
●
● ●
● ●
●
●
●●
●
● ●
●
●
●
●
●●
●
●
●
●
● ● ●
● ●●
● ●
●
●
●
●
●
●● ● ●
●
●
● ● ●●●
● ●
●
●
●
● ●● ●
●●
●
●●
●
● ●
●
●● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
● ● ●
●
●
●
●
●● ●
●
●
●
● ● ●●
● ●
● ● ●● ●
● ● ●●
● ●
●
● ●●
●●
●
● ●
●
●
●
●●
●● ●
●
●
●
●
●●
●
●
● ●
●● ●
●
●
●
●
●
●
● ● ● ●
●
●
●●● ● ●
●
● ●
●
●
●●
●
●
● ● ● ●
● ● ● ●
●
● ●●●
●
●
●
●
●
● ●● ●●●
●
● ●
●
●
● ● ●●● ● ● ●
●
●
●● ●
●
● ●● ●●
●
●
●
●
●
●
●● ● ●
●
●
●
● ●●
●
●
●
● ● ● ●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
● ●
●
●●
●
●
●
●
● ●
● ●
●● ●
●
●
●
●
●●
●
●● ● ●
● ●
●
● ●●●●●
●● ●
●
● ●
● ●●
●
●
● ● ●●
●
●
● ●
●
● ● ●
●
●●●●● ●
●
● ●
●
●●
● ● ●
● ●● ●●● ● ● ● ●
●
●
●●
● ●●● ●●
●●●●● ● ●●
●●
● ●●
●
●
●●●●●
● ●
●
●
●●
●
●
●●
●
●
●● ●
●
●
●
●
●
●
●
●
● ●●
●● ●
●●
● ●
●
●●
●
●
●● ●
●
●●
●
● ●●●
●●
● ●● ●
●
●●● ● ●●
●
●
●
●
●
●
●
●
●
●
●
●● ● ● ●
● ●
● ●● ●
●●
●
●
●
●●●● ● ●
●●●
●
●
●
●
●●
● ●
●
● ● ●●●
● ●
●●● ●
●●
●
●
● ●
● ● ●●
● ●
●●
●
●
●●
●
●●
●
●●
●●
● ●
●●
●●●● ● ●
● ●●
●●●
●
●
● ●
●●
● ●
●
●
●●● ●
●
●
● ● ●
●●
●
●● ●
●
●
●● ●●
●
● ●
●●●●
●● ●
●
●
●
●●●●●● ●
●●
●●
●● ●
● ●●●● ●●●●
●
●
●
●
●
●
●
● ●
● ● ● ● ● ●●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●●● ● ●
●
●
●
●
● ●
●●●
● ●●
● ● ●
●●
●
● ● ●
●●
●●
● ● ● ●●
●
●●
●
●
●●
●● ● ● ●● ●
●
●
●
●
●●
●● ●
● ● ●● ●
●●
●
● ●
●
●
●
●●
●
●
● ●●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●●
●
● ●● ● ●
●
●
●
●
●
●●
●
●
● ● ●● ●
●
● ● ●● ●
●
●
●
●
●●
●
● ●
●
●
●
●
●
●●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
0.3
●
●
●
0.04
● ●
0.06
0.2
1
0.1
−2
−1
0
ωc
1
2
3
0.18
ωn
−3
0
●
●
0.
1
0.14
0.0
B
0.08
0.16
0.22
0.2
0.5
less than HS
high school
more than HS
0.4
−1
0.3
0.2
−2
0.12
0.1
●
−2
−1
0.0
−3
−2
−1
0
ωn
1
2
0
ωc
1
2
3
Figure 1: Distribution of latent cognitive (panel A) and non-cognitive (panel B) skill factors by
education level (kernel density estimates of posterior means, bandwidth 0.35 evaluated over 200
point grid). Panel C plots the position of each individual in two-dimensional cognitive–noncognitive
skill space (contour plot of 2-dimensional density estimate superimposed).
economic controls to capture heterogeneity between individuals and households. These
include an individual’s age, immigration status, household size, house ownership, whether
he is divorced, a union member, self-employed, or currently living in the eastern part of
Germany. Estimates of controls are available in online appendix A.4.
Specifications (1) and (2) show that both cognitive and noncognitive skills exert a strong
negative effect on social policy preferences. This conclusion remains virtually unchanged
when including both latent skills simultaneously in specification (3). When adding a range
of individual and household controls in (4), I find a slight increase in the posterior standard
deviation of the cognitive skill effect, as well as a slight reduction of the estimated magnitude
of noncognitive skills. However, the conclusion remains the same: the higher someones
cognitive and noncognitive skills the lower his preferences for extensive social policy.
To illustrate the substantive magnitude of these effects, I conduct a series of simulations
displayed in Figure 2, which shows expected values of latent social policy preferences for
deciles of cognitive and noncognitive skills. Panel (A) shows posterior means and HPD
intervals for the effect of cognitive skills on preferences, holding all other covariates as
well as noncognitive skills at their sample means. Panel (B) repeats the same calculation
16
Table 3: Estimated total effects of cognitive and noncognitive skills on social
policy preferences. Posterior means and standard deviations.
(1)
Skills
Cognitive [η1 ]
(2)
−0.153
(0.034)
Noncognitive [η2 ]
Controls [γ]
−0.198
(0.036)
no
no
(3)
−0.148
(0.034)
−0.198
(0.037)
(4)
−0.141
(0.037)
−0.192
(0.037)
no
yes
Note: Total effects calculated from preference equation (eq. 4) estimated jointly with system of measurement equations (eq. 6 to 11). Controls included in (4) are age, house ownership, household size,
immigration status, being divorced, union member, self-employed, currently living in East Germany.
For estimates see table A.4.
using noncognitive skills. Both figures show the substantive relevance of skills. For example,
moving four deciles around the mean of the cognitive or noncognitive skill distribution
reduces social policy preferences by 0.4 and 0.5 standard deviations, respectively.
In panel (C) I simulate changes in social policy preferences when simultaneously raising
or lowering cognitive and cognitive skills. Shown is again the posterior expectation of
social policy preferences for all possible combinations of latent skill deciles. This makes the
substantial effect of skills even more apparent. Holding all else equal and moving from a
moderately low combination of skills (such as third deciles) to a high combination (such as
sixth deciles) lowers preferences by almost one standard deviation. These conclusions are
robust under a wide variety of alternative specifications (cf. section 6.5).
6.4. Mediated skill effects via labor market outcomes
I now turn to the mediated effect of skills. In other words: is the effect of skills on social
policy preferences due to individuals’ labor market outcomes, as argued above, or is it simply
a result of a myriad of other (substantively irrelevant) factors? To answer this question, I
proceed in two steps.
First, I demonstrate how latent skills determine commonly used labor market variables by
conducting several simulations displayed in Figure 3. Panels (A) show effects of cognitive
(A1) and noncognitive (A2) skills on the probability of choosing higher education, for each
latent skill decile. The probability of obtaining a higher education certificate increases
17
A
0.6
E(θ)
0.4
0.2
0.0
−0.2
−0.4
−0.6
1
2
3
4
5
1
2
3
4
5
ωc
6
7
8
9
10
6
7
8
9
10
0.6
B
E(θ)
0.4
0.2
0.0
−0.2
−0.4
−0.6
ωn
C
0.6
E(θ)
0.4
0.6
0.0
−0.2
0.4
0.6
−0.4
0.2
0.4
−0.6
0.0
1
3 0.2
0.0
2
E(θ)
E(θ)
0.2
−0.2
4
5
ωc
6
7
8
9
10
−0.2
−0.4
−0.4
−0.6
1
2
3
4
5
ωc
6 −0.6 7
1
8
9
3
2
10
4
5
ωn
6
7
8
9
10
Figure 2: Simulated effect of cognitive and noncognitive skills on social policy preferences holding
all else equal. Panels A and B show separate effect of cognitive and noncognitive skills, panel C shows
the effect of cognitive and noncognitive skills varied simultaneously. Based on 10,000 simulated
values, evaluated over a 100 point grid.
monotonically with latent noncognitive and (especially) cognitive skills. Similarly, an
individual’s permanent income depends strongly on latent skills. Panels (B) show that an
increase in either cognitive or noncognitive skills by one decile increases permanent income
by roughly 200 Euros, holding all else equal. Finally, panels (C) show the relationship
between unemployment and latent skills. It shows a remarkably strong effect of latent
cognitive skills: an increase in cognitive skills of one decile lowers the probability of
18
A2
0.4
0.3
Pr(ys = 4)
Pr(ys = 4)
A1
0.2
0.1
0.0
0.4
0.3
0.2
0.1
0.0
1
2
3
4
5
c
6
7
8
9
10
1
2
3
4
5
1
2
3
4
5
1
2
3
4
5
ω
B1
2
3
4
5
ωc
6
7
8
9
C2
8
9
10
ωn
6
7
8
9
10
6
7
8
9
10
0.7
0.6
Pr(yu = 1)
Pr(yu = 1)
0.6
7
28
26
24
22
20
18
16
14
10
0.7
6
x100
E(yL)
E(yL)
1
C1
B2
x100
28
26
24
22
20
18
16
14
ωn
0.5
0.4
0.3
0.2
0.5
0.4
0.3
0.2
0.1
0.1
1
2
3
4
5
c
6
7
8
9
10
ω
ωn
Figure 3: Simulations showing how labor market outcomes are determined by deciles of cognitive
skills (1) and noncognitive skills (2) holding other covariates at means. Panels A shows probability
of choosing college education, panels B shows expected permanent income, and panels C shows
probability of experiencing spells of unemployment. Based on 10,000 simulated values, evaluated
over a 100 point grid. Simulation posterior means and 95% HPD intervals.
experiencing spells of unemployment by roughly 5 percentage points. Moving from the
second to the ninth cognitive decile reduces this probability by over 30 percentage points,
ceteris paribus. The effect of noncognitive skills is somewhat less marked, but still highly
relevant. These results clearly indicate that labor market outcomes are dependent on latent
cognitive and noncognitive skills.
Second, I calculate (i) the mediated effect of skills on social policy preferences (cf. eq. 2),
which represents the theoretically relevant link between skills, labor market outcomes,
and preferences, as well as (ii) the remaining direct effect (cf. eq. 3), which represent a
19
Table 4: Mediated effects of cognitive and noncognitive skills on social policy preferences.
Posterior means and standard deviations.
(1)
Cognitive
skills
Total effect
– mediated effect
– remaining direct effect
Controls
(2)
Noncognitive
skills
−0.148
(0.034)
−0.079
(0.015)
−0.069
(0.035)
−0.198
(0.037)
−0.075
(0.014)
−0.123
(0.039)
no
Cognitive
skills
Noncognitive
skills
−0.141
(0.037)
−0.062
(0.016)
−0.079
(0.040)
−0.192
(0.037)
−0.060
(0.015)
−0.132
(0.040)
yes
Note: Calculated from equation (4). Controls included in (2) are age, house ownership, household size, immigration status,
being divorced, union member, self-employed, currently living in East Germany. Table 3.
(possibly large) number of other channels of influence.17 Estimates displayed in Table 4
show the total effect of skills decomposed into the mediated effect and the remaining direct
effect for specifications excluding (1) and including (2) the set of individual controls xi . I
find clear mediating effects of labor market outcomes for cognitive skills. The estimated
mediation effect of labor market outcomes is −0.062 ± 0.016, while the remaining effect is
−0.079 ± 0.04. This indicated that a range of other, unspecified channels might link skills
to social policy preferences, but their effect is uncertain (or “not significant”) compared
to the clear-cut systematic effect of labor market outcomes. For noncognitive skills, I
find a relevant remaining effect, −0.132 ± 0.04, indicating the significant presence of
mechanisms other than labor market outcomes. But the mediating effect of labor market
outcomes for noncognitive skills is as clear-cut as for their cognitive counter-part, estimated
as −0.06 ± 0.015. In other words, an individual’s three basic labor market variables –
education, permanent income, unemployment experience, go a long way in explaining why
cognitive and noncognitive skills systematically shape his social policy preferences.
Furthermore, the highly relevant link between latent cognitive and noncognitive skills, labor
market outcomes, and preferences suggests that excluding basic skills from empirical models
17
One cannot rule out the possibility that an unobserved variable influences both mediator and outcome
conditional on controls and treatments (although I argue that many possible unobserved socio-economic
confounders will simply be functions of latent skills). This creates correlated residuals between mediation
and outcome equations which can not be estimated as part of the model. Thus Imai et al. (2010b) propose
to conduct sensitivity analyses by simulating over a range of possible correlations. My simulations show
stable substantive effects up to error correlations of ≈ 0.5.
20
Table 5: Difference in estimated labor market effects when ignoring latent skills
Education
Unemployment
Permanent income
Diff.
in %
0.037
0.023
0.042
46.1
47.0
34.1
of social policy preferences will overstate the effects of labor market variables. To illustrate
this point, I re-estimate my model given in equation (4) and constrain all latent skill effects
to zero. Table 5 shows the difference in estimates for three labor market variables on
social policy preferences. I find that all estimates are inflated. The estimate of income on
preferences is more than 30 percent larger; while the effect attributed to unemployment
experiences is increased by almost 50 percent. While these findings do not, of course,
replace results from a full-fledged Monte Carlo study, they do indicate that researchers
ignoring cognitive and noncognitive skills might to so at their own peril.
6.5. Robustness checks
Before moving on the concluding section, I conduct a number of robustness checks. Table 6
shows estimates of total cognitive and noncognitive skill effects on social policy preferences
under several alternative specifications. Since several scholars have emphasized distinct
preferences of the religious (e.g., Scheve and Stasavage 2006), specification (1) includes
an individual’s religious identification as Catholic, Protestant, or other. To capture possibly
distinct preferences of individuals outside the labor market, specification (2) adds a dummy
for retired individuals. Using years of schooling instead of educational certificates constitutes
specification (3). In (4) I allow for differences in preferences between individuals with
differing levels of skill specificity (e.g., Iversen and Soskice 2001). To control for differences
between industries and local labor market conditions, specifications (5) and (6) include 7
industry and 15 state fixed effects, respectively. Instead of imputing missing information
of (mainly) cognitive test items and other covariates, specification (7) is estimated on a
subsample of observed responses only, reducing the sample size by 23 percent. While
psychometric research has shown that cognitive skills are largely stable after age eight and
thus independent of age, noncognitive skills might change over the life-cycle (Borghans
et al. 2008: 976). Thus, specification (8) allows for linear and quadratic age effects in
21
Table 6: Robustness checks. Estimates of cognitive and
noncognitive skill effects under nine alternative specifications. Posterior means and standard deviations.
cognitive
−0.149
(0.037)
−0.147
(0.037)
−0.140
(0.037)
−0.151
(0.040)
−0.142
(0.038)
−0.169
(0.039)
−0.132
(0.039)
−0.142
(0.038)
−0.146
(0.046)
(1) Religion
(2) Retirement
(3) Years of schooling
(4) Skill specificity
(5) Industry fixed effects
(6) State fixed effects
(7) Listwise deletion
(8) ωn age effects
(9) 5 66% subsamples
noncognitive
−0.192
(0.036)
−0.191
(0.037)
−0.192
(0.037)
−0.198
(0.040)
−0.187
(0.037)
−0.224
(0.038)
−0.160
(0.042)
−0.189
(0.036)
−0.192
(0.045)
Note: Coefficients for cognitive and noncognitive skill effects in social policy
preferences equation; estimated jointly with all measurement equations.
Sample size is 625 in (7); 551 in (9).
noncognitive skills. Finally, in order to check robustness against the presence of unobserved
heterogeneity, specification (9) creates 5 datasets with one third of all observations deleted
at random and re-estimates the model 5 times on these random subsets. The final estimate
is the average of these 5 models with ‘standard errors’ penalized proportional to the variance
between each set of estimates (Rubin 1987). Under each and every specification I find
clear negative effects of cognitive and noncognitive skills, comparable in size to my main
specification.
7. Conclusion
In this paper I have set out to empirically demonstrate the relevance of cognitive and
noncognitive skills for individuals’ social policy preferences, and how their effect operates
22
via basic labor market outcomes. To do so rigorously, I have created measurement models for
social policy preferences and for cognitive as well as noncognitive skills. Similar to results of
economic research on skills, I establish that an individual’s set of skills can be summarized
by a two-dimensional vector of cognitive and noncognitive skills. These two skill factors
drive observed survey responses, performance in aptitude tests, and labor market choices
and outcomes. Thus, variables such as income, education, or unemployment risk, which are
commonly taken as exogenous in individual level models of preference formation, are all
driven by unobserved skill factors.
By measuring these previously unobserved skill factors, and by using an explicit mediation
model, I am able to show that they shape social policy preferences in a systematic fashion.
Individuals with higher cognitive skills are more likely to hold an advantageous position
in the labor market, and consequently prefer a less active government in matters of social
policy. The same holds for individuals with high noncognitive skills. In fact, those two
effects are almost fully linearly additive, which has two important consequences. First,
noncognitive skills can compensate for lack of cognitive ability in the labor market and vice
versa. This can explain why individuals with low ability – often proxied by low education
in previous research – might still prefer limited social policies. Second, individuals who
command both high cognitive and noncognitive skills are strongly opposed to social policy,
even when controlling for a range of socio-economic characteristics and events. I conducted
a wide variety of robustness checks, which confirmed these basic findings.
My results suggest that commonly unobserved characteristics, such as ability, motivation,
and self-confidence, should be taken seriously in models for individual preferences. They cast
doubt on popular claims of a relationship between ‘intelligence’ and left-wing preferences
based on simplistic models (e.g., Kanazawa 2010, 2009), and thus add a further voice to
existing criticisms (Morton et al. 2011). In line with what we would expect from a basic
political economy perspective, I find that individuals with high cognitive and/or noncognitive
skills succeed in the labor market: They complete higher levels of education, have higher
life-time incomes, and are far less likely to experience unemployment. Consequently, they
prefer a less – and not more – extensive welfare state.
23
References
Alesina, A. and Angeletos, G.-M. (2005). Fairness and Redistribution. American Economic Review, 95,
960–980.
Alesina, A. and Fuchs-Schündeln, N. (2007). Good-Bye Lenin (or Not?): The Effect of Communism
on People’s Preferences. American Economic Review, 97, 1507–1528.
Alesina, A. and La Ferrara, E. (2005). Preferences for redistribution in the land of opportunities.
Journal of Public Economics, 89, 897–931.
Anger, S. and Heineck, G. (2010). Cognitive abilities and earnings – first evidence for Germany.
Applied Economics Letters, 17, 699–702.
Ansell, B. (2013). The Political Economy of Ownership: Housing Markets and the Welfare State.
Working Paper, University of Minnesota.
Ansolabehere, S., Rodden, J. and Snyder, J. M. J. (2008). The Strength of Issues: Using Multiple
Measures to Gauge Preference Stability, Ideological Constraint, and Issue Voting. American Political
Science Review, 102, 215–232.
Benabou, R. and Ok, E. A. (2001). Social Mobility and the Demand for Redistribution: The Poum
Hypothesis. The Quarterly Journal of Economics, 116, 447–487.
Benabou, R. and Tirole, J. (2002). Self-confidence and personal motivation. Quarterly Journal of
Economics, 117, 871–915.
Borghans, L., Duckworth, A. L., Heckman, J. J. and ter Weel, B. (2008). The Economics and
Psychology of Personality Traits. Journal of Human Resources, 43, 972–1059.
Bowles, S., Gintis, H. and Osborne, M. (2001). The Determinants of Earnings: A Behavioral Approach.
Journal of Economic Literature, 39, 1137–1176.
Bronars, S. G. and Oettinger, G. S. (2006). Estimates of the return to schooling and ability: evidence
from sibling data. Labour Economics, 13, 19–34.
Cameron, S. V. and Heckman, J. J. (1993). The Nonequivalence of High School Equivalents. Journal
of Labor Economics, 11, 1–47.
Carneiro, P., Hansen, K. T. and Heckman, J. J. (2003). Estimating Distributions of Treatment Effects
with an Application to the Returns to Schooling and Measurement of the Effects of Uncertainty on
College Choice. International Economic Review, 44, 361–422.
Cattell, R. B. (1987). Intelligence: Its structure, growth, and action. New York: Elsevier.
Cawley, J., Heckman, J. J. and Vytlacil, E. (2001). Three observations on wages and measured
cognitive ability. Labour Economics, 8, 419–442.
Cebi, M. (2007). Locus of Control and Human Capital Investment Revisited. Journal of Human
Resources, 42, 919–932.
Cunha, F., Heckman, J. J. and Navarro, S. (2005). Separating uncertainty from heterogeneity in life
cycle earnings. Oxford Economic Papers, 57, 191–261.
Cunha, F., Heckman, J. J. and Schennach, S. M. (2010). Estimating the Technology of Cognitive and
Noncognitive Skill Formation. Econometrica, 78, 883–931.
Cusack, T., Iversen, T. and Rehm, P. (2006). Risks At Work: The Demand And Supply Sides Of
Government Redistribution. Oxford Review Of Economic Policy, 22, 365–389.
Deary, I. J., Batty, G. D. and Gale, C. R. (2008). Bright children become enlightened adults.
Psychological science, 19, 1–6.
Drago, F. (2011). Self-esteem and earnings. Journal of Economic Psychology, 32, 480–488.
24
Flossman, A. L., Piatek, R. and Wichert, L. (2007). Going Beyond Returns to Education: The Role of
Noncognitive Skills on Wages in Germany. Working paper.
Gelman, A. and Rubin, D. (1992). Inference from Iterative Simulation Using Multiple Sequences.
Statistical Science, 7, 457–511.
Gerber, A. S., Huber, G. a., Doherty, D., Dowling, C. M. and Ha, S. E. (2010). Personality and Political
Attitudes: Relationships across Issue Domains and Political Contexts. American Political Science
Review, 104, 111.
Geweke, J. (1992). Evaluating the accuracy of sampling-based approaches to calculating posterior
moments. In Bernardo, J. M., Berger, J. O., Dawid, A. and Smith, A. (Eds.), Bayesian Statistics 4,
Oxford: Clarendon Press.
Gill, J. (2008a). Bayesian Methods. A Social and Behavioral Sciences Approach. Boca Raton: Chapman
& Hall.
Gill, J. (2008b). Is Partial-Dimension Convergence a Problem for Inferences from MCMC Algorithms?
Political Analysis, 16, 153–178.
Gottfredson, L. S. (1997). Why g matters: The complexity of everyday life. Intelligence, 24, 79–132.
Green, D. A. and Riddell, C. W. (2003). Literacy and earnings: an investigation of the interaction of
cognitive and unobserved skills in earnings generation. Labour Economics, 10, 165–184.
Griffin, P. and Ganderton, P. T. (1996). Evidence on Omitted Variable Bias in Earnings Equations.
Economics of Education Review, 15, 139–148.
Groves, M. O. (2005). How important is your personality? Labor market returns to personality for
women in the US and UK. Journal of Economic Psychology, 26, 827–841.
Heckman, J. J. (2001). Micro Data, Heterogeneity, and the Evaluation of Public Policy: Nobel Lecture.
Journal of Political Economy, 109, 673–748.
Heckman, J. J., Stixrud, J. and Urzua, S. (2006). The Effects of Cognitive and Noncognitive Abilities
on Labor Market Outcomes and Social Behavior. Journal of Labor Economics, 24, 411–482.
Hopkins, K. D. and Bracht, G. H. (1975). Ten-Year Stability of Verbal and Nonverbal IQ Scores.
American Educational Research Journal, 12, 469–477.
Idema, T. and Rueda, D. (2011). Redistribution Preferences and Life-Cycle Income. APSA 2011
Annual Meeting Paper.
Imai, K., Keele, L. and Tingley, D. (2010a). A general approach to causal mediation analysis.
Psychological Methods, 15, 309–334.
Imai, K., Keele, L., Tingley, D. and Yamamoto, T. (2011). Unpacking the Black Box of Causality:
Learning about Causal Mechanisms from Experimental and Observational Studies. American
Political Science Review, 105, 765–789.
Imai, K., Keele, L. and Yamamoto, T. (2010b). Identification, Inference and Sensitivity Analysis for
Causal Mediation Effects. Statistical Science, 25, 51–71.
Iversen, T. and Soskice, D. (2001). An Asset Theory of Social Policy Preferences. American Political
Science Review, 95, 875–893.
Jackman, S. (2008). Measurement. In Box-Steffensmeier, J. M., Brady, H. E. and Collier, D. (Eds.),
Oxford Handbook of Political Methodology, Oxford: Oxford University Press. pp. 119–151.
Jackman, S. D. (2009). Bayesian Analysis for the Social Sciences. New York: Wiley.
Jensen, A. R. (1998). The g factor: The science of mental ability. Westport: Praeger.
Kanazawa, S. (2009). IQ and the values of nations. Journal of Biosocial Science, 41, 537–556.
25
Kanazawa, S. (2010). Why Liberals and Atheists Are More Intelligent. Social Psychology Quarterly,
73, 33–57.
Lang, F. R., Weiss, D., Stocker, A. and von Rosenbladt, B. (2007). Assessing Cognitive Capacities in
Computer-Assisted Survey Research: Two Ultra-Short Tests of Intellectual Ability in the German
Socio-Economic Panel (SOEP). Schmollers Jahrbuch, 127, 183–192.
Lindqvist, E. and Vestman, R. (2011). The Labor Market Returns to Cognitive and Noncognitive
Ability: Evidence from the Swedish Enlistment. American Economic Journal: Applied Economics, 3,
28.
Margalit, Y. (2013). Explaining Social Policy Preferences: Evidence from the Great Recession.
American Political Science Review, 107, 80–103.
Morton, R., Tyran, J.-R. and Wengstr o m, E. (2011). Income and Ideology: How Personality Traits,
Cognitive Abilities, and Education Shape Political Attitudes. Working paper.
Perez, E. O. (2011). The Origins and Implications of Language Effects in Multilingual Surveys: A
MIMIC Approach with Application to Latino Political Attitudes. Political Analysis, 19, 434–454.
Quinn, K. M. (2004). Bayesian Factor Analysis for Mixed Ordinal and Continuous Responses. Political
Analysis, 12, 338–353.
Rehm, P. (2011). Risk Inequality and the Polarized American Electorate. British Journal of Political
Science, 41, 363–387.
Rehm, P., Hacker, J. S. and Schlesinger, M. (2012). Insecure Alliances: Risk, Inequality, and Support
for the Welfare State. American Political Science Review, 106, 386–406.
von Rosenbladt, B. and Stocker, A. (2005). Testerhebung zum SOEP 2006. Persönlichkeit und Alltag,
Verhaltensexperimente, Retest-Studie [Pre-test for SOEP 2006. Personality and Everyday Life,
Behavioral Experiments, Re-test study]. TNS Infratest Sozialforschung, PB20 - 441.
Rotter, J. (1966). Generalized expectancies for internal versus external control of reinforcement.
Psychological monographs, 80, 1–28.
Rubin, D. (1987). Multiple Imputation for Nonresponse in Surveys. Hoboken: Wiley.
Scheve, K. and Stasavage, D. (2006). Religion and Preferences for Social Insurance. Quarterly
Journal of Political Science, 1, 255–286.
Schoon, I., Cheng, H., Gale, C. R., Batty, G. D. and Deary, I. J. (2010). Social status, cognitive ability,
and educational attainment as predictors of liberal social attitudes and political trust. Intelligence,
38, 144–150.
Schuerger, J. M. and Witt, A. C. (1989). The temporal stability of individually tested intelligence.
Journal of Clinical Psychology, 45, 294–302.
Schupp, J., Herrmann, S., Jaensch, P. and Lang, F. R. (2008). Erfassung kognitiver Leistungspotentiale
Erwachsener im Sozio-oekonomischen Panel (SOEP) [Assesssing cognitive abilities of adults in the
Socio-economic Panel]. SOEP Data Documentation, 32.
Shayo, M. (2009). A Model of Social Identity with an Application to Political Economy: Nation, Class,
and Redistribution. American Political Science Review, 103, 147–174.
Skrondal, A. and Laake, P. (2001). Regression Among Factor Scores. Psychometrika, 66, 563–757.
Stegmueller, D. (2011). Apples and Oranges? The problem of equivalence in comparative research.
Political Analysis, 19, 471–487.
Stegmueller, D. (forthcoming). Modeling Dynamic Preferences. A Robust Bayesian Latent Ordinal
Probit Model. Political Analysis.
26
Strenze, T. (2007). Intelligence and socioeconomic success: A meta-analytic review of longitudinal
research. Intelligence, 35, 401–426.
Svallfors, S. (2006). The moral economy of class. Stanford: Stanford University Press.
Svallfors, S. (2010). Policy feedback, generational replacement, and attitudes to state intervention:
Eastern and Western Germany, 1990–2006. European Political Science Review, 2, 119.
Treier, S. and Jackman, S. (2008). Democracy as a latent variable. American Journal of Political
Science, 52, 201–217.
27
A. [Online] Appendix
A.1. Sample descriptives, test participation
Table A.1 gives means (and standard errors) for covariates in my estimation sample, and for
individuals who did and did not participate in cognitive testing. The final column provides a
logit model for test refusal.
28
Table A.1: Means of full sample, test-participants and non-participants. Logit model for probability
of cognitive test refusal. Estimates and standard errors.
Estimation
sample
Age
Permanent Income
Education
Elementary
General vocational
Vocational + degree
Higher education
Work experience
Grown up East
Living East
HH size
House owner
Foreigner
Divorced
Unemployment exp.
Union member
Self-employed
N
a
b
44.8
(0.4)
1753
(33)
Test
participants
44.2
(0.5)
1766
(35)
Test
refusers
46.6
(1.1)
1705
(79)
0.17
(0.01)
0.15
(0.02)
0.25
(0.04)
0.57
(0.02)
0.13
(0.01)
0.13
(0.01)
21.5
(0.5)
0.13
(0.01)
0.12
(0.01)
2.83
(0.05)
0.45
(0.02)
0.14
(0.01)
0.08
(0.01)
0.44
(0.02)
0.28
(0.02)
0.09
(0.01)
0.59
(0.02)
0.13
(0.02)
0.13
(0.02)
21.3
(0.5)
0.14
(0.02)
0.13
(0.01)
2.86
(0.06)
0.44
(0.02)
0.12
(0.01)
0.07
(0.01)
0.44
(0.02)
0.28
(0.02)
0.08
(0.01)
0.52
(0.04)
0.11
(0.03)
0.11
(0.03)
22.3
(1.1)
0.09
(0.02)
0.09
(0.02)
2.71
(0.12)
0.46
(0.04)
0.21
(0.03)
0.11
(0.03)
0.44
(0.04)
0.26
(0.04)
0.13
(0.03)
827
625
Logit model for test refusal. Correctly classified cases: 77.9%.
Reference category
29
202
Refusal
Probabilitya
0.076
(0.022)
0.000
(0.000)
0.688
(0.258)
–b
−0.172
(0.324)
−0.406
(0.387)
−0.066
(0.022)
−0.733
(0.784)
0.453
(0.788)
−0.091
(0.084)
0.392
(0.227)
0.797
(0.281)
0.517
(0.356)
0.042
(0.222)
−0.043
(0.233)
0.746
(0.333)
827
3.0
●
2.5
●
●
Eigenvalue
●
Full item set
Reduced item set
2.0
●
1.5
●
1.0
●
●
●
0.5
●
●
●
0.0
1
2
3
4
5
Component
Figure A.1: Eigenvalues of noncognitive skill measurement items. Full item set (10 items) and
reduced set (5 items) used in analysis.
A.2. Exploratory analysis of measurement items
Noncognitive skills
Eigenvalues of the correlation matrix of the original 10 set of items
are given in Figure A.1. This suggest that three factors should be used. However, three and
two factor solutions yield non-parsimonious solutions characterized by high cross loadings
of items on different factors. A more straightforward model is achieved with a reduced set
of five items; Eigenvalues of the correlation matrix suggest that one factor suffices.
Panel (A) of Table A.2 lists items used in the noncognitive skill measurement model. Items
are originally recorded on five point agree–disagree scales, but since outer categories are
very sparsely populated I collapsed responses to three or four categories (this does not
influence the distribution of the latent noncognitive skill variable or my final results). As
can be seen from the last column, few individuals refused to respond to these items. I deal
with missing responses as part of my measurement model.
Redistribution
Exact wording and details of items used in the redistribution measurement
model are given in panel (C) of Table A.2. Items are originally recorded on five point agree–
disagree scales, but since outer categories are very sparsely populated I collapsed responses
to three or four categories (this does not influence my final results, but removes thresholds
to be estimated from the IRT model). Nonresponse is low for all items. Eigenvalues of
the correlation matrix of redistribution items are given in Figure A.2. They suggest that a
one-factor model provides a good summary of the data.
30
Table A.2: Items used in measurement models
Item
categories
Missing %
3
3
4
3
3
1.2
1.7
1.0
0.6
1.1
–
–
–
24.4
24.4
24.4
3
3
3
3
4
0.7
0.7
0.9
0.9
1.0
(A) Locus of control items
I doubt my abilities when problems arise
I haven’t achieved what I deserve
What you achieve depends on luck
Others make the crucial decisions in my life
I have little control over my life
(B) Symbol correspondence test
SCT correct 1-30 sec.
SCT correct 31-60 sec.
SCT correct 61-90 sec.
(C ) State responsibility for financial security
When Sick
In Old-Age
When unemployed
When Requiring Care
For Family
4
●
Eigenvalue
3
2
1
●
●
●
●
4
5
0
1
2
3
Component
Figure A.2: Eigenvalues of redistribution measurement items.
31
3.0
Eigenvalue
2.5
●
2.0
1.5
1.0
0.5
●
●
0.0
1
2
3
Component
Figure A.3: Eigenvalues of cognitive skill measurement items.
Cognitive skills
Figure A.3 shows (not surprisingly) that the three cognitive ability mea-
surements should be summarized by one factor. Eigenvalues are correlated based on the
correlation matrix of observed values. This excludes 24% of respondents who refused to
participate in the cognitive test (cf. panel (B) of Table A.2). I deal with missing responses as
part of my measurement model, and I conducted robustness tests of my final results using
both imputed and listwise-deleted data.
32
A.3. Estimates of controls used in skill outcome equations
Table A.3: Full table of skill outcome equations. Posterior means and 95% HPD regions.
Cognitive skills
Non-cognitive skills
Foreign
East
Age
Income
School choice
Unemployment
0.265
[0.201 0.327]
0.293
[0.225 0.352]
−0.306
[−0.445 −0.154]
−0.823
[−0.979 −0.680]
0.001
[−0.146 0.149]
0.223
[0.159 0.291]
0.22
[0.148 0.291]
−0.348
[−0.516 −0.174]
−0.251
[−0.333 −0.165]
−0.184
[−0.264 −0.106]
0.583
[0.393 0.770]
0.571
[0.387 0.760]
−0.3
[−0.373 −0.231]
Parental education
Work experience
0.322
[0.261 0.381]
0.219
[0.090 0.350]
Note: Based on 10,000 MCMC samples.
A.4. Full table of social policy preference equation including controls
33
Table A.4: Estimated effects of cognitive and noncognitive skills on redistribution preferences. Posterior means and standard deviations.
(1)
Skills
Cognitive
Noncognitive
(2)
−0.146
(0.033)
−0.227
(0.035)
Controls
Age
(3)
−0.139
(0.033)
−0.222
(0.036)
(4)
−0.149
(0.037)
−0.206
(0.036)
−0.061
(0.032)
−0.022
(0.033)
−0.174
(0.061)
0.204
(0.089)
0.288
(0.117)
0.079
(0.065)
−0.355
(0.119)
0.145
(0.114)
HH size
House owner
Foreigner
Divorced
Union member
Self-employed
Living east
Note: Shown are coefficients from redistribution equation estimated jointly with system of
measurement equations. Based on 10,000 MCMC samples.
34
A
0.6
less than HS
high school
more than HS
0.5
0.4
0.3
0.2
0.1
0.0
−3
B
−2
−1
0
ωc
1
0.5
2
3
less than HS
high school
more than HS
0.4
0.3
0.2
0.1
0.0
−3
y cic
n
y ni
ωc
ωn
µ�c
c
c c
2
y−2
+ ψ c y c =0n λ c ω c + ψ1c
ci = λ c ω i −1
c i
ci
n y c = λncc ω c +nψ cn ω
y ni =ciΦ(τ nt −i λ n ωyin) = Φ(τ n − λ n ω n )
nt
n i
ni
y n = Φ(τ n c− λ nncω ni )
cω c c∼niN K (µ c , nt
c
c
�
)
= λ c ω i + ψ c k k ω ∼ N K (µ k , � k )
ω cN∼K (µ
N n(µ cn, � c ) �, . . . ,nK n
n∼
ω n nt
= Φ(τ
− λ nn ωKkni,)�kk ),ωk nk ∼= N
K (µ k , � k ), k = �, . . . , K
n
n
n
c ωc ∼
n cN K (µ kc , � k ),
n k = �, . . . , K
c
n
µ
=
µ
=
�,
λ
=
λ
=
�
� )
�
∼ N K�(µ k , �
µc �� = µn � = �, λ�c = λ�n = �
c k n
µ = nµ� = �, λ� = λ� = �
∼ N KK(µ∼kn�Multinomial(π)
, � k ), k = �, K
. . .∼, Multinomial(π)
K
K
∼
Multinomial(π)
= µ�n = �, λ�c = λ�n = �
3
K = � ∶ π̂ = [�.��, �.��]K = � ∶ π̂ = []
K ∼ Multinomial(π)
K = � ∶ π̂ = []
Figure A.4: Distribution of non-normal latent skill factors
K = � ∶ π̂ = []
A.5. Non-normal latent skill factors
Figure A.4 shows the distribution of posterior means of cognitive and noncognitive latent
skills factors using a more flexible distributional specification. Specifically I use a finite
mixture of normals distribution (details are given at the bottom of Figure Figure A.4). The
resulting distribution is substantively similar to the one used in the main text of the paper.
35