A closer look at the effects of within-school ability grouping in secondary schools: how are different performers affected?1 An investigation in four OECD countries Miroslav Beblavy, Anna-Elisabeth Thum, Galina Potjagailo, Marten von Werder Abstract The relationship between ability grouping and equality of educational opportunities has been subject of much debate. We examine this relation for differently performing students in 4 OECD countries. Our paper shows that, depending on the student’s relative location on the PISA performance distribution, within-school ability can increase or decrease educational equality to a different extent. Quantile regression results show that in Belgium low performers are affected most by inequality effects, whereas in Austria high performers are most affected. In Finland we do not find a significant relationship. In Hungary, we find that high performers are most affected by equality effects. Keywords: ability grouping, quantile regression, PISA scores JEL classification: I24, J24, Z18 1 The working paper version was funded by the European Commission within the Seventh Framework Programme under the project NEUJOBS (www.neujobs.eu). The funding source played no further role than provision of funding and peer review. 1 1) Introduction Reducing or postponing ability grouping has been widely recognized as a policy measure that contributes to increasing the equality of educational opportunities (Causa and Chapuis, 2009; OECD, 2004, 2007, 2010; Schuetz, Ursprung and Woesmann, 2005; Hanushek and Woessmann, 2005; Woessmann, 2009). However, while effects of between-school ability grouping are widely studied, the effects of within-school ability grouping on equality of educational opportunities are less frequently researched in the economic literature as scholars focus more strongly on acrossschool ability grouping, also termed tracking. In this paper we examine how within-school ability grouping can be associated with equality of educational opportunity measured by the socio-economic gradient2 in four OECD countries. Using quantile regressions we take a closer look at this effect by disaggregating it across the performance distribution in the OECD’ Programme for International Student Assessment (PISA). Distributional effects can be of interest as they are not necessarily equal: students performing less well might benefit more than higher performers from non-grouped lessons due to peer effects, whereas higher performers might not gain as much in terms of educational performance3. To our knowledge this is the first paper that examines the relation between education policies and equality of educational opportunities across the ability distribution in an international comparison. Quantile regression results show that the four countries differ in terms of how withinschool ability grouping is associated with equality of educational opportunities across the performance distribution. In Belgium low performers are more strongly concerned by inequality effects of within-school ability grouping, whereas in Austria high performers are more concerned. In Hungary, we find that within-school ability grouping is positively correlated with equality and more so for the high performers. In Finland within-school ability grouping does not show to have a significant relation with equality. Our paper shows that, depending on the student’s relative location on the PISA performance distribution, within-school ability can 2 We use ‘socio-economic gradient’ interchangeably with ‘equality of educational opportunity’, ‘educational equality’ and ‘equality in education’. The socio-economic gradient is usually understood as the dependence of educational outcomes on family background (Causa and Chapuis 2009). Section 3 discusses this concept in more detail. 3 Hanushek and Woessmann (2006) speak of non-linear peer effects. 2 increase or decrease educational equality to a different extent. It further shows that a regression at the average of the PISA distribution may not show the entire picture. The remainder of the paper is organised as follows: Section 2 analyses the previous literature; Section 3 describes our dataset; Section 4 presents descriptive statistics and Section 5 explains the econometric model. In Section 6 we present the results and Section 7 concludes. 2) Previous literature In the economic literature equality of educational opportunities is usually based on Roemer (1998, 2008), who defines educational equality as the independence of educational outcome from social origins. This understanding is strongly linked to the notion of the socio-economic gradient, which is defined as the correlation between educational outcomes and family background. Studies examining Roemer’s concept of equality of educational opportunities and their determinants can be grouped into those focussing on (1) the effect of family background on test scores; (2) the effect of policies on the socio-economic gradient; (3) the effect of school characteristics on the socio-economic gradient or on test scores and (4) peer effects and equality. Woessmann (2004) examines the effect of family-background on student achievement in 18 countries using quantile regression through the impact of socio-economic variables on Third International Mathematics and Science Study (TIMSS) test results across the TIMSS performance distribution. He finds that the impact of family background on scores is increasing along quantiles for some countries and decreasing for a few others. Fertig (2003) uses quantile regressions to study the effect of family background on PISA test scores in Germany. Also using a quantile regression approach Hindricks, Verschelde, Rayp, Schoors (2010) find that the school environment has a more heterogeneous effect for pupils of lower performance in Belgium. Hanushek and Luque (2004) find that in the most OECD countries the effect of family background on test scores diminishes across students’ age. A set of cross-country studies examines the effect of education policies on educational inequality. Causa and Chapuis (2009) conduct an analysis for 28 OECD countries using PISA 2006 test scores in science. The authors find that within-school ability grouping increases inequality across countries. Schuetz, Ursprung and Woessmann (2005) use the TIMSS database and confirm that late tracking increases educational equality. Using a combination of the PISA 3 and PIRLS data, Hanushek and Woessmann (2004) apply a difference-in-difference methodology to eliminate cross-country differences and find that early tracking increases inequality in education and decreases the mean performance. Ammermueller (2005) follows Hanushek and Woessmann’s (2004) approach and finds that tracking benefits students from a better social background. Raitano and Vona (2011) find that the effects of different ability peer groups vary across different tracking schemes. Other studies analyse the effect of education policies on educational inequality on a national level. Figlio and Page (2001) examine the effects of tracking policies in the United States using the NELS database (National Educational Longitudinal Survey). They do not find a negative effect for tracking policy for low ability children. Betts and Shkolnik (2000) examine the effects of ability grouping on educational achievement in secondary education in the US and – contrary to the majority of the relevant literature – do not find a significant effect. Muehlenweg (2007) studies the effects of secondary school tracking in Germany on educational outcomes by comparing the outcomes of those who were tracked after the 4th grade and those who were tracked after the 6th grade (participants of the so-called orientation stage) in the state of Hessen. She finds that the effects of later tracking are more positive for the low ability groups than for the rest of the ability distribution. A set of studies focusses particularly on within-school ability grouping and its effects. The results, however, appear to be ambiguous: Based on a meta-analysis, Hattie (2009) does not find significant effects of ability grouping on pupils performance while even checking for potential effects for different ability groups in OECD countries. In contrast to that, a recent study by Collins and Gan (2013) found significant improvement in math and reading scores after sorting pupils according to their performance. The authors control for different selection procedures and hereby point out that the applied measures indeed may provide different conditions of ability grouping that could affect the result of streaming. Hallam (2003) collected data on ability grouping practices in the UK but does not relate them to pupil’s performance. However, to the best of our best knowledge, there is indeed a gap in the academic documentation providing a systematic typology of different grouping practices. A set of studies use quantile regression to assess the effect of school characteristics on educational equality. Basset, Tam and Knight (2002) study the econometric impact of high 4 school characteristics on student performance using quantile regression in the US and find that high school characteristics have differing effects on achievement at different points on the achievement distribution. Eide and Showalter (1998) also use US data and find that performance at the top of the conditional distribution of maths score changes is improved by a lengthened school year while performance at the bottom of the distribution appears not to benefit from the extra class time, other factors being considered equal. Collier and Millimet (2009) look at the effects of institutional characteristics on test scores in the TIMMS data base and show that it is beneficial to study the distributional effects of institutional arrangements on education outcomes, since the effects of institutional arrangements on test scores vary across quantiles. RangvidSchindler (2007) studies the effects of a socio-economic mix on educational outcomes in Denmark and finds that school composition effects vary across ability quantiles. As mentioned above, Peer group effects can have a beneficial effect on educational performance and are more likely to be present if tracking is postponed. A set of papers looks at the effect of characteristics of peer groups on educational outcomes across ability quantiles. Levin (2001) looks at the effects of class size and finds that a smaller class size emphasises peer effects. Averett and McLennan (2004) provide a comprehensive report on research and empirical findings of the effect of class size on education outcomes, drawing both from the educationalist and economic literature. Leuven and Ronning (2011) examine the effects of multi-age grouping within the same classroom and find that it has a positive effect on the performance of the whole group to mix different grades but depends on the proportion of lower and higher grades. Schneeweis and Winter-Ebmer (2007) look at peer effects in Austrian schools and find that positive peer effects are the largest for those with lower ability. The effect of peer effects in schools is further analysed in Toma and Zimmer (2000), Glewwe (1997) and Rivkin (2001), for example. As we can see, there is no consensus in the literature on effects of ability grouping, both generally and with regard to groups of different abilities. It appears that different countries and different groups at the ability scale react differently. Despite this ambiguity the OECD has been strongly pushing for a policy change towards later tracking, especially with its legitimacy buttressed by PISA. Therefore, analyzing the effects of ability grouping at a country level in a rigorous comparative framework should be useful. 5 3) Data We use the PISA (2009) database. The PISA programme tests 15-year old students in terms of their cognitive skills in mathematics, reading and science in OECD countries. In these fields PISA measures in particular students’ ability to apply their knowledge in different contexts (Ammermueller, 2005). The PISA sampling design is based on a two-step procedure: first a set of schools are selected and then students are drawn randomly from the resulting population (Ammermueller, 2005 and PISA, 2009). The test scores are computed using a mixed coefficients multinomial logit model by Adams, Wilson and Wang (1997) in order to produce an ‘ability scale’ measuring cognitive skills of the students in the sample (see PISA 2009, chapter 9 for more details). A generalised form of the Rasch model4 from item response theory is used in psychometrics to evaluate ability. In addition to test scores, the PISA database contains background information for both students and schools. The following table gives an overview of the variables used in our analysis. Table 1: Variables used in the empirical analysis Variable Description PISA score (mathematics, science and reading) ESCS score (economic, cultural and social score) Ability grouping The PISA test scores are computed as described above, scores are distributed around an international mean of 500 and a standard deviation of 100 test score points. The ESCS score comprises the students’ answers on home possessions, books in the home, parental occupation and parental education expressed as years of schooling. This variable is retrieved from the PISA school questionnaire. Ability grouping is measured through the school principals’ indications on whether ability grouping between classes exists in their school and if it is conducted for all pupils or only some. For this study we redefine the variable and only distinguish between schools which ability grouping takes place and schools in which ability grouping is not conducted. Table A1 shows the shares of grouped and non-grouped students per country. The mean of the ESCS scores over all students within each student’s school, which should account for school environment effects. Gender Migration background first generation Migration background second generation (indicated by the ESCS school-mean Control variables 4 For a comprehensive overview of the Rasch model and other psychometric models, see Rabe-Hesketh and Skondral (2004), Generalised latent variable modeling: multilevel, longitudinal and structural equation models, Boca-Raton, FL, Chapman and Hall/CRC. 6 father only) Foreign language spoken at home Mother at home Father at home 4) Descriptive statistics and country selection In order to understand how within-school ability grouping might affect the socioeconomic gradient for different PISA score performers, we assess conditional PISA performance distributions for 27 European OECD countries5 using the PISA maths score (see Figure A1). The conditional distributions depict the PISA performance by socio-economic background and ability grouping. We assign pupils to three ranked and equally sized groups according to the score on the PISA Economic, Social and Cultural Score6. We then compare the performance distribution of pupils belonging to the lowest and the highest ESCS score group. We then examine whether the differences in math scores between high- and low-ranking pupils differs between the subsamples of ability grouped pupils and non-grouped pupils. We do so by using (meanstandardized) QQ-Plots and kernel density estimates (see Figures 1-4 and A1). Based on how the conditional PISA performance distributions react to ability grouping, we select four countries which represent typical cases of the effect of within-school ability grouping on the socio-economic gradient. In these countries a higher social status as measured by the ESCS score entails ceteris paribus better results in the PISA tests but the effect of ability grouping in fact differs considerably across countries. We select Finland, Austria, Belgium and Hungary. In Finland (see Figures 1 and A1), the mean standardized kernel densities reveal almost identical shapes for the distributions of math scores across ability grouping and non-grouping even though there are level differences in scores according to the ESCS score. We therefore conclude that ability grouping in Finland affects pupils of high and low status in a very similar way and also affects pupils of different ability levels almost equally. In the QQ-Plot below this 5 These countries are Austria, Belgium, Bulgaria, Czech Republic, Denmark, Estonia, Finland, Germany, Greece, Hungary, Ireland, Italy, Latvia, Lithuania, Luxemburg, Netherlands, Norway, Poland, Portugal, Romania, Slovakia, Slovenia, Spain, Sweden, Switzerland, the United Kingdom and the United States. We exclude France as a variable for ability grouping is not available. 6 We restrict the observations for the following figures to that third of observations with the highest (“high status”) and the third of observations with the lowest (“low status”) ESCS score, i.e. we omit the third of the observations with a medium score in ESCS. 7 lack of effect is depicted by the fact that almost all dots lie on the diagonal. Some other countries show very similar results to the Finish case: Italy, Denmark, Sweden, Latvia, Lithuania, Estonia and Norway. It is apparent that these countries track pupils according to performance generally late (see Table A1) so that a potentially differentiating effect of tracking cannot be compounded by additional ability grouping. [Figure 1 here] The second selected country is Austria (see Figures 2 and A1), where ability grouping takes a differentiating effect: For pupils with a high social status that did not witness ability grouping, the performance distribution is much narrower than the distribution for grouped pupils and no ability grouping equalises performance at least for high status pupils. The same pattern, while being less pronounced, is visible for Germany and the Netherlands. A second observation is raised by non-mean-standardized QQ-Plots: The difference in test results between high and low status pupils are smaller in the non-grouping subsample. In particular high achieving low status pupils seem to benefit from the absence of ability grouping. Correspondingly, the nonmean standardized kernel densities show that the non-grouping almost fully compensates low status pupils for their status disadvantage: They score similarly well as high status grouped pupils. These differences would not be identified by simple OLS regression techniques, which justify the application of quantile regressions later in this paper. Both observations suggest that ability grouping enlarges performance differences. [Figure 2 here] In Belgium (see Figures 3 and A1) the kernel densities reveal that generally the standard deviation in the performance of low status pupils is higher than for high status pupils. But it is also visible, that for these low status pupils ability grouping seems to even enlarge the dispersion of the performance record. Ability grouping therefore differentiates the performance of low status pupils. The mean standardized QQ-Plots confirm this finding of slightly more pronounced tails in the ability grouped subsamples. The non-standardized QQ Plots then show that for low and medium achieving low status pupils the achievement deficit compared to high status pupils is much smaller in the non-grouped subsample. Hence, the Belgian case depicts somehow the pendant to the Austrian one: In Austria high status pupils witness a strong differentiation through 8 ability grouping, in Belgium this effect can be observed for low status pupils. The Polish case has some similarities with the Belgian. [Figure 3 here] The fourth selected country is Hungary (see Figures 4 and A1), where ability grouping show to equalise achievements in test scores for high status pupils. This is a unique finding and contradicts the expected effect of ability grouping. The mean standardized kernel density shows a much narrower form for high status pupils in ability grouping. The mean standardized QQ Plots correspondingly show fatter tails for low status pupils in ability grouping. The nonstandardized QQ Plots in turn reveal that particularly high achieving low status pupils seem to fall behind in the non-grouped subsample. [Figure 4 here] These four countries depict the main effects ability grouping has taken in the 27 OECD countries. In the following section we will shed further light on these cases by estimating quantile regressions. 5) Econometric Methodology We base our econometric analysis on Roemer’s definition of educational equality (see Section 2). The educational achievement of a child, in Roemer’s view, should be independent from social background. We set up an econometric model in which we test whether ability grouping reduces dependence between educational achievement and parental background. The model contains an interaction term, which measures the effect of ability grouping on the socioeconomic gradient. 5.1 Econometric model Our econometric analysis builds on Woessmann (2004), who applies a quantile regression approach to individual-level data in a set of countries to analyse the effect of social background on achievement across the PISA performance distribution. The paper is also related to work by Causa and Chapuis (2009) who conduct a cross-country panel data analysis of the role of education policies – including ability grouping - in equality of educational opportunities. In this paper, for each country the following econometric model is tested: 9 ∑ where is the individual PISA maths score; is the individual family background score; indicates whether within-school ability grouping was present at the school the individual attended and represents the set of control variables (immigration background, gender, language spoken at home and an indicator of whether the individual’s mother and father live at home). ∑ represents the mean ESCS score across individuals in school s of individual i. We include this term in order to account for school environment effects7. denotes an error term. We are principally interested in the estimation of , which represents the interaction effect between the ability grouping indicator and the family background. It is a measure of the effect of within-school ability grouping on the socio-economic gradient, which measures the effect of the family background on the PISA score. We are interested in how the effect of within-school ability grouping on the socioeconomic gradient varies over the conditional PISA performance distribution. We estimate the equation above for each country by quantile regression, which allows assessing how independent variables affect the shape of the conditional response distribution (Koenker, 2005). The distributional effects can provide evidence of whether within-school ability grouping has a heterogeneous effect across the conditional performance distribution. To compute the quantile regression coefficients the absolute sum of errors is minimized. To produce estimates at different ends of the error distribution the absolute sum is weighted: to compute coefficients for the lower quantiles, the lower end of the empirical conditional distribution receives more weight and for the higher quantiles the higher end of the conditional distribution receives more weight. 7 This measure is created by averaging the ESCS across individuals in the respective school of the respective individual. 10 5.2 Endogeneity issues As mentioned in Section 3, as indicator of within-school ability grouping we use a variable measuring the presence of ability grouping at the school level reported by the principal. The decision to perform within-school ability grouping is not prescribed by a nationwide policy in the four countries we examine (see Section 2) and the decision of a principal to conduct within-school ability grouping could therefore be correlated with school characteristics: an unfavourable school environment could lead to the introduction of within-school ability grouping and simultaneously affect PISA performance. We account for this possibility by including the average level of the social background variable (ESCS) over all students from their respective school. The school average of the ESCS variable should capture school environment effects by accounting for the socio-economic composition of the students’ school and for peer effects. Nevertheless, we cannot exclude the possibility that there are some school characteristics, other than the school environment effect stemming from socio-economic background, that drive both the presence of within-school ability grouping and the achievement of students. A further source of endogeneity might arise on the individual level: the probability of visiting a school with or without within-school ability grouping could be correlated with innate ability, which also affects PISA performance. The question therefore arises whether the principal’s decision to practice within-school ability grouping is related to the level innate ability. We argue firstly, that there are other and stronger factors at play in this decision such as socio-cultural factors of the community, elitism, pressure from different groups and preferences. Secondly, when believing in ability level as a determinant in the principal’s decision of whether or not to conduct within-school ability grouping, one can argue both ways: one could either argue that schools with mainly students of low levels of innate ability may decide to conduct ability grouping in order to help students in their specific needs or vice versa, that within-school ability grouping would be conducted in institutions with students of high innate ability to select the best and enhance their competences to a maximum. 6) Results The results are displayed in Table A2 and summarized in Table 2. 11 Table 2: Estimates of Interaction term from quantile Regressions, per country Estimates for Interaction term q20 q50 q80 1.567 11.96*** 15.26*** Austria (3.293) (2.783) (1.840) Belgium 9.333*** 6.167*** 5.215*** (2.218) (2.077) (1.938) -1.110 1.442 -2.977 Finland (4.284) (3.009) (2.754) -4.919** 7.028*** Hungary -2.198 (2.458) (2.197) (2.678) In Austria within-school ability grouping is positively correlated with the effect of the socio-economic background on the PISA score, which means that within-school ability grouping will reinforce the effect of the socio-economic background and decrease educational equality. This social stratification is reinforced by ability grouping most strongly for the better performing students and socio-economic background has a stronger effect for the more able than for the less able. In Belgium, on the other hand, ability grouping reinforces stratification more for the less able. Belgium can be seen as a ‘polarising’ country: ability grouping stratifies and does so mostly for those at the lower end of the performance distribution. In Hungary, ability grouping is associated with higher equality. The negative estimates indicate that the total effect of the social background on PISA performance is smaller for pupils at schools with ability grouping. While being insignificant for the less able, the equalising effect of ability grouping in Hungary exists for medium performing pupils and is particularly pronounced for high achieving pupils. However, the direction of the effect is most uncommon. In our sample of 27 countries, negative estimates for the interaction effect are generally rare and only Slovakia and Hungary show negative estimates for the interaction term over the whole performance distribution. The estimates for Finland do not indicate a certain effect of ability grouping. First, the estimates are insignificantly different from zero. Second, the changing signs don’t even allow recognizing a tendency for the entire performance distribution. Ability grouping does not seem to 12 matter for the performance in the PISA test and has no impact on the educational equality in Finland. It would be useful to link these empirical findings to a documentation of within-school ability grouping measures (as in e.g. Hallam et al., 2003 for the UK) in these four countries. However, very few sources examine the practice of within-school ability grouping in a systematic way. Only for Finland we find suitable sources: Kupiainen et al. (2012) and OECD (2012) document that within-school ability grouping as a mean to meet the needs of pupils with rather different performance records is most uncommon. Ability grouping only occurs during the first 6 grades by selective classes, i.e. in music or if certain languages are chosen. These findings in fact match our descriptive statistics8 and regression results which show that there is virtually no difference in math scores across schools with or without ability grouping. Even in the simplest specification, the estimates for ability grouping and the interaction term remain insignificant. 6) Conclusion In this paper we set out to investigate the relationship of within-school ability grouping with equality of educational opportunities across the PISA performance distribution. We examined the distributional effect of an interaction term between a family background indicator and an indicator for within-school ability grouping. The paper reveals that the effects of ability grouping remain highly controversial. This holds for the general effect of ability grouping just as for the effect over different ability groups. We observe that within-school ability grouping evokes different reactions across countries and different reactions across the ability scale. The OECD nevertheless has been a strong campaigner for a policy change towards less abilitygrouping and particularly invoked PISA results. Therefore, a more systematic and comparative analysis of the effects of ability grouping at the country level should be implemented. In particular, we find that the effect of within-school ability grouping on equality of educational opportunity is heterogeneous across the four countries we study and that the effect of within-school ability grouping on equality of educational opportunities is not the same across the ability distribution. Quantile regression results indicate that in Austria within-school ability 8 A Kolmogorov-Smirnov equlity of distribution test confirms the finding and attributes the slight differences to chance. 13 grouping is associated with stronger inequality, especially for more able children: talent might be wasted if the socio-economic background is low. In Belgium within-school ability grouping is associated with higher educational inequality, especially for less able children: stratification is reinforced for the less able. In Hungary within-school ability grouping is correlated with a higher equality especially for the more able: the policy has a beneficial effect especially for the more able. In Finland, we do not find within-school ability grouping to have a significant effect on educational equality. Policy-makers should take into account that an educational policy might have a different effect for different children. Our paper further shows that a regression at the average of the PISA distribution may not show the entire picture. Our findings imply that within-school ability grouping might have a different effect for different children and the effect is further mediated by institutional effects of a country educational system. Therefore, policy-makers should be careful in application of international one-size-fits-all recommendations. Understanding one’s own education system and how it interacts with earlier or later tracking is essential for policy-making, regardless of political objectives. 14 References Adams, R., Wilson, M., & Wang, W. (1997), The Multidimensional random coefficients multinomial logit model, Applied Psychological Measurement, 21(1), 1-23 Ammermüller, A. (2005), Educational Opportunities and the Role of Institutions, ZEW Discussion Papers 05-44, ZEW - Zentrum für Europäische Wirtschaftsforschung / Center for European Economic Research Averett S. L. and McLennan M.C. (2004), “Exploring the effects of class size on student achievement: what have we learned over the past two decades?” in Geraint J. and Johns J. (Eds), International handbook on the economics of education, Cheltenham: Edward Elgar, pp. 329367. Bassett, G.W., Tam, M.-Y. and Knight, K. (2002) “Quantile models and estimators for data analysis” International Journal for Theoretical and Applied Statistics, Volume 55, 2002, 17-26. Betts JR, Shkolnik JL. (2000) “The effects of ability grouping on student achievement and resource allocation in secondary schools” in Economics of Education Review 19(1):1-15. Causa, O. and Chapuis C. (2009): Equity in student achievement across OECD countries an investigation of the role of policies, OECD. Collier, T. and Millimet, D. (2009), “Institutional arrangements in educational systems and student achievement: A cross-national analysis”. Empirical Economics, 37(2), 329-381. Eide, E., Showalter M.H. (1998): “The effect of school quality on student performance: A quantile regression approach”. Economics Letters, Vol. 58, No. 3, pp. 345-350. Fertig, M. (2003), Who’s to blame? The determinants of German Students’ Achievement in the PISA 2000 Study. IZA Discussion Paper 739. Bonn: Institute for the Study of Labor. Figlio D. N. and Page M. E. (2000), School Choice and the Distributional Effects of Ability Tracking: Does Separation Increase Equality?, NBER Working Papers 8055, National Bureau of Economic Research, Inc. 15 Glewwe, P. (1997), "Estimating the impact of peer group effects on socioeconomic outcomes: Does the distribution of peer group characteristics matter?" Economics of Education Review, Elsevier, vol. 16(1), pages 39-43, February. Hanushek, E. A. and, J.A. Luque., 2003. "Efficiency and equity in schools around the world," Economics of Education Review, Elsevier, vol. 22(5), pages 481-502, October. Hanushek, E.A. and L. Woessmann (2005). "Does Educational Tracking Affect Performance and Inequality? Differences-in-Differences Evidence across Countries," NBER Working Papers, 11124, National Bureau of Economic Research, Inc. Hattie, J. (2009), Visible learning: A synthesis of over 800 meta-analyses relating to achievement, Routledge, United Kingdom. Hindriks J. G., Verschelde M., Rayp G., and Schoors K. (2010), School tracking, social segregation, and educational opportunity: evidence from Belgium, CORE DP n°2010-8. Koenker, R. (2005) "Quantile Regression," Cambridge Books, Cambridge University Press, number 9780521608275, 7. Levin J. (2001). "For whom the reductions count: A quantile regression analysis of class size and peer effects on scholastic achievement," Empirical Economics, Springer, vol.26(1), pages 221246. Leuven, E. and M. Ronning (2011): Classroom Grade Composition and Pupil Achievement, IZA Discussion Paper no. 5922, Institute for the Future of Labor, Bonn, Germany. Mühlenweg, A. (2007), Educational Effects of Early or Later Secondary School Tracking in Germany, ZEW Discussion Paper No. 07-079, Mannheim. OECD (2004), Learning for tomorrow’s world – first results from PISA 2003, OECD, Paris. OECD (2007), PISA 2006: Science competencies for tomorrow’s world, OECD, Paris. OECD (2010), PISA 2009 results: overcoming social background – equity in learning opportunities and outcomes (volume II), OECD, Paris. 16 OECD (2012), Equity and Quality in Education: Supporting Disadvantaged Students and Schools, OECD, Paris. PISA (2009), Technical Report, Paris: OECD Publishing. http://www.oecd.org/document/19/0,3746,en_2649_35845621_48577747_1_1_1_1,00.html Rabe-Hesketh, S. and Skondral. A (2004), Generalised latent variable modelling: multilevel, longitudinal and structural equation models, Boca-Raton, FL, Chapman and Hall/CRC. Raitano, M. and Vona, F. (2010): Peer Heterogeneity, Parental Background and Tracking: Evidence from PISA 2006. Documents de Travail de l'OFCE 2010-23, Observatoire Francais des Conjonctures Economiques (OFCE). Rangvid-Schindler B. (2007), “School composition effects in Denmark: quantile regression evidence from. PISA 2000”, Empirical Economics 32: 359-386. Rivkin, S. (2001), Tiebout Sorting, Aggregation and the Estimation of Peer Group Effects, Economics of Education-Review 20 (3), 201-09. Roemer, J.E. (1998), Equality of Opportunity, Cambridge, MA: Harvard University Press. Roemer, J.E. (2008): “Equality of opportunity.” The New Palgrave Dictionary of Economics Online. Palgrave Macmillan. Available from http://www.dictionaryofeconomics.com/article?id=pde2008_E000214 (Accessed 29.Nov. 2011) Schneeweis N., and Winter-Ebmer R. (2007), "Peer effects in Austrian schools," Empirical Economics, Springer, vol. 32(2), pages 387-409, May. Schütz G. and Ursprung H.and Woessmann L. (2005), Education Policy and Equality of Opportunity, Munich: CESifo Working Paper Series No. 1518. Toma, E. and Zimmer R (2000), “Peer Effects in Private and Public Schools Across Countries”, Journal of Policy Analysis and Management 19 (1), 75-92. Woessman, L. (2004), How equal are educational opportunities? Family background and student achievement in Europe and the US, Munich: CESifo Working Paper No.1162 17 Woessmann, L. (2008): Efficiency and equity of European education and training policies, International Tax and Public Finance 15(2), pp.199-230 Woessmann, L. (2009): International evidence on school tracking: a review, Munich: CESifo DICE Report 1/2009:26-34. 18 Appendix Table A1: Tracking age and share of grouped students by country Tracking age Share of non-gouped pupils 1 Austria 10 58.06 2 Belgium 12 53.2 3 Bulgaria 45.81 4 Czech Republic 11 28.24 5 Denmark 16 53.08 6 Estonia 15 41.27 7 Finland 16 39.78 8 Germany 10 49.12 9 Greece 15 86.08 10 Hungary 11 32.05 11 Ireland 15 3.27 12 Italy 14 43.71 13 Latvia 52.99 14 Lithuania 23.1 15 Luxembourg 13 33.13 16 Netherlands 12 22.04 17 Poland 16 52.1 18 Portugal 15 66.4 19 Romania 27.05 20 Slovakia 11 25.19 21 Slovenia 14 47.09 22 Spain 16 45.76 23 Sweden 16 25.9 24 UK 16 1.31 25 Norway 16 26.54 26 Switzerland 12 16.72 27 United States 16 10.19 Note: The dataset does not provide data for ability grouping in France; the tracking information are taken from: OECD (2012) 19 Table A2: Regression Estimates for Austria, Belgium, Finland and Hungary dependent: Math Score ESCS Quantile Regression I q20 q50 q80 30.25*** 33.15*** 33.50*** (1.490) (1.597) (1.383) m_ESCS ESCS* ABGROUP ABGROUP -54.28*** (2.535) Female -55.12*** (2.414) -43.81*** (3.380) 449.3*** 520.7*** (1.874) (1.499) Observations 6174 6174 R-squared Standard errors in parentheses 585.4*** (1.922) 6174 Mother at home Father at home Foreign born Foreign born father language at home Constant dependent: Math Score ESCS Quantile Regression I q20 q50 q80 55.83*** 49.22*** 40.50*** (2.065) (1.409) (1.954) 2.347 (3.383) 9.146*** (2.351) 15.17*** (2.600) Female Mother at home Father at home Foreign born Foreign born father Language at home Constant 432.2*** 513.0*** (1.803) (1.985) Observations 8258 8258 R-squared Standard errors in parentheses -37.92*** (4.438) -28.09*** (2.479) -32.03*** (8.099) -1.631 (3.441) -0.755 (5.840) -22.05*** (6.979) -15.51** (6.699) 503.3*** (10.65) 5460 -32.61*** (2.762) -31.79*** (2.189) -17.91* (9.915) -5.592 (3.604) 1.311 (5.745) -28.30*** (5.599) -10.29* (6.023) 556.0*** (9.534) 5460 -27.66*** (3.809) -31.25*** (3.111) -24.79*** (9.362) -0.447 (5.040) 4.625 (5.722) -22.26*** (6.249) -8.283 (5.797) 614.1*** (12.17) 5460 Quantile Regression III OLS q20 q50 q80 PV1MATH 7.630*** 1.067 -0.932 2.407 (2.072) (2.078) (2.124) (1.768) 79.27*** 96.28*** 94.19*** 89.80*** (3.823) (3.693) (3.922) (3.025) 1.567 11.96*** 15.26*** 8.913*** (3.293) (2.783) (1.840) (2.173) -38.57*** -33.96*** -30.98*** -30.99*** (2.966) (2.456) (2.419) (2.253) -28.24*** -32.88*** -31.96*** -32.10*** (2.277) (2.296) (2.908) (2.049) -31.67*** -20.45* -25.61*** -23.91*** (7.539) (12.13) (8.152) (7.787) -1.185 -6.901** -1.292 -3.431 (4.311) (3.359) (4.162) (3.005) -0.967 0.0767 4.201 1.356 (5.258) (5.501) (8.179) (5.232) -21.32*** -26.96*** -19.27*** -22.43*** (5.136) (4.700) (5.733) (4.037) -15.37** -9.902** -11.77** -13.26*** (6.107) (4.995) (5.659) (4.762) 502.4*** 561.8*** 618.6*** 557.6*** (8.579) (12.54) (10.94) (8.637) 5460 5460 5460 5460 0.321 *** p<0.01, ** p<0.05, * p<0.1 m_ESCS ESCS* ABGROUP ABGROUP Regression Estimates Austria Quantile Regression II q20 q50 q80 8.005*** 5.527** 6.722*** (2.114) (2.733) (2.588) 79.40*** 96.67*** 97.89*** (4.844) (4.552) (5.669) 587.3*** (1.754) 8258 Regression Estimates Belgium Quantile Regression II q20 q50 q80 15.10*** 13.32*** 12.16*** (1.966) (1.324) (1.612) 105.9*** 94.13*** 81.12*** (3.499) (3.468) (3.905) 22.54*** (2.995) -27.91*** (2.669) -34.16*** (9.043) -2.908 (5.091) -15.05** (6.321) -24.81*** (3.745) 2.210 (3.204) 486.7*** (13.00) 7102 20.68*** (2.299) -25.01*** (2.374) -28.59*** (6.738) -2.460 (3.378) -11.13** (4.368) -27.22*** (2.968) -1.014 (2.736) 549.7*** (9.651) 7102 *** p<0.01, ** p<0.05, * p<0.1 20 15.49*** (2.585) -28.89*** (2.544) -20.70* (11.93) -0.266 (3.802) -8.537* (4.484) -29.50*** (3.114) 0.898 (2.877) 609.8*** (14.97) 7102 Quantile Regression III OLS q20 q50 q80 PV1MATH 9.863*** 9.828*** 9.333*** 9.873*** (1.838) (2.109) (2.001) (1.475) 107.0*** 95.16*** 81.50*** 93.97*** (3.646) (3.047) (3.460) (2.229) 9.333*** 6.167*** 5.215*** 6.148*** (2.218) (2.077) (1.938) (1.348) 19.53*** 18.08*** 12.71*** 15.41*** (2.529) (2.628) (2.180) (1.897) -27.22*** -25.74*** -29.02*** -27.20*** (2.791) (1.976) (3.280) (1.808) -36.78*** -28.39*** -24.04*** -27.69*** (5.062) (5.782) (7.915) (5.889) -0.986 -3.642 0.923 -1.810 (4.091) (3.145) (3.761) (2.826) -15.33*** -13.05*** -11.84** -10.52*** (4.584) (3.109) (4.731) (3.502) -25.94*** -25.66*** -27.84*** -28.12*** (2.692) (2.224) (3.315) (2.645) 2.617 0.935 2.792 1.449 (2.301) (2.161) (2.904) (2.248) 487.9*** 551.0*** 612.6*** 548.9*** (8.617) (7.577) (10.56) (6.820) 7102 7102 7102 7102 0.372 dependent: Math Score ESCS Quantile Regression I q20 q50 q80 26.92*** 27.48*** 27.73*** (2.439) (2.244) (2.087) m_ESCS ESCS* ABGROUP ABGROUP 0.535 (2.132) -1.590 (1.972) -4.420* (2.517) Female Mother at home Father at home Foreign born Foreign born father Language at home Constant 460.4*** 530.5*** (2.177) (2.417) Observations 5774 5774 R-squared Standard errors in parentheses dependent: Math Score ESCS 12.35*** (2.036) 11.23*** (2.889) Female student Mother at home Father at home Foreign born Foreign born father Language at home Constant 431.7*** 497.3*** (1.802) (2.162) Observations 4582 4582 R-squared Standard errors in parentheses -1.321 (2.898) 5.036 (3.496) -10.13 (12.08) -13.42*** (4.506) -16.97 (16.42) -24.01*** (8.908) -18.92*** (7.055) 488.1*** (13.82) 5452 -0.504 (2.193) -4.201** (2.089) -11.29** (5.515) -14.81*** (4.963) -12.55 (9.467) -22.09*** (7.327) -16.89*** (5.477) 563.9*** (7.506) 5452 -2.229 (2.311) -9.891*** (2.924) -13.10** (6.578) -9.095*** (3.062) 2.132 (13.90) -13.48 (8.214) -18.81** (7.691) 623.4*** (8.913) 5452 Quantile Regression III OLS q20 q50 q80 PV1MATH 23.15*** 23.99*** 27.04*** 23.14*** (3.765) (2.747) (2.717) (2.220) 11.00 4.929** 10.78** 8.751** (7.104) (1.964) (5.344) (3.810) -1.110 1.442 -2.977 0.872 (4.284) (3.009) (2.754) (2.569) -0.880 -1.339 -1.487 -1.891 (4.347) (4.254) (3.280) (2.445) 5.540* -4.145 -9.211*** -4.645** (3.005) (2.750) (2.611) (2.109) -10.04 -10.12 -13.62* -10.44* (7.692) (6.791) (7.037) (5.489) -13.02*** -14.23*** -7.949*** -11.79*** (3.829) (3.532) (2.732) (3.062) -16.47 -11.08 -1.708 -13.28* (12.57) (13.44) (15.35) (7.360) -25.15*** -22.07*** -12.66 -15.55*** (7.866) (6.591) (8.561) (5.450) -19.31*** -16.69*** -18.32*** -18.84*** (5.708) (4.103) (5.290) (4.454) 486.8*** 562.4*** 622.1*** 556.7*** (10.75) (8.345) (9.341) (7.284) 5452 5452 5452 5452 0.076 *** p<0.01, ** p<0.05, * p<0.1 Quantile Regression I q20 q50 q80 46.16*** 48.32*** 45.52*** (1.314) (1.545) (2.252) m_ESCS ESCS* ABGROUP ABGROUP 596.4*** (2.579) 5774 Regression Estimates Finland Quantile Regression II q20 q50 q80 22.39*** 24.43*** 24.34*** (2.098) (2.055) (1.650) 10.57 5.168 11.32*** (6.535) (3.385) (3.702) Regression Estimates Hungary Quantile Regression II q20 q50 q80 5.015** 9.024*** 11.46*** (2.204) (1.083) (1.672) 90.89*** 88.43*** 88.11*** (3.767) (1.832) (2.732) Quantile Regression III OLS q20 q50 q80 PV1MATH 7.087* 12.43*** 15.95*** 12.16*** (3.907) (2.626) (2.768) (1.987) 89.50*** 88.15*** 87.42*** 87.98*** (2.901) (3.068) (3.566) (2.121) -2.198 -4.919** -7.028*** -4.400** (2.458) (2.197) (2.678) (2.001) 3.658 -3.148 -2.184 -1.452 -4.615* -3.949 -2.365 -4.111* (3.895) (3.267) (2.003) (2.310) (2.649) (3.291) (2.404) (2.148) -18.70*** -21.34*** -24.97*** -18.42*** -21.90*** -25.98*** -22.07*** (3.587) (2.945) (2.336) (3.375) (2.222) (2.940) (1.944) -24.03*** -17.07*** -18.04** -24.10*** -17.30*** -14.96** -16.23*** (7.187) (4.666) (9.133) (7.038) (5.742) (7.343) (5.050) -3.492 -4.093** 0.903 -4.169 -3.577 1.840 -1.661 (3.942) (2.074) (3.223) (3.901) (2.959) (3.322) (2.523) 5.023 -6.454 0.196 5.774 -6.440 2.107 -1.578 (15.18) (7.334) (14.24) (6.930) (6.755) (12.04) (8.018) -4.903 -4.404 -10.50 -4.903 -4.012 -8.880 -2.460 (10.35) (4.219) (10.82) (7.687) (4.716) (7.229) (6.006) -22.44** -52.76** -33.12** -22.23 -51.74** -27.38 -25.98** (10.99) (22.36) (15.77) (13.51) (23.43) (23.46) (11.51) 564.5*** 500.1*** 548.9*** 598.3*** 501.6*** 549.8*** 595.5*** 545.9*** (3.023) (8.564) (6.090) (10.92) (8.660) (5.310) (6.105) (6.053) 4582 4341 4341 4341 4341 4341 4341 4341 0.476 *** p<0.01, ** p<0.05, * p<0.1 Note: Quantile Regression I ist the simplest specification, II adds the controls, III the interaction term. 21 [Figure A1 here] [Figure A2 here] [Figure A3 here] [Figure A4 here] 22 Apppendix Figure 1: Mean standardized QQ-plots of PISA maths scores by social background and ability grouping in Finland Figure 2: Mean standardized QQ-plots of PISA maths scores by social background and ability grouping in Austria 23 Figure 3: Mean standardized QQ-plots of PISA maths scores by social background and ability grouping in Belgium Figure 4: Mean standardized QQ-plots of PISA maths scores by social background and ability grouping in Hungary 24 Figure A1: Mean standardized Kernel density estimates for PISA maths scores by socioeconomic status and ability grouping in Finland Figure A2: Mean standardized Kernel density estimates for PISA maths scores by socioeconomic status and ability grouping in Austria 25 Figure A3: Mean standardized Kernel density estimates for PISA maths scores by socioeconomic status and ability grouping in Belgium Figure A4: Mean standardized Kernel density estimates for PISA maths scores by socioeconomic status and ability grouping in Hungary 26 Figure A5: QQ-plots of PISA maths scores by social background and ability grouping in Finland Figure A6: QQ-plots of PISA maths scores by social background and ability grouping in Austria 27 Figure A7: QQ-plots of PISA maths scores by social background and ability grouping in Belgium Figure A8: QQ-plots of PISA maths scores by social background and ability grouping in Hungary 28 Figure A9: Plotted estimates from Quantile and OLS Regressions for Austria, Belgium, Finland and Hungary 29 Figure A10: Mean standardized Kernel density estimates for PISA maths scores by socioeconomic status and ability grouping in 27 OECD countries. 30 31
© Copyright 2026 Paperzz