The Pennsylvania State University The Graduate School College of Education TEACHER AUTONOMY AND RESPONSIBILITY VARIATION AND THEIR ASSOCIATION WITH STUDENT SCIENCE PERFORMANCE IN DIDAKTIK AND CURRICULUM TRADITIONS IN PISA 2009 A Dissertation in Educational Theory and Policy by Armend Tahirsylaj © 2015 Armend Tahirsylaj Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy May 2015 ii The dissertation of Armend Tahirsylaj was reviewed and approved* by the following: David Baker Professor of Education and Sociology Dissertation Co-Adviser Co-Chair of Committee Richard Duschl Kenneth B. Waterbury Chaired Professor in Secondary Education Dissertation Co-Adviser Co-Chair of Committee Liang Zhang Associate Professor of Education and Labor Studies Soo-yong Byun Assistant Professor of Education Gerald LeTendre Professor of Education Department Head, Education Policy Studies *Signatures are on file in the Graduate School iii ABSTRACT This study is a cross-national analysis of teacher autonomy and responsibility in 12 countries representing curriculum (Australia, Canada, United Kingdom, Ireland, New Zealand and United States) and didaktik (Austria, Germany, Denmark, Finland, Norway and Sweden) traditions. It addresses two main research questions. First, what is the variation of teacher autonomy and responsibility across schools in curriculum and didaktik countries? And second, what is the association of teacher autonomy and responsibility measures with students’ science performance? Nationally representative data from 2009 Programme for International Student Assessment (PISA) are used to address the two questions. Overall, teachers in didaktik countries enjoy more autonomy than teachers in curriculum countries as measured through three autonomy-specific variables pertaining to the use of standardized tests, teacher-developed tests and teacher judgments. On the other hand, more teachers, on average, enjoy responsibility in curriculum than in didaktik countries as measured through the proportion of teachers who were responsible for assessment policies, textbooks to be used, course content and courses offered at school level. However, the differences were statistically significant but substantively weak. Further, within-country associations of autonomy and responsibility measures with students’ science performance were found in a few countries. Most notably, the findings suggest that the use of standardized tests is negatively associated with science performance in some of the within-country models, while the use of teacher-developed tests was positively associated with science performance in some sample countries in PISA 2009. Responsibility measures had less predictive power of science performance, and relationships existed only in few countries as per the analytical models and covariates accounted for. Cross-sectional nature of the data limits the extent of causal arguments to be made. However, the findings suggest that global push towards iv more standardized testing at national level is harming student science performance in some of the nations in curriculum and didaktik countries, while teacher-developed tests tend to be positively associated with science performance. Therefore, policy-makers might reconsider the support for standardized testing, which limits teacher autonomy, in favor of more teacher-driven assessment practices that increase autonomy. Further research is recommended to address teacher responsibility and complexities that accompany it in current stakeholder-crowded school contexts. Also, further studies on how teacher autonomy and teacher responsibility are negotiated with school principals and other educational authorities could provide valuable insights to better understand how teacher autonomy and responsibility could be enhanced for the benefit of improving student learning and performance. v Table of Contents LIST OF TABLES.................................................................................................................................... vii LIST OF FIGURES .................................................................................................................................viii ACKNOWLEDGEMENTS....................................................................................................................... ix Chapter 1. INTRODUCTION ................................................................................................................ 1 Statement of Purpose ................................................................................................................................... 5 Chapter 2. LITERATURE REVIEW, CONCEPTUAL FRAMEWORK AND RESEARCH HYPOTHESES........................................................................................................................................... 9 Didaktik and Curriculum – Past to Present ................................................................................................... 9 Emergence of didaktik tradition.............................................................................................................. 11 Perspectives on curriculum tradition and how the field developed .......................................................... 20 Humanists/scholar academic .................................................................................................................. 23 Child study/learner centered group ........................................................................................................ 24 Social meliorists/social reconstruction group ......................................................................................... 26 Social Efficiency ....................................................................................................................................... 28 Internal influences on curriculum making............................................................................................... 30 Comparison of didaktik and curriculum orientations ................................................................................. 37 Overlap of didaktik and curriculum frameworks with sociology of education theories ............................ 45 Functional Theory.................................................................................................................................... 46 Conflict Theory ........................................................................................................................................ 49 Neo-institutional Theory ......................................................................................................................... 51 Cross-national Student Achievement and Teacher Characteristics Studies ............................................... 57 Teacher Autonomy and Teacher Responsibility ......................................................................................... 61 Conceptual Framework ............................................................................................................................... 63 Research Questions and Hypotheses.......................................................................................................... 65 Chapter 3. KEY COMPONENTS OF EDUCATION SYSTEMS IN SAMPLE COUNTRIES ....... 67 K-12 Education Structure ............................................................................................................................ 68 Teacher Education and Certification and Teacher Professional Development .......................................... 69 Teacher education and certification ....................................................................................................... 70 Teacher professional development ......................................................................................................... 71 Assessment Systems ................................................................................................................................... 72 Funding Mechanisms .................................................................................................................................. 74 Chapter 4. DATA AND METHODS ................................................................................................... 75 Data ............................................................................................................................................................. 75 Sampling and Sample .................................................................................................................................. 76 Rationale for selection of countries for the study ................................................................................... 77 Measures..................................................................................................................................................... 81 Descriptive Statistics ................................................................................................................................... 85 vi Methods ...................................................................................................................................................... 89 Chapter 5. RESULTS AND ANALYSES ............................................................................................. 93 Means of Teacher Autonomy Measures Across Sample Countries ............................................................ 93 Proportions of Teacher Responsibility Measures Across Sample Countries .............................................. 99 Association of Teacher Autonomy and Responsibility with Student Science Performance in PISA 2009 105 Chapter 6. CONCLUSIONS, DICUSSION, AND IMPLICATIONS ............................................. 114 Teacher Autonomy and Responsibility in Schools in Didaktik and Curriculum Countries ........................ 114 Association of Teacher Autonomy and Responsibility Measures with Student Science Performance .... 121 Implications ............................................................................................................................................... 123 Educational theory ................................................................................................................................ 124 Educational policy and practice ............................................................................................................ 126 Educational research............................................................................................................................. 130 Delimitations and Limitations ................................................................................................................... 135 Conclusions ............................................................................................................................................... 135 References .......................................................................................................................................... 140 Appendices ......................................................................................................................................... 169 Appendix A: Key Components of Education Systems in Didaktik Countries.......................................... 169 Appendix B: Key Components of Education Systems in Didaktik Countries .......................................... 175 Appendix C: Science Proficiency in Participating Countries in PISA 2009............................................. 181 Appendix D: Description for levels of proficiency on science scale ......................................................... 182 Appendix E: Percent of missing data in study variables ........................................................................... 183 Appendix F: Correlation tables for curriculum countries ......................................................................... 184 Appendix G: Correlation tables for didaktik countries ............................................................................. 190 Appendix H: Descriptive statistics for teacher autonomy items ............................................................... 196 Appendix I: Descriptive statistics for teacher autonomy scale by country............................................... 197 Appendix J: Descriptive statistics for teacher responsibility items .......................................................... 198 Appendix K: Descriptive statistics for teacher responsibility scale by country........................................ 200 Appendix L: Maximum likelihood estimates for science performance in curriculum countries in PISA 2009 .......................................................................................................................................................... 201 Appendix M: Maximum likelihood estimates for science performance in didaktik countries in PISA 2009 .................................................................................................................................................................. 207 Appendix N: Innovation in education, 2000-2011 .................................................................................... 213 vii LIST OF TABLES Table 1 Classification schemes of curriculum tradition .............................................................................. 21 Table 2 Comparison of Klafki’s and Tyler’s questions................................................................................. 38 Table 3 Logic Model of Didaktik and Curriculum traditions ....................................................................... 43 Table 4 PISA science processes and scientific competencies 2000-present............................................... 59 Table 5 School and student sample size for didaktik and curriculum countries ........................................ 76 Table 6 Didaktik and curriculum countries’ PISA 2009 performance in reading, mathematics and science ............................................................................................................................................................ 79 Table 7 Main study measures and their original scales .............................................................................. 81 Table 8 Descriptive statistics of main student-level variables .................................................................... 87 Table 9 Maximum likelihood estimates for science performance in curriculum countries ...................... 107 Table 10 Maximum likelihood estimates for science performance in didaktik countries......................... 111 Table 11 Maximum likelihood estimates for student science performance in Canada ............................. 202 Table 12 Maximum likelihood estimates for student science performance in United Kingdom .............. 203 Table 13 Maximum likelihood estimates for student science performance in Ireland ............................. 204 Table 14 Maximum likelihood estimates for student science performance in New Zealand ................... 205 Table 15 Maximum likelihood estimates for student science performance in United States ................... 206 Table 16 Maximum likelihood estimates for student science performance in Austria ............................. 207 Table 17 Maximum likelihood estimates for student science performance in Germany .......................... 208 Table 18 Maximum likelihood estimates for student science performance in Finland............................. 210 Table 19 Maximum likelihood estimates for student science performance in Norway ............................ 211 Table 20 Maximum likelihood estimates for student science performance in Sweden ............................ 212 viii LIST OF FIGURES Figure 1 Continuum of three sociological and two educational theories................................................... 55 Figure 2 A diagram of logic rationale .......................................................................................................... 63 Figure 3 Geographic distribution of countries in the sample ..................................................................... 78 Figure 4 Means of use of standardized tests by country ............................................................................. 94 Figure 5 Means of use of teacher-developed tests by country .................................................................... 95 Figure 6 Means of use of teacher judgments by country ............................................................................ 96 Figure 7 Means of teacher autonomy scale by country............................................................................... 98 Figure 8 Proportion of teachers deciding about assessment policies by country ........................................ 99 Figure 9 Proportion of teachers deciding about textbooks to be used by country .................................... 100 Figure 10 Proportion of teachers deciding about course content by country ............................................ 101 Figure 11 Proportion of teachers deciding about courses offered by country .......................................... 102 Figure 12 Means of teacher responsibility scale by country ..................................................................... 104 Figure 13 Revised continuum of three sociological and two educational theories .................................. 126 ix ACKNOWLEDGEMENTS I would like to acknowledge and thank Waterbury Chair Professor and my dissertation advisor Richard Duschl both for his academic advice and financial support through Waterbury Fund for my studies at Penn State. Dr. Duschl’s guidance allowed me to explore and develop ideas presented in this dissertation, and his valuable advice extends beyond the context of research and will stay with me for the rest of my life. I could not have asked for a more supportive adviser. I would also like to thank committee co-chair David Baker, and committee members Liang Zhang and Soo-yong Byun for making their expertise and advice available to guide my work. Next, I would like to acknowledge and thank a long list of professors, students and staff members at Educational Theory & Policy program and entire College of Education for providing a stimulating academic environment that made it possible to expand my knowledge and skills and further my education. I have benefited tremendously from this unique intellectually enriching prestigious institution. I also want to acknowledge my past advisors, professors and colleagues at the University of Prishtina, Kosovo, University of Calgary, Canada, University of Vienna, Austria, and University of Gothenburg, Sweden for their contribution and support during my academic journey. In addition, I would like to thank another long list of colleagues and friends I have met on and off-campus during past five years, first as part of the Humphrey Fellowship Program during 2010-2011, and then during the PhD program since 2011. In particular, Jack Matson was incredibly helpful in every single step along the way. I am thankful I have had the chance to x know him and learn from him and with him during these past few years, and his wisdom will continue to make a difference wherever the journey takes me next! Also, I want to thank family, friends and colleagues in Kosovo and across Europe, who have been supportive for many years prior to coming to Penn State, and have contributed to bring me closer to this achievement. Finally, I cannot end this list of acknowledgments without mentioning my wife Leonora, my daughter Arsea, my mother Shahe, my brothers Alkend, Alferd, and Argjent, including his wife Naida, son Anik and daughter Alea. Also, I am forever grateful to advice of my late father Selman. He continues to be the source of motivation and inspiration in every step. It is impossible to express how much you all mean to me. Thank you for your unconditional love and support – now and forever! 1 Chapter 1. INTRODUCTION Intensification of global cross-national cooperation in almost any field, not least in education, has been attractive and inspiring for educational theorists and practitioners to look ‘over’ the borders of their nations for exchange and transfer of ideas for educational improvement. But for exchange and transfer to occur, some level of understanding is required to enable the flow of ideas in a two- or multi-way direction. In efforts to reach that level of understanding, European and US scholars initiated an intensive dialogue to discuss and understand similarities and differences between German-based didaktik and US-based curriculum traditions during the 1990s. The spark for that intensified dialogue, after some earlier exchanges between two traditions starting from 1960s, came from the interest of US educational scholars to better understand what was this concept called ‘Didaktik’? (Hopmann & Riquarts, 2000). Of course, the underlying objectives and intentions were much broader and the dialogue tried to address questions, such as: How did the two traditions compare and contrast? Could, and to what extent, one tradition inform the other? Is there something that they can learn from one another? etc. Theoretical exchanges that emerged from this dialogue as well as extended theoretical contributions from many scholars to didaktik and curriculum orientations dominating K-12 education system provide the framework for this study. Brief background to Didaktik and Curriculum First, what is Didaktik theory? German Didaktik theory is central to curriculum, teaching and learning in Continental and Nordic Europe generally and German speaking world specifically, but is mostly unknown in the English-speaking world (Hopmann, 2007; Hudson, 2003; Westbury et al., 2000; Kansanen, 1999). The German term Didaktik is well established in 2 post-1990s English literature, and to avoid the use of more negatively loaded concept of didactics as teacher-centered and controlled instruction (Kansanen, 2002), I use the original spelling didaktik throughout the study. In its original conceptualization, “Didaktik is about how teaching can instigate learning, but learning as a content-based student activity not as swallowing a sermon or a monologue or otherwise one-sided distribution of knowledge by a teacher” (Hopmann, 2007, p. 113). The interpretation of didaktik across Europe led to a variety of modes but Hopmann points out they shared three common aspects, "(a) the concept of Bildung, (b) the embedded differential of matter and meaning, and (c) a concept of the necessary autonomy of teaching” to address “problems of order, sequence, and choice within their respective frames of reference" (Hopmann, 2007, p. 115). Bildung, will be discussed more in depth later, but in classical sense, it encompasses the contents of assisting individuals to achieve their selfdetermination by developing and using their reason without others’ guidance and acquiring the cultural objects of the world in which those individuals are born into and situated in (Klafki, 2000a; Hudson, 2003). Curriculum, on the other hand, is a widely used theory amongst many countries, primarily in the English-speaking world. While there are numerous definitions and constructs on curriculum, varying largely from one period of time to the other, what is referred to as curriculum here, is the prevailing curriculum model that has been in place in the U.S. since early 1900s, when the so-called social efficiency model of curriculum promoted by Franklin Bobbit and largely perpetuated by US industrial forces of the time, won the American education battle against humanistic-based models of curriculum, who were mainly led and supported by John Dewey (Kliebard, 2004). According to Kliebard (2004), ideas outside the realm of education, such as those of Frederick Taylor, an engineer from scientific management, which gave birth to 3 social efficiency as a social ideal and educational doctrine, and ideas of Edward Thorndike from psychology, who introduced educational or “intelligence” measurement into American education, led to social efficiency curriculum models within the U.S. education. Such models have shaped US education curriculum and educational thinking and practice to the present days. Social efficiency models suggested that students would learn in schools only what they needed to know in order to perform as an adult member of social order, that is, “To go beyond what someone had to know in order to perform that role successfully was simply wasteful” (Kliebard, 2004, p. 77). Further, it has been noted that the Tyler rationale has been a recurring theme in the American curriculum tradition serving as the technocratic model that has dominated American schools to this day (Luke, 2004). More recently the criticism against so-called ‘business-driven’ educational policies in the U.S. have mounted to such heights where it is argued that “the common school became first a factory, then a corporation, now a cram school, but always a business” (Pinar, 2012, p. 15). However, with advancement of the learning sciences in the last several decades, curriculum has seen a shift towards cognitive and constructive models of curriculum, but the driving assumptions embedded in social efficiency models about education goals have not been challenged. The study aims to contribute further to the debates between didaktik and curriculum that took place during the 20th century. During 1960s, there was an export of curriculum ideas from the U.S. curriculum to Germany (Gundem & Hopmann, 1998), during 1990s, there was a dialogue for the benefit of better understanding between traditions (Westbury, Hopmann, & Riquarts, 2000a), while more recently there is renewed interest in cross-cultural comparisons and scholarly exchanges between the US curriculum and German didaktik (Duschl, Maeng & Sezen, 2011). Duschl et al. (2011) concluded that didaktik framework provided an alternative to 4 development of learning progressions with its emphasis on order, sequence and choice about curriculum and instructional design decisions. This rekindled interest is an opportunity for both traditions to reflect upon their recent past and current developments and identify areas in which they could be strengthened if they become more open to reflective practices about educational changes and more willing to learn from one another. Indeed, the cross-national exchanges and learning between the German didaktik and Anglo-American curriculum have been in place for more than a century, as the curriculum in the U.S. during 19th century adopted heavily from the didaktik theory of teaching and learning in Germany, and most continental Europe, that was based on idealism of German philosophers such as Immanuel Kant, Georg Wilhelm Friedrich Hegel, Wilhelm Dilthey, and Wilhelm von Humboldt (Reid, 1997). However, similarities and differences between curriculum and didaktik have not been tested empirically until very recently (Tahirsylaj, Brezicha & Ikoma, In press), and the crossnational discussion around these two traditions was mainly at the theoretical level (Hudson, 2003). Using Trends in International Mathematics and Science Study (TIMSS) 2003, 2007 and 2011 data, Tahirsylaj et al. found clear empirical differences among a set of countries representing curriculum and didaktik systems pertaining to duration of professional development, years of teaching, completion of formal education, and to a lesser degree, teacher instructional activities. Still, no studies had been undertaken to specifically examine how educational systems under didaktik and curriculum systems perform in international student assessment programs, nor how individual variables affect student performance. This study aimed to fill that gap in the literature by exploring empirically similarities and differences between Didaktik and curriculum systems in a select group of 12 countries in terms of teacher and school characteristics on one hand, and examining the associations of teacher characteristics 5 with student science performance in The Programme for International Student Assessment (PISA) 2009. The analysis provides insights as to whether differences emphasized by literature as well as by recent research with TIMSS data are backed by empirical data collected through PISA 2009 assessment. Therefore, the study bridges the gap between theoretical and empirical understanding of didaktik and curriculum and how the two inform us in better understanding their respective frameworks, and indeed themselves as two dominant educational orientations worldwide. Statement of Purpose Purposes for this study were manifold. One purpose was to empirically test theoretical claims made about didaktik and curriculum in previous literature. For example, one of the core claims in prior didaktik/curriculum exchanges was that there is more teacher autonomy and professionalism in didaktik systems than in curriculum systems (Westbury, 2000; Hopmann, 2007). This study explored this argument in depth to clarify whether the claim was supported by empirical data. Further, the purpose was to understand and further extend cross-national and cross-cultural educational dialogues and exchanges between didaktik and curriculum theorists, educational practitioners, researchers, and policy makers. A more intensive and extensive educational exchange between nations’ scholars could potentially lead to 1) a deeper and better understanding of respective pedagogical traditions, and 2) better informed decision-makers and policy makers to help avoid challenges in policy development and program implementation. The ongoing international student assessments provide rich student performance and student and school background data that make possible the dialogue between and among countries all around the world pertaining to relevant and complex educational policy issues in K- 6 12 education systems. They also offer opportunities for researchers to examine different questions of interest that could have not been possible to be addressed otherwise due to limitations in data collection capacities. To that end, another purpose of the study was to track whether and how the didaktik and curriculum traditions in two sets of selected countries represented align within own group, and how cohesive the didaktik-curriculum division is. In addition, the purpose was to investigate how key measures based on data collected by PISA in 2009 are associated with student science achievement data for two sets of countries. Ultimately, the purpose of this cross-national study was not to simply compare and contrast countries but to discern the similarities and differences in national educational practices so that individual countries get to a deeper understanding of their own national educational orientations and practices. It has been argued that international comparative studies provide for a rich set of understandings for national scholars, policy-makers, and practitioners to help, […] define what is achievable […] observe and characterize consequences of different practices and policies for different groups under different circumstances […] bring to light concepts for understanding education that have been overlooked […] identify and question beliefs and assumptions that are taken for granted. (National Research Council [NRC], 2003, pp. 8-9, emphasis in the original). To a considerable extent, this study is at the intersection of all these goals of international comparative studies as put forth by NRC as it aimed to, first, theoretically explore similarities and differences between didaktik and curriculum orientations and key concepts and constructs that define respective fields, and, second, to empirically examine how different teacher and school practices are associated with student achievement under different national contexts. 7 Furthermore, reviewing the types of international comparative studies in education from the US perspective, NRC (2003) distinguished among Type I, Type II and Type III international studies as per their primary purposes. NRC (2003) defined Type I international comparative studies, primarily large-scale survey assessments, as ones that aim to compare educational outcomes cross-nationally and listed TIMSS and PISA as examples. Type II studies were designed to research specific educational policies and their implementation with the aims to inform educational policy in the US. These studies were based on mixed-methods approaches, including quantitative, qualitative, descriptive and interpretative studies and were primarily conducted at a lower scale, and studying high school tracking in a couple of countries was listed as an example. Lastly, NRC (2003) defined Type III studies as ones that aim to understand education more broadly, as well as to increase general understanding about education systems and processes. These studies could be either large- or small-scale and also mixed-methods oriented as Type II studies, and a study of culture and pedagogy in a set of five countries was listed as an example. Here too, the present study sat at the intersection of these three NRCdefined broad types of international comparative studies as it entailed getting a better understanding of educational issues, such as teacher autonomy and responsibility, in a set of countries representing didaktik and curriculum educational systems (Type III), utilizing data from PISA 2009 (itself a Type I study), to reach a better cross-national understanding and informing national policy makers in each national contexts of implications the studied constructs might have for their school settings (Type II). To this end, the present study aimed to make a small contribution to the ongoing and growing worldwide research efforts in the field of international comparative education. 8 Next, in Chapter 2, I elaborate on literature review, conceptual framework and research hypothesis, where I present an extended discussion on historical development of didaktik and curriculum as two educational traditions. In Chapter 3, I provide an analysis of education systems in 12 countries included with an emphasis on education structure, teacher education and certification, teacher professional development, assessment systems and funding mechanisms. In Chapter 4, I present data, sample, variables and methods employed to conduct the study. In Chapter 5, main results and findings are presented, and Chapter 6 concludes with discussion of the findings, conclusions reached and recommendations for further research. 9 Chapter 2. LITERATURE REVIEW, CONCEPTUAL FRAMEWORK AND RESEARCH HYPOTHESES This study drew heavily from the literature on didaktik and curriculum traditions. These two educational orientations constituted the core theoretical frameworks that provided the thinking for the entire design of the study. As mentioned, a more recent dialogue between the two educational traditions was initiated during 1990s and has been on and off ever since. However, exchanges between the two traditions have existed ever since the public education systems were introduced in Europe in 18th century - first in Prussia (part of present Germany) and then France and England. In efforts to bring the dialogue among scholars over the two traditions forefront, a review of the educational thought from the ancient to modern times as well as that of the didaktik-curriculum dialogue is presented here, with additional arguments that portray both didaktik and curriculum as two educational orientations in their own right. Didaktik and Curriculum – Past to Present The current didaktik and curriculum orientations are grounded in their historicity. What is known today about them is an extension of the educational thought that has been passed from generation to generation in the written historical accounts. To view both didaktik and curriculum traditions as ahistorical is to run the risk of inappropriately crediting scholars for later developments in these scholarly domains. To that end, crediting 20th century educational philosophers and scholars for developing modern educational theories without considering the educational thought that preceded them prior to 20th century is a disservice to the human endeavors that contributed to the betterment of world societies through education for many centuries. 10 A quick review of the history of education shows that questions of how to best help the young grow and mature were perennial. Considering the far antiquity and the early part of Egypt’s six-thousand-year history, we find that education was primarily vocational in the form of apprenticeship where sons were taught by their fathers. In ancient China, there were two dominant educational orientations, attributed to Lao-Tse (6th century B.C.) and Confucius (5th century B.C.) respectively (Schubert, 1986). Lao-Tse emphasized the role of education for spiritual enrichment, which was to be achieved through contemplation. Confucius emphasized the role of education for preserving society and institutions and put less emphasis on the individual (Schubert, 1986). In ancient Greece, leisure time was emphasized as an opportunity for individuals to develop educationally. The omnipresent word school comes from Greek scole meaning leisure (Adler, 1951). However, leisure did not mean spare time but “an opportunity to contemplate the significance, worth, meaning, and aesthetic value of ones life and contributions” (Schubert, 1986, p. 56). The three greatest ancient Greek contributors to the educational thought were Socrates, Plato and Aristotle who lived roughly between 5th and 3rd century B.C. Their contributions, especially of Plato in his Republic, and Aristotle in The Politics constitute the roots of a liberal education – their curriculum ideas included education of both boys and girls from age six to eighteen in a number of disciplines such as music, mathematics and gymnastics and the goals should be broad and not directed to vocations (Schubert, 1986). However, these three domains were broader, for example, music included literature and history, while gymnastics included the study of dance, rhythm, athletics and military arts. In ancient Rome, education was influenced by Greek ideas, and the curriculum consisted of philosophy, literature and rhetoric, but “as this was assimilated into the Roman life-style, it became more Latinoriented, eventually creating the Latin grammar school, which was the model for the Western 11 education” (Schubert, 1986, p. 58). During the early years of Christianity and up until the renaissance period, the curriculum was dominated by teachings of Jesus, and value was placed on knowledge since knowledge was instrumental in propagation of Christianity. During the renaissance period, toward the end of the 15th century, Western scholars returned to ideas developed in ancient Greece and Rome, which for education meant reviving Socratic questioning and humanistic pedagogy, and emphasizing liberal arts education around dialectic, rhetoric, grammar, astronomy, arithmetic, geometry and music as well as Greek and Latin languages, history and fine arts (Schubert, 1986). Emergence of didaktik tradition It was during the renaissance and enlightenment periods that the roots of modern didaktik were established. John Amos Comenius (1592-1670) with his Didactica Magna/The Great Didactic is considered to be the first truly European educator and the father of modern education (Hopmann, 2007; Schubert, 1986). Comenius emphasized the understanding of how students meet the knowledge of disciplines on their own beings more than what knowledge is valid within the structure, and he argued that there should be a sequence of knowledge from the microcosmos of students to the macro-cosmos of the world (Hopmann, 2007). During the enlightenment period, a number of key philosophers advanced their theories about educational thought, including French Jean Jacque Rousseau, Swiss Johann Heinrich Pestalozzi, and Germans Immanuel Kant, Georg Wilhelm Hegel, Johann Friedrich Herbart, Friedrich Froebel and Wilhelm von Humboldt. Rousseau laid the foundation for the child-centered education with his emphasis on education of children apart from adults and as much as possible in their natural settings in order to avoid the corrupted states and adult society. Pestalozzi’s work built on that of Rousseau by promoting a balanced development of children’s head, heart and hand, and 12 implementing his ideas into practice in his own experimental school in Switzerland. Kant called for the curriculum to be adapted to students’ needs and abilities, while Hegel was among the first to promote lifelong learning with his emphasis on continued education to increase the sophistication of reason. He maintained that it was state’s responsibility to provide public education. Herbart is known as the father of science of education and of modern psychology and he continued to influence educators into the twentieth century. He put forth that the goal of education should be development of cultured human beings who are guided by highest ethical values. Further, he argued from a psychological base that children’s development follows the evolution of human culture – from primitive to civilized – and noted that it was teachers’ responsibility to assist students in their path towards greater civilization. Froebel, in turn, is known for introducing kindergarten to the world of education in 1837, where the curriculum was truly child-centered, which paved the way for progressive education in the next century (Schubert, 1986). Didaktik scholars view Humboldt as the first educationist that initiated the development of the theory of Bildung or cultivation of humankind as the core of didaktik tradition (Gundem & Hopmann, 1998; Hopmann & Riquarts, 1995; Lüth, 1998). In von Humboldt’s own words, It is the ultimate task of our existence to achieve as much substance as possible for the concept of humanity in our person, both during the span of our life and beyond it, through the traces we leave by means of our vital activity. This can be fulfilled only by the linking of the self to the world to achieve the most general, most animated, and most unrestrained interplay. (Humboldt, 1793/2000, p. 58) Humboldt considered that individuals needed to be cultivated and elevated into higher beings so that they are able to transform themselves, as well as the world around them so that 13 they become significant to the world with the traces they leave behind. According to him, to fully achieve the goals of Bildung meant grasping a completely fresh view of the world as well as selfdetermination. Bildung, as explicated by Humboldt, meant ‘grasping as much world as possible’ and as ‘contributing to human kind’ by development of one’s own unique self. In the process of acquiring Bildung, he argued, two sources of resistance try to interrupt it – the changes that any intellectual activity acquires as it proceeds and the nation where one is situated and the occupation that one is involved within that context. To him, the geniuses, the ones who manage to overcome these two interruptions are viewed as disturbances in the society but are the ones who move their nations to new positive directions (Humboldt, 1793/2000). He also emphasized that one can understand the mind only through deep reflection and continued observation, which are cornerstones of what later become didaktik theory. However, Humboldt noted that education alone is not sufficient and discerned social relations and ones’ circumstance as other sources of enrichment of individuality, i.e. acquiring inner Bildung (Lüth, 1998). Still, Humboldt’s definition of Bildung was but one of the many developed during the classic theories of Bildung (Bildungstheorie). Klafki (2000a), referring to Immanuel Kant, describes Bildung as capacity for reasonable self-determination, […] which presupposes and includes emancipation from determination by others. It is a qualification for autonomy, for freedom, for individual thought, and for individual moral decisions. Precisely because of this, creative self-activity is the central form in which the process of Bildung is carried out. (Klafki, 2000a, p. 87) According to Klafki, this definition of Bildung derives directly from the ideal of enlightenment. Quoting Kant, Klafki (2000a) argues that the enlightenment itself meant man’s departure from an immaturity, while immaturity in turn is the incapacity to make use of one’s 14 own reason without guidance of another. Bildung was not limited only to this sense of subjectivism, Klafki argues, noting that a second group of core concepts attached to Bildung included “humanity, humankind and humanness, world, objectivity, the general” (Klafki, 2000a, p. 88, emphasis in original). Thus, the process of self-determination is to be achieved within ones’ national, cultural and moral context, i.e. as part of the humankind or the world that one is situated in. Klafki (2000a) emphasizes that according to classical theories of Bildung, the processes and outcomes of Bildung are not limited to any specific group in society or any specific class or intellectual elite – Bildung is Allgemeinbildung or Bildung for all. This conception of Bildung for all persists to present days where Bildung applies to both general and vocational education programs (Klafki, 2000a). However, it is the students themselves that have to develop their reasoning and transform their views of the world, while teachers can only assist them in their pursuits of acquiring Bildung (Hudson & Meyer, 2011). German concept Bildung is a noun meaning something like “being educated, educatedness.” It also carries the connotations of the word bilden “to form, to shape”. Other terms used to translate term Bildung include ‘formation’, ‘self-formation’, ‘cultivation’, ‘selfdevelopment’, and ‘cultural process’ (Siljander & Sutinen, 2012). Tracing historical roots of the term Bildung, Schwenk (1996), as referred to by Siljander and Sutinen (2012), distinguished two historical tradition of Bildung reflected into the modern Bildung, including cultura animi of ancient Hellenism and Christian doctrine of Imago Dei (Schwenk, 1996, p. 210), where cultura animi means spiritual cultivation, while Imago Dei literally means God’s image. Linguistically, German term Bildung follows the doctrine of Imago Dei, since the root of the word Bildung is Bild, which means image (Siljander & Sutinen, 2012, p. 3). Based on these two historical traditions, Bildung is defined, first, as a creative process where the person shapes and develops 15 himself or herself as well as his or her cultural environment, and second, “[…] in the processes of Bildung, a person seeks a more advanced form of life” (Siljander & Sutinen, 2012, p. 4). In Bildung, whatever is done or learned is done or learned to develop one’s own individuality, to unfold the capabilities of the I (Hopmann, 2007). The notion of Bildung, as developed in the Didaktik perspective, sees Bildung as an individual outcome, not as a program for education. This is in opposition to testing-driven education systems and international testing programs where the meaning of the subject matter content is fixed, and in most cases only one right solution is sought from everyone. Furthermore, competence-based curricula are problematic as it is not possible to match a certain competence with a certain content matter, and also gaining competence is but one of the many potential meanings which can be achieved by a given content matter (Hopmann, 2007). Bildung is seen as an ideal aspiring to be mastered by students with teacher’s support and something to hold on to and work towards throughout a personal life journey. In other words, it means preparing students for lifelong learning beyond formal education, for the sake of transforming themselves as human beings, and to the extent possible, extend that transformation to what the person does (occupation) and to society at large (context). Furthermore, Bildung is not only about preparing students to participate in society and economy but it is about developing students’ significance in the world (Pinar, 2011). In the broadest sense, didaktik encompasses the dimension of objectives and contents and the dimension of methods (Klafki, 2000b). In Anglo-American terms, these two dimensions could be referred to as Curriculum and Instruction, however the terms are not completely equivalent across two traditions. Didaktik was defined also “as a relation between teachers and learners (the who), subject matter (the what) and instructional methods (the how)” (Klette, 2007, p.147). In more simplistic definition and relying on didaktik triangle, Didaktik is about teachers, 16 students and subject matter, where teachers enable the access of students to the given subject matter (Künzli, 1998). Similarly, Didaktik “provides teachers with ways of considering the essential what, how and why questions around their teaching of their students in their classrooms” (Westbury, 2000, p. 17, emphasis in the original). Hopmann (2007) stated that Didaktik is a matter of order, sequence and choice. Within the frame of order, sequence, and choice, Hopmann noted that “Didaktik became the main tool for creating space for local teaching by providing interpretative tools for dealing with state guidelines on a local basis” (p. 113). Order here deals with the order of knowledge that is to be introduced to students, sequence pertains to how and when the given content is unveiled to students following a progression from simple to more complex knowledge, and choice relates to selection of contents for the school curriculum. The syllabi or Lehrplan developed at the state level in Germany provided the broad curriculum guidelines while teachers at the local level translated those guidelines into lesson plans, a process facilitated by didaktik and enabled by teachers ‘pedagogical freedom’ (Hopmann (2007). According to Hopmann (2007) didaktik is based on Socratic dialectic-based teaching methods where teaching is restrained, allowing the opportunity for students’ individual growth. Further, Hopmann identifies three commonplaces of Didaktik, namely Bildung, matter and meaning, and autonomy. Bildung is the outcome of the encounter of the student with the content facilitated by the teacher. Within didaktik, there is no separation between the matter and meaning as they cannot exist without one another (Hopmann, 2007), and teachers define ‘What knowledge is of most worth?’ as part of their professional autonomy (Pinar, 2011). Künzli (1998) explicates that the primary focus of a didactician is the object of learning, and what that object can and should signify to students and how students experience the significance, while, 17 All other questions and problems – other than the significance of the learning content – such as class management, individual and social learning, learning control, individual learning speed, appropriate representation, etc. – are subordinate to this central concern and gain significances only when the question of educative substance (Bildungsgehalt) is at issue. Educational psychology and instruction research tend to be peripheral phenomena. (p. 40) Lastly, the common place of autonomy means the autonomy of both teachers and learners in going through their encounters over certain subject matter, and it specifically pertains to flexibility in terms of outcome of Bildung, “[…] which is often not visible at all, at least not right away. It depends on what remains after the hurly burly of teaching is done, the battle of minds is lost or won, and the student comes to terms with his or her own world. (Hopmann, 2007, p. 117) This conceptualization of teaching and learning process and outcomes is in sharp contradiction with neo-liberal educational policies that rely on clear ‘objective’ definition of educational standards and measures to evaluate the progress towards achieving those standards. While both rooted on the theories of Bildung, two didaktik models developed during the 20th century, namely the human science-oriented pedagogy during the first part of the 20th century and critical-constructive didaktik from 1960s to the present days (Klafki, 1998; Künzli, 1998). Klafki (1998) attributed three characteristics to the human science pedagogy, including, first, the close relationship between the pedagogical theory and pedagogical practice, second, the relative autonomy of education from external political, social, and cultural influences, and third, conceptualizing and understanding human science pedagogy in a historical context. Klafki credits philosopher Wilhelm Dilthey and educationist Erich Weniger as contributors to and 18 promoters of human science pedagogy. Critical-constructive didaktik, in turn, was advanced by Klafki himself in an article in 1963. He positioned that the concept ‘critical’ pertains to the interest of knowledge “[…] insofar as this approach to Didaktik is oriented towards the goal of guiding all children and adolescents to greater capacity for self-determination, co-determination and solidarity” (Klafki, 1998, p. 311). The concept of ‘constructive’ is used to mean the interplay between the theory and practice and to allow for an ongoing reformed or reforming practice for a humane and democratic schooling. Klafki also argued, in almost sociology of education terms, that education-society connection was a two-way relationship, claiming that education has both the opportunity and responsibility not only to be reactive to societal needs but also to influence social developments for the benefit of entire society. Another characteristic of critical-constructive didaktik deals with elements that Bildung needs to promote, namely self-determination, co-determination, and solidarity, which in Klafki’s words are defined as follows: Self-determination: Each and every member of society is to be enabled to make independent responsible decisions about her or his individual relationships and interpretations of an interpersonal, vocational, ethical or religious nature. Co-determination: Each and every member of society has the right but also the responsibility to contribute together with others to the cultural, economic, social and political development of the community. Solidarity: […] means that the individual right to self-determination and opportunities for co-determination can only be represented and justified if it is associated not only with the recognition of equal rights but also with active help for those whose opportunities for self-determination and co-determination are limited or non-existent due to social 19 conditions, lack of privilege, political restrictions or oppression. (Klafki, 1998, p. 314, emphasis added) Indeed, these three elements of Bildung put forth by Klafki underline the key opportunities as well as challenges that educational processes offer and are faced with to the present days. At the institutional level, didaktik has seen a division into general didaktik (Allgemeine Didaktik) and subject matter didaktik (Fachdidaktik) where the first centers around broader issues of teaching and learning while the second deals with analysis, organization and preparation of subjects of teaching (Künzli, 1998). Further development of subject matter didaktiks as well as the relationship between the general didaktik and subject matter didaktik is considered to be the challenge facing the fields (Seel, 1999). Klafki’s didaktik analysis has been at the core of instructional planning as part of the critical constructive didaktik. It relies on five questions, including, 1. What wider or general sense or reality do these contents exemplify and open up for the learner? What basic phenomenon or fundamental principle, what law, criterion, problem, method, technique or attitude can be grasped by dealing with this content as an ‘example’? 2. What significance does the content in question or the experience, knowledge, ability or skill to be acquired through this topic already possess in the minds of the children in my class? What significance should it have from a pedagogical point of view? 3. What constitutes the topic’s significance for the children’s future? 4. How is the content structured [which has been placed into a specifically pedagogical perspective by questions 1, 2 and 3]? 5. What are the special cases, phenomena, situations, experiments, persons, elements of 20 aesthetic experience, and so forth, in terms of which the structure of the content in question can become interesting, stimulating, approachable, conceivable or vivid for children of the stage of development of this class? (Klafki, 2000b, pp. 151-155) These five questions enable the design of learning opportunities that set up conditions for teachers and students to work together towards meeting the three elements or abilities of Bildung in the form of self-determination, co-determination and solidarity. Despite being heavily situated in the German speaking world in the Continental Europe, Didaktik has been influential to other geographic locations, most prominently in the Nordic Europe, where didaktik orientations has been central to the development of education throughout the 20th century up to present (Kansanen, 1999; Uljens, 1997, 2009; Hudson & Meyer, 2011). However, Hudson & Meyer (2011) had observed that Nordic educational scholars were also open to the Anglo-American curriculum theory as evidenced in a significant number of references to Anglo-American scholars used in their published scholarly work, compared to a lower rate of such references among Continental Europe educators. Still, Kansanen (1999) emphasized that didaktik constituted the central theory for running educational systems and selecting curriculum and instructional methods in Nordic countries. Perspectives on curriculum tradition and how the field developed Curriculum scholars have long been at work to dissect curriculum tradition. The notions vary as to what the tradition can be categorized into – some scholars refer to sub-components of curriculum tradition as ideologies (Schubert, 1986; Schubert, 1996; Schiro, 2013), others refer to them as models (Ellis, 2004), traditions of practice (Zeichner, 1993) and orientations (Eisner & Vallance, 1974) among else. Schiro (2013, p. 8) defined ideology as a “[…] collection of ideas, a comprehensive vision, a way of looking at things, a worldview that embodies the way a person 21 or a group of people believes the world should be organized and function.” In line with numerous curriculum scholars who used varied notions of how to refer to seemingly same education processes and problems, the present study uses ideologies, theories, frameworks, orientations and models interchangeably to refer to curriculum sub-components as well as when referring to didaktik and curriculum broadly, recognizing that there are inherent differences in these theoretical concepts. Table 1 provides a comparison of varied notions that curriculum has been classified into by a number or curriculum scholars. Table 1 Classification schemes of curriculum tradition Eisner & Vallance (1974) academic rationalism technology &cognitive processes self actualization Schubert (1986) Schubert (1996) Zeichner (1993) Ellis (2004) Kliebard (2004) Schiro (2013) intellectual traditionalist social behaviorist intellectual traditionalist social behaviorist academic knowledge centered social efficiency scholar academic social efficiency child study learner centered socialmeliorist social reconstruction experientialist experientialist social reconstructionism critical reconstructionist social efficiency developmentalist social reconstructionist progressive & learner centered society centered humanist Adapted from Schiro (2013). Most curriculum theorists agree that there are primarily four distinct ideologies humanists/scholar academic, child study/learner centered, social meliorists/social reconstruction, and social efficiency – within the curriculum tradition, and even when different labels are used to describe them, the underlying positions and assumptions are the same. Schiro (2013) noted that when considering Learner Centered ideology in the US, for example, it has been labeled as “…child study (1890s), progressive education (1910-1950), open education (1965-1980), developmentalist (1970-1990), and constructivist (1990-present)” (p.10). To further elaborate 22 these four curriculum ideologies, I first provide an extended discussion on how they came to be established as such in the U.S. at the turn of the 20th century, and what influences played significant roles on proponents of each of them. Then, I review the developments in the curriculum field to present days and offer a condensed summary of key arguments put forth by major curriculum theorists in the respective periods of time. The curriculum thinking in progressive era, roughly 1890 to 1930, evolved into discussions around four key ideologies, such as scholar academic, social efficiency, learnercentered, and social reconstruction (Schiro, 2013), while Kliebard (2004), taking a historical approach, distinguished four main interest groups competing with their ideas over curriculum control during the initial struggle towards curriculum making. Putting these two classifications together, we get the following four competing groups pushing their visions forward to win the battle for American curriculum: 1. Humanists/scholar academic (for example, Charles W. Eliot, William Torey Harris) 2. Child study/learner centered (for example, G. Stanley Hall, William Heard Kilpatrick) 3. Social meliorists/social reconstruction (George Counts, Harold Rugg, Lester Frank Ward, etc.), and 4. Social efficiency (for example, John Franklin Bobbit, Charles Ellewood, Ross Finney, David Snedden). Progressive era cannot be discussed without considering John Dewey’s role in shaping curriculum thinking and developments of the time. Dewey’s work in designing school practices in his Laboratory School at University of Chicago as well as his theoretical insights over 23 educational issues in the US at the turn of the 20th century contributed largely to educational developments that took place in the US for nearly half of the 20th century. However, as a prominent figure, he did not fully side with any of the four groups, but shared more in his ideals and principles with child study/learner centered group and reconstructionists or social meliorists. Both local national forces and developments at the time (incoming emigrants, industrial revolution, etc.) and external forces (primarily European influences) were at play during the formation of these four competing groups. Humanists/scholar academic Humanists promoted a curriculum model that pushed for liberal education or a more general academic curriculum that focused on reading, writing, and arithmetic, but also required taking other courses such history, geography, arts, Greek and Latin classes in high school (Ravitch, 2000). Humanistic curriculum was dominant throughout 19th century, but by the end of it, around 1890s, proponents like Charles Eliot and William Torrey Harris, felt the mounting pressure of other developments in the country, and started to talk about ‘new education’. However, as Ravitch (2000) noted, what they meant by ‘new education’ was introduction of science into the academic curriculum. At the heart of humanistic approach was the belief that the role of the curriculum was to improve society by advancing academic achievement of individuals. Humanists were heavily influenced by European philosophical and psychological developments. They were strong supporters of the doctrine based on mental discipline that derived from the work of German psychologist Christian Wolf. Mental disciplinarians believed that “certain subjects of study had the power to strengthen faculties such as memory, reasoning, will, and imagination” (Kliebard, 2004, p. 4). In this regard, the curriculum in the U.S. during 24 19th century adopted heavily from the didaktik theory of teaching and learning in Germany, and most continental Europe, that was based on idealism of German philosophers such as Immanuel Kant, Georg Wilhelm Friedrich Hegel, Wilhelm Dilthey, and Wilhelm von Humboldt (Reid, 1997). Specifically, key proponents of humanistic education, Eliot and Harris, were considered to be America’s leading Hegelians, who contributed in educational developments during the 1890s as part of the Committee of Ten, amongst else, since Eliot was also president of Harvard University, while Harris was the U.S. Commissioner of Education (Kliebard, 2004). Both Eliot and Harris studied educational thought developed in Europe, where Eliot event spent years in visiting France and Germany studying their educational systems to detail, interviewing policymakers as well as school leaders and teachers in school settings. Nevertheless, the 19th century curriculum dominated by humanistic approaches started to be viewed as outdated and traditional, and three other ideologies lined up to challenge it. Child study/learner centered group Child study/learner centered group or developmentalists as they were also referred to came into American curriculum debate with the initiation of child study movement in 1880s by G. Stanley Hall (1844-1924), a movement that “encouraged educators to study children as they were – to watch them carefully, to listen to them intently, and to collect data about them so that instruction could be designed based on observations of children’s nature, needs and interests” (Schiro, 2008, p. 113). The movement gave birth to what is widely known to date as child- or learner-centered instruction, a concept strongly associated with the work and teachings of American philosopher and educator John Dewey. Kliebard (2004) discussed John Dewey as separate from the four interest groups, but other authors associate him primarily with two of the four groups, namely, developmentalists and social reconstructionists (Pinar, 1995; Ravitch, 2001; 25 Schiro, 2008). The movement promoted some aspects of Herbartian curriculum, which focused on subjects being taught, with emphasis on history and literature and later science, instruction methods, and steps that would make curriculum accessible to learners (Pinar, 1995). In addition to Dewey, G. Stanley Hall, Francis Parker, and William Heard Kilpatrick are referred to as developmentalists in their approaches to curriculum. The heaviest influence on developmentalists was German pedagogue Johann Friedrich Herbart (1776-1841), who himself was influenced by Kant, Hegel and Pestalozzi, (Pinar, Reynolds, Slattery and Taubman, 1995). In addition, many American educators and psychologists travelled to Europe to study Herbartians, Froebelians and Pestalozzians (Schubert, 1986). Again, as in humanists’ case, there is a presence of German philosophers and educators on American educational leaders and experimenters thinking, but drawing from Herbartianism, developmentalists tried to break away from ‘mental discipline’ into building educational experiences that have the learner at its core. Reportedly, Dewey, a student of Hall at Johns Hopkins University was advised to turn to German idealists and disregard British empiricism, which led to Dewey writing his dissertation on “Psychology of Kant” in 1884 (Schubert, 1986). In addition to Herbart, four more European philosophers and educators influenced developmentalists or learner-centered instruction movement in the U.S, including: Johan Amos Comenius (1592-1670) who viewed education as developmental, progressing from concrete experience to abstract thought; Jean-Jacques Rousseau (1712-1778), French philosopher, who viewed children as naturally good and society as corrupt and contended that the role of education should be to maintain children’s goodness free from corruptions of the society, and to create environments in which they live their childhood fully and involve in rich experiences that facilitate their growth; Johann Hein Pestalozzi (1746-1827), Swiss educator, credited with 26 putting Rousseau’s ideas into practice, supported the idea that children should be enabled to explore their interests in spontaneity and play-based learning. Dewey seems to have adopted his ideas for learning by doing approaches he promoted in the U.S. from Pestalozzi; and lastly, Friedrich Froebel (1782-1852), a German educator and student of Pestalozzi, who invented kindergarten and pushed for play-based learning, in which children can use toys and other materials to shape and manipulate them as they desire (Schiro, 2008). The educational ideas generated in Europe influenced the thinking of developmentalists such as Hall, Francis, Dewey, Kilpatrick and others. These influences were obvious in the work these educators engaged with in the U.S. in directly implementing those ideas into different contexts and projects. For example, developmentalists reiterated that children and not content should be the focus of teaching, teachers should create an experiential environment that encourages learners to create personal meaning, and school subjects should be integrated through project learning (Dewey & Dewey, 1915). Dewey together with his wife established Laboratory School at the University of Chicago, which was entirely based on child-centered model of education (Ravitch, 2000). In addition, Kilpatrick’s work on project method was an actual implementation of borrowed ideas from Europe. He concluded that children are naturally social and that they require a skillful teacher to guide their purpose, who in turn engages in ‘projects’ to stimulate and guide learners’ learning and character building (Kilpatrick, 1918). Social meliorists/social reconstruction group Social meliorists were another group of enthusiasts who challenged the traditional curriculum of 19th century by putting forward a vision of education that saw schools as principal force for social change and social justice (Kliebard, 2004). Pinar et al. (1995) viewed social reconstruction as the best reform movement during the progressive era, and associated, as the 27 majority of the other historians, Lester Frank Ward, John Dewey, Harold Rugg and George Counts, as most prominent proponents of the movement. The social meliorists view “human experience, education, truth, and knowledge as socially defined” (Schiro, 2008, p. 143), and they did not believe that there could be a good individual apart from a good society (Counts, 1932). All these educators considered education as an engine that will resolve all social ills and believed in the power of education to renew and transform society. “The continuity of any experience through renewing of the social group is a literal fact. Education, in its broadest sense, is the means of this social continuity of life” (Dewey, 1916, p. 3). In other words, Dewey considered that education can be a catalyst for ensuring continuity of life through renewing social structures. Commenting on Dewey’s vision for education with regard to social reconstructionist movement, Kliebard (2004) noted that implementing such a vision required sweeping changes not only in curriculum but in the way the schools were run. Dewey’s conceptualization of education asked for changing not only the elements of education, such as curriculum and instructional methods, but also the process under which those elements made their way into the classroom affecting learners’ experiences and lives. Influences over social meliorists were more diverse. Schiro (2008) argued that ideas of social reconstruction were as old as establishment of American nation when Americans started to reject British rule and worked on building their own economic, political and social system. These deep-rooted ideas were first voiced by Lester Frank Ward during 1880s, when he opposed social Darwinism, which asserted that “the best” of the race would rise to the top in line with Darwin’s theory of survival of the fittest (Pinar et al., 1995). Ward believed that “status quo of the human affairs was not ‘natural’ and thus changeable” (Pinar et al., 1995, p. 104). Ward’s ideas were further developed by Dewey, Rugg and Counts. Dewey and Counts were also heavily influenced 28 by European philosophers, primarily Marxist ideas, as a result of visits they took to Western Europe and Russia. Naively, both Dewey and Counts believed that Russian education system was the best in the world and saw it as a model to be adopted in the U.S. (Ravitch, 2000). Rugg, on the other hand, had a background in engineering and believed that a new social order would have to eliminate subject-centered schooling with more child-centered practices supported by technology integration (Ravitch, 2000). Social Efficiency Lastly, social efficiency group appeared in the US curriculum debate at the turn of the 20th century. Kliebard (2004) elaborated that ideas outside the realm of education gained much attention within educational community – in particular ideas of Frederick Taylor, an engineer from scientific management, that gave birth to social efficiency as a social ideal and educational doctrine, and ideas of Edward Thorndike from psychology, who introduced educational or “intelligence” measurement into American education. Drawing from the work of Edward Ross, Frederick Taylor, Franklin Bobbit, Edward Thorndike, and David Snedden, Kliebard (2004) interpreted the arguments that lead to social efficiency model within the U.S. education, a model that shaped to a large extent the U.S. education curriculum and educational thinking and practice to present days (Luke, 2004). Social efficiency model suggested that students would learn in schools only what they needed to know in order to perform as adult members of social order, and “To go beyond what someone had to know in order to perform that role successfully was simply wasteful” (Kliebard, 2004, p. 77). Kliebard further argued that social efficiency educational model came into being, on one hand as a response to the flourishing of industrialism, and as antithesis of Dewey’s ideas on the other. 29 To see the contrast with regard to the vision of education between social efficiency model and the other three competing models, one has to examine the work of Franklin Bobbit, who is credited for establishing the curriculum field with his publication The Curriculum in 1918 (Pinar et al., 1995). Arguing in favor of scientific management and against tradition model of curriculum, Bobbit wrote: The old education, except as it conferred the tools of knowledge, was mainly devoted to filling the memory with facts. The new age is more in need of facts than the old; and of more facts; and it must find more effective methods of teaching. (Bobbitt, 1918, pp. 10– 11) The quote shows that the goal of scientific management model was not to change the content of old education as much as to make such a model more efficient by putting more facts into learners’ minds in shorter period of time. Social efficiency model not only rejected other educational models but also brought to light the idea of curriculum differentiation, an idea attributed to sociologist David Snedden (Ravitch, 2000). Curricular differentiation marked the beginning of implementation of vocational education in the U.S. education system, which was to lead to tracking of students shortly after. Ideas from industrial revolution transformed into ideas for industrial education so that learners’ skills at the end of schooling met the needs of employers in factory assembly lines. To facilitate the line of argument for the influences that led to social efficiency to become the dominant curriculum model, two types of influences are presented here: internal and external. By internal influences I mean forces that shaped decision-making in a larger scale of how the country is run in general, and how education is organized in particular. These forces include 30 increasing number of incoming emigrants, mass education, and industrial revolution occurring at the time. The second group includes external influences that primarily relate to European philosophical, psychological and pedagogical influences that directly or indirectly contributed to emergence of social efficiency model in the U.S. curriculum. Internal influences on curriculum making Schiro (2008), paraphrasing Callahan (1962), emphasized that social efficiency model had its origin in four movements, namely, social reform, utilitarian education, behavioral psychology, and scientific methodology, which according to Schiro, are still active to date. Social reform was fired up by muckraking journalism during first two decades of 20th century, which helped create a reform-conscious population “that put social needs above all else” (Schiro, 2008, p. 72). The idea behind utilitarian education was to emphasize the relevance of making schools useful for the life of individuals and the nation, while behavioral psychology enabled social efficiency model to frame its endeavors into a psychological context. John Watson, Edward Thorndike, and later B. F. Skinner created that psychological base for the social efficiency (Schiro, 2008). Finally, scientific methodology with its techniques and methods for accurate measurement and eliminating waste gave social efficiency the tools to overcome the inefficiency its proponents had identified in old education (Schiro, 2008). Oakes (1985) listed millions of incoming immigrants, initiation of mass public schooling at high school that suddenly at the turn of 20th century had to educate far more children as before, as well as growing needs of industry for better skilled workforce as problems that needed to be addressed by the nation, and that enabled social efficiency model to emerge as best tool to fix the problems effectively and efficiently. In her words: 31 Social Darwinism had provided the “scientific” justification for the schools to treat the children of various groups differently. The Americanization movement provided much of the content of the schooling to be offered the children of the poor and immigrant. It was left to American industry to provide the form this new kind of education would take. (Oakes, 1985, p. 27, emphasis in the original) Katz (1987) criticized the educational policy reform from a historical perspective and concluded that systemic wide reforms that attempt to change the quality of schooling almost always fail. However, the success of social efficiency model and its institutionalization in a nation-wide scale seems to prove his conclusion invalid. Still, Katz built his argument around the notion that external societal order and developments are the true forces that shape educational experiences as those forces are more powerful than any educational planning or policy. It seems, in the case of social efficiency, efforts of policy makers to change educational system matched with the vision of external societal forces. Arguably, alignment of these visions constitutes the core explication why social efficiency model was successful. Social efficiency model was the most American-made curriculum idea, but it still was influenced by three major developments in Europe. First, as noted above, social Darwinism ideas justified curricular differentiation into general and vocational tracks. Second, the very idea of tracking/curriculum differentiation through vocational education was borrowed from Germany, which had already put in place such a model of education. Third, the idea of intelligence quotient (IQ) testing was borrowed from French psychologist Alfred Binet (Ravitch, 2000). Perhaps one of the ways to understand why social efficiency model of curriculum won the competition over American curriculum against three other competing models – humanists, 32 developmentalists, and social reconstructionists – is to ask the question: What problems did educators in the U.S. at the end of 19th and beginning of 20th century have to resolve? As noted above, education system was under a trifold pressure, including increased numbers of incoming immigrants, mass schooling, and industrial revolution. While humanists seemed to have been outsiders in the competition from the very beginning as all three other groups were challenging them at the core, it is still less clear why developmentalists and social reconstructionists, who shared some similar ideas and principles, and indeed, united through the figure of John Dewey, failed to occupy a more prominent place in American curriculum field. Kliebard (2004) provided some insight as to why social efficiency model won the day: And, insofar as the effect on actual school practice is concerned, the prominence and persistence of the basic ideas of the scientific curriculum-makers indicates that someone like relatively obscure Bobbit may have been far more in touch with the true temper of his times than the world-renowned Dewey. (p. 104). Hlebowitsh and Wraga (1995) concluded that Dewey was celebrated more as a theorist and philosopher of education but had less success in implementing his ideas into practice because he did not side with class and cultural struggle by ignoring the ‘strong’ and siding with vulnerable groups, i.e. children and marginalized communities. Social efficiency group, on the other hand, seems to have been at the nexus of converging interests at the early part of 20th century in the U.S. to win the American battle over curriculum. One of the most influential curriculum scholars to the present days is Ralph Tyler, whose four questions in his Basic Principles of Curriculum and Instruction pertaining to development 33 of curriculum and instruction program have reached almost a status of revealed doctrine (Kliebard, 1970). Tyler’s (1949, p. 51) four questions are the following: 1. What educational purposes should the school seek to attain? 2. How can learning experiences be selected which are likely to be useful in attaining these objectives? 3. How can learning experiences be organized for effective instruction? 4. How can the effectiveness of learning experiences be evaluated? In short, the questions focus on stating the educational objectives, selecting learning experiences, organizing learning experiences and finally evaluating them. From the four questions, the first one is the most important since the other three will follow based on what educational objectives will be pursued (Kliebard, 1970). According to Tyler, the educational objectives should be drawn from three sources, including studies of the learners, studies of society and suggestions from the subject-matter specialists, while filtering the sources through philosophical and psychological screening. Tyler’s rationale assumes that one can easily match objectives and outcomes through the evaluation process, and it is based on the idea of education system as an input-output model. The Tyler Rationale operates under the assumptions of the educational philosophy of social efficiency, and serves as a technocratic model that dominated American schools to present day (Deng and Luke, 2008). Despite subsequent curriculum theories and educational models, “key epistemological, psychological, pedagogical, cultural and political questions continue to be deferred under the assumption, pace Tyler, that neutral, consensually-driven statements of outcomes are the common building blocks of any programmatic approach to curriculum” (Deng 34 & Luke, 2008, p. 74). Indeed, the persistence of Tyler rationale situated in social efficiency curriculum ideology throughout the 20th century to present days has been evidenced by both curriculum scholars as well as historians of education (Kliebard, 2004; Ellis, 2004; Labaree, 2005; Pinar, 2011, 2012; Ravitch, 2001). During the second part of the 20th century, two educationists – Jerome Bruner and Joseph Schwab – occupied a prominent role in the curriculum field, with Bruner advocating for organizing the subject matter according to academic disciplines, thus promoting the doctrine of disciplinarity within the curriculum tradition, and Schwab emphasizing the understanding of learners as the starting point for translating academic discipline into subject matter (Deng & Luke, 2008). According to Bruner, academic knowledge should be derived from the structure of the respective academic discipline. Bruner initiated a curriculum reform movement in the US in the 1960s to respond to the post-Sputnik calls for advancing scientific and technological expertise. Bruner emphasized that intellectual activity is the same both in third-grade as well as the most advanced level, and highlighted that “Grasping the structure of a subject is understanding it in a way that permits many other things to be related to it meaningfully. To learn structure, in short, is to learn how things are related” (Bruner, 2009, p. 7). Bruner was an advocate of the knowledge-centered (or academic rationalism or intellectual traditionalist) ideology of curriculum and in his work put emphasis on the process and concepts of the academic disciplines rather than the products – in other words, the process of education should result in understanding rather than performance (Ellis, 2004). Further, efforts to reform curriculum, and science education specifically, during the post-Sputnik period focused on getting students to think like scientists in the academic disciplines so that they would be set on the path to engage in science careers (Duschl, 2008). 35 With the focus on the practical inquiry, Schwab (1969), after he declared the curriculum field moribund, announced that there would be a renaissance in the field only when the emphasis was shifted from the theoretic to the practical – practical in the sense of “[…] complex discipline, relatively unfamiliar to the academic and differing radically from the disciplines of the theoretic” (Schwab, 1969, p. 1). Schwab advocated for a new form of curriculum development that would assist in translating the academic knowledge into subject matter. The new curriculum making would rely on a process of collaborative deliberation that would include a number of what Schwab termed ‘commonplaces’ or ‘agents of translation’ – subject matter, learners, milieu, teachers – assisted by curriculum specialists (Schwab, 1973). According to him, subject matter commonplace refers to someone who knows the materials and academic discipline for the given subject; learners refers to someone who understands the learners as primary beneficiaries of curriculum, and in that aspect, that someone should have deep understanding of learners’ minds and hearts as well as probabilities for their future economic status and function; milieu refers to the classroom, school and community where learning will take place, and it would require someone who understands the values, attitudes towards education and aspirations of the stakeholders in the milieu that might affect children’s learning; teachers refers to someone that has an understanding of teachers and how ready they are to engage in new learning and new ways of teaching; and curriculum specialists, who are required to enable the curriculum making process taking into consideration the relevant experiences of the four commonplaces (Schwab, 1973). Schwab’s commonplaces expand the triad of commonplaces emphasized by Dewey at the turn of the 20th century pertaining to the learners, society and specialized knowledge, which if viewed from the curriculum ideologies’ perspectives, relate to learner-centered, social efficiency, academic rationalism and to a lesser extent to social reconstructionism (Deng & Luke, 2008). 36 From this perspective, it can be argued that Dewey and Schwab were the most prominent educators who advocated for tightly integrated curriculum approaches that tried to overcome ideology divide across the four common ideologies that span curriculum field to this day. Also, to some extent, the concerns raised and aspirations promoted by Dewey and Schwab resemble the more established didaktik theory in Europe, where the goals of education are embodied into the unified and integrated concept of Bildung. During the later part of the 20th century and early 21st century, social reconstructionist groups reemerged in the curriculum field calling for social justice and criticizing the curriculum for maintaining the social injustices in the society (Pinar et al., 1995; Apple, 2004; Pinar, 2011, 2012). During this period, curriculum field expanded to include diverse challenges of the society such as poverty, racism, sexism, marginalized groups and so forth. As such, curriculum was defined as a complicated conversation (Pinar et al., 1998) as well as an interdisciplinary study of educational experience (Pinar, 2012). Pinar, especially during the first part of the 20th century, became very critical of educational system in the US, arguing that curriculum theorists are suspicious of the policy talk around ‘standards’ and ‘accountability’ and ‘business thinking’ in curriculum making, from which perspective curriculum is offered as a product from producers to consumers (Pinar, 2012). Indeed, Pinar (2004) complained that due to anti-intellectual culture in the US of the predominantly white culture and history now politically hegemonic in the US the field of education has remained underdeveloped. According to social reconstructionists, the solution to the social injustices relies on educating social activists who will carefully act to right the wrongs inflicted on the ‘oppressed’. They view the continued emphasis on standards-based education and accountability models as everlasting social efficiency model of education, which only pressed for efficiency in education delivery. 37 Comparison of didaktik and curriculum orientations Comparison of didaktik and curriculum orientations has been a topic for discussion and source of inspiration for numerous books, volumes and articles (Autio, 2013; Gundem & Hopmann, 1998; Siljander, Kivelä, & Sutinen, 2012; Uljens, 1997; Westbury, Hopmann, & Riquarts, 2000). Scholars have argued that while both didaktik and curriculum orientations deal with the same issues and concerns in principle in terms of educating children, clear differences persist as to how they go about doing the school business. To show how striking the differences are between the two traditions, let’s turn again to the key questions (see Table 2) that guide preparing and selecting curriculum and instruction for the classroom practices as shown in the previous sections in the form of questions raised by Wolfgang Klafki in didaktik and Ralph Tyler in curriculum orientations. As shown earlier in this chapter, Klafki’s questions for didaktik analysis and Tyler’s questions as part of Tyler Rationale for selection of curriculum and instruction have been and still are influential in educational systems in respective traditions. An examination of these questions reveals the most striking difference between the two sets of questions, and indeed, between the two traditions, i.e. the complete lack of content focus on Tyler’s questions and complete lack of assessment/evaluation in Klafki’s questions. Second, Klafki’s questions are addressed to teachers and what content and examples and experiences they will select and teach to their students, thus making didaktik a teacher-oriented tradition that relies on teacher professionalism, responsibility and autonomy, while Tyler’s questions are addressed at least at the school-level, thus making curriculum an institutional- or system-oriented tradition where teachers implement what schools/system require them to. 38 Table 2 Comparison of Klafki’s and Tyler’s questions Didaktik Orientation Curriculum Orientation Klafki’s five question for didaktik analysis 1. What wider or general sense or reality do these contents exemplify and open up for the learner? What basic phenomenon or fundamental principle, what law, criterion, problem, method, technique or attitude can be grasped by dealing with this content as an ‘example’? 2. What significance does the content in question or the experience, knowledge, ability or skill to be acquired through this topic already possess in the minds of the children in my class? What significance should it have from a pedagogical point of view? Tyler’s four questions for curriculum and instruction 1. What educational purposes should the school seek to attain? 2. How can learning experiences be selected which are likely to be useful in attaining these objectives? 3. How can learning experiences be organized for effective instruction? 4. How can the effectiveness of learning experiences be evaluated? 3. What constitutes the topic’s significance for the children’s future? 4. How is the content structured [which has been placed into a specifically pedagogical perspective by questions 1, 2 and 3]? 5. What are the special cases, phenomena, situations, experiments, persons, elements of aesthetic experience, and so forth, in terms of which the structure of the content in question can become interesting, stimulating, approachable, conceivable or vivid for children of the stage of development of this class? Third, Tyler’s questions are about efficiency of the system, and they require an evaluation of learning experiences so that there is an empirical base as to how efficient the system is in terms of learning outcomes. Klafki’s questions emphasize idiosyncrasies of individual classrooms, teachers and students allowing for the learning outcomes to vary as well as to be determined by 39 specific encounters of students with content and teachers in specific classrooms, which ultimately will have a different significance educationally, emotionally and culturally. Lastly, Tyler’s questions in a more narrow sense, and the Anglo-American curriculum framework in a broader sense, operate focusing on questions 4 and 5 in Klafki’s didaktik analysis which are pedagogical in nature in the sense that they are concerned with how learning should take place, while questions 1, 2 and 3 that are curricular in nature in the sense that they deal with content have received far less attention in curriculum field (Deng & Luke, 2008). Didaktik and curriculum traditions have been referred to as two educational “superdiscourses” in the Western world (Autio, 2006), while in other cases, it has been stated that the two traditions are “[…] incommensurable as they operate on the basis of fundamentally different assumptions and ideas” (Biesta, 2011, p. 176). According to Biesta (2011), didaktik theory was established as an academic field of its own in the Continental Europe, while curriculum theory in Anglo-American world was not. Referring to British scholars (Tibble, 1966; Hirst, 1966), Biesta (2011) noted that education in the Anglo-American context was constructed as an interdisciplinary field with theoretical input from other foundational disciplines such as philosophy, history, psychology and sociology. According to Hirst (1966), curriculum theory cannot generate any unique understanding about education in addition to what is generated through ‘fundamental’ disciplines of philosophy, history, psychology and sociology. On the other hand, Biesta (2011) argued, educational sciences in Continental Europe developed into a discipline of its own because it made the distinction between the natural and social sciences. He attributed this distinction to German educationist Wilhelm Dilthey. Dilthey argued that there was a fundamental distinction between the study of natural phenomena and the study of social and historical phenomena. While the world of natural 40 phenomena is a world of cause and effect which for that reason is amenable to explanation, the socio-historical world is a world in which human beings pursue aims and plan actions in order to achieve these aims. The main objective of the study of the sociohistorical world should therefore be to clarify the aims people pursue. This is not a question of explanation but requires understanding. Moreover, such understanding cannot be generated through observation from the ‘outside’ but needs interpretation and an insider perspective. (Biesta, 2011, p. 186, emphasis in the original) Biesta noted that the insider perspective is developed only when there is a clear conception of education and what the object of study of such field is. In his view, there is nothing wrong asking philosophical, historical, psychological, and sociological questions of education, but what is further needed is asking educational questions about education, which according to him include questions about the processes and practices of education. In other words, according to Biesta (2011), educational questions about education are asked and addressed in Continental Europebased didaktik tradition but not in Anglo-American-based curriculum tradition. Still, the lacking of a clear independent educational theory in Anglo-American context does not constitute the end of comparison and dialogue between the two traditions. In the meantime, on the other side of the Atlantic, the US curriculum scholars have their dilemmas about the curriculum field. For example, Connelly and Xu (2008) posit that there were two types of progressive scholars during the progressive era of education at the turn of 20th century in the US, namely pedagogical progressives represented by John Dewey and administrative progressive represented by psychologist Edward Thorndike. The first had most impact on curriculum rhetoric, and the latter on curriculum structure and practice, a scenario often referred to as a battle between Dewey and Thorndike where Thorndike won and Dewey 41 lost (Connelly & Xu, 2008; Lagemann, 1989). Comparing didaktik and curriculum, Westbury (2000) argued that under curriculum theory, the social and cultural world is an objective structure and the task of the curriculum is to present the structure to students, and help them determine what place they will occupy in it. Within curriculum tradition the teacher role is the one of an invisible agent of the system, “seen as “animated” and directed by the system and not as a source of animation for the system” (Westbury, 2000, p. 21). On the other hand, under didaktik, the world that education should help create is presented as subjectified and the question is not how students learn or how to lead students towards a body of knowledge, but the question is about the object of learning in terms of Bildung, what it should signify to the student, and how students themselves experience this significance. Didaktik theory is a teacher- rather than system-centered system, where the role of the teacher is “forming” rather “instructing” his or her students and in so doing, celebrates the individuality of each teacher as an active and reflective curriculum and decision maker rather than seeing the teacher as implementing a workplace manual of best practices, that is, a curriculum or a curriculum package (Westbury, 2000). Under Didaktik, teachers enjoy autonomy because “[…] nothing is decided by the choice of the matter itself, but by how the teacher chooses to enact a given content for a given audience of students under given circumstances” (Hopmann, 2007, p. 117). Furthermore, Hopmann argued, Didaktik is the necessarily restrained effort to make certain substantive outcomes possible, while knowing that it can always turn out completely different from what was intended. Hopmann (2007) writes that: “The purpose of teaching and schooling is in this perspective . . . the use of knowledge as a transformative tool of unfolding the learner’s individuality and sociability, in short: the Bildung of the learners by teaching” (p. 115). 42 To a large extent, Didaktik relies heavily on the notions of teacher autonomy and teacher responsibility. As I put forth in the measures for the study, these constitute the core constructs that will be examined with the data available from PISA. In particular, teacher autonomy and teacher responsibility allow for Didaktik reflection to take place and be used by teachers in their respective classroom settings in Didaktik systems. “Through and by way of Didaktik reflection each and every teacher must determine, as an expert professional, what must be done in this setting, with this material, with these students, in the light of the values associated with Bildung” (Westbury, 2000, p. 29, emphasis in original). Subsequently, the lack of teacher autonomy and responsibility limits the opportunities to design instructional practices that best fit the needs, abilities and interests of specific students in specific classrooms. Utilizing an innovative approach based on logic models framework, Tahirsylaj, Niebert and Duschl (under review) compared and contrasted didaktik and curriculum traditions to elicit their key features as well as similarities and differences. Logic Models are tools primarily developed and used by program managers and evaluators to describe programs in terms of their assumptions, inputs, activities, outputs and outcomes (McLaughlin & Jordan, 1999). The result of that logic model-based comparative work is presented in Table 3. Table 3 offers a quick summary of how the two systems compare and contrast. As one examines the list of elements representing didaktik and curriculum under Assumptions, Inputs, Activities, Outputs and Outcomes, it quickly becomes clear that these are two quite different systems despite the fact that they are primarily in the same business – that of educating students. One way to easily spot the differences is to follow the dashes, or the missing category, in the respective system. For example, under Inputs, there is a category for Teacher autonomy in Didaktik, but it is missing under the curriculum theory, or there are categories of Curriculum 43 implementation and evaluation under curriculum under Activities, but they are missing under didaktik as those are perceived as sole responsibilities of individual teachers under their specific circumstances with specific students. Table 3 Logic Model of Didaktik and Curriculum traditions ASSUMPTIONS INPUTS ACTIVITIES Didaktik Curriculum-InstructionAssessment - Education creates a world as subjectified - Hermeneutics - Humanistic / Romantic - Education creates a world as objectified - National Bildungs-standards (Competencies + Bildung) - Core curricula on the federal state level - Empiricism - Positivist / Formalist / Cognitivist / Constructivist - National common core standards ---- - School curricula based on federal state curricula (developed and enacted by within school subject matter teachers) - District and/or school curricula based on national common core standards - Teachers as professionals/ curriculum makers - Teacher autonomy - Students (Students’ conceptions as relevant aspects of classrooms) - Subject matters/objects of learning in terms of Bildung/content of teaching ---- Reflective practice of teaching - Teaching as licensed - Didaktik reflection - Observation based on competencies defined in Bildungsstandards - External state assessments of Bildungsstandards after grade 10 and 12 (state assessment counts for approx. 30% of students’ final grade) - Teachers as system’s agents ---- Students - Subject matters/learning for deep understanding Accountability - Effective teaching - Teaching as certified/in-serviced - Adaptive instruction ---- Assessments aligned to common core standards - Peer-teacher evaluation / selfprofessionalism - Systemic teacher performance evaluation -School evaluation – consequences are recommendations -------Teacher education exceeds subject orientation (psychology, -School evaluation – consequences are high-stakes - Curriculum implementation - Curriculum evaluation -Teacher education exceeds subject 44 OUTPUTS Short-term O U T C O M E S Medium-term Long-term educational sciences, FachDidaktik), includes 7 years of education (3 years undergraduate+2 years master+2 years in-service training) - Mandated advanced training on the job - Initiation of learning to help students achieve competencies - Unfolding of individuality/creativity - Meaning-making - Reflection - Bildung orientation (psychology, educational sciences, subject matter-oriented methods) - Professional development relevant to content to be taught - Learning outcomes - Test scores/Student performances - Deeper understanding ---- Mastery of knowledge and skills/deeper learning for deeper understanding - Self-determination ---- - Co-determination ---- - Solidarity ----Application of competencies for an active participation in society ---STEM literacy - Increased productivity and consumerism in the society - Diversity - Conformity - Increased commitment to Bildung - Maintained competitive advantage over other nations Notes: The dashes in the table above indicate that no corresponding concept was found for that tradition. Words in bold indicate changes during 2000s in Didaktik and curriculum as departures from their traditional. Following the concepts in bold in Table 3, it is evident that both Didaktik and curriculum evolved during the post-2000 period. Tahirsylaj et al. (under review) argued that these changes were introduced in both systems due to overall global education trends in general, and as a result of both internal and external pressure deriving from results in international assessments such as PISA. Specific documents from both traditions were consulted to identify the changes highlighted in Table 3 (Baumert, Bos & Lehmann, 2000; Weinert, 2001; Köller, 2009; National Research Council, 2007; Carnegie Corporation of New York, & Institute for Advanced Study, 45 2009). Introduction of standards-based curricula and external standardized assessments in didaktik, and efforts to create curriculum, instruction and assessment and introduction of common core standards in mathematics, English and sciences were the major sea changes identified when examining how Didaktik and curriculum evolved during the first decade of the 21st century. While there are indications that the two traditions are moving towards one another’s territories and converging, the differences rooted in their national histories and culture still hold and make them distinctly unique. The presentation and portrayal of these two traditions here relies on how key scholars view their respective fields internally – in other words, the description here is an insider’s perspective of didaktik and curriculum traditions and how they affect education in their respective geographies. Next, I turn to how education is viewed from the sociology of education theories rooted in Western culture. Overlap of didaktik and curriculum frameworks with sociology of education theories Nowadays, from a sociological lens at least, education is considered to be one of the key public functions and domains that any society in the world invests in and cares about, making education ever more ubiquitous (Baker, 2009). However, the expansion of mass schooling, particularly in the Western world, to the current levels only started during 19th century and radically accelerated during the 20th century ‘world education revolution’ (Boli, Ramirez, & Meyer, 1985; Meyer, Ramirez, Rubinson, & Boli-Bennett, 1977; Meyer, Ramirez, & Soysal, 1992). The expansion of education, amongst else, created opportunities for sociologists and other scholars to ask new questions as to how it affects society from the sociological perspective. To this end, three main theoretical sociological outlooks, namely, functional theory, conflict theory, and neo-institutional theory were forwarded to provide theoretical and empirical explanations, 46 interpretations, and understandings on the interplay between education and society at a macro perspective. In that sense, theories of sociology of education might be viewed as having a broader scope than educational frameworks, whose theoretical reach is narrower and more limited, although there are clear overlaps among them. Also, the summary of the theories below are presented as viewed from their proponents, but it must be noted that arguments shared by one group are fervently contested by those representing a competing theory (see for example Carney, Rappleye, & Silova, 2012). Functional Theory One of the core distinctions between three prevailing sociological theories is whether they view education’s role as a primary or secondary institution in the society. Functional theory considers that education is a secondary institution that plays the role of the socializer, i.e. students going through the education system are socialized into the meritocratic society (Collins, 1971). From this perspective, one of the core functions of education is to provide students with technical skill requirements that are needed in the job market (Collins, 1971). While doing so, education provides individuals with the opportunity to achieve educational attainment and social mobility based on their own meritocratic values. For this model, functional theory relies on a number of assumptions. The core one is that the society is stable. There are no major fluctuations in the society. Institutions have their own functions, and they all are interdependent. In turn, there is generally a fixed set of positions in the society and education’s role is to prepare individuals to fill those positions. French Emile Durkheim is considered as the founder of the functional theory. Durkheim considered that “modern educational system came to replace church as the central integrative institution of society and a crucial aspect of the maintenance of the social order through its 47 socialization functions” (Morrow & Torres, 1995, p. 12). Furthermore, Durkheim noted that a major role of education in society was to create a unity by providing a common moral code necessary for social cohesion (Ballantine & Spade, 2007). Based on Durkheim’s work on sociological theory of education, sociologists see the transmission of moral and occupational education, discipline and values as necessary for the survival of the society. In this regard, schools play a very important role in carrying out the functions that education provides for the larger society. Durkheim was concerned primarily with the value transmission for the stability of society (Ballantine & Spade, 2007). Durkheim’s views seem to have been limited in the sense that he didn’t provide an explanation of how and why society changed as a result of the emerging industrial societies during the 20th century. One of the main early functionalists in the U.S. was Talcott Parsons, who contended that there are four main functions played by education in a society, namely, academic, distribute, economic and political socialization (Parsons, 1959). He saw education as performing certain important tasks or “functioning” for society, such as preparing young people for roles in a democratic society. In this regard, Parsons’ views do not differ much from Durkheim’s, as both envision a fixed number of positions in the society that needs to be filled by individuals, while society remains largely stable at the same time. In this line of arguments, other sociologists have considered the social organization of schools and how values taught in schools lead to greater societal consensus and preparation for one’s role in society (Dreeben, 1968). More recently, Ballantine and Spade (2007) have argued that there are five functions that are played by education and which need to be taken into consideration from the functionalist perspective, including 1. Socialization, meaning teaching children to be productive members of society; 2. School organization enhances personal and social development; 3. Education helps 48 selection and training of individuals for positions in society; 4. Education promotes innovation and change; and 5. There are latent functions of education. These arguments extend to some extent the notion of functional theory in the sense that education can bring change and innovation, which challenges Durkheim’s and Parsons’ views that society is stable and there are fixed positions in the society. Baker (2014) argued that from functional perspective formal education reproduces society at the hands of various external forces. According to him, schooling socializes students through teaching curricula and credentials them for adult positions, while the institutional unidirectional influence flows from society to schooling. Furthermore, scholars have argued that individuals invest on education themselves so that they become more employable by possessing skills that are needed and valued in the job market (Becker, 1962; Schultz, 1961). These propositions were developed and put forward as part of the human capital theory, which considers that students and people in general acquire knowledge and skills but those skills are a form of capital, namely, human capital, to differentiate it from other forms of capital. Schultz (1961) argued that investment in human capital was associated with national economic output. Human capital theory was developed by economists who were trying to find out why states differ in terms of the economic output. However, the focus on more skills and knowledge and primarily on technical skills that made individuals employable also brought the fears that society was going to experience over-education or credential inflation. According to this argument, the society and job market would not be able to absorb all the individuals with advanced skills, therefore leading to a potential crisis. Functional theory contributed to an understanding of how education systems work and what purpose education serves in societies (Ballantine & Spade, 2007). However, on the downside, functional theory assumes a passive role of schooling, where individuals leaving 49 education fill in a generally fixed set of positions in society (Collins, 1971). In this sense, functional analysis fails to state the conditions under which a particular pattern holds rather than others. In other words, it does not explain well how did education came to play specific functions functionalists ascribe to it and why this, rather than some other models, will continue to be the same model in which education functions will be fulfilled, i.e. education as a secondary institution will ensure reproduction of skills needed by the society. Conflict Theory Conflict theory shares one common feature with functional theory – viewing education as a secondary institution – but it is fundamentally different in all other aspects. The core assumption of the conflict theory is that society is unstable due to conflicts between the stratification or classes and there is everlasting struggle among status groups (Collins, 1971). Based on this perspective, the main activity of education is to teach particular status cultures, both in and outside the classroom. Furthermore, education is viewed as a field in which people compete against each other in order to win in class competition. This perspective assumes that districts, school leaders and teachers decide to teach students certain skills to maintain and reproduce existing social order, while students are assumed to have no agency as the system itself prepares the path for students of different backgrounds to move ahead to the extent that their existing social status allows. Also same as functional theory, conflict theory considers that one of the functions of the education is to serve as a mechanism of occupational placement. The founder of conflict theory German Karl Marx argued that economy determines education and put forth his concept of “economic determinant” (Marx & Engels, 1971). Marx was concerned with the growth of capitalism during 20th century as well as with the social conditions of the exploited workers in the class system. He argued that schools create and 50 maintain inequality by teaching students an ideology that serves the interest of the rich and instills students a sense of the false consciousness. Max Weber (1961) also contributed to conflict theory with his concept of power, but had a different view about the role of economy on society. He considered that conflict in society was not based solely in economic relations. According to him, inequalities and potential conflict were sustained in different distributions of status, power and class. Other theorists contributed to the conflict theory argument in different ways. For example, one idea that emerged was the correspondence theory, which assumes that education prepares individuals to fit within their corresponding social status, thus ensuring the reproduction of society (Bowles & Gintis, 1976). According to them, “The educational system, perhaps more than any other contemporary institution, has become the laboratory in which competing solutions to the problems of personal liberation and social equality are tested and the arena in which social struggles are fought” (Bowles & Gintis, 1976, p. 5). In line with Marxian thinking for education to be an arena for fighting social struggles, they are also similar to Marx in the belief that the corporate world, i.e. capitalism, determines what happens in education. Thus, they are completely in line with the “economic determinant” concept that was initially developed by Marx. French sociologist Pierre Bourdieu was the one to advance conflict theory beyond economic determinant argument. He argued that education system favored students who possessed a certain level of cultural and social capital (Bourdieu, 1977). Cultural capital refers to the cultural practices, including language patterns and experiences such as visits to museums, that provide knowledge of middle- and upper-class culture, which is also the dominant culture in schools. According to him, the schooling enables social reproduction based on the cultural and 51 social capital that each student possessed and brought in with them to the school settings. Those without the cultural capital that is dominant in the school contexts are disadvantaged and thus have thinner chances to succeed. Cultural capital allows students from middle and upper classes to convert home and school advantages into economic advantage (Lareau, 1989). In this regard, conflict theory changed the functional theory’s assumption that the schools are ideologically and politically neutral and that schools operate based on meritocracy. Conflict theorists argue that inequality is based on one’s position in the social system, not merit, and that schooling privileges some children and disadvantages others. The theory focuses on the tensions created by power and conflict that ultimately cause change. Furthermore, conflict theorists see education as a tool of capitalist society, controlling the entrance into higher levels of education through selection and allocation function and manipulating the public. It is argued that until society’s economic and political systems are changed, school reform providing equal access and opportunity to all children will be impossible (Bowles & Gintis, 1976). Conflict theory is more in accord with evidence than functional theory (Collins, 1971), but still, same as functional theory assumes a passive role of schooling and also considers that states are fixed. Scholars have argued that from conflict theory perspective, schooling reproduces and even validates existing social inequalities (Baker, 2014). Neo-institutional Theory Neo-institutional theory brought a fresh perspective in the field of sociology and how education is viewed from sociological perspectives. First of all, contrary to previous prevailing theories, in neo-institutional theory perspective education plays the role as a primary institution in the society. It operates on the assumption that education as a primary institution influences 52 society in multi-directional ways owing to the rise of the Western university (Baker, 2014). The theory emerged from the work of American John Meyer who put forth the legitimation theory, which treats education as constructing or altering roles in society and authoritatively allocating personnel to these roles (Meyer & Rowan, 1977). Meyer and Rowan argued that education consists of allocation rules and initiation of ceremonies designating which persons possess the authority and competence for various elite roles, thus providing legitimacy to individuals to occupy certain roles. One key argument of the neo-institutional theory is that education not only changes individuals, but it also transforms other institutions and modern society entirely (Baker & LeTendre, 2005; Baker, 2011; Meyer & Rowan, 1977). It is noted that educational revolution has strengthened transformative role of education, and that as a result individuals attend long period of formal schooling and adult status is mostly determined by academic outcome (Baker, 2014). In addition, Baker argued that in post-industrial society all institutions are increasingly influenced by ideas, values, and norms originating from education as a social institution. Given this context, individuals acknowledge the results of formal schooling, i.e. they understand that where they end-up in the social ladder is a result of their cognitive abilities and academic achievement during the schooling process. Education in the schooled society is omnipresent; furthermore, providing widespread educational opportunity is desirable and is even formally recognized as a basic human right (Baker, 2014). As an institution, formal education has come to be one of only a few institutions dominating modern society. Meyer and Rowan (1977) emphasized that schools are fundamentally normative, rule-conforming institutions whose work is only loosely coupled with the technical demands and efficiency requirements found in some other institutional fields. Neo- 53 institutional theorists challenged the notion that the rise and spread of modern schooling occurred as a functional response to demands of polity or economy (Meyer, Boli, Thomas, & Ramirez, 1997). Their contention is that the modern institution of schooling spreads because it becomes cognitively normative. Baker (2011) noted that the education revolution is a cultural phenomenon more than a material or political one, although it has major material and political consequences. Widespread education in a postindustrial society creates cultural ideas about new types of knowledge, new types of experts, new definitions of personal success and failure, a new workplace and conception of jobs and new definitions of intelligence and human talent. Educational achievement and degree attainment have come to dominate social stratification and social mobility, superseding and delegitimizing forms of status attainment left over from the past (Baker, 2014). Discussing the role of education as a loosely coupled system, Meyer and Rowan (1977, p. 340-1) contend that “institutionalized products, services, techniques, policies, and programs function as powerful myths and many organizations adopt them ceremonially. But conformity to institutionalized rules often conflicts sharply with efficiency criteria and, conversely, to coordinate and control activity in order to promote efficiency undermines and organization’s ceremonial formal structures form the uncertainties of technical activities by becoming loosely coupled, building gaps between their formal structures and actual work activities”. Furthermore, they argue that “Organizational structures are created and made more elaborate with the rise of institutionalized myths, and, in highly institutionalized contexts, organizational action must support these myths. But an organization must also attend to practical activity. The two requirements are at odds. A stable solution is to maintain the organization in a loosely coupled 54 state. No position is taken here on the overall social effectiveness of isomorphic and loosely coupled organization” (Meyer & Rowan, 1977, pp. 359-60). Scholars note that the schooled society followed the educational revolution that took place during the 20th century. According to them, education revolution has created a worldwide environment where institutions tend to become more isomorphic. In other words, the core values and social meanings behind education have become remarkably similar worldwide as a result of similar mode of formal education (Baker, 2014; Baker & LeTendre, 2005). Along with the diffusion of mass education, the normative standard of educational attainment has risen with each new generation of schooled parents (Baker, 2011). Education performance will be the singular dominant factor in social status attainment, and education will be one of the most transforming of social institutions (Baker, 2014). Furthermore, allocative and cognitive functions of schooling will reinforce one another and form a significant ideology about the centrality of formal education, which in turn will intensify each function’s legitimacy. One of the merits of education is that the effects of education on modern society open up a much wider appreciation of how education transforms everyday lives (Baker, 2014; Baker & LeTendre, 2005). Given this brief sketch of the three main sociological perspectives on the relationship between education and society, namely functional, conflict and neo-institutional theories, and considering their founders, key assumptions, and main arguments in contrast with key assumptions of didaktik and curriculum orientations as described above, a number of overlaps are observed. The continuum and overlap of three sociological and two educational theories is provided in Figure 1. 55 Figure 1 Continuum of three sociological and two educational theories In this representation, both sociological and educational theories fall within the same continuum as all of them occupy the same problem space – education. However, theories are both limiting and enabling at the same time since when one is situated within any single one of them, all others are automatically excluded. However, if we follow Dilthey’s argumentation that social sciences enhance our understanding of social phenomena then we might consider sociological and educational theories as lenses that contribute to our understanding and sense-making of education as a social endeavor and as a social institution. In turn, if we rely on key assumptions of curriculum tradition as dominated by the social efficiency ideology, and key assumptions of didaktik tradition as dominated by Klafki’s critical-constructive didaktik, then curriculum is placed between critical theory and functional theory and didaktik between functional and neoinstitutional theory in the continuum of the sociology of education theories. This does not necessarily exclude influences of critical theory on didaktik and of neo-institutional theory on curriculum completely but only emphasizes the dominant ideologies behind didaktik and curriculum traditions and how they fit within the assumptions of each of the three dominant theories of sociology of education. To some extent this representation might be misunderstood as inversed and even paradoxical given that American-developed neo-institutional theory shares more assumptions with Continental Europe-based didaktik than homegrown curriculum tradition. However, when the transformational power of education as highlighted by neo-institutional theory through the education revolution and didaktik theory through the tenets of Bildung are 56 considered, parallels between the two are not hard to be drawn. If didaktik and curriculum traditions, as revealed here based on this extended literature review, precisely capture what goes on in respective education systems, and sociological theories precisely capture the effects and roles of education on society, it could be argued that didaktik orientation produces the effects emphasized by the neo-institutional theory, while curriculum orientation produces those emphasized in functional and to a lesser extent critical theory. Conversely, if educational and sociological theories operate at two different levels of abstraction and foci, where educational theories’ units of analysis are individual players situated in and acting in confined spaces of classrooms within schools, and sociological theories’ units of analysis are institutional players situated in and acting on open spaces of nations and even cross-nationally and internationally, then any parallels between the two are far-fetched. Finally, if we maintain only a sociological outlook based on neo-institutional thinking at the role and effects of education on society then it can be argued that didaktik is only an exception to and variation of the world culture enhanced by the education revolution and the schooled society as usually from the neo-institutional perspective the tracking practice in a number of Continental Europe countries where didaktik is dominant, such as Austria, Switzerland, and Germany, is viewed as a relict of the past and also as a measure through which the society controls education and uses it for its own needs. On the other hand, neo-institutional theorists herald the rise of accountability measures in the form of standardized student testing, primarily in the US, as a tool to differentiate among those who are cognitively capable to move ahead in the social ladder, and as a result, upward social mobility relies on own individual abilities, thus making differentiations in social status attainment meritocratic-based. However, neither tracking in Continental Europe nor accountability measures in US and other curriculum 57 countries constitute the whole story, and they do not represent the whole enterprise of what goes on in education systems in respective countries and how what goes on in schools affect societies in the long run. The fact that all countries in both curriculum and didaktik systems belong to “Western” culture, defined more broadly, and that all of them operate in a capitalistic economic systems might provide clues to conclude that there are different routes to becoming an advanced developed society in terms of economic output and productivity and that salient differences in respective didaktik and curriculum educational systems do not inhibit them to perform well as developed societies – in other words, countries in respective education traditions seem to achieves similar advanced societal objectives via varied set of means and processes, educationally speaking. Cross-national Student Achievement and Teacher Characteristics Studies Research on cross-national comparisons as well as within individual countries with regard to predictors of student achievement is abundant. Many researchers have undertaken studies to investigate the factors that contribute to student achievement. Three main groups of factors are usually associated with student achievement in past research, namely, family SES, teacher and school characteristics, and national level investments into education system. Hanushek and Wößmann (2010) found in a cross-national analysis of student achievement that institutional structures and the quality of teaching forces accounted for a significant portion of international differences in the level and equity of student achievement. Other researchers have found that late tracking and long-pre-school cycle contributed to equality of opportunity with regard to educational performance (Schütz, Ursprung, & Wößmann, 2008). On the other hand, teacher related characteristics, such as teacher autonomy and responsibility are not often used as 58 predictors of student achievement as they seem to be more discussed within the context of teacher education relying more on qualitative research methods (Smith, 2001). One characteristic of international large-scale assessments such as TIMSS and PISA is that they are usually focused on assessing a very narrow scope of human learning – mathematics and sciences in TIMSS case and reading, mathematics and science in PISA’s case (Alexander, 2012). TIMSS has a longer history than PISA – TIMSS started to be developed by the International Association for the Evaluation of Educational Achievement (IEA) in 1960s and the first assessment took place in 1970/1971, Organization for Economic Cooperation and Development’s (OECD) PISA started to be developed in 1997 and first assessment took place in 2000 and rolls in a cycle of every three years ever since (Kind, 2013). Analyzing science assessment frameworks in place for large-scale assessments, Kind (2013) observed three successive trends, namely the emergence of cognitive psychology defining achievement as mental structures and processes (or science processes) as a counter-reaction to behaviorism; next, the second trend pertained to conceptual change, defining learning less as logical and abstract thinking and more about developing and understanding domain-specific knowledge, and lastly, a new view of learning based on socio-cultural perspective of science and learning inspired by Vygotsky (1978) and Bakhtin (1981). The most influential academics and science educators contributed to the development and revision processes of these assessment frameworks for the large-scale assessments being implemented – as a result often being labeled as “world class standards” (DeBoer, 2011). Focusing specifically on PISA assessment frameworks, Kind (2013) noted that the framework shifted from being focused on science processes in 2000 and 2003 and then in scientific competencies from 2006 onwards (see Table 4). Kind positioned that the shift towards 59 scientific competencies in 2006 was a result of the OECD’s DeSeCo project on definition of key competencies for the future (OECD, 2003a). Arguably, PISA has become a more relevant source for international comparisons among science educators for defining scientific literacy (DeBoer, 2011). This present study used PISA 2009 data and students’ science performance was set as outcome variable in the analytical models, where teacher related constructs are the main predicting variables. Table 4 PISA science processes and scientific competencies 2000-present PISA 2000 science processes PISA 2003 science processes PISA 2006-present scientific competencies 1. recognizing scientifically investigable questions 1. describing, explaining, and predicting scientific phenomena - Identify scientific issues 2. identifying evidence needed in a scientific investigation 2. understanding scientific investigation, and 3. drawing or evaluating conclusions 3. interpreting scientific evidence and conclusions. - Explain phenomena scientifically - Use scientific evidence 4. communicating valid conclusions, and 5. demonstrating understanding of scientific concepts. Source: OECD (1999), OECD (2003b), and OECD (2006) In cases when teacher related variables were examined, especially in econometric methods-based research, almost always they are examined in the context of teacher quality and its relationship with student achievement (Hanushek, 1971; Hanushek & Rivkin, 2006; Meghir & Rivkin, 2011; Rivkin, Hanushek, & Kain, 2005). A review of teacher quality related research revealed that the empirical research follows three distinct avenues, including, first, teacher salaries, since teachers constitute the largest budgetary line in schools; second, the extent to 60 which specific teacher characteristics account for student achievement; and third, the total effects of teachers on student achievement without specifying any measurable teacher characteristic (Hanushek & Rivkin, 2006). This present study falls into the second line of teacher related research as it examined, at the analytical level, the relationship among teachers’ characteristics – autonomy and responsibility – to student achievement. According to Hanushek and Rivkin (2006) these studies follow what is referred to as educational production function and its history may be traced back to “The Coleman Report” (Coleman et al., 1966). One challenge with these studies consists of the difficulty to separate teacher effects from other sources of influence such as parents, administrators and policymakers (Hanushek & Rivkin, 2006). As result, the interrelated choices of all these stakeholders make it difficult to establish causal relationships, in addition to the ongoing challenge of omitted variable bias, that may relate to own teacher choices, for example, pertaining to the tendency of experienced more effective teachers to move to districts where student achievement is higher (Hanushek, Kain, & Rivkin, 2004; Hanushek & Rivkin, 2006). Despite the emphasis on the importance of teacher quality among the research community, the evidence from studies that utilized econometric methods to identify causal relationships between teacher characteristics and student achievement are weak, and in cases where those relationships were established, larger teacher effect variance was found in lower than in higher SES schools (Nye, Konstantopoulos & Hedges, 2004). Overall, comparative education scholarship is divided over usability and validity of crossnational education research due to national cultural and historical differences. Indeed, scholars participating in didaktik-curriculum debate during 1990s noted that educational issues “[…] are always rooted in the particularities of national histories, of national habits, and of national aspirations” (Reid, 1998, p. 11-12). In addition, considering the complexities of the 21st century 61 advanced societies, even under heavy influence of internationalization and globalization processes, there always seem to be some incomparability within the comparable (or comparability within the incomparable) (Depaepe, 2002). However, constructive insights and deeper understandings may emerge while disentangling global and cross-national educational developments and practices and differences and similarities therein. It is argued that educational policy discourse has largely been standardized globally (Schriewer, 2000), still, cross-country educational differences in didaktik and curriculum systems may serve as relevant potential research-based understandings of themselves and one another. Teacher Autonomy and Teacher Responsibility In cases where literature addresses teacher autonomy and teacher responsibility, their definition seems to vary from author to author, especially regarding autonomy, while there are quite few sources that address responsibility. For example, after his review, Smith (2001) identified three main definitions for teacher autonomy in the related literature: 1) (Capacity for) self-directed professional action, where [Teachers may be] “autonomous in the sense of having a strong sense of personal responsibility for their teaching, exercising via continuous reflection and analysis . . . affective and cognitive control of the teaching process” (Little, 1995); 2) (Capacity for) self-directed professional development, where [The autonomous teacher is] “one who is aware of why, when, where and how pedagogical skills can be acquired in the self-conscious awareness of teaching practice itself” (Tort-Moloney, 1997); and 3) Freedom from control by others over professional action or development. Smith (2001) argues that it is the third definition that dominates the literature on teacher autonomy, and it is also adopted for the present study. In other words, teacher autonomy is defined here as teacher freedom from control by others over 62 professional action or development, in particular with regard to the type of assessments used in school settings. A number of studies have examined the rate at which teacher enjoy teacher autonomy. In the United States, for example, it was argued that teacher autonomy has been on the decline during the 1980s: This decline can be attributed in large part to three factors. First, uniform staff development programs based on research on effective teaching have become widespread. Second, classroom observations have become an integral part of imposed teacher evaluations. Third, school principals have been called on to assume the role of ‘instructional leader” (Anderson, 1987, p. 357). As the quote indicates, the decline of teacher autonomy in the US during 1980s was a result of intervening professional development and accountability practices as well as placing instructional leadership roles to school principals, thus making them more prone to interfere with instructional autonomy and responsibility of teachers. A comparative study in Sweden and Norway on teacher autonomy found that teachers enjoyed much individual autonomy at the school level, more so in Sweden than in Norway due to earlier introduction of decentralization reforms there, but teacher had less influence on national level policies (Helgøy & Homme, 2007). Regarding teacher responsibility, the study relies on the definition provided by OECD (2009) in its assessment framework, where teacher responsibility was taken to mean responsibility over decisions pertaining to school management, financial and instructional issues. In this study, items related to teacher’s responsibility over assessment policies, textbooks used, course content and courses offered are utilized. Corcoran (1995) also refers to teacher 63 responsibility as their capacity to make curriculum- and assessment-related decisions. In another case, teacher responsibility over curriculum and assessment matters is presented as part of the skill set of effective teachers (Stronge, 2007). Conceptual Framework The research on how student and school factors are associated with student achievement is not new. The impetus was created with the publication of the report on The Equality of Educational Opportunity or as often referred to Coleman report (Coleman et al., 1966) that found that outside-of school factors had a larger effect on student achievement than school factors. Ever since, more and more studies have tried to either confirm or disconfirm those findings under different circumstances and at different contexts (Baker, Goesling, & LeTendre, 2002, Heyneman & Loxley, 1982). More than 400 similar studies have been undertaken since then (Hanushek &Rivkin, 2006). Student Characteristics (SES) (Girl) Teacher and School Characteristics (Teacher autonomy) (Teacher responsibility) Figure 2 A diagram of logic rationale D I D A K T I K C U R R I C U L U M Student achievement in PISA 2009 (Science) 64 However, most of these studies have explored the association of school and out-of-school factors to student achievement under sociological and economic frameworks. This study took a different approach as it analyzed the student and teacher and school characteristics under educational frameworks, i.e. under the Didaktik and curriculum orientations. A rough sketch of this logic is given in Figure 2. The diagram of logic for rationale is based on the assumptions that first, both student and teacher and school characteristics contribute to student achievement and between themselves, and second, that those associations are mediated by the educational orientations in which they are embedded, in this case under didaktik and curriculum. There is no prior research that has used this framework to test student and school characteristics’ associations with student achievement, and in that regard, the study promised to make value-added contribution to the literature of the field. Based on this model and in relation to the hypotheses of this study, the student achievement in science in PISA 2009 was the dependent variable, while student and teacher and school characteristics were the independent variables. More specifically, the two measures listed under teacher and school characteristics, namely teacher autonomy and teacher responsibility items were the main independent variables of interest, both in testing hypothesis one and addressing question two given below, while variables listed under student characteristics, including SES and girl were control variables for addressing question two only. A more detailed table of dependent and independent variables is given in Chapter 4. As noted above, a number of prior studies have tried to explain student achievement by looking into external factors, such as socio-economic status and parent education, however, 65 recent research agendas are more focused into classroom practices and instruction/teaching as a proximal cause to student learning and achievement (Black & William, 2006; Cohen, Raudenbush & Ball, 2003; Raudenbush, 2008). Cohen, Raudenbush and Ball (2003) defined teaching as a collection of practices including pedagogy, learning, instructional design and managing organization. Raudenbush (2008) argued that conventional resources such as per pupil expenditures, teacher credentials, physical facilities or class size have small effects on student outcomes, and contended that student outcomes are affected by the resources only to the extent they are used by specific educators pursuing specific aims under a specific ‘instructional regime’. In this study, the instructional regime is used to represent two different educational orientations. In other words, in a narrower sense in the micro-level of classroom, didaktik and curriculum frameworks represent two different instructional regimes under which teaching and learning, as well as student achievement is mediated and affected by. Research Questions and Hypotheses Didaktik and curriculum similarities and differences are multifaceted and complex, but the data available in PISA allowed for testing only a few of the claims that are made about didaktik and curriculum respectively. The study addressed two overarching questions that pertained to two types of analysis, i.e. descriptive and analytical. At the descriptive level, the main question was: 1. How do schools in didaktik and curriculum countries compare across the core constructs representing teacher and school characteristics, namely teacher autonomy and teacher responsibility using PISA 2009 data? 66 a. Hypothesis: Schools in didaktik countries will show higher levels of teacher autonomy, and teacher responsibility than curriculum countries. The question was broken down to individual constructs used and respective dataset from PISA 2009. At the analytical level, and given the findings from the first question, the study examined the relationship between teacher and school characteristics and student science test scores in PISA 2009. The question was: 2. What is the association of teacher and school characteristics with student science performance in PISA 2009 in representing didaktik and curriculum countries? Since the didaktik-curriculum framework did not make any claim about the association of teacher and school characteristics with student achievement and since there was no other study that tested this relationship for didaktik and curriculum countries using didaktik and curriculum frameworks, there was no hypothesis developed for the second question. Therefore, the second research question was exploratory in nature and extended understanding about and the role of didaktik and curriculum frameworks on student performance. On the other hand, if research findings from other studies involving teacher related characteristics are considered, it was hypothesized that associations between teacher variables and student performance do exist, however relatively moderate if not weak. 67 Chapter 3. KEY COMPONENTS OF EDUCATION SYSTEMS IN SAMPLE COUNTRIES Chapter 3 offers an overview of the education for the countries included in this study. Austria, Germany, Denmark, Norway, Sweden and Finland represent didaktik countries, and Australia, New Zealand, United Kingdom, Ireland, Canada and United States represent curriculum countries. The discussion concentrates on an inconclusive list of five key components of education systems: K-12 Education Structure; Teacher Education and Certification; Teacher Professional Development; Assessment Systems; and Funding. A sketch of each country along these components is presented in table form in Appendix A for didaktik nations and Appendix B for curriculum nations. The main distinctions of the key components in respective countries are summarized below. One overarching feature of the national education systems in the study pertains to a type of paradoxical stance – all countries are both seemingly similar and different at the same time! Examined from a macro-perspective they are all similar since none of them deviates from the ‘norm’ that has already been established for many decades, in the sense of organizing education along age-groups over a certain period of time. However, the devil is in the details and when each national system is explored in-depth, there are striking variations. For example, even within the single state system of United Kingdom, which participates as one nation in PISA assessments, the four constituting countries – England, Northern Ireland, Scotland, and Wales – show elements of diversity with regard to how they structure the system, how they educate and train teachers and the assessment systems put into place. The information on education systems for European Union member states was obtained from Eurydice Network sources. For non-EU 68 countries the information was collected from respective national departments or ministries of education. K-12 Education Structure Structurally, all education systems follow more or less the same format: all countries are expanding their education downward towards forms of early-childhood education. Many start with 3 year olds; all have K-12 systems divided into three parts: (1) elementary or primary education, usually grades 1-4/5, (2) lower secondary or middle school – usually Grades 5/6-8/9, and (3) upper secondary or high school – usually 9/10-12/13. The exception here is Ireland, which divides its education into two phases only – primary schooling including ages 4-12 years, and post-primary schooling including 12-17/18 years. All have regulations for compulsory education in place, usually covering ages between 5/6/7 to 16. There seems to be a trend towards adding a pre-primary grade as part of the compulsory education. Content-wise, there are some clear differences between the didaktik and curriculum countries, most notably with regard to tracking – tracking understood as separate organization of education into academic and vocational types of education. While K-12 education systems in curriculum countries are almost always comprehensive with little or no tracking at all, those in didaktik countries vary from little tracking in Sweden and Denmark to ‘heavy’ tracking in Austria and Germany. All four Nordic countries included in the didaktik sample organize their high schools along academic and vocational tracks, and taking students interests wherever possible as a the basis for choosing one track over the other, while Austria, and especially Germany initiate tracking as early as grade 5. 69 Germany is an interesting case since past local beliefs of having one of the best educational systems in the world were shattered after PISA 2000 results were released and rocked the country with ‘PISA shock’. Subsequently, German states or Länder undertook a number of reforms to address system weaknesses such as the wide variation of student achievement across different tracks – Gymnasium, Realschule and Hauptschule - and socioeconomic backgrounds. Gymnasium is the academic track for students deemed of high-ability, Realschule mixes general and vocational education that allows for opportunities for further postsecondary and higher education programs leading to lower-civil servant jobs and technicians, while Hauptschule is a dead-end purely vocational track with its students destined to workingclass jobs or “[…] a road to nowhere” (OECD, 2011, p. 206). However, by 2009, most German states increased the number of gymnasium schools and are merging Realschule and Hauptschule thus reducing number of Hauptschule, and introduced comprehensive schools. The goal is that Realschule and Hauptschule will completely merge by 2020 (OECD, 2011). Even though the German-tripartite system still remains in place, Germany increased its performance in every PISA assessment since 2000, indicating that the reforms undertaken at the state and national level with regard to development of national education standards (Bildungstandards) and introduction of external standardized tests to track student achievement have translated into improved student performance in international assessments such as PISA (OECD, 2011). Teacher Education and Certification and Teacher Professional Development The countries were most diverse with regard to their systems of pre-service and inservice teacher education and professional development. The variation occurs both within and across didaktik and curriculum traditions. However, there are some overarching themes that are distinctive to each tradition. For, example didaktik countries recruit teachers from the highest 70 ability student groups, train them longer before they are placed as teachers, and have extended periods of induction training. Curriculum countries are generally less selective in the recruitment process, but do require teachers to obtain masters degrees to enter the profession. Similarities and differences with regard to teacher education and certification and professional development are discussed separately in the next sections. Teacher education and certification Austria aside, didaktik countries – Germany, Denmark, Norway, Sweden and Finland – require a masters’ degree for their teachers to teach in upper-secondary (high school) education. For elementary and lower secondary education, all require at least a bachelor’s degree, while there are less strict requirements for those teaching in early childhood education. In particular, Germany, Finland and Sweden require 5 or 6 years of education for becoming a subject teacher in upper secondary education. In some of the didaktik countries, teachers are on probation for 12-24 months during the initial teaching phase and have to pass additional state examinations before they are granted the license to teach. Curriculum countries also require a bachelor’s degree as a minimum to be a teacher and some require a master’s degree to teach at upper secondary education. Ireland is an exception again as it is more similar to didaktik countries regarding this component – it requires postprimary teachers to complete a one year post-graduate program while both primary and postprimary teachers are on probation for three years when they start teaching, during which period they need to fulfill the requirements for what is referred to as Post-Qualification Employment. 71 Teacher professional development Teacher professional development (TPD) is understood here as specific training for teachers to update their skills from teacher education program or programs that enhance teachers’ pedagogical skill set but not training that would lead to re-training for teaching new subjects or teaching at another level of education. Here two, there are differences between didaktik and curriculum systems. TPD is present in all countries, but while only recommended or not mandatory in didaktik countries TPD is required in curriculum countries, with exception of Ireland, where TPD is voluntary and up to teachers. Another difference between the two systems pertains to TPD providers – in didaktik countries it is usually university colleges for teacher education or state-run in-service teacher training institutes. In curriculum countries the providers include both university and non-university providers such as teacher-focused professional organizations. Individual national systems organize TPD in varied ways and there is more diversity among curriculum countries than didaktik countries. Among didaktik countries, TPD is strictly required only in Austria and Germany, while in the four Nordic countries there is no strict requirement and it is usually a responsibility of schools or local authorities. Among curriculum countries, New Zealand provides extensive TPD during the first two years of teaching and it is part of the induction program. In Ireland, when demand for TPD is requested by teachers voluntarily it is usually provided by teacher unions or subject-based teacher associations. Within United Kingdom, England is most strict with professional development as it is tied to accountability measures related to school development and evaluation process administered by OfSTED (The Office for Standards in Education, Children’s Services and Skills). In Scotland, Teaching Scotland’s Future is responsible for provision of TPD. In Australia, TPD is tied to 72 accountability and Australian Institute for Teaching and School Leadership (AITSL) develops professional standards for teachers and provides TPD opportunities for them to meet those standards. In Canada and the US, TPD is decentralized at province/state level and then further to school districts, which are responsible for TPD provision. Assessment Systems Assessment systems constitute another point of divergence between didaktik and curriculum systems, but there is also variation within systems as well. Overall, students in curriculum countries have to take far more external assessments during their K-12 education experience than in didaktik countries. In didaktik countries, only Austria has a high school matriculation exam (Matura) that tests German, mathematics and a foreign language. Matura is a two-part exam – in writing and oral – and it is organized and administered by teachers themselves in line with Bildungstandards. Germany has a high school exit exam (in Gymnasium it is called the Abitur) while external standardized tests have been developed after the 2000 PISA shock. The external tests are administered at the state/Länder level in grades 4 and 8 and they are diagnostic tests to track the progress of schools without any stakes to students and teachers. Denmark organizes different national tests at different frequency – Danish language is tested every year between grades 1 and 10, mathematics every two years, and English, geography, biology and physics once. These tests are computer-based and adaptive to students’ input to every question. In Finland and Sweden all assessments are done at the school level and there are no highstakes standardized tests in place - any national test is done for diagnostic purposes only to track the progress of education system. Norway among didaktik countries tests students the most: there 73 are national tests of students’ skills (reading in Norwegian, and English and Mathematics) for students in grade 5 in primary and grade 8 and 9 in lower secondary schools. Final examinations (Norwegian, Mathematics, English) for students occur in grade 10 in lower secondary schools. In addition, there are examinations (Norwegian, Mathematics, English and other major subjects) for students in upper secondary school. In curriculum countries, with the exception of New Zealand, where assessments are school-based and national tests are administered for diagnostic purposes, students are tested heavily throughout K-12 education. Ireland has only two key examinations, one leading to Junior Certificate (at age 15-16) and one leading to Leaving Certificate (at age 17-18). Australia administers bi-annual summative tests in Years 3, 5, 7 and 9 and two external examinations in Years 11 and 12. Canada has a number of provincial and national standardized test at grades 3, 4 5 and Grades 9 and 10 depending on the province. In United Kingdom, England administers tests at the end of key stages 1, 2 and 4 (ages 7, 11 and 16), but there is less standardized testing in Wales and there is no standardized testing in Scotland, where quality assurance and assessment decision are made at the school level. The US also tests its students heavily as part of the accountability measures – there are annual standardized tests at the state level, while ‘No Child Left Behind’ (NCLB) legislations requires tracking students through tests in grades 3 through 8 and then grade 10 with implications for both students and teachers. In addition, National Assessment of Education Progress (NAEP) is administered at least every two years in reading, mathematics and science in grades 4 and 8. 74 Funding Mechanisms Funding sources for public education systems in didaktik and curriculum countries follow the within-country governance structures and division of power among those structures. Overall, with some variations, national governments cover educational expenditures in didaktik countries, while in curriculum countries those costs are shared between states and municipalities. In terms of government expenditure on education as a percentage of Gross Domestic Product (GDP), with the exception of Canada where the percentage in 2011 was only 2.32%, the other didaktik and curriculum countries fall in the range from 5.10% in Germany (in 2010) to 8.73% in Denmark (in 2009) (UNESCO, 2014). Among didaktik countries, for Austria, Finland and Norway all the funding for education comes from the central government; for Germany funding is shared between states and federal government; for Denmark, municipalities pay for K-10 education and the national government pays for upper secondary education; and for Sweden, education costs are shared between state and municipalities. Among curriculum countries, for Ireland and New Zealand, the national governments provide funding for K-12 education. For Australia and Canada, public funding comes from states and provinces respectively; for United Kingdom, the UK government is the source of funding for England, Northern Ireland and Wales, while Scotland provides education funding on its own; and for the US, education funding is typically shared between states and school districts. 75 Chapter 4. DATA AND METHODS Data This study utilized the data from the 2009 PISA survey. The Organisation for Economic Cooperation and Development (OECD) initiated its work on developing PISA surveys around mid-1990s and administered the first PISA survey in 2000 (OECD, 2002). The survey aimed to investigate whether 15-year olds across nations, irrespective of their grade, are ready for posthigh school life in terms of further work and education opportunities. PISA tests students’ skills in three cognitive domains including mathematics, science and reading. It tests both knowledge and skills in any of these domains. PISA dataset fit best with the framework used for the study since the dialogue on Didaktik and curriculum was primarily focused on high school level. One of the research goals of PISA is to provide participating countries with information on how well their schools prepare young adults to meet challenges of the future. The survey is administered in a three-year cycle, focusing more on one of the tested domains in each cycle. The first survey was administered in 2000 and it focused on reading literacy, then the second round was administered in 2003 and it focused on mathematical literacy, the third in 2006 focusing on science literacy, the fourth in 2009 started over focusing on reading literacy. The fifth cycle was administered in 2012 and results became available in December 2013. However, since the key teacher constructs of interest of this study were not measured by collecting data on teachers alone, the decision was made to use PISA 2009 instead. The number of participating countries in PISA varies from year to year and includes both OECD and non-OECD member countries. In 2000, 28 OECD and four non-OECD countries participated. By 2009, the number of participating countries and economies grew to 65, including 34 OECD and 31 non-OECD 76 countries and economies. In 2012, the survey was administered in 67 countries – 33 OECD and 34 non-OECD countries and economies. Sampling and Sample A two-step sample selection process is followed in each country. In the first step, the participating educational systems are broken down by key characteristics such as regions (federal states, provinces, cantons, etc.) and school types. Within these subdivisions, schools are then sampled at random. In the second step of the sampling process, 15-year-old students are sampled at random within the sampled schools (OECD, 2002). In all OECD countries, 15-year-old students are nearing the end of compulsory education. Table 5 School and student sample size for didaktik and curriculum countries PISA 2009 School Overall Average sample student within- size sample school size sample size Didaktik Countries Denmark (DNK) Finland (FIN) Norway (NOR) Sweden (SWE) Austria (AUT) Germany (DEU) N=1381 285 203 197 189 281 226 n=32530 5924 5810 4660 4567 6590 4979 Curriculum Countries N=2285 n=63450 353 978 144 163 482 165 14251 23207 3937 4643 12179 5233 3858 95980 Australia (AUS) Canada (CAN) Ireland (IRL) New Zealand (NZL) United Kingdom (GBR) United States (USA) Total Source: OECD (2012). 20.8 28.6 23.7 24.2 23.4 22 40.4 23.7 27.3 28.5 25.3 31.7 77 In addition to testing students’ knowledge and skills, PISA also collects background data by administering questionnaires to students, parents, and schools. Most often the school questionnaires are completed by school principals or administrators. The total school sample in PISA 2009 cycle for both didaktik and curriculum countries is N=3858, and the total number of student sample is N=95980. In PISA 2009, both school and student samples are larger for curriculum countries than didaktik countries mainly due to large school and student samples in Canada, Australia and the United Kingdom. Table 5 gives a detailed breakdown of the school sample size, student sample size, and average within-school sample size per representing country in the study. Rationale for selection of countries for the study A representative sample of countries from didaktik and curriculum systems were examined in the study. The so-called Germanic and Nordic countries were included in the didaktik system and Anglo countries in the curriculum system. Table 5 above shows the selected countries for data analysis, that includes six countries representing didaktik and another six representing curriculum. The rationale for the selection of countries was based on a number of criteria that justified this particular grouping. First, there is the historical development of education systems. Didaktik emerged from the German-speaking world more broadly, and from 19th century Prussia (today’s Germany) and it quickly spread to most of Continental Europe, and particularly to the Nordic countries, as one of the regions in proximity to German-speaking areas (Westbury et al., 2000; Kansanen, 1999). On the other hand, curriculum developed among Anglo-Saxon countries, primarily in United 78 Kingdom and United States, and due to historical as well as linguistic ties permeates Englishspeaking countries around the world. Figure 3 Geographic distribution of countries in the sample Second, there is a cultural element. Prior studies on world cultures, and more specifically Global Leadership and Organizational Behavior Effectiveness Research Project (GLOBE) grouped world countries into ten cultural clusters based on data from the surveys that tried to understand organizational behavior in respective societies (House, Hanges, Javidan, Dorfman, & Gupta, 2004). Didaktik countries below fall into two cultural clusters. Denmark, Finland, Norway and Sweden form the Nordic Europe Cluster while Austria and Germany fall into Germanic Europe cluster that also includes other regions and countries that speak German or Dutch in Europe. Curriculum countries selected for this study belong to the cluster called Anglo Cultures. Third, there is empirical evidence from educational studies that the cultural clusters referred to above could be a potential way to differentiate these clusters in terms of withincountry school differentiation (Zhang, Khan & Tahirsylaj, forthcoming). Research findings from 79 that study used PISA data suggests that countries within Nordic, Germanic and Anglo cultures are similar to one another as to how much of the variance in student achievement can be explained by the differentiation among schools within respective countries. Table 6 Didaktik and curriculum countries’ PISA 2009 performance in reading, mathematics and science Statistically significantly above the OECD average Not statistically significantly different fro the OECD average Statistically significantly above the OECD average On the overall On the On the science reading scale mathematics scale scale Finland 536 541 554 Canada 524 527 529 New Zealand 521 519 532 Australia 515 514 527 Norway 503 498 500 United States 500 487 502 Sweden 497 494 495 Germany 497 513 520 Ireland 496 487 508 Denmark 495 503 499 United Kingdom 494 492 514 Austria 470 496 494 Source: OECD PISA 2009 database. Note: Didaktik countries are in red, Curriculum countries in blue The OECD average test score is 500 for all three tested subject areas. Lastly and maybe most importantly, there is a practical element that pertains to the didaktik-curriculum dialogue taking place during the 1990s. The main issue under discussion was how similar or different are the didaktik and curriculum educational traditions. Two groups of scholars were involved. On one hand, there were scholars and researchers representing didaktik that included both German and Nordic scholars, and on the other hand, there were 80 curriculum experts that included scholars mainly from UK and US (Gundem & Hopmann, 1998). Indeed, most of the meetings and symposia took place during either in United States, Germany or Norway. In terms of student achievement on PISA assessments (see Table 6), all twelve countries score from high to higher as almost all of them score above the OECD average or are not statistically significantly below the OECD average, with exceptions of Austria for the overall reading scale, Sweden and Ireland for the mathematics scale and Sweden and Austria for the science scale. Also, economically, all the countries in both groups are OECD members, meaning they are part of developed countries, constituting some of the most developed economies in the world. In terms of the level of achievement, PISA differentiates among six levels of proficiency for all three scales. Appendix C shows the descriptions for the six levels of proficiency on the science scale. Comparing Appendix C and Table 6 it is observed that all countries in the sample score at proficiency level 3 as all their overall sciences scores fall above 484 and below 559. Appendix D shows the distribution of student science scores in all PISA 2009 participating countries across six proficiency levels, and the countries in the sample of present study follow the overall trend in all participating countries where those that score higher, also have more students scoring in higher proficiency levels. For example, Finland and Canada are the highest scoring countries among the 12 in the study and they also have the smallest proportion of students scoring below level 2, which is set as a baseline level by PISA, where students start to demonstrate scientific competencies that enable them to fully participate in life (OECD, 2010). 81 Measures All measures and variables for the study came from PISA 2009 dataset. The following table and ensuing elaboration offer a detailed discussion on measures used here. Elaboration on study variables Dependent variable – for the analytical models includes student science test scores in PISA 2009. It is measured and scaled based on student achievement in PISA 2009 assessment taken by students in the 12 representing countries. The OECD average is 500 points in each of the three scales for three content areas, namely science, math and reading. Five plausible values are given for each content area. Table 7 Main study measures and their original scales Study measures Short description Full description (Original scale) Dependent variables Science Test Scores Five plausible values of science test scores in PISA 2009 Independent Measures Level-1 (student) measures Index of Economic, Social and Cultural Status created on the basis of the following variables: the International Socio-Economic Index of Occupational Status (ISEI); the highest level of education of the Socio-Economic Status student’s parents, converted into years of schooling; the PISA index of (SES) family wealth; the PISA index of home educational resources; and the PISA index of possessions related to “classical” culture in the family home Gender Are you female or male? (female=1, male=0) Age Age of the student Grade Grade compared to the modal grade in the country First generation immigrant (first generation immigrant=1, 0 otherwise) Second generation immigrant (second generation immigrant=1, 0 otherwise) 82 Home language other than the test language Level-2 (school) measures Teacher Autonomy (home language other than test language =1, home language same as test language=0) Generally, in your school, how often 15 year olds are assessed using: a. standardized tests b. teacher developed tests, and c. teacher judgmental ratings. (Scale: 1=Never, 2=1-2 times a year, 3=3-5 times a year, 4=Monthly, 5=More than once a month) Teacher Responsibility Regarding your school, who has considerable responsibility for the following tasks? a. Establishing student assessment policies b. Choosing which textbooks are used c. Determining course content, and d. Deciding which courses are offered (1=Teachers, 0=Otherwise (Principals, School governing board, Regional or local education authority, National education authority) your school, who has considerable responsibility for the following tasks? School type Is your school a public or private school? (public=1, private=0) Note: Items under the main questions for teacher autonomy and teacher responsibility are only a few selected from a larger number of items that are listed under those specific questions in the PISA school questionnaire. Independent variables include primarily teacher and school characteristics that are associated with the Didaktik/curriculum tradition framework, as well as school type, and a number of control variables representing student characteristics, namely socio-economic status (SES), gender, age, grade, immigration status and if home language is different from the test language. The main teacher and school characteristics measures of teacher autonomy and teacher responsibility utilized as main independent variables include specific variables that serve as proxies for the respective constructs, such as: Teacher autonomy / Assessment – measured through three variables answering the question: Generally, in your school, how often are 15 year olds assessed using: a. standardized tests, b. 83 teacher developed tests, and c. teacher judgmental ratings. The answers range from Never, to 1-2 times a year, 3-5 times a year, Monthly, and More than once a month. In the present study, these were recorded into 0=Never, 1=1-2 times a year, 2=3-5 times a year, 3=Monthly, 4=More than once a month. In the case of standardized tests, a higher value means lower autonomy, and this variable was reverse coded when constructing teacher autonomy scale so that a higher value indicated higher teacher autonomy in line with two other items in the scale. Based on the didaktik/curriculum framework schools in didaktik countries were expected to use standardized tests less in favor of teacher developed tests and teacher judgmental ratings due to larger teacher autonomy teachers in didaktik countries enjoy. In turn, it was investigated whether any of these variables are associated with student science achievement in PISA 2009. The teacher autonomy measure here is both similar and different from measures developed by other educational researchers. As noted in Chapter 2, this study adopted the definition of teacher autonomy as put forth by Smith (2001), where autonomy means teachers’ autonomy to take professional action with regard to teaching and classroom practices free of external control. To some extent, the measure constructed here of variables that show the autonomy of teachers in selection of assessment practices follows the same argument as indeed the autonomy here also relates to teacher’s autonomy in selection of a classroom practice, in this case, assessment practices. The measure is different from those developed by Pearson and Moomaw (2005) who developed a curriculum autonomy and a general teaching autonomy measure. In their work curriculum autonomy “[…] was defined by the items that measured selection of activities and materials and instructional planning and sequencing […], and general teaching autonomy was defined by the items that measured classroom standards of conduct and personal on-the-job decision making […]” (Pearson & Moomaw, 2005, p. 44). They found that 84 lack of teacher autonomy pushes teachers to leave the profession. However, the literature is not clear with regard to teacher autonomy as what in some cases or what some teachers view as autonomy, others view as isolation or alienation from the rest of school context. One way to further explore the issue is to address the question whether teachers are technicians that implement ideas, rules and practices presented and suggested to them by others or autonomous professional individuals that are able to make professional school-related decisions on their own? Teacher responsibility / Curriculum-Instruction-Assessment – is measured by a question that asks whether principals, teachers, school board, regional education authority, or national education authority decide on the following: (a) Establishing student assessment policies; (b) Choosing which textbooks are used; (c) Determining course content; and (d) Deciding which courses are offered. In line with the didaktik/curriculum framework, teachers in didaktik countries were expected to show higher levels of responsibility on these aspects more than teachers in curriculum countries. However, no direction of association is meant for the analytical models since no hypothesis is made about the effects of these constructs on student science achievement. Here, teacher responsibility measure follows the definition provided by PISA, where responsibility is about teachers’ decision-making responsibility over issues related to courses offered and their content, assessment policies and textbooks to be used. However, this definition is different from responsibility measures present in the existing literature, which primarily relate teacher responsibility to teachers being responsible for students’ learning as part of teacher accountability (Del Schalock, 1998). 85 Prior to the data analysis, data were checked for any anomalies, in particular with regard to missing data. A description of missing data is provided in Appendix E. Overall, none of the key variables had more than 4 percent of data missing for N=9580, while some of the schooland student-controls had a higher percentage of missing data. To address the first research question on variation of school-level items across curriculum and didaktik countries, the complete case analysis was employed, i.e., only the observations with complete data were utilized for analysis. To address the second research question on relationships between teacher autonomy and responsibility items with student science scores, two approaches were employed, namely mean substitution and dummy variable adjustment. The mean substitution method made possible using all information for all cases in the samples, while the missing dummy variable adjustment requires creation of a new variable indicating missing data for the given variable and including it in the statistical model when conducting analyses and the approach eliminates the variance in the dependent variable that is attributable to missing data (McKnight, McKnight, Sidani, & Figueredo, 2007). In addition, for the two key scales pertaining to teacher autonomy and responsibility, factor analysis using varimax-rotated method was conducted to test whether overall all items together show any variation between curriculum and didaktik schools. PISA dataset provides five plausible values for student achievement in each of the three content areas and final student weights and school weights to correct for the design effects were used when conducting statistical analysis. Descriptive Statistics Table 8 shows the descriptive statistics only for the dependent variable (student science test scores) and the list independent student-level variables (SES, gender, age, grade, immigration status, and test language other than home language) and on independent school 86 school-level variable (Public schools). Since the means of the main independent school-level variables (teacher autonomy and teacher responsibility items) constitute the focus of the first research question about variation, their means are presented in detail in Chapter 5. Correlations tables of the variables in the study employing pairwise correlations with listwise deletion are provided in Appendix F for curriculum countries and Appendix G for didaktik countries, only for ten of the variables used in the analytical models, including the main independent variables of teacher autonomy and responsibility and three student level variables (science scores, SES and gender). The majority of correlations at within-country level using school weights were statistically significant but substantively weak. The strongest correlations both statistically and substantively were found between economic, social and cultural status (ESCS) and student science scores. Most teacher autonomy (items 2-4 in the correlation tables) and teacher responsibility (items 5-8 in the correlation tables) were statistically significant but substantively weak with a few being substantively moderate at about r=0.3. The dummy variable for Gender was almost in all cases for all countries statistically insignificant indicating that girls’ relationship with the variables in the study was not much different from that of boys as per the correlation results with the variables in the study. The remaining six school- and student-controls followed the same correlation patterns of being substantively weak even when statistically significant. 87 Table 8 Descriptive statistics of main student-level variables Country Science Scores Mean 527.27 Austria 494.33 Australia 528.71 Canada 520.41 Germany 499.34 Denmark 554.08 Finland Great Britain 513.71 507.98 Ireland 499.88 Norway New Zealand 532.01 495.11 Sweden United States 502 Age Mean Austria 15.77 Australia 15.81 Canada 15.83 Germany 15.83 Denmark 15.73 Finland 15.72 Great Britain 15.71 Ireland 15.71 Norway 15.80 New Zealand 15.76 Sweden 15.75 United States 15.79 SD 97.57 98.43 85.76 97.34 88.25 84.53 94.99 93.41 85.37 103.73 95.51 93.88 Economic, social and cultural status (ESCS) Min 76.41 196.14 145.97 186.91 140.94 218.05 129.28 104.75 188.68 96.36 135.71 131.61 Max 839.74 756.94 787.34 794.33 752.28 799.46 792.56 785.29 738.2 824.45 770.18 757.68 N 14251 6590 23207 4979 5924 5810 12179 3937 4660 4643 4567 5233 Mean 0.34 0.06 0.5 0.18 0.3 0.37 0.2 0.05 0.47 0.09 0.33 0.17 N 14251 6590 23207 4979 5924 5810 12179 3937 4660 4643 4567 5233 Grade Mean 0.08 -0.57 -0.15 0.20 -0.13 -0.12 0.00 0.50 0.00 -0.01 -0.02 0.09 Country SD 0.29 0.29 0.29 0.29 0.28 0.28 0.29 0.28 0.28 0.29 0.28 0.30 Min 15.25 15.33 15.33 15.33 15.25 15.25 15.25 15.25 15.25 15.25 15.25 15.25 Max 16.33 16.33 16.33 16.33 16.25 16.25 16.25 16.25 16.33 16.33 16.25 16.33 SD 0.75 0.84 0.83 0.9 0.87 0.78 0.79 0.85 0.74 0.79 0.81 0.93 SD 0.54 0.64 0.42 0.68 0.39 0.37 0.14 0.77 0.08 0.34 0.23 0.56 Min -3.4 -5.66 -4.77 -4.81 -3.87 -3.56 -3.43 -3.22 -2.83 -3.23 -6.04 -3.42 Min -3 -3 -3 -2 -2 -2 -1 -2 -1 -2 -2 -2 Max 2.98 3 3.1 3.16 3.29 3.06 2.9 2.86 2.57 2.71 2.99 2.89 Max 2 1 2 3 1 2 1 2 1 2 1 2 N 13933 6420 22616 4561 5797 5776 11936 3841 4620 4553 4512 5190 Proportion of girls Percent N 0.51 14251 0.51 6590 0.5 23207 0.49 4979 0.51 5924 0.5 5810 0.51 12179 0.49 3937 0.49 4660 0.49 4643 0.49 4567 0.49 5233 N 14251 6590 23207 4802 5924 5810 12179 3937 4660 4643 4567 5233 % of 1st Im. Ge. Percent N 0.11 14251 0.05 6590 0.10 23207 0.05 4979 0.03 5924 0.01 5810 0.05 12179 0.07 3937 0.03 4660 0.16 4643 0.04 4567 0.06 5233 88 Country % of Second Immigration Generation Percent SD N Australia 0.12 0.32 14251 Austria 0.10 0.30 6590 Canada 0.13 0.34 23207 Germany 0.11 0.31 4979 Denmark 0.06 0.23 5924 Finland 0.01 0.11 5810 Great Britain 0.06 0.23 12179 Ireland 0.01 0.12 3937 Norway 0.04 0.19 4660 New Zealand 0.08 0.27 4643 Sweden 0.08 0.27 4567 United States 0.13 0.33 5233 Source: OECD PISA 2009 database. Other than test language Percent SD N 0.09 0.29 14251 0.10 0.30 6590 0.14 0.34 23207 0.09 0.29 4979 0.04 0.20 5924 0.04 0.19 5810 0.06 0.24 12179 0.06 0.23 3937 0.07 0.26 4660 0.14 0.35 4643 0.08 0.27 4567 0.13 0.33 5233 % of Public schools Percent SD N 0.54 0.50 14251 0.79 0.41 6590 0.87 0.34 23207 0.89 0.31 4979 0.73 0.44 5924 0.95 0.21 5810 0.77 0.42 12179 0.39 0.49 3937 0.95 0.21 4660 0.92 0.27 4643 0.85 0.36 4567 0.78 0.42 5233 89 Methods First, to address the first research question on How do schools in didaktik and curriculum countries compare across the core constructs representing teacher and school characteristics, namely teacher autonomy and teacher responsibility using PISA 2009 data? A t-test procedure was conducted to assess whether the means of autonomy and responsibility scales of the two groups of schools were statistically different from one another. This procedure was helpful to test the first hypothesis whether schools in didaktik countries will show higher levels of teacher autonomy and teacher responsibility than schools in curriculum countries. For teacher autonomy individual items Mann-Whitney rank-sum test was employed to examine the differences between the schools in two traditions as the items were measured on an ordinal scale, while a difference of proportion test was used for teacher responsibility items to examine the differences. Second – given the results from the first question – to examine the relationship between teacher and school characteristics and student science test scores in PISA 2009 and to address the question on ‘What is the association of teacher and school characteristics with student science achievement in PISA 2009?’ a hierarchical linear modeling (HLM) procedure was conducted. Due to the lack of any past research to address this question using didaktik/curriculum framework, there was no hypothesis generated, and in that regard the analysis was more exploratory rather than confirmatory or dis-confirmatory. HLM was a more useful method for this study considering that PISA datasets have a nested structure of data with students nested in schools and schools nested in regions and/or countries, which arguably provides more precise estimates given that structure (Raudenbush & Bryk, 2002). Furthermore, HLM is preferred over simple ordinary least square (OLS) method 90 since the latter assumes independence of observations which is rather misleading in nested data where variance within group tends to remain dependent. To develop the HLM models, first an unconditional model was run for each country using the dependent variable. Here is the specified equation for science achievement. (1) Each school’s intercept, β0j, is then set equal to a grand mean, γ00, and a random error u0j. β 0 j = γ 00 + u0 j (2) where j represents schools and i represents students with a given country. Substituting (2) into (1) produces (3) where: ß0j = mean science achievement for school j γ 00 = grand mean for science achievement Var (eij ) = θ= within school variance in science achievement Var (u0j ) = τ00 = between school variance in science achievement This model explains whether there is variation in student’s standardized science scores across j schools for the given country. From here, a linear random-intercept model with covariates was set up. This model is an example of linear mixed effects model that splits the total residual or error into two error components. It starts with a multiple-regression model, as follows: Science scoresij = ß1 + ß2j x2ij+ …+ ßp xpij+ ξij (4) 91 Here ß1 is the constant for the model, while ß2j x2ij to ßp xpij represent covariates included in the given model. ξ is the total residual that is split into two error components: ij ξij Ξ uj + eij (5) where uj is a school specific error component representing the combined effects of omitted school characteristics or unobserved heterogeneity. It is a random intercept or the level-2 residual that remains constant across students, while level-1 residual eij is a student-specific error component, which varies across students i as well as schools j. Substituting ξ into the multipleij linear regression model (4), we obtain the linear random-intercept model with covariates Science scoresij = ß1 + ß2j x2ij+ …+ ßp xpij+ uj + eij (6) Again, ß x to ß x represent the covariates included in the model, and they vary depending 2j 2ij p pij on how many covariates are included in a specific model. Three linear random-intercept models are fit for each country in the study, where the first model includes three level-2 covariates representing teacher autonomy items, namely the use of standardized tests, the use teacherdeveloped tests and the use of teacher judgments. The second model adds another four level-2 covariates representing teacher responsibility items whether teachers were responsible for assessment policies, textbooks to be used, course content and courses offered. The final model is a full model that introduces one school-level covariates of school type (public vs. private) and a number of student level-1 covariates including SES, age, grade, immigration status (native vs. first generation vs. second generation), test language (native vs. another), and a dummy variable for gender, where female=1 and male=0, into the model with teacher autonomy and 92 responsibility items, and controlling for dummy missing variables. The same models are run for each of the twelve countries in the study. The linear random-intercept model is viewed as a regression model with a school-specific intercept ß1 + uj (Rabe-Hesketh & Skrondal, 2008). The model assumes that uj are independent over schools, the eij are independent over schools and students, and that the two error terms are independent of each other. Further, the model assumes that both uj and eij are uncorrelated with covariates. Regarding distributional assumptions, it is assumed that level-2 error component uj is normally distributed with a mean of zero and variance τθθ; and each eij is independent and normally distributed with a mean of zero and variance θ for every level-1 unit i within each level-2 unit j (i.e., constant variance in level-1 residuals across units). 93 Chapter 5. RESULTS AND ANALYSES Here the results found during the examination of the two main overarching research questions are presented. The research questions are: 1. What is the variation of teacher autonomy and responsibility across schools in curriculum and didaktik countries? 2. What is the association of teacher autonomy and responsibility items with students’ science performance? First, variation with regard to teacher autonomy items and scale is presented, followed by variation in teacher responsibility items and scale, and concluding this chapter with associations of variables representing the two constructs and other school- and student-level covariates with student performance applying within-country hierarchical linear modeling (HLM) statistical models. Means of Teacher Autonomy Measures Across Sample Countries The results for means of teacher autonomy measures, including use of standardized assessments, teacher-developed tests and teacher judgments across sample countries are presented in graph form in the following sections while detailed descriptive statistics are provided in Appendix H. Figure 4 shows the mean use of standardized tests for each country, a variable coded on a scale from 0 to 4. On this scale, a higher mean indicates lower teacher autonomy, as we expect less use of standardized tests in countries where teachers have more autonomy regarding employing student assessments instruments. 94 Use of standardized tests (by country) 0=Never, 1=1-2 a year, 2=3-5 a year, 3=monthly, 4=more than once a month Curriculum | Didaktik | OECD 1.6 1.4 1.2 1 0.8 0.6 0.4 0.2 0 1.47 1.40 1.38 1.21 1.20 1.19 1.06 0.95 0.92 0.90 0.80 0.45 0.43 Figure 4 Means of use of standardized tests by country Based on didaktik-curriculum framework, teachers in didaktik countries were expected to enjoy higher autonomy, thus the mean use on standardized tests were expected to be lower. As the graph shows, there are more didaktik countries on the upper side of the scale than curriculum countries. Within didaktik countries themselves there is a clear separation between the four Nordic countries (Sweden, Denmark, Norway, and Finland) and those from Continental Europe (Austria and Germany) with Nordic countries having the highest means. The OECD average for this scale is at 1.19 and only two of the curriculum countries (New Zealand and United States) are above the OECD average and the other four below it. A Mann-Whitney two-sample ranksum test was used to compare curriculum and didaktik countries as two groups. The overall analysis resulted with didaktik countries having a higher sum of ranks compared to the expected rank sums under the null hypothesis than curriculum countries and there was a significant difference of z = 3.306 with p<0.001. Since on this scale a higher mean indicates lower teacher autonomy, results show that taken together didaktik countries have a higher mean on the use of 95 standardized tests than curriculum countries thus indicating that teacher autonomy is higher in curriculum than didaktik. The next item on teacher autonomy scale was the use of teacher-developed tests. Here, a higher mean indicates higher teacher autonomy as the locus of control for student assessment purposes lies with teachers. Figure 5 shows the results for curriculum and didaktik countries. Here, the means for didaktik countries are around the OECD average, while those of curriculum countries are spread out on both ends of the continuum. Use of teacher-developed tests (by country) 0=Never, 1=1-2 a year, 2=3-5 a year, 3=monthly, 4=more than once a month Curriculum | Didaktik | OECD 4 3.5 3 2.5 2 1.5 1 0.5 0 3.57 3.52 3.33 3.31 3.18 2.94 2.89 2.75 2.73 2.70 2.28 2.28 2.18 Figure 5 Means of use of teacher-developed tests by country On average, apart from Ireland, United Kingdom and Denmark, all other countries have a mean close to 3, meaning teachers in those countries are reported to use tests developed by them on a monthly basis during the school year. In United States and Canada the mean is close to 4 indicating that in their schools teacher-developed tests are used more than once a month. 96 To test the difference between the two groups overall, the Mann-Whitney rank-sum test was employed. The results indicated that curriculum countries in total had a higher sum of ranks than the expected sum of ranks, while for didaktik countries the sum of ranks was lower than expected and there was a significant statistical difference of z = -6.533, with p<0.001. Here, teachers in curriculum countries enjoy higher autonomy, however, if teacher-developed tests represent a form of preparation for students to take standardized tests than, indirectly, a higher mean on this item might indicate less teacher autonomy. Use of teacher judgements (by country) 0=Never, 1=1-2 a year, 2=3-5 a year, 3=monthly, 4=more than once a month Curriculum | Didaktik | OECD 4 3.5 3 2.5 2 1.5 1 0.5 0 3.56 3.49 3.42 3.30 3.09 2.83 2.62 2.59 2.49 2.41 2.40 2.38 2.21 Figure 6 Means of use of teacher judgments by country The third item in teacher autonomy scale is the use of teacher judgments for ranking students as part of student assessment practices in respective countries. Figure 6 shows the means for each country, and here a clear separation between curriculum and didaktik countries is observed with all didaktik countries but Denmark positioned above the OECD average, and all curriculum countries but New Zealand falling below the OECD average. 97 The Mann-Whitney rank-sum test confirmed the statistical difference of z = 12.158 at p<0.001 between the curriculum and didaktik countries, with didaktik countries having higher sums of ranks than expected while curriculum countries’ sums of ranks were lower than expected. In summary, examining the three previous graphs, it is observed that since all of the countries in the study are OECD member states, their average is around the OECD and the differences from OECD are not large either for curriculum or didaktik countries. Second, all three forms of assessments are employed in all countries, but there is variation to the frequency of use of each form. Overall, it is observed that teachers in didaktik countries report using teacher judgments more than teacher-developed tests, while teachers in curriculum countries rely more on teacher developed tests than their judgments. This indicates that teachers in curriculum countries tend to rely more on objectiveness of their tests, while teachers in didaktik countries on subjectiveness of their judgments. Lastly, there are two sets of outliers with two countries each. One – the low reporting of the use of standardized tests in Austria and Germany, and two, high reporting of teacher-developed tests in Canada and United States. Next, an autonomy scale was created using varimax-rotated method out of three items related to the use of three different assessment practices. The scale reliability coefficient of Cronbach’s alpha was α = 0.31, which is lower than the expected value of α = .70 for a reliable scale, but not surprising considering that only three items were used. The item on the use of standardized tests was reverse coded prior to the analysis so that a higher value indicated higher teacher autonomy in line with two other items. The graph with means of each country is shown in Figure 7, and descriptive statistics for the scale are given in Appendix I. The standardized scale shows that when considering all three items together, teachers in didaktik countries enjoy higher autonomy than their counterparts in curriculum countries, however there is no clear 98 separation between curriculum and didaktik countries. United States, Canada and New Zealand have positive means, while Austria and Denmark from didaktik countries has negative means. Figure 7 Means of teacher autonomy scale by country A two-sample t-test for curriculum and didaktik groups was employed and the mean was higher for didaktik than curriculum countries with t(3450) = 4.97, p < 0.001. However, the strength of the difference between means was weak, since calculating R2 = t2/(t2+df) and plugging in t-value and degrees of freedom from above the result was R2= 0.007. So while statistically significant, the effect size for the difference between curriculum and didaktik was close to being small. Reportedly, values of 0.01 to 0.09 are considered small effect, 0.10 to 0.25 are medium effect, and over 0.25 are a large effect (Acock, 2008). 99 Proportions of Teacher Responsibility Measures Across Sample Countries The results for proportions of teacher responsibility measures across sample countries are presented in graph form in the following section while detailed descriptive statistics are provided in Appendix J. Proportion of teachers deciding about assessment policies (by country) Curriculum | Didaktik | OECD 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% 85.5% 84.0% 81.4% 81.1% 78.9% 77.7% 75.4% 72.4% 67.6% 61.2% 58.5% 57.9% 44.3% Figure 8 Proportion of teachers deciding about assessment policies by country Figure 8 shows the proportions of schools where teachers are reported to be responsible for making decisions about student assessment policies in respective countries. As per curriculum-didaktik framework, the hypothesis was that more schools report teachers to be responsible in didaktik than in curriculum countries. The graph shows that Austria (85.5%) and Ireland (84.0%) have the highest proportion of schools where teachers are responsible for assessment policies. United States has the lowest proportion of schools that have teachers who are responsible for assessment policies as reported by school principals with 44.3%. Most countries in the sample are above the OECD average. No clear separation is observed between 100 curriculum and didaktik countries. A difference-of-proportion test showed that didaktik countries together had a mean of 0.69 (overall 69% of teachers in didaktik countries are reported to be responsible for deciding about assessment policies) while curriculum countries had a mean of 0.74. The z statistic was z = -3.048, p < 0.01 indicating that the difference between the two groups was statistically significant. On average, there were about 5% more teachers reported to be responsible for assessment policies in curriculum than in didaktik countries. Proportion of teachers deciding about textbook use (by country) Curriculum | Didaktik | OECD 120% 100% 80% 99.9% 99.1% 97.9% 96.5% 96.2% 96.0% 91.7% 89.3% 84.0% 74.6% 70.9% 67.3% 64.4% 60% 40% 20% 0% Figure 9 Proportion of teachers deciding about textbooks to be used by country Figure 9 shows the proportion of schools with teachers reported to be responsible for deciding about textbooks used in their respective countries. Overall, the proportions are quite high and above 60% for all countries, and all but United States are above OECD average of 67.3%. In United Kingdom, Sweden, Ireland, Finland, New Zealand and Australia close to a 100% of schools are reported to have teachers who decide about textbooks to be used. Overall, there is no clear separation between the two groups. This was also confirmed by the proportionof-differences test, which showed that in both didaktik and curriculum countries in their total 101 school samples about 86% of teachers are reported to decide about textbooks used. The z statistic z = 0.245, p > 0.05 indicated that the difference in proportions between the two groups was not statistically significant. Proportion of teachers deciding about course content (by country) Curriculum | Didaktik | OECD 120% 100% 80% 60% 96.5% 93.4% 90.9% 87.1% 83.9% 80.6% 79.6% 74.3% 69.3% 67.7% 62.2% 55.0% 55.0% 40% 20% 0% Figure 10 Proportion of teachers deciding about course content by country Figure 10 shows the proportion of schools reported by principals to have teachers who decide about course content in their respective countries. Again there is no clear separation of curriculum and didaktik countries, and all countries in the sample are above the OECD average of 0.55. United Kingdom, New Zealand and Sweden have more than 90% of schools where teachers are reported to decide about course content. In other countries, the range is from a low of 55% in Canada to a high of 87% in Finland. The proportion-of-differences tests showed that on average about 77% of teachers in didaktik countries decide about course content compared to about 72% of teachers doing the same in curriculum countries. The z statistic was z = 3.619 and 102 p < 0.001 indicating that the difference in proportions between the two groups was statistically significant, with didaktik countries having higher proportions. Proportion of teachers deciding about courses offered (by country) Curriculum | Didaktik | OECD 90% 80% 70% 60% 50% 40% 82.9% 73.8% 69.2% 67.7% 67.2% 56.9% 55.6% 49.7% 44.5% 44.4% 44.0% 36.6% 30% 21.5% 20% 10% 0% Figure 11 Proportion of teachers deciding about courses offered by country Lastly, Figure 11 shows the proportion of schools where school principals reported that teachers decide about courses offered in their schools. The separation of didaktik and curriculum countries is most visible with regard to the proportions between the two groups, where all didaktik countries, apart from Finland, show that less than 50% of teachers decide about courses offered. Curriculum countries on the other hand show that the proportions vary from 55.6% in United States to 82.9% in United Kingdom. The OECD average for this item was 36.6% indicating that proportions in curriculum countries were significantly higher. Only Norway (21.5%) was below OECD average on this item. The proportion-of-differences test showed the proportion of teachers in curriculum countries was higher at 0.66 while that of didaktik countries was at 0.42, indicating that on average about 24% more teachers in curriculum countries are 103 reported to decide about courses offered than their colleagues in didaktik countries. The z statistic was z = -14.30 and p < 0.001, indicating that the difference in proportions between the two groups was statistically significant. Overall, out of four teacher responsibility items, the proportion was significantly higher in curriculum countries on two of them (deciding about student assessment policies and courses offered), the proportion was statistically higher in didaktik countries with regard to teachers who decide about course content, and there was no statistically significant difference between didaktik and curriculum countries with regard to choosing textbooks to be used. The overall picture across the four responsibility items shows that high proportions of teachers have decision-making power with regard to assessment policies, textbooks to be used and course content, and less power regarding course content. Also, the pattern shows that on average teachers enjoy more teacher responsibility in curriculum countries than didaktik countries, with New Zealand and United Kingdom among curriculum countries having the highest teacher responsibility in course content and courses offered items. Also the graphs show both curriculum and didaktik countries are above the OECD average across the four items. Next, a responsibility scale was created using varimax-rotated method out of four items related to teacher’s responsibility over assessment policies, textbooks used, course content and courses offered. The scale reliability coefficient of Cronbach’s alpha was α = 0.61, which was close to the expected value of α = .70 for a reliable scale. The graph with means of each country is shown in Figure 12, and descriptive statistics for the scale are given in Appendix K. 104 Figure 12 Means of teacher responsibility scale by country The standardized scale shows that when considering all four items together, teachers in curriculum countries enjoy higher responsibility than their counterparts in didaktik countries, however there is no clear separation between curriculum and didaktik countries. United Kingdom has the highest positive mean, while United States has the lowest negative mean. A two-sample t-test for curriculum and didaktik groups was employed and the mean was higher for curriculum than didaktik countries with t(3582) = -4.68, p < 0.001. However, the strength of the difference between means was weak, since calculating R2 = t2/(t2+df) and plugging in t-value and degrees of freedom from above the result was R2= 0.006. So while statistically significant, the effect size for the difference between curriculum and didaktik regarding teacher responsibility scale - same as in the teacher autonomy scale - was 105 comparatively small. Next, the association of autonomy and responsibility items, controlling for covariates, with within-country student science performance is examined. Association of Teacher Autonomy and Responsibility with Student Science Performance in PISA 2009 The second research question of the study pertained to the ‘so what’ question, i.e. given the variation in teacher autonomy and responsibility items across schools in curriculum and didaktik countries, the objective was to examine whether those items are associated with student science performance in PISA 2009. As no hypothesis was stated as to whether autonomy and responsibility items are associated with student scores, the question was entirely exploratory in nature. Linear random-intercept models were used to examine the relationship among autonomy and responsibility items with student science performance and to decompose the variance into between-school and within-school components. Initially, a null model for each country was run to estimate the intra-class correlation (ICC), which represents the variation in science achievement across schools within every given country. The ICC results, which are consistent with those of previous studies (for example Zhang et al., forthcoming), varied from a low of 0.09 in Finland to highs of 0.57 in Austria and 0.65 in Germany. A low ICC of 0.09 in Finland to high ICCs of 0.57 in Austria and 0.65 in Germany suggests that schools in Finland performed the same in PISA 2009 assessment, while in Austria and Germany there is a large variation in students’ science performance across schools. These variations in Austria and Germany are usually attributed to the tracking orientations within K-12 educational systems. In the other 9 countries, the ICC range was from 0.12 in Norway to 0.29 in United Kingdom and United States. 106 Next, three linear random-intercept models were fit: first, one with teacher autonomy items only; the second including both teacher autonomy (TA) and teacher responsibility (TR) items; and third, a full model that included both level-2 TA and TR items and level-2 and level-1 covariates. Tables 9 and 10 show estimates of the full model for curriculum and didaktik countries. Detailed results for each curriculum country are provided in Appendix L, and for didaktik countries in Appendix M. Overall, the results from the linear random intercept models show that only a few of the teacher autonomy and teacher responsibility items from 12 countries were statistically significant predictors of student science performance on PISA 2009. Tables 11 through 22 that appear in Appendix L and Appendix M provide the results for individual countries for the null model as well as three linear random-intercept models, which show fixed estimates, random estimates and derived estimates. Tables 9 and 10 show summary results for the full model only (TA + TR + Level-2 and Level-1 covariates) that are presented in three sections – “Fixed part”, “Random part” and “Derived estimates”. The main interest of the study pertains to the “Fixed part” estimated regression coefficients as the objective is to examine the association of covariates to student science performance. Across curriculum countries, in the full model with level-1 and level-2 covariates, there are significant coefficients of TA and TR items in Ireland, New Zealand and United States. The coefficients of the use of standardized test (β₂) were mostly negative in the full model, but not significant in any of the curriculum countries. However, β₂ was negative and significant in the United States in first (-31.8) and second (-23.86) random intercept models that included only 107 Table 9 Maximum likelihood estimates for science performance in curriculum countries Australia Fixed part Canada United Kingdom Ireland New Zealand United States Est. Est. Est. Est. Est. Est. 725.86*** 767.09*** 353.52*** 745.95*** 424.62*** 885.82*** β₂ (stand. tests) 1.06 -1.36 -2.97 1.67 -2.92 -12.98 β₃(teacher tests) 3.62 -0.86 -2.01 -7.28 8.25* 29.60*** β₄(teach. judgments) -0.57 0.17 5.03 2.66 0.99 -2.12 β5(assess. policies) 1.83 0.15 10.15 -5.17 1.36 -10.99 β6(text. used) -0.73 2.78 23.11 -48.21* -106.37*** 11.27 β7(course content) 4.43 -2.59 -4.47 -11.06 95.72*** -26.23 β8(courses offered) -2.43 -3.60 -11.18 1.87 15.49*** 10.65 β9(SES) 32.85*** 22.13*** 30.83*** 30.02*** 38.52*** 22.39*** β10(gender) β11(age) β12(grade) β13(1st immigration gen.) β14(2nd immigration gen.) β15 (Other test lang.) β16 (public school) -4.15*** -13.06*** 39.62*** -6.69*** -13.14*** 48.53*** -10.12*** 11.49*** 33.99*** -2.42 -10.76 19.80*** 7.17 6.19 29.87*** -14.70*** -26.63** 43.06*** -4.59 -11.22** 5.75 -3.28 0.25 -4.33 2.85 -13.47*** 14.26* 12.17 -3.45 -2.34 -19.92*** -26.21*** mixed -12.10*** -25.94** mixed -16.25* -46.59*** mixed -27.65** -25.77** mixed -42.51*** -18.35** mixed -6.72 -38.00** mixed 1188.83 1103.28 1489.69 1464.74 724.61 1297.04 6481.58 5124.95 5559.13 6033.58 6722.36 5008.22 0.29 806.47 -623957.37 12179(482) 0.24 1257.68 -92189.75 3937(144) 0.23 1678.6 -57752.06 4643(163) 0.29 1785.35 4095343.4 5233(165) β₁ (_cons) Random part Var. (random intercept) Var. (level-1 residual) ICC(rho)/xtreg Wald chi2(21^) Log likelihood Students(Schools) 0.24 0.23 1827.01 1498.44 -83124.87 -379026.79 14251(353) 23207(978) Derived estimates R2 (total variance) R12 (level-1 var.) 0.23 0.14 0.19 0.13 0.21 0.13 0.15 0.1 0.31 0.19 0.28 0.2 R22 (level-2 var.) 0.51 0.39 0.41 0.32 0.71 0.49 Note: * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. “^” indicates degrees of freedom for Wald chi-squared tests, which varied from country to country depending on how many variables were kept in the model. 108 level-2 covariates (see Table 16 in Appendix L), but it lost significance in the full model after adding level-1 covariates. The coefficients for United States in the models with level-2 covariates indicate that the use of standardized tests is expected to decrease student science performance significantly in United States. The coefficients for teacher-developed tests (β₃) were significant and positive only in New Zealand (8.25) and United States (29.6) in the full model controlling for other covariates and missing dummies among curriculum countries. The coefficient of teacher-developed tests for United States is particularly interesting as it is higher than SES coefficient (22.39) controlling for other covariates. At least in New Zealand and the United States, the monthly use of teacherdeveloped tests is estimated to increase student science performance. The US had the highest mean in the variable of teacher-developed tests. The third item representing teacher autonomy, the use of teacher judgments (β₄) was not significant in any of the curriculum countries. Among teacher responsibility items, textbooks used (β6) was significant and negative in Ireland (-48.21), while textbooks used and course content (β7) were significant and unexpectedly high in New Zealand, where if teachers were responsible for the textbooks to be used in schools then it was associated with a coefficient of -106.37 decrease of student performance. When teachers were responsible for the course content in New Zealand, it had a positive coefficient of 95.72, and courses offered variable (β8) was also significant and positive (15.49) in New Zealand. When teachers had responsibility for assessment policies (β5) it was not significant in any of the curriculum countries. Regarding level-1 covariates – SES (β9), as expected, was significant and positive in all curriculum countries, while gender (β10) was statistically significant and negative in five countries – Australia, Canada, United Kingdom, Ireland, and 109 United States – significant and positive only in New Zealand, in the model controlling for other covariates. From among the remaining level-1 covariates, age (β11) was significant and negative in Australia, Canada, and United States, significant and positive in United Kingdom, and not significant in Ireland and New Zealand. Grade (β12) was significant and positive in all six curriculum countries, while variables representing first immigration generation students (β13) and second immigration students (β14) were significant and negative in Canada, significant and positive in United Kingdom, and not significant in other countries. The results suggest that students with immigration background are expected to perform better than native students in United Kingdom and worse than native students in Canada. When the home language of the student is different than test language (β15) was significant and negative in all countries but United States. Lastly, level-2 covariate representing public schools (β16) was significant and negative in all six curriculum countries, indicating that within each respective country, students attending public schools are expected to perform worse than those in private schools, as per the model employed and covariates used here. The “Random part” section of the table shows the unexplained variance for level-2 (random intercept) and unexplained variance for level-1 after controlling for explanatory variables and missing dummies. The ICC shows variation across schools in science performance, while Wald chi-squared and Log likelihood estimates are tests of the null hypothesis that coefficients used in the model are equal to zero. The number of students and schools per country is also given under the “Random part”. Lastly, coefficients of determination or “R-squared” are provided in the “Derived estimates” section. Total R-squared indicates how much variation the model of interest explains. First, total variance of the null model and of two out of three target models (TA, and full model) was calculated as follows: 110 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛 𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣−𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 𝑜𝑜𝑜𝑜 𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖 𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣 R2 (total) = 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛 𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣 R-squared were not calculated for the TA-TR model as it didn’t differ substantively from TA model. Then, following Raudenbush and Bryk (2002) suggestion, the same procedure to partition the between-school and within-school level variance is applied for calculating R-squared for level-1 and level-2. As Table 9 shows, the proportion of variance explained by covariates in the full model was larger for level-2, i.e. variance across schools, than for level-1 or within-schools variance. Table 10 provides maximum likelihood estimates for science performance in didaktik countries controlling for the covariates and missing dummies based on the same models as for curriculum countries. Among, the variables of interest representing teacher autonomy, the use of standardized tests (β₂) was negative in all didaktik countries but Denmark and Finland. In Finland, the coefficient for use of standardized tests was significant and positive (16.74). However, it was significant and negative in Austria (-19.38) and Germany (-30.83), indicating that the use of standardized tests in these two countries was associated with lower student science performance in PISA 2009 after controlling for covariates. The use of teacher-developed tests (β₃) was significant only in Norway (6.07), while the use of teacher judgments (β₄) was significant only in Sweden (7.07). Regarding teacher responsibility items, when teachers were responsible for assessment policies (β5) it was negative and statistically significant in Austria (-30.97), positive and significant in Finland (12.90) and Norway (11.63), and it was not significant in other countries. When teachers were responsible for textbooks to be used (β6) and course content (β7) they were 111 Table 10 Maximum likelihood estimates for science performance in didaktik countries Fixed part β₁ (_cons) Austria Est. Germany Est. Denmark Est. Finland Est. Norway Est. Sweden Est. 693.63*** 763.21*** 650.94*** 685.43*** 199.60*** 572.18*** β₂ (stand. tests) -19.38** -25.97* 4.42 16.74*** -2.89 -10.41 β₃(teacher tests) 1.17 0.62 1.64 -4.82 6.07* 2.5 8.03 -1.48 1.99 3.41 0.12 -7.07* -30.97* -5.05 7 12.90* 11.63* 8.35 β6(text. used) 0.12 11.57 14.74 -0.03 -2.32 -15.72 β7(course content) -3.55 -6.97 1.14 -0.99 -0.58 0.34 β8(courses offered) 5.09 -36.16** -6.03 -0.67 14.28 2.65 β9(SES) 11.85*** 10.26*** 28.47*** 27.03*** 30.24*** 32.84*** β10(gender) -25.71*** -10.45* 39.50*** -20.37*** -13.35*** 37.32*** -15.15*** -10.94 40.19*** 13.98*** -9.68 39.17*** 4.48 15.90** 14.15 2.7 -1.24 57.10*** -21.21*** -12.78** -37.28*** -29.22 -7.68 -30.26** -30.58*** -18.48*** -34.82*** -27.88* -19.63 -28.67*** -12.91** -16.12*** -22.99*** -35.76*** -47.46*** -15.07 -13.04 mixed -23.27 mixed -2.17 mixed -17.16 mixed 14.77 mixed -21.31 mixed 3073.84 4332.85 618.4 410.45 624.45 873.57 3614.06 2634.14 5329.18 5707.73 5653.72 6361.53 0.57 2057.46 -259127.77 6590(282) 0.65 1222.86 -1454987.5 4979(226) 0.21 1312.38 -185910.1 5924(285) 0.09 --121355.85 5810(203) 0.12 --123893.3 4660(197) 0.17 909.21 -233090.04 4576(189) 0.33 0.16 0.31 0.25 0.31 0.22 0.15 0.12 0.14 0.12 0.21 0.17 0.46 0.35 0.66 0.39 0.3 0.44 β₄(teach. judgments) β5(assess. policies) β11(age) β12(grade) β13(1st immigration gen.) β14(2nd immigration gen.) β15 (Other test lang.) β16 (public school) Random part Var. (random intercept) Var. (level-1 residual) ICC(rho)/xtreg Wald chi2(22^) Log likelihood Students(Schools) Derived estimates R2 (Total variance) R12 (level-1 var.) R22 (level-2 var.) Note: * p < 0.05, ** p < 0.01, *** p < 0.001. "--" indicates estimate not reported by model. Controlling for dummy missing variables. “^” degrees of freedom for Wald chi-squared tests varied from country to country depending on how many variables were kept in the model. 112 not significant in any of the didaktik countries after controlling for covariates, while when teachers were responsible for courses offered (β8) it was significant only in Germany with a negative coefficient of -36.16. The same as in curriculum countries SES (β9) was significant and positive in all didaktik countries, while gender (β10) was significant and negative in Austria, Germany and Denmark, significant and positive in Finland, and not significant in Norway and Sweden. From the rest of level-1 covariates, age (β11) was significant and negative in Austria and Germany, significant and positive in Norway, and not significant in Denmark, Finland and Sweden. Grade (β12) was significant and positive in all countries but Norway. The variable of the first immigration generation students (β13) was significant and negative in all didaktik countries but Finland and Norway, where it was not significant. The variable of second immigration students (β14) was significant and negative in all countries but Norway. Differently from curriculum countries, the results suggest that students with immigration background are expected to perform worse than native students in the majority of didaktik countries, and in no country they are expected to performer better than native students. When the home language of the student is different than test language (β15) was significant and negative in all countries but Sweden. Lastly and interestingly, the level-2 covariate representing public schools (β16) was not significant in any of the didaktik countries, indicating that there are no significant differences in student science performance between public and private schools within each respective didaktik country, as per the model and covariates used in the study. The “random part” provides the unexplained variance for the random intercept and level1 residual, ICC values, Wald chi-squared and log likelihood estimates, and the number of 113 students and schools. The R-squared values, under Derived estimates, for didaktik countries are lower than for curriculum countries with exception of Denmark, suggesting that the randomintercept models were a better fit for curriculum than for didaktik countries. In Chapter 6, the results will be discussed in light of education structures and educational traditions in curriculum and didaktik countries. It addresses the findings pertaining to autonomy and responsibility variation and their association to science performance against national educational systems discussed in Chapter 3. Further, claims are made about implications of the findings with regard to teacher education policy, assessment systems and use of standardized tests, and education innovation. Next, I outline potential research ideas to deepen understanding of didaktik and curriculum traditions and how they might affect student performance. I conclude that blending of assessments for diagnostic purposes from curriculum empiricism with higher teacher autonomy in didaktik tradition could potentially lead to increased student science performance. 114 Chapter 6. CONCLUSIONS, DICUSSION, AND IMPLICATIONS A nationally representative sample of 12 nations from PISA 2009 data was used to address two research questions. First, what is teacher autonomy and responsibility variation in schools in 12 countries representing curriculum and didaktik education traditions? And second, what is the association of teacher autonomy and responsibility measures with student science performance? Applying mean comparison across countries to address the first and hierarchical linear modeling (HLM) to address the second question, the study compared schools in curriculum and didaktik nations along a set of teacher autonomy and teacher responsibility measures, and explored the relationships of those measures with student science performance in PISA 2009. The didaktik-curriculum framework guided the selection of variables with the purpose to narrow down the focus to teacher-specific items so that theoretical claims about teacher autonomy and responsibility could be empirically tested. In this Chapter, conclusions for the two research questions are presented first. Next, policy implications are outlined, recognizing the cross-sectional nature of the data and limitations for drawing causal inferences. Lastly, the chapter ends with a set of recommendations for further research and final conclusions. Teacher Autonomy and Responsibility in Schools in Didaktik and Curriculum Countries The findings for the first research question were mixed, since taken together as one sample, teachers in schools in didaktik countries seem to enjoy more autonomy, while teachers in schools in curriculum countries seem to enjoy more responsibility as shown through autonomy and responsibility scales. Also, the variation was not as large as expected. The differences between didaktik and curriculum countries were statistically significant, but substantively weak. 115 A more nuanced story and not as clear-cut division between the two groups of countries was found when examining individual variables, selected as proxies for teacher autonomy and responsibility constructs. Three variables – the use of standardized tests, teacher-developed tests and teacher judgments – were proxies for autonomy, and four variables – responsibility over assessment policies, textbooks to be used, course content and courses offered – were proxies for teacher responsibility. Schools in curriculum countries had a higher mean in the use of standardized tests, but a higher mean on this measure meant lower teacher autonomy. If standardized tests are more frequently used, it means that a country’s education system aims to track the quality of learning externally, which in turn indicates lower involvement of teachers and limited professional autonomy. Examining the findings individually for each country, it was surprising to find that more didaktik than curriculum countries were above the OECD mean of 1.15 for this measure. Those didaktik countries included the four Nordic countries in the sample – Sweden, which had the highest mean of entire sample at 1.51, then Denmark, Norway and Finland. New Zealand and the United States were also above the OECD average. The overall higher mean on this measure for curriculum nations over didaktik nations is explained by the significantly lower means of Austria (0.44) and Germany (0.42). Overall, the findings indicate that none of the countries has more than one or two standardized tests a year, on average. Considering the findings from the perspective of the assessment systems in the countries, it becomes clear that mean use of standardized tests could be misleading in interpreting them at the face value. For example, Sweden and New Zealand have the highest means while Denmark and the United States have third and fourth higher means and the means of all four are close to one another. However, in Sweden and New Zealand 116 standardized tests are used for diagnostic purposes only while in Denmark and the United States they become high-stakes tests with implications for evaluation of teachers and schools. Therefore, despite having a higher mean use of standardized tests, teachers in Sweden and New Zealand might in fact enjoy more autonomy, as they are not affected by standardized tests in the same way that teachers in Denmark and the United States might be. The same applies to Norway and Finland, which have about the same mean on the measure, but standardized tests are used for accountability purposes in Norway and only for diagnostic purposes in Finland. From the didaktik-curriculum framework lens, then, there is no clear division of didaktik and curriculum countries both in terms of mean use of standardized tests and the purpose of such tests. Theoretically, we expected that teachers in schools of didaktik countries would be far less affected by standardized tests, and taken together that is the case. However, when individual countries are considered, we find that Denmark and Norway are more similar to curriculum countries, both in terms of mean use of standardized tests and the purposes of those tests than with didaktik countries on the measure. Clearly, these two are outliers among didaktik countries. On the other hand, Ireland and New Zealand are outliers among curriculum nations as they only rely on standardized tests for diagnostic purposes, similar to the majority of didaktik countries in the study. The use of teacher-developed tests manifested the same complexities as the use of standardized tests. At group level, there was statistically significant higher mean in schools of curriculum than didaktik nations, but the picture is more complex at the individual country level. A higher mean is taken to represent higher teacher autonomy. Interestingly, the countries that had a higher mean use of standardized tests also tended to have a higher mean use of teacherdeveloped tests, suggesting a potential interaction between the two. Overall, in all countries use 117 of teacher-developed tests ranged from a low of 3-5 times a year in Denmark to a high of more than once a month in the United States and Canada. OECD average was 2.94/year, suggesting that teacher-developed tests are a frequent assessment practice in all OECD countries as a mean close to 3 indicates these tests are used by teachers on monthly basis, as per the scale used to measure this variable in PISA 2009. Comparatively higher means for United States and Canada, which indicate a more than once a month use of teacher-developed tests, could be interpreted as a result of the pressure teachers or schools experience to prepare students for standardized tests. If that is the case, the higher mean for United States and Canada, as well as for Norway, could imply lower instead of higher teacher autonomy. In these three countries, teachers’ professional actions might be affected by an external assessment practice that pressures them to use more teacher-developed tests than they would otherwise, if high-stakes standardized testing was not in place. The findings regarding the use of teacher judgments showed a clear division between didaktik and curriculum countries, both as groups and as individual countries, where didaktik countries had statistically significant higher mean. Arguably, the use of this assessment practice is the most representative expression of teacher autonomy and professional action as individual teachers rely on their understanding of learners and learning to judge learners’ progress. The findings here could be an outcome of didaktik and curriculum frameworks that guide teacher education in didaktik and curriculum countries in the sense that in didaktik countries teachers are educated to rely on and use their own professional autonomy in classroom practices. Conversely, in curriculum countries teachers are educated to implement externally-set educational standards that tend to be objectively measured through specific tests, while teacher judgments, as results on 118 this measure show, are used less as an assessment practice compared to its use in didaktik countries. The findings from analysis of teacher responsibility measures revealed that relatively high proportions of teachers are responsible for deciding about assessment policies, textbooks to be used, course content and courses offered. Still the variation between didaktik and curriculum nations shows that the proportions are lower for didaktik than curriculum countries; if we rely on the teacher responsibility scale with all four measures. However, examining the measures individually reveals that out of four teacher responsibility items, curriculum nations had larger proportions of teachers deciding about student assessment policies and courses offered; didaktik nations had larger proportions of teachers deciding course content; and there was no statistically significant difference between didaktik and curriculum nations with regard to proportion of teachers who decide about textbooks to be use. Scrutinized from a didaktik-curriculum framework perspective as well as from the educational structures in sample countries, the findings pertaining to teacher responsibility measures indicate that more teachers in curriculum than in didaktik countries have a say on decisions that affect daily teaching practice. This was a surprising finding and not in the direction of the initial hypothesis. Under didaktik systems, teachers are trained to be professionally responsible and autonomous on decisions that affect their classroom interactions with students depending on the meaning and matter of the courses they teach. While the results show that didaktik teachers are responsible about the practices examined in the study, their colleagues in curriculum countries are more so. On the other hand, considering the numerous policy-makers and stakeholders in every educational system in the world (e.g. school principals, school boards, local, regional, and national educational authorities) that are in charge of making decisions about 119 provision of educational programs, it is surprising how high some of the proportions of teachers reported to decide about responsibility measures are, as in some countries those proportions were close to a 100%. With regard to assessment policies, for example, 85.5% of teachers in Austria, 84% in Ireland and 81% in Finland and Australia are reported to decide about assessment policies in their schools. The lowest proportion was found in United States, and considering that the use of standardized tests for accountability measures is prevalent in United States, it could be worthwhile for future research to explore why such proportion is lower and in case of US teachers who design assessment policies to examine how those are adapted to fit into their standards- and testing-driven educational system. Regarding textbook use, course content, and courses offered, the most striking finding was the high percentage of teachers that have a say in New Zealand and United Kingdom as reported by school principals. Close to 100% of teachers in these two countries are reported to decide about textbooks to be used and course content, and about 80% are reported to have a say about courses offered respectively. On average, the proportions of teachers across responsibility items in in schools of didaktik and curriculum nations were higher than OECD average. The findings reveal the variability along the responsibility measures across countries that transcend didaktik-curriculum borders, except for courses offered, where curriculum countries are in clear advantage compared to teachers in didaktik countries. The graphs on TR measures show this variability with didaktik and curriculum countries mixing up along the continuum. Indeed, while on average, teachers in schools of curriculum countries seem to enjoy more responsibility, there are quite a few outliers on both ends of the continuum. Future research should examine how and why some of the proportions are so high in some nations, and lower in others. One way to undertake this research would be to make an in-depth analysis of single 120 country case studies and pair-country comparisons that explore national education systems systematically, both in terms of policies in place to recruit, educate and train teachers and practices as to how policies are implemented at the school level and how teachers are engaged or disengaged in school-level decision-making process. One critical question to ask here is why there were no substantive differences between teacher autonomy and responsibility practices across schools in didaktik and curriculum countries in PISA 2009, when the theory strongly suggested that would be the case. One plausible explanation for the lack of differences could be the global trends in education promoted by highly influential global education players such as OECD with PISA study or International Association for the Evaluation of Education (IEA) with TIMSS. As noted earlier, the sample countries in the study have been consistently participating in international assessments during the past 30 years, and, as shown with Germany’s example, they have introduced educational policies to align their educational systems with international assessments such as PISA. Same as when what gets tested gets taught at the classroom level, what gets tested in international assessments drives what countries do in order to keep up with the ranking race associated with international assessments. As a result, while the rhetoric and theory behind curriculum and didaktik frameworks may indeed be valid, the school practices follow a pattern of their own that is more international than national - where the borders between local and global become thin and school practices around the world become similar. Ultimately, there is more variation in school practices within countries than between countries. Therefore, while the study relied on clearly theoretical divide between curriculum and didaktik, it revealed practice convergence in teacher autonomy and responsibility. To some extent, the study adds more evidence to the isomorphic nature of educational practices around the world and further expansion of world culture of 121 schooling, where countries continue to adopt similar practices in hopes to get similar better results. Association of Teacher Autonomy and Responsibility Measures with Student Science Performance Findings from the analytical models provide evidence for the predictive power of teacher autonomy and responsibility items in explaining within-country students’ science performance. Relationships between autonomy and responsibility items were only found in a small number of countries using within-country models. The findings suggest that the use of standardized tests negatively affects students’ science performance in Austria and Germany. The use of standardized tests in United States was also negatively and significantly associated with science performance in the model with TA measures (-31.18) as well as in the model with TA and TR measures (-23.86). Furthermore, evidence was found to suggest that teacher-developed tests are positively associated with science performance in New Zealand, United States and Norway. With respect to teacher responsibility measures, the association of teachers being responsible for assessment policies was found to be significant and negative in Austria, and significant and positive in Finland and Norway. Textbooks used and course content coefficients were abnormally high and significant in New Zealand, and significant and negative in Ireland. With respect to responsibility over courses offered, the findings show that there was a significant and negative association with science performance in Germany, and significant and positive in New Zealand, while there was no significant association in any of the other countries, accounting for school- and student-level covariates. 122 The analytical models were exploratory in nature, without any directional hypothesis with regard to how teacher autonomy (TA) and responsibility (TR) measures would be associated with science performance in didaktik versus curriculum countries. The findings do not reveal any particular trend to suggest that TA and TR items are associated differently with science performance in didaktik versus curriculum traditions, but they confirm that there is crossnational variation overall. Indeed, in the majority of countries representing both groups of countries, the analytical models did not identify any association among TA and TR measures with science performance. One plausible explanation for this finding, at least with regard to TR measures, could be related to the fact TR items are only a small part of total proportion of responsibility shared among numerous stakeholders affecting school policies and practices as part of school autonomy. Teachers represent one group of stakeholders who are involved in making decisions about such important school issues as assessment policies, textbooks to be uses, course content and courses offered. The same justification might apply to teacher autonomy. OECD itself emphasizes school autonomy as one of the features of educational systems that produce better performance results, but only when there is collaboration between school principals and teachers within the school (OECD, 2013). Therefore, teacher autonomy is also part of the broader definition of school autonomy that includes both other players (e.g. school principals) and other practices (e.g. autonomy in formulating school budget, allocating resources, etc.). To that end, the development of new and better measures and methods that capture TA and TR and that are distinguished from school autonomy measures would be valuable in examining the relationship of these constructs with students’ performance. The findings of second research question confirmed prior conclusions that when it comes to variation in national student performance, the largest variation exists between students, then 123 less variation between schools, and least variation between countries (Baker & LeTendre, 2005). In this study, this is shown by ICC values for within-country models. Indeed, for the sample of countries included in this study, the cross-national variation in science performance as indicated by an ICC value of 0.3 was dramatically low, meaning that between-country variation in science performance was only 3%. Therefore, as noted already, the findings here provide further evidence for the world culture of schooling, and the lack of single strong factors that explain cross-national variation. However, cross-national differences in how education systems are organized and run, or how teachers are educated, for example, with respect to how much autonomy and responsibility they enjoy, might have other effects on students non-cognitive skills that are not captured by measurement of cognitive skills in student performance tests such as PISA. In didaktik nations, the non-cognitive as well as cognitive skills are part of the contents of Bildung that aspire to educate students not only for the workplace or higher education but also to contribute to transformation of the workplace and advancement of themselves and whatever they are engaged in during their life trajectory. Furthermore, the contents of Bildung enable students not only to develop skills for certain professions, but also to gain significance and meaning about their place in the world as they engage in and with professions and others (Pinar, 2011; Hopmann, 2007). Implications In this section, I first discuss the frameworks used for the study and how they could potentially be consolidated into well-established educational theories. Then, I turn to implications for educational policy, practice, and research, and end with conclusions. 124 Educational theory Didaktik-curriculum frameworks served as guiding theories for the study. The extended review of the two frameworks in Chapter 2 shows that Bildung-centered didaktik theory is wellestablished in Continental Europe and that curriculum field in English speaking world is balkanized into several ideologies and furthermore dominated by theories of other disciplines, namely philosophy, history, psychology and sociology, which consider education as a field to be studied but not a field on its own (Biesta, 2011). However, as the review for this study revealed, curriculum as theory has been in place since early-20th century, despite complexities that have followed it and its fragmentation along several ideologies, most dominating of which was and is social efficiency (Deng & Luke, 2008). Many influential educators and curriculum scholars - most notably John Dewey and Joseph Schwab during the 20th century – have theorized about education and curriculum as a field and what its object of study might be. Dewey maintained that curriculum should focus on learners, society, and specialized knowledge. Schwab put forth a curriculum framework of common places that included subject matter, learners, milieu and teachers. Decades after Dewey’s and Schwab’s contributions, the curriculum field is still evolving into an integrated discipline that overcomes fragmentations along ideology lines. Both Dewey and Schwab emphasized the subject matter, an element that is missing from the current curriculum discourse. Subject matter, on the other hand, is central to the didaktik-influenced disciplines in Continental Europe. Current curriculum scholarship and research needs to revive ideals like those promoted by Dewey and Schwab, that advocate for tightly integrated curriculum approaches that overcome ideological divides. This research study demonstrated that educational frameworks might help us analyze and evaluate educational practices such as teacher autonomy and teacher responsibility. 125 Additionally, the research suggests that further development of curriculum theory and didaktik theory in their own right or as integrated would enhance opportunities to better understand educational practice and processes through educational frameworks. In Chapter 2 educational theories were cross-examined with theories from sociology of education. The initial review indicates that sociological theories of education capture very well the global education trends and the role of education for society. Also sociological theories seem to operate at a macro-level (i.e. societal) and educational theories seem to operate more at the micro-level (i.e. schools and classrooms). This dichotomy is an area that requires further analysis and revision to explore whether sociological and educational theories intersect at any point or operate independent of one another. Of particular importance is the need for research on curriculum/educational theory in English speaking nations, where sociological theories of education seem to be taken to mean “the” educational theories in absence of an established “educational” educational theory. For example, the sociological theories of education seem to constitute the main theoretical base for the educational theory and policy graduate programs across US universities. However, based on the findings of this study as per the dataset, variables and analyses employed, the continuum presented in Chapter 2 seems to be reversed in practice. Figure 13 shows the revised continuum that places curriculum closer to neo-institutional theory and didaktik closer to functional theory and conflict theory. This practice-based positioning of curriculum and didaktik theories across the continuum of sociological theories of education is the reverse of the theoretically-based one in Figure 1 of Chapter 2. 126 Figure 13 Revised continuum of three sociological and two educational theories Educational policy and practice OECD through PISA survey results provides the best available educational data for comparative research. Yet, it seems that educational researchers have had a love-hate relationship with PISA. On the one hand, there are scholars who criticize OECD/PISA for tending to promote neo-liberal ideas and operating as a “New Public Government”, and the other hand are those who laud PISA for making available such comprehensive educational datasets (Hopmann, Brinek, & Retzl, 2007, p. 9). Nevertheless, PISA has become an influential instrument for benchmarking student performance worldwide. A recent study found that PISA results have pushed many participating countries and economies to initiate policy reforms (Breakspear, 2012). One example is Germany, which undertook major policy reforms after low PISA 2000 results disturbed Germans - ‘PISA Shock’ - into action (OECD, 2011). One change the Germans introduced was integrating PISA assessment items into national assessments. By 2009 Germany’s scores in PISA improved significantly (OECD, 2011). Another area for further research is testing the hypothesis as to whether the rate of improvement in student performance was higher in countries that participate in international assessment programs such as PISA, TIMSS or PIRLS versus in the countries that have yet to participate in any of these assessments. While this comparison could reveal the potential “PISA effect” on national student performance across different years, it would be difficult to isolate 127 those effects and compare student performance cross-nationally if students did not take the same test, or at least comparably demanding and challenging assessment items. One major implication of this study with respect to understanding national student performance pertains to the way external assessments are used within nations. The review of assessment systems in participating countries revealed that didaktik countries use less standardized tests. This finding was also confirmed empirically through the first research question. The review also showed that even when standardized tests are used, they are usually part of gateway to the next educational level. For example, Austria, Germany, and Finland use national assessments at the end of middle school and at the end of high school, which do not interfere with teachers professional autonomy as is the case with countries where standardized tests are used every year, such as United States. Finland has been heralded as the exemplar for a successful educational system during past decade. However, this study reveals that Finland’s story is not only Finland’s story as other didaktik countries share key features of Finland’s system. Most notable are the minimum use of standardized tests, and teacher recruitment and education. Policies pertaining to teacher recruitment is another area for additional research. The review of teacher education and certification systems revealed that didaktik countries are more selective in recruiting future teachers and training them to be autonomous professionals with strong preparation in content matter. In Germany, for example, only students who have completed high school exit exams, Abitur, are allowed to apply for a teaching degree. In Finland future teachers are recruited from the top high school performers. With respect to teacher education programs, didaktik nations, in line with didaktik theory, focus on discipline specific preparation, while less emphasis is placed on the methods, which are considered of secondary 128 priority (Weniger, 2000). The empirical findings from this study suggest that teacher education policies in didaktik countries translate into more teacher autonomy, but not as much responsibility, on average, compared to curriculum countries. In that sense, curriculum countries could further examine teacher education policies in didaktik countries to identify long-term strategies that contribute to higher teacher autonomy, while didaktik countries could review how teacher education policies in curriculum countries lead to higher teacher responsibility. Higher teacher autonomy and responsibility might serve countries well in one key aspect of their education systems – innovation in education. In a recent study, OECD (2014) measured innovation in education covering a period from 2000 to 2011 in 28 countries and territories, relying on data from international educational assessments such as TIMSS, PISA and PIRLS. The findings suggested that education sector is fairly innovative when compared to other sectors, as the rate of changes introduced in education is as high as in other sectors. In the case of higher education, education is even more innovative than economy, which usually is perceived as most innovative sector. Within primary and secondary education, the study found innovative approaches in pedagogic practices through which teachers related lessons to higher order thinking skills, personalized learning, and integration of data and text interpretation (OECD, 2014). In the overall innovation index, Denmark scored the highest (see Appendix M for ranking of countries on overall innovation index and classroom and school practice innovation index). One key finding of the study was that the rate of innovation at classroom level was higher than at school level. Unquestionably, teachers have the potential to contribute most directly to classroom innovation. With more emphasis on the need for innovation in every sector, and considering that schooling is often called upon to solve societal problems and meet societal needs, we might expect that the rate of innovation in education will be accelerated. A better trained and 129 professionally autonomous teaching force could contribute to increased innovation in education systems. The revision of PISA Assessment Framework for 2015 and onwards will potentially have implications for science education across many countries around the world. Since 2006, PISA scientific literacy was defined – as also shown in Table 4 of Chapter 2 – by the following three competencies: (1) Identify scientific issues; (2) Explain phenomena scientifically; and (3) Use scientific evidence. However the revised framework lists the following three competencies to define scientific literacy: (1) Explain phenomena scientifically; (2) Evaluate and design scientific enquiry; and (3) Interpret data and evidence scientifically (OECD, 2012). In PISA 2015 Framework, competence no. 2 from 2006 was retained as competence no. 1, while competences no. 2 and 3 use different action verbs that belong to the higher level if considered from the Bloom’s Taxonomy perspective. Mastering these competences will require a different knowledge base. Osborne (In press) notes that students will need to develop specific, preferably domain-specific scientific reasoning that encapsulates content, procedural, and epistemic knowledge. Osborne argues that second and third competence in particular require not only knowledge of science, but also an understanding of how scientific knowledge is established and the degree of confidence with which it is held. Procedural knowledge is about standards procedures and methods used to establish scientific knowledge, while epistemic knowledge pertains to the rationale for the common practices of scientific inquiry (Osborne, In press). In other words, if content knowledge is about the ‘what’ question, procedural knowledge is about ‘how’ and epistemic knowledge about ‘why’ questions. If and when individual countries adopt the same competences to be mastered by their students in their K-16 education systems, it will have implications for 130 curriculum, instruction and assessment systems in place as well as for teacher education and teacher professional development programs, which in turn will transform the teaching and learning that takes place in school settings. Educational research The answers generated by the main research questions of the study shed light as to whether theoretical claims about teacher autonomy and responsibility in didaktik and curriculum traditions survive the empirical tests, and to what extent autonomy and responsibility measures contribute to students’ science performance in PISA 2009. However, this research study raises new questions concerning future research avenues, and how OECD through PISA might collect specific teacher data. In turn, the new teacher data could provide opportunities for researchers to get a richer understanding of teacher practices and characteristics among participating in OECD/PISA endeavors. Learning progression research in the US has gained momentum in efforts to achieve curriculum, instruction and assessment alignment and coherence in science education. Learning progressions (LPs) have been defined as research-based descriptions of how students construct their knowledge, and gain more expertise within and across a discipline over a broad span of time (Schweingruber, Duschl, & Shouse, 2007). The issues highlighted by the present study with respect to importance placed on order, choice and sequence of content for use in school settings as defined under didaktik theory (Hopmann, 2007) could complement efforts for developing effective LPs in science education. Specifically, LPs should consider the importance placed on the content matter and meaning attached to it for students’ present and future in didaktik theory so that students understand why specific content is relevant for them. On the other hand, didaktik nations seem to have already adopted some of the features of curriculum systems, such as 131 introduction of standardized assessments, which are adapted for domestic use and dominantly for diagnostic purposes to track effectiveness of their educational programs. A recurring theme in this study was variation. It set out to track variation across traditions, than countries within those traditions, than schools within those countries, and finally teaching practices within those schools. At the national level, it was intriguing to find examples of countries that behave contrary to the expected norm. For example, Ireland and New Zealand among curriculum nations were outliers in terms of non-use of standardized tests. Similarly, Denmark and Norway among didaktik countries were outliers for the opposite reason – they use standardized tests when expectation was they would not. From this perspective, one future research directions that could be explored is getting deeper into analysis of these pairs of countries in terms of either the types of standardized tests they use, or why they don’t use them. Further, it would be insightful to compare Denmark or Norway to any of the curriculum countries that use standardized tests heavily, such as the US or Australia. In turn, exploring how specific standardized tests affect student achievement in a small number of nations would provide valuable information on how countries could improve their assessment systems. Further, examining how standardized tests affect teachers and their sense of professional autonomy through a qualitative study is another intriguing research project. Another potential and promising research area is to examine students’ opportunities to learn (OTL) across curriculum and didaktik countries. OTLs indicate actual opportunities students have to learn what they will be assessed on (Schmidt, Zoido, & Cogan, 2014). PISA 2012 collected data on some OTL items for the first time and further exploration of those items and their relationship with teacher autonomy and responsibility as well as with student performance could be valuable. These OTL items collect data, for example, on what 132 opportunities students have in their school experiences to learn through word problems if later on word problems will be among assessed items. This line of research could provide further understanding about the intersection of educational traditions and teacher education programs in a country, teacher practices, OTL, and student performance. Yet another research domain that arguably will become relevant in the near future has to do with ways in which educational systems design educational interventions and policies with goals to increase creativity and innovation potential of students and teachers. As noted above, OECD has already collected data and developed innovation indices that indicate that contrary to public perception, education as a public system is quite innovative. Also, the OECD (2014) finding that educational systems have been more innovative at classroom change than school change reinvigorates the need for more strategic teacher autonomy and paints a more optimistic picture of the future of teaching and learning. Reportedly, teachers are capable of innovating their classroom practices – more so in some countries than others – and those capabilities need to be further utilized to enhance students’ creativity and innovation. OECD has already shown interest in measuring innovation in education across educational systems (OECD, 2014a) and in innovating assessments in education (Looney, 2009), and possibly it might extend that interest to initiate programs and surveys that measure students’ creativity and innovation skills, either through PISA or through a new separate survey. Creativity and innovation are deemed important outcomes of education and some work has been done to develop frameworks to assess creativity in schools (see for example Lucas, Claxton, & Spencer, 2013), however the expertise the OECD has developed through PISA and resources and network infrastructure it possesses to engage in major assessment activities could 133 be further utilized to start developing and administering a survey for assessing students’ creativity and innovation skills. One way that educational systems can start integrating creativity and innovation education in their curricula is to provide opportunities for both teachers and students to engage in frequent small scale experiments and projects so that students learn about risk-taking, failure as a knowledge-generation tool, and their creative and innovative potential (Tahirsylaj, 2012; Matson, 1996; 1992). Considering how influential PISA survey has been in initiating educational reforms cross-nationally, simply having a survey that measures creativity and innovation skills might directly or indirectly push nations to integrate these skills into their educational programs. Provision of such opportunities for students could potentially enhance their and their nations’ creativity and innovation output as well as educational achievement overall. Another issue OECD might consider regarding PISA survey pertains to how school-level and teacher data are collected. Currently, PISA school questionnaires are completed by school principals or a designate. OECD already collects teacher data through Teaching and Learning Survey (TALIS), administered in 2008 and 2013 (OECD, 2014b). However, far more countries and economies participate in PISA survey than in TALIS survey – only 34 countries and economies participated in TALIS 2013, while 65 participated in PISA 2012 and 71 are signed up to participate in PISA 2015 – and thus, far more teacher data would be collected if OECD added a teacher-specific questionnaire to PISA survey. Sample questions for such a survey could be focused on how school or other local, regional or national policies affect teachers day-to-day teaching practice; how they cope with externally-mandated requirements such as standardized tests; how they overcome challenges they face in implementing curriculum requirements; what challenges they face in enhancing students’ learning; what is their sense of autonomy and 134 responsibility; what opportunities do they have for professional development; and what approaches they follow to enhance their sense of success in their profession. Other research ideas researchers might tackle as per the findings of this study pertain to teacher responsibility. The findings of this study show that, overall, there is a high proportion of teachers who are responsible for crucial teaching and learning matters. On the other hand, only a few of responsibility measures were found to be significantly associated to student’s science performance. One way to explore this issue could be to view responsibility from a broader perspective, by examining the proportion of responsibility of other educational stakeholders, such as school principles and other local or national authorities, who are responsible for making decisions relevant to important issues such as assessment policies, textbooks to be used, course content and courses offered. Further, exploring the interactions of these stakeholders and how responsibility for decision-making is shared could provide further valuable insights as to how teachers are positioned in prevailing complex hierarchical decision-making structures within national educational systems. Finally, applying some of the more recent quasi-experimental quantitative research methods such as propensity score matching (PSM) or regression discontinuity (RD) and using other datasets in addition to PISA, such as TIMMS or TALIS, wherever applicable, would provide more precise estimates of teacher-specific effects on student performance. PSM in particular could be more applicable when using current international education datasets such as PISA and TIMSS since PSM in combination with multi-level modeling might isolate teacher autonomy (TA) causal effects if schools with high TA versus schools with low TA within a country are balanced based on a number of school-level characteristics that are considered as 135 pretreatment covariates (Hong & Raudenbush, 2005). Such models would yield average causal effect of schools with high TA on students’ performance. Delimitations and Limitations The findings of this study may be generalized to specific countries that were examined more thoroughly here as well as to schools within them. In particular, the study provides a set of policy recommendations that may apply at different levels of decision-making, such as national governments, regional and district level educational authorities, school governing boards and teacher education and teacher training institutions. However, there were a number of limitations to the study. First, the data to be used is cross-sectional and no causality is meant with the findings. Also, this is a secondary-data analysis, so it was not possible to control what data is collected. The items used were derived from variables available on the PISA 2009 dataset, which served as proxies to the constructs that the study examined. Variables that were selected attempted to narrow down teacher-specific items only. The analytical models employed were specifically focused on relationships of teacher autonomy and responsibility measures with student science performance, controlling for two student-specific variables, and the list of variables used for predicting science achievement is selective and in no way exhaustive. Conclusions The twelve participating countries in this study represent some of the most developed nations in the world. Australia, Canada, United Kingdom, Ireland, New Zealand and United States constituted the curriculum sample, and Austria, Germany, Denmark, Finland, Norway and Sweden constituted didaktik sample. Clearly, these nations are doing a lot of things right in their education systems, otherwise they would not have been esteemed as some of the most powerful 136 economies in the world. They do quite well in PISA rankings too – the majority of them consistently score at or above the OECD average in PISA science, mathematics and reading surveys. No need to declare a state of emergency! At the same time, these countries are at the forefront of technological developments and otherwise, pushing boundaries of the possible further. They are clearly committed to education. As shown through the funding mechanisms’ review, they allocate considerable portions of their high GDPs to education sector. Their very participation in PISA indicates that they want to find out both how their students are performing within country, and how they compare to other nations. They already share a sense of urgency to improve, to help their students perform higher on national as well as international assessments. To this end, this comparative study serves as an opportunity for these countries to find out to what extent their teachers enjoy autonomy and responsibility, and how they compare to other advanced nations. Furthermore, it showed to what extent these two constructs matter when it comes to student performance. In line with aspirations to improve, therefore, the study presents an opportunity for these countries to understand their national positioning and learn from other systems. Cross-national educational research and subsequent exchanges in education policy have been around since 1950s, when some of these twelve countries were major players in such initial undertakings. Indeed, the first international comparative education organization emerged after a visit of Torsten Husen from the University of Stockholm at the University of Chicago in the mid1950s, where among others he met with the famous Benjamin Bloom “ […] whose view was that the whole world should be seen as a single educational laboratory. From this meeting emerged 137 the International Association for the Evaluation of Educational Achievement (IEA), which, for diplomatic reasons was managed from Sweden” (Baker, Lee, & Heyneman, 2003, p. 332). We are witnesses of how far the world has come to become the educational laboratory that Bloom envisioned! However, as the review of American battle over curriculum showed in Chapter 2, the exchange of educational ideas, at least between United States and Germany, had occurred intensively more than a century ago. Reminiscent of that period, one of the current most vocal curriculum theorists reconfirmed it and wrote, “One hundred years ago, Americans traveled to Germany […] to study concepts of education. It seems to me it is time again to selectively incorporate German concepts in North American practices of education” (Pinar, 2011, p. xiv). Pinar primarily refers to elements of Bildung as German concepts that should be incorporated into American education. This is not to suggest that Germany should be the go-to destination to import education policies or concepts from. I bring this point up to evoke Joseph Schwab’s idea of “the eclectic curriculum” that picks and chooses what works best. In the same vein, the countries in this study, as well as any other country that participates in international assessments in particular, have the advantage of benchmarking themselves against the best performing nations in the world, and can eclectically pick and choose foreign policies and what to incorporate in their educational systems. To this end, this study offers a contribution on better understanding didaktik and curriculum traditions in their own right, and it enables countries to pick and choose what they might consider best of these two educational “superdiscourses”. In particular, this study offers an opportunity to countries to consider specific constructs of autonomy and responsibility, and depending on the country or when desirable, to adopt ways to advance them. 138 Arguably, teachers enjoy autonomy by default! When they close classroom doors and lessons start, they are in charge and responsible for everything that happens inside classrooms. The challenge and opportunity for national systems is to equip teachers with, what for the sake of argument I will refer to as, “strategic and transparent autonomy and responsibility”. What I mean here entails the idea that teachers’ autonomy and responsibility should be guaranteed and at the same time overt to educational stakeholders and general public. This almost generates a paradoxical, yet crucially necessary, scenario where teachers have to exercise their guaranteed autonomy and responsibility, always making transparent to the public the ways in which they use such autonomy and responsibility. In turn, this scenario could inadequately be viewed as too much public oversight, but in fact it should be a tacit contract between teachers and the public, where public means both the general public and educational-specific authorities. Returning to the idea of “the eclectic” and based on the findings of this study as well as on understanding of educational systems examined here, it seems that Nordic countries – Denmark, Finland, Norway, and Sweden – have been eclectic with respect to educational models for quite some time, more so since they engaged in system-wide reforms during 1980s. Being close to Continental Europe, they had already adopted didaktik theory as guiding philosophy and framework to organize their educational systems and especially to educate their teachers. But that didn’t stop them from being open to educational developments in Anglo-American world, from where they seem to have adopted and adapted new modes of system-wide assessments, local control, and to a lesser degree comprehensive compulsory schooling. For one Nordic country, to improve the education system means “[…] improving the teaching force, limiting student testing to a necessary minimum, placing responsibility and trust before accountability, and handing over school- and district-level leadership to educational professionals” (Sahlberg, 139 2010, p. 5). The country the quote refers to is Finland, but to some extent, the other three Nordic countries seem to have followed the same agenda. The eclectic education model has been tried and considering Nordic countries’ educational performance and overall welfare of their citizens, it has also delivered. The message for English and German speaking nations in the study is to book trips to Scandinavia and bring back their initially-home-grown-now-transformed educational policies and practices. 140 References Acock, A. C. (2008). A gentle introduction to Stata. Stata press. Adler, M. J. (1951). Labor, leisure, and liberal education. The Journal of General Education, 6(1), 35–45. Alexander, R. J. (2012). Moral Panic, Miracle Cures and Educational Policy: what can we really learn from international comparison? Scottish Educational Review, 4–21. Anderson, L. W. (1987). The decline of teacher autonomy: Tears or cheers? International Review of Education, 33(3), 357–373. Apple, M. W. (2004). Ideology and Curriculum (3rd ed.). New York: Routledge Falmer. Australian Government (2014). Australian education system. Retrieved from: http://www.studyinaustralia.gov.au/global/australian-education/education-system Australian Institute for Teaching and School Leadership (2014). Australian Professional Standards for Teachers. Retrieved from: http://www.aitsl.edu.au/australian-professionalstandards-for-teachers Autio, T. (2013). Subjectivity, curriculum, and society: Between and beyond the German Didaktik and Anglo-American curriculum studies. Routledge. Baker, D. P., Goesling, B., & LeTendre, G. K. (2002). Socioeconomic Status, School Quality, 141 and National Economic Development: A Cross-National Analysis of the “HeynemanLoxley Effect” on Mathematics and Science Achievement. Comparative Education Review, 46(3), 291–312. Baker, D. P., Lee, J., & Heyneman, S. P. (2003). Should America be more like them? Cross-national high school achievement and US policy. Brookings Papers on Education Policy, 309-338. Baker, D.P., & LeTendre, G.K. (2005). National differences, global similarities: World culture and the future of schooling. Stanford, California: Stanford University Press. Baker, D.P. (2009). The schooled society and beyond: the modernizing role of formal education as an institution. Retrieved from http://www.beyondcurrenthorizons.org.uk/wpcontent/uploads/ch3_final_baker_modernizingroleformaleducation_20081218-new.pdf Baker, D.P. (2011). Forward and backward, horizontal and vertical: Transformation of occupational credentialing in the schooled society. Research in Social Stratification and Mobility, 29(1), 5–29. Baker, D.P. (2014). The Schooled Society: The Educational Transformation of Culture. Stanford, California: Stanford University Press. Bakhtin, M. M. (1981). The dialogic imagination: Four essays (C. Emerson & M. Holquist, Transl.). Austin: University of Texas Press. Ballantine, J. H., & Spade, J. Z. (2007). Understanding Education Through Sociological Theory. 142 Schools and Society: A Sociological Approach to Education, 5. Baumert, J., Bos, W., Lehmann, R. (Eds.) (2000): Dritte Internationale Mathematik- und Naturwissenschaftsstudie: Mathematische und naturwissenschaftliche Bildung am Ende der Schullaufbahn. Bd. 1: Mathematisch-naturwissenschaftliche Grundbildung am Ende der Pflichtschulzeit. Opladen: Leske + Budrich. Becker, G. S. (1962). Investment in human capital: a theoretical analysis. The Journal of Political Economy, 9–49. Biesta, G. (2011). Disciplines and theory in the academic study of education: A comparative analysis of the Anglo-American and Continental construction of the field. Pedagogy, Culture & Society, 19(2), 175–192. Black, P., & Wiliam, D. (2006). Inside the black box: Raising standards through classroom assessment. Granada Learning. Bobbitt, F. (1918). Scientific method in curriculum-making. The curriculum studies reader, 2. Boli, J., Ramirez, F. O., & Meyer, J. W. (1985). Explaining the origins and expansion of mass education. Comparative Education Review, 29(2), 145–170. Bourdieu, P. (1977). 29. Cultural Reproduction and Social Reproduction. Power and ideology in education, 485. Bowles, S., & Gintis, H. (1976). Schooling in capitalist America: Educational reform and the 143 contradictions of American life. New York: Basic Books. Breakspear, S. (2012). “The Policy Impact of PISA: An Exploration of the Normative Effects of International Benchmarking in School System Performance”, OECD Education Working Papers, No. 71, OECD Publishing. http://dx.doi.org/10.1787/5k9fdfqffr28-en Bruner, J. S. (2009). The process of education. Harvard University Press. Byun, S., & Kim, K. (2010). Educational inequality in South Korea: The widening socioeconomic gap in student achievement. Research in the Sociology of Education, 17, 155–182. Carney, S., Rappleye, Jeremy, & Silova, I. (2012). Between Faith and Science: World Culture Theory and Comparative Education. Comparative Education Review, 56(3), 366–393. doi:10.1086/665708 Carnegie Corporation of New York, & Institute for Advanced Study. (2009). The Opportunity Equation: Transforming Mathematics and Science Education for Citizenship and the Global Economy. Retrieved from: http://opportunityequation.org/uploads/files/oe_report.pdf. Cohen, D. K., Raudenbush, S. W., & Ball, D. L. (2003). Resources, instruction, and research. Educational Evaluation and Policy Analysis, 25(2), 119–142. 144 Coleman, J.S., Campbell, E.Q., Hobson, C.J., McPartland, J.,Mood, A.M.,Weinfeld, F.D., York, R.L. (1966). Equality of Educational Opportunity. U.S. Government Printing Office, Washington, DC. Collins, R. (1971). Functional and conflict theories of educational stratification. American Sociological Review, 1002–1019. Collins, R. (1979). The Credentialist Society. New York: Academic. Connelly, M.F., Xu, S. (2008). The Landscape of curriculum and instruction: Diversity and Continuity. In F. M. Connelly, M. F. He & J. Phillion (Eds.), The Sage handbook of curriculum and instruction (pp. 514-533). Sage. Corcoran, T.B. (1995). Transforming professional development for teachers. A guide for state policymakers. Washington, DC: National Governors Association. Council of Ministers of Education Canada (n.d.). The Pan-Canadian Assessment Program (PCAP). Retrieved from: http://www.cmec.ca/240/ Counts, G. (1932). Dare the School Build a New Social Order. New York: John Day Co, 9–10. DeBoer, G. E. (2011). The globalization of science education. Journal of Research in Science Teaching, 48(6), 567 – 591. Del Schalock, H. (1998). Student progress in learning: Teacher responsibility, accountability, and reality. Journal of personnel Evaluation in Education, 12(3), 237-246. 145 Deng, Z., & Luke, A. (2008). Subject Matter: Defining and Theorizing School Subjects. In F. M. Connelly, M. F. He & J. Phillion (Eds.), The Sage handbook of curriculum and instruction (pp. 66-87). Sage. Depaepe, M. (2002). A Comparative History of Educational Sciences: the comparability of the incomparable?. European Educational Research Journal, 1(1), 118-122. Dewey, J., & Dewey, E. (1915). Schools of to-morrow. EP Dutton & Company. Dewey, J. (1916). Democracy and education: An introduction to the philosophy of education. The Macmillan Company. Dreeben, R. (1968). On what is learned in school. Addison-Wesley Reading, MA. Retrieved from http://en.scientificcommons.org/6882154 Durkheim, E. (1956). Education and sociology. Simon and Schuster. Durkheim, E. (2002). Moral education. Dover Publications. Duschl, R. (2008). Science education in three-part harmony: Balancing conceptual, epistemic, and social learning goals. Review of Research in Education, 32(1), 268–291. Duschl, R., Maeng, S., & Sezen, A. (2011). Learning progressions and teaching sequences: A review and analysis. Studies in Science Education, 47(2), 123-182. Ellis, A. K. (2004). Exemplars of curriculum theory. Routledge. Eisner, E. W., & Vallance, E. (1974). Conflicting concepts of curriculum. Berkeley: McCutchan. 146 Eurydice (2015a). Austria: Organisation of the education system and of its structure. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php?title=Austria:Organisation _of_the_Education_System_and_of_its_Structure&oldid=104129 Eurydice (2015b). Austria: Teachers and education staff. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Austria:Teachers_and_Edu cation_Staff Eurydice (2015c). Austria: Organisation and governance. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Austria:Organisation_and_ Governance Eurydice (2015d). Austria: Funding in education. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Austria:Funding_in_Educa tion Eurydice (2015e). Germany: Organisation and governance. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Germany:Organisation_an d_Governance Eurydice (2015f). Germany: Teachers and education staff. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Germany:Teachers_and_E ducation_Staff 147 Eurydice (2015g). Germany: Continuing professional development for teachers working in early childhood and school education. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Germany:Continuing_Prof essional_Development_for_Teachers_Working_in_Early_Childhood_and_School_Educa tion Eurydice (2015h). Germany: Funding in education. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Germany:Funding_in_Edu cation Eurydice (2015i). Denmark: Organisation and governance. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Denmark:Organisation_an d_Governance Eurydice (2015j). Denmark: Teachers and education staff. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Denmark:Teachers_and_E ducation_Staff Eurydice (2015k). Denmark: Continuing professional development for teachers working in early childhood education and school education. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Denmark:Continuing_Prof essional_Development_for_Teachers_Working_in_Early_Childhood_and_School_Educa tion Eurydice (2015l). Denmark: Quality assurance in early childhood and school education. 148 Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Denmark:Quality_Assuran ce_in_Early_Childhood_and_School_Education Eurydice (2015m). Denmark: Funding in education. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Denmark:Funding_in_Edu cation Eurydice (2015n). Finland: Organisation and governance. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Finland:Organisation_and_ Governance Eurydice (2015o). Finland: Teachers and education staff. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Finland:Teachers_and_Edu cation_Staff Eurydice (2015p). Finland: Continuing professional development for teachers working in early childhood and school education. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Finland:Continuing_Profes sional_Development_for_Teachers_Working_in_Early_Childhood_and_School_Educati on Eurydice (2015q). Finland: Early childhood and school education funding. Retrieved 149 from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Finland:Early_Childhood_ and_School_Education_Funding Eurydice (2015r). Finland: Quality assurance in early childhood and school education. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Finland:Quality_Assurance _in_Early_Childhood_and_School_Education Eurydice (2015s). Norway: Organisation and governance. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Norway:Organisation_and _Governance Eurydice (2015t). Norway: Teachers and education staff. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Norway:Teachers_and_Ed ucation_Staff Eurydice (2015u). Norway: Continuing professional development for teachers working in early childhood and school education. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Norway:Continuing_Profe ssional_Development_for_Teachers_Working_in_Early_Childhood_and_School_Educati on Eurydice (2015v). Norway: Quality assurance in early childhood and school education. 150 Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Norway:Quality_Assuranc e_in_Early_Childhood_and_School_Education Eurydice (2015x). Norway: Early childhood and school education funding. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Norway:Early_Childhood_ and_School_Education_Funding Eurydice (2015y). Sweden: Organisation and governance. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Sweden:Organisation_and _Governance Eurydice (2015z). Sweden: Teachers and education staff. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Sweden:Teachers_and_Ed ucation_Staff Eurydice (2015aa). Sweden: Continuing professional development for teachers working in early childhood and school education. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Sweden:Continuing_Profes sional_Development_for_Teachers_Working_in_Early_Childhood_and_School_Educati on Eurydice (2015ab). Sweden: Quality assurance in early childhood and school education. 151 Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Sweden:Quality_Assuranc e_in_Early_Childhood_and_School_Education Eurydice (2015ac). Sweden: Early childhood and school education funding. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Sweden:Early_Childhood_ and_School_Education_Funding Eurydice (2015ad). United Kingdom (England): Organisation and governance. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Sweden:Organisation_and _Governance Eurydice (2015ae). United Kingdom (England): Teachers and education staff. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/United-KingdomEngland:Teachers_and_Education_Staff Eurydice (2015af). United Kingdom (Scotland): Organisation and governance. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/United-KingdomScotland:Organisation_and_Governance Eurydice (2015ag). United Kingdom (Scotland): Teachers and education staff. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/UnitedKingdom-Scotland:Teachers_and_Education_Staff Eurydice (2015ah). United Kingdom (Scotland): Quality assurance. Retrieved from: 152 https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/United-KingdomScotland:Quality_Assurance Eurydice (2015ai). United Kingdom (England): Quality assurance. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/United-KingdomEngland:Quality_Assurance Eurydice (2015aj). Ireland: Organization of the education system and its structure. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Ireland:Organisation_of_th e_Education_System_and_of_its_Structure Eurydice (2015ak). Ireland: Teachers and education stuff. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Ireland:Teachers_and_Edu cation_Staff Eurydice (2015al). Ireland: Funding in education. Retrieved from: https://webgate.ec.europa.eu/fpfis/mwikis/eurydice/index.php/Ireland:Funding_in_Educat ion Gundem, B. B., & Hopmann, S. (1998). Didaktik and/or curriculum. New York: Peter Lang. Hanushek, E. (1971). Teacher Characteristics and Gains in Student Achievement: Estimation Using Micro Data. The American Economic Review, 61(2), 280 – 288. 153 Hanushek, E. A. (1986). The economics of schooling: Production and efficiency in public schools. Journal of Economic Literature, 1141–1177. Hanushek, E. A., Kain, J. F., & Rivkin, S. G. (2004). Why public schools lose teachers. Journal of Human Resources, 39(2), 326–354. Hanushek, E. A., & Raymond, M. E. (2005). Does school accountability lead to improved student performance? Journal of Policy Analysis and Management, 24(2), 297–327. Hanushek, E. A. (2006). Does Educational Tracking Affect Performance and Inequality? Differences-in-Differences Evidence Across Countries. The Economic Journal, 116(510), C63–C76. Hanushek, E. A., & Rivkin, S. G. (2006). Chapter 18 Teacher Quality. In E. H. and F. Welch (Ed.), Handbook of the Economics of Education (Vol. 2, pp. 1051–1078). Elsevier. Retrieved from http://www.sciencedirect.com/science/article/pii/S1574069206020186 Hanushek, E. A., & Woessmann, L. (2010). The economics of international differences in educational achievement. National Bureau of Economic Research. Retrieved from http://www.nber.org/papers/w15949 Helgøy, I., & Homme, A. (2007). Towards a new professionalism in school? A comparative study of teacher autonomy in Norway and Sweden. European Educational Research Journal, 6(3), 232–249. 154 Hirst, P.H. (1966). Educational theory. In J.W. Tibble (Ed.), The study of education (pp. 29–58). London: Routledge and Kegan Paul. Hlebowitsh, P. S., & Wraga, W. G. (1995). Social class analysis in the early progressive tradition. Curriculum Inquiry, 7–21. Hong, G., & Raudenbush, S. W. (2005). Effects of kindergarten retention policy on children’s cognitive growth in reading and mathematics. Educational Evaluation and Policy Analysis, 27(3), 205-224. Hopmann, S., & Riquarts, K. (1995). Didaktik and/or Curriculum (Vol. 147). Institut für die Pädagogik der Naturwissenschaften Kiel, Germany. Hopmann, S. (2007). Restrained teaching: The common core of Didaktik. European Educational Research Journal, 6(2), 109-124. Hopmann, S. T., Brinek, G., & Retzl, M. (2007). Pisa zufolge Pisa–Pisa According to Pisa. Wien: Lit-Verlag. House, R. J., Hanges, P. J., Javidan, M., Dorfman, P. W., & Gupta, V. (2004). Culture, leadership, and organizations: The GLOBE study of 62 cultures. Thousand Oaks, CA: Sage. Hudson, B. (2003). Approaching educational research from the tradition of critical-constructive Didaktik. Pedagogy, Culture and Society, 11(2), 173–187. Hudson, B., & Meyer, M.A. (Eds.). (2011). Beyond fragmentation. Didactics, learning 155 and teaching in Europe. Opladen: Barbara Budrich Publishers. Humboldt, v. W. (1793/2000). Theory of Bildung. In I. Westbury, S. Hopmann, & K. Riquarts (Eds.), Teaching as a Reflective Practice: The German Didaktik Tradition (pp. 57-61). Mahwah, New Jersey: Lawrence Erlbaum Associates. Kansanen, P. (1999). The Deutsche Didaktik and the American Research on Teaching. In B. Hudson, F. Buchberger, P. Kansanen & H. Seel (Eds.), Didaktik/Fachdidaktik as Science(-s) of the Teaching Profession? (pp. 21–36). Umeå, Sweden: Thematic Network of Teacher Education in Europe. Kansanen, P. (2002). Didactics and its relation to educational psychology: Problems in translating a key concept across research communities. International review of education, 48(6), 427-441. Katz, M. B. (1987). Reconstructing American Education. Harvard University Press: Cambridge, MA. Kilpatrick, W. (1918). The project method. The Teachers College Record, 19(4), 319–335. Kind, P. M. (2013). Conceptualizing the Science Curriculum: 40 Years of Developing Assessment Frameworks in Three Large-Scale Assessments. Science Education, 97(5), 671–694. Klafki, W. (1998). Characteristics of critical-constructive didaktik. In B. Gundem & S. Hopmann (Eds.), Didaktik and/or Curriculum (pp. 29-46). New York: Peter Lang. 156 Klafki, W. (2000a). The significance of classical theories of bildung for a contemporary concept of allgemeinbildung. In I. Westbury, S. Hopmann & K. Riquarts (Eds.), Teaching as a Reflective Practice: The German Didaktik Tradition (pp. 85-107). Mahwah, New Jersey: Lawrence Erlbaum Associates. Klafki, W. (2000b). Didaktik analysis as the core preparation of instruction. In I. Westbury, S. Hopmann & K. Riquarts (Eds.), Teaching as a Reflective Practice: The German Didaktik Tradition (pp. 139-159). Mahwah, New Jersey: Lawrence Erlbaum Associates. Klette, K. (2007). Trends in Research on Teaching and Learning in Schools: didactics meets classroom studies. European Educational Research Journal, 6(2), 147–160. Kliebard, H. M. (1970). The Tyler rationale. The School Review, 259–272. Kliebard, H. M. (2004). The struggle for the American curriculum, 1893-1958. Routledge. Künzli, R. (1998). The common frame and places of Didaktik. In B. Gundem & S. Hopmann (Eds.), Didaktik and/or Curriculum (pp. 29-46). New York: Peter Lang. Lagemann, E.C. (1989). The plural worlds of educational research. History of Education Quarterly, 29(2), 185-214. Lareau, A. (1989). Home advantage: Social class and parental involvement in elementary education. London: Falmer. 157 Labaree, D. F. (2005). Progressivism, schools and schools of education: An American romance. Paedagogica Historica, 41(1-2), 275–288. Little, D. (1995). Learning as dialogue: The dependence of learner autonomy on teacher autonomy. System, 23(2), 175–181. Looney, J. W. (2009), “Assessment and Innovation in Education”, OECD Education Working Papers, No. 24, OECD Publishing. http://dx.doi.org/10.1787/222814543073 Lucas, B., G. Claxton and E. Spencer (2013), “Progression in Student Creativity in School: First Steps Towards New Forms of Formative Assessments”, OECD Education Working Papers, No. 86, OECD Publishing. http://dx.doi.org/10.1787/5k4dp59msdwk-en Luke, A. (2004). Teaching after the market: From commodity to cosmopolitan. The Teachers College Record, 106(7), 1422–1443. Lüth, C. (1998). On Wilhelm von Humboldt’s Theory of Bildung. Journal of Curriculum Studies, 30(1), 43–59. Marx, K., & Engels, F. (1971). Capital: 3 Vol. Progress. Matson, J. V. (1992). The art of innovation: Using intelligent fast failure. Pennsylvania State University Press. Matson, J. V. (1996). Innovate or die: A personal perspective on the art of innovation. Paradigm Press. 158 Meghir, C., & Rivkin, S. (2011). Chapter 1 - Econometric Methods for Research in Education. In S. M. and L. W. Eric A. Hanushek (Ed.), Handbook of the Economics of Education (Vol. 3, pp. 1–87). Elsevier. Retrieved from http://www.sciencedirect.com/science/article/pii/B9780444534293000016 Meyer, J. W., Boli, J., Thomas, G. M., & Ramirez, F. O. (1997). World society and the nationstate. American Journal of Sociology, 103(1), 144–181. Meyer, J. W., Ramirez, F. O., Rubinson, R., & Boli-Bennett, J. (1977). The world educational revolution, 1950-1970. Sociology of Education, 242–258. Meyer, J. W., Ramirez, F. O., & Soysal, Y. N. (1992). World expansion of mass education, 1870-1980. Sociology of Education, 128–149. Meyer, J. W., & Rowan, B. (1977). Institutionalized organizations: Formal structure as myth and ceremony. American Journal of Sociology, 340–363. McKnight, P. E., McKnight, K. M., Sidani, S., & Figueredo, A. J. (2007). Missing data: A gentle introduction. Guilford Press. Morrow, R. A., & Torres, C. A. (1995). Social theory and education: A critique of theories of social and cultural reproduction. SUNY Press. National Center on Education and the Economy (2014a). Teacher and principal quality: 159 Australia. Retrieved from: http://www.ncee.org/programs-affiliates/center-oninternational-education-benchmarking/top-performing-countries/australiaoverview/australia-teacher-and-principal-quality/ National Center on Education and the Economy (2014b). Teacher and principal quality: Canada. Retrieved from: http://www.ncee.org/programs-affiliates/center-oninternational-education-benchmarking/top-performing-countries/canadaoverview/canada-teacher-and-principal-quality/ National Center on Education and the Economy (2014c). Teacher and principal quality: New Zealand. Retrieved from: http://www.ncee.org/programs-affiliates/center-oninternational-education-benchmarking/top-performing-countries/new-zealandoverview/3866-2/ National Research Council (NRC). (2003). Understanding Others, Educating Ourselves: Getting More from International Comparative Studies in Education. Committee on a Framework and Long-term Research Agenda for International Comparative Education Studies. C. Chabbott and E. J. Elliott, editors. Board on International Comparative Studies in Education, Board on Testing and Assessment, Center for Education, Division of Behavioral and Social Sciences and Education. Washington, DC: The National Academies Press. New Zealand Ministry of Education (2014). Parents and whanau. Retrieved from: http://www.minedu.govt.nz/Parents.aspx 160 New Zealand Teachers Council (2014). Studying to be a teacher. Retrieved from: http://www.teacherscouncil.govt.nz/content/studying-be-teacher Nye, B., Konstantopoulos, S., & Hedges, L. V. (2004). How large are teacher effects? Educational Evaluation and Policy Analysis, 26(3), 237-257. Oakes, J. (1985). Keeping track: How schools structure inequalities. New Haven, CT: Yale University Press. OECD. (1999). Measuring student knowledge and skills. A new framework for assessment. Paris: Author. OECD. (2003a). Definition and selection of competencies: Theoretical and conceptual foundations (DeSeCo), summary of the final report “Key competencies for a successful life and a well-functioning society. Paris: Author. OECD. (2003b). The PISA 2003 assessment framework—Mathematics, reading, science and problem solving knowledge and skills. Paris: Author. OECD. (2006). Assessing scientific, reading and mathematical literacy: A framework for PISA 2006. Paris: Author. OECD. (2009). PISA 2009 Assessment Framework: Key competencies in reading, mathematics and science. Paris: Author. OECD. (2010). PISA 2009 Results: What Students Know and Can Do – Student 161 Performance in Reading, Mathematics and Science (Volume I). Available at: http://dx.doi.org/10.1787/9789264091450-en OECD. (2011). Lessons from PISA for the United States, Strong Performers and Successful Reformers in Education. OECD Publishing. http://dx.doi.org/10.1787/9789264096660-en OECD. (2012). The PISA 2015 Assessment Framework: Key competencies in reading, mathematics and science. http://www.oecd.org/pisa/pisaproducts/pisa2015draftframeworks.htm OECD. (2013). Lessons from PISA 2012 for the United States, Strong Performers and Successful Reformers in Education, OECD Publishing. http://dx.doi.org/10.1787/9789264207585-en OECD. (2014a). Measuring Innovation in Education: A New Perspective. OECD Publishing. http://dx.doi.org/10.1787/9789264215696-en OECD. (2014b). TALIS 2013 results: An international perspective on teaching and learning. TALIS, OECD Publishing. http://dx.doi.org/10.1787/97899264196261-en Osborne, J. (In Press). Defining a knowledge base for reasoning in science: The role of procedural and epistemic knowledge. In R. A. Duschl & A. S. Bismack (Eds), Reconceptualizing STEM Education: The central role of practices. New York: Routledge, Taylor and Francis Group. 162 Parsons, T. (1959). General theory in sociology. Basic Books. Pearson, L. C., & Moomaw, W. (2005). The Relationship between Teacher Autonomy and Stress, Work Satisfaction, Empowerment, and Professionalism. Educational research quarterly, 29(1), 38-54. Pinar, W. (1995). Understanding curriculum: An introduction to the study of historical and contemporary curriculum discourses (Vol. 17). Peter Lang Pub Inc. Pinar, W., Reynolds, W., Slattery, P., & Taubman, P. (1995). Understanding curriculum: An Introduction to the study of historical and contemporary curriculum discourses. New York, NY: Peter Lang. Pinar, W. (2004). What is curriculum theory? Mahwah, New Jersey: Lawrence Erlbaum Associates Inc. Pinar, W. F. (2011). The character of curriculum studies: Bildung, currere, and the recurring question of the subject. Palgrave Macmillan. Pinar, W. F. (2012). What is curriculum theory?. Routledge. Rabe-Hesketh, S., & Skrondal, A. (2008). Multilevel and longitudinal modeling using Stata. STATA press. Raudenbush, S. W. (2008). Advancing educational policy by advancing research on instruction. American Educational Research Journal, 45(1), 206–230. 163 Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical linear models: Applications and data analysis methods (Vol. 1). Sage Publications, Inc. Ravitch, D. (2000). Left back: A century of failed school reforms. Simon and Schuster. Reid, W. A. (1997). Principle and pragmatism in English curriculum making 1868-1918. Journal of curriculum studies, 29(6), 667–682. Rivkin, S. G., Hanushek, E. A., & Kain, J. F. (2005). Teachers, schools, and academic achievement. Econometrica, 73(2), 417–458. Sahlberg, P. (2010). Finnish Lessons: What Can the World Learn from Educational Change in Finland?. New York: Teachers College Press. Schiro, M. (2008). Curriculum theory: Conflicting visions and enduring concerns. Sage Publications, Inc. Schiro, M. S. (2nd ed.) (2013). Curriculum theory: Conflicting visions and enduring concerns. Sage. Schmidt, W. H., Zoido, P., & Cogan, L. (2014). Schooling matters: Opportunity to learn in PISA 2012. OECD. Schubert, W. H. (1986). Curriculum: Perspective, paradigm, and possibility. Macmillan Pub. Co. Schubert, W. H. (1996). Perspectives on Four Curriculum Traditions. Educational Horizons, 164 74(4), 169–76. Schultz, T. W. (1961). Investment in human capital. The American Economic Review, 1–17. Schütz, G., Ursprung, H. W., & Wößmann, L. (2008). Education policy and equality of opportunity. Kyklos, 61(2), 279–308. Schwab, J. J. (1969). The practical: A language for curriculum. The School Review, 1–23. Schwab, J. J. (1973). The practical 3: Translation into curriculum. The School Review, 501–522. Schweingruber, H. A., Duschl, R. A., & Shouse, A. W. (Eds.). (2007). Taking Science to School: Learning and Teaching Science in Grades K-8. National Academies Press. Schwenk, B. (1996). Bildung. In D. Lenzen (Hg.), Pädagogische Grundbegriffe. Band 1 (pp. 208-221). Hamburg: Rowohlt Taschenbuch Verlag. Seel, H. (1999). “Allgemeine Didaktik” (“General Didactics”) and “Fachdidaktik” (“Subject Didactics”). In B. Hudson, F. Buchberger, P. Kansanen & H. Seel (Eds.), Didaktik/Fachdidaktik as Science(-s) of the Teaching Profession? (pp. 13–20). Umeå, Sweden: Thematic Network of Teacher Education in Europe. Siljander, P., & Sutinen (2012). Introduction. In P. Siljander, A. Kivelä & A. Sutinen (Eds.), Theories of Bildung and growth: Connections and controversies between continental educational thinking and American pragmatism (pp. 1-18). Springer. Smith, R. C. (2001). Teacher education for teacher-learner autonomy. Language in Language 165 Teacher Education. Retrieved from http://homepages.warwick.ac.uk/~elsdr/Teacher_autonomy.pdf Statistics Canada (2010). Structure of education and training in Canada. Retrieved from: http://www.statcan.gc.ca/pub/81-582-g/2010001/app-ann/app-ann1-eng.htm Stronge, J. H. (2007). Qualities of effective teachers. ASCD. Tucker, M. S. (2011). Surpassing Shanghai: An agenda for American education built on the world's leading systems. Cambridge, MA: Harvard Education Press. Tahirsylaj, A. S. (2012). Stimulating creativity and innovation through Intelligent Fast Failure. Thinking Skills and Creativity, 7(3), 265-270. Tahirsylaj, A., Brezicha, K., & Ikoma, S. (forthcoming). Unpacking teacher differences in Didaktik and Curriculum Traditions: Trends from TIMSS 2003, 2007, and 2011. In G. LeTendre & A. Wiseman (Eds.), Promoting and Sustaining a Quality Teaching Workforce. UK: Emerald Group Publishing. Tertiary Education Commission (2009). Types of tertiary education. Retrieved from: http://www.tec.govt.nz/Tertiary-Sector/Types-of-Tertiary-Education/ Tibble, J.W. (1966) Introduction. In J.W. Tibble (Ed.), The study of education (pp. vii-x). London: Routledge and Kegan Paul. Tort-Moloney, D. (1997). Teacher Autonomy: A Vygotskian Theoretical Framework. CLCS 166 Occasional Paper No. 48. Retrieved from http://www.eric.ed.gov/ERICWebPortal/recordDetail?accno=ED412741 Tyler, R. W. (1949). Basic Principles of Curriculum and Instruction. Chicago: University of Chicago Press. Uljens, M. (1997). School didactics and learning. A School Didactic Model Framing an Analysis of Pedagogical Implications of Learning Theory. Retrieved from http://samples.sainsburysebooks.co.uk/9781135481209_sample_525212.pdf Uljens, M. (2009). Critical School Didaktik–A theory and its features. Abo Akademi University. Retrieved from http://www.vasa.abo.fi/users/muljens/pdf/Uljens__Critical_School_Didactics_-_A_theory_and_its_features,_1.11_2009%20.pdf UNESCO (2014). Education finance. Retrieved from: http://www.uis.unesco.org/Education/Pages/education-finance.aspx UNESCO-IBE (2011). World Data on Education 2010/2011: Australia. Retrieved from: http://www.ibe.unesco.org/fileadmin/user_upload/Publications/WDE/2010/pdfversions/Australia.pdf U.S. Department of Education (2008). Structure of U.S. education. Retrieved from: http://www2.ed.gov/about/offices/list/ous/international/usnei/us/edlite-structure-us.html U.S. Department of Education (2014). Become a teacher. Retrieved from: 167 https://www.teach.org/pathway Vygotsky, L. S. (1978). Mind in society: The development of higher psychological processes. Cambridge, MA: Harvard University Press. Weber, M. (1961). Ethnic groups. Theories of society, 1, 305–309. Weniger, E. (2000). Didaktik as a theory of education. In I. Westbury, S. Hopmann & K. Riquarts (Eds.), Teaching as a Reflective Practice: The German Didaktik Tradition (pp. 85-107). Mahwah, New Jersey: Lawrence Erlbaum Associates. Westbury, I. (2000). Teaching as a reflective practice: What might didaktik teach curriculum? In I. Westbury, S. Hopmann & K. Riquarts (Eds.), Teaching as a Reflective Practice: The German Didaktik Tradition (pp. 85-107). Mahwah, New Jersey: Lawrence Erlbaum Associates. Westbury, I., Hopmann, S., & Riquarts, K. (2000). Teaching as a reflective practice: the German didaktik tradition. Lawrence Erlbaum Associates. Wössmann, L., & West, M. (2006). Class-size effects in school systems around the world: Evidence from between-grade variation in TIMSS. European Economic Review, 50(3), 695–736. Wössmann, L. (2011). Cross-country evidence on teacher performance pay. Economics of Education Review, 30(3), 404–418. 168 Zeichner, K. M. (1993). Traditions of practice in US preservice teacher education programs. Teaching and Teacher Education, 9(1), 1–13. Zhang, L., Khan, G., & Tahirsylaj, A. (forthcoming). Student performance, school differentiation, and world cultures: Evidence from PISA 2009. International Journal of Educational Development. 169 Appendices Appendix A: Key Components of Education Systems in Didaktik Countries Austria Age/Grades 3 to 6 year-old G1 to G4 Education Structure Teacher Education & Certification Teacher Professional Development Assessments Funding G5 to G8 School Option Kindergarten Primary School Lower Secondary (choice based on interest and/or ability) G9 to G13 Upper Secondary Undergraduate to Professional Tertiary Education Level Primary School Lower and upper secondary Vocational School All levels Requirement Required for all teachers Type High school matriculation exit exam (Matura) / recently revised into competence-, learning outcomes-based Source Federal government Notes Mandatory after age 5 Common for all General Secondary School/ New Secondary School/ Academic Secondary School Pre-vocational school; upper academic secondary school or vocational schools. The type of schools vary in length and degree received. Universities (public and private)/ Universities of Applied Sciences/ University Colleges of Teacher Education (Pädagogische Hochschulen (PH)) Requirements Teacher training college Teacher training college/University Teacher training college Unterrichtspraktikum—a year of concurrent courses with first year of teaching Providers University colleges for teacher education (PH) Notes German and Mathematics plus a foreign language are required Two-part exam in writing and oral a month apart (usually May and June) Organized and administered at the school level by teachers themselves in line with Bildungstandards/educational standards (teacher autonomy) Notes Total government expenditure on education in 2010: EUR 19.25 bn Government expenditure on education as % of GDP (%) 5.915 (2010) Note: Adapted from Eurydice (2015a, 2015b, 2015c, 2015d) and UNESCO (2014). 170 Germany Age/Grades Education Structure 3 to 6 year olds 1st to 4th grade 5th to 10th grade 10th to 12th grade onward 13th grade onward Level Early Childhood Teacher Education & Certification Teacher Professional Development Assessments Funding Grundschule Grundschule & all lower secondary school Upper Secondary Schools Vocational teacher All levels Requirement Required for all teachers by ordnance or law at Länder (provinces) level Type Gr. 3 & Gr. 8 Länder level comparative assessments High school exit exam (Abitur) Source Länder and federal funding School Option Notes Kindergarten Primary School Not Compulsory Compulsory from age 6 to 16 years old Options include basic general education, vocational education and Lower Secondary School academic education Upper Secondary Academic tracks with numerous vocational options Traditional universities; college of arts and music; universities of Higher Education applied sciences Requirements Training in Upper Secondary School / Pedagogical stuff for this level do not have the status of teachers 7 Semester training program at a Teachers College / 210 ECTS (European Credit Transfer System and First State Examination At least 7 Semester training program with emphasis in two subject areas at a Teachers College / 210 ECTS and First State Examination Bachelors + Masters / The course of study ends with 270 ECTS and First State Examination Bachelors + Masters with 12 months practical experience / 270 ECTS and First State Examination 12 to 24 months preparatory service which involves in classroom work coupled with continuing course work / Second State Examination Providers State-run in-service teacher training is organised in the Länder at central, regional and local level by staterun in-service training institutes which for the most part are subordinate to the Ministries of Education and Cultural Affairs as dependent Länder institutions. Notes Länder level assessments are diagnostic in nature to track the progress of all schools within a Länder, without any stake to students and teachers Abitur implemented at the school level based national Bildungstandards (educational standards) for German, Math and French/English Notes Total government expenditure on primary and secondary education in 2010: EUR 70 bn Government expenditure on education as % of GDP (%) 5.1 (2010) Note: Adapted from Eurydice (2015e, 2015f, 2015g, 2015h) and UNESCO (2014). 171 Denmark Education Structure Age/Grades 3 to 6 year olds School Option Kindergarten K to 10th grade Folkeskole 11th grade onward Upper Secondary 13th grade onward Higher Education Level Pre-primary Teacher Education & Certification Folkeskole Notes Compulsory only for 6 years old Consists of 1 year of kindergarten; 9 years of primary and lower secondary and 1 year 10th grade form 4 academic tracks with numerous vocational options Professional schools; University colleges; Universities; Arts Universities Requirements Bachelors of science degree Bachelors of science degree with specialization in either Danish, math, nature/technology or physics/chemistry and 1 or 2 other subjects Academic Upper Secondary Bachelors + Masters and year-long paedagogikum Vocational Upper Secondary Bachelors or Masters plus professional experience Teacher Requirement Providers Professional Recommended / Not University colleges for teacher education Development required by any Danish School of Education legislation Danish Union of Teachers Assessments Type Notes A number of frequent The Danish Evaluation Institute monitors the quality of teaching and learning by designing national tests national assessments throughout In Folkeskole, Danish is tested frequently every other year, mathematics twice and English, Folkeskole and upper Geography, Biology and Physics once each / Tests are computer based and adaptive secondary level Funding Source Notes Municipalities pay Government expenditure on education as % of GDP (%) 8.73 (2009) for Folkeskole (KPublic education is free 10th grade) State pays for upper secondary education Note: Adapted from Eurydice (2015i, 2015j, 2015k, 2015l, 2015m) and UNESCO (2014). 172 Finland Age/Grades 0 to 7 years Education Structure Teacher Education & Certification Teacher Professional Development Assessments 6-15 years 9th to 12th grade 13th grade onward Level Early Childhood Education and Primary Teachers Basic Education and Upper Secondary Teachers Upper Secondary Vocational Teachers Requirement No strict requirement School Option Early childhood education Pre-primary and basic education Secondary education Higher Education Notes Local authorities are obligated to provide education and care for all children Comprehensive education Separate tracks for general and vocational education Universities and Polytechnics Requirements A Bachelor’s degree in social pedagogy required A Master’s degree is required, with practical pedagogical training either during or after completion of the degree Subject teachers are trained at teacher education universities pedagogically and in subjectspecific departments for the content part Length of subject-teacher education is 5-6 years (300 ECTS) A Master’s or a Bachelor’s degree is required and length is usually 4 years (240 ECTS) Providers 1 to 5 days a year might be required depending on collective agreements Employers might assign teachers to PD opportunities offered by education providers (local authorities, universities, polytechnics) Notes All national assessments are implemented for data collection purposes to ensure equity and equality of education Type All assessments done at school level National exams Funding Source Notes About 99% of Government expenditure on education as % of GDP (%) 6.76 (2011) funding comes from the federal government Note: Adapted from Eurydice (2015n, 2015o, 2015p, 2015q, 2015r) and UNESCO (2014). 173 Norway Age/Grades 1st to 7th grade 8th to 10th grade School Option Nursery School and Kindergarten available Primary school Lower Secondary School 11th grade onward Upper Secondary School 13th grade onward Level Pre-primary Primary Primary or Lower Secondary Vocational Teacher Upper Secondary Teachers Requirement No strict requirements Higher Education 0 to 6 years old Education Structure Teacher Education & Certification Teacher Professional Development Assessments Notes Compulsory school starts at age 6 Provided as a single structure system No divisions between primary and lower secondary General education, vocational education and academic education and arts focused education Universities; Universities Colleges; Specialized Universities Requirements 3 year program 4 year program which includes one year of specialization in Math, Norwegian or English 4 year program which includes two year of specialization in Math, Norwegian or English 3 year program Integrated Masters Program Providers School owners at municipal level decide about in-service teacher education Local education authorities, teachers' associations, associations for special subjects in higher education institutions, regional officer’s educational departments, County Education Committees, national councils or national education authorities. Notes National tests of students’ skills (reading in Norwegian, and English and Mathematics) for students in grade 5 in primary and grade 8 and 9 in lower secondary school. Final examinations (Norwegian, Mathematics, English) for students in grade 10 in lower secondary schools. Examinations (Norwegian, Mathematics, English and other major subjects) for students in upper secondary school. Trade- and journeyman’s examinations for apprentices. Type National tests and final examinations in primary, lower secondary, upper secondary, and vocational education Funding Source Notes All education Government expenditure on education as % of GDP (%) 6.87 (2010) funding from the central government Note: Adapted from Eurydice (2015s, 2015t, 2015u, 2015v, 2015x) and UNESCO (2014). 174 Sweden Education Structure Teacher Education & Certification Age/Grades Age 1 to 5 1st to 10th Grade 11th grade onward 13th grade onward Level Pre-primary Primary Grades 1-3 School Option Pre-school class Primary School Upper Secondary Higher Education Notes Not mandatory Compulsory between age 7 and 16 Programs based on students’ interests and needs Universities; Universities Colleges; Post-Secondary Vocational Requirements 2 year program at pre-school training program 240 credit hours focus on math, reading and writing for young students Teachers specialize between social subjects, natural science subjects and technology, or one or more practical or artistic subjects. 270 Credits with specialization in three subject areas 330 credits with specialization in two subject areas Providers The Swedish National Agency for Education Higher education institutions and regional development centers Primary Grades 4-6 Primary Grades 7-9 Upper Secondary Teacher Requirement Professional No strict Development requirement, a school responsibility to ensure continued development Assessments Type Notes No exit exams for The Swedish National Agency for Education (Skolverket) is responsible for the national test compulsory and system. upper secondary education National diagnostic tests in grades 3, 6 and 9 Funding Source Notes Shared between state Government expenditure on education as % of GDP (%) 6.98 (2010) and municipalities Note: Adapted from Eurydice (2015y, 2015z, 2015aa, 2015ab, 2015ac) and UNESCO (2014). 175 Appendix B: Key Components of Education Systems in Didaktik Countries Australia Education Structure Age/Grades 5 and younger Grade K to 6/7 Grades 7/8 to 10 Grades 11 to 12 School Option Pre-school options Primary School Secondary School Senior Secondary School Notes Neither compulsory or regulated by the government Compulsory school begins at age 6 Compulsory school ends at age 16 Universities, Technical Schools and Vocational Education Grades 13 onward Tertiary Education Options Level Requirements Teacher Primary School Bachelors or Masters in Education – all teachers must have bachelors at least Education & Teachers An induction period for teachers follows after completion of formal education Certification Secondary School Teacher training at universities or accredited non-government college Teacher Requirement Providers Professional Required and tied to Australian Institute for Teaching and School Leadership (AITSL) develops professional Development accountability standards for teachers and provides PD opportunities for teachers to meet those standards Assessments Type Notes Bi-annual summative States have system-wide assessments to track student achievement assessments Annual tests in Years 3, 5, 7 and 9 include literacy and numeracy tests Annual testing in Years 3, The Australian Curriculum, Assessment and Reporting Authority (ACARA) helps to develop 5, 7 and 9 at school level national tests External examination in Years 11 and 12 Funding Source Notes Most funding for Government expenditure on education as % of GDP (%) 5.12 (2011) school education comes from state level, Federal government covers higher education Note: Adapted from Australian Government (2014), Australian Institute for Teaching and School Leadership (2014), National Center on Education and the Economy (2014a), UNESCO-IBE (2011) and UNESCO (2014). 176 Canada Age/Grades 4 or 5 years Education Structure Teacher Education & Certification 1st to 12th 13th Grade Onward Level School Option Pre-Grade 1 education Notes Mandatory only in Nova Scotia and New Brunswick Quebec is an exception with 6 years of elementary and 5 years of secondary education Elementary to secondary Some forms of vocational education offered in some provinces Degree-granting (primarily universities) and non-degree granting institutions (such as community colleges, regional colleges, Tertiary School centers, institutes, etc.) Requirements 4 year bachelor degree required Those teaching in secondary education need qualifications beyond bachelors Induction training during the first year of teaching Providers Decentralized and varies from province to province Elementary and secondary teachers Teacher Requirement Professional Required in all Development provinces Assessments Type Notes Large scale Provinces have standardized tests at different grades standardized testing at province and Pan-Canadian Assessment Program (PSAP) tests students across the nation to track student nation-wide level achievement Funding Source Notes Primarily provinces Government expenditure on education as % of GDP (%) 2.39 (2011) responsibility Note: Adapted from Statistics Canada (2010), National Center on Education and the Economy (2014b), The Council of Ministers of Education, Canada (n.d.), and UNESCO (2014) 177 United Kingdom (including England, Northern Ireland, Scotland, and Wales / Scotland’s education is somewhat different) Age/Grades School Option Notes 3 to 5 years Early years foundations Not compulsory before age 5 Primary and Secondary offered by a variety of providers, 5 to 11 years Primary School compulsory till 16 Compulsory school ends at age 16 Education In Scotland: At13/14 years old students choose subjects to Structure continuing study. At 15, decide to enter Upper Secondary or 11 to 16 years Secondary School workforce 16 to 19 years Upper Secondary School Offered in both vocational and academic courses Universities, Higher education institutions, and Vocational Age 19 Onward Tertiary Education Education Options Level Requirements Primary School Undergraduate degree Teacher Four-year bachelors degree (Scotland) Education & Teachers Certification Secondary School Postgraduate degree + school experience Note: Any teacher who holds the highly qualified status can teach any child or level (England) Teacher Requirement Providers Professional Required and Higher education institutions and regional development centers Development organized within The National College for Teaching and Leadership school, local level Tied to school development and evaluation and nationally In Scotland: Teaching Scotland’s Future Assessments Type Notes Standardized tests at Department of Education and OFSTED (The Office for Standards in Education, Children’s key stages 1, 2 and 4 Services and Skills) monitor school and student performance / There are less standardized tests (ages 7, 11 and 16) in Wales In Scotland, quality assurance responsibility rests with schools themselves and no standardized tests are administered Funding Source Notes UK government Government expenditure on education as % of GDP (%) 6.22 (2010) / Total for all four (England, Northern countries constituting UK Ireland, Wales) Scotland Government Note: Adapted from Eurydice (2015ad, 2015ae, 2015af, 2015ag, 2015ah, 2015ai) and UNESCO (2014) 178 Ireland Age/Grades 4 to 12 years Education Structure 12 -17/18 18 years onward Level Teacher Education & Certification Teacher Professional Development Assessments Funding Primary teachers Post-primary teachers Requirement Not required, voluntary Type Two key external examinations Source State funding School Option Primary schools Notes Compulsory age from 6 to 16 Three types: Secondary schools / Schools/community colleges / Post-primary schools Comprehensive and community schools Leaving Certificate required to get to higher education Tertiary School institutions Requirements Four-year Bachelor of Education degree required Induction and probation requirements need to be fulfilled within first 3 years of teaching Three/Four-year Bachelor of Education degree required + university one year post-graduate program leading to Professional Diploma in Education Post-Qualification Employment requirements must be fulfilled within 3 years Providers Teacher unions, universities and colleges of education, subject-based teachers, etc. Notes Exam leading to Junior Certificate (age15-16) Exam leading to Leaving Certificate (age 17-18) The State Examinations Commission by the Department of Education and Skills administers exams Notes Government expenditure on education as % of GDP (%) 6.15 (2011) Note: Adapted from Eurydice (2015aj, 2015ak, 2015al) and UNESCO (2014). 179 New Zealand Age/Grades Education Structure 1 to 5 years old 1st to 8th Grade (Year) 9th to 13th Grade (Year 13th onward Teacher Education & Certification Teacher Professional Development Assessments Funding Level 1 to 5 years old Primary to High school Requirement Part of the induction program Type School-based assessment National diagnostic assessments School Option Teacher and parent led options Primary/Middle School Notes 1 to 5 years old Differing schools provide varying grade levels Year 9 & 10 study the same curricula while 11 and 12th grade High School/College choose courses based on interests Universities, vocational programs, and transitional training Tertiary Education programs Requirements Three year undergraduate program 3 to 4 year undergraduate program and/or one year graduate program Providers Ministry of Education manages the professional development and provides guidelines for inservice teacher training All teachers undergo heavy professional development during their first two years of teaching career Notes National Education Monitoring Project (NEMP) is New Zealand’s national diagnostic assessment with a sample of students in grades 4 and 8 Source Notes National government Government expenditure on education as % of GDP (%) 7.38 (2012) funding Note: Adapted from New Zealand Ministry of Education (2014); Tertiary Education Commission (2009); New Zealand Teachers Council (2014); National Center on Education and the Economy (2014c) and UNESCO (2014). 180 United States Education Structure Teacher Education & Certification Teacher Professional Development Assessments Funding Age/Grades 3 to 5 years 1st to 5th grade 6th to 8th 9th to 12th grade School Option Pre-school options Elementary school Middle school High School 13th Grade Onward Level Primary, Secondary School teachers of technology, physical education and music Secondary School teacher Requirement Required at the state level Type State standardized Annual achievement tests National tests Tertiary School Notes Vary by state and local districts Compulsory age of attendance varies by state May be combined with elementary school Differing schools provide varying tracks Community colleges, colleges, universities and vocational education Requirements 4 year bachelor degree States set their certification requirements to get into teaching profession Bachelors degree plus school experience Providers Higher education institutions and teacher professional organizations Notes NCLB related tests to track annual progress of students in reading and math in Grades 3 through 8 and Grade 10 with implications for schools and teachers as part of accountability measures National Assessment of Education Progress (NAEP) administered at least every two years in reading and math in grades 4 and 8 Source Notes State/local district Government expenditure on education as % of GDP (%) 5.42 (2010) funding Note: Adapted from U.S. Department of Education (2008); U.S. Department of Education (2014); and UNESCO (2014). 181 Appendix C: Science Proficiency in Participating Countries in PISA 2009 Countries are ranked in descending order of the percentage of students at Levels 2, 3, 4, 5 and 6. Didaktik countries are squared and curriculum countries are circled. Source: OECD PISA 2009 Database, Table I.3.4. 182 Appendix D: Description for levels of proficiency on science scale Source: OECD 183 Appendix E: Percent of missing data in study variables Variable Missing Percent Missing Science scores Standardized tests Teacher tests Teacher judgments Assessment policies Textbooks used Course content Courses offered School type ESCS Girl Age Grade Immigration status 0 3,562 2,861 3,707 1,838 1,838 1,838 1,838 1,332 2,225 0 0 177 2,709 0 3.71 2.98 3.86 1.91 1.91 1.91 1.91 1.39 2.32 0 0 0.18 2.82 Home language other than test language 3,883 4.05 184 Appendix F: Correlation tables for curriculum countries Australia (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) -(1) Science scores (2) Use of standardized tests -0.0382*** (3) Use of teacher developed tests 0.0647*** 0.0437*** -- (4) Use of teacher judgments (5) Teachers responsible for assessment policies (6) Teachers responsible for choosing textbooks (7) Teachers responsible for course content (8) Teachers responsible for courses offered (9) Gender -0.0068 0.1030*** 0.3604*** -- -0.0009 0.0106 0.0716*** 0.0160 (10) ESCS -- -- 0.0007 0.0100 0.1086*** 0.0406*** 0.3818*** -- 0.0371*** -0.0626*** 0.1650*** 0.0770*** 0.1932*** 0.4040*** -- -0.0334*** -0.0514*** 0.1064*** 0.1877*** 0.3278*** 0.2544*** 0.4064*** -- 0.0322 -0.0185* 0.0567*** 0.0672*** 0.0354*** -0.0137 -0.0296*** -0.0239** -- 0.3867*** -0.0187* 0.0971*** -0.0504*** -0.0178 -0.0155 -0.0077 -0.0554*** 0.0044 *p < 0.05, ** p < 0.01, *** p < 0.001. Weighted. -- 185 Canada (1) Science scores (1) (2) (3) (4) (5) (6) (7) (9) (10) -- (2) Use of standardized tests 0.0029 (3) Use of teacher developed tests 0.0276*** 0.0069 (4) Use of teacher judgments -0.0397*** 0.1485*** 0.0002 (5) Teachers responsible for assessment policies 0.0349*** 0.0425*** 0.0109 ---0.0373*** -- (6) Teachers responsible for choosing textbooks 0.0579*** 0.0719*** -0.0830*** 0.0541*** 0.3766*** -- (7) Teachers responsible for course content 0.0422*** 0.1288*** -0.0341*** 0.1052*** 0.2146*** 0.4248*** -- (8) Teachers responsible for courses offered -0.0255*** 0.0023 -0.0713*** 0.0457*** 0.3028*** 0.2578*** 0.2238*** (9) Gender -0.0087 (10) ESCS (8) 0.3121*** -0.0107 -0.0380*** 0.0328*** 0.0294*** 0.0250*** -0.0128 *p < 0.05, ** p < 0.01, *** p < 0.001. Weighted. 0.0210** 0.0193** 0.0019 0.0300*** 0.0034 0.0706*** -0.0139* -0.0126 --0.0203** -- 186 United Kingdom (1) (2) (3) (4) (5) (6) (7) (8) (9) (1) Science scores -- (2) Use of standardized tests -0.0151 -- (3) Use of teacher developed tests 0.0219* 0.1547*** -- (4) Use of teacher judgments 0.0436*** 0.1417*** 0.4154*** -- (5) Teachers responsible for assessment policies -0.0140 0.0348*** -0.0547*** -0.0275** -- (6) Teachers responsible for choosing textbooks 0.0004 0.0325*** -0.0507*** -0.0409*** 0.0668*** -- (7) Teachers responsible for course content 0.0049 0.1210*** 0.1291*** 0.0708*** -0.0247** 0.1877*** -- (8) Teachers responsible for courses offered -0.1329*** -0.0084 -0.1330*** -0.0893*** 0.2371*** 0.0772*** 0.0016 -- (9) Gender -0.0063 0.0386*** 0.0056 -0.0420*** -0.1351*** 0.0051 0.0176 -0.0769*** -- (10) ESCS 0.4334*** -0.0520*** 0.0349*** 0.0261** -0.0412*** -0.0077 0.0103 -0.1430*** 0.0150 *p < 0.05, ** p < 0.01, *** p < 0.001. Weighted. (10) -- 187 Ireland (1) (2) (3) (4) (5) (6) (7) (8) (9) (1) Science scores -- (2) Use of standardized tests 0.0077 -- (3) Use of teacher developed tests -0.0222 0.1569*** -- (4) Use of teacher judgments 0.0226 0.1383*** 0.0870*** -- (5) Teachers responsible for assessment policies 0.0697*** -0.1169*** -0.0818*** -0.2230*** -- (6) Teachers responsible for choosing textbooks -0.0232 -0.0773*** -0.0598*** -0.0978*** 0.0740*** -- (7) Teachers responsible for course content -0.0257 0.0564** -0.0830*** 0.2016*** 0.1094*** 0.1463*** -- (8) Teachers responsible for courses offered -0.0565*** 0.0261 -0.0291 -0.0495** 0.3642*** 0.1419*** 0.2202*** -- (9) Gender 0.0093 -0.1083*** 0.1521*** 0.0227 -0.0615*** -0.0025 -0.0341* -0.0779*** -- (10) ESCS 0.3389*** 0.0421* 0.0084 -0.0090 -0.0565*** 0.0195 0.0144 -0.0559** -0.0243 *p < 0.05, ** p < 0.01, *** p < 0.001. Weighted. (10) -- 188 New Zealand (1) (2) (3) (4) (5) (6) (7) (8) (9) (1) Science scores -- (2) Use of standardized tests -0.03218 -- (3) Use of teacher developed tests 0.1550*** 0.0687*** -- (4) Use of teacher judgments 0.0519*** 0.1537*** 0.2499*** -- (5) Teachers responsible for assessment policies 0.0507*** 0.1170*** 0.0655*** 0.1682*** -- (6) Teachers responsible for choosing textbooks 0.0045 -0.0583*** -0.0539*** 0.0082 0.1605*** -- (7) Teachers responsible for course content 0.2305*** -0.1158*** 0.2625*** 0.1010*** 0.0949*** 0.4627*** -- (8) Teachers responsible for courses offered 0.1584*** -0.0031 0.1115*** 0.0363* 0.3083*** 0.1725*** 0.3728*** -- (9) Gender 0.0747*** 0.0120 0.1504*** 0.0385* 0.0328* 0.0290 0.1642*** 0.0375* (10) ESCS 0.4035*** -0.0175 0.0562*** -0.0208 0.0405** 0.0089 0.0527*** 0.1268*** -0.0014 (10) --- *p < 0.05, ** p < 0.01, *** p < 0.001. Weighted. United States (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) 189 (1) Science scores -- (2) Use of standardized tests -0.1457*** -- (3) Use of teacher developed tests 0.2002*** 0.0156 -- (4) Use of teacher judgments -0.0060 0.0642*** 0.1631*** -- (5) Teachers responsible for assessment policies -0.1247*** 0.2266*** -0.0298* 0.0451** -- (6) Teachers responsible for choosing textbooks -0.0757*** 0.1398*** 0.0938*** 0.0474*** 0.4530*** -- (7) Teachers responsible for course content -0.0885*** 0.0559*** -0.0859*** 0.3016*** 0.3488*** 0.5020*** -- (8) Teachers responsible for courses offered -0.0382** 0.1331*** -0.1394*** 0.2133*** 0.4325*** 0.5267*** 0.6500*** -- (9) Gender -0.1035*** 0.0110 -0.0253 0.0692*** 0.0108 0.0687*** 0.0860*** 0.0546*** -- (10) ESCS 0.4522*** -0.1640*** 0.1486*** 0.0386** -0.1713*** -0.1815*** -0.0651*** -0.0808*** *p < 0.05, ** p < 0.01, *** p < 0.001. Weighted. -0.0602*** -- 190 Appendix G: Correlation tables for didaktik countries (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) Austria -(1) Science scores (2) Use of standardized tests -0.0630*** (3) Use of teacher developed tests 0.0205 -0.0998*** (4) Use of teacher judgments (5) Teachers responsible for assessment policies (6) Teachers responsible for choosing textbooks (7) Teachers responsible for course content (8) Teachers responsible for courses offered 0.0981*** 0.0055 0.3932*** -0.1316*** -0.0154 0.0338** 0.0091 -0.0083 0.0368** 0.1776*** 0.1630*** 0.1925*** -0.1038*** 0.0944*** 0.1884*** 0.0806*** 0.2681*** 0.3014*** -- -0.0170 0.1323*** -0.0631*** 0.1418*** 0.2655*** 0.1439*** 0.1745*** -0.0224 -0.0918*** 0.0989*** 0.0859*** 0.0209 0.0959*** 0.0230 -0.0555*** -- 0.4382*** -0.0361*** 0.0126 0.0747*** -0.1231*** -0.0139 -0.0962*** -0.0708*** -0.0005 (9) Gender (10) ESCS --- *p < 0.05, ** p < 0.01, *** p < 0.001. Weighted. -- -- -- -- -- 191 Germany (1) (2) (3) (4) (5) (6) (7) (8) (9) (1) Science scores -- (2) Use of standardized tests -0.1711*** -- (3) Use of teacher developed tests -0.0160 0.0109 -- (4) Use of teacher judgments -0.0101 -0.0056 0.4366*** -- (5) Teachers responsible for assessment policies -0.0458** -0.0583*** 0.0638*** 0.1217*** -- (6) Teachers responsible for choosing textbooks -0.0362* -0.0969*** 0.0940*** 0.0684*** 0.3277*** -- (7) Teachers responsible for course content -0.0987*** 0.0192 -0.0525*** 0.0380* 0.3245*** 0.3187*** -- (8) Teachers responsible for courses offered -0.1659*** 0.0428** -0.0864*** -0.0655*** 0.2881*** 0.3131*** 0.3279*** -- (9) Gender -0.0047 -0.0224 0.0019 -0.0221 -0.0231 -0.0462** -0.0093 -0.0037 -- (10) ESCS 0.4350*** -0.1378*** 0.0066 0.0074 0.0025 -0.0087 -0.0371* -0.1036*** -0.0009 *p < 0.05, ** p < 0.01, *** p < 0.001. Weighted. (10) -- 192 Denmark (1) (2) (3) (4) (5) (6) (7) (8) (9) (1) Science scores -- (2) Use of standardized tests 0.0180 -- (3) Use of teacher developed tests 0.0127 0.4005*** -- (4) Use of teacher judgments 0.0191 0.2465*** 0.2881*** -- (5) Teachers responsible for assessment policies 0.0680*** -0.1376*** -0.1562*** 0.0444*** -- (6) Teachers responsible for choosing textbooks 0.0710*** -0.0492*** -0.0798*** -0.0386** 0.2853*** -- (7) Teachers responsible for course content 0.0437*** -0.1224*** -0.0097 -0.0886*** 0.2579*** 0.3189*** -- (8) Teachers responsible for courses offered 0.0348** -0.1311*** -0.1923*** -0.0565*** 0.4617*** 0.2485*** 0.2317*** -- (9) Gender -0.0624*** 0.0127 0.0118 0.0309* -0.0096 -0.0115 -0.0029 -0.0182 -- (10) ESCS 0.3916*** -0.0164 -0.0450*** -0.0030 0.0644*** 0.0521*** 0.0446*** 0.0594*** -0.024 *p < 0.05, ** p < 0.01, *** p < 0.001. Weighted. (10) -- 193 Finland (1) (2) (3) (4) (5) (6) (7) (8) (9) (1) Science scores -- (2) Use of standardized tests 0.0396** -- (3) Use of teacher developed tests -0.0074 0.3832*** -- (4) Use of teacher judgments 0.0490*** 0.1651*** 0.2084*** -- (5) Teachers responsible for assessment policies 0.0578*** -0.0720*** 0.1259*** 0.1529*** -- (6) Teachers responsible for choosing textbooks 0.0036 0.0088 0.0833*** -0.0595*** 0.1573*** -- (7) Teachers responsible for course content 0.0111 0.0018 0.1975*** 0.0655*** 0.2604*** 0.0406** -- (8) Teachers responsible for courses offered 0.0179 0.0135 0.1361*** 0.1116*** 0.2312*** -0.0039 0.3668*** -- (9) Gender 0.1059*** 0.0138 0.0151 0.0087 0.0098 0.0050 0.0135 0.0125 (10) ESCS 0.2719*** -0.0120 -0.0236 -0.0138 0.0448*** -0.0070 0.0246 0.0465*** 0.0123 *p < 0.05, ** p < 0.01, *** p < 0.001. Weighted. (10) --- 194 Norway (1) (2) (3) (4) (5) (6) (7) (8) (9) (1) Science scores -- (2) Use of standardized tests -0.0326* -- (3) Use of teacher developed tests 0.0515*** 0.0113 -- (4) Use of teacher judgments 0.0177 -0.1074*** 0.1595*** -- (5) Teachers responsible for assessment policies 0.0444** -0.0254 -0.0056 0.0694*** -- (6) Teachers responsible for choosing textbooks -0.0067 0.1646*** 0.0784*** -0.0862*** 0.1590*** -- (7) Teachers responsible for course content 0.0133 0.1485*** -0.0491*** -0.1260*** 0.2846*** 0.3060*** -- (8) Teachers responsible for courses offered 0.0268 -0.0728*** 0.0037 -0.0179 0.0235 0.1558*** 0.2003*** -- (9) Gender 0.0205 -0.0125 -0.0147 -0.0087 -0.0118 -0.0071 -0.0192 -0.0233 -- (10) ESCS 0.2937*** 0.0189 -0.0044 -0.0092 -0.0330* -0.0168 -0.0174 -0.0746*** -0.0286 *p < 0.05, ** p < 0.01, *** p < 0.001. Weighted. (10) -- 195 Sweden (1) (2) (3) (4) (5) (6) (7) (8) (9) (1) Science scores -- (2) Use of standardized tests -0.0456** -- (3) Use of teacher developed tests -0.0127 0.0583*** -- (4) Use of teacher judgments -0.0306* 0.0845*** 0.3542*** -- (5) Teachers responsible for assessment policies -0.0004 0.0765*** 0.0843*** -0.0251 -- (6) Teachers responsible for choosing textbooks -0.0394** 0.0136 -0.0340* 0.1227*** 0.0182 -- (7) Teachers responsible for course content -0.0394** -0.0444** -0.1222*** -0.0513*** 0.0840*** 0.0791*** -- (8) Teachers responsible for courses offered 0.0508*** 0.0052 -0.0668*** -0.1202*** 0.2252*** -0.0408** 0.2730*** -- (9) Gender 0.0303 0.0031 0.0025 -0.0029 -0.0025 -0.0175 -0.0209 -0.0019 -- (10) ESCS 0.3881*** -0.0394** -0.0088 0.0273 -0.0778*** -0.0490*** -0.0712*** 0.0312* 0.0002 *p < 0.05, ** p < 0.01, *** p < 0.001. Weighted. (10) -- 196 Appendix H: Descriptive statistics for teacher autonomy items 1) Use of standardized tests / Higher value here means less teacher autonomy Country Mean SD Min Max N SWE DNK NZL USA NOR OECD FIN CAN AUS IRL GBR AUT 1.473644 1.398434 1.375387 1.212103 1.196681 1.191897 1.063328 0.9470986 0.9195504 0.8956996 0.8035691 0.4469886 0.6393583 0.8017181 1.093501 0.5021535 0.5979463 0.959375 0.6331885 0.635598 0.7721146 0.8681785 0.6004895 0.7182859 0 0 0 0 0 0 0 0 0 0 0 0 3 4 4 4 4 4 4 4 4 4 4 4 185 276 156 164 189 10364 200 949 353 123 447 260 DEU 0.4342686 0.5227085 0 2 198 Max N 4 4 4 4 4 4 4 4 4 4 4 4 4 164 955 186 190 205 157 10534 265 353 201 125 454 274 Country USA CAN SWE NOR DEU NZL OECD AUT AUS FIN IRL GBR DNK 2) Use of teacher developed tests Mean SD Min 3.572816 3.523757 3.32627 3.311003 3.178582 2.944607 2.887775 2.750226 2.727661 2.698096 2.282017 2.275016 2.180358 0.7151496 0.7087081 0.8044478 0.8192112 0.9345095 0.8791524 1.009039 0.9476882 1.081928 0.7906053 0.9652089 0.8326167 0.924615 0 1 1 0 1 0 0 0 0 1 1 0 0 197 Country SWE AUT FIN DEU NOR NZL OECD USA CAN GBR IRL AUS DNK 3) Use of teacher judgments Mean SD 3.558783 3.489333 3.417115 3.300511 3.089203 2.833118 2.615704 2.587243 2.489609 2.414119 2.395453 2.383385 2.214317 0.834302 0.8146811 0.8822086 0.988247 0.9914069 1.099833 1.34737 1.494752 1.414863 0.9425681 1.049423 1.344235 1.040375 Min Max N 1 0 1 0 0 0 0 0 0 0 0 0 0 4 4 4 4 4 4 4 4 4 4 4 4 4 186 266 200 205 189 152 10431 161 932 451 125 353 276 Appendix I: Descriptive statistics for teacher autonomy scale by country Descriptive statistics for teacher autonomy scale by country/weighted Country Mean SD Min Max N Sweden 0.8065781 0.777753 -1.933588 2.132128 185 Norway 0.45159 0.7368017 -1.593086 1.719472 188 United States 0.3815064 0.9103744 -2.968056 2.613285 161 Canada 0.1939432 0.8709955 -2.861651 2.613285 920 New Zealand 0.1920103 1.012672 -2.521149 2.613285 152 Finland 0.1732547 0.7737676 -1.827184 2.613285 199 Germany 0.1087924 0.8743737 -2.414745 1.650972 197 Austria -0.0427982 0.8154978 -3.449213 2.613285 257 Australia -0.3389469 1.138171 -3.449213 2.613285 353 Denmark -0.5054214 1.077931 -3.449213 2.613285 274 Ireland -0.5936942 0.938241 -2.861651 2.613285 121 United Kingdom -0.6360779 0.8659084 -2.861651 2.613285 445 Total for 12 countries 0.171952 0.9588621 -3.449213 2.613285 3452 198 Appendix J: Descriptive statistics for teacher responsibility items Country AUT IRL FIN AUS DEU GBR SWE NZL CAN OECD DNK NOR USA 0.8548958 0.8397869 0.8140017 0.8113101 0.7890914 0.7772794 0.7539656 0.7237349 0.6761022 0.6122743 0.5851464 0.5790078 0.4433837 Country GBR SWE IRL FIN NZL AUS DNK AUT NOR CAN DEU OECD USA 1) Teachers decide about assessment policies Proportion SD Min Max 0.3528503 0.3682562 0.3900672 0.3918175 0.4089146 0.4165272 0.4318428 0.4485623 0.4682025 0.4872542 0.4935758 0.4950227 0.4982965 0 0 0 0 0 0 0 0 0 0 0 0 0 2) Teachers decide about textbook use Proportion SD Min 0.9989129 0.9908896 0.9792743 0.9651008 0.9623687 0.9599674 0.9172678 0.89327 0.8399381 0.7463463 0.709136 0.6727282 0.6442658 0.0329894 0.0952648 0.1430287 0.1839781 0.1909041 0.196314 0.2759684 0.3093347 0.3676322 0.4353257 0.4552308 0.4692393 0.4801925 0 0 0 0 0 0 0 0 0 0 0 0 0 N 1 1 1 1 1 1 1 1 1 1 1 1 1 274 127 203 353 213 458 189 159 972 10704 281 190 165 Max N 1 1 1 1 1 1 1 1 1 1 1 1 1 458 189 127 203 159 353 281 274 190 972 213 10704 165 199 Country GBR NZL SWE FIN AUS AUT DNK NOR USA IRL DEU CAN OECD Country GBR NZL FIN AUS IRL CAN USA AUT DNK DEU SWE OECD NOR 3) Teachers decide about course content Proportion SD Min 0.965216 0.9336713 0.9087851 0.8707023 0.8386581 0.8058212 0.7959795 0.742883 0.693344 0.6765307 0.6221224 0.5504863 0.5502009 0.1834326 0.2496421 0.2886791 0.3363586 0.3683681 0.396291 0.403703 0.4381991 0.462509 0.4696528 0.485999 0.4977007 0.4974967 0 0 0 0 0 0 0 0 0 0 0 0 0 Max N 1 1 1 1 1 1 1 1 1 1 1 1 1 458 159 189 203 353 274 281 190 165 127 213 972 10704 4) Teachers decide about the courses offered Proportion SD Min Max 0.8289207 0.7375137 0.6915849 0.6768424 0.6723017 0.5694265 0.5564182 0.4966712 0.4450938 0.4436296 0.4400977 0.3660412 0.2150922 0.37699 0.4413757 0.4629807 0.4683462 0.4712331 0.4954114 0.4983191 0.5009038 0.4978628 0.4979826 0.4977172 0.4817434 0.4119719 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 N 458 159 203 353 127 972 165 274 281 213 189 10703 190 200 Appendix K: Descriptive statistics for teacher responsibility scale by country Descriptive statistics for teacher responsibility scale by country/weighted Country Mean SD Min Max United Kingdom New Zealand Finland Australia Ireland Sweden Austria Denmark Germany Canada Norway United States Total for 12 countries 0.5797123 0.4061814 0.3896089 0.3436941 0.243439 0.2249226 0.1446985 -0.0767221 -0.3047042 -0.3195224 -0.3780905 -0.4932323 0.4875249 0.8111061 0.7174726 0.8366535 0.7677358 0.6471503 0.8323236 0.9495058 1.11964 1.161421 0.9359251 1.266347 -0.2553465 1.166916 -2.535996 -2.535996 -2.535996 -2.535996 -2.535996 -2.535996 -2.535996 -2.535996 -2.535996 -2.535996 -2.535996 -2.535996 N 0.9010759 0.9010759 0.9010759 0.9010759 0.9010759 0.9010759 0.9010759 0.9010759 0.9010759 0.9010759 0.9010759 0.9010759 458 159 203 353 127 189 274 281 213 972 190 165 -2.535996 0.9010759 3584 201 Appendix L: Maximum likelihood estimates for science performance in curriculum countries in PISA 2009 Table 11 Maximum likelihood estimates for science performance in Australia Null model Australia Est. Level-2 cov. (TA items) Level-2 cov. (TA+TR items) Full Model (Level-1 & Level-2 cov.) Est. SE Est. SE Est. SE 512.27*** 10.07 514.52*** 13.03 725.86*** 49.42 β₂ (stand. tests) -9.54 5 -9.38* 4.76 1.06 2.57 β₃(teacher tests) 6.86 3.63 6.58 3.81 3.62 2.36 β₄(teach. judgments) -0.12 2.73 Fixed part β₁ (_cons) 520.01*** (-2.72) 0.54 2.82 -0.57 1.64 β5(assess. policies) -1.43 11.4 1.83 6.30 β6(text. used) -8.3 17.04 -0.73 8.55 β7(course content) 15.9 14.39 4.43 7.60 β8(courses offered) -10.91 9.65 -2.43 6.53 32.85*** 1.22 β9(SES) β10(gender) -4.15** 1.58 β11(age) -13.06*** 3.12 β12(grade) 39.62*** 2.29 β13(1st immigration gen.) -4.59 3.08 β14(2nd immigration gen.) 2.85 2.83 -19.92*** 3.90 β15 (home lang. other than test lang.) β16 (public school) xtreg SD(random intercept) 49.06 Var(random intercept) 2384.93 371.27 2350.39 348.27 1188.83 191.47 SD (level-1 residual) 86.82 Var(level-1 residual) 7262.79 147.37 7262.75 147.35 6481.58 125.31 ICC(rho) 0.24 Wald chi2(3) mixed -26.21*** 4.18 mixed Random part mixed 10.09 Wald chi2(7) 15.58 Wald chi2(18) 1827.01 Log likelihood -83129.69 N=14251 Schools=353 -492731.52 -492716.15 -83124.868 Derived estimates R2 (Total variance) 0.029 0.23 R12 (level-1 var.) 0.009 0.14 R22 (level-2 var.) 0.036 0.51 Note: Standard errors in parentheses in null model. * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. 202 Table 11 Maximum likelihood estimates for student science performance in Canada Null model Canada Est. Level-2 cov. (TA items) Level-2 cov. (TA+TR items) Full Model (Level-1 & Level-2 cov.) Est. SE Est. SE Est. SE 500.75*** 13.36 496.06*** 14.86 767.09*** 52.09 β₂ (stand. tests) -0.81 3.64 -1.63 3.66 -1.36 2.95 β₃(teacher tests) 6.12 3.4 6.07 3.44 -0.86 3.06 β₄(teach. judgments) -1.15 1.67 -1.33 1.72 0.17 1.52 β5(assess. policies) 3.06 5.7 0.15 5.52 β6(text. used) 10.6 6.35 2.78 6.29 β7(course content) 2.99 5.35 -2.59 4.40 β8(courses offered) -10.1 6.22 -3.60 5.62 β9(SES) 22.13*** 1.27 β10(gender) -6.69*** 1.65 β11(age) -13.14*** 2.98 β12(grade) 48.53*** 2.43 β13(1st immigration gen.) -11.22** 4.32 β14(2nd immigration gen.) -13.47*** 3.60 β15 (home lang. other than test lang.) -12.10*** 3.72 Fixed part 517.26*** β₁ (_cons) (-1.48) β16 (public school) xtreg SD(random intercept) 42.51 Var(random intercept) 1750.18 162.87 1711.6 152.16 1103.28 111.53 SD (level-1 residual) 76.92 Var(level-1 residual) 5859.52 100.58 5858.63 100.65 5124.95 91.60 ICC(rho) 0.23 Wald chi2(6) mixed -25.94** 8.29 mixed Random part mixed 11.71 Wald chi2(11) 16.13 Wald chi2(21) 1498.44 Log likelihood -134687.69 N= 23207 Schools=978 -385711.63 -385675.12 -379026.79 Derived estimates R2 (Total variance) 0.015 0.19 R12 (level-1 var.) 0.01 0.13 R22 (level-2 var.) 0.031 0.39 Note: Standard errors in parentheses in null model. * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. 203 Table 12 Maximum likelihood estimates for student science performance in United Kingdom Null model United Kingdom Est. Level-2 cov. (TA items) Level-2 cov. (TA+TR items) Full Model (Level-1 & Level-2 cov.) Est. SE Est. SE Est. SE 511.40*** 20.6 513.37*** 27.02 353.52*** 65.56 β₂ (stand. tests) -2.86 7.02 -2.93 6.59 -2.97 4.39 β₃(teacher tests) -0.88 7.54 -2.44 6.94 -2.01 4.15 4.1 5.47 3.55 5.2 5.03 3.36 β5(assess. policies) 4.58 9.94 10.15 8.68 β6(text. used) 22.4 26.69 23.11 19.86 β7(course content) 5.53 17 -4.47 12.91 β8(courses offered) -34.22** 10.41 -11.18 9.10 β9(SES) 30.83*** 1.64 β10(gender) -10.12*** 2.99 β11(age) 11.49** 4.06 β12(grade) 33.99*** 5.27 β13(1st immigration gen.) 5.75 7.76 β14(2nd immigration gen.) 14.26* 5.89 β15 (home lang. other than test lang.) -16.25* 7.07 Fixed part β₁ (_cons) 508.53*** (-2.41) β₄(teach. judgments) β16 (public school) Random part xtreg SD(random intercept) 50.47 Var(random intercept) 3014.84 352.54 2864.02 332.24 1489.69 180.12 SD (level-1 residual) 79.76 Var(level-1 residual) 6209.05 182.48 6209.06 182.47 5559.13 158.61 ICC(rho) 0.29 Wald chi2(6) mixed -46.59*** 12.17 mixed mixed 5.05 Wald chi2(11) 30.74 Wald chi2(22) 806.47 Log likelihood -71191.81 N= 12179 Schools=482 -676150.44 -676040.63 -623957.37 Derived estimates R2 (Total variance) -0.035 0.21 R12 (level-1 var.) 0.024 0.13 R22 (level-2 var.) -0.184 0.41 Note: Standard errors in parentheses in null model. * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. 204 Table 13 Maximum likelihood estimates for student science performance in Ireland Null model Ireland Level-2 cov. (TA items) Level-2 cov. (TA+TR items) Full Model (Level-1 & Level-2 cov.) Est. Est. SE Est. SE Est. SE 506.27*** (-4.1) 491.96*** 20.92 520.19*** 20.52 745.95*** 110.25 Fixed part β₁ (_cons) β₂ (stand. tests) 1.15 5.4 0.62 5.39 -1.67 4.36 β₃(teacher tests) -3.35 6.07 -4.01 6.09 -7.28 4.47 β₄(teach. judgments) 4.61 6.22 4.56 6.99 2.66 4.92 β5(assess. policies) -5.74 22.91 -5.17 14.95 β6(text. used) -10.82 17.51 -48.21* 21.21 β7(course content) -7.2 12.03 -11.06 10.12 β8(courses offered) -8.17 14.4 1.87 10.58 30.02*** 1.92 β10(gender) -2.42 3.96 β11(age) -10.76 6.98 19.80*** 3.00 β13(1st immigration gen.) -3.28 7.42 β14(2nd immigration gen.) 12.17 15.24 β15 (home lang. other than test lang.) -27.65** 10.10 β16 (public school) -25.77** 9.25 mixed β9(SES) β12(grade) Random part xtreg SD(random intercept) 46.33 Var(random intercept) 2455.47 446.44 2375.09 458.03 1464.74 323.43 SD (level-1 residual) 81.69 Var(level-1 residual) 6801.11 238.27 6800.95 238.29 6033.58 235.52 ICC(rho) 0.24 Wald chi2(6) mixed mixed 16.07 Wald chi2(11) 47.67 Wald chi2(21) 1257.68 Log likelihood -23084.18 N= 3937 Schools=144 -105948.55 -105938.14 -92189.752 Derived estimates R2 (Total variance) -0.05 0.15 R12 (level-1 var.) -0.019 0.1 R22 (level-2 var.) -0.144 0.32 Note: Standard errors in parentheses in null model. * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. 205 Table 14 Maximum likelihood estimates for student science performance in New Zealand Null model New Zealand Level-2 cov. (TA items) Level-2 cov. (TA+TR items) Full Model (Level-1 & Level-2 cov.) Est. Est. SE Est. SE Est. SE 530.52*** (-4.15) 457.24*** 23.4 499.99*** 25.54 424.62*** 85.72 Fixed part β₁ (_cons) β₂ (stand. tests) -3.29 4.3 -1.71 3.37 -2.92 2.70 β₃(teacher tests) 18.94** 7.33 10.89 5.69 8.25* 3.90 4.66 5.64 2.62 4.21 0.99 2.86 5.16 10.97 1.36 7.64 β6(text. used) -144.10*** 22.06 -106.37*** 14.91 β7(course content) 110.68*** 12.46 95.72*** 13.25 β8(courses offered) -22.53* 11.09 15.49*** 7.81 38.52*** 2.59 β10(gender) 7.17 4.97 β11(age) 6.19 5.50 29.87*** 4.60 β13(1st immigration gen.) 0.25 5.72 β14(2nd immigration gen.) -3.45 5.71 β15 (home lang. other than test lang.) -42.51*** 5.91 β16 (public school) -18.35** 6.34 mixed β₄(teach. judgments) β5(assess. policies) β9(SES) β12(grade) Random part xtreg SD(random intercept) 49.83 Var(random intercept) 2636.95 459.89 2104.75 400.19 724.61 130.21 SD (level-1 residual) 90.83 Var(level-1 residual) 7963.18 240.28 7962.63 240.06 6722.36 180.10 ICC(rho) 0.23 Wald chi2(6) mixed mixed 39.7 Wald chi2(11) 924.5 Wald chi2(21) 1678.6 Log likelihood -27703.16 N= 4643 Schools= 163 -60832.24 -60791.03 -57752.06 Derived estimates R2 (Total variance) 0.012 0.31 R12 (level-1 var.) 0.035 0.19 R22 (level-2 var.) -0.062 0.71 Note: Standard errors in parentheses in null model. * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. 206 Table 15 Maximum likelihood estimates for student science performance in United States Null model United States Level-2 cov. (TA items) Level-2 cov. (TA+TR items) Full Model (Level-1 & Level-2 cov.) Est. Est. SE Est. SE Est. SE 499.41*** (-4.1) 421.52*** 31.89 420.30*** 30.38 885.82*** 161.28 Fixed part β₁ (_cons) β₂ (stand. tests) -31.18* 12.25 -23.86* 10.6 -12.98 8.61 β₃(teacher tests) 38.95** 11.65 42.19*** 9.53 29.60*** 5.92 -2.99 6.67 -4.57 5.67 -2.12 3.86 β5(assess. policies) -24.43 13.61 -10.99 11.19 β6(text. used) -17.35 20.94 11.27 13.84 β7(course content) -16.3 20.03 -26.23 15.12 β8(courses offered) 28.45 17.91 10.65 14.07 β9(SES) 22.39*** 3.18 β10(gender) -14.70*** 3.31 β11(age) -26.63** 9.47 β12(grade) 43.06*** 5.00 β13(1st immigration gen.) -4.33 6.99 β14(2nd immigration gen.) -2.34 5.98 β15 (home lang. other than test lang.) -6.72 5.48 β₄(teach. judgments) β16 (public school) Random part xtreg SD(random intercept) 50.49 Var(random intercept) 2500.75 454.81 2261.81 357.03 1297.04 239.70 SD (level-1 residual) 79.15 Var(level-1 residual) 5740.22 162.1 5737.79 162.53 5008.22 134.33 ICC(rho) 0.29 Wald chi2(5) mixed -38.00** 13.95 mixed mixed 300.96 Wald chi2(9) 506.65 Wald chi2(20) 1785.35 Log likelihood N= 5233 -30514 -4150558.6 -4149148.1 -4095343.4 Schools= 165 Derived estimates R2 (Total variance) 0.065 0.28 R12 (level-1 var.) 0.084 0.2 R22 (level-2 var.) 0.019 0.49 Note: Standard errors in parentheses in null model. * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. 207 Appendix M: Maximum likelihood estimates for science performance in didaktik countries in PISA 2009 Table 16 Maximum likelihood estimates for student science performance in Austria Null model Austria Level-2 cov. (TA items) Level-2 cov. (TA+TR items) Full Model (Level-1 & Level-2 cov.) Est. Est. SE Est. SE Est. SE 478.51*** (-4.66) 396.74*** 34.26 454.20*** 33.7 693.63*** 88.85 β₂ (stand. tests) -5.6 12.5 -5.64 13.48 -19.38** 6.72 β₃(teacher tests) 3.68 6.82 4.53 6.92 1.17 4.83 β₄(teach. judgments) 12.8 10.27 13.26 9.43 8.03 7.28 -42.07* 19.75 -30.97* 12.94 β6(text. used) -6.86 18.7 0.12 14.82 β7(course content) -19.57 18.53 -3.55 13.71 β8(courses offered) -8.79 14.21 Fixed part β₁ (_cons) β5(assess. policies) 5.09 10.33 β9(SES) 11.85*** 1.65 β10(gender) -25.71*** 3.54 β11(age) -10.45* 5.36 β12(grade) 39.50*** 2.87 β13(1st immigration gen.) -21.21*** 6.14 β14(2nd immigration gen.) -30.58*** 5.00 β15 (home lang. other than test lang.) -12.91** 4.61 β16 (public school) mixed -13.04 13.08 mixed Random part xtreg SD(random intercept) 75.4 Var(random intercept) 6359.19 696.33 5847.09 645.02 3073.84 362.01 SD (level-1 residual) 65.56 Var(level-1 residual) 4272.67 121.23 4270.43 121.09 3614.06 112.51 ICC(rho) 0.57 Wald chi2(6) mixed 13.51 Wald chi2(11) 29.93 Wald chi2(22) 2057.46 Log likelihood -37368.43 N= 6590 Schools= 282 -271593.58 -271478.36 -259127.77 Derived estimates R2 (Total variance) -0.065 0.33 R12 (level-1 var.) 0.006 0.16 R22 (level-2 var.) -0.119 0.46 Note: Standard errors in parentheses in null model. * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. 208 Table 17 Maximum likelihood estimates for student science performance in Germany Null model Germany Level-2 cov. (TA items) Level-2 cov. (TA+TR items) Full Model (Level-1 & Level-2 cov.) Est. Est. SE Est. SE Est. SE 511.30*** (-5.49) 494.70*** 29.13 521.73*** 29.21 763.21*** 74.01 β₂ (stand. tests) -36.67** 13.36 -33.69** 12.87 -25.97* 12.18 β₃(teacher tests) 0.72 7.63 -1.2 7.79 0.62 6.77 β₄(teach. judgments) 0.98 8.91 -0.71 8.12 -1.48 7.27 β5(assess. policies) 0.86 15.62 -5.05 14.11 β6(text. used) 10.58 15.98 11.57 14.25 β7(course content) -11.4 14.25 -6.97 12.09 β8(courses offered) -39.49** 15.19 -36.16** 12.95 β9(SES) 10.26*** 1.24 β10(gender) -20.37*** 1.79 β11(age) -13.35*** 3.81 β12(grade) 37.32*** 1.93 β13(1st immigration gen.) -12.78** 4.85 β14(2nd immigration gen.) -18.48*** 3.85 β15 (home lang. other than test lang.) -16.12*** 4.42 Fixed part β₁ (_cons) β16 (public school) Random part xtreg SD(random intercept) 81.45 Var(random intercept) 6593.99 627.19 6173.42 572.18 4332.85 496.24 SD (level-1 residual) 59.27 Var(level-1 residual) 3465.68 104.93 3465.37 104.91 2634.14 69.59 ICC(rho) 0.65 Wald chi2(5) mixed -23.27 20.21 mixed mixed 12.5 Wald chi2(10) 26.5 Wald chi2(20) 1222.86 Log likelihood -27810.63 N= 4979 Schools= 226 -1635456.9 -1634968 -1454987.5 Derived estimates R2 (Total variance) 0.009 0.31 R12 (level-1 var.) 0.013 0.25 R22 (level-2 var.) 0.006 0.35 Note: Standard errors in parentheses in null model. * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. 209 Table 19 Maximum likelihood estimates for student science performance in Denmark Null model Denmark Level-2 cov. (TA items) Level-2 cov. (TA+TR items) Full Model (Level-1 & Level-2 cov.) Est. Est. SE Est. SE Est. SE 481.10*** (-2.75) 492.15*** 11.84 470.59*** 12.04 650.94*** 97.08 β₂ (stand. tests) 1.89 3.51 2.42 3.7 4.42 2.86 β₃(teacher tests) -0.72 4.11 0.11 4.06 1.64 3.05 0.8 3.08 0.51 3.02 1.99 2.38 6.52 6.89 7.00 5.03 19.18* 9.85 14.74 8.68 β7(course content) -0.6 7.89 1.14 5.73 β8(courses offered) -2.56 7.11 -6.03 5.47 β9(SES) 28.47*** 1.52 β10(gender) -15.15*** 2.90 -10.94 5.85 β12(grade) 40.19*** 3.84 β13(1st immigration gen.) -37.28*** 6.25 β14(2nd immigration gen.) -34.82*** 4.57 β15 (home lang. other than test lang.) -22.99*** 5.73 Fixed part β₁ (_cons) β₄(teach. judgments) β5(assess. policies) β6(text. used) β11(age) β16 (public school) Random part xtreg SD(random intercept) 42.49 Var(random intercept) 1400.47 165.38 1266.52 146.03 618.40 84.27 SD (level-1 residual) 82.42 Var(level-1 residual) 6474.04 185.35 6472.45 185.63 5329.18 155.37 ICC(rho) 0.21 Wald chi2(5) mixed -2.17 6.58 mixed mixed 12 Wald chi2(10) 93.9 Wald chi2(20) 1312.38 Log likelihood -34804.82 N= 5924 Schools= 285 -191445.64 -191376.09 -185910.1 Derived estimates R2 (Total variance) 0.084 0.31 R12 (level-1 var.) 0.047 0.22 R22 (level-2 var.) 0.224 0.66 Note: Standard errors in parentheses in null model. * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. 210 Table 18 Maximum likelihood estimates for student science performance in Finland Null model Finland Level-2 cov. (TA items) Level-2 cov. (TA+TR items) Full Model (Level-1 & Level-2 cov.) Est. Est. SE Est. SE Est. SE 548.44*** (-2.16) 540.41*** 11.64 533.37*** 12.63 685.43*** 86.2 β₂ (stand. tests) 8.79 5.85 10.32 5.59 16.74*** 4.64 β₃(teacher tests) -4.67 3.7 -5.58 3.79 -4.82 3.16 β₄(teach. judgments) 5.09 3.36 4.19 3.23 3.41 2.30 14.62* 6.04 12.90* 5.29 β6(text. used) -0.94 20.19 -0.03 19.21 β7(course content) 2.17 6.43 -0.99 5.50 β8(courses offered) -2.85 6.51 -0.67 4.51 β9(SES) 27.03*** 1.60 β10(gender) 13.98*** 2.73 -9.68 5.10 39.17*** 5.69 β13(1st immigration gen.) -29.22 15.37 β14(2nd immigration gen.) -27.88* 12.34 -35.76*** 6.08 Fixed part β₁ (_cons) β5(assess. policies) β11(age) β12(grade) β15 (home lang. other than test lang.) β16 (public school) xtreg Random part mixed -17.16 10.05 mixed mixed SD(random intercept) 26 Var(random intercept) 598.29 113.55 576.36 112.06 410.45 73.53 SD (level-1 residual) 80.66 Var(level-1 residual) 6552.25 174.46 6549.67 174.06 5707.73 137.74 ICC(rho) 0.09 Wald chi2(5) NR Wald chi2(9) NR Wald chi2(20) NR Log likelihood -33887.24 N= 5810 Schools= 203 -122868.08 -122854.48 -121355.85 Derived estimates R2 (Total variance) 0.004 0.15 R12 (level-1 var.) -0.007 0.12 R22 (level-2 var.) 0.115 0.39 Note: Standard errors in parentheses in null model. * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. NR=not reported for model by software. 211 Table 19 Maximum likelihood estimates for student science performance in Norway Null model Norway Level-2 cov. (TA items) Level-2 cov. (TA+TR items) Full Model (Level-1 & Level-2 cov.) Est. Est. SE Est. SE Est. SE 499.97*** -2.46 479.05*** 14.95 472.41*** 17.08 199.60*** 78.82 Fixed part β₁ (_cons) β₂ (stand. tests) -5.06 6.13 -3.6 5.83 -2.89 5.41 β₃(teacher tests) 7.84* 3.28 7.91** 3.06 6.07* 3.23 -0.3 2.98 -0.55 2.86 0.12 2.48 11.93* 6 11.63* 5.10 β6(text. used) -4.14 7.71 -2.32 6.54 β7(course content) -0.41 6.13 -0.58 5.93 β8(courses offered) 12.41 9.81 14.28 9.12 30.24*** 2.37 4.48 2.96 15.90** 5.12 β12(grade) 14.15 17.27 β13(1st immigration gen.) -7.68 12.20 β14(2nd immigration gen.) -19.63 10.81 -47.46*** 9.36 β₄(teach. judgments) β5(assess. policies) β9(SES) β10(gender) β11(age) β15 (home lang. other than test lang.) β16 (public school) Random part xtreg SD(random intercept) 29.87 Var(random intercept) 864.28 143.24 836.39 146.6 624.45 109.65 SD (level-1 residual) 80.17 Var(level-1 residual) 6453.3 161.48 6445.99 161.34 5653.72 144.83 ICC(rho) 0.12 Wald chi2(5) mixed 14.77 11.28 mixed mixed NR Wald chi2(9) NR Wald chi2(19) NR Log likelihood -27182.53 N= 4660 Schools= 197 -128025.15 -128000.97 -123893.3 Derived estimates R2 (Total variance) 0.0003 0.14 R12 (level-1 var.) -0.004 0.12 R22 (level-2 var.) 0.031 0.3 Note: Standard errors in parentheses in null model. * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. NR=not reported for model by software. 212 Table 20 Maximum likelihood estimates for student science performance in Sweden Null model Sweden Level-2 cov. (TA items) Level-2 cov. (TA+TR items) Full Model (Level-1 & Level-2 cov.) Est. Est. SE Est. SE Est. SE 497.97*** (-2.75) 537.88*** 26.71 573.11*** 27.44 572.18*** 83.24 Fixed part β₁ (_cons) β₂ (stand. tests) -13.6 8.01 -14.09 8.18 -10.41 5.65 β₃(teacher tests) 1.29 6.54 0.33 6.64 2.50 4.05 β₄(teach. judgments) -4.33 6.88 -3.66 6.93 -7.07* 3.37 5.6 13.75 8.35 8.96 β6(text. used) -30.18 15.98 -15.72 18.92 β7(course content) -8.33 16.79 0.34 12.78 β8(courses offered) -0.92 10.43 2.65 7.07 32.84*** 2.21 β10(gender) 2.70 2.83 β11(age) -1.24 5.13 β12(grade) 57.10*** 7.67 β13(1st immigration gen.) -30.26** 9.76 β14(2nd immigration gen.) -28.67*** 6.85 β15 (home lang. other than test lang.) -15.07 8.33 β16 (public school) -21.31 12.82 mixed β5(assess. policies) β9(SES) Random part xtreg SD(random intercept) 39.45 Var(random intercept) 2148.68 422.47 2125.5 417.65 873.57 198.63 SD (level-1 residual) 87.31 Var(level-1 residual) 7691.12 199.81 7691.65 200.23 6361.53 184.20 ICC(rho) 0.17 Wald chi2(5) mixed mixed 4.49 Wald chi2(9) 10.86 Wald chi2(20) 909.21 Log likelihood -27055.62 N= 4567 Schools= 189 -237393.89 -237386.88 -233090.04 Derived estimates R2 (Total variance) -0.072 0.21 R12 (level-1 var.) -0.009 0.17 R22 (level-2 var.) -0.381 0.44 Note: Standard errors in parentheses in null model. * p < 0.05, ** p < 0.01, *** p < 0.001. Controlling for dummy missing variables. 213 Appendix N: Innovation in education, 2000-2011 Overall composite innovation index (Source: OECD 2014) School and classroom composite innovation index, 2000-2011 (Source: OECD 2014) VITA Armend Tahirsylaj Education 2015 (expected) Ph.D. in Educational Theory & Policy, The Pennsylvania State University 2007 M.Ed. in Curriculum, Teaching & Learning, University of Calgary 2003 B.A. in English Language and Literature, University of Prishtina Grants, Assistantships, Fellowships, and Scholarships 2014-2015 Center for Online Innovation in Learning Grant, The Pennsylvania State University 2011-2014 Waterbury Graduate Assistantship, The Pennsylvania State University 2010-2011 Hubert Humphrey Fellowship, The Pennsylvania State University 2004-2007 Canadian International Development Agency Scholarship, University of Calgary Professional Experience 2014-2015 Program Manager, The Pennsylvania State University 2011-2014 Graduate research assistant, The Pennsylvania State University 2009-2010 Education Adviser & Coordinator, Ministry of Education (Kosovo) 2004-2010 Lecturer, University of Prishtina (Kosovo) Selected Publications Tahirsylaj, A., Brezicha, K., & Ikoma, S. (In press). Unpacking teacher differences in Didaktik and Curriculum Traditions: Trends from TIMSS 2003, 2007, and 2011. In G. LeTendre & A. Wiseman (Eds.), Promoting and Sustaining a Quality Teaching Workforce. UK: Emerald Group Publishing. Bismack, A., Ong, Y.S., Tahirsylaj, A., & Duschl, R. (In press). Driving change forward. In R. A. Duschl & A. S. Bismack (Eds), Reconceptualizing STEM Education: The central role of practices. New York: Routledge, Taylor and Francis Group. Zhang, L., Khan, G., & Tahirsylaj, A. (forthcoming). Student Performance, School Differentiation, and World Cultures: Evidence from PISA 2009. International Journal of Educational Development. Tahirsylaj, A. (2013). Educational Change in Post-war Kosovo: Perceptions of Kosovo Educators as Presented in Shkëndija. Interchange, 44(1-2) 1-13. Tahirsylaj, A. (2012). Stimulating creativity and innovation through Intelligent Fast Failure. Thinking Skills and Creativity, 7(3), 265-270. Tahirsylaj, A. (2010). Higher Education in Kosovo: Major Changes, Reforms and Development Trends in Post-conflict Period at University of Prishtina. Interchange, 41(2), 171-183.
© Copyright 2026 Paperzz