Reading Renaissance Testing the Reading Renaissance Program Theory: A Multilevel Analysis of Student and Classroom Effects on Reading Achievement Geoffrey D. Borman N. Maritza Dowling University of Wisconsin-Madison 1 Reading Renaissance 2 In the recent National Reading Panel report, the panel concluded that guided oral reading is an important component for developing reading fluency (NICHD, 2000). In the panel’s review, it found that guided oral reading helped students across a wide range of grade levels to learn to recognize new words, read accurately and easily, and comprehend what they read. In contrast to these benefits associated with guided oral reading, the panel found considerably less research to support the practice of silent independent reading. Though the Panel’s conclusions did not refute that encouraging children to read more improves reading achievement, the report did state that guided oral reading procedures are more effective in this regard than is free, voluntary silent reading. One of the major benefits of the oral reading approach is that it can serve as an important diagnostic tool for teachers to assess a student’s word attack skills, fluency, and comprehension by listening to the child’s emphasis and phrasing. By overemphasizing oral reading, though, the teacher may divert a student’s attention from comprehension, especially if the teacher stresses reading the text absolutely correctly over understanding its meaning (Graves, Juel, & Graves, 1998). On the other hand, there is evidence suggesting that encouraging a greater amount of silent reading time in which students read extensively and have ownership over the materials they choose to read can contribute to higher intrinsic motivation to read (Ivey & Broaddus, 2001), and increased vocabulary and comprehension skills (Cunningham & Stanovich, 1998). The Reading Renaissance (RR) program developed by Renaissance Learning, offers the structure and diagnostic feedback that is a key strength of the oral reading approach along with the flexible and student-centered emphasis of silent reading. Diagnostic feedback is provided by the computerized Accelerated Reader (AR) Reading Renaissance 3 information system, which was first made available to schools in 1986 and, according to the developers, is currently used in more than half of the schools in the U.S. (Paul, 2003). The AR provides both students and instructors with immediate feedback on student reading practice through short, computer-based quizzes that assess students’ comprehension of the books they have read. The teacher then uses the information to guide further reading by helping students choose books that are appropriate to their ability levels and interests and diagnosing reading difficulties for individual students and the classroom as a whole. With more than 65,000 AR Reading Practice Quizzes, there are assessments available for a wide range of books. The independent reading program coupled with the systematic feedback provided by AR form the core of the RR guided independent reading program. Using a large-scale database of students and classrooms, this analysis tested the central premises that underlie the RR guided independent reading program. These premises are that higher-quality implementations of the program are marked by a greater amount of reading, a high reading success rate, use of reading material that is appropriately matched to a student’s ability and, ultimately, that these indicators of implementation are related to improved reading achievement. Specifically, applying a two-level hierarchical linear model, with students nested within RR classrooms, we assess the relationships between students’ individual reading behaviors and achievement and the relationship between classroom-level implementation of RR and its overall and compensatory effects on achievement outcomes. Reading Renaissance 4 Background RR uses the computer-provided information to track and manage the development of students’ reading skills and to motivate students of all ages to read more and better books. Although the program is available for K-12, most of the sites using RR are elementary and middle schools. The RR program has 4 main components: reading practice; the computerized AR system; a motivational system; and the teacher’s role to motivate, instruct, monitor, and intervene. Teachers adopting the RR program design independent reading as part of classroom instruction along with the diagnostic feedback provided by the AR system. Using information from AR, teachers set goals for each student on an individual basis. Teachers monitor students’ progress and confirm that students are reading appropriate books that match their reading level and that they are making progress. Teachers also provide instruction on reading skills needed to improve reading comprehension and they intervene when students are not successful, helping them choose appropriate books and use more appropriate reading strategies. RR recommends three goals for independent reading practice: quality, quantity, and challenge. Quality is measured by the average percent correct on the AR quizzes. The program stresses high levels of successful reading, with an average percent correct of 85% or more on the AR quizzes of students’ comprehension of the books they have read. The quantity goal is determined by the amount of in-school time that students have spent engaged in guided independent literature-based reading of self-selected books. The last factor or goal, challenge is related to students being guided to select reading material that is at the appropriate difficulty level for their reading abilities. These goals of quality, quantity, and challenge that drive high-quality implementations of the program have been Reading Renaissance 5 informed by research and by the developers’ practical experiences implementing AR and RR across the country. Quality: Percent Correct The average percent correct on AR quizzes is the key indicator that students are enjoying success as independent readers and is an important indicator of high-quality implementation of the RR program. There is extensive research showing how high levels of success result in large reading achievement gains (Allington, 1984; Rosenshine & Stevens, 1984; Betts, 1946). It has been argued that success rate on tasks, as measured by percent correct, is a powerful indicator not only assessing comprehension but also engagement and, to some extent, motivation and interest (Cunninghan & Cunninghan, 2002). High success also indicates that students are receiving sufficient help to meet their needs, context clues necessary to learn new material, and appropriate scaffolding (Beck & MacKeown, 1991). Low-achievers appear to benefit most from high rates of success compared to high-achievers. For example, Marliave & Filby (1985) found that with low-achieving students, high rates of success have larger effects on learning than engaged time on task. The most important achievement goal for all students in RR programs is to maintain a minimum average of 85% correct on the AR reading practice quizzes. In fact, in the grades 4-12, the developers suggest maintaining an average in the 90% range to optimize reading growth (Paul, 2003). Initial minimum percent correct goals are often set at 85% to prevent students from getting discouraged at the outset of the program. Since the RR model stresses formative evaluation and continuous student monitoring, diagnosis, feedback, and reading adjustment, if students have difficulties maintaining the Reading Renaissance 6 minimum average percent correct, instructors may decide on an appropriate strategy to help them succeed. Guidance is often provided on selecting appropriate books and students also are taught strategies, such as re-reading or taking and reviewing notes before taking the quizzes. Quantity: Number of Words Read The most fundamental indicator of student progress in a guided independent reading program designed to motivate students to read more is, of course, the quantity of text read. As noted in the National Reading Panel report, there are literally hundreds of correlational studies that have shown that the best readers read the most and the worst readers read the least (NICHD, 2000). This suggests that the more children read, the better their literacy outcomes. As the panel noted, though, it could also be the case that better readers simply choose to read more. In a comprehensive meta-analysis and literature review on independent reading practices, reading exposure, and other correlates of reading achievement, Lewis & Samuels (2003) also found strong empirical evidence that students who read more are better readers and obtain higher overall achievement scores than students who read less and spend less time reading. When this independent reading is guided or monitored, as it is in RR, it leads to even greater gains in achievement (Yu, 1993). The RR developers suggest that 60 minute of in-school time per day be allocated to guided independent reading across all grades, 1-12 (Paul, 2003). Though 60 minutes per day is more that some reading experts suggest, Allington (2001) has recommended that students may benefit from up to 90 minutes per day. Challenge: Zone of Proximal Development Reading Renaissance 7 The idea that the potential for reading development depends upon the zone of proximal development (ZPD) is based on principles introduced by Vygotsky (1978) in his social development theory. Cognitive development, states Vygotsky, is limited to a certain range at any given age. Translated into the RR philosophical context, optimal learning takes place when students read books with readability levels that match their ZPD. This idea has been supported in a number of research studies conducted since the late 1970s applying Vygotsky’s theoretical principles (Din, 1998; Slavin, Karweit, & Madden, 1989; Gickling & Armstron, 1978). The ZPD is then conceptualized as a recommended book-level range for the student geared to maximize the effectiveness of the reading program. It is a level of difficulty that is not too hard or too easy for the student. In setting student learning goals, RR provides the flexibility for teachers to choose a new range or create a wider one that better matches the student’s abilities if books in the recommended range seem too hard or easy for the student. After identifying an initial ZPD, teachers monitor students’ reading practice and adjust their ZPD if necessary. The intention is to adequately match the ZPD with the actual book reading ability for the student. Usually the starting point is near the low end of the student’s ZPD. This practice allows students to select book levels within a wider overall ZPD range. Method Sample and Data The sample and data for this study were obtained from the Reading Practice Database (RPD) assembled and maintained by the Research and Evaluation Department of Renaissance Learning. The RPD is a large database that contains information on Reading Renaissance 8 student guided independent literature-based reading. It was developed primarily for the purpose of analyzing student reading practice patterns in schools implementing RR schoolwide. The RPD contains the reading records of 50, 807 students in 139 U.S. schools in grades 1-12 using the Accelerated Reader (AR) computerized information system in the 2001-2002 school year. The data contained in RPD are not a nationally representative sample of U. S. schools but rather a typical sample of schools participating in the RR program. To conduct the multilevel analyses, two files were created from the RPD: a student-level file and a classroom-level file. Records in the student-level file were included only if a student (a) belonged to a classroom with five or more student records in the RPD, (b) had both pre- and posttest STAR Reading scores, (c) had some activity in the AR between the pre- and posttest dates, and (d) had no missing information on the grade level variable. Only six records in the student-level file had no information on grade level. The sample was further restricted to students in grades 2 to 12, as students in grade 1 do little independent reading. Therefore, 5,150 first-grade students were not included the HLM analyses. The final sample consisted of 45,108 student records. This sample spanned 2,434 classrooms with an average class size of 19 students. The demographic characteristics of the students in the analytic sample are listed in Table 1. ___________________________ Insert Table 1 about here ___________________________ The classroom-level file was created from the student-level file by aggregating the data by classroom and school identification code. Additional data regarding teacher Reading Renaissance 9 certification status was added to the aggregated file based on data from Renaissance Learning customer records. The explanatory variables were created similarly by taking classroom-level averages (aggregates) of the student data. Separate HLM analyses were conducted for the following three clusters of grades: 2nd to 5th graders, 6th to 8th graders, and 9th to 12th graders. Measures Dependent variable. The dependent variable in the multilevel analysis was the normal curve equivalent (NCE) posttest score obtained from the STAR Reading test. NCEs are normalized percentile scores with a population mean of 50 and standard deviation of 21.06. The STAR Reading test is a nationally normed standardized computer-adaptive test developed by Renaissance Learning with test-retest reliabilities, by grade, ranging from 0.70 to 0.91, and an overall reliability of 0.94. The scores on the STAR reading achievement test in the present study ranged from an NCE of 1 to a maximum of 99. Student-level variables. Five student-level variables, summarized in Tables 2 to 4 for each cluster of grade-levels analyzed, were of interest in this study. Two explanatory variables were continuous: (a) actual number of words read (calculated from the actual number of words in books that students read for which they also passed the quiz), and (b) NCE pretest scores from the fall STAR Reading achievement test. The natural logarithm of the variable “actual number of words read” was taken to normalize the distribution of the variable. ___________________________ Insert Tables 2-4 about here Reading Renaissance 10 ___________________________ The remaining three explanatory variables, grade-level, percent correct, and zone of proximal development range (ZPD), were categorical. After the student-level file was split into the three grade-level clusters, the variable grade was dummy coded using the lowest grade in the cluster as the reference category. The percent correct variable was developed through a two-step process. First, we calculated each student’s average score based on all quizzes attempted by the student between the pre- and posttest. If the percent correct was greater than or equal to 85%, then the variable was coded as “1” and if the percent correct was less than 85% it was coded as “0.” The ZPD variable was created from two variables: each student’s average book level read and STAR pretest grade equivalent score. These two scores were used to create book level ranges called ZPD ranges. Each ZPD included a range of book levels, suggested by the developer, at which students should be able to read independently while being challenged enough but without reaching a state of frustration that will block a healthy motivation level. See Appendix for the specific ZPD ranges corresponding to each pretest grade equivalent score. According to the ZPD range, students were coded as (1) low ZPD or low challenge, (2) optimum ZPD or optimum challenge, or (3) high ZPD or high challenge. For instance, if a student obtained a pretest grade-equivalent score from 0.0 to 1.4 then s/he had an “optimum” ZPD or challenge range of 1.0 to 2.0. If the student’s actual average book level assigned was between this range, then a code of 2, optimum challenge, was assigned. If the actual book level assigned was below 1.0, then a code of 1, or low challenge, was assigned, and if it was above 2.0 it was coded as 3, meaning Reading Renaissance 11 high ZPD or high challenge. A series of three dummy codes represented each category, low, optimum, and high. In our analyses, the optimum category was treated as the omitted and referent category. Classroom-level variables. Most of the classroom-level predictor variables were obtained by aggregating (averaging) student-level variables by classroom. Six variables were of interest: (a) classroom mean STAR Reading NCE pretest score; (b) percentage of students in the classroom with average quiz scores greater than 85%; (c) classroom mean of log of number of words read; (d) number of students in the classroom; (e) percentage of students in the classroom receiving assignments at the optimum reading challenge or ZPD; and (f) the RR teacher’s master certification status. A summary of these variables is also presented in Tables 2 to 4. Two variables, the number of students in the classroom and the RR teacher’s certification status, were obtained from the classroom-level data created by Renaissance Learning. The teacher certification variable was dummy coded so that master certification equaled 1 and all other non-master certified teachers were coded as 0. Procedure Using a hierarchical linear modeling (HLM) approach (Raudenbush & Bryk, 2002), the analyses distinguished between student-level and classroom-level effects of a greater amount of reading, a high reading success rate, and use of reading material that is appropriately matched to students’ abilities. The analysis helped us understand how much variation in achievement is at the classroom versus student level and how much variation can be explained using both student and classroom-level predictors of the outcome. This information allowed us to identify more clearly the effects that quality Reading Renaissance 12 classroom-level implementations of the RR tenets had on students’ test score gains that were above and beyond the student-level effects of their own reading behaviors. In addition, the analyses examined aptitude-by-treatment (ATI) interaction effects by assessing whether higher-quality guided independent reading implementations closed the within-classroom achievement gaps between students who scored higher and lower on the baseline pretest. In these ways, our analyses assessed the extent to which the model practices of RR were associated with both excellence and equity in students’ achievement outcomes. We developed three separate series of two-level HLM models for: the elementary school sample (grades 2-5), middle school sample (grades 6-8), and high school sample (grades 9-12). In each case, student performance on the spring reading test was the outcome predicted by students’ fall pretest score, grade level, number of words read, the dummy code indicating that the child’s average percent correct was above 85 on the STAR quizzes, and two dummy codes indicating that the assigned reading material was, on average, below or above the ability indicated by the student’s baseline pretest level. At the second level of the model, we used the classroom-level aggregates of four student variables to form the classroom average pretest score, number of words read, the classroom-mean percent correct above 85 percent correct, and the percentage of students in the class assigned reading material at the optimum ability level. In addition, the class size was used as a covariate and the teacher’s status as a master RR teacher was used as an additional measure of high-quality implementation. For each of the three groups we studied, elementary, middle, and high school, we began by specifying an unconditional multilevel model, with no student or classroom Reading Renaissance 13 predictors of the reading achievement outcome. This model decomposed the variance in the outcome into its within- and between-classroom components and provided an upperbound estimate of the proportion of variability in reading achievement that can be explained by differences across classrooms. The second set of multilevel models that we estimated introduced the fall pretest and the students’ specific grade levels as covariates to predict achievement. These models estimated inequalities in students’ outcomes and accounted for within-school variability that was associated with students’ initial achievement and grade level. After assessing the variability in achievement associated with these student-level characteristics, the third model turned to measures of students’ individual reading behaviors as predictors of achievement. These models estimated the relationships between achievement and the number of words a student read, whether the child attained the 85% correct standard on STAR quizzes, and whether each student was assigned reading material below or above his/her optimum ability level, after controlling for pretest and grade level. In the fourth model, along with the student-level predictors included in the third model, we modeled the classroom-level mean pretest score and the class size as predictors of between-classroom variability in the outcome. Fifth, after controlling for the class size and the classroom-level average pretest score, we included the classroomlevel aggregates of student reading behaviors: classroom mean words read; classroom percentage of students achieving at or above the 85% correct standard; and percentage of students in the classroom assigned reading material that is at their optimum reading level. These variables, along with an indicator of whether or not the classroom teacher was a Reading Renaissance 14 master RR instructor that was added in the sixth and final model, provided classroomlevel measures of the quality of the RR implementation. In models 5 and 6, we also used this collection of implementation measures to model classroom-to-classroom differences in the within-school pretest-posttest slope. That is, did better implementations of the program help attenuate the relationship between pretest and posttest and, thus, improve equality of educational outcomes within classrooms? Results Outcomes for Students and Classrooms from Grades 2-5 Tables 5 and 6 display the maximum likelihood (ML) results for grades 2-5 starting with Model 1, the empty model, to Model 6. The first analytical model, the empty or unconditional multilevel model with no student- or classroom-level predictors, shows the overall average value on the outcome measure. This model partitions the variance in the outcome into its between and within classroom components and tests whether there is a statistically significant amount of between-classroom variance to model with independent variables. For the reading achievement outcome, the unconditional model yielded an average score of 51.78 NCEs. In other words, students were, on average, scoring just above the national average NCE, or the percentile score of 50. The intraclass correlation coefficient, which represents the proportion of variance in the outcome that is between classrooms was 0.26. This result indicated that 26% of the variance in reading achievement was between classrooms and that there was a statistically significant, χ2 (1337, N = 1338) = 9,269.85, amount of variability in the posttest outcome potentially explainable by classroom-level characteristics. ___________________________ Reading Renaissance 15 Insert Tables 5-6 about here ___________________________ Our next steps involved attempting to control pretest reading achievement and the grade level of the child. With the exception of the pretest predictor, our HLM models treated all other student-level indicators and measures as fixed slopes. That is, it was assumed that the effect of most student-level predictors was homogeneous across classrooms. We chose this model due to both practical and theoretical considerations. From a practical standpoint, classrooms typically served only a single grade level, 2, 3, 4, or 5. Having no variability on these student-level grade indicators for the vast majority of classrooms, it was not possible to model these grade level indicators as sources of random variation within classrooms. In addition, from an analytical and theoretical perspective, the primary focus of the current study was on the sources of between-school differences in mean verbal achievement rather than on processes of within-school achievement differentiation. The one exception was, of course, the within-classroom inequalities associated with students starting the academic year at a higher or lower pretest level. In the initial prediction model, Model 2, the pretest and grade level explained 61.70% of the between-classroom variance. Therefore, the student-level predictors did account for considerable between-classroom variability, but a statistically significant amount of between-classroom variability remained even after controlling for these variables. All of the variables were statistically significant predictors of reading achievement. On average, students beginning the academic year during the fall with Reading Renaissance 16 higher test scores obtained posttest scores that were also higher. For every one point increase on the pretest, students showed a 0.74 point increase on the posttest. After controlling for pretest, students from the second grade tended to outperform students from the higher elementary grades. The HLM results also showed that there were statistically significant, χ2 (df=1337, N = 1337) = 254.08, differences across schools in the relationship between pretest and posttest. These results provided evidence that the distribution of achievement varied across schools. That is, some schools were more equitable and some were less equitable with respect to the relationship between students’ pretest scores and their subsequent posttest outcomes. Model 3 introduced the student-level measures of reading behaviors. All 3 variables were statistically significant predictors of posttest reading achievement. Achieving the 85% correct standard was related to higher posttest scores. The difference between a student scoring at or above 85% correct and a student performing below the standard was associated with a nearly 4 NCE point difference. Also, the difference between a student who read a considerable amount of text and a student who read only an average amount of text (i.e., a 2 standard deviation difference on the actual number of words read variable) was equivalent to an advantage of over 10 NCE points for the student who read more text. Finally, after controlling for the other variables in the model, students who were assigned readings that were, on average, below their optimum challenge level tended to perform 3.19 NCEs lower on the posttest than those students who were assigned readings at their optimum level. When students were assigned books that were above their optimum reading level, students tended to score 3.65 NCEs higher Reading Renaissance 17 on the reading posttest than students who were reading at their optimum level. These reading behaviors explained nearly 5% of additional within-classroom variability on the posttest. In Model 4 in Table 6, we added the classroom-level mean pretest score and class size as predictors of differences in the true classroom means. Both variables were statistically significant predictors of posttest. For each one point increase on the mean classroom-level pretest, classrooms were predicted to gain an additional 0.88 points on the posttest. Students attending larger classes actually tended to achieve better posttest outcomes than students attending smaller classes, though the result was not of considerable practical significance. For each additional 10 students in the class, the classrooms attained mean posttest scores that were 0.80 points higher. Together, the classroom-level pretest and the number of students within the classroom explained over 86% of the variance that lied between classrooms. Next, in Model 5, we included the mean words read, percentage of students reading at the optimum level, and the percentage of students achieving 85% correct or greater as predictors of the classroom mean achievement outcome and the NCE pretest slope. For classroom mean achievement, the number of words read and the percent correct greater than 85% were the two classroom-level indicators to attain statistical significance. The outcome for the number of words read variable suggested that classrooms that engaged in a considerable amount of reading (i.e., 2 standard deviations above the mean for the classroom-mean log number of words read) outperformed classrooms that engaged in an average amount of reading by approximately 4.5 NCE points. Also, the HLM predicted a 1.8 NCE point difference between a classroom with Reading Renaissance 18 100% of its students attaining the 85% correct standard and a classroom in which no students met the standard. After controlling for these variables, the number of students in the classroom was no longer a statistically significant predictor of reading achievement. For the other primary outcome of Model 5, the NCE pretest slope, this model indicated a statistically significant relationship between classrooms that produced a greater amount of reading and the widening of the achievement gap between initially lower and higher achieving students. However, this relationship was of little practical significance. Classrooms that were two standard deviations above the mean on the classroom-mean log number of words read variable widened the achievement gap by only a fraction of an NCE point, 0.04 NCEs. Finally, Model 6 added the indicator variable that distinguished master RR teachers from those who had not achieved this status. Relative to the non-master teacher, master teachers’ classrooms scored nearly 1 NCE higher, even after taking into account the other classroom-level indicators of high-quality RR implementation. Outcomes for Students and Classrooms from Grades 6-8 The estimated coefficients from each of the six two-level models for the middle school grades, 6 to 8, are shown in Tables 7 and 8. The unconditional model, Model 1 in Table 7, yielded an expected mean reading test score for all classrooms of 44.58 NCEs. Therefore, the group of 6th to 8th graders in this sample, on average, scored somewhat below the national average percentile score of 50. The intraclass correlation, or measure of dependence between level-2 (classroom) units, was 0.38. That is, 38% of the total variance in reading test scores for this sample was associated with classrooms as opposed to individuals. This amount of variability was statistically significant, χ2 (df= 415, N = Reading Renaissance 19 416) = 4490.77. In this case, we rejected the null hypothesis that mean test scores of students from all classrooms under study were equal, and concluded that there was significant variability in means across classrooms to justify a multilevel analysis. ___________________________ Insert Tables 7-8 about here ___________________________ Controlling for pretest reading achievement and the grade level of the child in Model 2, helped explain 59.72% of the total within-classroom variance. However, the χ2 test of the level-2 residual variance revealed that even after controlling for these variables, a significant amount of between-classroom variance remained unexplained. The grade level dummy variables were not statistically significant predictors of reading achievement. NCE pretest score, however, was statistically significant. Students beginning the academic year during the fall with higher pretest scores, on average, also performed better on posttest scores. For the middle school group, a one-point increase on the pretest scores was associated with a 0.77 point increase in the posttest reading score. In addition, the test statistic for the variance component estimate for the pretest slope was statistically significant, χ2 (df = 413, N = 416) = 496.79, indicating variation in pretest achievement across all classrooms. In Model 3, we introduced the student-level indicators of reading achievement progress: the 85% correct indicator; the actual number of words read; and the two dummy indicators for ZPD, or challenge: low challenge and high challenge. Among the added predictors, several statistically significant effects emerged. Having an average percent correct greater than 85% had a positive association with posttest scores. After Reading Renaissance 20 controlling for the other student-level characteristics, students who met the standard held a 3.34 NCE point advantage over students who did not meet the 85% standard. The coefficient estimate for the number of words read indicated that the difference between a student who read a considerable amount of text and a student who read only an average amount of text (i.e., a 2 standard deviation difference on the actual number of words read variable) was equivalent to an advantage of 4.33 NCE points favoring the student who read more text. Students who were assigned average book levels that were below their optimum challenge or ZPD level tended to score approximately 2.59 NCE units lower on the posttest compared to students who were reading books at an optimum challenge level. On the other hand, students reading books above their optimum challenge or ZPD level, on average, tended to score 1.74 NCE units higher on the posttest than students reading books at their optimum challenge level. Adding these individual-level measures of reading achievement progress accounted for an additional 1.73% of the initial withinclassroom variation in posttest scores. Having established that there was statistically significant across-classroom variation, both in term of average posttest scores and the pretest slopes, Models 4 to 6 in Table 8 were developed to account for this variation. In Model 4, we first added two level-two covariates: the classroom mean pretest score and the number of students in the classroom. Classroom mean pretest score was a statistically significant predictor of reading achievement for the middle-school sample. On average, for each unit increase in classroom mean pretest score, middle school classrooms were expected to gain 0.93 NCE points on the posttest. Model 4 accounted for approximately 97% of the initial betweenclassroom variance in posttest scores. Reading Renaissance 21 In the next model, Model 5, we included three more level-2 variables, the mean number of words read, percentage of students reading at the optimum level, and the 85% correct indicator for the classrooms, as predictors of both the classroom mean achievement outcome and the NCE pretest slope. For classroom-level mean achievement, the classroom-level 85% correct variable was the one statistically significant predictor. The model predicted a 3.08 NCE point difference between a classroom with 100% of its students attaining the 85% correct standard and a classroom in which no students met the standard. None of these variables, though, were statistically significant predictors of the NCE pretest slope. This model helped explain only an additional 0.65% of the total between-classroom variation. Finally, Model 6 introduced a dummy variable that distinguished teachers who had received RR master status from those who had not achieved this status. After controlling for the other student- and classroom-level indicators of high-quality RR implementation, master teachers’ classrooms for the middle school sample scored over 2 NCE units higher on reading achievement than the non-master teachers’ classrooms. Adding this variable to the model accounted for an additional 0.17% of the total betweenclassroom variance in posttest scores. Outcomes for Students and Classrooms from Grade 9-12 The ML estimates from Model 1 to Model 6 for the high-school sample (grades 9 through 12) are displayed in Tables 9 and 10. The unconditional model for the high school sample produced an average reading achievement score of 47.50 NCEs, which is slightly below the national average NCE of 50. The estimate of the intraclass correlation, revealed that 28% of the total variance in reading achievement resided between Reading Renaissance 22 classrooms. This variance was statistically significant, χ2 (df= 349, N = 350) = 2302.22, indicating the need to model meaningful classroom-level properties that would help explain sources of variability. ___________________________ Insert Tables 9-10 about here ___________________________ With the exception of the dummy variable “Grade 12,” all predictors in Model 2 were statistically significant. On average, students with higher NCE pretest scores obtained NCE posttest scores that were also higher. For every one point increase on the pretest, students showed a 0.75 point increase on the posttest. After controlling for the pretest, students from the 10th and 11th grade tended to outperform students from the referent 9th grade level. This model accounted for approximately 56% of the initial within-classroom variance in NCE scores on the STAR Reading test. Additionally, statistically significant variation across classrooms both in terms of the intercept, or average test scores, (χ2 = 4895.01, df= 347), and the pretest slope, (χ2 = 464.68, df= 347), lead us to reject the hypothesis of constant distribution of achievement across classrooms. The next model, Model 3, introduced the three student-level measures of reading behaviors. All of these variables proved to be statistically significant predictors of posttest outcomes. First, the difference between a student scoring at or above 85% correct and a student performing below 85% correct was associated with a 3.17 NCE point difference. Second, the difference between a student who read a large amount of text and a student who read an average amount of text (i.e., a 2 standard deviation difference on the actual number of words read variable) was equivalent to an advantage Reading Renaissance 23 of 2.66 NCE points. Third, after controlling for the other variables in the model, students who were assigned readings that were, on average, below their optimum challenge level tended to perform over 3 NCEs lower on the posttest than those students who were assigned readings at their optimum level. Finally, when students were assigned books that were above their optimum reading level, students tended to score 1.83 NCEs higher on the posttest than students who were reading at their optimum level. These reading behaviors explained 1% of additional within-classroom variability on the posttest. The level-2 predictor variables introduced in Models 4 to 6 in Table 10 attempted to account for the significant variation in variance components across classrooms in average test scores and the pretest slope. First, in Model 4, we added the classroom-level mean pretest score and class size as predictors of differences in expected classroom means. Only classroom mean pretest score was a statistically significant predictor of the reading achievement posttest. The coefficient estimate indicated that, on average, a oneunit increase in a classroom’s mean pretest score was associated with a 0.93-point increase in its posttest reading achievement score. Model 4 accounted for approximately 94% of the initial variance in posttest scores that lied between classrooms. In Model 5, we included the mean number of words read, percentage of students reading at the optimum level, and the percentage of students achieving 85% correct or greater as predictors of both the classroom mean achievement outcome and the NCE pretest slope. For classroom mean achievement, the average optimum reading challenge and the percentage of students achieving 85% correct or greater both reached statistical significance. After controlling for all the other variables in the model, students in classrooms with a higher average number of students reading books at their optimum Reading Renaissance 24 challenge or ZPD level, showed greater improvement in their reading skills compared to students in classrooms in which this was not the case. If one compared a class in which all students were challenged at the optimum level to a class in which none of the students were challenged at the optimum level, the model predicted a difference of 6.69 NCE points. In addition, classrooms in which 100% of the students were achieving 85% or greater on the AR quizzes outperformed classrooms in which none of the students were meeting the 85% standard by 4.21 NCEs. Examining the estimates for the pretest slope, or the average within-classroom relationship between students’ pretest and posttest outcomes, the results indicated that classrooms in which a higher percentage of students were reading at the optimum level tended to close the achievement gap between initially lower and higher achieving students. Also, classrooms with a higher percentage of students attaining the 85% correct standard tended to widen the achievement gap between initially lower and higher achieving students. However, as the results in Model 6 revealed, this relationship was no longer statistically significant in the final model, which controlled teachers’ master certification status. In the final model, Model 6, the indicator variable that distinguished master RR teachers from those who had not achieved this status was introduced. Interestingly, master certification status was not a significant predictor of reading achievement for the high school sample. However, the sample of master teachers at the high-school level was so small that this coefficient was estimated with considerable error. The final model for the pretest slope continued to show that classrooms with a higher percentage of students reading at the optimum challenge closed the achievement gap. However, as stated above, Reading Renaissance 25 after controlling for master teacher status the percentage of students attaining 85% correct on AR quizzes was no longer a statistically significant predictor of the relationship between pretest and posttest across the classrooms in the sample. Discussion The results of our analyses show that student performance on a spring reading achievement test is predicted by both students’ individual reading behaviors, which are shaped by the guided independent reading program, and by classroom-level features of the RR program implementation. First, we find that even after using rigorous statistical controls for the students’ initial reading ability levels, their reading success rate, and their reading challenge level, the amount of text that an elementary or middle-school child reads is a key predictor of his or her literacy development over the school year. This result provides further support for the intuitive, yet often disputed, idea that regardless of a child’s initial ability level, more reading is associated with greater learning. Also consistent with the RR program theory, the student-level results suggest that a high success rate over the course of the school year predicts better outcomes at the end of the year. This finding is consistent across all samples, the elementary, middle-school, and high-school groups. In contrast to the general theory of the model, though, after controlling for students’ baseline scores, number of words read, and reading success rate, students who were assigned reading material that was, on average, beyond their baseline ability performed better on the posttest than did students who were assigned material within their optimum reading range. Consistent with the theory, though, students who were assigned material below their optimum reading range performed worse on the outcome than did students who read material that tended to be within the optimum range. Reading Renaissance 26 This result suggests that if students’ success rates are not suffering, teachers should modify their plans and assign material to students that is above their apparent baseline ability. In this respect, the finding supports suggestions provided to teachers by RR to adjust book levels if the suggested optimum range appears to be too easy or difficult for the student. This result was relatively uniform across all three groups—elementary, middle school, and high school—we studied. As much as 38% of the variance in the reading achievement outcome was attributable to the classroom level. That is, beyond the individual behaviors of students, as much as 38% of the variability in achievement is associated with differences across RR classrooms. In the elementary grades, RR classrooms that effectively encouraged a greater overall amount of guided independent reading and that maintained a high success rate showed statistically significant improvements in the overall achievement level of the classroom. Beyond these results, there were statistically significant positive effects for classrooms taught by master RR teachers. This finding held for both the elementary and middle-school samples. Even after taking into account students’ initial abilities and their individual reading behaviors along with the range of classroom-level measures of highquality implementation, these teachers brought to bear additional skills that promoted student learning beyond the levels achieved by non-master teachers. These abilities are likely ones that go beyond assuring that students are reading quality assignments and are experiencing a high success rates while reading an array of literature. These skills may include adept diagnosis of students’ reading problems, high-quality feedback, skilled use Reading Renaissance 27 of incentives, and effectively teaching students appropriate strategies to help them become better readers. In the middle and high-school grades, improved posttest outcomes were encouraged by those teachers who helped ensure that a higher percentage of students in their classrooms were achieving the 85% correct standard. Among high school classrooms, it was also the case that teachers who ensured that more of their students were reading at the optimum reading level achieved a statistically significant and positive difference on the posttest relative to teachers who maintained an optimum level for fewer of their students. In contrast to the elementary school and middle school outcomes, master teachers did not achieve outcomes that were different from those of non-master RR teachers. The reason for this latter finding, though, was simply due to the fact that only 1% of the high school teachers held a master certification and the outcomes for this small group were estimated with considerable error. Teachers must be sure that all students are moving ahead productively and that lower-performing students are not falling behind in otherwise successful classrooms. This point was raised by the result found for high school classrooms. Specifically, those classrooms in which a higher percentage of students were receiving assignments targeted toward their optimum reading challenge level closed the achievement gaps separating students who entered the RR classroom with lower and higher initial reading abilities. Overall, these results provide support for the RR guided independent reading program theory. High-quality implementations of the program do appear to make a difference, especially when the program is offered by a skilled master RR teacher. Further, in the early grades, greater exposure to literature supports improved achievement Reading Renaissance 28 and, in the later grades, a greater reading success rate makes a positive difference. These findings have important implications for the RR guided independent program and for future efforts to develop teachers’ skills in implementing the program and in promoting improved student learning. Reading Renaissance 29 References Allington, R. L. (1984). Content coverage and contextual reading in reading groups, Journal of Reading Behavior, 16, 85-85. Allington, R.L. (2001). What really matters for struggling readers: Designing researchbased programs. New York: Longman. Beck, I. L. & MacKeown, M. G. (1991). Social studies texts are hard to understand: Mediating some of the difficulties. Language Arts, 68, 482-490. Betts, E. A. (1946). Foundations of reading instruction with emphasis on differentiated guidance. New York: American Book Company. Cunninghan, P. M. & Cunninghan, J. W. (2002). What we know about how to teach phonics. In Farstrup, A. E. & Samuels, S. J. (Eds), What research has to say about reading instruction. Newark, DE: International Reading Association. Cunningham, A. E. & Stanovich K. E. (1998). What reading does for the mind. American Educator, 22(1&2), 8-15. Din, F.S. (1998, March). Use Direct Instruction to quickly improve reading skills. Paper presented at the Annual National Conference on Creating the Quality School. Arlington, VA. Gickling, E. E. & Armstron, D. L. (1978). Levels of instructional difficulty as related ontask behavior, task completion, and comprehension. Journal of Leaning Disabilities, 1, 559-566. Graves, M.J., Juel, C., & Graves, B. (1998). Teaching reading in the 21st century. Boston: Allyn and Bacon. Reading Renaissance 30 Ivey G. & Broaddus, K. (2001). Just plain reading: A survey of what students want to read in middle school classrooms. Reading Research Quarterly, 36, 350-377. Lewis, M. & Samuels, S. J. (2003). Read more--read better? A meta-analysis of the literature on the relationship between exposure to reading and reading achievement. Minneapolis, MN: University of Minnesota. Available online at http://www.tc.umn.edu/~samue001/publications.htm. Marliave, R. S. & Filby, N. (1985). Success rate: A measure of task appropriateness. In Fisher, C. W. & Berliner, D. C. (Eds.), Perspectives on instructional time. White Plains, NY: Longman. National Institute of Child Health and Human Development. (2000). Report of the National Reading Panel. Teaching children to read: An evidence-based assessment of the scientific research literature on reading and its implications for reading instruction (NIH Publication No. 00-4769). Washington, DC: U.S. Government Printing Office. Paul, T. (2003). Guided independent reading: An examination of the Reading Practice Database and the scientific research supporting guided independent reading as implemented in Reading Renaissance. Wisconsin Rapids, WI: Renaissance Learning, Inc. Rosenshine, B. & Stevens, R. (1984). Classroom instruction in reading. In Pearson, P. D. (Ed.), Handbook of reading research (pp. 745-798). New York: Longman. Slavin, R.E., , Karweit, N. L. & Madden, N. A. (1989). Effective programs for students at risk. Boston: Allyn & Bacon. Vygotsky, L.S. (1978). Mind in Society. Cambridge, MA: Harvard University Press. Reading Renaissance Yu, V. (1993). Extensive reading programs: How can the best benefit the teaching and learning of English? TESL Reporter, 26, 1-9. 31 Reading Renaissance 32 Table 1 Student Race/Ethnicity and Gender for the Analytic Sample by Grade Level. Grade Race Gender Asian Male Female Total African American Male Female Unknown Total Hispanic 2 3 4 5 6 7 8 9 10 11 Total 12 6 6 12 13 8 21 8 6 14 8 9 17 5 6 11 4 3 7 6 9 15 12 13 25 9 6 15 5 4 9 0 5 5 76 75 151 231 202 0 433 283 265 4 552 191 208 0 399 153 158 1 312 102 83 5 190 80 81 1 162 54 73 0 127 67 55 1 123 41 35 0 76 26 27 0 53 15 23 0 38 1,243 1,210 12 2,465 Male 87 84 73 82 154 65 106 186 46 32 30 945 Female 85 69 58 66 146 65 98 176 42 48 21 874 Unknown Total 0 2 0 1 0 0 0 0 0 0 0 3 172 155 131 149 300 130 204 362 88 80 51 1,822 2 5 7 2 1 3 3 9 12 1 4 5 3 1 4 38 40 78 4 3 7 4 7 11 4 3 7 2 1 3 2 1 3 65 75 140 Native American Male Female Total White Male Female Unknown Total 676 626 2 1,304 732 628 4 1,364 576 545 1 1,122 441 379 0 820 225 266 0 491 194 175 1 370 297 301 0 598 330 295 0 625 314 296 0 610 237 228 1 466 163 159 0 322 4,185 3,898 9 8,092 Unknown Male 2,108 2,023 1,999 1,784 1,488 990 711 518 366 291 279 12,557 Female 1,850 1,716 1,716 1,558 1,362 848 694 520 339 300 292 11,195 Unknown 1,746 1,475 1,408 1,140 980 968 262 256 166 142 143 8,686 Total 5,704 5,214 5,123 4,482 3,830 2,806 1,667 1,294 871 733 714 32,438 7,632 7,309 6,801 5,785 4,826 3,553 2,618 2,440 1,667 1,344 1,133 45,108 Grand Total Reading Renaissance 33 Table 2 Descriptive Statistics for Student- and Classroom-Level Variables for Grades 2 to 5 (Analytical Sample) Student-Level Descriptive Statistics Variable Name n M SD Min Max NCE Pretest 24,925 48.98 21.46 1 99 NCE Posttest 24,925 52.27 20.78 1 99 Average Percent Correct Above 85 24,925 0.69 0.46 0 1 Actual Number of Words Read 24,925 358,038.75 409,091.90 0 5,750,622.00 Log of Actual Number of Words 24,925 12.16 1.30 0 15.56 Read Low Challenge 24,925 0.04 0.19 0 1 High Challenge 24,925 0.26 0.44 0 1 Grade 3 24,925 0.27 0.44 0 1 Grade 4 24,925 0.25 0.43 0 1 Grade 5 24,925 0.21 0.41 0 1 Classroom-Level Descriptive Statistics Classroom-Mean Number of Words 1,338 335,044.66 266,566.89 1,019.31 1,648,440.63 Read Classroom-Mean of Log of Number 1,338 12.07 1.07 5.48 14.27 of Words Read Percent Optimum Reading 1,338 0.70 0.19 0 1 Challenge Classroom-Mean Pretest Score 1,338 48.16 11.83 3.42 79.88 Classroom-Mean Percent Correct Above 85 1,338 0.69 0.22 0 1 Master Certification 1,338 0.23 0.42 0 1 Number of Students in Classroom 1,338 18.71 4.58 5 44 Reading Renaissance Table 3 Descriptive Statistics for Student- and Classroom-Level Variables for Grades 6 to 8 Analytical Sample Student-Level Descriptive Statistics n M SD Variable Name NCE Pretest 8,402 46.88 21.63 NCE Posttest 8,402 46.24 21.26 Average Percent Correct Above 85 8,402 0.57 0.50 Actual Number of Words Read 8,402 614,000.13 542,076.04 Log of Actual Number of Words Read 8,402 12.78 1.57 Low Challenge 8,402 0.04 0.21 High Challenge 8,402 0.14 0.34 Grade 7 8,402 0.32 0.47 Grade 8 8,402 0.22 0.41 Classroom-Level Descriptive Statistics Classroom-Mean Number of 416 Words Read 573,595 340,260.89 Classroom-Mean of Log of Number of Words Read 416 12.59 1.39 Percent Optimum Reading Challenge 416 0.79 0.20 Classroom-Mean Pretest Score 416 44.80 15.22 Classroom-Mean Percent Correct Above 85 416 0.55 0.26 Master Certification 416 0.09 0.29 Number of Students in Classroom 416 20.38 11.23 Min Max 1 1 99 99 0 0 1 5,641,805 0 0 0 0 0 15.55 1 1 1 1 1,096.50 2,092,026.75 1.69 14.45 0 1 1 84.17 0 0 1 1 5 135 34 Reading Renaissance Table 4 Descriptive Statistics for Student- and Classroom-Level Variables for Grades 9 to 12 Analytical Sample Student-Level Descriptive Statistics n M SD Variable Name Min Max NCE Pretest 5,817 48.38 21.82 1 99 NCE Posttest 5,817 48.18 21.99 1 99 Average Percent Correct Above 85 5,817 0.28 0.45 0 1 Actual Number of Words Read 5,817 312,696.97 337,908.23 0 4,022,247 Log of Actual Number of Words Read 5,817 11.49 2.96 0 15.21 Low Challenge 5,817 0.06 0.24 0 1 High Challenge 5,817 0.11 0.31 0 1 Grade 3 5,817 0.24 0.43 0 1 Grade 4 5,817 0.20 0.4 0 1 Grade 5 5,817 0.18 0.39 0 1 Classroom-Level Descriptive Statistics Classroom-Mean Number of Words Read 350 304,267.78 183,183.90 8,832.14 925,142.69 Classroom-Mean of Log of Number of Words Read 350 11.41 1.44 5.71 13.66 Percent Optimum Reading Challenge 350 0.82 0.17 0 1 Classroom-Mean Pretest Score 350 47.47 13.44 1 78.23 Classroom-Mean Percent Correct Above 85 350 0.28 0.17 0 1 Master Certification 350 0.01 0.08 0 1 Number of Students in Classroom 350 16.7 5.64 5 34 35 Reading Renaissance 36 Table 5 HLM Models for Predicting Posttest NCE Reading Achievement: Grades 2 to 5 Sample Effect Effects of Within Classroom Variables NCE Posttest NCE Pretest Grade 3 Grade 4 Grade 5 Average Percent Correct Above 85 Actual Number of Words Read Low Challenge High Challenge Effects of Between Classroom Variables Classroom-Mean Pretest Score Number of Students in Classroom Classroom Average Daily Engaged Reading Time Classroom Mean Percent Correct Above 85 Average Optimum Reading Challenge Master Certification NCE Pretest Slope Model 1 se ***51.78 0.32 t 164.13 Effect ***54.83 ***0.74 ***-3.36 ***-4.04 ***-6.57 Model 2 se 0.56 0.005 0.67 0.81 0.85 t Effect 97.24 152.39 -5.01 -5.02 -7.70 ***55.22 ***0.65 ***-3.90 ***-4.32 ***-7.51 ***3.95 ***4.07 ***-3.19 ***3.65 df Estimate Model 3 se 0.54 0.01 0.64 0.79 0.81 0.17 0.14 0.47 0.21 t 101.98 93.94 -6.09 -5.47 -9.23 22.60 28.22 -6.86 17.36 Classroom Average Daily Engaged Reading Time Classroom Average Percent Correct Average Optimum Reading Challenge Professional Development Sessions Master Certification Variation Between Classrooms Intercept NCE Pretest Slope Variation Within Classrooms Model Statistics Percentage of Parameter Variance Explained Intra-class Correlation 2 Within-Classrooms: R Between-Classrooms: R2 Note: * p < .05; ** p < .01; *** p < .001 Estimate χ2 114.45 ***9269.85 328.23 df 1337 Estimate χ2 123.64 ***22960.07 0.01 ***2042.23 125.71 1337 1337 124.69 ***26007.69 0.01 ***2044.41 110.94 25.85% 61.70% - χ2 66.20% - df 1337 1337 Reading Renaissance 37 Table 6 HLM Models for Predicting Posttest NCE Reading Achievement: Grades 2 to 5 Sample (Cont.) Model 4 Effect Effects of Within Classroom Variables NCE Posttest NCE Pretest Grade 3 Grade 4 Grade 5 Average Percent Correct Above 85 Actual Number of Words Read Low Challenge High Challenge Effects of Between Classroom Variables Classroom-Mean Pretest Score Number of Students in Classroom Mean Actual Number of Words Read Average Optimum Reading Challenge Classroom Mean Percent Correct Above 85 Master Certification NCE Pretest Slope Mean Actual Number of Words Read Average Optimum Reading Challenge Classroom Average Percent Correct Above 85 Master Certification Variation Between Classrooms Intercept NCE Pretest Slope Variation Within Classrooms Model Statistics Percentage of Parameter Variance Explained Within-Classrooms: R2 2 Between-Classrooms: R Note: * p < .05; ** p < .01; *** p < .001 se Model 5 t Effect se Model 6 t Effect se t ***54.56 ***0.65 ***-3.06 ***-3.84 ***-6.28 ***3.94 ***4.06 ***-3.23 ***3.65 0.31 0.01 0.38 0.38 0.39 0.17 0.14 0.47 0.21 176.19 93.87 -8.03 -10.22 -16.28 22.54 28.08 -6.94 17.33 ***55.80 ***0.65 ***-4.67 ***-5.51 ***-8.44 ***3.93 ***2.09 ***-3.06 ***3.69 0.33 0.01 0.39 0.45 0.50 0.18 0.20 0.47 0.21 170.00 92.17 -11.88 -12.37 -16.81 22.34 10.58 -6.49 17.29 ***55.74 ***0.65 ***-4.61 ***-5.40 ***-8.29 ***3.93 ***4.10 ***-3.06 ***3.69 0.33 0.01 0.39 0.45 0.51 0.18 0.14 0.47 0.21 169.32 92.19 -11.69 -12.09 -16.41 22.34 24.47 -6.49 17.29 ***0.88 **0.08 0.01 0.03 67.85 2.87 ***0.79 0.03 ***2.09 -1.40 0.02 0.03 0.20 0.82 46.08 1.18 10.58 -1.71 ***0.79 0.03 ***2.03 -1.41 0.02 0.03 0.20 0.82 46.26 1.24 10.14 -1.73 **1.83 0.69 2.64 *1.39 **0.87 0.68 0.31 2.04 2.81 **0.02 0.01 3.01 **0.02 0.01 3.04 0.002 0.03 0.07 0.002 0.03 0.06 -0.003 0.02 -0.13 0.001 -0.01 0.02 0.01 0.03 -0.50 Estimate 12.14 0.01 110.92 χ2 ***3980.48 ***2028.67 df 1332 1334 Estimate 12.04 0.01 110.92 χ2 ***3956.66 ***2028.11 df 1331 1332 Estimate 15.77 0.01 110.91 χ2 ***4760.50 ***2044.84 df 1335 1337 66.21% 66.21% 66.21% 86.22% 89.39% 89.48% Reading Renaissance 38 Table 7 HLM Models for Predicting Posttest NCE Reading Achievement: Grades 6 to 8 Sample Effect Effects of Within Classroom Variables NCE Posttest NCE Pretest Grade 7 Grade 8 Average Percent Correct Above 85 Actual Number of Words Read Low Challenge High Challenge Effects of Between Classroom Variables Classroom-Mean Pretest Score Number of Students in Classroom Classroom Average Daily Engaged Reading Time Classroom Mean Percent Correct Above 85 Average Optimum Reading Challenge Master Certification NCE Pretest Slope Model 1 se ***44.58 0.70 t Effect 63.62 ***44.71 ***0.77 -0.71 -0.39 df Estimate Model 2 se 0.88 0.01 1.17 1.49 t Effect 50.69 93.01 -0.61 -0.26 ***44.68 ***0.73 -0.71 -0.28 ***3.34 ***1.38 ***-2.59 ***1.74 df Estimate Model 3 se 0.88 0.01 1.17 1.46 0.30 0.17 0.69 0.40 t 50.83 75.03 -0.61 -0.19 11.07 8.31 -3.78 4.39 Classroom Average Daily Engaged Reading Time Classroom Average Percent Correct Average Optimum Reading Challenge Professional Development Sessions Master Certification Variation Between Classrooms Intercept NCE Pretest Slope Variation Within Classrooms Model Statistics Percentage of Parameter Variance Explained Intra-class Correlation 2 Within-Classrooms: R Between-Classrooms: R2 Note: * p < .05; ** p < .01; *** p < .001 Estimate χ2 185.86 ***4490.77 306.46 415 χ2 205.14 ***11004.66 0.01 ***496.79 123.44 413 413 205.59 ***11495.19 0.004 **483.38 118.13 37.75% 59.72% - χ2 61.45% - df 413 413 Reading Renaissance 39 Table 8 HLM Models for Predicting Posttest NCE Reading Achievement: Grades 6 to 8 Sample (Cont.) Model 4 Effect Effects of Within Classroom Variables NCE Posttest NCE Pretest Grade 7 Grade 8 Average Percent Correct Above 85 Actual Number of Words Read Low Challenge High Challenge Effects of Between Classroom Variables Classroom-Mean Pretest Score Number of Students in Classroom Mean Actual Number of Words Read Average Optimum Reading Challenge Classroom Mean Percent Correct Above 85 Master Certification NCE Pretest Slope Mean Actual Number of Words Read Average Optimum Reading Challenge Classroom Average Percent Correct Above 85 Master Certification Variation Between Classrooms Intercept NCE Pretest Slope Variation Within Classrooms Model Statistics Percentage of Parameter Variance Explained Within-Classrooms: R2 2 Between-Classrooms: R Note: * p < .05; ** p < .01; *** p < .001 se Model 5 t Effect se Model 6 t Effect se t ***43.70 ***0.72 **1.33 *0.89 ***3.36 ***1.39 ***-2.57 ***1.68 0.24 0.01 0.41 0.42 0.30 0.17 0.68 0.40 184.31 74.75 3.21 2.11 11.08 8.31 -3.79 4.24 ***43.66 ***0.73 **1.42 *0.91 ***3.35 ***1.38 ***-2.60 ***1.72 0.23 0.01 0.42 0.40 0.30 0.17 0.69 0.41 190.01 68.56 3.45 2.25 11.09 8.22 -3.76 4.16 ***43.64 ***0.73 ***1.51 *0.89 ***3.36 ***1.38 ***-2.60 ***1.73 0.23 0.01 0.41 0.40 0.30 0.17 0.69 0.41 191.77 68.08 3.67 2.22 11.11 8.22 -3.79 4.18 ***0.93 -0.01 0.01 0.01 70.55 -0.75 ***0.90 -0.01 0.03 0.70 0.02 0.01 0.16 1.44 45.30 -1.01 0.20 0.49 ***0.90 -0.01 -0.01 0.73 0.02 0.01 0.16 1.42 43.13 -0.75 -0.06 0.52 ***3.08 0.74 4.17 **2.36 ***2.11 0.75 0.58 3.13 3.67 -0.004 0.01 -0.33 -0.005 0.01 -0.45 0.005 0.06 0.08 0.01 0.06 0.18 -0.01 0.04 -0.20 -0.02 0.05 0.04 0.03 -0.59 1.91 Estimate 5.53 0.004 118.00 χ2 ***777.39 **483.69 df 408 410 Estimate 5.22 0.004 118.10 χ2 ***757.80 ***480.87 df 407 409 Estimate 6.19 0.004 117.97 χ2 ***827.81 **484.62 df 411 413 61.51% 61.50% 61.46% 96.67% 97.02% 97.19% Reading Renaissance 40 Table 9 HLM Models for Predicting Posttest NCE Reading Achievement: Grades 9 to 12 Sample Effect Effects of Within Classroom Variables NCE Posttest NCE Pretest Grade 10 Grade 11 Grade 12 Average Percent Correct Above 85 Actual Number of Words Read Low Challenge High Challenge Effects of Between Classroom Variables Classroom-Mean Pretest Score Number of Students in Classroom Classroom Average Daily Engaged Reading Time Classroom Mean Percent Correct Above 85 Average Optimum Reading Challenge Master Certification NCE Pretest Slope ***47.50 Model 1 se 0.68 t Effect 69.76 ***46.00 ***0.75 ***3.29 **2.96 -0.39 df Estimate Model 2 se 0.89 0.01 0.86 0.99 1.15 t Effect 51.67 66.48 3.82 3.01 -0.34 ***45.92 ***0.73 ***3.22 **3.09 0.01 ***3.17 ***0.45 ***-3.08 **1.83 df Estimate Model 3 se 0.88 0.01 0.82 0.97 1.14 0.39 0.07 0.77 0.67 t 52.23 59.52 3.90 3.20 0.01 8.11 6.53 -3.99 2.74 Classroom Average Daily Engaged Reading Time Classroom Average Percent Correct Average Optimum Reading Challenge Professional Development Sessions Master Certification Variation Between Classrooms Intercept NCE Pretest Slope Variation Within Classrooms Model Statistics Percentage of Parameter Variance Explained Intra-class Correlation 2 Within-Classrooms: R Between-Classrooms: R2 Note: * p < .05; ** p < .01; *** p < .001 Estimate χ2 138.05 ***2302.22 362.92 349 159.38 0.01 160.16 χ2 ***4895.01 ***464.68 347 347 159.11 ***4991.06 0.01 **453.16 156.46 27.65% 55.87% - χ2 56.89% - df 347 347 Reading Renaissance 41 Table 10 HLM Models for Predicting Posttest NCE Reading Achievement: Grades 9 to 12 Sample (Cont.) Model 4 Effect Effects of Within Classroom Variables NCE Posttest NCE Pretest Grade 10 Grade 11 Grade 12 Average Percent Correct Above 85 Actual Number of Words Read Low Challenge High Challenge Effects of Between Classroom Variables Classroom-Mean Pretest Score Number of Students in Classroom Mean Actual Number of Words Read Average Optimum Reading Challenge Classroom Mean Percent Correct Above 85 Master Certification NCE Pretest Slope Mean Actual Number of Words Read Average Optimum Reading Challenge Classroom Average Percent Correct Above 85 Master Certification Variation Between Classrooms Intercept NCE Pretest Slope Variation Within Classrooms Model Statistics Percentage of Parameter Variance Explained Within-Classrooms: R2 2 Between-Classrooms: R Note: * p < .05; ** p < .01; *** p < .001 se Model 5 t Effect se Model 6 t Effect se t ***47.11 ***0.73 *1.19 0.88 *-1.86 ***3.06 ***0.46 ***-2.85 1.31 0.35 0.01 0.52 0.54 0.77 0.39 0.07 0.78 0.67 132.83 59.70 2.30 1.62 -2.41 7.79 6.59 -3.67 1.94 ***46.87 ***0.73 **1.34 **1.41 -1.31 ***3.08 ***0.46 ***-3.05 **1.69 0.34 0.01 0.51 0.54 0.74 0.39 0.07 0.77 0.69 138.54 61.96 2.65 2.61 -1.78 7.82 6.56 -3.99 2.42 ***46.88 ***0.73 *1.34 *1.40 *-1.32 ***3.08 ***0.46 ***-3.05 **1.69 0.36 0.01 0.53 0.59 0.63 0.41 0.07 0.79 0.71 129.23 59.27 2.54 2.38 -2.10 7.49 6.79 -3.88 2.40 ***0.93 0.05 0.02 0.05 49.31 0.98 ***0.84 0.02 0.37 ***6.69 0.03 0.05 0.20 1.66 31.63 0.40 1.87 4.02 ***0.84 0.02 0.37 **6.74 0.03 0.04 0.20 2.00 32.13 0.44 1.84 3.39 **4.21 1.41 2.97 *4.47 -0.83 1.51 3.02 2.83 -0.27 -0.004 0.01 -0.49 -0.004 0.01 -0.42 ***-0.32 0.07 -4.66 ***-0.32 0.08 -3.90 *0.14 0.07 1.97 0.14 0.05 0.08 0.15 1.88 0.32 Estimate 7.45 0.01 156.16 χ2 ***606.24 **428.72 df 342 344 Estimate 7.49 0.01 156.16 χ2 ***605.94 ***428.54 df 341 343 Estimate 8.63 0.01 156.29 χ2 ***661.70 **456.83 df 345 347 56.94% 56.97% 56.97% 93.75% 94.60% 94.57% Reading Renaissance Appendix 42 Reading Renaissance ZPD Ranges by Grade Equivalent Scores Pretest Grade Equivalent Scores .00 .10 .20 .30 .40 .50 .60 .70 .80 .90 1.00 1.10 1.20 1.30 1.40 1.50 1.60 1.70 1.80 1.90 2.00 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 3.50 3.60 3.70 3.80 3.90 Suggested ZPD 1.0 to 2.0 1.5 to 2.5 2.0 to 3.0 2.3 to 3.3 2.6 to 3.6 2.8 to 4.0 43 Reading Renaissance 4.00 4.10 4.20 4.30 4.40 4.50 4.60 4.70 4.80 4.90 5.00 5.10 5.20 5.30 5.40 5.50 5.60 5.70 5.80 5.90 6.00 6.10 6.20 6.30 6.40 6.50 6.60 6.70 6.80 6.90 7.00 7.10 7.20 7.30 7.40 7.50 7.60 7.70 7.80 7.90 3.0 to 4.5 3.2 to 5.0 3.4 to 5.4 3.7 to 5.7 4.0 to 6.1 4.2 to 6.5 4.3 to 7.0 4.4 to 7.5 44 Reading Renaissance 8.00 8.10 8.20 8.30 8.40 8.50 8.60 8.70 8.80 8.90 9.00 9.10 9.20 9.30 9.40 9.50 9.60 9.70 9.80 9.90 10.00 10.10 10.20 10.30 10.40 10.50 10.60 10.70 10.80 10.90 11.00 11.10 11.20 11.30 11.40 11.50 11.60 11.70 11.80 11.90 4.5 to 8.0 4.6 to 9.0 4.7 to 10.0 4.8 to 11.0 45 Reading Renaissance 12.00 12.10 12.20 12.30 12.40 12.50 12.60 12.70 12.80 12.90 13.00 4.9 to 12.0 46
© Copyright 2026 Paperzz