The intelligibility of speech in elementary school classrooms J. S. Bradleya兲 National Research Council, Montreal Road, Ottawa, K1A 0R6, Canada H. Sato Institute for Human Science & Biomedical Engineering, National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Higashi, Tsukuba, Ibaraki 305-8566, JapanII 共Received 16 July 2007; revised 11 November 2007; accepted 8 January 2008兲 This is the second of two papers describing the results of acoustical measurements and speech intelligibility tests in elementary school classrooms. The intelligibility tests were performed in 41 classrooms in 12 different schools evenly divided among grades 1, 3, and 6 students 共nominally 6, 8, and 11 year olds兲. Speech intelligibility tests were carried out on classes of students seated at their own desks in their regular classrooms. Mean intelligibility scores were significantly related to signal-to-noise ratios and to the grade of the students. While the results are different than those from some previous laboratory studies that included less realistic conditions, they agree with previous in-classroom experiments. The results indicate that +15 dB signal-to-noise ratio is not adequate for the youngest children. By combining the speech intelligibility test results with measurements of speech and noise levels during actual teaching situations, estimates of the fraction of students experiencing near-ideal acoustical conditions were made. The results are used as a basis for estimating ideal acoustical criteria for elementary school classrooms. 关DOI: 10.1121/1.2839285兴 PACS number共s兲: 43.55.Hy, 43.71.Gv 关RYL兴 I. INTRODUCTION There has recently been renewed interest in achieving improved acoustical conditions in classrooms. Although there is a general consensus as to acoustical criteria for good classrooms,1 the supporting evidence from studies in actual classrooms is limited. The renewed interest in classroom acoustics is related to our growing understanding of the negative effects of ambient noise and poor room acoustics on children’s ability to learn in schools 共see also Anderson2 for an extensive review兲. There is evidence that increased levels of noise affect memory3 and are associated with decreased reading scores.4–6 It is reasonable to assume that a mechanism to explain the association of decreased educational progress, such as the effect of noise on reading scores, is simply that noise interferes with verbal communication 共especially that between teachers and students兲, which is the predominant mechanism for learning in elementary schools. It is well known that younger children have greater difficulty understanding speech in even modest levels of ambient noise.7–9 In fact several authors have reported results showing that the ability to recognize speech in noise improves systematically with age.7,10,11 Although it is clear that children need quieter conditions and corresponding larger signal-to-noise ratios than adults to achieve high speech recognition scores,8 and that the younger the children, the quieter the conditions should be, the results of the various previous studies reveal large differences and do not agree well with previous results of tests carried out in actual classrooms.12 a兲 Electronic mail: [email protected] 2078 J. Acoust. Soc. Am. 123 共4兲, April 2008 Pages: 2078–2086 Figure 1 compares results from Marshal11 and Elliott7 with previous classroom measurements of speech intelligibility scores versus A-weighted speech–noise level differences, S / N共A兲. Marshal and Elliott’s results are both based on simple word recognition tasks and both indicate large variations in intelligibility scores with the age of the listener. However, when examined in detail, these two sets of results show quite different effects of age and show some much lower intelligibility scores than the previous classroom results for 12 year olds. The previous classroom results were from a 1986 study that included ten classes of 12 year old students12 and used a rhyme test in which subjects identified the correct response of several possible rhyming words. The subjects were seated in their regular seats in their own classrooms and listened to the recorded speech material from a small loudspeaker with directionality similar to a human talker. The differences in Fig. 1 are thought to be due to the laboratory studies using monaural headphone presentation of speech and noise signals, which increases the negative effects of noise and reverberation. Listening naturally with two ears leads to a “binaural advantage” that can make it easier to understand speech in noise. MacKeith and Coles13 measured binaural advantages equivalent to as much as an 18 dB improvement in signal-to-noise ratio for extreme cases in free field conditions. Neuman and Hochberg9 tested 5 to 13 year old children, finding small binaural advantages for all ages in reverberant conditions. They pointed out that the binaural advantages would probably be much larger in lower signalto-noise situations. Nábflek and Robinson14 tested subjects with ages from 10 to 72 years and found an average binaural advantage corresponding to a 5% increase in intelligibility scores in reverberant conditions but without significant masking noise. They did not include the effects of lower 0001-4966/2008/123共4兲/2078/9/$23.00 Downloaded 21 Jun 2011 to 150.29.215.199. Redistribution subject to ASA license or copyright; see http://asadl.org/journals/doc/ASALIB-home/info/terms.jsp used to estimate the fraction of students in the measured classrooms who would experience these ideal conditions. That is, a new basis for classroom acoustics criteria as well as estimates of the negative impact of common existing conditions were produced. 100 90 80 Intelligibility, % 70 60 II. MEASUREMENT PROCEDURES 50 40 30 Marshall, age: Elliott, age: 5 7 7 9 9 11 11 13 1986 classrooms 20 10 0 -15 -10 -5 0 5 10 15 20 S/N(A), dB FIG. 1. Laboratory word intelligibility test results from Marshall—Ref. 11 共5, 7, 9, and 11 year olds兲 and Elliott—Ref. 7 共7, 9, 11, and 13 year olds兲 showing the effect of listener age on intelligibility scores vs A-weighted speech–noise level differences, S / N共A兲, and compared with previous inclassroom intelligibility test results from 12 year olds 共Ref. 12兲. signal-to-noise ratios where larger effects would be expected. It is clear that the older laboratory based speech intelligibility studies in which monaural headphone presentation was used would exaggerate the negative effects of noise on speech recognition but the magnitude of these effects cannot be accurately estimated. Similarly there are previous studies indicating that younger children’s ability to understand speech is more adversely affected by room reverberation.9,14,15 Some early studies used monaural headphone presentation of the test signals,10 which would exaggerate the negative effects of reverberation on speech recognition scores.16 Because shorter reverberation times led to improved speech recognition scores in these tests, they have led some to recommend very short reverberation times for classrooms. Of course, excessive absorptive treatments may control reverberation but will also decrease effective speech levels and exaggerate the more common problem of inadequate signal-to-noise ratios in classrooms. The limitations of the various previous studies carried out in laboratory settings can be avoided by testing children in their classrooms where the children listen in realistic conditions with both ears. The work reported in this paper and in a companion paper17 was intended to provide a more complete basis for deriving acoustical criteria for classrooms from speech intelligibility tests of students in their own classrooms and acoustical measurements in the same classrooms. More specifically it was intended to show younger children’s ability to understand speech in real classroom conditions as a function of signal-to-noise ratios, varied room acoustics, and the age of the children. At the same time it was planned to determine speech and noise levels in classrooms during actual teaching situations in order to understand the actual signal-to-noise ratios experienced in active classrooms. By combining these two types of information, this paper estimates ideal acoustical conditions in classrooms for children of various ages. At the same time the data are J. Acoust. Soc. Am., Vol. 123, No. 4, April 2008 The measurements included speech recognition tests of complete classes of grades 1, 3, and 6 students 共nominally 6, 8, and 11 year olds兲 in their own classrooms. The signal-tonoise ratios experienced by the students were varied over a wide range by varying the speech playback level. Room acoustics conditions were varied by testing in a number of different classrooms. Speech and noise levels were measured during the speech tests as well as during normal teaching activities. Room acoustics measurements were made with the classroom occupied and also unoccupied. A. Speech recognition tests The speech intelligibility scores were obtained using the Word Identification by Picture Identification 共WIPI兲 test11,18,19 that includes four lists of 25 phonetically balanced simple nouns. This was selected as an easy test that 6 year olds and older students could quickly learn and respond to individually in a classroom situation. It consists of simple test words said to be familiar to 5 year olds and these were presented in the carrier phrase, “Please mark the _ _ _ _ now.” While sitting at their desks in their regular classroom, the students responded by placing a sticker on one of six pictures to indicate the correct word. Each sentence was approximately 3 s long and the next sentence was played to the students when all were ready to proceed. The tests were carried out in 41 classrooms evenly distributed among grade 1, grade 3, and grade 6 students in 12 different schools. The schools were in relatively quiet rural areas and small towns in Eastern Ontario, Canada. A total of 840 students were evaluated in 41 classrooms. Grade 1 students were each tested at two different signal-to-noise values and the other students at three different signal-to-noise values. A total of 2285 individual speech recognition tests were obtained. All students in each class with parental permission participated in the tests. The parental permission form asked whether the student had any known hearing problems. Almost all students with known hearing impairment did not receive parental permission and did not participate. As a result the test results of the few students with reported hearing problems were not included in these analyses because of the very small numbers of these students. The speech source was a small loudspeaker with similar directionality to that of a human talker. Digital recordings of the WIPI test material were made in an anechoic room so that they were reflection free and with negligible noise. A female talker was used and the recordings were edited to use exactly the same version of the carrier phrase for all test words and to have the same sound levels for all test words. Varied S / N共A兲 values were obtained by changing the playback level of the speech material relative to the existing J. S. Bradley and H. Sato: Speech in classrooms 2079 Downloaded 21 Jun 2011 to 150.29.215.199. Redistribution subject to ASA license or copyright; see http://asadl.org/journals/doc/ASALIB-home/info/terms.jsp natural ambient noise in the classroom. This included some tests with 20– 30 dB S / N共A兲 values to determine intelligibility scores in truly ideal conditions for these students. This made it possible to obtain speech recognition scores for a very wide range of signal-to-noise ratios with natural ambient noises. Room acoustics conditions were varied by carrying out the tests in a number of different classrooms. It was not possible to artificially modify room acoustics conditions as part of the experiments. B. Acoustical measurements Speech and noise levels were recorded during the speech recognition tests at four positions in the fully occupied classrooms. There were on average 20.4 students in each classroom 共s.d.⫾ 3.96兲 and hence there were about five students near each measuring microphone. These recordings were used to determine speech and noise levels during the tests by statistical analysis of the distributions of recorded sound levels.17 Speech and noise levels were also recorded at the same four microphone positions during a normal teaching activity in each classroom. Recordings were made during a period when the teacher planned to be mostly talking to the class of students as a complete group. Finally, ambient noise levels were also recorded at these same four positions when the classroom was unoccupied. Room acoustics parameters were also measured from impulse responses obtained at the same microphone locations for both occupied and unoccupied conditions. These included decay times and energy ratios as described in the companion paper17 along with the various speech and noise level measurements. All sounds were digitally recorded on a portable computer. To avoid running cables through the classrooms, the signals from the four microphones were connected to a central computer via 16 bit digital transmitters and receivers. III. MEAN TRENDS OF SPEECH RECOGNITION TEST RESULTS FIG. 2. Mean intelligibility scores of groups of approximately five students vs A-weighted speech–noise level difference, S / N共A兲. The error bars indicate the standard deviations of the scores of each group of students. There is also a large amount of scatter about the mean trends that tends to increase with decreasing S / N共A兲 values. The scatter is partly due to the approximately five students near each microphone being slightly different distances from the microphone. The larger scatter at lower S / N共A兲 values may be indicative of how students react to more difficult listening conditions. At lower S / N共A兲, some students can still do quite well, but others may more or less give up and get much lower scores. The best-fit regression lines to the data in Fig. 2 are repeated in Fig. 3 and are compared with the previous speech intelligibility scores from 12 year olds in classrooms. The previous results12 used a rhyme test with simple rhyming words presented in a carrier phrase. Given the amount of scatter in both sets of data, and possible larger inaccuracies in the older results, there is remarkable agreement between the old results for 12 year olds and the results for 11 year olds 共i.e., the grade 6 students兲. This agreement suggests that A. Speech recognition scores as a function of S / N„A… 2080 J. Acoust. Soc. Am., Vol. 123, No. 4, April 2008 90 80 70 Intelligibility, % Speech recognition scores were first examined as a function of S / N共A兲 as this was the key independent variable that was varied in these experiments. The speech recognition scores were averaged over the scores of the approximately five students located close to each microphone in each classroom. These average scores are plotted versus the measured signal-to-noise ratios 关S / N共A兲兴 in Fig. 2. They are plotted versus S / N共A兲 separately for the results of the grades 1, 3, and 6 students. An analysis of variance of the scores showed that there were highly significant main effects of age and S / N共A兲 共p ⬍ 0.001兲 as well as a highly significant interaction effect of these two independent variables, age and S / N共A兲 共p ⬍ 0.001兲. That is, although there is significant scatter in the results, there are highly significant effects related to S / N共A兲 and the age of the listeners. The younger children need significantly higher S / N共A兲 values to obtain the same intelligibility scores as the older children in these classrooms. 100 60 50 40 30 Grade 6 Grade 3 Grade 1 1986 Schools 20 10 0 -10 -5 0 5 10 15 S/N(A), dB FIG. 3. Comparison of the best-fit regression lines from Fig. 2 with the mean trend of previous speech intelligibility scores for 12 year olds in classrooms 共Ref. 12兲. J. S. Bradley and H. Sato: Speech in classrooms Downloaded 21 Jun 2011 to 150.29.215.199. Redistribution subject to ASA license or copyright; see http://asadl.org/journals/doc/ASALIB-home/info/terms.jsp TABLE I. Summary of mean values and standard deviations 共兲 of key room-acoustics parameters for occupied conditions. C50, EDT, and T60 are midfrequency results for the combined 500 and 1000 Hz octave bands. S / N共A兲 is the signal-to-noise ratio in terms of A-weighted speech–noise level differences. 100 Intelligibility, % 95% 90 8.5 dB grade 6 12.5 dB 15.5 dB grade 3 grade 1 80 Grade 6 Grade 3 Grade 1 70 0 5 10 15 20 Grade 1 Mean Grade 3 Mean Grade 6 Mean C50共500– 1000兲 EDT共500– 1000兲 T60共500– 1000兲 S / N共A兲 8.9 2.19 0.40 0.10 0.43 0.09 6.5 7.30 8.6 2.41 0.42 0.12 0.44 0.11 3.8 5.2 8.8 2.94 0.37 0.13 0.40 0.10 3.4 9.0 S/N(A), dB FIG. 4. 共Color online兲 Expanded view of the mean trends from Fig. 2 showing the point at which each regression line reaches a 95% speech intelligibility score. the results are a more representative indication of children’s ability to understand speech as a function of S / N共A兲 in real classroom conditions than some previous laboratory speech tests. Figure 4 shows an expanded view of the best-fit regression lines from Fig. 2. The performance of the three age groups can be compared by considering the required S / N共A兲 for each group to achieve near-ideal conditions for speech communication. For the results of simple word intelligibility tests such as the WIPI test, a speech intelligibility score of 95% correct is used to indicate near-ideal conditions, because 95% correct scores are readily achievable in high S / N共A兲 conditions. For example, Fig. 2 shows that for very high S / N共A兲 共+23 to +30 dB兲, the grade 1 and 3 students scored ⬃98% correct and the grade 6 students scored ⬃99.5% correct. That is, although the younger children might be expected to find the test a little more difficult, all three age groups are capable of getting higher scores than 95% in very high S / N共A兲 conditions. The mean trends in Fig. 4 show that the grade 6 students could, on average, achieve 95% correct scores for a S / N共A兲 of +8.5 dB. However, the grade 3 students required +12.5 dB S / N共A兲 and the grade 1 students required +15.5 dB S / N共A兲 to obtain a mean score of 95% correct. In this case there is a 7 dB difference between the average needs of grade 1 and grade 6 students. That is, the grade 1 students would need a 7 dB greater S / N共A兲 value, or a corresponding 7 dB quieter ambient noise level, to obtain the same intelligibility scores as the grade 6 students. Further, it is likely that the grade 6 students would have somewhat lower speech intelligibility scores than would young adult listeners in the same situations.9 frequency values of early-to-late arriving sound ratios 共C50兲, early decay times 共EDT兲, reverberation times 共T60兲, and A-weighted signal-to-noise ratios 关S / N共A兲兴. The acquisition of these data are described in the companion paper.17 It was hoped that there would be sufficient variation of room acoustics conditions among the classrooms to determine the additional effects of room acoustics on speech intelligibility scores. In practice, Table I indicates relatively small variations about the mean conditions that were close to ideal. To investigate possible additional effects of room acoustics on intelligibility scores, multiple regression analyses were performed regressing speech intelligibility scores on values of S / N共A兲, S / N共A兲2, and one of the room acoustics parameters. Table II summarizes the results in terms of the resulting R2 共coefficient of determination兲 values. The R2 values for the combination of the S / N共A兲 and S / N共A兲2 are first given for each grade level of students. If there are significant additional effects of room acoustics, then when values of one of the room acoustics parameters were added to the regression analysis, the R2 value would be expected to increase. For the grade 1 results none of the room acoustics parameters added significantly to the prediction and the R2 values did not increase. For the grade 3 and 6 results, adding one of the room acoustics parameters to the prediction did result in modest but significant increases in the prediction accuracy of the intelligibility scores. Of the three room acoustics parameters considered, C50 values tended to be slightly more effective in increasing the R2 values. Figure 5 illustrates the resulting multiple regression equations for combinations of S / N共A兲, S / N共A兲2, and C50 TABLE II. R2 values from multiple regression analyses of intelligibility scores on the predictors shown at the top of each column. The subscript ns, indicates that the room acoustics predictor variable 共C50, EDT, or T60兲 did not add significantly to the prediction 共p ⬍ 0.05兲. N S / N共A兲 S / N共A兲2 S / N共A兲 S / N共A兲2 C50 S / N共A兲 S / N共A兲2 EDT S / N共A兲 S / N共A兲2 T60 136 96 156 0.707 0.625 0.355 0 . 710ns 0.660 0.508 0 . 707ns 0.655 0.475 0 . 707ns 0.663 0.474 B. Effects of room acoustics on speech recognition scores Table I lists the mean values of key room-acoustics parameters20 along with the standard deviations of these parameters for each grade level group. These included midJ. Acoust. Soc. Am., Vol. 123, No. 4, April 2008 Grade 1 Grade 3 Grade 6 J. S. Bradley and H. Sato: Speech in classrooms 2081 Downloaded 21 Jun 2011 to 150.29.215.199. Redistribution subject to ASA license or copyright; see http://asadl.org/journals/doc/ASALIB-home/info/terms.jsp TABLE III. Average intelligibility scores for each grade and overall average standard deviations 共兲 for all grades determined for each S / N共A兲 interval. Speech intelligibility, % 100 80 S / N共A兲 共dB兲 Avg. grade 1共%兲 Avg. grade 3共%兲 Avg. grade 6共%兲 共%兲 60 −10 −5 0 5 10 15 20 25 40 C50 = 6 dB C50 = 9 dB C50 = 12 dB 20 (a) grade 3 Speech intelligbility, % 0 100 80 38.0 56.5 73.2 84.7 91.3 94.8 96.4 97.3 ¯ 69.0 81.5 89.3 93.6 95.9 97.0 ¯ 70.5 80.5 87.9 92.6 95.0 97.3 98.3 ¯ 16.50 12.30 9.00 5.68 3.50 2.20 1.29 0.70 60 Canadian schools, these results suggest that room acoustics in these elementary school classrooms tend to be homogeneously reasonably acceptable. 40 C50= 6 dB C50 = 9 dB C50 = 12 dB 20 (b) grade 6 0 -15 -10 -5 0 5 10 15 S/N(A), dBA FIG. 5. Multiple regression results for prediction of speech intelligibility scores from S / N共A兲, S / N共A兲2, and C50 values for 共a兲 grade 3 results and 共b兲 grade 6 results. Results are given for C50 values of 6, 9, and 12 dB roughly corresponding to the range of frequently found conditions in the measured classrooms. values for the grade 3 and grade 6 results. The regression equations in Fig. 5 are given by the following: Grade 3, SI = 2.495 S/N共A兲 − 0.110 S/N共A兲2 + 0.98C50 + 70.37, Grade 6, SI = 0.772 S/N共A兲 − 0.0189 S/N共A兲2 + 1.53C50 + 74.46. The above-mentioned grade 3 results suggest that a 1 dB change in C50 values would result in about a 1.0% change in intelligibility scores. However, the grade 6 results indicate a 1.53% change in intelligibility scores would result for a 1 dB change in C50 values. If most of the data are within ⫾1 s.d. of the mean, most C50 values would be within a range of about 6 – 12 dB depending on the grade. The largest effect would be for the grade six results where changes in C50 values are likely to result in changes to intelligibility scores of up to about 9% 共obtained by multiplying the standard deviation value in Table I for C50 values in grade 6 classrooms by the coefficient of C50 in the equation for the abovepresented grade 6 results, and doubling the result to include positive and negative deviations about the mean兲. Of course, this is much smaller than the effect of S / N共A兲 values illustrated in Fig. 2. These data do not show large effects of room acoustics parameters because there were not large variations in room acoustics conditions among the various classrooms. This is illustrated in Fig. 5 that plots the regression equations for the case when C50 was used as the added room acoustics parameter. Similar results were obtained using EDT and T60 values as the added room acoustics parameter. Since these classrooms are assumed to be “typical” of many classrooms in 2082 J. Acoust. Soc. Am., Vol. 123, No. 4, April 2008 IV. DISTRIBUTION OF INTELLIGIBILITY SCORES ABOUT THE MEAN TRENDS While Sec. II discusses the mean trends of speech intelligibility scores, it is clear that many individual student results deviate significantly from the mean trends. That is, while the average student may for some condition be able to understand reasonably well, many cannot. It is therefore important to also examine the distribution of intelligibility scores about the mean trend. The ultimate goal is to determine the fraction of the students that can understand speech well at each S / N共A兲 value and the required conditions to enable most children to understand speech well. The distribution of the intelligibility scores about the mean trends seen in Fig. 2 was analyzed by dividing the results into 5 dB wide S / N共A兲 intervals for the data from the students of each grade. It was then possible to examine the distribution of scores within each of these S / N共A兲 segments. Although in most cases there were approximately normal distributions of scores in each segment, in some cases there were not adequate numbers of data points to provide regular distributions. A procedure was required to approximate the distributions in all segments of the data. To do this, the mean scores and the standard deviations of the scores about the mean values were calculated for each S / N共A兲 segment. These values were then plotted versus the mean S / N共A兲 values for each interval. The mean values are listed in Table III and are different for each grade and follow trends almost identical to those in Fig. 2. The standard deviations of the scores in each S / N共A兲 interval were similarly plotted versus the mean S / N共A兲 value for each interval. As illustrated in Fig. 6, the results for all three grades follow an approximately similar trend but with some considerable uncertainty in the lowest S / N共A兲 interval. The mean trend in Fig. 6 gives a good estimate of the variation in the standard deviation of scores over a wide range of S / N共A兲 values and is quite adequate for the purposes of these analyses in which the focus is on very good conditions for speech communication. Therefore this mean trend is used as an estimate of the standard deviations for all grades. 关There is some uncertainty in the mean trend at the lowest S / N共A兲 category where the mean S / N共A兲 is −10 dB. This is J. S. Bradley and H. Sato: Speech in classrooms Downloaded 21 Jun 2011 to 150.29.215.199. Redistribution subject to ASA license or copyright; see http://asadl.org/journals/doc/ASALIB-home/info/terms.jsp 0.3 S/N(A) grade 1 grade 3 grade 6 20 dB Fraction of responses standard deviation of SI, % 18 12 6 0.2 15 dB 0.1 10 dB -5 dB -10 dB 0 dB 5 dB 0.0 0 0 -15 -10 -5 0 5 10 15 20 20 25 40 60 80 100 Speech intelligibility, % S/N(A), dB to be expected as conditions get more difficult and at extremely low S / N共A兲 values intelligibility scores must eventually all approach 0% with a standard deviation of zero.兴 The mean trend standard deviations from Fig. 6 are included in Table III. Assuming a normal distribution is a reasonable approximation to the distribution of scores in each S / N共A兲 interval, one can estimate the distributions of scores for each interval from the mean and standard deviation of the scores in each interval. That is, the number of speech intelligibility scores in each S / N共A兲 interval can be estimated from the following expression for a normal distribution:21 y = N/兵冑2其e−共x − 兲 2/22 , 共1兲 where N is the total number of intelligibility scores in the distribution for one S / N共A兲 interval, y is the number of scores at intelligibility x, x is the speech intelligibility score, is the standard deviation of the intelligibility scores in each S / N共A兲 interval, and is the mean speech intelligibility score in each S / N共A兲 interval. With the mean scores and standard deviations for each S / N共A兲 interval given in Table III, one can construct a simple mathematical model of the speech intelligibility scores for the responses of students in each grade. This model can describe the distribution of scores as well as the mean trend of the scores for each age of student. Figure 7 shows the resulting speech intelligibility distributions in each S / N共A兲 interval for the grade 1 results. One can more easily describe the fraction of students experiencing some high level of speech intelligibility by replotting the information as cumulative probability plots. This is done in Fig. 8 for the data from all three grade levels. The goal is to determine the required S / N共A兲 at each grade level for students to experience near-ideal conditions. Near-ideal is again defined as corresponding to speech intelligibility scores of 95% or better. Figure 8 can be used to determine the fraction of students at each S / N共A兲 category that would experience nearideal conditions with intelligibility scores of 95% or better. For the example of the grade 6 students at a 20 dB S / N共A兲, essentially all would experience 95% intelligibility or better. J. Acoust. Soc. Am., Vol. 123, No. 4, April 2008 FIG. 7. Distribution of the speech intelligibility scores in S / N共A兲 intervals from −10 to +20 dB for the grade 1 results. At a S / N共A兲 of +15 dB only 25.4% of the grade 6 students would not experience 95% or better speech intelligibility. One might therefore argue that the common recommendation for a +15 dB S / N共A兲 value12,22 is satisfactory for the grade 6 students because at this S / N共A兲, 74.6% of the students would experience near-ideal conditions. For the grade 3 students at a +15 dB S / N共A兲 value, only 54.5% would experience 95% or better speech intelligibility and for the grade 1 students, only 36.6% would experience this near-ideal speech intelligibility for conditions of +15 dB S / N共A兲. However, for a +20 dB S / N共A兲, 74.8% of the grade 1 students would experience 95% or better speech intelligibility. It is therefore important to note that a +15 dB S / N共A兲 does not provide near-ideal conditions for most of the grade 1 students. V. COMPARISON OF ACTUAL S / N„A… VALUES WITH IDEAL REQUIREMENTS A. Ideal S / N„A… goals The data in Fig. 8 can be replotted as the percentage of students who would experience 95% speech intelligibility or 1.0 0.8 Grade 6 0.6 0.4 S/N(A) 0.2 Fraction of responses FIG. 6. Mean trend of standard deviations of speech intelligibility scores for all three grades as a function of the S / N共A兲 values. 0 dB -5 dB 20 dB 15 dB 10 dB 5 dB 0.0 0.8 Grade 3 0.6 0.4 S/N(A) 0.2 5 dB 0 dB -5 dB 20 dB 15 dB 10 dB 0.0 0.8 Grade 1 20 dB 0.6 15 dB 0.4 0.2 0.0 S/N(A) 0 dB -5 dB 40 60 Intelligibility score, % 5 dB 10 dB 80 100 FIG. 8. 共Color online兲 Cumulative probability distributions of intelligibility scores by S / N共A兲 category and student grade level. Vertical dashed line indicates 95% speech intelligibility. J. S. Bradley and H. Sato: Speech in classrooms 2083 Downloaded 21 Jun 2011 to 150.29.215.199. Redistribution subject to ASA license or copyright; see http://asadl.org/journals/doc/ASALIB-home/info/terms.jsp for intermediate grade results and perhaps even assume that grade 7 and 8 results would yield a curve only slightly to the left of the grade 6 results in Fig. 9. To accommodate the needs of younger children and/or to include even a modest safety factor leads to required S / N共A兲 values greater than the common recommendation for an ideal S / N共A兲 of +15 dB. Fraction better than 95% SI 1.0 0.8 Grade1 Grade3 Grade6 0.6 0.4 B. Conditions in active classrooms 0.2 0.0 -10 -5 0 5 10 15 20 As part of this work, speech and noise levels were recorded in active classrooms and separate speech and noise levels were determined using a statistical procedure.17 共“Active” indicates the classrooms were fully occupied with students active in educational activities with their teacher.兲 The mean teacher speech level at the microphone locations near the students was 59.5 dBA with a standard deviation of ⫾5.5 dB. The distribution of measured S / N共A兲 values in active classrooms is shown in Fig. 12 of Ref. 17. The mean S / N共A兲 was approximately +11 dB. Although a S / N共A兲 of +11 dB is often said to correspond to somewhat acceptable conditions for speech communication, Sec. V A demonstrated that much higher S / N共A兲 values are required for good speech communication for younger children. In the actual teaching situations shown in Fig. 12 of Ref. 17, S / N共A兲 values of +15 dB or more only occurred in 6.25% of the measured cases. The S / N共A兲 conditions during the measurements of actual teaching sessions were considerably inferior to the S / N共A兲 values required for near-ideal conditions in Sec. IV. Knowing children’s ability to understand speech at various S / N共A兲 values and knowing the distribution of S / N共A兲 values found in active elementary school classrooms, one can now estimate the proportion of the students in the measured classrooms who would experience near-ideal conditions for speech communication. First, the distribution of measured S / N共A兲 values in the active classrooms was recalculated with the same 5 dB intervals as were used in Table III and Figs. 7 and 8. These values are listed in the left-hand 3 columns of Table IV and show that most of the measured conditions 共61.6%兲 had S / N共A兲 values in the category with a mean S / N共A兲 of +10 dB. The middle three columns in Table IV show the percentage of students who would experience 95% speech intelligi- 25 S/N(A), dB FIG. 9. Percentage of students who would experience 95% speech intelligibility or higher as a function of the S / N共A兲 value. better as a function of the existing S / N共A兲 value, as illustrated in Fig. 9. This again assumes that 95% speech intelligibility on a simple word intelligibility test represents nearideal conditions for speech communication. In this format one can directly determine the S / N共A兲 required for a certain percentage of the students to experience 95% speech intelligibility or better. That is, one could choose a desired S / N共A兲 goal so that some large percentage of the students would experience these near-ideal conditions for speech communication. For example, one might aim for acoustical conditions in which at least 80% of the students would experience 95% speech intelligibility or better. Figure 9 shows that for the grade 6 results a S / N共A兲 value of just above +15 dB is required. The grade 3 results indicate a minimum S / N共A兲 of +18.5 dB would be required and the grade 1 results indicate a minimum required S / N共A兲 of close to +20.5 dB. One could require that a larger or smaller percentage of the students should experience such near-ideal conditions. For example, requiring that 90% of the students should experience such near-ideal conditions rather than 80% would increase the minimum S / N共A兲 values needed by about 2 dB. One might also include a small safety factor to be sure that the desired S / N共A兲 value is actually achieved in real classrooms. Some differences in approach are possible, but Fig. 9 can serve as the basis for setting acceptable S / N共A兲 criteria for elementary school students. One could readily interpolate TABLE IV. Calculation of the percentage of students at each grade level who would experience near-ideal acoustical conditions for speech communication in the measured active classrooms. Columns 1–3, distribution of S / N共A兲 values while teachers were talking. Columns 4–6, percentages of students who would experience 95% speech intelligibility or better in each S / N共A兲 category. Columns 7–9, total percentages of students who would experience near-ideal conditions in the measured active classrooms. Mean S / N共A兲 共dB兲 0 5 10 15 20 Total 2084 N cases 0 9 69 34 0 112 % cases 0.0 8.0 61.6 30.4 0.0 100.0 Percent scoring ⬎95% SI %cases⫻ % students Grade 1 Grade 3 Grade 6 Grade 1 Grade 3 Grade 6 3.9 13.0 36.6 12.0 26.6 54.5 21.4 40.5 74.6 0.31 8.03 11.12 0.96 16.39 16.55 1.72 24.93 22.63 19.5 33.9 49.3 J. Acoust. Soc. Am., Vol. 123, No. 4, April 2008 J. S. Bradley and H. Sato: Speech in classrooms Downloaded 21 Jun 2011 to 150.29.215.199. Redistribution subject to ASA license or copyright; see http://asadl.org/journals/doc/ASALIB-home/info/terms.jsp TABLE V. Calculation example to determine maximum acceptable ambient noise levels. 59.5 dB A −5.5 dB A −20 dB A 34 dB A 35 dB A Mean classroom speech level from teachers Standard deviation of measured speech levels Required S / N共A兲 for grade 1 students Required maximum ambient noise level ANSI S12.60 maximum ambient noise level criteria bility or better in each of these same S / N共A兲 categories and these were obtained from Fig. 8. By multiplying the percentage of cases in each S / N共A兲 category by the corresponding percentage of students who would experience 95% speech intelligibility or better in that category, the total percentage of all students who would experience near-ideal conditions is obtained in the three right-hand columns for each age group. These final percentages are summed over all three S / N共A兲 categories at the lower right-hand side of Table IV. The quite surprising result is that only 19.5% of the grade 1 students would experience near-ideal conditions during the measured teaching activities. This rises to 33.9% of the grade 3 students and 49.3% of the grade 6 students. This is in spite of the fact that room acoustics parameters were near-ideal and the rooms seemed to have only minimal acoustical problems to the 共adult兲 experimenters carrying out these tests. C. Maximum ambient noise level criteria By combining the various results from this study, one can estimate maximum acceptable ambient noise levels that would provide near-ideal speech communication for students of various ages. Table V illustrates the process for grade 1 students for whom a minimum S / N共A兲 of 20 dB was found to be necessary in the previous sections of this paper. This estimate results in a recommended maximum ambient noise level that is almost identical to that recommended in the ANSI S12.60 classroom acoustics standard. This is strong confirmation of the validity of the recommendation in ANSI S12.60. Similar estimates could be made for grade 3 and 6 students. If these estimates included a small safety factor of 2 – 4 dB to account for various uncertainties, the results would again be close to the recommendations of ANSI S12.60. It is therefore justifiable and practical to recommend a maximum ambient noise level in all elementary school classrooms of no more than 35 dBA. It is important to realize too, that although occupied noise levels may be higher than 35 dBA, lower unoccupied levels lead to lower occupied levels17 and hence it is particularly important to make unoccupied ambient noise levels as low as possible. Unfortunately the current data are not adequate to estimate ideal room acoustics criteria for classrooms. There was simply very little variation in room acoustics conditions. This means that the conclusions with respect to desirable S / N共A兲 and ambient noise levels are directly applicable to classrooms with room acoustics conditions similar to those in these classrooms. However, the room acoustics measurement results in Table I show that conditions in these classJ. Acoust. Soc. Am., Vol. 123, No. 4, April 2008 rooms were close to most recommended values for classrooms. Thus, the recommended S / N共A兲 values can be said to be an important component of an ideal classroom. VI. DISCUSSION AND CONCLUSIONS This study has provided data that better describe the abilities of elementary school children to understand speech in noise in real classrooms of schools near Ottawa, Canada. The results are better because they are from a large sample of children and are based on natural binaural listening in actual classrooms with realistic ambient noises. These results also realistically include the other distractions that are expected to occur in actual classrooms such as those from other students both in the classroom and in adjacent spaces. The form of the relationships between intelligibility scores and S / N共A兲 values is similar to previous in-classroom speech intelligibility tests but different than earlier laboratory studies using monaural headphone playback of the speech material. The mean trends of the results indicate that grade 1 students 共6 year olds兲 require 7 dB higher S / N共A兲 values to achieve the same speech intelligibility scores as would grade 6 students 共11 year olds兲. Although no adult data were obtained, it is likely that young adult listeners would get somewhat higher speech intelligibility scores than the grade 6 students at the same S / N共A兲 conditions.9 There is also much scatter in the speech intelligibility scores about the mean trends indicating that many students would often have more difficulty understanding speech than indicated by the mean trends. The distribution of speech intelligibility scores about the mean trends was therefore also examined and a mathematical model of the means and distributions of scores was developed to more completely define children’s abilities to understand speech in noise. From this model estimates of the S / N共A兲 values required for grade 1, grade 3, and grade 6 students to experience near-ideal conditions for speech communication were made. Near-ideal conditions were defined as 95% speech intelligibility scores on simple word intelligibility tests and all ages of student could do better than this in very high S / N共A兲 conditions. For 80% of the students to experience such nearideal conditions, S / N共A兲 values of +20, +18, and +15 dB would be required for grade 1, grade 3, and grade 6 students, respectively. Measurements during actual teaching activities showed an average S / N共A兲 of 11 dB and in only 6.25% of the cases were the S / N共A兲 +15 dB or higher. In the actual teaching situations only 19.5% of the grade 1, 33.9% of the grade 2, and 49.3% of the grade 6 students would experience nearideal conditions for speech communication. That is, in the measured classrooms that appeared to have acceptable acoustical conditions to the adult experimenters, less than half of the students would experience near-ideal speech communication. The inability of younger children to understand many of the words that a teacher is saying must make it more difficult for the children to learn new concepts. There is a growing literature of results indicating that increased noise levels are J. S. Bradley and H. Sato: Speech in classrooms 2085 Downloaded 21 Jun 2011 to 150.29.215.199. Redistribution subject to ASA license or copyright; see http://asadl.org/journals/doc/ASALIB-home/info/terms.jsp associated with a number of educational factors such as delayed reading ability, effects on memory, and student behavior.2 Further calculations based on the new measurements led to estimates of maximum acceptable ambient noise levels that were very close to the 35 dBA recommendation in ANSI S12.60. The range of room acoustics conditions measured in the classrooms was quite small and close to values thought to be optimum. Although there were significant effects of room acoustics parameters, the limited range of the data made it impossible to produce new estimates of ideal room acoustics conditions for speech communication in classrooms as a function of student age. Further research is required to consider the question of optimum room acoustics criteria to maximize intelligibility and the quality of speech communication. ACKNOWLEDGMENTS This work was supported by a grant from the Canadian Language and Literacy Research Network. The very helpful cooperation of the teachers and administration of the Upper Canada District School Board made this work possible. 1 American National Standards Institute, “Acoustical performance criteria, design requirements, and guidelines for schools,” ANSI S12.60, New York, 2002. 2 K. Anderson, “The problem of classroom acoustics: The typical classroom soundscape is a barrier to learning,” Semin. Hear. 25, 117–129 共2004兲. 3 S. Hygge, “Classroom experiments on the effects of different noise sources and sound levels on long-term recall and recognition in children,” Appl. Cognit. Psychol. 17, 895–914 共2003兲. 4 A. L. Bronzaft and D. P. McCarthy, “The effect of elevated train noise on reading ability,” Environ. Behav. 7, 517–528 共1975兲. 5 A. L. Bronzaft, “The effect of a noise abatement program on reading ability,” J. Environ. Psychol. 1, 215–222 共1981兲. 2086 J. Acoust. Soc. Am., Vol. 123, No. 4, April 2008 6 G. W. Evans and L. Maxwell, “Chronic noise exposure and reading deficits,” Environ. Behav. 29, 638–656 共1997兲. 7 L. L. Elliott, S. Connors, E. Kille, S. Levin, K. Bell, and D. Katz, “Children’s understanding of monosyllabic nouns in quiet and in noise,” J. Acoust. Soc. Am. 66, 12–21 共1979兲. 8 L. L. Elliott, “Performance of children aged 9 to 17 years on a test of speech intelligibility in noise using sentence material—With controlled word predictability,” J. Acoust. Soc. Am. 66, 651–653 共1979兲. 9 A. Neuman and I. Hochberg, “Children’s perception of speech in reverberation,” J. Acoust. Soc. Am. 73, 2145–2149 共1983兲. 10 T. Finitzo-Hieber and T. W. Tillman, “Room acoustics effects on monosyllabic word discrimination ability for normal and hearing-impaired children,” J. Speech Hear. Res. 21, 440–458 共1978兲. 11 N. B. Marshall, “The effects of different signal-to-noise ratios on the speech recognition scores of children,” Ph.D. thesis, University of Alabama, Tuscaloosa, AL, 1987. 12 J. S. Bradley, “Speech intelligibility studies in classrooms,” J. Acoust. Soc. Am. 80, 846–854 共1986兲. 13 N. W. MacKeith and R. R. A. Coles, “Binaural advantages in hearing of speech,” J. Laryngol. Otol. 85, 213–232 共1971兲. 14 A. K. Nábflek and P. K. Robinson, “Monaural and binaural speech perception in reverberation for listeners of various ages,” J. Acoust. Soc. Am. 71, 1242–1248 共1982兲. 15 A. K. Nábělek and J. M. Pickett, “Reception of consonants in a classroom as affected by monaural and binaural listening, noise, reverberation and hearing aids,” J. Acoust. Soc. Am. 56, 628–639 共1974兲. 16 J. P. Moncur and D. Dirks, “Binaural and monaural speech intelligibility in reverberation,” J. Speech Hear. Res. 10, 186–195 共1967兲. 17 H. Sato and J. S. Bradley, “Evaluation of acoustical conditions for speech communication in active elementary school classrooms,” J. Acoust. Soc. Am. 123, 2064–2077 共2008兲. 18 M. Ross and J. Lerman, “A picture identification test for hearing impaired children,” J. Speech Hear. Res. 13, 44–53 共1970兲. 19 M. Ross and J. Lerman, Word Intelligibility by Picture Identification 共Stanwix House, Pittsburgh, 1971兲. 20 International Organisation for Standardisation, “Acoustics—Measurement of the reverberation time of rooms with reference to other acoustical parameters,” ISO3382, Geneva, Switzerland. 21 G. F. Ferguson and Y. Takane, Statistical Analysis in Psychology and Education, 6th ed. 共McGraw Hill, New York, 1989兲. 22 T. Houtgast, “The effect if ambient noise on speech intelligibility in classrooms,” Appl. Acoust. 14, 15–25 共1981兲. J. S. Bradley and H. Sato: Speech in classrooms Downloaded 21 Jun 2011 to 150.29.215.199. Redistribution subject to ASA license or copyright; see http://asadl.org/journals/doc/ASALIB-home/info/terms.jsp
© Copyright 2026 Paperzz