Language and Speech http://las.sagepub.com/ Tone Perception Ability of Cantonese-Speaking Children Kathy Yuet Sheung Lee, Sung Nok Chiu and Charles Andrew van Hasselt Language and Speech 2002 45: 387 DOI: 10.1177/00238309020450040401 The online version of this article can be found at: http://las.sagepub.com/content/45/4/387 Published by: http://www.sagepublications.com Additional services and information for Language and Speech can be found at: Email Alerts: http://las.sagepub.com/cgi/alerts Subscriptions: http://las.sagepub.com/subscriptions Reprints: http://www.sagepub.com/journalsReprints.nav Permissions: http://www.sagepub.com/journalsPermissions.nav Citations: http://las.sagepub.com/content/45/4/387.refs.html >> Version of Record - Dec 1, 2002 What is This? Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 LAN GUAGE AND SPEECH, 2002, 45 (4), 387 – 406 387 Tone Perception A bility of Cantonese-Speaking Children* Kathy Yuet Sheung Lee1 Sung Nok Chiu2 Charles Andrew van Hasselt1 1 The Chinese University of Hong Kong, Hong Kong Kong Baptist University, Hong Kong 2 Hong Key words Cantonese F0 onset and offset lexical knowledge tone perception Abstract Studies have shown that while children acquire all Cantonese tones by age two, they are not able to label them reliably until approximately age 10. One possible explanation for the large age discrepancy may be t he different methodologies used. This study aimed to (1) investigate a new research design for the collection of reliable tone perception data from young children; (2) compare lexical and nonlexical items for testing tone perception ability; and (3) identify the relative ease of perceiving the three basic tone contrasts in Cantonese, that is, high level/ high rising (T1 / T2), high level/ low falling (T1/ T4), and, high rising / low falling tones (T2 / T4). The three tone pairs were presented to 31 children in the form of word and nonword stimuli. It was found that the research design could be used to assess the tone perception knowledge of children as young as 2;09. Significant differences were found between word and nonword stimuli and also in the identification of the T2 / T4 contrast in comparison with the other two pairs. Children’s overall tone perception abilities are discussed in detail with reference to the role of lexical knowledge and the potential for tone perception confusions arising from differences in fundamental frequencies for tone onset and offset. 1 Introduction Cantonese, a dialect spoken in H ong Kong, M acau and the province of G uangdong in China, is one of the world’s many tone languages, which use differences in the pitch of the speaker’s voice to distinguish one word from another (Bauer & Benedict, 1997). * Acknowledgments: The research was supported by a grant from the Health Services Research Committee (G rant N umb er 831019). We thank D r. G odfrey H arrison for his helpful comments on an earlier draft of this paper, the two referees and D r. M arilyn Vihman for their detailed and thorough review which improved the paper substantially, M s. Pheobe Lo for recruiting subjects, M s. Livia Wong for data collection, M s. Pamela Cheung for drawing pictures and M s. Johanna Barry for the final editing work. We are also grateful to all the parents and children who took part in the study. Address for correspondence: Prof. C. A. van H asselt / Kathy Y. S. Lee, D ept. of Surgery, The Chinese U niversity of H ong Kong, Prince of Wales H ospital, Shatin, H ong Kong; e-mail: < [email protected]> / < [email protected]> . Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 388 Tone perception ability of Cantonese-speaking children There are six contrastive tones in Cantonese (M atthews & Yip, 1994); they are distinguished by fundamental frequency (F 0) in terms of height, contour and direction (G andour, 1981). U sing the computer software D r. Speech (D r. Speech — software group, Tiger D R S, 1998), F igure 1 was constructed to illustrate the different fundamental frequency patterns of the six tones spoken by a female experimenter in this study. Figure 1 The different fundamental frequency patterns of the six Cantonese tones on the vowel /a/ As shown in F igure 1, Tone 1 is high level. H owever, it can also take the form of a high falling contour. According to Chao’s interpretation (1947), the high level and high falling fundamental frequency patterns of Tone 1 are allophones of the high level tone. Tone 2 is high rising while Tone 5 is low rising. Tones 3 and 6 are mid and low level tones, respectively. Tone 4 is low falling. Previous studies of Cantonese tones in children have mainly focused on acquisition and perception. Based on the principle that comprehension precedes production in child language development, a child who produces tones correctly should also be able to perceive them correctly. However, a comparison of research data from the perception and production studies reveals a marked discrepancy in observations. Several studies have demonstrated that children can produce all six tones in Cantonese by age two (D odd & So, 1994; Tang & M aidment, 1996; Tse, 1978), but they have been found to be unable to identify them all correctly before the age of 10 (Ching, 1984; Ching, 1990; Lui, 2000). One possible explanation for the discrepancies between the two study types may be different data collection methods. Speech acquisition studies generally involve the collection of spontaneous speech samples; hence they do not require children to follow Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 K. Y. S. Lee, S. N. Chiu, and C. A. van Hasselt 389 difficult test instructions. Tones produced in speech samples may be vocalizations that do not form meaningful words. In perception studies, on the other hand, participants have to follow specific test instructions. The testing requires the child to overcome stranger anxiety, to be compliant to testing instructions, to be cooperative and to have a good attention span. A stimulus-response test paradigm which requires the child to point to a picture / word card after the presentation of the stimulus has the significant limitation that a reliable response pattern can only be consistently established for child ren aged fro m 30 – 35 m ont hs and older (H o dgson , 1994; M cCor m ick, 1992). F urthermore, such a task is linguistically demanding and is only appropriate for use with participants who are already familiar with the words in the task. There is thus a significant limitation on the age and type of child who can be recruited to participate in such perception studies. Four-year-old children have so far been the youngest age group to have participated in tone perception studies with Cantonese (Aisha, 2000; Ching, 1984; Ching, 1990; Lui, 2000). The present study aimed at investigating the tone perception abilities of threeyear-olds and further aimed to expand current understanding of children’s tone perception abilities. This age range was considered suitable for such testing since M cCormick (1992) and H odgson (1994) have found that this age group can perform reliably on the stimulus-response task. Inclusion of all six tones was considered too difficult for children in this age range. Thus fewer tones were used to ensure successful test completion and reliable response rates. A number of studies have revealed that three tones are produced earlier and are easier to identify (Cheung, 1995; Ching, 1988; Ching, 1990; Fok, 1984; Lee, Cheung, Chan, & van H asselt, 1997; Tse, 1978; Varley & So, 1995). They are high level (T1), high rising (T2) and low falling (T4) tones. The relative ease of identification reflects the fact that they have higher frequencies of occurrence, have relatively higher intensity levels and are perceptually more salient in terms of their marked differences in the contour and pitch heights (Ching, 1984; Fok, 1974). We will refer to the three tones as basic. The three tone contrasts— T1/ T2, T1/ T4 and T2/ T4 —were formed by combining the three basic tones. We were interested in determining their relative ease of identification. U sing the two dimensions of pitch height and contour identified by G andour (1981, 1983), tone identification difficulty has been addressed in a number of investigations. In studies with hearing-impaired children Fok (1984) adopted a high falling contour as her T1 stimulus to contrast with T2 (different contour) and T4 (different height). Children found it easier to identify pitch height (T1 / T4) than contour movements (T1/ T2). Ching (1988), also working with impaired hearing participants, applied ’sequential information analysis’ and reported that pitch height accounted for most of the information transmitted. Thus both Fok (1984) and Ching (1988) concluded that pitch height was perceptually more salient than pitch contour for listeners with impaired hearing. U sing the same sequential information analysis on alaryngeal speech with normal hearing adults as listeners, Ching, Williams, & van H asselt (1994) found that pitch contour was the more salient cue, however. As for the studies with normal hearing participants, Ching (1984) did not analyze the two dimensions in detail but pointed out that children confused tones 3 and 6 due to the similarity of pitch height, while the difficulty with tones 5 and 6 stemmed from Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 390 Tone perception ability of Cantonese-speaking children a confusion of pitch contour. Based on Ching’s (1984) study, the data of Lui (2000) showed that while some tone pairs involving differences in pitch height had higher scores (T1/ T6, T1 / T3), others (T 3 / T6, T2 / T5) had lower scores when compared with tone pairs differing in contour (T1 / T2, T5 / T6, T4 / T5, T4 / T6). For adults, Varley and So (1995) found that tone pairs with similar onset pitch heights and / or pitch contour resulted in comparable error rates. They concluded that both dimensions were equally important for tone perception. In this study, T1 / T2 share the same pitch height but different pitch contour (level vs. rising) while T1 / T4 and T2 / T4 differ in both height and contour. We expected that tone identification on T1 / T2 would be more difficult than the latter two contrasts as T1 / T2 differ in one dimension only. When testing young children the linguistic factor tends to confound results (Tyler, 1993). In picture identification tasks children need to be able to relate each word to the appropriate picture (Barton, 1980). The syllable /ji/ occurs with all six tones in Cantonese and is commonly used for tone identification studies (Aisha, 2000; Ching, 1984; Ching, 1988; Ching, 1990; Fok, 1984; Lui, 2000). H owever, the validity of its use when testing young children is questionable since not all children are equally familiar with all six words, for example, /ji 1 / ‘clothes’, /ji 2 / ‘chair’ and /ji 4 / ‘son’ are used in written Chinese but are not colloquial Cantonese terms and could be foreign to young children. Others, like /ji 3/, are only meaningful when combined with other syllables to form multisyllabic words such as / ji3 tai6 lei6 fán2 / ‘spaghetti’. The database compiled by F letcher, Stokes, Leung, & Weizman (2000) included 332 different word tokens produced by five two-yearolds and 594 from 10 three-year-olds. The word /ji/ occurred on the list only with T6 (meaning ‘two’), suggesting that the /ji/ stimuli were outside children’s common experience. Even if children undergo a training session to learn all the words before the test commences, the distortion due to differences in linguistic familiarity will remain. Such effects have been found to be closely related to the number of errors made in perception tasks (Barton, 1980). One way to minimize the effect is to use stimuli that stay within the participants’ receptive lexical inventory. The lexical advantage for linguistically more precocious children will thus be minimized. Another way to control differences in word familiarity among test participants is to use nonwords as stimuli. When presenting the test stimuli, children are forced to respond using perceptual ability alone. Words and nonwords have been widely used in the literature to examine phonological processing, lexical representation and memory (Bowey, 1997; G athercole & Baddeley, 1997; G erken, M urphy, & Aslin, 1995; M cKone, 1995; Pitt, 1995). Similar studies on tone perception are rare, however. One such study was by Cutler and Chen (1997), who compared the performance of D utch and Cantonese listeners making samedifferent judgments on Cantonese words and nonwords. The assumption was that linguistic knowledge would facilitate tone processing, and native speakers of Cantonese would perform significantly better than non-native speakers on the task. The hypothesis was not supported, however, as the two groups performed similarly. By contrast, Lee, Vakoch, and Wurm (1996) did find a positive effect for linguistic experience on tone processing. U sing Cantonese, M andarin and English speaking adults as respondents, they found that native speakers were more successful at discriminating tones from their own languages for both words and nonwords. Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 K. Y. S. Lee, S. N. Chiu, and C. A. van Hasselt 391 The different conclusions arrived at the two studies may be related to the time factor. The procedure used by Cutler and Chen (1997) was a speeded-response task where only two seconds were allowed for participants to make responses. The interval between presenting the two stimuli and making a same-different judgment was 400 ms. In Lee, Vakoch, and Wurm’s study (1996), no time limit was imposed on participants for making a response though they were asked to respond as accurately and quickly as possible. The interstimulus interval was either five seconds or immediate presentation. Lee, Vakoch, and Wurm (1996) reported significantly higher error rates with shorter interstimulus interval. The 400 ms interstimulus interval used by Cutler and Chen (1997) was even shorter than five seconds used by Lee, Vakoch, and Wurm (1996). Test participants thus had less time for storing the first stimulus for later comparison. The imposition of the two second rule for making a response may also have effectively inhibited participants’ access to linguistic representations, hence masking any positive effects due to lexical knowledge. There have been no studies using both words and nonwords to examine the role of linguistic knowledge in Cantonese tone perception for children. Clumeck (1980) investigated tone recognition abilities and attempted to find minimal pairs of real words that were familiar to the children learning Mandarin. However, he failed to find one with the rising tone to contrast with the low-dipping tone. As a result, he used a nonword instead. H is two participants were unable to distinguish the tone pair. On being tested for other tone contrasts where words were used, correct identification was consistently exhibited. It was unclear what effect the use of a nonword had in contributing to the observation that the rising and low-dipping tone distinction was difficult to perceive. As far as the authors are aware, no further studies using words and nonwords for testing tone perception in M andarin have been published. The effect of linguistic knowledge on tone processing remains unclear for Cantonese-speaking children as well. Based on the fact that children master tone production by the age of two, we believe children should be able to reliably identify all tones in Cantonese at less than four years of age, and that it is due to a limitation in research methodology that young children’s tone perception ability has previously been underestimated. In this study we aimed to test the tone perception abilities of children as young as three by limiting the number of tones from six to three and by controlling for the confounding variable of linguistic knowledge. The latter was achieved by using words within the child’s vocabulary and by including a series of nonwords as test stimuli. The use of word and nonword stimuli was also important for determining the effect of linguistic knowledge on tone processing. Three main goals motivated this research: (1) to determine if the research design was suitable for assessing the tone perception ability of children younger than four years of age; (2) to determine if lexical or nonlexical items are better for assessing tone perception, and (3) to determine the relative ease of identifying the three basic tone contrasts of T1 / T2, T1 / T4, and T2 / T4. Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 392 Tone perception ability of Cantonese-speaking children 2 Method 2.1 Subjects Participants were recruited on a voluntary basis from preschool centers and from among children who had previously been involved in a screening program for otitis media with effusion (Tong, Yue, Ku, Lo, & van H asselt, 2000). The selection criteria were: (1) age range from 2;09 to 3;03; (2) Cantonese as first language; (3) no abnormalities in overall cognitive development reported by teachers or parents; and, (4) normal hearing and language development as assessed during hearing and language screenings conducted by qualified audiologists and speech therapists. For hearing screening, children were tested with tone frequencies at 500, 1000, 2000, and 4000H z at 20dB H L (A scale). For language screening, the Cantonese Receptive Vocabulary Test (Lee, Lee, & Cheung, 1996), a test standardized on 609 Cantonese-speaking children, was used. The passing criterion was that the receptive vocabulary age equivalent score was greater than or equal to the child’s chronological age. Among the recruited participants (n = 71), 32 (45%) fulfilled the criteria. Of these, one child did not complete testing, leaving a total of 16 girls and 15 boys in the participant pool. 2.2 Test stimuli Tone pairs. Three tone pairs were selected. These were T1 / T2, T1 / T4, and T2 / T4. Tone 1 wa s presented as a high level tone throughout the test. Each tone pair was presented using both word and nonword stimuli. Word stimuli. A list of tone pairs was developed to meet three criteria: (1) minimal pairs contrasting in tone only, (2) familiarity to three-year-old children, and (3) readiness for representation in picture format. A total of 32 pairs involving 49 words were suggested (fifteen T1 vs. T2, nine T1 vs. T4, and eight T2 vs. T4). Out of the 32 pairs, 15 pairs were made into five trios using 15 words. The remaining 17 pairs involved 34 words. The words were reviewed by two speech therapists with six or more years of experience in managing pediatric speech and language cases who agreed that the pool could be regarded as an exhaustive list and no more pairs could be added. The minimal pairs of words were then represented in line drawings m easuring 15 cm ´ 12 cm. Appendix 1 displays the list of test items. Nonword stimuli. The same vowel /a/ in the three basic tones were used in the nonword stimuli. The syllable structure of a vowel alone, instead of (consonant) - vowel- (consonant), was selected so as to avoid coarticulation variations that might arise from different consonant contexts. Mode of presentation. Since test participants were three-year-olds with relatively limited language ability and attention span, we chose live voice presentation to maximize response success rate. Children have been shown to perform significantly better with live voice presentation than with recorded speech (K irk & Lento, 2000). Varley and So (1995) also commented that participants with a range of cognitive factors such as difficulties with attention control respond better to live than to recorded presentations. Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 K. Y. S. Lee, S. N. Chiu, and C. A. van Hasselt 393 The use of live voice mode made it possible to present the stimuli in isolation. Tone represents relative pitch information and listeners need to access the speaker’s overall fundamental frequency range before making a correct tone perception judgment. For this reason, stimuli used in recorded mode need to be presented in framed sentences. The necessity of using a sentence does not apply in the case of live voice presentation since participants have had sufficient opportunity to interact with the speaker to accommodate to his/ her relative pitch range before having to make tone perception judgments. An advantage of using stimuli in isolation is that pitch differences among tones are greater in citation form. Fok (1974) found that tones spoken in isolation covered a wider frequency range than those spoke within a sentence frame since speakers were more conscious of the tonal opposition in the former case. The use of tone in isolation also overcomes the confounding variable of intonation patterns such as the sentence-final tone lowering effect (Vance, 1976) and increases in tone levels in emphatic speech (Fok, 1974). Citation form tones would thus be predicted to be easier for young children to label correctly due to their reduced pitch variability and greater between-tone pitch differences. 2.3 Test procedures The test protocol involved three stages: (1) word comprehension check, (2) tone test using words, and (3) tone test using nonwords. 2.3.1 Stage 1— Comprehension check Any speech perception test involving the use of words has to ensure that a participant’s failure to identify cannot be ascribed to his limited vocabulary (H nath-Chisolm, & Boothroyd, 1998). There was no Cantonese database to indicate the extent of children’s vocabulary in the age range of this study. The comprehension check was designed to control for children’s vocabulary knowledge. The 49 words to be used as stimuli were arranged into 13 test plates, each comprised of four line drawings depicting the four words. Words in each test plate were arranged so that they represented different tones and different phonemic segments. In the test children were asked to point to the named pictures. The results obtained in the comprehension check formed the basis for selecting items for the second stage of the study. The item list comprising the comprehension check is provided in Appendix 2. 2.3.2 Stage 2 —Tone test using words At this stage, the “tone test form,” which consisted of 32 minimal pairs, was employed in test administration. U sing the results from the preceding comprehension test, the only items administered were those for which the child knew both words in the minimal pair contrast. While each tone pair was randomly presented in five trials with the target words, the actual number of total trials varied from child to child depending on the number of available testable pairs. Table 1 (overleaf) summarizes the number of testable pairs for each participant based on the comprehension check. For instance, one child could be tested on seven test pairs for the T1 / T2 contra st; the total number of trials he Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 394 Tone perception ability of Cantonese-speaking children attempted for the T1 / T2 contrast was thus 35 (7 pairs ´ 5 trials each). Another child could not be tested for any T1 / T4 or T2 / T4 contrast; this child was not tested on the two contrast types. D uring the test, the experimenter read aloud selected tone pairs from the tone test form. The participants pointed to the pictures that they believed represented the words. For each tone pair, each child was scored based on the number of correct responses out of the total number of trials presented. TABLE 1 Number of testable pairs based on the comprehension check Tone pairs Number of testable pairs H igh level versus high rising (T1 / T2) 3 4 5 6 7 Number of subjects 11 16 3 0 1 Total 31 H igh level versus low falling (T1/ T4) 0 3 4 5 1 13 15 2 Total 31 H igh rising versus low falling (T2 / T4) 0 1 2 3 4 5 1 6 2 10 11 1 Total 31 Stage 3 —Tone test using nonwords As in the second stage, the three tone pair contrasts (T1 / T2, T1 / T4, and T2 / T4) were used, but nonwords instead of words were used as test stimuli in this stage. Each tone pair contrast (T1 / T2, T1 / T4, and T2 / T4), presented as a repeat of the vowel /a/ , was randomly presented in five trials, leading to a total of 15 trials (3 tone contrasts ´ 5 trials each). Three dolls from the toy set “M r. Potato H eads” were used as test materials. The dolls possessed distinctive facial expressions and costumes. Each doll was given a name corresponding to a tone. M ore precisely, the names were / a a / in tone 1, 2 or 4. Before the responses were scored, the names of the two dolls were each presented twice for the participants to learn. The children indicated understanding by either pointing to the Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 K. Y. S. Lee, S. N. Chiu, and C. A. van Hasselt 395 corresponding doll or by spontaneous imitation after the experimenter. After the practice trials, participants were asked to feed the named doll with a color marble. The order of presentation for the target stimuli within each tone pair was randomized. For each nonword tone pair, each child was scored based on the proportion of trials in which correct responses were made. 2.4 Order of presentation As children found the names of the dolls in the nonword condition highly confusing when presented repeatedly, an alternating presentation of word and nonword stimuli was used as follows: word stimuli T1 / T2 ® nonword stimuli T1 / T2 ® word stimuli T1 / T4 stimuli T1 / T4 ® word stimuli T2 / T4 ® nonword stimuli T2 / T4. ® nonword 2.5 Test environment The testing procedure wa s conducted in a soundproof booth. The two testers were qualified female speech therapists who had undergone a 30 mins training and practice session in monitored live voice presentation. The practice aimed at achieving production of all stimuli, regardless of tone, at 65dB (A scale) ± 5dB levels. The training target was achieved with the aid of a sound level meter (Bruel & K jaer 2238 M ediator integrating sound level meter — basic SLM software BZ 7126, Bruel, & K jaer Sound & Vibration Measurement A / S, N oerum, Denmark, 1998), which was also used to monitor voice intensity during testing sessions. 3 Results A logistic regression (Agresti, 1990; F isher & van Belle, 1993; Rasch, 1960) ln [pijk / (1 – pijk ) ] = a i + wj + Tk , was used to analyze the dichotomous data (success and failure) from all trials to examine whether the scores observed depended on the participants’ tone perception ability, the lexical status of the stimulus (i.e., word vs. nonword) or the tone contrast pair (T1 / T2, T1/ T4, and T2/ T4). In the model, pijk is the probability that the i th participant correctly identified the kth tone pair (k = 1, 2, and 3 for T1 / T2, T1 / T4, and T2 / T4 respectively) of words if j = 0 or nonwords if j = 1, a i is the tone perception ability measure for the ith participant and, wj and Tk are the difficulty measures of the j th lexical status and the kth tone pair, respectively. There were 31 participants and six types of questions (wo rd vs. nonword for T1 / T2, T1 / T4, and T2 / T4). Appendix 3 summarizes the scores obtained for the study sample (n = 31). U sing the G EN M OD procedure in SAS 8.0, a statistically significant difference in the participants’ ability to make correct responses was found (p < .0001). The probability pijk was influenced by the lexical status (p < .0001) as well as the combination of tone pairs (p = .0004). N o significant interaction was observed between the lexical status and combination of tone pair (p = .2850). Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 396 Tone perception ability of Cantonese-speaking children Table 2 summarizes the descriptive statistics for the participants’responses which were obtained from the logistic regression. We used the model’s predicted values because they smoothed the data and provided improved estimates. The highest and lowest possible scores for each participant were 1 and 0, respectively. TABLE 2 Descriptive statistics for the model-based scores for the study sample (n = 31) Tone Pairs Word Nonword H igh Level (T1) M ean 0.93 0.77 vs. H igh R ising (T2) M in 0.76 0.42 M ax 1.00 1.00 Range 0.24 0.58 SD 0.06 0.15 95% CI 0.91– 0.95 0.72– 0.83 H igh Level (T1) M ean 0.92 0.75 vs. Low Falling (T4) M in 0.73 0.39 M ax 1.00 1.00 Range 0.27 0.61 SD 0.06 0.16 95% CI 0.90– 0.94 0.69– 0.80 H igh R ising (T2) M ean 0.87 0.66 vs. Low Falling (T4) M in 0.62 0.28 M ax 1.00 1.00 Range 0.38 0.72 SD 0.09 0.20 95% CI 0.84– 0.91 0.59– 0.73 SD = standard deviation; CI = confidence interval. As shown in Table 2, the participants had a higher overall average probability of making correct responses for words than for nonwords. The ranges and the standard deviations for the scores indicated that intersubject variability was higher with nonwords than that with words. Likelihood ratio statistics were employed to examine intergroup differences between the three tone pairs in the logistic regression model. The differences between identifying T2/ T4 with T1/ T2 and with T1/T4 were significant (p= .0002 and .0028 respectively). The lowest mean score was obtained for the T2 / T4 tone contrast, indicating that this was the most difficult contrast to label correctly. N o significant difference was found when contrasting T1 / T2 and T1 / T4 identification (p = .4396). M ore precise fundamental frequency measurements were obtained to further investigate the observed differences in the relative ease with which the contrasts were identified. Table 3 summarizes the average F 0 values obtained for the onsets and offsets of the three Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 K. Y. S. Lee, S. N. Chiu, and C. A. van Hasselt 397 tones based on measurements on the vowel /a / averaged over 10 productions for each of the two female experimenters in this study. Values reported by Bauer and Benedict (1997) on two female Cantonese speakers in H ong Kong are included for comparison. TABLE 3 Average F 0 values at onset and offset for the three tones T1 ( HL) T2 ( HR) T4 ( LF) Average F0 at: onset offset onset offset onset offset Speaker 1 in this study 245.4 236.6 169.2 253.4 173.1 131.9 Speaker 2 in this study 290.8 298.9 200.3 293.8 197.7 158.3 Speaker 1 in Bauer & Benedict (1997) 245.2 266.7 218.8 285.2 223.0 178.0 Speaker 2 in Bauer & Benedict (1997) 255.4 331.9 205.4 346.5 206.7 186.5 TABLE 4 D ifferences in F 0 values at onset and offset for the three tone contrasts T1 / T4 T1 / T2 T2 / T4 Difference in F0 at: onset offset onset offset onset offset Speaker 1 in this study 72.3 104.7 76.2 16.8 3.9 121.5 Speaker 2 in this study 93.1 140.6 90.5 5.1 2.6 135.5 Speaker 1 in Bauer & Benedict (1997) 22.2 88.7 26.4 18.5 4.2 107.2 Speaker 2 in Bauer & Benedict (1997) 48.7 145.4 50.0 14.6 1.3 160.0 Table 4 shows the differences in F 0 values at onset and offset for the three tone contrasts. Substantial differences in F 0 values for both onset and offset were observed for the T1/ T4 pair. The T1/ T2 differed significantly in onset F 0 value but much smaller differences in F 0 offset values were measured. T2 / T4 has the reverse pattern, that is, a small onset F 0 difference but a marked difference in F 0 offset value. 4 Discussion 4.1 Overall tone perception performance This study has found an average probability of correct tone identification of 0.91, which is substantially higher than the means of 0.32 and 0.66 given in Ching (1984) and Lui Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 398 Tone perception ability of Cantonese-speaking children (2000). An attempt was made to establish the reasons for these differences by comparing the methodological approaches in the three studies (Table 5). TABLE 5 Comparison of the three related studies investigating Cantonese tone perception in normal hearing children Present study Ching, 1984 Lui, 2000 Age of subjects used for comparison 2;09 – 3;03 4;00 4;00– 4;11 N umber of subjects in the group 31 N ot specified 15 N umber of tones studied 3 6 6 In isolation In isolation Embedded in the middle of a carrier phrase In the child’s vocabulary inventory# The six tone contrasts on /ji/ The six tone contrasts on /ji/ M ode of presentation M onitored live voice Recorded presentation Recorded presentation M ode of response required from subjects Picture pointing# Picture pointing Picture pointing N umber of choices given in each test item 2 6 2 Overall tone identification score on all six tones No information 0.33 0.69 Overall tone identification score on T1, T2 and T4 0.91# 0.32 Around 0.66* Format of stimuli presentation Stimulus Words *F igure obtained by estimation on graph, exact figure not available. # Refer to the experimantal condition of using word stimuli in the present study. Lui (2000) based her study design on Ching (1984) and made two major modifications. F irstly, the number of choices given to the participants was reduced from six to two. Secondly, she presented the target word in a carrier phrase instead of in isolation. The superior tone identification score in Lui’s study suggests that modifications to the research design were well suited to the task abilities of four-year-olds. We believe that the use of the two-choice format in testing young children is an appropriate design since it greatly reduces the cognitive load required. The effect of reduced response choice and of presenting the target word in a carrier phrase can be explored by comparing the present study with Lui’s. Participants in this study had a higher average score suggesting that there was no advantage in using medial positioning for stimulus presentation as used by Lui (2000), that is, children seemed to Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 K. Y. S. Lee, S. N. Chiu, and C. A. van Hasselt 399 have no problem in extracting speakers’ relative pitch level during monitored live voice presentation even when the stimuli were presented in isolation. On the contrary, presenting in isolation may focus children’s attention on the stimuli and thus yield more accurate judgments. U sing words that were in the child’s inventory as test stimuli also had an important effect. We believe that employing a comprehension check to screen test stimuli was an effective means for controlling the confounding variable of vocabulary knowledge. As discussed in the introduction, the presentation of the six tones in Cantonese on the syllable /ji/ (Ching, 1984; Lui 2000) is not optimal. It is possible that children learned the /ji/ stimuli as new words in the course of testing. In other words, they were effectively nonwords in the receptive lexicons of the test participants. The close proximity of scores observed in Lui (2000) and the nonword stimuli in this study (0.66 vs. 0.73) offers some support for this hypothesis. The restriction in the number of tones tested to only three in the present study may also have contributed to the high perception scores observed. This relates directly to the number of items involved, reducing the attentional demands of the task. In summary, the superior tone performance in this study implies that a methodology involving a reduced number of tones, a two-choice format and the use of words within a child’s vocabulary inventory is well adapted to the test capacities of children as young as 2;09. 4.2 Lexical versus nonlexical items in tone perception Although there have been a few studies examining the effect of linguistic knowledge on Cantonese tone processing, none of them have involved children. With the design of word and nonword tokens in this study, the children’s superior performance on words provides strong evidence that lexical knowledge facilitates tone perception, which agrees with Lee, Vakoch, and Wurm (1996) but differs from Cutler and Chen (1997), who imposed time constraints on participants for making responses. Tonal information is hard to represent in working memory (Taft & Chen, 1992) and is auditorily coded in short term memory (Lee, Vakoch, & Wurm, 1996). One possible explanation for the superior response to words is that lexical information helps to code the rapidly fading tone information into long term memory. In the nonword response, no lexical information can be used as a retrieval cue. Although, following Peynircioglu (1995), the nonword vowel /a/ was presented in the form of reduplicated syllables to assist short term memory, the scores in the nonword condition were still much lower than those in the word condition. One may speculate that young children do not treat tones separately from segmental information, as an abstract representation (Taft & Chen, 1992). Instead, tone by itself is treated as an aspect of the syllable as a whole and may consequently be difficult to separate, indeed may be inseparable, from the segmental information and hence not readily stored for future retrieval separate from a specific lexical carrier. H owever, we need to interpret the superior scores on lexical over nonlexical items with caution. The different outcomes from the two conditions might simply reflect research design constraints. In the nonword environment, children were only permitted two trials for learning the dolls’names. Some children may have had difficulty memorizing Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 400 Tone perception ability of Cantonese-speaking children the names in the time permitted. If this was the case then the results do not reflect perception problems. In fact, it is interesting to note at this point that some of the children did indeed point to the incorrect doll while at the same time successfully imitating the tone produced by the experimenter. F urthermore, Barton (1980) noted that experimental tasks involving nonsense syllables were too difficult for young children. It is thus not entirely clear to what extent failures of task performance were due to task difficulty rather than tone perception difficulty. It is possible that task problems of this nature could be addressed by increasing the number of practice trials allowed, perhaps to the point where the children can name the dolls spontaneously. Successful attempts at such training on children between 3 and 4.5 years have been reported by G raham and H ouse (1971) and Barton (1980). Judging from the superior performance in the word condition and the related problems of utilizing nonwords, we believe that the former is a better choice in studying young children’s tone perception ability, provided that the words used are in the child’s inventory. 4.3 The relative ease of identifying the three basic tones The relative importance of pitch height and contour in tone perception as demonstrated by G andour (1981; 1983) has been a major focus for researchers in the field. N evertheless, no conclusive result can be arrived at based on studies of both the normal-hearing and the hearing-impaired population (Ching, 1984; Ching, 1988; Fok, 1984; Lui, 2000; Varley & So, 1995). Applying the two dimensions in analyzing the three tone contrasts in this study suggests that T1 / T4 and T2 / T4 should be easier than T1 / T2 as they differ from each other both in terms of pitch height and contour. The prediction wa s only partially supported, however. While children found T1 / T4 easy to perceive, the correct tone identification on T2 / T4 was much lower. The use of the generic terms pitch height and contour appears to be unsatisfactory in explaining the relative ease of tone identification. We therefore analyzed the tones in terms of a more precise fundamental frequency measurement and put forward the hypothesis that the greater the difference in fundamental frequency between the two tones, the easier it would be for children to make correct identification. U nder this hypothesis the relative ease of identifying T1/ T4 is understandable in view of the marked discrepancies in pitch at both onset and offset. Performance comparable to that found on T1/ T4 was noted on T1/ T2 identification. There is a significant difference in the value of F 0 at onset on both tone contrasts — over 70 and 90 H z in speakers 1 and 2, respectively. The marked difference in F 0 onset may be the reason for the relative ease of identification in the two tone contrasts. As for T2 / T4, scores are consistently lower than those in the other two pairs. The values of F 0 at onset, unlike those for T1 / T4 and T1 / T2 contra sts, are very similar. The difference is within 4 H z only across the two speakers. The similarity in F 0 onset may result in decreased tone discriminability, which is not diminished by subsequent changes in contour shape and offset differences. Children’s overall performance on this task suggests that the similarity of F 0 at onset has a more important impact on tone identification than that of F 0 at offset. The role of differences in tone onset and offset in determining tone perception performance merits further exploration. Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 K. Y. S. Lee, S. N. Chiu, and C. A. van Hasselt 401 5 Conclusions In this study, we have simplified the methodologies used for testing tone perception so that the perception abilities of children as young as three years of age can be tested. This has, in part, involved including a word comprehension test to exclude words that are not in the child’s receptive vocabulary. By controlling for linguistic knowledge we were able to deploy the time saved by not testing unfamiliar words on examining perception of more tone pairs in the experimental setting. Children were observed to be more successful at perceiving tones on words than on nonwords. Although we are not entirely sure if the difference was due to the experimental design, we believe the inherent problems in using nonwords may unduly bias the data. We, hence, suggest utilizing words as stimuli in further studies concerning young children’s tone perception ability. The use of meaningful words instead of nonsense syllables is also recommended for training items for learning Cantonese tone. F inally, we found that children are equally successful at identifying contrasts between T1 / T2 and T1 / T4, but have more difficulty on the T2/ T4 contrast. We propose that the relative similarity of F 0 at onset is more important in predicting ease of perception of tone contrasts than is that of similar F 0 at offset. The hypothesis certainly needs further verification in future research with larger sample sizes and with more tone contrasts. Received: March 6, 2001; revised: February 6, 2002, September 12, 2002, December 4, 2002; accepted: December 23, 2002 References AG R ESTI, A. (1990). Categorical data analysis. N ew York: John Wiley & Sons. AISH A, R. (2000). Tone perception of Cantonese-speaking children with cochlear implant. Bachelor of Science (Speech and Hearing Sciences): F irst degree dissertations. Volume. 1. H ong Kong. BARTON, D. (1980). Phonemic perception in children. In G. H . Yeni-Komshain, J. F. Kavanagh, & C. A. Ferguson (Eds.), Child phonology, Volume 2, Perception. N Y: Academic Press. BAU ER , R . S., & BEN ED ICT, P. K . (1997). Modern Cantonese phonology. Berlin: M outon de G ruyter. BOWEY, J. A. (1997). What does nonwo rd repetition measure? A reply to G athercole and Baddeley. Journal of Experimental Child Psychology, 67(2), 295 – 301. CH AO, Y. R . (1947). Cantonese primer. H arvard: H arvard U niversity Press. CH EU N G, P. S. P. (1995). A speech therapy activity book on Hong Kong Cantonese phonology. H ong Kong: Transaudio. CH IN G, T. Y. C. (1984). Lexical tone pattern learning in Cantonese children. Language Learning and Communication, 3(3), 243– 414. CH IN G, Y. C. (1988). Lexical tone perception by Cantonese deaf children. In I. M . Liu, M . J. Chen, & H . C. Chen (Eds.), Cognitive aspects of the Chinese language, 1, 93–102. Hong Kong: Asian Research Service. CH IN G, Y. C. (1990). Tones for profoundly deaf tone-language speakers. CUHK. Papers in Linguistics, 2, 1 – 22. CH IN G, T. Y. C., WILLIAM S, R ., & van H ASSELT, A. (1994). Communication of lexical tones in Cantonese alaryngeal speech. Journal of Speech and Hearing Research, 37, 557– 571. Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 402 Tone perception ability of Cantonese-speaking children CLU M ECK , H . (1980). The acquisition of tone. In G. H . Yeni-Komshain, J. F. Kavanagh, & C. A. Ferguson (Eds.), Child phonology, Volume 1, Production. New York: Academic Press. CU TL ER , A., & CH EN, H . C. (1997). Lexical tone in Cantonese spoken-word processing. Perception and Psychophysics, 59(2), 165 – 79. D OD D, B. J., & SO, L. K. H . (1994). The phonological abilities of Cantonese-speaking children with hearing loss. Journal of Speech and Hearing Research, 37, 671 – 679. F ISH ER , L. D., & Van BELLE, G. (1993). Biostatistics. A methodology for the health sciences. New York: John Wiley & Sons. F LETCH ER , P., STOK ES, S., LEU N G, C. S., & WEIZM AN, Z. (2000). A guide to Cantonese language development. Report to the H ong Kong G overnment Language F und, January. F OK , A. C. Y. Y. (1984). The teaching of tones to children with profound hearing impairment. British Journal of Disorders of Communication, 19, 225– 236. F OK , C. Y. Y. (1974). A perceptual study of tones in Cantonese. H ong Kong: U niversity of H ong Kong. GAND OUR, J. (1981). Perceptual dimensions of tones: Evidence from Cantonese. Journal of Chinese Linguistics, 9, 20 – 36. GANDOUR, J. (1983). Tone perception in Far Eastern languages. Journal of Phonetics, 11, 149–175. G ATH ERCOLE, S. E., & BAD D ELEY, A. D. (1997). Sense and sensitivity in phonological memory and vocabulary development: A reply to Bowey (1996). Journal of Experimental Child Psychology, 67(2), 290– 94. G ER K EN, L., M U R PH Y, W. D., & ASLI N, R . N. (1995). Three-and four-year-olds’perceptual confusions for spoken words. Perception and Psychophysics, 57(4), 475– 486. G R AH AM , L. W., & H OU SE, A. S. (1971). Phonological oppositions in children: A perceptual study. Journal of the Acoustical Society of America, 49, 559– 566. H NATH -CH ISOLM , T., & BOOTH ROYD, A. (1998). Age-related changes on a children’s test of sensory-level speech perception capacity. Journal of Speech Language and Hearing Research, 41, 94 – 106. H OD GSON, W. R . (1994). Evaluating infants and young children. In J. K atz (Ed.), Handbook of clinical audiology. 4th ed. Baltimore: Williams & Wilkins. K IR K , K . I., & LEN TO, C. L. (2000). Assessing spoken word recognition in children: Effects of presentation mode and age at testing. Presented at the 5th European Symposium on Pediatric Cochlear Implantation, June 4 – 7, Antwerp, Belgium. LEE, K . Y. S., CH EUN G, D. M . C., CH AN, B. Y. T., & van H ASSELT, C. A. (1997). Cochlear implantation: Implication for tone language. In I. H onjo & H . Takahashi (Eds.), Cochlear implant and related sciences update. Advances in Oto-rhino-laryngology(pp.254– 257). Basel: Karger. LEE, K . Y. S., LEE, L. W. T., & CH EU N G, P. S. P. (1996). Hong Kong Cantonese Receptive Vocabulary Test. Hong Kong: The H ong Kong Society for Child H ealth and Development. LEE , Y. S., VAKOC H , D. A., & WU R M , L. H . (1996). Tone perception in Cantonese and Mandarin: A cross-linguisticcomparison. Journal of Psycholinguistic Research, 25(5), 527–542. LUI, J. Y. Y. (2000). Cantonese tone perception in young children. Bachelor of Science (Speech and Hearing Sciences): F irst degree dissertation. Volume 3. H ong Kong: U niversity of H ong Kong. M ATTH EWS, S., & YIP, V. (1994). Cantonese: A comprehensive grammar. London: Routledge. M cCOR M ICK , B. (Ed.). (1992). Behavioral hearing tests six months to five years. Pediatric Audiology 0 – 5 years, 97 – 115. London: Whurr. McKONE, E. (1995). Short-term implicit memory for words and nonwords. Journal of Experimental Psychology: Learning, Memory and Cognition, 21(5), 1108 – 1126. PEYN IRCIOG LU, Z. F. (1995). Covert rehearsal of tones. Journal of Experimental Psychology: Learning, Memory and Cognition, 21(1), 185– 192. PITT, M . A. (1995). The locus of the lexical shift in phoneme identification. Journal of Experimental Psychology: Learning, Memory and Cognition, 21(4), 1037 – 1052. Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 K. Y. S. Lee, S. N. Chiu, and C. A. van Hasselt 403 R ASCH , G. (1960). Probabilistic models for some intelligence and attainment tests. Copenhagen: N ielson and Lydiche. TAF T, M ., & CH EN, H . C. (1992). Judging homophony in Chinese: The influence of tones. In H . C. Chen & O. J. L. Tzeng (Eds.), Language processing in Chinese. Amsterdam: N orthH olland. TAN G, J. S. Y., & M AID M EN T, J. A. (1996). Prosodic aspects of child-directed speech in Cantonese. Speech-Hearing-and-Language, 9, 257 – 276. TON G, M . C. F., YU E, V., K U, P. K. M ., LO, P. S. Y., & van H ASSELT, C. A. (2000). Screening for otitis media with effusion to measure its prevalence in Chinese children in H ong Kong. Ear Nose and Throat Journal, 79(8), 626– 630. TSE, J. K. P. (1978). Tone acquisition in Cantonese: A longitudinal case study. Journal of Child Language, 5, 191 – 204. TYLER , R. S. (1993). Speech perception by children. In R . S. Tyler (Ed.), Cochlear implants: audiological foundations. San D iego, CA: Singular. VAN CE, T. J. (1976). An experimental investigation of tone and intonation in Cant onese. Phonetica, 33, 368– 392. VAR LEY, R ., & SO, L. K. H . (1995). Age effects in tonal comprehension in Cantonese. Journal of Chinese Linguistics, 23(2), 76 – 97. Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 404 Tone perception ability of Cantonese-speaking children Appendix 1 Item list in word stimuli Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 K. Y. S. Lee, S. N. Chiu, and C. A. van Hasselt 405 Appendix 2 Item list in the comprehension check Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014 406 Tone perception ability of Cantonese-speaking children Appendix 3 The scores (number of correct responses/ number of trials) obtained by the 31 participants in this study Word Participant Nonword T1 / T2 T1 / T4 T2 / T4 T1 /T2 T1 / T4 T2 / T4 1 15 / 15 15 / 15 15 / 15 5/5 3/5 3/ 5 2 15 / 15 15 / 15 15 / 15 5/5 5/5 5/ 5 3 21 / 25 25 / 25 22 / 25 3/5 4/5 5/ 5 4 13 / 15 15 / 15 15 / 15 5/5 5/5 5/ 5 5 16 / 20 16 / 20 11 / 20 1/5 5/5 4/ 5 6 15 / 15 14 / 15 14 / 15 2/5 3/5 5/ 5 7 25 / 25 25 / 25 13 / 15 5/5 5/5 5/ 5 8 20 / 20 15 / 15 4/5 5/5 5/5 5/ 5 9 15 / 15 14 / 15 5/5 1/5 2/5 3/ 5 10 11 / 15 5/ 15 4/5 4/5 2/5 5/ 5 11 29 / 35 15 / 15 9/ 10 5/5 5/5 2/ 5 12 12 / 15 14 / 15 11 / 15 0/5 4/5 3/ 5 13 20 / 20 20 / 20 15 / 20 4/5 5/5 2/ 5 14 15 / 15 15 / 15 14 / 15 2/5 4/5 4/ 5 15 14 / 15 15 / 15 15 / 15 4/5 3/5 2/ 5 16 9 / 15 13 / 15 10 / 15 5/5 5/5 0/ 5 17 14 / 15 13 / 15 15 / 15 4/5 3/5 0/ 5 18 19 / 20 17 / 20 14 / 15 3/5 4/5 2/ 5 19 24 / 25 19 / 20 19 / 20 5/5 1/5 2/ 5 20 20 / 20 20 / 20 19 / 20 5/5 5/5 5/ 5 21 18 / 20 17 / 20 3/5 4/5 5/5 4/ 5 22 20 / 20 18 / 20 5/5 3/5 5/5 3/ 5 23 20 / 20 18 / 20 20 / 20 5/5 5/5 5/ 5 24 20 / 20 15 / 20 18 / 20 5/5 2/5 0/ 5 25 19 / 20 17 / 20 17 / 20 3/5 3/5 2/ 5 26 20 / 20 16 / 20 16 / 20 4/5 3/5 3/ 5 27 20 / 20 18 / 20 10 / 20 5/5 5/5 5/ 5 28 20 / 20 19 / 20 20 / 20 5/5 5/5 5/ 5 29 20 / 20 20 / 20 17 / 20 4/5 3/5 4/ 5 30 18 / 20 – 4/5 4/5 5/5 2/ 5 31 20 / 20 19 / 20 – 1/5 2/5 1/ 5 Language and Speech Downloaded from las.sagepub.com at Guangdong University of Foreign Studies on March 7, 2014
© Copyright 2026 Paperzz