Does Horse Activate Mother? Processing Lexical

C.-Y.– 123
Lee
LANGUAGE AND SPEECH, 2007, 50 (1), 101
101
Does Horse Activate Mother?
Processing Lexical Tone
in Form Priming
Chao-Yang Lee
School of Hearing, Speech and Language Sciences,
Ohio University, Athens, Ohio, U.S.A.
Key words
Abstract
Lexical tone languages make up the majority of all known languages of
the world, but the role of tone in lexical processing remains unclear. In
the present study, four form priming experiments examined the role of
lexical tone
Mandarin tones in constraining lexical activation and the time course of
the activation. When a prime and a target were related directly in form (e.g.,
Mandarin
lou3 ‘hug’ — lou2 ‘hall’), competitors that differed from the prime in tone
failed to be activated, indicating the use of tonal information to distinguish
spoken word
between segmentally identical words. When a prime and a target were not
recognition
form-related but were related through a third word that was not actually
presented (e.g., lou3 ‘hug’ — jian4zhu0 ‘building’, where lou3 is form-related
to lou2 ‘hall’, which was semantically related to jian4zhu0), a mismatch in tone prevented activation of minimal tone pairs at 250ms interstimulus interval (ISI) but did not prevent activation
at 50 ms ISI. These results indicate that tonal information is used on-line to reduce the number
of activated candidates, but does not prevent the minimal tone pairs from being activated in the
early phase of lexical activation.
form priming
1 Introduction
Recognizing spoken words constitutes the basis for auditory language comprehension.
To decode a spoken message, a listener extracts segmental and prosodic information
from the acoustic signal and maps it onto some form of internal representation in the
mental lexicon. Cognitive models of spoken word recognition such as Cohort (MarslenWilson, 1987), TRACE (McClelland & Elman, 1986), Neighborhood Activation model
(Luce & Pisoni, 1998), and Shortlist (Norris, 1994) all characterize the mapping as
an activation-competition process (Frauenfelder, 1996; Lively, Pisoni, & Goldinger,
1994; McQueen, Dahan, & Cutler, 2003). However, only Shortlist (Norris, 1994)
Acknowledgments: This research was supported in part by the Corinna Borden Keene Dissertation
Fellowship and by NIH grant DC00142 to Brown University. The author would like to thank
Sheila E. Blumstein for her guidance and Denis Burnham, Annett Schirmer, Arthur Samuel,
and Chilin Shih for their comments on earlier versions of the text.
Address for correspondence. Chao-Yang Lee, School of Hearing, Speech and Language Sciences,
Grover Center W225, Ohio University, Athens, OH 45701, U.S.A.; e-mail: <[email protected]>.
‘Language and Speech’ is © Kingston Press Ltd. 1958 – 2007
Language and Speech
102
Lexical tone in form priming
explicitly incorporated prosody as a cue to speech segmentation, which is a precursor
to word recognition. The absence of prosodic specifications in other models most
likely reflects the minor role of prosody for word distinctions in Indo-European
languages. For example, English has few stress minimal pairs such as FORbear and
forBEAR (Cutler, 1986) that contrast solely in lexical stress. For stress contrasts that
involve concurrent segmental changes, listeners may not need to rely on prosodic
information for word recognition.
However, lexically contrastive prosody is used extensively in lexical tone languages.
The information value of lexical tones may be too high to ignore in word recognition
(Cutler & Chen, 1997). In Mandarin Chinese, for example, each syllable (which normally
corresponds to a monosyllabic word or free morpheme) is assigned one of four tones.
Substituting the tone of a syllable changes the meaning of the word or morpheme. For
example, the syllable ma with Tone 1 (a high-level tone, denoted as ma1) means ‘mother’,
whereas ma2 (a mid-rising tone), ma3 (a low-dipping tone), and ma4 (a high-falling
tone) mean ‘hemp’, ‘horse’, and ‘to scorn’, respectively. In other words, lexical tone is
an integral part of a lexical entry as much as segmental structure is.
Since lexical tone is functionally phonemic, is tone used in a similar way as
segmental structure in processing spoken Mandarin words? Intuitively, lexical tone
must be implicated in Mandarin word recognition at some point since each syllable has
a distinct tone and listeners have to rely on tonal information to distinguish between
minimal tone pairs. Fox and Unkefer (1985) found a “lexical effect” on Mandarin
tone categorization (cf. Connine, Clifton, & Cutler, 1987; Ganong, 1980; McQueen,
1991). When presented with series of synthetic tokens that were ambiguous between
two tones, Mandarin listeners gave tone responses that would make an ambiguous
token a real word. Ching (1985) also showed that lip-reading Cantonese words is
much more accurate when fundamental frequency (F 0) information is available.
Nonetheless, the studies by Fox and Unkefer (1985) or Ching (1985) did not answer
how and when tone is implicated in lexical processing. The “off-line” nature of the
tasks did not address whether lexical tone is used as an “access code” in reducing the
number of candidates in the early phase of lexical activation, or as a “phonological
code” to be consulted for word identity verification only after lexical activation is
completed (Cutler, 1986).
A number of studies used “on-line” speeded-response tasks to compare tonal and
segmental processing. Cutler and Chen (1997) found that tone processing in Cantonese
is more error-prone than consonant or vowel processing. Listeners made more lexical
decision errors when presented with nonword items that differed from real words in
tone. In contrast, there were fewer errors when those nonword items differed from
real words in consonant or vowel. In a same-different judgment task, listeners were
also slower and less accurate when the mismatch between two stimulus items was
tone compared to consonant or vowel. Ye and Connine (1999) showed that vowel-tone
monitoring was slower when a stimulus item deviated from the target-bearing syllable
in tone. Cutler and Chen (1997) suggested that the processing difference is due to the
acoustic characteristics of prosodic and segmental information. In particular, tone
is processed later than segmental phonemes because tone is carried by vowels and
cannot arrive earlier than segmental information. Even though listeners process both
tonal and segmental information as soon as they become available, the temporallyLanguage and Speech
C.-Y. Lee
103
distributed nature of tonal information results in relatively late processing of tone. On
this account, prosodic information may not be processed as efficiently as segmental
information in spoken word recognition.
However, this conclusion has recently been challenged by electrophysiological
evidence showing comparable processing of tonal and segmental information. Schirmer,
Tang, Penney, Gunter, and Chen (2005) presented sentences that contained words
that were semantically congruous or incongruous with the sentential context. Eventrelated potentials (ERPs) elicited by the semantic incongruity as well as traditional
behavioral measures of response time and accuracy were taken. The incongruous
words were constructed by modifying the segmental structure, the tone, or both
segmental structure and tone of a word that would be congruous with the sentential
context. Participants were asked to judge whether or not the sentences they heard were
semantically congruous. The results showed, while the response to tonal violations
was less accurate, tonal and segmental violations elicited comparable time course
and amplitude of the N400-like negativity, a correlate of semantic incongruity. Since
traditional behavioral measures require an explicit response (e.g., pressing buttons),
the less accurate detection of tonal information as opposed to segmental information
found in previous studies (Cutler & Chen, 1997; Ye & Connine, 1999) may reflect
postaccess processes such as response strategies rather than the initial activationcompetition process (Schirmer et al., 2005).
Despite the issue of potential response strategies, much experimental data
regarding spoken word recognition has been accumulated based on behavioral
measures, which are sensitive to aspects of word processing (Schirmer et al., 2005). One
source of such evidence regarding the role of sound structure in spoken word recognition has come from form priming studies (Zwitserlood, 1996). In this experimental
paradigm, listeners are presented with a prime followed by a target that is related to
the prime in phonological form. Listeners are required to make a speeded response to
the target, and the latency and accuracy of the response are used to evaluate the effect
of the form relationship on word recognition. The widespread use of this paradigm
in the segmental literature has generated fairly mixed results and interpretations. In
particular, both facilitatory and inhibitory priming have been found (Zwitserlood,
1996). As Zwitserlood (1996) noted, the direction of form priming effects depends
on factors such as task, interstimulus-interval, proportion of related trials, lexical
status of primes, and position of overlap. In addition, response strategies have been
implicated in the interpretation of facilitatory priming results (Goldinger, Luce,
Pisoni, & Marcario, 1992). Despite these potential methodological issues, facilitation is
normally considered a prelexical effect arising from form-overlap between prime and
target (Radeau, Morais, & Segui, 1995; Slowiaczek & Hamburger, 1992; Slowiaczek
& Pisoni, 1986) and inhibition a lexically-based effect reflecting competition among
activated word candidates (Hamburger & Slowiaczek, 1996; Radeau, Morais, &
Dewier, 1989; Slowiaczek & Hamburger, 1992). Taken together, form priming is
likely to reflect the balance between form-based facilitation and inhibitory lexical
competition (Slowiaczek & Hamburger, 1992; Zhou & Marslen-Wilson, 1995).
Form priming has been used to explore the role of lexical prosody in spoken
word recognition. Cutler and Otake (1999) showed that Japanese words contrasting
minimally in pitch accent did not prime each other in an auditory lexical decision
Language and Speech
104
Lexical tone in form priming
task. The absence of priming was interpreted as listeners’ successful use of accentual
information to constrain lexical activation. That is, a mismatch in accentual information was used by the listeners to resolve segmental ambiguity in the initial activation
phase, inhibiting the prosodically incompatible candidate. The reasoning is that, if
pitch accent information were not used by the listener to rule out the other member
of the accent pairs, both members would have been treated as homophones and
would have primed each other (cf. Cutler, 1986). In a study of lexical stress in Dutch,
Cutler and Donselaar (2001) found that Dutch words contrasting minimally in lexical
stress did not prime each other. Following the same line of reasoning as in Cutler
and Otake (1999), Cutler and Donselaar (2001) interpreted this result as indicating
listeners’ use of stress information to inhibit activation of the incompatible member
of the stress pair. Similarly, Cooper, Cutler, and Wales (2002) in a study on lexical
stress in English found that a stress mismatch between a disyllabic auditory prime
and a visual target in English did not generate priming. This finding was likewise
interpreted as listeners’ successful use of stress information to inhibit the incompatible
member of the stress pair.
The use of form priming to address the role of lexical tone in word recognition
has been studied in Cantonese. Cutler and Chen (1995, also reported in Chen &
Cutler, 1996) examined the effect of form overlap between disyllabic Cantonese words.
Lexical decision latency was evaluated in prime-target pairs with mismatch in tone
or rhyme in the first or second syllable. The results showed that tone and rhyme had
similar priming effects but the effects varied as a function of the mismatch position:
When the tone or rhyme mismatch was in the second syllable (e.g., ji6liu4 — ji6liu5,
to4fa1— to4foo1), response was slowed down. When the mismatch was in the first
syllable (e.g., to4wa6 — to2wa6, si6yip6 — sue6yip6), response was faster. The authors
attributed the facilitation to prelexical form overlap and the inhibition to lexical
competition. In particular, when the mismatch was in the second syllable, the prime
and target were identical in the first syllable. Therefore, there was inhibition due to
lexical competition. Such inhibition did not exist when the mismatch was in the first
syllable, where facilitation was able to surface due to form overlap in the second syllable.
Using a shadowing task, Yip, Leung, and Chen (1998) examined priming in
monosyllabic Cantonese word pairs overlapping in onset, rhyme, and / or tone. They
found significant facilitation only when the prime-target overlap was in onset+ rhyme,
that is, when the prime and target were minimal tone pairs. Yip (2001) replicated the
1998 study with a smaller proportion of related trials (cf. Goldinger, Luce, Pisoni, &
Marcario, 1992) and found reliable facilitation when the overlap was in onset+ rhyme
or rhyme + tone. It was concluded that Cantonese listeners were more sensitive to
segmental than suprasegmental information, presumably because the tone overlap
did not consistently produce facilitation. On Cutler and colleagues’ account, however,
the onset+ rhyme (i.e., minimal tone pair) priming would not be expected if listeners
were able to use the mismatch in tone to inhibit activation of the tonally incompatible
member. In other words, the findings from Yip et al. (1998) and Yip (2001) were not
consistent with Cutler and colleagues’ notion that tone is part of the lexical “access
code” and that tonal contrasts are processed at the early activation phase of word
recognition (Cutler, 1997; Cutler & Chen, 1997, Cutler, Dahan, & van Donselaar,
1997). Can the findings from Yip et al. (1998) and Yip (2001) generalize to other
Language and Speech
C.-Y. Lee
105
lexical tone languages, suggesting a universal processing difference between lexical
prosodies, or are these results only applicable to Cantonese, suggesting languagespecific properties at work?
The current study extended previous form priming studies on lexical prosody
in three ways. First, existing form priming studies on lexical tone were all conducted
in Cantonese. A different tone language, Mandarin Chinese, was used in the present
study. Secondly, in addition to the direct priming task (Experiments 1 and 2) that
allows a direct comparison with previous studies, the mediated priming task was also
used in the present study (Experiments 3 and 4). In the direct priming paradigm, the
prime and target are directly related in form (e.g., ma3 ‘horse’ — ma1 ‘mother’); in
mediated priming, the prime is not directly related to the target in form (e.g., ma3
‘horse’ — fu4qin0 ‘father’, where 0 denotes an unstressed “neutral” tone), but is formrelated to a third word (ma1 ‘mother’) that is semantically related to the target but is not
actually presented. The motivation for using the mediated priming task was twofold.
First, mediated priming may be less susceptible to participants’ response strategies.
Since the prime and target in direct priming are directly related in form, it may be
easier for participants to detect the form relationships, which could invoke postaccess
processes. In mediated priming, the prime and target are not directly form-related. It
is less likely for participants to develop response strategies based on their detection
of the relationships. Furthermore, effects obtained with mediated priming are more
likely to have a lexical locus. Since the prime and target in mediated priming are not
directly related in form, any priming effect must be mediated through the activation
of a third word that is not actually presented (Zwitserlood, 1996).
Finally, the use of two interstimulus intervals (ISIs) allowed the present
experiments to address not only the role of tonal versus segmental overlap in lexical
activation, but also the time course of the activation. In both variants of the priming
task, two ISIs were used to evaluate the time course of tone processing. The 250 ms
ISI (Experiments 1 and 3) was a direct comparison with Yip (2001). The 50 ms ISI
(Experiments 2 and 4) was selected in light of previous priming studies in the segmental
literature suggesting that priming effects are rapid and short-lived. For example,
Andruski, Blumstein, and Burton (1994) found that subphonetic variation in voice
onset time had an effect on the magnitude of semantic priming at 50 ms but not at
250ms ISI. The time course of the priming effect at 50 versus 250ms ISI has also been
replicated in patients with aphasia (Utman, Blumstein, & Sullivan, 2001).
To evaluate the effects of tonal and segmental overlap in form priming, the primes
and targets selected in the present experiments had four types of relationships. In direct
priming (Experiments 1 and 2), the prime and target shared segmental structure (e.g.,
lou3 ‘hug’— lou2 ‘hall’), tone (e.g., cang2 ‘hide’— lou2 ‘hall’), both segmental structure
and tone (e.g., lou2 ‘hall’— lou2 ‘hall’), or did not overlap at all (e.g., pan1 ‘climb’— lou2
‘hall’). In mediated priming (Experiments 3 and 4), the prime was form-related to a
third word (not actually presented) that is related to the target. The prime and the
third word bore the same relationships as in direct priming, but the targets were
words that are semantically related to the third word (e.g., replacing the target in the
above examples with jian4zhu0 ‘building’). It was predicted that the complete overlap
(tone +segments) would produce facilitatory priming relative to the unrelated baseline.
In addition, if an overlap in tone or segmental structure was sufficient in activating
Language and Speech
106
Lexical tone in form priming
target representation (Experiments 1 and 2) or the mediating word (Experiments 3
and 4), facilitation would also be expected in these two conditions.
2 Experiment 1
In Experiment 1, the prime and target words had four types of form relationships.
They shared either segmental structure only (S) (e.g., lou3 ‘hug’— lou2 ‘hall’), tone
only (T) (e.g., cang2 ‘hide’— lou2 ‘hall’), both segmental structure and tone (ST) (e.g.,
lou2 ‘hall’ — lou2 ‘hall’), or were unrelated (UR) (e.g., pan1 ‘climb’— lou2 ‘hall’). In
this and the following experiments, all stimuli were presented auditorily. Participants
made lexical decisions on the target and the effect of the prime-target relationships
was evaluated by response time and accuracy.
2.1
Method
2.1.1
Participants
Forty graduate students at Brown University (age: 22 – 30 years; gender: 30 male, 10
female) participated in the experiment. The participants in this and the following
experiments are all native speakers of Mandarin. None reported history of speech or
hearing difficulties. Participants came from various geographical regions in China but
were all screened by the experimenter prior to the experiments to ensure Mandarin
is the first language for all participants.
2.1.2
Materials
Eighty monosyllabic Mandarin words were selected as targets. For each target, four
types of primes were identified: ST primes share both segmental structure and tone
with the target (e.g., lou2 ‘hall’ — lou2 ‘hall’); S primes share only segmental structure
with the target (e.g., lou3 ‘hug’— lou2 ‘hall’); T primes shares only tone with the target
(e.g., cang2 ‘hide’— lou2 ‘hall’); UR primes are not related to the target (e.g., pan1
‘climb’— lou2 ‘hall’). In addition to the word targets, 80 monosyllabic, pronounceable
nonwords were generated. They were paired with the same set of primes and served
as fillers. The complete set of stimuli is listed in Appendix A.
Since monosyllabic homophones are prevalent in Mandarin, an effort was made
to select stimuli from words that did not have homophones, based on a Mandarin
word frequency dictionary (Wang, 1986). The definition of a “word” in Chinese has
been controversial (Packard, 2000). Zhou and Marslen-Wilson (1994, 1995) provided
experimental evidence for a word-level representation in Chinese in addition to the
morpheme-level representation. As a working definition, a “word” in this study has
to have an independent entry and word frequency count in Wang’s (1986) dictionary.
A morpheme without a word frequency entry is therefore not counted as a “word.”
For example, lou3 ‘hug’ is not considered homophonous with the morpheme lou3 ‘a
container made from bamboo’ because the latter, even though a morpheme, does not
have an independent word frequency count in the dictionary. Frequency counts of
Language and Speech
C.-Y. Lee
107
the stimulus words were obtained from this corpus of 1.3 million words. The average
word frequencies for the stimulus words were 257 (ST), 282 (S), 284 (T), and 280 (UR).
There is no statistically significant difference among these averages.
Prime type was intended to be a within-subject factor. However, it was determined that participants were not to hear the same stimulus item more than once
during the experiment. To these ends, four stimulus lists were constructed. For a
given target, each of the four primes was assigned to a different list such that each
list would include all four prime types without repeating any stimulus item in a list.
Although the fillers were paired with the same set of primes, each of the four primes
was similarly assigned to a different list. If a prime was already assigned to a word
target in a given list, the prime would be assigned to a different list to pair with a
nonword target. Therefore, no primes or targets were repeated in any list. In sum, each
list included all 160 targets (80 words and 80 nonwords) with the four types of primes
equally distributed. Therefore, the proportion of related trials was 75% (cf. 75% in
Yip et al., 1998, and 50% in Yip, 2001). Each participant was randomly selected to be
tested on one list only. The four lists were presented equally often across participants
such that the presentation of lists was counterbalanced across participants. The ISI
between the prime and target was 250ms to allow a direct comparison with Yip (2001).
The intertrial interval was three seconds.
2.1.3
Procedure
The stimuli were recorded by a male graduate student at Brown University from
Beijing, China. The recording was made in a sound-treated room with a portable
digital recorder (Sony TCD-D7) and a high-quality microphone (Sony ECM-909).
The recorded speech was digitized onto a Windows PC at a sampling rate of 20kHz
and 14-bit quantization. The stimulus items were identified from the waveform display
of the Brown Lab Interactive Speech System (BLISS; Mertus, 2000) and were saved
as individual audio files. The audio files were then imported to a subject-testing
program, AVRunner of BLISS, for stimulus presentation.
Participants were tested individually in a sound-treated room. They sat in a
testing booth and listened to the stimuli presented by AVRunner through a pair of
headphones (KOSS PRO / 4XTC). The participants were instructed to judge whether
or not the second item in a stimulus pair was a real Mandarin word. They were
asked to respond by pressing the buttons labeled ‘YES’ and ‘NO’ on a custom-made
response box. They were also told to respond as quickly and accurately as possible.
The presentation of trials in a given list was randomized for each participant by
AVRunner. Prior to the test list, a practice list was presented including eight practice
trials (4 with word targets and 4 with nonword targets) to make sure the participants
understood the procedure. An experiment session took approximately 15 mins.
2.2
Results and Discussion
In the present and following experiments, response time, measured from target
onset, and response accuracy to word targets were recorded and exported to the
Language and Speech
108
Lexical tone in form priming
StatView® software for statistical analysis. Separate analyses of variance (ANOVAs)
were conducted of response time and number of errors, with prime type as the fixed
factor and subject and item as random factors. F statistics obtained from the analyses
by subjects (F1) and by items (F2) are both reported. When a main effect from the
ANOVAs was significant, the Bonferroni post hoc test was used for all six possible
means comparisons to keep the family-wise Type I error rate at 5%.
The top panel of Figure 1 shows the mean response times to targets preceded
by the four types of primes. One-way repeated measure ANOVAs revealed a main
effect of prime type: F 1 (3, 39) = 13.91, p <.0001; F 2 (3, 79) = 17.96, p <.0001. Pair-wise
means comparisons by the Bonferroni test showed significant facilitation (85 ms) by
ST primes compared to the UR baseline (p =.0001 by subjects, p <.0001 by items).
This repetition priming was expected, as ST primes were identical to the targets
in both tone and segmental structure. For S primes, which shared only segmental
structure with the targets, reliable facilitation was found only in item analysis (p <.05)
but not in subject analysis. The result from the subject analysis is in contrast to Yip
(2001), who showed that minimal tone pairs in Cantonese did produce significant
facilitation. For T primes, which shared only tone with the targets, no reliable
priming was found.
In addition to the priming effects relative to the baseline, multiple comparisons
also showed significant or near significant differences among the ST, S, and T conditions. In particular, ST was 64 ms faster than S (p <.05 by subjects, p =.06 by items)
and 129ms faster than T (p <.0001 by subjects and by items). S was 65ms faster than T
(p <.05 by subjects, p <.005 by items). These results suggest that listeners were sensitive
to the phonological difference among the three types of primes.
Participants made few errors in this experiment. The average numbers of errors
were 1.8 (ST), 2.2 (S), 2.9 (T) and 3.2 (UR) out of 20. The pattern of errors is similar
to that of the response time, indicating no tradeoff between speed and accuracy. Oneway repeated measures ANOVAs showed a main effect of prime type: F 1 (3, 39) = 5.37,
p <.005; F 2 (3, 79) = 5.59, p =.0001. Pair-wise comparisons showed significant differences
between ST and UR (p <.005 by subjects and by items) and between ST and T (p <.05
by subjects and by items).
In sum, the results from Experiment 1 showed that only a complete overlap
was sufficient to facilitate target response. Segmental or tonal overlap alone did not
facilitate target response reliably. However, it was possible that the priming effect
could not be detected by the use of the ISI. In particular, it has been suggested that
lexical activation takes place rapidly in the initial phase of word recognition and
fades away rapidly (e.g., Marslen-Wilson, 1987). At 250 ms ISI, potential activation
might have disappeared before it could make an impact on the target. To explore
this possibility, a shorter ISI (50 ms) was used in Experiment 2 to examine the time
course of form priming.
3 Experiment 2
The purpose of Experiment 2 was to examine whether any evidence of lexical activation by minimal tone pairs could be detected with a shorter ISI between the prime and
Language and Speech
Response Time (ms)
C.-Y. Lee
1150
Figure 1
1100
Top panel:
109
Mean lexical decision response times
(in milliseconds) with standard error
bars to targets preceded by primes
sharing segmental structure and tone
(ST), segmental structure only (S),
tone only (T), and neither segmental
structure nor tone (unrelated baseline,
UR) in Experiment 1 (250 ms ISI)
1050
1000
950
900
850
800
Response Time (ms)
ST
S
T
UR
1150
Bottom panel:
1100
Mean lexical decision response times
in Experiment 2 (50 ms ISI)
1050
1000
950
900
850
800
ST
S
T
UR
target. The experimental setup was identical to that of Experiment 1, except that the
ISI was 50 ms. If there was priming other than repetition priming at this shorter ISI,
it would suggest that partial overlap could activate the target but that the activation
is short-lived.
3.1
Method
3.1.1
Participants
Forty graduate students at Brown University (age: 22 – 30 years; gender: 23 male,
17 female) participated in the experiment. None of them had participated in
Experiment 1.
3.1.2
Materials
The materials were identical to those used in Experiment 1 except that the ISI used
for this experiment was 50 ms.
Language and Speech
110
Lexical tone in form priming
3.1.3
Procedure
The procedure was identical to that of Experiment 1 except that no additional
recording was made.
3.2
Results and Discussion
As in Experiment 1, response time and accuracy were recorded and used for separate
ANOVAs. One item (ci3 ‘this’) was excluded from the analyses because all 10 participants receiving List 4 responded incorrectly to this item.
The bottom panel of Figure 1 shows the average response times to targets
preceded by the four types of primes. One-way repeated measure ANOVAs revealed
a main effect of prime type: F 1 (3, 39) = 15.88, p <.0001; F 2 (3, 78) = 20.29, p <.0001.
Pair-wise means comparisons showed significant facilitation (108 ms) by the ST
primes compared to the unrelated baseline (p =.0001 by subjects and by items). As
noted earlier, this repetition priming was expected. For S primes, which shared only
segmental structure with the targets, no significant facilitation was found. With the
50ms ISI, primes with a mismatch in tone still failed to facilitate target response. For T
primes, which shared only tone with targets, no reliable priming was found either.
In addition to the priming effect, multiple comparisons also revealed reliable
differences among the ST, S, and T conditions. In particular, ST was 82ms faster than
S (p <.0001 by subjects and by items) and 133 ms faster than T (p <.0001 by subjects
and by items) S was 51ms faster than T (p <.05 by subjects, p <.005 by items). As noted,
this indicates that the listeners were sensitive to the phonological difference among
the three types of primes at 50 ms ISI.
Participants in this experiment also made few errors and there was no indication
of any speed-accuracy tradeoff. The average numbers of errors were 1.4 (ST), 1.7 (S),
2.9 (T), and 2.2 (UR) out of 20. One-way repeated measures ANOVAs revealed a
main effect of prime type: F 1 (3, 39) = 6.15, p <.001; F 2 (3, 78) = 8.29, p <.0001. Pair-wise
comparisons showed significant differences between ST and UR (p <.05 by subjects,
p <.005 by items), ST and T (p <.0001 by subjects and by items), and S and T (p <.005
by subjects, p <.001 by items).
In sum, the pattern of results from this experiment is identical to that obtained
from Experiment 1. The only reliable effect was repetition priming. There was no
evidence that tonal or segmental overlap alone could facilitate target response despite
the fact that listeners were sensitive to the phonological difference between the ST, S,
and T primes. However, the finding that minimal tone pair primes, despite a complete
segmental overlap, still failed to activate the target is consistent with findings of
Cutler and Otake (1999), Cutler, and Donselaar (2001), and Cooper et al. (2002), who
interpreted the lack of priming as listeners’ use of accentual or stress information to
inhibit activation of the incompatible member of the accentual or stress pair. Following
the same logic, the absence of priming could be interpreted as listeners’ use of tonal
information to inhibit activation of the incompatible member of the tone pair.
Language and Speech
C.-Y. Lee
111
Nonetheless, the question remains whether tonal or segmental overlap was truly
unable to facilitate target response, or the priming effect could not be detected by
the direct priming paradigm. As noted in the introduction, behavioral measures that
involve explicit response may invoke participants’ response strategies (Schirmer et
al., 2005). Direct priming could be particularly susceptible to such strategies also
because the prime-target relationships may be transparent to the participants. Mediated
priming, while still requiring participants’ behavioral response, may be less susceptible
to response strategies arising from the detection of the prime-target relationships.
Experiment 3 and 4 further explored form priming with the mediated priming task.
4 Experiment 3
In Experiments 3 and 4, the mediated priming paradigm (Zwitserlood, 1996) was used
to further evaluate the results from Experiments 1 and 2. In this paradigm, the prime
(e.g., streak) and target (e.g., road) are not directly related, but the prime is form-related
to a third word (e.g., street) that is not actually presented, but is semantically related
to the target. If there is facilitation of the target road by the prime streak, the effect
is most likely mediated by the third word street (not actually presented) since there is
no direct relationship between the prime and target. By the same token, if ma3 ‘horse’
primes fu4qin0 ‘father’, the effect is most likely mediated by ma1 ‘mother’ based on
the phonological overlap between ma3 and ma1 and the semantic association between
ma1 ‘mother’ and fu4qin0 ‘father’, even though the mediating word ma1 ‘mother’ is
not actually presented. Since the prime and target were not directly related in form,
mediated priming may be less susceptible to participants’ response strategies. As in
Experiments 1 and 2, the 250 ms ISI was used in Experiment 3 and the 50 ms ISI in
Experiment 4 to evaluate the time course of tone processing in priming.
4.1
Method
4.1.1
Participants
Forty graduate students at Ohio University (age: 22 – 30 years; gender: 19 male, 21
female) participated in the experiment. None of them had participated in the previous
experiments.
4.1.2
Materials
The same set of primes used in Experiments 1 and 2 served as primes for the current
experiment. In addition, 80 disyllabic Mandarin words, which were semantically
related to the ST primes, were selected as targets. Therefore, for each target, there
were four types of primes: ST primes were semantically related to the target (e.g.,
lou2 ‘hall’ — jian4zhu0 ‘building’); S primes shared segmental structure with ST
primes but were not related to targets (e.g., lou3 ‘hug’ — jian4zhu0 ‘building’); T
primes shared tone with ST primes but were not related to targets (e.g., cang2
Language and Speech
112
Lexical tone in form priming
‘hide’ — jian4zhu0 ‘building’); UR primes were not related to the ST primes or the
targets (e.g., pan1 ‘climb’ — jian4zhu0 ‘building’). In addition to the word targets, 80
disyllabic, pronounceable nonwords were generated by combining the first syllable
of the word targets and the monosyllabic nonword targets used in Experiments 1
and 2. The nonwords were paired with the same set of primes and served as fillers.
The complete set of stimuli is listed in Appendix B.
As in Experiments 1 and 2, prime type was intended as a within-subject factor
and participants were not to hear the same stimulus item more than once during the
experiment. For these purposes, four stimulus lists were constructed. For a given
target, each of the four primes was assigned to a different list such that each list would
include all four prime types without repeating any stimulus items. Therefore, each
list included all 160 targets (80 words and 80 nonwords) with the four types of primes
equally distributed. Each participant was randomly selected to be tested on one list
only. The ISI between the prime and target was 250 ms for a direct comparison with
Experiment 1. The intertrial interval remained three seconds.
4.1.3
Procedure
The disyllabic word and nonword targets were recorded by the same speaker used in
Experiment 1. The procedures for recording, digitizing, sound file processing, item
randomization, and subject assignment and testing were also identical to those in
Experiment 1.
4.2
Results and Discussion
The top panel of Figure 2 shows the mean response times to targets preceded by the
four types of primes. One-way repeated measure ANOVAs revealed a main effect
of prime type: F 1 (3, 39) = 22.64, p <.0001; F 2 (3, 79) = 12.16, p <.0001. Pair-wise means
comparisons showed significant facilitation (49 ms) by the ST primes (p <.0001 by
subjects and by items). Since the ST primes were semantically related to the targets,
the facilitation based on semantic association was expected. There was no other
priming effect. The average response times in the S, T, and UR conditions were
virtually identical: No statistically significant differences were found in other pairwise mean comparisons.
Compared to Experiments 1 and 2, participants made even fewer errors in this
experiment. The average numbers of errors were 0.4 (ST), 0.7 (S), 1.0 (T) and 1.1 (UR)
out of 20. One-way repeated measures ANOVAs revealed a main effect of prime
type: F 1 (3, 39) = 5.39, p <.005; F 2 (3, 78) = 5.1, p <.005. Pair-wise comparisons showed
significant differences between ST and UR (p <.01 by subjects and by items) and
between ST and T (p <.05 by subjects and by items).
In sum, the only priming effect obtained in this experiment was semantic priming
by the ST primes. There was no evidence that the S or T primes produced facilitation mediated by the ST primes through semantic association with the targets. As
in Experiments 1 and 2, the absence of priming from S primes suggests that tonal
Language and Speech
C.-Y. Lee
113
Response Time (ms)
minimal pairs were not activated. That is, the S primes did not activate the mediating
word, which in turn failed to activate the target. However, it was possible that the
250ms ISI was too long for the priming effect to emerge due to the transient nature
of lexical activation. The similarity in results between Experiments 1 and 2 seem to
argue against this possibility. Nonetheless, it was an empirical question of whether
ISI would have a different effect on the mediated priming task. In particular, lexical
decision in mediated priming is not made on a target that is a direct competitor with
the prime. Evidence of lexical activation may be able to emerge at a shorter ISI when
residual activation is still effective and when the prime is not a direct competitor of
the target. To evaluate this possibility, Experiment 4 was conducted with the same
set of materials used in Experiment 3 but with a shorter ISI (50ms).
1150
Figure 2
1100
Top panel:
Mean lexical decision response
times (in milliseconds) with standard error bars to targets preceded
by primes overlapping with ST
primes in segmental structure
and tone (ST), segmental structure only (S), tone only (T),
and neither segmental structure
nor tone (UR) in Experiment 3
(250 ms ISI)
1050
1000
950
900
850
800
Response Time (ms)
ST
S
T
UR
1150
Bottom panel:
1100
Mean lexical decision response
times in Experiment 4 (50 ms ISI)
1050
1000
950
900
850
800
5 Experiment 4
The purpose of Experiment 4 was to examine whether any mediated priming from the S
or T primes could be detected with a shorter ISI. The experimental setup was identical
to that of Experiment 3 except the ISI was 50 ms in the current experiment.
Language and Speech
114
Lexical tone in form priming
5.1
Method
5.1.1
Participants
Forty graduate students at Brown University (age: 22 – 30 years; gender: 22 male, 18
female) participated in the experiment. Twenty-six of the participants also participated
in Experiment 2, which took place two months before Experiment 4.
5.1.2
Materials
The materials were identical to those used in Experiment 3 except that the ISI used
for this experiment was 50 ms.
5.1.3
Procedure
The procedure was identical to that of Experiment 3 except that no additional
recording was made.
5.2
Results and Discussion
The bottom panel of Figure 2 shows the mean response times to targets preceded
by the four types of primes. One-way repeated measure ANOVAs revealed a main
effect of prime type: F 1 (3, 39) = 18.69, p <.0001; F 2 (3, 79) = 13.65, p <.0001. Pair-wise
means comparisons showed significant facilitation (51 ms) by ST primes (p <.0001 by
subjects and by items). Since the ST primes were semantically related to the targets, the
facilitation based on semantic association was expected. In addition to the semantic
priming, there was also significant facilitation (18 ms) by S primes (p <.05 by subjects
and p =.05 by items), which was not found in previous experiments.
In addition to the priming effects, multiple comparisons showed a number of
reliable differences among the ST, S, and T conditions. In particular, ST was 34 ms
faster than S (p <.0001 by subjects and by items) and 53 ms faster than T (p <.0001 by
subjects and by items) S was 19 ms faster than T (p <.05 by subjects, nonsignificant by
items). Again, these results suggest that the listeners were sensitive to the phonological
difference among the three types of primes at this ISI.
Participants made few errors in this experiment. The average numbers of errors
were 0.6 (ST), 1.0 (S), 1.0 (T) and 1.1 (UR) out of 20. One-way repeated measures
ANOVAs revealed a main effect of prime type, significant only by subjects,
F 1 (3, 39) = 2.7, p <.05. Pair-wise comparisons showed significant difference between
ST and UR (p =.01) and between ST and T (p <.05).
In sum, semantic priming remained evident at 50 ms ISI, as was found in
Experiment 3. However, contrary to the previous three experiments where no other
priming effects were found, reliable facilitation by S primes was found in this experiment with the shorter ISI. The S primes were minimal tone pairs of ST primes, which
were semantically related to the target. Since the S primes and targets were not
Language and Speech
C.-Y. Lee
115
directly related in form or meaning, the priming effect was most likely mediated by
the ST primes, which differed from the S primes only in tone. This suggests that the
S primes did activate the ST primes, which in turn activated the targets to produce
the facilitation.
6 General Discussion
This study aimed to explore the role of lexical tone in spoken word recognition by
investigating the effects of segmental and tonal overlap in form priming. Facilitation
in direct priming was found when the prime and target were identical in both tonal
and segmental structure. Similarly, facilitation in mediated priming was found when
the prime and the mediating word were identical. Furthermore, mediated facilitation
was also found when the prime and the mediating word were different in tone only
and when the ISI was short (50 ms). Overall, these results showed that a complete
overlap in tonal and segmental structure generated facilitation consistently, but
tonal or segmental overlap alone was not sufficient to generate reliable facilitation
with one exception: the minimal tone pair prime (segmental overlap) did produce
mediated priming at 50 ms ISI. If facilitation reflects target activation, these results
indicate that primes sharing only segmental structure (i.e., tonal mismatch) or tone
(i.e., segmental mismatch) with the target were not able to activate the target except
when the ISI was short. As Zwitserlood (1996) suggested, mediated priming has a
lexical locus because the activation of the mediating word is probed by a semantically related target. In other words, the facilitation found in Experiment 4 indicates
that lou3 ‘hug’ did activate lou2 ‘hall’, which in turn activated its semantic associate
jian4zhu0 ‘building’. However, the facilitation was reliable only at 50 ms ISI, which
indicates the activation was short-lived.
How would these results compare to previous form priming studies examining
the processing of words contrasting in lexical prosody? As noted, similar studies using
the form priming paradigm have consistently found that primes with a mismatch in
lexical prosody did not produce priming, which has been interpreted by the authors as
the use of lexical prosody to constrain lexical activation, that is, to reduce the number
of activated candidates (Cooper et al., 2002; Cutler & Donselaar, 2001; Cutler & Otake,
1999; Soto-Faraco, Sebastian-Galles, & Cutler, 2001). In other words, these minimal
prosodic pairs are not treated as homophones in lexical processing (cf. Cutler, 1986).
Under this view, the “failure” of minimal tone pairs (i.e., S primes) to facilitate target
response in Experiments 1, 2, and 3 can be interpreted as listeners’ active use of tonal
information to constrain lexical activation. That is, tonal information allowed listeners
to rule out tonally incompatible candidates in the early phase of lexical activation.
If listeners failed to use tonal information to constrain lexical activation, according
to this reasoning, minimal tone pairs would be treated as homophones and would
produce facilitatory priming.
Facilitatory priming, however, was indeed observed in minimal tone pair condition (i.e., S primes) when the ISI was short, which suggests that a mismatch in tone
did not prevent activation at this ISI (Experiment 4). The use of the two ISIs allowed
the current experiments to evaluate not only the effects of segmental and tonal
overlap, but also the time course of minimal tone pair activation. Taken together,
Language and Speech
116
Lexical tone in form priming
results from the experiments indicate that minimal tone pairs were activated in the
early phase of lexical access based on form overlap, but listeners were able to use
tonal information to quickly rule out the tonally incompatible candidates. Another
evidence for the listeners’ successful use of tonal information is that the response
time to minimal tone pair (S) primes was significantly slower compared to identity
(ST) primes in all four experiments. This result indicates that the minimal tone pair
primes (S) were not treated as homophones to the segmentally identical ST primes
or targets (cf. Cutler, 1986).
If minimal tone pairs were indeed activated early, why did we not observe
statistically reliable evidence of such activation in Experiment 2, where the ISI was
also 50 ms? Factors specific to the direct priming task may have played a role. In
particular, lexical decision was made only after a target word was recognized. For
a word decision to be made, all other activated competitors need to be ruled out.
Under current models of spoken word recognition such as TRACE (McClelland &
Elman, 1986), Cohort (Marslen-Wilson, 1987), and the Neighborhood Activation
model (Luce & Pisoni, 1998), a minimal tone pair prime should be a strong competitor
with the target based on their form similarity. When a lexical decision was made
on a target preceded by a prime that was a minimal tone pair (Experiment 2), the
decision was likely to be slowed down due to direct competition with the prime. In
fact, the literature on segmental processing has shown inhibitory priming by word
primes sharing more than one or two segmental phonemes with word targets, and
the inhibitory priming is normally taken as evidence for lexical competition between
phonologically similar words (Goldinger, 1999; Hamburger & Slowiaczek, 1996;
Monsell & Hirsh, 1998; Radeau, Morais, & Dewier, 1989; Slowiaczek & Hamburger,
1992). The minimal tone pair prime in Experiment 2 could have activated the target,
but the activation was inhibited by direct competition with the target, on which the
lexical decision was made. In mediated priming, lexical decision was not made on a
target directly competing with the minimal tone prime and the form-based facilitation
was able to be detected. This possibility, however, needs to be evaluated by further
research designed to examine the role of tone in lexical competition.
The use of a different tone language also allowed a comparison with previous
tone priming studies. Experiment 1 in the current study was in many ways analogous
to Yip (2001). Both Cantonese and Mandarin have a syllable-based tone system. Both
experiments examined direct priming in monosyllabic words. Both used the ISI of
250 ms. However, in contrast to Yip (2001), no evidence of minimal tone priming was
found in the present study. A number of factors could be the source of the contrasting
results. Cantonese has more tones (6) than Mandarin (4). The proportion of related
trials was lower (50%) in Yip (2001) compared to the current study (75%). Yip (2001)
used the shadowing task while the current study used the lexical decision task. With
regard to the number of tones, tonal contrasts may be harder to perceive in Cantonese
because five of the six tones in Cantonese have a similar onset on the F 0 scale and
have similar F 0 contours, whereas the four tones in Mandarin have fairly distinct
F 0 contours (with the exception between the rising and the low tone). Cutler and
Chen (1997) found a difference in response accuracy between perceptually easy and
hard tonal contrasts in Cantonese. It is not clear if the word pairs used in Yip (2001)
involved perceptually hard tone contrasts. If they did, the listeners might not be able
Language and Speech
C.-Y. Lee
117
to distinguish minimal tone pairs as effectively as the Mandarin counterparts, which
could result in minimal tone pair priming.
The difference in the proportion of related trials was unlikely to be the source of
contrast between Yip (2001) and the current study. In particular, Yip (2001) intended to
replicate Yip et al. (1998) with a lower proportion of related trials in order to evaluate
the extent to which response strategies contribute to facilitatory priming (Goldinger,
Luce, Pisoni, & Marcario, 1992). If the facilitatory priming in Yip et al. (1998) were
due to response strategies developed from detecting the prime-target relationships,
a lower proportion should have also reduced or removed the facilitation. On the
contrary, Yip (2001) showed that not only the facilitation by the minimal tone pair
(onset+rhyme primes) remained, there was also reliable facilitation by the rhyme +tone
prime. Moreover, the proportion of related trials in the current study was even higher
than that in Yip (2001). If facilitation were due to the strategies, facilitatory priming
would have been evident in the present study as well.
That having been said, response strategies due to general task demands could be
an issue for studies using behavioral measures (Schirmer et al., 2005). In particular,
all behavioral measures involve explicit responses by listeners, which may invoke
postaccess processes that reflect task demands in addition to the activation-competition processes. In the segmental literature on form priming, facilitation has been
consistently observed in studies using the shadowing task, in which listeners are
asked to repeat the target word (Goldinger, 1999; Hamburger & Slowiaczek, 1996;
Slowiaczek & Hamburger, 1992). However, it has been pointed out that the facilitation
may reflect a task-specific strategy (Goldinger, 1999; Hamburger & Slowiaczek, 1999;
Norris, McQueen, & Cutler, 2002). That is, listeners seem to learn that prime and
target often start with the same segments and are more ready to repeat the target word.
Schirmer et al. (2005) showed comparable results for segmental and tonal violations
in the ERP and reaction time measures but not in the behavioral accuracy measure.
One way to avoid such task-induced response strategies is to use temporally sensitive
electrophysiological measures such as ERP (Schirmer et al., 2005).
At least three issues remain to be examined regarding the role of lexical tone in
form priming. First, what is the relative contribution of tone and segmental structure
in form priming? The prime and target in the current study were either maximally
similar (S prime) or maximally distinct (T prime) in segmental structure. It is not
known whether the priming effect would vary as a function of the degree of overlap
(e.g., overlap in consonant+tone or vowel+tone). Cutler and Chen (1995) compared the
effects of tone versus rhyme overlap in disyllabic words, but further studies following
Cutler and Chen’s (1997) systematic manipulation of tone and segmental mismatch
are needed to evaluate the individual and joint contribution of segmental structure
and tone in form priming.
The second issue concerns the role of tonal similarity in form priming. Cutler and
Chen (1997) and Ye and Connine (1999) showed that performance in same-different
judgment, lexical decision, and tone-phoneme monitoring all varied as a function
of the perceptual similarity between tones. This finding was taken as evidence for
the graded nature of lexical activation based on similarity between the input and
lexical representation (Ye & Connine, 1999). If the mapping from the acoustic signal
Language and Speech
118
Lexical tone in form priming
is graded, lexical activation should also vary as a function of tonal similarity. In
particular, tone perception studies have consistently shown that the rising tone and
the low tone in Mandarin are more confusing than other tonal contrasts (Gandour,
1983; Kiriloff, 1969) presumably because of their acoustic similarity (i.e., the two tones
share a similar onset on the F 0 scale and have similar F 0 contours). It is of interest to
see if tonal similarity plays a role in form priming as well.
Finally, although tone in isolated syllables appears to be processed more slowly
and less accurately compared to consonant and vowel (Cutler & Chen, 1997; Ye &
Connine, 1999), Ye and Connine (1999) also found that the disadvantage of tone
disappeared when the tone-phoneme target to be monitored was embedded in an
idiom in Mandarin. It seems that the disadvantage of tone in perceptual processing
was neutralized by the presence of a semantic context. Furthermore, Schirmer et al.
(2005) found that semantic incongruity with a sentential context generated comparable N400-like negativity between segmental and tonal modifications, suggesting
that sentence context may be used to constrain lexical activation. Taken together,
the later arrival of tonal information in isolated syllables may not hinder the relative
usefulness of tone in a more realistic processing environment after all.
The conclusion to be drawn from the current study is that auditory presentation of a Mandarin word activates its minimal tone pairs in the early phase of
lexical activation, but tonal information is used soon afterwards to resolve segmental
ambiguity, ruling out candidates that are mismatching in tone. Evidence from the
present experiments suggests that ‘horse’ did activate ‘mother’, but Mandarin listeners
quickly figured out that ‘horse’ was not ‘mother’ after all. Listeners are able to do so
by utilizing tonal information on-line to disambiguate the minimal tone pairs.
manuscript received:
02. 16. 2005
manuscript accepted: 03. 14. 2006
References
ANDRUSKI, J. E., BLUMSTEIN, S. E., & BURTON, M. (1994). The effect of subphonetic
differences on lexical access. Cognition, 52, 163 – 187.
CHEN, H.-C., & CUTLER, A. (1996). Auditory priming in spoken and printed word recognition. In H.-C. Chen (Ed.), Cognitive processing of Chinese and related Asian languages
(pp. 77 – 81). Hong Kong: Chinese University Press.
CHING, Y. C. (1985). Lipreading Cantonese with voice pitch. Journal of the Acoustical Society
of America, 77, S39 – 40.
CONNINE, C. M., CLIFTON, C., Jr., & CUTLER, A. (1987). Effects of lexical stress on phonetic
categorization. Phonetica, 44, 133 –146.
COOPER, N., CUTLER, A., & WALES, R. (2002). Constraints of lexical stress on lexical
access in English: Evidence from native and non-native listeners. Language and Speech,
45, 207 – 228.
CUTLER, A. (1986). Forbear is a homophone: Lexical prosody does not constrain lexical access.
Language and Speech, 29, 201 – 220.
CUTLER, A. (1997). The comparative perspective on spoken-language processing. Speech
Communication, 21, 3 – 15.
Language and Speech
C.-Y. Lee
119
CUTLER, A., & CHEN, H.-C. (1995). Phonological similarity effects in Cantonese word
recognition. Proceedings of the 13th international Congress of Phonetic Sciences, 106 – 109.
Stockholm.
CUTLER, A., & CHEN, H.-C. (1997). Lexical tone in Cantonese spoken word processing.
Perception and Psychophysics, 59, 165 – 179.
CUTLER, A., DAHAN, D., & van DONSELAAR, W. (1997). Prosody in the comprehension
of spoken language: A literature review. Language and Speech, 40, 141 – 201.
CUTLER, A., & van DONSELAAR, W. (2001). Voornaam is not a homophone: Lexical prosody
and lexical access in Dutch. Language and Speech, 44, 171 – 195.
CUTLER, A., & OTAKE, T. (1999). Pitch accent in spoken-word recognition in Japanese. Journal
of the Acoustical Society of America, 105, 1977 – 1988.
FOX, R. A., & UNKEFER, J. (1985). The effect of lexical status on the perception of tone.
Journal of Chinese Linguistics, 13, 69 – 90.
FRAUENFELDER, U. H. (1996). Computational models of spoken word recognition. In T.
Dijkstra & K. de Smedt (Eds.), Computational psycholinguistics: AI and connectionist models
of human language processing (pp. 114 – 138). London: Taylor and Francis.
GANDOUR, J. (1983). Tone perception in far eastern languages. Journal of Phonetics, 11,
149 – 175.
GANONG, W. F. (1980). Phonetic categorization in auditory word perception. Journal of
Experimental Psychology: Human Perception and Performance, 6, 110 – 125.
GOLDINGER, S. D. (1999). Only the Shadower knows: Comment on Hamburger and Slowiaczek
(1996). Psychonomic Bulletin and Review, 6, 347 – 351.
GOLDINGER, S. D., LUCE, P. A., PISONI, D. B., & MARCARIO, J. K. (1992). Formbased priming in spoken word recognition: The roles of competition and bias. Journal of
Experimental Psychology: Leaning, Memory, and Cognition, 18, 1211 – 1238.
HAMBURGER, M., & SLOWIACZEK, L. M. (1996). Phonological priming reflects lexical
competition. Psychonomic Bulletin and Review, 3, 520 – 525.
HAMBURGER, M., & SLOWIACZEK, L. M. (1999). On the role of bias in dissociated phonological priming. Psychonomic Bulletin and Review, 6, 352 – 355.
KIRILOFF, C. (1969). On the auditory perception of tones in Mandarin. Phonetica, 20,
63 – 67.
LIVELY, S. E., PISONI, D. B., & GOLDINGER, S. D. (1994). Spoken word recognition. Research
and theory. In M. A. Gernsbacher (Ed.), Handbook of psycholinguistics (pp.265 – 301). San
Diego: Academic Press.
LUCE, P. A., & PISONI, D. B. (1998). Recognizing spoken words: The Neighborhood Activation
model. Ear and Hearing, 19, 1 – 36.
MARSLEN-WILSON, W. D. (1987). Functional parallelism in spoken word recognition.
Cognition, 25, 71 – 102.
McCLELLAND, J. L., & ELMAN, J. L. (1986). The TRACE model of speech perception.
Cognitive Psychology, 18, 1 – 86.
McQUEEN, J. M. (1991). The influence of the lexicon on phonetic categorization: Stimulus
quality in word-final ambiguity. Journal of Experimental Psychology: Human Perception
and Performance, 17, 433 – 443.
McQUEEN, J. M., DAHAN, D., & CUTLER, A. (2003). Continuity and gradedness in speech
processing. In N. O. Schiller & A. S. Meyer (Eds.), Phonetics and phonology in language
comprehension and production: Differences and similarities (pp.39 – 78). Berlin: Mouton de
Gruyter.
MERTUS, J. A. (2000). The Brown Lab Interactive Speech System (BLISS). Providence: Brown
University.
MONSELL, S., & HIRSH, K. W. (1998). Competitor priming in spoken word recognition. Journal
of Experimental Psychology: Learning, Memory, and Cognition, 24, 1495 – 1520.
Language and Speech
120
Lexical tone in form priming
NORRIS, D. G. (1994). SHORTLIST: A connectionist model of continuous speech recognition.
Cognition, 52, 189 – 234.
NORRIS, D., McQUEEN, J. M., & CUTLER, A. (2002). Bias effects in facilitatory phonological
priming. Memory & Cognition, 30, 399 – 411.
PACKARD, J. L. (2000). The morphology of Chinese: A linguistic and cognitive approach.
Cambridge: Cambridge University Press.
RADEAU, M., MORAIS, J., & DEWIER, A. (1989). Phonological priming in spoken word
recognition: Task effects. Memory & Cognition, 17, 525 – 535.
RADEAU, M., MORAIS, J., & SEGUI, J. (1995). Phonological priming between monosyllabic
spoken words. Journal of Experimental Psychology: Human Perception and Performance,
21, 1297 – 1311.
SCHIRMER, A., TANG, S.-L., PENNEY, T. B., GUNTER, T. C., & CHEN, H.-C. (2005). Brain
responses to segmentally and tonally induced semantic violations in Cantonese. Journal of
Cognitive Neuroscience, 17, 1 – 12.
SLOWIACZEK, L. M., & HAMBURGER, M. (1992). Prelexical facilitation and lexical interference in auditory word recognition. Journal of Experimental Psychology: Learning, Memory,
and Cognition, 18, 1239 – 1250.
SLOWIACZEK, L. M., & PISONI, D. B. (1986). Effects of phonological similarity on priming
in auditory lexical decision. Memory & Cognition, 14, 230 – 237.
SOTO-FARACO, S., SEBASTIAN-GALLES, N., & CUTLER, A. (2001). Segmental and suprasegmental mismatch in lexical access. Journal of Memory and Language, 45, 412 – 432.
UTMAN, J. A., BLUMSTEIN, S. E., & SULLIVAN, K. (2001). Mapping from sound to meaning:
Reduced lexical activation in Broca’s aphasics. Brain and Language, 79, 444 – 472.
WANG, H. (1986). A frequency dictionary of Modern Chinese. Beijing: Beijing Language Institute
Press.
YE, Y., & CONNINE, C. M. (1999). Processing spoken Chinese: The role of tone information.
Language and Cognitive Processes, 14, 609 – 630.
YIP, M., LEUNG, P.-Y., & CHEN, H.-C. (1998). Phonological similarity effects in Cantonese
spoken-word processing. Proceedings of ICSLP’98, 5, 2139 – 2142. Sydney, Australia.
YIP, M. (2001). Phonological priming in Cantonese spoken-word processing. Psychologia, 44,
223 – 229.
ZHOU, X., & MARSLEN-WILSON, W. (1994). Words, morphemes and syllables in the Chinese
mental lexicon. Language and Cognitive Processes, 9, 393 – 422.
ZHOU, X., & MARSLEN-WILSON, W. (1995). Morphological structure in the Chinese mental
lexicon. Language and Cognitive Processes, 10, 545 – 600.
ZWITSERLOOD, P. (1996). Form priming. Language and Cognitive Processes, 11, 589 – 596.
Language and Speech
C.-Y. Lee
121
Appendices
A. Stimuli used in Experiments 1 and 2, transcribed in Chinese characters and the Pinyin
Romanization system.
Targets
Word
wai
ceng
lun
chuang
heng
shui
lou
leng
mie
zhe
zuo
niu
qia
cao
zeng
yong
ai
zang
chan
rao
peng
wen
cai
mou
hai
rang
che
lu
mai
shai
san
min
zhou
chou
ran
tun
4
4
4
3
1
3
2
3
4
2
3
3
4
3
1
4
3
1
1
2
4
4
1
3
3
3
1
2
2
4
1
2
2
4
3
1
Nonword
外
ken 4
蹭
lia 4
论
fo 4
闯
sai 3
哼 mang 1
水
nen 3
楼
wai 2
冷
mie 3
灭 qiong 4
折
cuo 2
左
run 3
扭
miu 3
恰
neng 4
草
ruo 3
增
nu 1
用
zei 4
矮
lue 3
髒
jiong 1
搀
rong 1
饶 shuan 2
碰
gei 4
问
de 4
猜
nuan 1
某
nue 3
海
en 3
嚷 zhuang 3
車
min 1
炉
yue 2
埋
nie 2
晒
qun 4
三
fou 1
民
zang 2
轴
suan 2
臭
dei 4
染
te 3
吞
mou 1
Primes
ST
wai
ceng
lun
chuang
heng
shui
lou
leng
mie
zhe
zuo
niu
qia
cao
zeng
yong
ai
zang
chan
rao
peng
wen
cai
mou
hai
rang
che
lu
mai
shai
san
min
zhou
chou
ran
tun
S
4
4
4
3
1
3
2
3
4
2
3
3
4
3
1
4
3
1
1
2
4
4
1
3
3
3
1
2
2
4
1
2
2
4
3
1
外
蹭
论
闯
哼
水
楼
冷
灭
折
左
扭
恰
草
增
用
矮
髒
搀
饶
碰
问
猜
某
海
嚷
車
炉
埋
晒
三
民
轴
臭
染
吞
wai
ceng
lun
chuang
heng
shui
lou
leng
mie
zhe
zuo
niu
qia
cao
zeng
yong
ai
zang
chan
rao
peng
wen
cai
mou
hai
rang
che
lu
mai
shai
san
min
zhou
chou
ran
Tun
1
1
1
2
4
2
3
2
1
3
2
2
1
2
4
1
2
4
4
3
1
1
4
2
2
2
4
3
3
1
4
3
3
1
2
4
T
UR
歪
leng 4 愣
噌
rao 4 绕
抡
nei 4 内
床
zen 3 怎
横
tai 1 胎
谁
an 3 俺
搂
cang 2 藏
棱
qia 3 卡
咩 chuang 4 创
者
nin 2 您
昨
gai 3 改
牛
qing 3 请
掐
nong 4 弄
槽
ying 3 影
赠
suan 1 酸
拥
te 4 特
癌
duan 3 短
葬
kou 1 抠
颤
gu 1 孤
扰
xiang 2 降
砰
kuo 4 阔
温
mei 4 妹
菜
shuo 1 说
谋
ruan 3 软
还
dong 3 懂
瓤
bei 3 北
撤
zuan 1 钻
橹
pin 2 贫
买
ye 2 爷
筛
zong 4 纵
散
que 1 缺
抿
po 2 婆
肘
beng 2 甭
抽
geng 4 更
燃
ku 3 苦
褪
dao 1 刀
jiong
tiao
pie
liu
wa
cheng
pan
sao
kuang
nai
nen
hen
nu:
sun
tian
lia
yun
hu
hou
lang
tan
zha
ren
bai
kun
cou
qian
sa
suo
hong
re
kua
ben
man
tou
sha
3
3
3
1
2
4
1
1
2
4
4
4
3
1
3
3
1
3
3
4
3
3
3
1
4
4
3
4
1
3
3
1
1
3
1
3
窘
挑
撇
溜
娃
秤
攀
搔
狂
耐
嫩
恨
女
孙
舔
俩
晕
虎
吼
浪
毯
砟
忍
掰
困
凑
浅
卅
缩
哄
惹
夸
奔
满
偷
傻
Language and Speech
122
pin
zan
meng
nie
shuai
pao
reng
za
guo
fa
tie
kang
nu
chun
tui
luan
er
yuan
ao
tao
can
bi
bie
kuan
niang
nao
mo
ru
cun
pen
quan
huan
mao
piao
lei
sui
xue
kan
song
ci
guang
han
niao
Lexical tone in form priming
1
2
4
1
3
3
1
1
1
3
1
2
4
1
1
3
3
3
2
1
3
1
1
3
2
4
1
4
4
1
4
3
1
4
2
1
4
3
4
3
3
3
3
拼
咱
梦
捏
甩
跑
扔
咂
锅
法
贴
扛
怒
春
推
卵
耳
远
熬
掏
惨
逼
鳖
款
娘
闹
摸
入
寸
喷
劝
缓
猫
票
雷
虽
血
砍
送
此
广
喊
鸟
guai 3 拐
ran
le
nin
ruan
se
die
kao
nuo
niao
xiong
rui
ka
diu
kuai
gun
teng
niang
zhei
zun
rou
huai
shun
nei
chai
zhun
kei
lie
ping
cang
nai
ca
kuo
re
keng
pou
lai
sou
chui
zhuo
ce
ze
cen
nong
1
2
4
1
3
3
1
1
1
3
1
2
4
1
1
3
3
3
2
1
3
1
1
3
2
4
1
4
4
1
4
3
1
4
2
1
4
3
4
3
3
3
3
tuan 3
Language and Speech
pin
zan
meng
nie
shuai
pao
reng
za
guo
fa
tie
kang
nu
chun
tui
luan
er
yuan
ao
tao
can
bi
bie
kuan
niang
nao
mo
ru
cun
pen
quan
huan
mao
piao
lei
sui
xue
kan
song
ci
guang
han
niao
1
2
4
1
3
3
1
1
1
3
1
2
4
1
1
3
3
3
2
1
3
1
1
3
2
4
1
4
4
1
4
3
1
4
2
1
4
3
4
3
3
3
3
拼
咱
梦
捏
甩
跑
扔
咂
锅
法
贴
扛
怒
春
推
卵
耳
远
熬
掏
惨
逼
鳖
款
娘
闹
摸
入
寸
喷
劝
缓
猫
票
雷
虽
血
砍
送
此
广
喊
鸟
guai 3 拐
4
3
1
4
4
1
2
3
2
4
3
1
3
3
3
4
4
1
1
3
1
2
3
1
4
2
3
3
2
2
3
1
3
2
1
2
3
4
3
1
4
1
4
聘
攒
蒙
镍
率
抛
仍
咋
国
发
铁
糠
努
蠢
腿
乱
二
冤
凹
讨
餐
鼻
瘪
宽
酿
挠
抹
乳
存
盆
犬
欢
卯
瓢
勒
随
雪
看
耸
呲
逛
酣
尿
guai 1
乖
pin
Zan
Meng
nie
shuai
pao
reng
za
guo
fa
tie
kang
nu
chun
tui
luan
er
yuan
ao
tao
can
bi
bie
kuan
niang
nao
mo
ru
cun
pen
quan
huan
mao
piao
lei
sui
xue
kan
song
ci
guang
han
niao
1
2
4
1
3
3
1
1
1
3
1
2
4
1
1
3
3
3
2
1
3
1
1
3
2
4
1
4
4
1
4
3
1
4
2
1
4
3
4
3
3
3
3
靴
蒙
坏
招
跛
匪
抓
熏
拉
嘴
村
俗
倔
通
桑
体
哪
宿
贼
闷
狗
捞
拴
处
凿
配
摔
袜
告
搭
落
扯
切
愧
挪
擦
让
得
难
转
否
举
妥
she 3
舍
xue
meng
huai
zhao
bo
fei
zhua
xun
la
zui
cun
su
jue
tong
sang
ti
na
xiu
zei
men
gou
lao
shuan
chu
zao
pei
shuai
wa
gao
da
luo
che
qie
kui
nuo
ca
rang
dei
nan
zhuan
fou
ju
tuo
3
1
3
3
1
4
3
2
3
1
2
1
2
2
2
1
4
1
4
3
1
3
2
4
1
3
3
3
2
3
3
1
3
2
4
3
2
4
3
4
4
1
1
卡
揣
窄
宠
葱
俊
我
坟
选
丢
拔
吹
忙
聊
薄
眯
胖
撩
棍
逞
揪
暖
勺
乐
收
损
晃
剐
佛
耍
吮
粗
寝
驴
晃
免
绳
退
喘
润
日
追
僧
rou 4
肉
ka
chuai
zhai
chong
cong
jun
wo
fen
xuan
diu
ba
chui
mang
liao
bao
mi
pang
liao
gun
cheng
jiu
nuan
shao
le
shou
sun
huang
gua
fo
shua
shun
cu
qin
lu:
huang
mian
sheng
tui
chuan
run
ri
zhui
seng
C.-Y. Lee
123
B. Stimuli used in Experiments 3 and 4. Only targets are shown. The primes were identical to
those used in Experiments 1 and 2.
Word
li3mian4
zou3lu4
shuo1hua4
ji2mang2
chang4ge1
ye4ti3
jian4zhu2
wen1nuan3
xiao1shi1
wan1qu1
you4bian1
tiao4wu3
zheng4hao3
lu4se4
jian3shao3
hua1fei4
shen1gao1
gan1jing4
tian1jia1
yuan2liang4
yu4jian4
hui2da2
mi2yu3
ren2wu4
he2liu2
han3jiao4
gong1lu4
chu2fang2
yin3cang2
tai4yang2
shu4zi4
ren2kou3
he2xin1
xiang1wei4
yan2se4
shi2wu4
zu3he2
ta1men2
shui4jiao4
ji3ya1
shou3bi4
Nonword
裡面
走路
說話
急忙
唱歌
液體
建筑
溫暖
消失
弯曲
右边
跳舞
正好
绿色
減少
花费
身高
干净
添加
原谅
遇见
回答
谜语
人物
河流
喊叫
公路
厨房
隐藏
太阳
数字
人口
核心
香味
颜色
食物
组合
他们
睡觉
挤压
手臂
li3ken4
zou3lia4
shuo1fo4
ji1sai3
chang4mang1
ye4nen3
jian4wai2
wen1mie3
xiao1qiong4
wan1cuo2
you4run3
tiao4miu3
zheng4neng4
lu4ruo3
jian3nu1
hua1zei4
shen1lue3
gan1jiong1
tian1rong1
yuan2shuan2
yu4gei4
hui2de4
mi2nuan1
ren2nue3
he2en3
han3zhuang3
gong1min1
chu2yue2
yin3nie2
tai4qun4
shu4fou1
ren2zang2
he2suan2
xiang1dei4
yan2te3
shi2mou1
zu3ran1
ta1le2
shui4nin4
ji3ruan3
shou3se3
Word
zhui1gan3
la1ji1
zui3ba1
chao3cai4
lu4shi1
jiao1shui3
yun4song4
sheng1qi4
dong1tian1
la1che3
jing1zi3
yan3jing1
kao4jin4
nan2shou4
ba2chu1
bei1shang1
qiang2po4
wu1gui1
qian2bi4
fu4qin1
hun4luan4
jie1chu4
chu1lai2
chang2du4
zheng1qi4
shui4fu2
ji2zao4
xiao3gou3
xi4yuan4
xia4yu3
dan4shi4
hong2se4
dao1zi0
juan1zeng4
na4ge0
kuan1chang3
da4sheng1
fei1xiang2
zhuan3wan1
Nonword
追赶
垃圾
嘴巴
炒菜
律师
胶水
运送
生气
冬天
拉扯
精子
眼睛
靠近
难受
拔出
悲伤
强迫
乌龟
钱币
父亲
混乱
接触
出来
长度
蒸气
说服
急躁
小狗
戏院
下雨
但是
红色
刀子
捐赠
那个
宽敞
大声
飞翔
转弯
zhui1die3
la1kao1
zui3nuo1
chao3niao3
lu4xiong3
jiao1rui1
yun4ka2
sheng1diu4
dong1kuai1
la1gun1
jing1teng3
yan3niang3
kao4zhei3
nan2zun2
ba2rou1
bei1huai3
qiang2shun1
wu1nei1
qian2chai3
fu4zhun2
hun4kei4
jie1lie1
lai2ping4
chang2cang4
zheng1nai1
shui4ca4
ji2kuo3
xiao3re1
xi4keng4
xia4pou2
dan4lai1
hong2sou4
dao1chui3
juan1zhuo4
na4ce3
kuan1ze3
da4cen3
fei1nong3
zhuan3tuan3
Language and Speech