JOURNAL OF MEMORY Continuous AND 27,368-381 LANGUAGE and Discontinuous Access in Spoken Word-Recognition: The Role of Derivational Prefixes LORRAINE MRC Language Institute KOMISARJEVSKY and Speech WILLIAM Max-Planck (1988) for Psycholinguistics, Group, TYLER University of Cambridge MARSLEN-WILSON Nijmegen; and MRC Applied Psychology Unit, Cambridge AND JAMES RENTOUL AND PETER HANNEY MRC Language and Speech Group, University of Cambridge Theories of spoken word-recognition differ in the claims they make about continuous and discontinuous mapping of the speech input onto mental representations of lexical form. The research reported here contrasts the claims of a discontinuous, prefix-stripping model, and of a cohort-based continuous access model. In three experiments, using the gating, auditory lexical decision, and auditory naming tasks, we compared recognition-points for prefixed words (such as miscount) and their corresponding free stems (such as count). The results disconfirm the pretix-stripping model, showing that the recognition-point for a prefixed word was determined by the properties of the word as a complete, full form, and not, as a prefu-stripping model requires, by the properties of its stem. 6 1988 Academic PRSS, IIIC. tinuous mapping of the sensory input onto representations of lexical form (Warren & Marslen-Wilson, 1987). Based on the principle of maximal processing efficiency, the model states that each word is recognized at that point, starting at word-onset, at which it becomes uniquely distinguishable from all other words in the language beginning with the same sound sequence. During the recognition process a set of word-candidates is activated on the basis of the initial sensory input. This set includes all of the words in the language beginning with that initial sound sequence (the “word-initial cohort”). Only those wordcandidates which continue to match the incoming sensory input remain activated. This process continues until only a single word-candidate is left matching the input. This point in the word is called the “recognition point. ” This model of sequential access and selection (Marslen-Wilson, 1987) places The mental representation of lexical form constitutes the search space within which the listener conducts the on-line process of lexical access. The research we report here is concerned with the properties of these mental representations and with the manner in which they are accessed over time. It contrasts, in particular, theories of continuous and of discontinuous access, and their associated claims about the representation of lexical form. The theoretical framework for the current research is the “cohort” model of spoken word recognition (Marslen-Wilson, 1987; Marslen-Wilson & Tyler, 1980; Marslen-Wilson & Welsh, 1978). This model claims that spoken words are recognized by a process which involves the conAddress correspondence and requests for reprints to Dr. Lorraine Komisajevsky Tyler, University of Cambridge, Department of Experimental Psychology, Downing Site, Cambridge CB2 3EB. 368 0749-5%x/88 Copyright Ml rights $3.00 0 1988 by Academic Press, Inc. of reproduction in any form reserved. CONTINUOUS AND DISCONTINUOUS strong constraints on the properties of the recognition lexicon; that is, on the properties of the representations of lexical form onto which the speech signal is mapped in the initial stages of lexical access. Specifitally, the cohort model requires what Butterworth (1983) has labeled a full listing hypothesis: that for each word recognized there is a corresponding representation in the mental lexicon of the full acoustic-phonetic form of that word. Unless there is a full listing of lexical forms, the system will not be able to access the full word-initial cohort as the beginning of each new word is heard. These views of representation and access contrast with theories which do not specify strictly sequential access, and where words are not necessarily accessed from their onset, but rather from some sublexical unit-for example, the word’s stem (Taft, Hambiy, and Kinoshita, 1986), or its initial strong syllable (Grosjean & Gee, 1987; Norris & Cutler, 1985). For polysyllabic words, and for morphologically complex words (stems plus affixes), access to the lexicon can only begin when the access unit is encountered. This will lead to discontinuities in lexical access when the stem or strong syllable is not the first syllable of the word. These theories do not require full listing of word-forms in the recognition lexicon. The search space is organized in terms of the access units in question (such as stems or syllables), and the speech input has to be broken down into these units for the purpose of contacting the lexicon. We will refer to these as decomposition hypotheses of access and representation. The competing claims of full-listing and decomposition hypotheses have been exhaustively, though inconclusively, studied in the visual domain (cf. Butterworth, 1983; Henderson, 1985), in research looking at the perception of morphologically complex words. These studies, however, tell us little about the representation and access of lexical form during spoken word recognition. ACCESS 369 Although there has also been some research on the role of morphological factors in the spoken domain (e.g., Fowler, Napps, & Feldman, 1985; Kempley & Morton, 1982; Tyler & Marslen-Wilson, 1986), almost all of this work has looked at the effects of inflectional and derivational suffixes. For these cases, where the affix follows the stem (in words likejumping and happiness), the predictions of continuous and discontinuous access models do not differ. The critical comparison is for prefixed words, where the &ix precedes the stem (in words like unhappy or displease), and where the full-listing hypothesis predicts immediate access and the decomposition hypothesis predicts delayed access. There are only two studies in the current literature looking at the perception of prefixed spoken words. One of them, the study by Jarvella and Meijers (1983) of inflectionally prefixed verb-forms in Dutch, does not provide a clear answer to the questions at issue here. More central to our concerns is the research by Taft et al. (1986) on derivationally prefixed English words, which transposes into the auditory domain the types of theory and experiment that Taft and his colleagues had previously developed for visual word-recognition (e.g., Taft, 1981, 1985; Taft & Forster, 1975). For spoken as well as for written words, Taft et al. (1986) propose a strictly stembased access theory, which requires the listener to strip off any prefixes in order to identify the word’s stem, which is then used to access representations in recognition lexicon. Taft et al. (1986) base this claim on the results of an auditory lexical decision experiment in which subjects heard nonwords consisting of various combinations of real and nonexistent prefixes and stems. In the critical comparisons, real prefixes combined with real stems (as in dejoice) were contrasted with nonprefixes combined with real stems (as in tejoice), and real prefixes combined with nonstems (as in dejouse) were contrasted with non- 370 TYLER prefvtes combined with a nonstem (as in tejouse). Taft et al. (1986) found that response latencies were slower when the nonword began with a real prefix as opposed to a nonprefix, and that this difference was larger when the stem was a real stem. This was interpreted as evidence that the listener strips the prefix off a prefixed word and attempts to accessthe lexicon on the basis of the stem. When the stem is a real stem, it takes the listener longer to decide that the sequence is a nonword than when the stem is not a real stem and therefore cannot access any lexical representation. This result is inconsistent with the continuous access theory and its associated full listing hypothesis. It supports, instead, a discontinuous access theory, together with a view of lexical representation which claims that the recognition lexicon is organized around stems rather than full forms. Taft et al. (1986) argue, in fact, that even bound stems (they give the example of joice from rejoice) are represented in this lexicon, and can be activated as part of a word-initial cohort. There are, however, a number of problems with the Taft et al. (1986) study. In particular, it is likely that the task of making nonword decisions about morphologically complex nonwords will induce processing strategies that are not part of normal listening. In our own research using nonword detection tasks (Marslen-Wilson, 1984), the only departure from strictly sequential processing came when subjects heard nonwords constructed by combining very common prefixes with nonsense stems. Response latencies to six items preceded by the prefix in were 150 ms longer than responses to items not preceded by a prefix. We argued (Marslen-Wilson, 1984) that the correct strategy for listeners to follow in these cases, given the productivity of this prefix in the language, was to wait to see whether the sequence to which the in was attached formed a word or not, rather than to base their response on the ET AL. entire sequence, including the in. This, however, was clearly a secondary strategy, induced by the task and by the absence of a listing in the recognition lexicon of the [in + stem] sequence being heard. Henderson, Wallis, and Knight (1984) have suggested a similar task-induced account of lhft’s evidence for prefix-stripping in the visual domain. In the research reported here, we avoid these problems by looking at the processing of derivationally prefixed real words. The cohort model, with its claims for continuous accessand for the full listing of word-forms in the recognition lexicon, treats the recognition of derivationally prefixed words in the same way as the recognition of morphologically simple word-forms. The prefixed form is processed as an undecomposed sequence, which will be recognized at the point at which it diverges from all other candidates beginning with the same initial sound sequence. Derivationally prefixed words like remind or miscount will be identified at the point at which they diverge from their closest competitors, such as remark or misconduct (as we noted above, this point of divergence is referred to as the recognition point). A stem-based access theory, in contrast, requires that a prefixed word cannot be identified until the stem (the access unit) has been identified. This has two consequences. First, that the presence of the prefix will induce a temporal discontinuity in lexical access, and, second, that the recognition point for the word as a whole will depend on the recognition point for the stem on its own. A word like miscount will be identified, not when the entire sequence diverges from its competitors, but only after the stem (count) has diverged from ifs competitors (words like council). l ’ On the affii-stripping view, there will presumably be additional costs associated with the processes of search, etc., that need to take place after the stem has been identified. These should simply add a constant, on average, to recognition time. CONTINUOUS AND DISCONTINUOUS In Experiment 1 we will use a gating task to determine directly whether the recognition points for prefixed words are controlled by their stems, as a stem-based access theory predicts, or whether, as the cohort model predicts, recognition point is determined by the properties of the word as a whole. In Experiments 2 and 3 we will reexamine these and additional questions, using the auditory lexical decision and auditory naming tasks. EXPERIMENTI This experiment will compare recognition-points for pairs of words such as count and miscount, where the prefixed member of the pair consists of a free (or unbound) morpheme preceded by a prefix, and where the other member of the pair is the same free morpheme standing in isolation. According to a stem-based access theory, the recognition-points for the two words will be highly correlated, since both words are initially accessed via the same representation in the recognition lexicon: the representation of the stem. In fact, the recognition-point for the prefixed member of the pair should fall at the same location in the stem as it does when the stem is heard in isolation. If, for example, the recognition-point for the free morpheme count falls on the /t/, then the prefixed word miscount will also be recognized on or after the /t/. This means that recognition-points for prefixed words can be directly computed by adding the length of the prefix to the recognition-point calculated for the stem heard in isolation. Measuring from word-onset, prefixed words will always be recognized later than their corresponding free stems, because of the delay induced by the presence of the prefix. Note that the nature and the length of this delay necessarily follows from the sequential nature of speech. If word-forms are accessed from their stems, then the nrocessor simnlv _ < has to wait until the full ACCESS 371 auditory stimulus for the prefix has been heard before accessof the stem can begin. The cohort model predicts that there should be no processing dependencies between the two words during lexical access, since each should be recognized on the basis of distinct representations in the recognition lexicon. The recognition-points for the free morphemes in isolation have no necessary implications for the recognitionpoints of the prefixed words containing these stems, since each is recognized on the basis of different word-initial cohorts. The recognition-point for count, for example, falls on the /t/, but for miscount it should fall on the /au/. Nor does the presence of a prefix mean that a prefixed word will always be recognized later than its corresponding isolated stem. Whether there is a relative delay or not depends entirely on the properties of the word-initial cohorts for the two word-forms. To measure recognition-point for these stimulus pairs, we used the gating task (Grosjean, 1980; Tyler & Marslen-Wilson, 1986; Tyler & Wessels, 1983, 1985). This is a task in which listeners hear successive small increments of a word, and at each increment they are asked to state what they think the word is, or is going to be. This gives a precise estimate of how much sensory information listeners need to hear before they can identify a word. These estimates correlate highly with performance in tasks such as word monitoring, which are assumed to tap more directly the processes involved in normal on-line language processing (Marslen-Wilson, 1984). Method Materials. We selected 36 pairs of words, consisting of a free morpheme and a matched prefixed form which incorporated this morpheme as its stem (as in frost-defrost or turn-return). The following criteria constrained the selection of these stimuli: 1. Derivational morphology: Since this experiment was in part intended as a test of the Taft et al. (1986) prefix-stripping hy- 372 TYLER pothesis, we followed their criteria for what constituted a prefixed item-namely that the word should be listed in the Shorter Oxford Dictionary as being derived from a prefix plus a stern.* 2. Theoretical recognition-points: So that the competing hypotheses would make distinct predictions, we needed to select word pairs whose recognition-points were the same on a prefix-stripping view but different on a cohort analysis. We therefore chose word pairs where the separation point of the prefixed form, on the full listing hypothesis, should occur one or two segments earlier in the stem than the separation point of the same stem when heard as a free morpheme. On average, this meant that the theoretical uniqueness points for the prefixed words fell about 90 ms earlier than the uniqueness points for the corresponding free stems. For example, the word rebuild becomes unique at the /I/ whereas its stem build becomes unique at the /d/. These points were determined by searching Chambers Twentieth Century Dictionary and Jones and Gimson’s (1984) pronouncing dictionary. 3. Word frequency: It was not possible to construct a stimulus set that met the other criteria and where the pairs were also fully matched in frequency. We chose, therefore, to construct a list where the frequency (Francis & Kucera, 1982) of the free morpheme was always higher (median frequency = 97) than the frequency of the prefixed form (median frequency = 7.5). The effect of this is to bias the results in favor of the stem-access theory. If frequency affects word-recognition, then higher frequency words should be recognized earlier than lower frequency words. This would have the effect of increasing the differences in the recognition-points of * Note that on the prefix-stripping hypothesis, as transposed to the auditory domain, it does not matter whether the listener is hearing a true prefixed word or not. So long as the first syllable sounds like a prefix, then the system must strip it. ET AL. stems and prefixed words that the stem-access model predicts.3 4. Prefixes: We used a variety of common monosyllabic prefixes with the constraint that no more than six words in the total set contained the same prefix. 5. Stress: Several current theories claim that lexical access is driven by stressed, or strong, syllables and that unstressed, or weak, syllables are not used in initial access (Grosjean & Gee, 1987; Norris & Cutler, 1985). This raises the possibility of a confound with the effects of prefixing. If a prefix is unstressed, then this may be an additional reason-or even the sole reason-for a delay in lexical access. To control for this possibility, we varied the stress pattern of the prefixed words, such that 16 items began with an unstressed prefix (containing a reduced vowel), and 20 began with prefixes containing full vowels, carrying either primary or secondary stress. Design and procedure. The 36 pairs of words were assigned to two lists, such that each list contained only one member of each pair, with an equal number overall of prefixed words and free morphemes. Each list contained a maximum of three words with the same prefix. The prefixed words and stems were pseudorandomly ordered within a list, with the same order of items maintained across lists. These lists, plus five practice items, were recorded by a female speaker of British English. The test words were then digitized at a sampling rate of 20 kHz. The wave-form for each word was displayed on a CRT screen and 50-ms segments were marked off starting from the beginning of the word. The word was then output from the computer in a sequence of fragments, each of 3 It is, in fact, not quite clear what a stem-base access theory predicts here. If initial access is via the same stem representation for both the prefixed form and for the free stem, then it may be the frequency of the free stem that determines the frequency effect. If so, then the differences in frequency between the prefixed full-forms and the free stem would be irrelevant. CONTINUOUS AND DISCONTINUOUS which increased by 50 ms in duration. The first fragment consisted of the first 50 ms of the word, the second consisted of the first 100 ms, and so on until the whole word had been output. A smoothing function was applied at the offset of each fragment to reduce splicing artifacts. The number of fragments required for each word varied according to the total duration of the word, ranging from 7 to 17 gates. There was a 5-s interval between each fragment to give subjects time to write down their response. In preparing the stimuli we measured the length of each word, and, for the prefixed words, the length of their component prefixes and stems. These measurements were made directly from the wave-forms, using auditory information where necessary. We also established, for each stimulus, where the theoretical uniqueness points fell, measured in milliseconds from the word-onset. These theoretical uniqueness points were based on dictionary counts, and the measurement-point was defined as the onset of the segment at which the word became unique. Subjects were tested in groups of four, each seated in a separate carrel. They heard the stimuli over closed-ear headphones and were instructed to write down what word they thought they were hearing after each fragment, together with a rating of their confidence in this judgement. The rating scale ran from 1 to 10, with 1 being labeled as “Pure Guess” and 10 as “Completely Sure.” Each testing session lasted 1 V2hr. Subjects. Thirty subjects (15 on each of the two lists of items) were tested. They were taken from the MRC Language and Speech subject panel and were paid for their participation. Results and Discussion Following the procedures used in other gating studies (e.g., Grosjean, 1980; Tyler & Wessels, 1983, 1985) we calculated recognition-points for each item. The recognition-point is defined as the mean gate at ACCESS 373 which subjects correctly identify the word they are hearing, and assign a confidence rating of 80% or more to their response. For seven items, the subjects’ responses failed to meet the recognition-point criterion, and these items, together with the other member of the pair, were discarded from the analysis .4 All of the analyses we report are based on the remaining 29 pairs of items.5 These items are listed in Appendix A. A stem-based access theory makes three main predictions here: that the presence of a prefix will necessarily cause a delay in the recognition of a prefixed word as compared to its stem heard in isolation; that the recognition-point for the prefixed word can be simply calculated by adding together the duration of the prefix and the recognitionpoint for the stem in isolation; and that there will be a strong correlation between recognition-points for stems heard in isolation and recognition-point for the same stems heard as part of a prefixed word. All three of these predictions are disconfirmed by the results. First, there is no difference in the mean recognition-points for the prefixed words and for their corresponding free stems. The mean recognition-point for the prefixed forms is 434 ms as opposed to 444 ms for the isolated stems. This difference does not approach significance (F2 < 1). The exact relationship in length between a given set of prefixed and free stems is of course arbitrary, reflecting as it does the independent relationships of the free stems and the prefixed 4 The discarded items were primarily cases where the speech editing process had created “click” artifacts at the onset of the stimulus, leading to difficulties in identifying the initial segment of the item. This does not reflect any general degradation of the speech signal by the digitizing and playback procedures used here. J The analyses of variance were carried out on item means only. Mean recognition-points could not be meaningfully computed for subjects, since the critical comparisons involved contrasts between matched pairs of items. 374 TYLER forms to their respective-and quite different-word-initial cohorts. What is important about the result is that it shows that there is no necessary increase in recognition-time associated with the addition of a prefix. Second, as this first result implies, recognition-point for the prefixed words is not the sum of the length of the prefix and the recognition-point for the stem. The mean length of the prefixes was 183 ms. Subtracting this from the total recognitionpoint for the prefixed items, we can estimate that the mean recognition point for these items fell 251 ms after the onset of the stem. This is nearly 200 ms earlier than the mean recognition-point (444 ms) for the isolated stems. This difference cannot be explained in terms of durational differences between the stems in the prefixed words and the same stems in isolation, since they only differ in length by 28 ms. The mean duration of the isolated stems is 548 ms, as opposed to 520 ms for the stems in prefixed words. Third, there is no relationship between recognition-point for stems heard as free morphemes and stems heard in prefixed words. To establish this, we first subtracted the length of each prefix from the overall gating recognition-point for that word. On the stem-based access theory, access only begins after the prefix has been heard, so that variations in the length of the prefix are simply noise as far as the question at issue is concerned. These corrected recognition-points for the prefixed words were then correlated with the recognition-points for the corresponding free stems. The result was a nonsignificant correlation of + .019 (df = 27), indicating that the two sets of recognition-points have essentially no variance in common. The final possibility we investigated was whether the overall effects concealed an interaction with the stress value of the prefix, If syllabic stress does play a role in lexical access, then it is possible that prefixes carrying stress behave differently to unstressed prefixes. We therefore separated ET AL. out the 29 pairs according to the stress value of the prefix. Twelve of the prefixed words began with unstressed syllables containing reduced vowels and 17 began with stressed syllables containing full vowels. The mean recognition-point for items containing unstressed prefixes was earlier (401 ms) than for items containing prefixes with either secondary or primary stress (457 ms).‘j Note that the direction of this effect is the opposite to what is predicted by theories which claim that lexical access is driven by stressed or strong syllables (Grosjean & Gee, 1987; Norris & Cutler, 1985). These theories predict that words containing a stressed prefix should be recognized earlier than those containing an unstressed prefix, because of the delay in contacting the lexicon induced by the unstressed prefix. The difference we found here between stressed and unstressed prefixed words in fact has little to do with the value of stressed or unstressed syllables as access units. Instead, it corresponds almost exactly to the difference in average length (62 ms) between stressed and unstressed prefixes. The stimulus set contained, for example, six items beginning with the prefix re-: the unstressed cases averaged 139 ms in duration, while the stressed cases averaged 201 ms. This increase is primarily due to an increase in the length of the vowel. From the perspective of lexical access and selection, this has the effect of delaying the point in the signal, measuring from wordonset, at which the listener starts to get information about the next segment. This, in turn, has the consequence of delaying the point at which, eventually, the listener receives the discriminatory acoustic-phonetic information necessary to identify the word. To correct for these differences, we sub- 6 On a separate analysis, including only the prefuted items, there was a significant main effect of stress (F2 (2,28) = 4.691). On an overall analysis, including the free stems as well. There was no overall effect of stress, nor a significant interaction between stress and word-type. CONTINUOUS AND DISCONTINUOUS tracted prefix length from recognition-point for each item (overall mean duration was 213 ms for stressed prefixes, and 141 ms for unstressed). This yielded a mean recognition-point for the stressed prefixes of 244 ms from the onset of the stem, and of 260 ms for the unstressed prefixes. This difference did not approach significance (F2 < 1). There is no evidence here to suggest that the stress value of a prefix has any consequences for its status in the access process. In contrast to the problems these results pose for stem-based access theories, they fit very well with a continuous access, fulllisting hypothesis of the cohort type. The prefixed forms and their matched free stems begin with different speech sounds, and are recognized, therefore, on the basis of completely different word-initial cohorts. This is why the recognition-points for the two kinds of items have no variance in common, and why there is no necessity for the recognition-point for a prefixed word to occur later in the word, measuring from word-onset, than the recognitionpoint for its stem heard in isolation. The information provided by the prefix need be no less effective in constraining the left-toright on-line decision-space than the information provided by any other type of word-initial syllable. And any variations in the length of this first syllable-whether a prefix or not- will have the effect of increasing or decreasing average recognition time. Additional support for the continuous access hypothesis comes from the relationship between the recognition-points we obtained in the gating task and the theoretical recognition-points we calculated on the basis of dictionary counts. If the cohort model is correct in claiming that the recognition points for both free stems and prefixed words are determined by the properties of their respective word-initial cohorts, then there should be a significant correlation between the observed gating recognition points and the recognition-points calculated on theoretical grounds. This is 375 ACCESS what we found. The correlation between the gating recognition-points and theoretical recognition-points was +0.797 (df = 27, p < .OOl) for the free stems, and + 0.562 (df = 27, p < .OOl) for the prefixed words. We now turn to a second experiment in which we test the same stimuli in an auditory lexical decision task. EXPERIMENT 2 Although the results of the gating study were unequivocal in rejecting the prefixstripping hypothesis, we decided to run an additional test, using the auditory lexical decision task with the same stimuli. The previous experiment differs from Taft et al. (1986) not only in the type of stimulus it used (real words as opposed to nonwords), but also in the kind of experimental paradigm (gating as opposed to lexical decision). The goal of this second experiment, therefore, was to establish the role of these two different tasks in generating such different outcomes. In particular, we wanted to rule out the possibility that the gating task itself, with its strictly sequential and incremental mode of presentation, might be artificially inducing a continuous left-toright access strategy. Method Materials. We used the acoustic tokens of the 36 test pairs recorded for the gating study. In addition, we selected 96 nonwords and 60 filler real-words. Of the nonwords, 60 were monosyllabic, 26 were prefixed bi- or trisyllabic strings, and 10 were unprefixed bisyllables. The real-word tillers consisted of 40 monosyllables and a further 20 monomorphemic two or three syllable words. An additional practice list, of 50 real words and nonwords, was also constructed. These materials were then recorded by the same speaker who had recorded the original gating test pairs. There were no detectable differences between the earlier and the later recording. 376 TYLER mo test versions were constructed by cross-recording from master tapes. Each version contained half the prefixed items and half the free stems. These test items were pseudorandomly interspersed with filler items. Timing pulses were then placed (to within 10 ms accuracy) on the nonspeech channel of the tape at the onset of each test word. These pulses served to trigger a timing device which was stopped when the subject pressed a response key. Procedure. Twenty-five subjects (from the MRC Language and Speech subject pool) were tested in groups of four, each seated in separate carrels. They heard the stimuli over closed ear headphones. Given the problems associated with making No responses in this context (as we pointed out in the Introduction), the subjects were required to make Yes responses only. They were asked to press the response key in front of them as rapidly as possible whenever the sequence they heard was a real word. Subjects were paid for the 30-min testing session. Results and Discussion The data from three subjects were discarded because their responses were slow and erratic (their average latencies were over 1 s and they had high error rates). This left 11 subjects for each of the two experimental versions. Extreme outliers (defined as RTs over 1500 ms) and missing values were omitted from the data analysis. These totaled 2.3% of the total. We then computed midmean RTs for each item and each subject, including only the data for the 29 pairs used in the gating analyses. These were then entered into two sets of analyses of variance, on subjects and on items, with the factors of Word Type (Prefixed vs. Free Stem) and of Stress (primary and secondary stress vs. unstressed). As in our discussion of the gating results, we begin by examining the three main predictions made by a stem-access, prefixstripping hypothesis. The first two of these concern the length of the RTs to the pre- ET AL. fixed words as opposed to the free stems. There is now an overall difference between the two sets of 45 ms, with mean response times of 795 ms for free stems as opposed to 840 ms for prefixed words. This difference is significant on the subject (Fl (1,21) = 11.328, p < .Ol) and the item (I?2 (1,27) = 5.053, p < .05) analyses, but not on the Min F’ statistic (Min F’ (1,45) = 3.494, p < .lO). In so far as there is a systematic effect here, it seems to be entirely due to the prefixed items with primary or secondary stress. The unstressed prefixes (mean RT of 806 ms) do not differ from their matched free stems (mean of 795 ms). Where there is a difference is between the stressed prefixes (mean RT of 864 ms) and their matched free stems (mean of 794 ms).7 The difference between stressed and unstressed prefixed words is numerically almost identical to the differences in mean recognitionpoint for these two classes of prefixed items in the gating study, and undoubtedly derives from the same source-namely, the 62-ms difference in mean duration between stressed and unstressed prefixes. The overall difference in lexical decision time between free stems and prefixeswhatever its source-is in any case far smaller than any prefix-stripping account could predict. Given the mean length of the prefixes (183 ms), response-time to the prefixed words should have been of the order of 980 ms, rather than 840. The other main prediction of the stemaccess theory concerned the relationship between recognition-times in prefixed words and in the matched free stems. Following the same procedure as in Experiment 1, we subtracted prefix duration from the mean RT for each prefixed word, and correlated these corrected RTs with the ’ On the overall analysis, these effects were only significant on the subjects analysis, where there was a main effect of Stress (Fl (1.21) = 4.876, p < .05) and an interaction with Word Type (Fl (1,21) = 4.775, p < .05). Neither of these was significant on the items analysis or on the Min F’ statistic (Min F’ < 1). CONTINUOUS AND DISCONTINUOUS RTs for the free stems. As in the gating study, the outcome (r = +.017, df = 27) suggests that the two sets of RTs have no variance in common. This dissociation is confirmed by the frequency effects for the two groups. Overall, the correlation between RT and word-frequency is - .358 (df = 27)*. This also holds within the prefixed group alone, with a correlation between the frequency of the full form and RT of - .360 (& = 27, p = .05). What we do not find, however, is a significant correlation between prefixed word RT and the frequency of the free stem (df = 27, r = - .153). On a stem-access hypothesis, where morphologically complex forms are initially accessed via their stems, one might expect the frequency of the stem to contribute to the frequency effect at least as much as the frequency of the form as a whole. Given these results, we can exclude the possibility that the failure of a discontinuous access model in Experiment 1 was in any way due to the special properties of the gating task. Beyond this, the results confirm that the recognition points computed in the gating task accurately reflect the point at which listeners can recognize prefixed words and free stems in ordinary listening. If the recognition points computed in Experiment 1 are correlated with the mean RTs for the corresponding items in Experiment 2, we obtain a correlation of + .81 for the free stems and + .69 for the prefixed forms. The gating and the auditory lexical decision tasks provide very similar estimates of the basic timing of spoken word-recognition processes. We now turn to a third experiment, testing the same stimuli in an auditory naming task. EXPERIMENT Despite the agreement 3 between the s This is another potential source for slower responses to prefixed items, since these were much lower in frequency than the free stems. ACCESS 377 gating and lexical decision tasks, there are two potential criticisms that could be levied against lexical decision as a test of the prefix-stripping hypothesis. First, there is the now quite widely held belief (e.g., Seidenberg, Waters, Sanders, & Langer, 1984) that the task has a strong postaccess component, so that it is not a pure reflection of the immediate recognition process. Second, there is the possibility that the task encourages a strategy where subjects wait until the end of the word to respond, independent of when they actually recognize it, in order to make sure that is does not become a nonword. If these criticisms are correct, so that the lexical decision response is primarily determined by a late, postaccess representation of the input, then this might make it insensitive to the early processes of decomposition and stem-based search predicted by a discontinuous access hypothesis of the prefixstripping type. To meet these possible objections, we ran the same stimuli on a third task (auditory naming). This is a task which-at least for the naming of visual stimuliseems to be relatively less sensitive to postaccess strategic processes (e.g., Balota & Chumley, 1984, 1985), and where there is no necessity for subjects to wait until the end of the word to respond. They do not hear any nonwords, and can respond as soon as they have identified the word they are hearing (Marslen-Wilson, 1985). Method Materials. For this experiment we crossrecorded the prefixed words, the free stems, and the associated real word tillers used in the lexical decision study (together with their timing pulses) onto two new tapes, omitting the nonwords, to create two new experimental versions. This meant that we could use the identical acoustic tokens as in the first two experiments. The timing pulses (as before, placed at word onset) activated a timer which was connected to a voice-key. The timer stopped when the subject named the word. 378 TYLER Twenty-two subjects were tested individually in a quiet room. Subjects heard the stimuli over closed ear headphones and were asked to say each word out loud as soon as they recognized it. Each experimental session lasted 20 min and subjects were paid for their participation. ET AL. TABLE Procedure. PERFORMANCE 1-3 (MS) stems Unstressed prefixes Stressed prefures 444 401 458 794 629 806 630 864 702 Free Gating recognition-points Lexical decision reaction-times Naming latency 1 IN EXPERIMENTS Results and Discussion Out of the 11 subjects tested on each of the two versions of the stimuli, two from each group were omitted from the data analyses because their responses were slow and erratic. The data from the remaining 18 subjects were anlyzed after missing and extreme values (Cl%) had been removed. Midmeans for each item and each subject were computed and entered into two two-way ANOVAs (on Word-Type and Stress) which formed the basis for the Min F’ statistic. The pattern of results was almost identical to the lexical decision results, except that overall response time was considerably faster (650 as opposed to 817 ms)-indicating that listeners were responding much earlier, and in some caseswell before they reached the end of the word. There was again an overall difference between prefixed words and free stems, with mean latencies, respectively, of 672 and 628 ms. This difference was significant on both subject (Fl (1,17) = 11.934, p < .Ol) and item (F2 (1,27) = 5.808, p < .05) analyses, but not on the Min F’ statistic (Min F’ (1,43) = 4.907, p < .lO). The size of this effect is identical to what we observed for the same items in the lexical decision study, and is again entirely due to the stressed prefixed items, with naming latencies of 702 ms, as opposed to the 630 ms mean latencies for 9 On the overall analysis there is no main effect of Stress (Min F’ < l), but there is a marginally significant interaction between Stress and Word Type for the subject (Fl (1,17) = 6.473, p < .05) and for the item (F2 (1,27) = 3.760, p < .lO) analyses, although not on the Min F’ statistic. the unstressed items.9 This seems to be the same effect that we saw in the gating and the lexical decision studies, reflecting the slight increase in duration of stressed prefixes (see Table 1) There is no evidence here either to support a prefix-stripping account, or to suggest that the gating and the lexical decision tasks give an inaccurate picture of the properties of the spoken word-recognition process. In fact, as Table 1 illustrates, the overall pattern is very similar across the three tasks, despite the superficially very different kinds of response that each task requires. CONCLUSIONS The research we reported here was designed to contrast the predictions of continuous and discontinuous theories of lexical access, and the differing implications of these claims for the representation of lexical form. The results, for a stimulus set tested in three quite different experimental situations, provide no evidence here for any discontinuity in the access process, whether associated with derivational morphology, or with the stress value of the initial syllable. The presence of a prefix, whether stressed or unstressed, does not to Other research (Hall, 1987; Marsien-Wilson, Tyler, Hall, & Warren, unpublished) shows, in a similar set of experiments, that the prefix induces no delays in the access process, irrespective of the relationship between the prefix and its stem. Cases like rebuild, where the relationship between stem and prefix is transparent, behave in the same way as cases like delight, where the relationship is opaque. CONTINUOUS AND DISCONTINUOUS introduce any necessary delay into the access process, lo and there was nothing in the results to support the claim that lexical access is delayed until the stem can be identified. Words are simply recognized too early for the system to be waiting that long. The results, instead, are much more compatible with the type of continuous access process suggested by the cohort model. Prefixed and nonprefixed words appear to be recognized in the same way, by continuously mapping the sensory input onto representations of lexical form and identifying the word at the point at which it becomes unique, relative to its word-initial cohort. The significant correlations between theoretical recognition-points and gating recognition, for both free stems and for prefixed words, reflect this dependence of the on-line recognition process on the decision-space defined by the full-forms of the words involved. l1 We turn now to the implications of this result for the representation of lexical form. We should stress immediately that we are concerned here only with what we have called the recognition lexicon: those representations of lexical form that are the direct targets of the lexical access process, and onto which the listener maps the incoming speech signal during the on-line process of lexical choice. We have nothing to say here about higher levels of lexical representation, and in particular about their semantic properties. Any claims we make for or against decomposition in the access process do not necessarily apply to the subsequent process of syntactic and semantic interpretation. In our earlier discussion of the representational requirements of the cohort model, and of a continuous access theory, we I1 Note that we also obtained significant correlations between theoretical uniqueness-points and response latencies in the two response-time studies. For free stems and for prefixed words, the correlations were, respectively, + 26 and + 20 in the lexical decision experiment, and + .37 and + 40 in the naming experiment (df = 27 throughout). ACCESS 379 claimed that these required afull listing hypothesis: that to construct the word-initial cohort, the system in some sense requires all the possible candidates to be already listed. In the discussion here, we explore some of the limits on this notion, beginning with the question of the abstractness of the representation we are postulating. We have talked so far as if “full-listing” meant the storage in the recognition lexicon of a specific acoustic-phonetic realization of a given word-form. This cannot be literally true. The specific realization of a word-form will vary radically as a function of differences in speakers, in speech rate, and in the phonological environment in which the item occurs. The representation in the recognition lexicon must abstract away from these sources of variation. It is an urgent question for the study of spoken word-recognition in general to determine just what the nature of this abstract representation might be. Second, even if full-listing applies to derivationally prefixed words, do we want to claim that it also applies to derivationally or inflectionally suffixed words in English, and to inflectionally prefixed words in languages like Dutch or German? On general processing grounds, there seems to be less benefit for the processing system to insist on full-listing for suffixed forms. If anything like a cohort model is correct, then continuous access would be possible without full listing for caseswhere the stem is encountered first. Functional arguments in this domain can be quite misleading, however. In fact, as far as English is concerned, the evidence currently available in the spoken domain tends to support full listing of derivationally suffixed words, as well as of the more frequent inflectionally suffixed words (e.g., Fowler et al,, 1985; Stemberger & MacWhinney, 1986). There is very little work on languages other than English in the spoken domain. One recent exception, the research by Jarvella and Meijers (1983) looking at inflectionally pre- 380 TYLERETAL. fixed words in Dutch, presents a complex pattern of results for which it is difficult to find a unitary interpretation. On a priori grounds, however, it is for inflectional prefixation, if anywhere, that one might expect the full-listing hypothesis to fail for prefixed words, and this is clearly an area which requires more research. Finally, we should make it clear that a full listing hypothesis for the purposes of lexical access is by no means incompatible with a decomposition hypothesis for the purposes of lexical integration (MarslenWilson, 1987)-that is, for the purposes of mapping lexical semantic and syntactic information onto higher levels of interpretation. As we have argued elsewhere (Tyler and Marslen-Wilson, 1986), it is likely that there is on-line decomposition of inflectionally suffixed words as they are heard in an utterance context. In this earlier research we were able to demonstrate the differential sensitivity of stems and inflectional aflixes to the properties of the structural semantic and syntactic environment in which they were being heard. By the same token, it is also possible that at least some derivationally prefixed words-for example, relatively transparent forms like rebuild or unhappy-may be decomposed as they are heard for the purposes of semantic interpretation. But if this kind of decomposition does occur, our evidence here shows that it occurs after access, and not as part of the access process itself. ACKNOWLEDGMENTS Experiment 1 in this report was carried out by Rentom and Hanney as an undergraduate research project in the Department of Experimental Psychology, University of Cambridge, under the supervision of the first two authors. We thank Vera Kamphuis and Marie Jefsioutine for canying out Experiments 2 and 3, and Howard Cobb, Chris H&l, and Paul Warren for their help with many aspects of this study. The research was supported in part by an MRC program grant to Tyler andMarslen-Wilson. APPENDIX A The following items were used in Experiment with their recognition-points (in ms). Prefixed words Stems RP Amoral Delight Asleep Enlarge Remind Rebound Discount Adrift Discharge Provision Pretext Outclass Deform Rebuild Misfortune Outnumber Rethink Profile Mislead Defrost Discard Disgrace Prearrange Refund Return Proclaim Prejudge Depart Ahead 1, 439 368 390 454 479 446 388 425 414 423 518 545 450 467 454 477 467 450 383 483 438 487 408 508 355 336 458 346 333 RP Moral Light Sleep Large Mind Bound Count Drift Charge Vision Text Class Form Build Fortune Number Think File Lead Frost Card Grace Arrange Fund B.tm Claim Judge Fart Head 404 383 421 558 550 477 405 362 533 358 500 529 511 379 500 362 354 535 446 465 533 409 446 555 417 427 383 362 323 REFERENCES BALOTA, D. A., & CHUMLEY, J. I. (1984). Are lexical decisions a good measure of lexical access? The role of word-frequency in the neglected decision stage. Journal of Experimental Psychology: Human Perception and Performance, 10, 340-357. BALOTA, D. A., & CHUMLEY, J. I. (1985). The locus of word-frequency effects in the pronunciation task: Lexical access and/or production? Journal of Memory and Language, 24,89-106. BU~ERWORTH, B. (1983). Lexical representation. In B. Butterworth (Ed.), Language production, Vol. 2. New YorWLondon: Academic Press. FOWLER, C. A., NAPPS, S. E., & FELDMAN, L. (1985). Relations among regular and irregular morphologically related words in the lexicon as revealed by repetition priming. Memory and Cognition, 13, 241-255. CONTINUOUS AND DISCONTINUOUS FRANCIS, W. N., & KUCERA, H. (1982). Frequency analysis of English usage: Lexicon and grammar. Boston: Houghton Mifflin. GROSJEAN, F. (1980). Spoken word recognition cesses and the gating paradigm. Perception Psychophysics, proand 28, 267-283. GROSJEAN, F., & GEE, J. (1987). Prosodic structure and spoken word recognition. Cognition, 25, 135-155. HALL, C. J. (1987). Language structure and explanation. PhD thesis, University of Southern California. HENDERSON, L. (1985). Towards a psychology of morphemes. In A. W. Ellis (Ed.), Progress in the psychology of language, Vol. 1. London: LEA. HENDERSON, L., WALLIS, J., & KNIGHT, K. (1984). Morphemic structure and lexical access. In H. Bouma & D. G. Bouwhuis (Eds.), Attention and performance, Vol. X. London: LEA. JARVELLA, R. J., & MELTERS, G. (1983). Recognizing morphemes in spoken words: Some evidence for a stem-organised mental lexicon. In G. B. Flores d’Arcais & R. J. Jarvella (Eds.), The process of language understanding, Chichester: Wiley. JONES, D., & GIMSON, A. C. (1984). Everyman’s English pronouncing dictionary. London: Dent. KEMPLEY, S., & MORTON, J. (1982). The effects of priming with regularly and irregularly related words in auditory word recognition. British Journal of Psychology, 13, 441-454. MARSLEN-WILSON, W. D. (1984). Function and process in spoken word-recognition. In H. Bouma L D. G. Bouwhuis (Eds.), Attention and Performance, X. London: LEA. MARSLEN-WILSON, W. D. (1985). Speech shadowing and speech comprehension. Speech Communication, 4, 55-73. MARSLEN-WILSON, W. D. (1987). Functional parallelism in spoken word-recognition, Cognition, 25, 71-102. MARSLEN-WILSON, W. D., & TYLER, L. K. (1980). The temporal structure of spoken language understanding. Cognition, 8, l-71. MARSLEN-WILSON, W. D., & WELSH, A. (1978). Processing interactions during word recognition in 381 ACCESS continuous speech. Cognitive Psychology, 10, 29-63. NORRIS, D., & CUTLER, A. (1985). Juncture detection. Linguistics, 23, 689-705. SEIDENBERG, M. S., WATERS, G. S., SANDERS, M., & LANGER, P. (1984). Pre- and post-perceptual loci of contextual effects on word recognition. Memory and Cognition, 12, 315-328. STEMBERGER, J. P., & MACWHINNEY, B. (1986). Frequency and the lexical storage of regularly inflected forms. Memory and Cognition, 14, 17-26. TAFT, M. (1981). Prefix stripping revisited. Journal of Verbal Learning and Verbal Behavior. 20, 289-297. TAFT, M. (1985). The decoding of words in lexical access: A review of the morphographic approach. In D. Besner, T. G. Wailer, 8t G. E. Mackinnon (Eds.), Reading research: Advances in theory and practice, Vol. V. New York: Academic Press. TAG, M., & FORSTER, K. I. (1975). Lexical storage and retrieval of prefixed words. Journal of Verbal Learning and Verbal Behavior, 14, 630-647. TAFT, M., HAMBLY, G., & KINOSHITA, S. (1986). Visual and auditory recognition of prefixed words. Quarterly Journal 38A, 35 I-366. of Experimental Psychology, TYLER, L. K., & MARSLEN-WILSON, W. D. (1986). The effects of context on the recognition of polymorphemic words. Journal of Memory and Language, 25,241-256. TYLER, L. K., & WESSELS, J. (1983). Quantifying contextual contributions to word recognition processes. Perception and Psychophysics, 34, 409-420. TYLER, L. K., & WESSELS, J. (1985). Is gating an online task? Evidence from naming latency data. Perception and Psychophysics, 38,217-222. WARREN, P., & MARSLEN-WILSON, W. D. (1987). Continuous uptake of acoustic cues in spoken word-recognition. Perception and Psychophysics, 41, 262-275. (Received May 13, 1987) (Revision received February 1, 1988)
© Copyright 2026 Paperzz