Open Research Online The Open University’s repository of research publications and other research outputs The attention-grammar interface: eye-gaze cues structural choice in children and adults Journal Article How to cite: Ibbotson, Paul; Elena, Lieven and MIchael, Tomasello (2013). The attention-grammar interface: eyegaze cues structural choice in children and adults. Cognitive Linguistics, 24(3) pp. 457–481. For guidance on citations see FAQs. c 2013 Walter de Gruyter Version: Accepted Manuscript Link(s) to article on publisher’s website: http://dx.doi.org/doi:10.1515/cog-2013-0020 Copyright and Moral Rights for the articles on this site are retained by the individual authors and/or other copyright owners. For more information on Open Research Online’s data policy on reuse of materials please consult the policies page. oro.open.ac.uk The Attention-Grammar Interface Running Head: THE ATTENTION-GRAMMAR INTERFACE. The Attention-Grammar Interface: Eye-Gaze Cues Structural Choice in Children and Adults 1 The Attention-Grammar Interface Abstract We investigated whether children (3- and 4-year-olds) and adults can use the active passive alternation – essentially a choice of subject– in a way that is consistent with the eye-gaze of the speaker. Previous work suggests the function of the subject position can be grounded in attentional mechanisms (Tomlin, 1995; 1997). Eye-gaze is one powerful source of directing attention that we know adults and young children are sensitive to; furthermore, we know adults are more likely to look at the subject of their sentence than any other character (Gleitman, January, Nappa & Trueswell, 2007; Griffin & Bock, 2000). We demonstrate that older children and adults are able to use speaker-gaze to choose a felicitous subject when describing a scene with both agent-focused and patient focused cues. Integrating attentional and grammatical information in this way allows children to limit the degrees of freedom on what the function of certain linguistic constructions might be. Key words Attention-Grammar Interface, Eye-Gaze, Subject, Argument-Structure Constructions, Social Cognition. 2 The Attention-Grammar Interface Introduction Understanding the interface between attention and grammar is a central concern in many functional linguistic theories (e.g., Langacker, 1991; Talmy, 2007) and an important topic in psycholinguistic research (e.g., Henderson and Ferreira, 2004; Trueswell and Tanenhaus, 2005). Directing attention via referential and perceptual priming causes people to construe a scene in a particular way and typically this is reflected in the linguistic structures people use (Prentice, 1967; Turner and Rommetveit, 1968; Posner, 1980; Tomlin 1995, 1997; see also review by Myachykov et al., 2011). Here we investigate the extent to which eye-gaze influences speakers’ construal of a scene in both adults and children. We offer a developmental perspective as children have to learn the particulars of the form-construal mapping in their language and using social-cognitive information such as eye-gaze might be one cue (among many others) as to how this works. First, we introduce the cognitive linguistic perspective on construal and the attention-grammar interface. Second, we discuss the hypothesis that the grammatical notion of subject can be grounded in the cognitive concept of attention. Third, we outline the development of subject knowledge, attention and eye-gaze sensitivity. Finally, we set out how this previous work relates the hypothesis that social-cognitive cues such as eye-gaze might scaffold construction learning. Construal and the attention-grammar interface The same state of affairs in the world can be encoded by the use of different linguistic devices to communicate a nuanced range of perspectives. For example: 1a. the roof slopes gently downwards 1b. the roof slopes gently upwards 3 The Attention-Grammar Interface The two scenes referring to the same roof can be mentally viewed or ‘construed’ from either above (1a) or below (1b) (cf. Langacker 1988: 62). Languages have of course evolved many different ways to alter how a particular concept is construed in the mind including different structural frames: 2a. the dog chased the cat 2b. the cat was chased by the dog Note that the scenes in 1a-1b and 2a-2b are truth-conditionally equivalent in the sense that the state of affairs in the world which requires the statements to be true is the same for both. For example, in 1a and 1b there is a roof that exists such that it is angled at x degrees. The different expressions are therefore not describing different facts about the world rather they are conventionalized and prefabricated ways of expressing different perspectives. In some functional linguistic approaches (e.g., Goldberg, 2006) more abstract forms such as argument-structure constructions carry their own meaning independent of the items that appear in them; thus the hearer’ s attention is still guided to what happened to the gazzer in the passive ‘the gazzer was mibbed by the pubber’ regardless of the fact that we do not know what the words mean. Contrast this with theoretical frameworks that analyse 2b as the movement of the cat from the post verbal position into subject position (e.g., Radford et al., 1999). In the functionalist approach, “surface grammatical form does not conceal a ‘truer’, deeper level of grammatical organization; rather, it itself embodies the conventional means a language employs for the structuring and symbolization of semantic content” (Langacker 1987: 46-7). Some of the most extensively investigated attention-directing devices are those components of dynamic events, including those that highlight the conceptual distinctions of 4 The Attention-Grammar Interface containment/support, path/manner, source/goal and figure/ground (Choi and Bowerman, 1991; Jackendoff, 1983; Lakoff, 1987; Talmy, 1985, 2000, 2007). The notion of figure/ground – whereby some information is highlighted with respect to relatively stable background– has been particularly well studied in relation to the English subject (for a similar conceptual division see Fillmore’s (1976) "frame/highlighting" and Langacker’s (1987) "base/profiling") The English Subject and Attention The subject of an English1 clause is a ‘mosaic’ of prototypical coding and behavioural features (Keenan, 1976). For example, the subject typically comes before the verb and triggers agreement with it (e.g., She smiles, not She smile), has a special pronominal form (e.g., She smiles, not Her smiles) and entails certain structural properties (e.g., only the subject can leave in She smiled at him and left). The subject position is also associated with certain discourse properties2; most relevantly for this study, the foregrounding of items that appear in that position and, by definition, backgrounding other items in the clause (depending on the choice of linguistic terminology this function is also been variously referred to as figure-ground, perspective, theme, aboutness and prominence e.g., Talmy, 1985; MacWhinney, 1977; Langacker, 1991). Tomlin has shown that the foregrounding function of subject can essentially be re-described in terms of the cognitive concept of attention (1995, 1997). The key idea here is that attentional mechanisms privilege some forms of information over others by ‘gating’ perceptual input, sustaining focus on what is 1 Cross-linguistically speaking, there have been about 30 different grammatical features that have been variously attributed to the concept of “subject” (for example controlling verb agreement, determining the actor in a subjectless second coordinate clause and so on) and any one language ‘subject’ is only a subset of these features which do not necessarily overlap. 2 For example, the subject position is associated with given information whereas the object position is associated with new information (e.g., Halliday, 1985). 5 The Attention-Grammar Interface foregrounded. What is foregrounded then becomes the subject of the sentence (see Figure 1). This is most clearly exemplified by the active passive alternation in English, the function of which allows speakers to focus attention on what the agent did (active) in contrast to what happened to the patient (passive), examples 2a-2b and Figure 1. The utility of defocusing the agent’s role in an action or state of affairs has not escaped many politicians, most famously realized in the non-apology apology “...mistakes were made…” Figure 1. A schematic of the relationship between the subject role and the active passive voice alternation. Participants that appear in the subject role receive attention. Active sentences are the unmarked form and are relatively more frequent transitive sentences than passives, as indicated by the heavier outline. Arrows indicate the direction of transitivity. To test the hypothesis that subject-is-theme-is-attention Tomlin (1995) asked participants to watch prototypical transitive scenes of two fish approaching each other until one swallows the other and swims away. Tomlin manipulated the attention of participants by placing a 6 The Attention-Grammar Interface flashing red arrow above one or other of the fishes 75 ms before the eating action was completed. Participants were asked to keep their eyes on the character the arrow pointed at and describe what they saw. The majority of adult speakers performed as the subject-istheme-is-attention hypothesis predicted; on the cued agent animations (the arrow was above the fish that was doing the eating) the agent was assigned the subject position, and the clause was active. On the cued patient animations (the arrow was above the fish being eaten) the patient was assigned subject position and the overall clause was passive. Gleitman, January, Nappa & Trueswell, (2007) obtained similar results even when the cue was implicit (participants were largely unaware of the cue because it appeared so briefly), although with a decreased effect size of passivisation. Items that appear at the start of an utterance occupy a salient slot and thus could trigger structural organization somewhat independently of grammatical status (MacWhinney, 1977). Because the sentence initial position is confounded with Subject role in English transitive sentences it is difficult to differentiate between a linear-ordering versus a grammatical-role account of the priming effects. To do so we must turn to languages that permit, under certain pragmatic contexts, more flexible word orders. Three recent studies analyzed perceptually primed structural choice in Russian (Myachykov and Tomlin, 2008), Finnish (Myachykov et al. 2010), and Korean (Hwang and Kaiser 2009). Overall the studies suggested that in flexible word-order languages the extent of perceptual priming is consistently weaker than in the fixed word-order languages. Myachykov et al (2010) propose that speakers universally attempt to employ the grammatical-role assignment mechanism in order to represent the perceptually salient referent but this interacts in complex ways with the availability and reliability of the linguistic resources of the particular language. In languages like Russian and Finnish, for example, passives are rare or largely 7 The Attention-Grammar Interface dispreferred and as a result, a linear-ordering mechanism is used to accommodate referential salience in terms of word order. The exact nature of the attention-grammar interface is still uncertain and of course, subject position is one attentional cue among many others. For example, in unmarked cases, English tends to correlate theme with given information and subject position, and focus with new information and object position (and usually also prosodic stress). The famous Moses illusion takes advantage of this pattern: when asked “How many animals of each kind did Moses take on the Ark,” most people respond “two,” even though it can be independently established that they know that it was Noah, not Moses, who took the animals on the Ark (Erickson & Mattson, 1981). The fundamental role of attention in this process is underlined by the fact the illusion can be ameliorated when attention is focused on the incongruent item using structures such as clefts (3a) and there-insertions (3b) (Traxler, 2012) 3a. It was Moses who took two of each kind of animal on the Ark. 3b. There was a guy called Moses who took two of each kind of animal on the Ark. While acknowledging the role that focus and other salience cues play at the attentiongrammar interface, we concentrate here on the subject position for the theoretical reasons outlined above. Recognizing that these and other factors do influence structural choice, the counterbalancing procedure controls for these effects, see Design. The development of subject knowledge, attention and eye-gaze sensitivity The overall developmental picture is that children acquire the different features of the English subject gradually and at different times in a ‘mosaic’ fashion (Rispoli, 1991). Young children identify ‘subject’ as the most animate participant, or the first-mentioned 8 The Attention-Grammar Interface participant, or the agent, which suggests something less abstract than an adult-like notion of subject (see Corrigan, 1998 for a review). Experimentally Braine et al. (1993) have shown that mastery over the notion of English subject appears at around 5-6 years of age (see also Corrigan, 1998). Consistent with this, typically children do not produce full passives in spontaneous speech until about 4-5 years-of-age however performance can be significantly boosted when the passive form is supported with case marked pronouns (Ibbotson et al., 2009) with training (Pinker, Lebeaux & Frost 1987; Brooks & Tomasello, 1999) and when the passive form is more frequent in the ambient language (relative to Indo-European languages: Allen & Crago, 1996 for Inuktitut; Suzman, 1985 for Zulu; Pye & Poz, 1988 for K’iche’ Mayan). Following Croft (2001) one explanation for this relatively late and piecemeal acquisition pattern is that in reality, abstract constructions such as intransitive, transitive, passive and there-constructions actually have their own subject. They may only be united by analogy later on in development under something like a highly schematic subject-predicate construction (Tomasello, 2003). In terms of the development of attention, unsurprisingly, younger children are more easily distracted by task-irrelevant events (Enns & Girgus, 1985; Well, Larch, & Anderson, 1980; Carlson, 2005), they search less efficiently for a specified target (Day, 1978; Gibson & Yonas, 1966), they are less able to sustain attention on a given task (Corkum et al., 1995; Kannass and Oakes, 2008) and they are less able to switch efficiently from task to task (Guttentag, 1985; Pick & Frankel, 1974). The general developmental picture for infants is one of increasing cognitive control so that by around their fourth year they have begun to orientate their attention with volition and flexibility (Hughes, 1998; Ruff & Rothbart, 1996). Orienting of attention in this context refers to the alignment of some internal mechanism 9 The Attention-Grammar Interface with an external sensory input source that results in the preferential processing of that input. The external sensory input source we are most interested in here is eye-gaze. Humans show a strong sensitivity to eye-gaze from birth (Farroni, Csibra, Simion, & Johnson, 2002). Neonates can follow gaze if the pupils are seen to move (Farroni, Massaccesi, et al., 2004) and at around 5-months-of-age they can discriminate between very small horizontal deviations (5°) of eye gaze (Symons et al., 1998). Like adults, infants process facial features in deeper way when gaze is directed towards them as compared with averted gaze (Farroni et al., 2002; Farroni, Johnson, & Csibra, 2004; Farroni, Massaccesi, Menon, & Johnson, 2007; Hood et al., 2003). Clearly, the capacity to use another person’s eye gaze as a cue to attention develops very early in life, however, to begin with this might be achieved with rather low-level, low-mentalising mechanisms, for example, the perceptual geometry and luminance of the eye (Ando, 2002). Compared with other primates, humans have a relatively large white sclera surrounding a small dark pupil and iris making eye-gaze discrimination relatively easier in humans than in other animals (Kobayashi & Kohshima, 1997). Supporting the low-mentalising interpretation of eye-gaze sensitivity, a wide range of species have a very accurate ability to determine whether they are being looked at (e.g., Burger, Gochfeld, & Murray, 1992; Burghardt & Greene, 1990; Perrett & Mistlin, 1991) and nonhuman primates such as adult rhesus monkeys can discriminate between photographs depicting direct gaze and gaze averted by 5°, the same ability that has been reported in human infants (Campbell, Heywood, Cowey, Regard, & Landis, 1990; Symons, Hains, & Muir, 1998). Although infants’ eye-gaze sensitivities may be based on relatively simple mechanisms (gaze perception), young children soon begin to integrate eye-gaze information into a more sophisticated picture of how other people work including their future intentions 10 The Attention-Grammar Interface and mental states (Baron-Cohen, 1994; see Striano & Reid, 2006, for a recent review). The development of joint attention at 20 months predicts theory of mind abilities at 44 months (Charman et al., 2001) underlining eye-gaze as a key component in the development of social cognition in early life. Baron-Cohen, Campbell, Karmiloff-Smith, Grant, and Walker (1995) found that children ages 3 and 4 years old deduce the direction of gaze of a schematic face and they can ascribe mental states such as desires on the basis of the direction of gaze (see also Lee, Eskritt, Symons, & Muir, 1998). Thus, understanding that direction of gaze can indicate which objects a person knows exists, is currently attending to, and holds a mental state about can help a child infer much about the current visual world (although this understanding may not be as flexible as adults when cues conflict e.g., Friere, Eskritt, & Lee, 2004; Pellicano & Rhodes, 2003; Pellicano and Rhodes, 2003). Eye-gaze following at 6 months has been shown to correlate with vocabulary size at 18 months (Morales et al., 2000; Morales, Mundy, & Rojas, 1998) and in noun learning, children can use eye-gaze, head posture and gesture to infer speakers’ referential intention (e.g., Baldwin, 1991; Carpenter , Akhtar & Tomasello, 1998; Gergely, Bekkering & Király, 2002; Woodward & Sommerville, 2000). Nappa, Wessel, McEldoon, Gleitman & Trueswell (2009) showed that 3-, 4- and 5-year-olds used the eye-gaze of the speaker to infer the meaning of novel relational verbs (of the type chase vs. flee) in linguistically uninformative contexts (e.g., He’s mooping him). Thus children who saw a speaker looking at the chaser when they uttered the novel verb were more likely to attribute ‘chase-like’ semantics to the novel verb. The opposite effect was found when a speaker looked at the flee-er. Change in gaze direction is one of several behavioral cues that individuals use in combination with changes in facial and vocal displays and body posture to mark the 11 The Attention-Grammar Interface intention to act on an object (Mumme et al., 2007). Crucially, just prior to speaking, adults are more likely to look at the subject of their sentence than any other character (Gleitman, January, Nappa & Trueswell, 2007; Griffin & Bock, 2000). This raises the possibility that children could use this cue in the input, probabilistically at least, to build a correspondence between the perspective of an event and how that perspective is expressed in their language (cf. Nappa et al., 2009). Just as verbs such as chase and flee can lead to different construals of the same (perceptual) scene so can argument-structure constructions like the activepassive alternation, which are basically perspective-taking devices. In summary, the above evidence raises the possibility that young children could use the social-cognitive cue of eye-gaze – which directs attention – to infer the function of grammatical subject – which is grounded in attention. We investigated this by exploring if they can use the active passive alternation (essentially a choice of subject) in a way that is consistent with the eye-gaze of the speaker. The developmental hypothesis is that if the function of subject position is grounded in attentional mechanisms (Tomlin, 1995; 1997), then we would expect that developing attentional abilities should interact with developing linguistic ability to assign a subject. Thus we would expect different age groups to perform differently. Our second hypothesis is more of a general theoretical point and is relevant to adults and children. As the evidence we have reviewed suggests, some cognitive linguistic frameworks suggest a close relationship between attention and linguistic performance (e.g., Givon 1992; Landau and Jackendoff 1993; MacWhinney 1977; Osgood and Bock 1977; Talmy 2007). This general assumption leads to the specific hypothesis that a speaker’s attention can be guided by the eye-gaze of their interlocutor and that this in turn affects the structural choice of their utterances, specifically whether to use an active or passive sentence. 12 The Attention-Grammar Interface Methods Participants A total of 91 participants were tested. For each age group we report the numbers in each group (N) the mean age in years (M) and the standard deviation of the group as calculated on months-of-age (SD). For the Test condition: 3-year-olds (N=21, M=3.55, SD±3.66), 4year-olds (N=23, M=4.39, SD±2.79) and adults (N=11, M=23.31, SD±87.60). For the Control Condition: 3-year-olds (N=13, M=3.45, SD±3.63), 4-year-olds (N=12, M=4.38, SD±3.60) and adults (N=11, M=19.08, SD±41.25). 7 children were excluded from the analysis because of fussiness (4) or not producing a novel verb (3) – for example they persisted with “the dog’s hitting the cat” even after correction, instead of the intended “the dog’s tamming the cat”. Design and Materials The between-subjects factors were Age (3-year-old, 4-year-old and adults) and Condition (Test, Control) and the within-subjects factor for the Test Condition was Gaze (Agent-Focus, Patient-Focus). Ideally, to increase the power we would have treated each child as its own control with a within-subjects design. We assumed that most children and adults would provide answers in the experiment with an active voice if not cued to do otherwise (as indeed they did). We reasoned that if we ran a cued and non-cued trial with the same individual there was a risk that we would structurally prime the active response to such a degree that the already weakly represented passive would not get produced at all. We were interested in the degree to which participants could switch between active and passive responses in a way that was congruent with eye-gaze. By the time a participant answered with 8 actives responses (as they most likely would have done in the test condition) it would 13 The Attention-Grammar Interface be much harder to elicit a passive than if they have just finished our training phase (see below). Likewise, running the control after the test might have artificially boosted passive responses in the control condition. Means and standard deviations of age between Test and Control were comparable (see Participants). Increasing the activation levels/ease of retrieval of the either active or passive was avoided by keeping control and test participants separate. Although this reduced the power, as it turned out the effect was still strong enough to detect. If participants came to the experiment with no preference for describing a scene with either actives or passives and responded at random then we could expect the proportion of congruent answers to be 50%, that is, an active response to agent-focused question and a passive response to a patient-focused question. If participants came to the experiment with a preference for describing a scene with only actives then again we could expect the proportion of congruent answers to be 50% – participants score 4/4 on the active-focused questions but 0/4 on passive focused questions. The design therefore takes account of baseline preference with respect to congruency. Despite this methodical treatment of participant’s preference we still ran a control to check there was nothing strange about the puppets used or experimental set-up and to measure the different age groups’ ‘default’ response that might vary for whatever reason. We therefore present the control data alongside the test data in the results section as it impacts on how easily participants can switch from their ‘default’ response to an eye-gaze cued response. There were 8 test questions per participant, with an equal number of Agent-Focused and Patient-Focused trials, the orders of which were randomised. Over the experiment, the order of appearance of model sentences (training phase, see below), the left/right appearance of the participant in focus and the participant acting as agent were counterbalanced. There were six hand puppets used, randomly assigned to either the training phase or the test phase. 14 The Attention-Grammar Interface Referential salience and conceptual accessibility affect the degree to which a participant might attend to a particular character. Care was taken to match the puppets on size, general colourfulness and animals that could be considered prototypical instances of their own kind, (a monkey, a pig, a horse, a dog, a cat, and a bear). All actions were reversible transitives and the experimenter took care to make the actions as comparable as possible between trials (as far is possible with the act-out method). Importantly however, the counterbalancing procedure controls for salience. For example, if participants considered the pig more salient than the cat and correspondingly chose this as the subject, the pig appeared both as the focus of attention in active and passive conditions, and the unfocused participant of active and passive conditions over the course of the experiment. Procedure Training Phase The experimenter modelled novel verb, (tamming) using animal hand puppets. This was done twice in the active voice (e.g., “look! the horse is tamming the goat”) and twice in the passive voice (e.g., “look! the frog is getting tammed by the cow !”). After each model the experimenter asked the child or adult to repeat the sentence (e.g., “can you say the horse is tamming the goat?”). The experiment did not progress until the participants had correctly repeated all four of the model sentences at least once. The action consisted of an agent hitting the top of a patient’s head with a cardboard tube (see Figure 1). We used the cardboard tube to increase the distance between participants and thus make it easier to determine whether the experimenter was looking at the agent or patient (yet still remain a causal transitive scene). Throughout the training phase, the experimenter did not use eyegaze to direct attention onto either of the characters. 15 The Attention-Grammar Interface Test Phase The training puppets were abandoned and four new hand puppets were introduced. The experimenter asked the participant to name the animals, which all did correctly. The experimenter then said “we’re going to play a different game now, this time you have to tell me what the animals are doing”. The experimenter looked at the participant, engaged eye contact, looked at the target animal (either the agent or patient of the impending action) looked back at the participant and then finally looked back to the target animal again. Thus the idea was to establish triadic joint-attention between speaker, addressee and referent. The experimenter then performed the action while continuing to look at the target referent. Shortly after the action had begun the experimenter asked, “what’s happening?” If the participant did not respond, the experimenter persisted for a while with the same question taking care not to linguistically cue them into an active or passive response (e.g., “what is the cat doing”? or “what’s happening to the dog”?) – thus we tried to create an analogous situation to the Nappa et al. ‘linguistically uninformative’ condition for verb learning (2009). Children’s and adult’s responses were recorded and later coded for active or passive voice and the order in which participants were mentioned. We also recorded whether the subject of the sentence (either agent or patient) was congruent or incongruent with the eye-gaze of the speaker/experimenter. For example, if the speaker was looking at the agent of the action and the participant responded with the agent in subject position (i.e., active: agent-V-patient) it was coded as congruent. If the participant responded with the patient in subject position (i.e., passive: patient-Aux-V-(agent)) it was coded as incongruent. It is important to note that either an active or a passive answer to the question “what’s happening” is equally grammatical with respect to who is doing what to whom (assuming the arguments map onto what is actually happening in the scene). We are interested in what could be considered the 16 The Attention-Grammar Interface most pragmatically felicitous response (following adult performance in Tomlin, 1995; 1997 and anticipated adult performance in this study), namely those responses where the subject of the utterance maps onto where the attention has been directed, in this case via the eyegaze of the speaker. Adults and children experienced the same experimental procedure with the same materials. While the hand-puppet procedure might have appeared strange to some adults it was necessary to make the results directly comparable – if we had not used hand puppets any difference between adults and children could be attributable to the different methodologies used. The control condition contained a procedure that was identical to that described above except for the fact that the experimenter did not attempt to focus the attention of the child or adult onto any participant before the action began and looked straight ahead when asking “what’s happening?” (Figure 2). Agent-Focus Control Patient-Focus Figure 2. In this example ‘the cat is tamming the dog’ or, depending on your perspective, the ‘dog is getting tammed’. The eye-gaze of the experimenter cues attention towards either the agent of the action (left), neither agent nor patient (centre) or the patient of the action (right). 17 The Attention-Grammar Interface Results First we present a comparison between the Control Condition and the Test Condition. We underline that the control condition was identical to the test condition except for the fact that the experimenter did not attempt to focus the attention of the child or adult onto any participant before the action began and looked straight ahead when asking “what’s happening?” The results are presented in terms of Eye Gaze (Test Condition) and No Eye Gaze (Control Condition) as a proportion of active responses, displayed in Figure 3 below. Figure 3. Mean Proportion of Active Responses by Age Group. 18 The Attention-Grammar Interface To determine whether there was a significant difference between Test and Control conditions with respect to the mean proportion of active utterances we conducted a 2 (Eye Gaze Focus, No Eye Gaze Focus) by 3 (3-Year-Olds, 4-Year-Olds, Adults) ANOVA with mean proportion of active responses as the dependent variable. There was a significant effect of Condition F(1,85) = 6.07, p=.016 ηp2 = .067, Age F(2,85) = 3.68, p=.029 ηp2 = .08 and no significant interaction between Age and Condition F(2,85) = 0.85, p=.428 ηp2 = .02. This means Eye Gaze significantly affected the active/passive ratio for all age groups. What is most relevant for the discussion is that the youngest age group show the strongest preference for describing the scene (in the absence of eye-gaze focus) with active sentences. This means the youngest age group has the strongest ‘default’ to overcome if they are to switch to describing the scene with a passive. We now present the analysis of the Test data with respect to the mean proportion of congruent responses to eye-gaze (obviously we cannot analyse the control data in this way as there is no eye gaze manipulation in the Control – note the change in y-axis from Figure 3 to 4). Figure 4 shows the mean proportion of congruent responses as a function of whether the experimenter was looking at the agent of the action or the patient of the action for different age groups. 19 The Attention-Grammar Interface Figure 4. Mean Proportion of Congruent Responses as a function of Eye-Gaze and Age. Note the change in y-axis from Figure 3. Overall Congruency Comparisons To determine the overall effect of eye gaze on congruent responses, we first conducted a 2 (Congruent, Incongruent) by 3 (3-Year-Old, 4-Year-Old, Adults) mixed model analysis of variance (ANOVA). This showed a significant main effect of Congruency F(1,52) = 97.94, p<0.001, ηp2 = .653 (M .737 > .263) and a significant interaction between Congruency and Age F(2,52) = 14.36, p<0.001, ηp2 = .356. This means that overall there were more 20 The Attention-Grammar Interface congruent responses than incongruent. The interaction suggests that the difference between congruent and incongruent responses was not the same across all age groups. To analyze this interaction further we conducted an ANOVA by Age. We found significant differences between congruent and incongruent responses for 4-Year-Olds F(1, 22) = 36.59, p<0.001, ηp2 = .625 and Adults F(1, 10) = 306.25, p<0.001 ηp2 = .968 but not for 3-Year-Olds F(1, 20) = 3.50, p = 0.07 ηp2 = .149. This means that regardless of the type of eye-gaze focus (agent/patient) only the 3-Year-Olds were unable to consistently match a congruent response. We now analyze how this overall effect of eye-gaze on congruency is influenced by whether the gaze is focused on the agent or patient. Agent-Focused We analyzed Agent-Focused trials with a 2 (Congruent, Incongruent) by 3 (3-Year-Old, 4Year-Old, Adults) mixed model analysis of variance (ANOVA). There was a significant main effect of Congruency F(1, 52) = 52.49, p<0.001 ηp2 = .502 (M .817 > .183) and no interaction between Congruency and Age F(2, 52) = 1.82, p=0.172 ηp2 = .066. This means all age groups consistently produced active sentences when the experimenter was looking at the agent. Patient-Focused Second, we analyzed Patient-Focused trials with a 2 (Congruent, Incongruent) by 3 (3-YearOld, 4-Year-Old, Adults) mixed model ANOVA. There was a significant main effect of Congruency F(1, 52) = 9.25, p <0.001 ηp2 = .151 (M .657 > .343) and a significant interaction between Congruency and Age F(2, 52) = 5.72, p<0.001 ηp2 = 180. The interaction suggests that the difference between congruent and incongruent responses for Patient-Focused trials was not the same across all age groups. To analyze this further we 21 The Attention-Grammar Interface conducted an ANOVA by Age. We found significant differences between congruent and incongruent responses for 4-Year-Olds F(1, 22) = 5.80, p=0.025, ηp2 = .209 and Adults F(1, 10) = 49.23, p<0.001 ηp2 = .831 but not for 3-Year-Olds F(1, 20) = 0.56, p=.460, ηp2 = .028. This means only the 3-Year-Olds could not consistently produce passive sentences when the experimenter was looking at the patient. Finally we report that for both Control and Test conditions the vast majority of passive responses from children and adults were truncated (e.g., “the cat got tammed”) rather than full passives (e.g., “the cat got tammed by the dog”), Table 1. Table 1. Proportion of Full Passives by Age Group and Condition (%) 3-Year-Olds Test Control 4/168 (2.38) 0/104 (0) 4-Year-Olds 11/184 (5.97) Adults 2/96 (2.08) 19/88 (21.59) 12/88 (13.63) This did not have a significant impact on the main analyses as we are primarily interested in whether the (correct, i.e., congruent) agent/patient appeared in the subject role as a function of speaker’s eye gaze. In principle there are other ways of describing the scene, where the subject would be congruent with eye-gaze but the structure would not be active or passive, such as Dog is hurting from cat tamming it. We did not witness any examples of these types of sentences, perhaps because the training phase primed/reduced the degrees of freedom on what participants thought is an appropriate response to the question “what’s happening?”, i.e. an active or passive was modeled by the experimenter in the training phase (and participants were asked to repeat them) and once participants had begun to answer with these options they continued to do so. In sum, all ages gave more congruent responses than 22 The Attention-Grammar Interface incongruent responses to both agent-focused and patient-focused trials, except for the 3Year-Olds with patient-focused trials. Discussion We set out to investigate whether children can use the active passive alternation – essentially a choice of subject/focus – in a way that is consistent with the eye-gaze of the speaker – a task that requires integrating social and syntactic knowledge. We established that 4-Year-Olds and adults are able to use speaker-gaze to choose a felicitous subject when describing a scene; 3-Year-Olds can do so consistently only when the focus is on the agent. Our interpretation of how participants achieve this performance and of the relatively impaired performance of the 3-Year-Olds on the passive follows. To succeed at this task is not trivial. We don’t know for sure all the steps needed to produce the appropriate response but at the very least, the following processes are all credibly involved. First, both children and adults need to understand that following gaze establishes reference. Subcomponents of this ability are recognizing that looking is intentional behavior directed to external objects and events; that looking results in the mental experience of seeing an object or event; and that others share in the capacity to see things (D'Entremont, et. al., 2007). Second, participants need to coordinate where their attention has been focused with a linguistic representation. This involves selecting the construction that best serves the function of foregrounding a participant, which in this case is the subject position (perhaps something like Fig. 1). Part of this ability requires suppressing the preferred information structure and most heavily entrenched form (the active) when one needs to describe a scene from the perspective of the patient. There is evidence that this maybe more of a challenge for the 3-Year-Olds because they have the strongest preference for describing the scene with an active in the absence of social cues (see Control Condition). In addition, success on the 23 The Attention-Grammar Interface experiment not only requires attention for a given trial but the ability to switch attention between trials. Thus inhibitory control, attentional flexibility, and working memory are all implicated in giving the correct response, which as noted in the introduction, are still in development around 4 to 5 years-of-age (Hughes, 1998). Finally, participants need to produce a string of nouns, verbs and auxiliaries that not only satisfy the grammatical requirements of who did what to whom but also conform to the social-pragmatic demands of the context. Salamo, Lieven and Tomasello (2010) note that children have more difficulty giving pragmatically appropriate responses to sentence-focus questions of the type “what’s happening?” than either argument-focus questions “who is VERB-ing?” or predicate-focus questions “what is AGENT doing?” This pattern corresponds with sentence-focus questions being relatively less frequent to the other types in child directed speech. In this experiment we deliberately chose an unbiased question (linguistically-speaking) to assess the role of social cueing in isolation (i.e., “what’s happening?”). To the extent that sentence-focus questions are more pragmatically difficult, this seems to affect the youngest age group in this study the most. Again, one possible explanation for this is in terms of the development of underlying domain-general capabilities – the idea is that argument-focus questions and predicate-focus questions help to anchor the relevant piece of information (either verb or agent) in short term-memory, from which the appropriate response to the question is constructed (an advantage which is not present in sentence-focus questions). Although more infrequent in child directed speech, sentence-focus questions may benefit the most from the support of social cues precisely because they are linguistically uninformative. Indeed Nappa et al., (2009) found the strongest effect of social cues on verb learning in the linguistically uninformative condition. In reality children face a complex matrix of social and linguistic 24 The Attention-Grammar Interface cues and they have to work out how reliable these are across multiple situations. Using the design in this study, one might predict an even stronger effect of subject choice alternations (and perhaps at younger age) where linguistic anchoring (e.g., “what’s happening to the cat?”) acts in coalition with social cues (e.g., speaker looks at cat). As noted in the introduction, in English the active is the preferred information structure for describing a causal transitive scene: actives outnumber passives 9:1 in written text and the ratios are even higher for spoken discourse. Not only this, the token frequency for passives is low. In one analysis of child directed speech, the passive-per-utterance rate is 0.36% (Gordon and Chafetz, 1990). Thus all age groups have had relatively less experience with the passive than the active, and the results show that there are more congruent active responses than passive responses for all age groups. The active construction also aligns with a non-linguistic preference for conceptually framing events as source-to-goal or instigatorpatient rather than the reverse (Lakusta and Landau, 2005; Lakusta, Wagner, O’Hearn & Landau, 2007). Nappa et al. (2009) note that in many languages, verbs that align with this preference (e.g., chase) outnumber their asymmetric partners (e.g., flee). They also note the instigator-patient bias is present pre-linguistically (Csibra, Biro, Koos, & Gergely, 2003; Woodward, 1998) and continues through to adulthood (Gleitman et al., 2005). Increased experience (use) of the passive with time, particularly with a literate undergraduate population from which the adult sample was mainly drawn (see Dąbrowska, 2006), acts in opposition to this tendency. This experience would make it relatively easier to retrieve the passive construction from memory when it is required (i.e., on the patient-focused trials). Also, in the design of this experiment, all participants were explicitly shown in the training phase (and perhaps reminded in the case of adults) that actives and passives were legitimate ways to describe the scene. This part of the design may have increased the conceptual 25 The Attention-Grammar Interface accessibility (Osgood & Bock, 1977) of the passive and boosted the level of passives description from that which we would normally see in spontaneous speech. Experimentally it has been shown children around their second year of life can use the attentional and emotional states of speakers as cue to their communicative intentions (e.g., Baldwin, 1991; Tomasello and Farrar, 1986). One might wonder why it takes participants in this study until their fifth year of life to link the linguistic and the social domains in a way that allows them to succeed on both agent and patient focused trials. As discussed above, the complexity of the task and the relative frequency of the passive construction hints at one answer but another possibility, as mentioned in the introduction, is that it takes time for young children to integrate eye-gaze information (as a basic perceptual cue) into a more sophisticated picture of how other people work including their future intentions and mental states (Baron-Cohen, 1994; see Striano & Reid, 2006, for a recent review). The difference between the 3 and 4 year olds in our experiment might also be partly attributable to development in this social-cognitive capacity. Without further tests that can disassociate the lower-level perceptual cues from intentional eye-gaze cues (for example a non-voluntary eye-gaze shift) it is not clear to what extent the children in this experiment are using in the intentions of the speaker to assign a subject. For that reason we use the term the attention-grammar interface rather than the social-grammar interface. It is also worth noting that the experiments that have focused on the relationship between attentional states of speakers and communicative intentions (e.g., Baldwin, 1991; Tomasello and Farrar, 1986) have mainly focused of word learning. The adult-like function of subjecthood makes it a much more abstract and less concrete learning challenge than learning words, which, in the usage-based framework at least, means that mastery of subjecthood requires more evidence and more experience with using it. While the 3-year26 The Attention-Grammar Interface olds in this study probably possess many of the social-cognitive foundations that the 4-yearolds do, success on the task needs competence in linguistic and executive control domains as well as effective connections between these domains. Priming studies show that children already have some representation of the passive younger than 4 years-of-age (e.g., Huttenlocher, Vasilyeva & Shimpi, 2004). One might expect that the 3-Year-Olds in this study might be able to succeed on both agent-focused and patient-focused trials. However, the factors we have reviewed above – the steps needed to succeed at this task, the relative frequency of the passive, the conceptual bias for instigator-patient, the relative difficulty with sentence-focus questions, the developing executive control, the abstractness of ‘subject’ relative to word learning – could all disproportionally affect the youngest age group. The performance of the 3-, 4-year-olds and the adults in this study provides further support for the subject-is-theme-is-attention attention hypothesis (Tomlin 1995; 1997). What is more, we have demonstrated this using a more ecologically valid cue than a red arrow hovering above a participant, namely, eye-gaze. Our interpretation of these results is that eye-gaze plays a similar role to the flashing arrow in the Tomlin studies, namely, it is an attention-directing cue that foregrounds one character and, by definition, backgrounds the other. Importantly, we know adults are more likely to look at the subject of their sentence than any other character (Gleitman, January, Nappa & Trueswell, 2007; Griffin & Bock, 2000). This raises the possibility that young children could be using the social cue of eyegaze in situ (which directs attention) to infer the function of grammatical subject (which is grounded in attention). More generally, the methodology we have used here advocates exploring linguistic cues in combination with the social-pragmatic context. By using eye-gaze we have been 27 The Attention-Grammar Interface able to consider a broader range of cues than a traditional corpus-based approach to the development of language. By doing so, we have been able to get closer to reconstructing the rich social-pragmatic-linguistic world in which the child actually grows up. The challenge is to explore ways in which social-pragmatic skills interact with prodigious pattern-finding abilities in a way that which explains the emergence of linguistic knowledge. Yu & Ballard (2007) found that a computational model of word-learning performed better when it used social information (joint attention and prosody) in combination with statistical cues (crosssituational learning) than when it relied on purely statistical information alone. There is no reason to doubt that a combination of social and linguistic cues also helps in learning syntactic relations (e.g., Nappa et al., 2009). We noted in the introduction that, typically, children do not produce full passives in spontaneous speech until about 4-5 years-of-age however performance can be significantly boosted when the passive is supported with case marked pronouns (Ibbotson et al., 2009) with training (Pinker, Lebeaux & Frost 1987; Brooks & Tomasello, 1999) and when the passive form is more frequent in the ambient language (relative to Indo-European languages) (Allen & Crago, 1996; Suzman, 1985; Pye & Quixtan Poz, 1988). To this list we might add that felicitous passive use is also boosted when supported with the kind of social-cognitive cues that children might actually experience, such as eye-gaze. Our understanding of children’s social cognition has progressed significantly, however, it has yet to be worked through in any detail how this knowledge interacts with emerging syntactic representations. This paper is a step in that direction and investigates whether eye-gaze influences the choice of grammatical subject for young children and adults. The linguistic notion of subjecthood is in principle an abstract one. Previous work suggests the function of the subject position can be grounded in terms of attention and 28 The Attention-Grammar Interface information structure. One powerful source of directing attention that we know young children are sensitive to is eye-gaze. We have demonstrated that 4-year-olds and adults (but not conclusively 3-year-olds) are able to use speaker-gaze to choose a felicitous subject. It has been shown before that social-cognitive cues help children learn words but this is the first demonstration that eye-gaze could be important in learning something as abstract as subject role. Integrating attentional and grammatical information in this way allows children to limit the degrees of freedom on what the function of certain linguistic constructions might be (cf. Nappa et al., 2009) and allows linguistic theory to ground abstract functions in deeper cognitive and communicative principles (Tomlin, 1995,1997; Goldberg, 2006; Langacker, 1991). 29 The Attention-Grammar Interface References Allen, S. & Crago, M. (1996). Early passive acquisition in Inuktitut. Journal of Child Language 23(1). 129-155. Ando, S. (2002). Luminance-induced shift in the apparent direction of gaze. Perception, 31, 657–674. Baldwin, D. A. (1991). Infant contribution to the achievement of joint reference. Child Development, 62, 875–890. Baron-Cohen, S. (1994). How to build a baby that can read minds: Cognitive mechanisms in mindreading. Cahiers de Psychologie Cognitive [Current Psychology of Cognition], 13, 513–552. Baron-Cohen, S., Campbell, R., Karmiloff-Smith, R., Grant, J., & Walker, J. (1995). Are children with autism blind to the mentalistic significance of the eyes? British Journal of Developmental Psychology, 13, 379–398. Braine, M.D.S., Brooks, P.J., Cowan, N., Samuels, M.C., & Tamis-LeMonda, C. (1993). The Development of categories at the semantics/syntax interface. Cognitive Development, 8, 465-494. Brooks, P. J. and Tomasello, M. (1999). Young children learn to produce passives with nonce verbs. Developmental Psychology, 35(1), 29-44. Burger, J., Gochfeld, M., & Murray, B. G. (1992). Risk discrimination of eye contact and directness of approach in black iguanas (Ctenosaura similis). Journal of Comparative Psychology, 106, 97–101. 30 The Attention-Grammar Interface Burghardt, G. M., & Greene, H. W. (1990). Predator simulation and duration of death feigning in neonate hognose snakes. Animal Behaviour, 36, 1842–1844. Campbell, R., Heywood, C. A., Cowey, A., Regard, M., & Landis, T. (1990). Sensitivity to eye gaze in prosopagnosic patients and monkeys with superior temporal sulcus ablation. Neuropsychologia, 28, 1123–1142. Carlson, S. M. (2005). Developmentally sensitive measures of executive function in preschool children. Developmental Neuropsychology, 28, 595-616. Carpenter, M., Akhtar, N., & Tomasello, M. (1998). Fourteen- through 18-month-old infants differentially imitate intentional and accidental actions. Infant Behavior and Development, 21(2), 315–330. Charman, T., Baron-Cohen, S., Swettenham, J., Baird, G., Cox, A., & Drew, A. (2001). Testing joint attention, imitation, and play as infancy precursors to language and theory of mind. Cognitive Development, 15, 481–498. Choi, S., & Bowerman, M. (1991). Learning to express motion events in English and Korean: The influence of language-specific lexicalization patterns. Cognition, 43, 83–121. Corkum, V., Byrne, J. M., & Ellsworth, C. (1995). Clinical assessment of sustained attention in preschoolers. Child Neuropsychology, 1, 3–18. Corrigan, R. (1988). Children's identification of actors and patients in prototypical and nonprototypical sentence types. Cognitive Development, 3, 285-297. Croft, W. (2001). Radical Construction Grammar: syntactic theory in typological perspective. Oxford: Oxford University Press. 31 The Attention-Grammar Interface Csibra, G., Biro, S., Koos, O., & Gergely, G. (2003). One year old infants use teleological representations of actions productively. Cognitive Science, 27, 111–133. Dąbrowska, E. and Street, J. (2006). Individual differences in language attainment: Comprehension of passive sentences by native and non-native English speakers. Language Sciences 28, 604-615. Day, M. C. (1978). Visual search by children: The effect of background variation and the use of visual cues. Journal of Experimental Child Psychology, 25, 1–16. D'Entremont, B., Yazbeck, A., Morgan, A. & MacAulay, S. (2007). Early gaze-following and the understanding of others. In R. Flomm, K. Lee & D. Muir (Eds.), Gaze-following: It's development and significance (pp. 77–94). Mahwah, NJ: Lawrence Erlbaum Associates. Enns. J. T., & Girgus, J. S. (1985). Developmental changes in selective and integrative visual attention. Journal of Experimental Child Psychology, 40, 319-337. Erickson, T. A. and Mattson, M. E. (1981). From words to meaning: A semantic illusion. Journal of Verbal Learning and Verbal Behavior, 20: 540-552. Farroni, T., Csibra, G., Simion, F., & Johnson, M. H. (2002). Eye contact detection in humans from birth. Proceedings of the National Academy of Sciences, 99, 9602–9605. Farroni, T., Johnson, M. H., & Csibra, G. (2004). Mechanisms of eye gaze perception during infancy. Journal of Cognitive Neuroscience, 16, 1320–1326. Farroni, T., Massaccesi, S., Menon, E., & Johnson, M. H. (2007). Direct gaze modulates face recognition in young infants. Cognition, 102, 396–404. 32 The Attention-Grammar Interface Farroni, T., Massaccesi, S., Pividori, D., & Johnson, M. H. (2004). Gaze following in newborns. Infancy, 5, 39–60. Fillmore, C. (1976). Frame Semantics and the Nature of Language. In "Annals of the New York Academy of Sciences: Conference on the Origin and Development of Language and Speech". Vol. 280, 20-32. Friere, A., Eskritt, M., & Lee, K. (2004). Are eyes windows to a deceiver’s soul? Children’s use of another’s eye gaze cues in a deceptive situation. Developmental Psychology, 40, 1093–1104. ergely, ., ekkering, H., & Kir ly, I. (2002). Rational imitation in preverbal infants. Nature, 415, 755. Gibson, E. J., & Yonas, A. (1966). A developmental study of the effects of visual and auditory interference on a visual scanning task. Psychonomic Science, 5, 163–164. Givon, T. 1992. The grammar of referential coherence as mental processing instructions. Linguistics 30. 5–55. Gleitman, L. R., January, D., Nappa, R., & Trueswell J. C. (2007). On the give and take between event apprehension and utterance formulation. Journal of Memory and Language, 57(4), 544–569. Gleitman, L., Cassidy, K., Nappa, R., Papafragou, A., & Trueswell, J. (2005). Hard words. Language Learning and Development, 1(1), 23–64. Goldberg, A. (2006). Constructions at Work: The nature of generalisations in language. Oxford: Oxford University Press. 33 The Attention-Grammar Interface Gordon, P. & J. Chafetz. (1990). Verb-based versus class-based accounts of actionality effects in children’s comprehension of passives. Cognition, 36, 227-254. Griffin, Z. M., & Bock, K. (2000). What the eyes say about speaking. Psychological Science, 11, 274–279. Guttentag, R. E. (1985). Memory and aging: Implications for theories of memory development during childhood. Developmental Review, 5, 56-82. Halliday, M. A. K. (1985). An introduction to functional grammar. London: Edward Arnold. Henderson, J., and F. Ferreira. 2004. The integration o and f language, vision, and action: eye movements and the visual world. New York: Psychology Press. Hood, B. M., Macrae, C. N., Cole-Davies, V., & Dias, M. (2003). Eye remember you: The effects of gaze direction on face recognition in children and adults. Developmental Science, 6, 67–71. Hughes, C. (1998). Executive function in preschoolers: Links with theory of mind and verbal ability. British Journal of Developmental Psychology, 16, 233-253 Huttenlocher, J., Vasilyeva, M. & Shimpi, P. (2004). Syntactic priming in young children. Journal of Memory and Language, 50, 182-195. Hwang, H., and E. Kaiser. (2009). The effects of lexical vs. perceptual primes on sentence production in Korean: an on-line investigation of event apprehension and sentence formulation. Talk presented at the 22nd CUNY conference on sentence processing, Davis, CA. 34 The Attention-Grammar Interface Ibbotson, P. (2010). A Cognitive Constructivist Approach to Early Syntax Acquisition. PhD thesis (unpublished), submitted to the University of Manchester. Ibbotson, P., Theakston, A., Lieven, E., & Tomasello, M. (2011). The Role of Pronoun Frames in Early Comprehension of Transitive Constructions in English. Language, Learning and Development. 7: 24–39. Jackendoff, R. (1983). Semantics and cognition. Cambridge, MA: MIT Press. Kannass, K. N., & Oakes, L. M. (2008). The development of attention and its relations to cognitive skills in infancy and toddlerhood. Journal of Cognition and Development, 9, 222– 246. Keenan, E. (1976). Towards a universal definition of “subject”. C.N. Li (ed.), Subject and topic, 303 – 33. New York: Academic Press. Kobayashi, H., & Kohshima, S. (1997, June 19). Unique morphology of the human eye. Nature, 387, 767–768. Lakoff, G. (1987). Cognitive models and prototype theory. In U. Neisser (Ed.), Concepts and conceptual development: Ecological and intellectual factors in categorization. Emory symposia in cognition (Vol. 1, pp. 63–100). Cambridge, England: Cambridge University Press. Lakusta, L, Wagner, L., O’Hearn, K., & Landau, . (2007) Conceptual foundations of spatial language: Evidence for a goal bias in infants. Language Learning and Development, 3(3), 179–197. Lakusta, L., & Landau, B. (2005). Starting at the end: The importance of goals in spatial language. Cognition, 96, 1–33. 35 The Attention-Grammar Interface Landau, ., and R. Jackendoff. 1993. ‘‘What’’ and ‘‘where’’ in spatial language and spatial cognition. Behavioral and Brain Sciences 16. 217–65. Langacker, R.W. (1991). Foundations of cognitive grammar: Vol. 2–Descriptive application. Stanford: Stanford University Press. Langacker, Ronald (1987). Foundations of cognitive grammar: Theoretical prerequisites (vol. 1). Stanford: Stanford University Press. Langacker, Ronald 1991. Foundations of Cognitive Grammar, vol. 2.: Descriptive Application. Stanford: Stanford University Press. Langacker, Ronald W. (1988). Autonomy, agreement, and cognitive grammar. Agreement in Grammatical Theory, ed. by Diane Brentari, Gary Larson, and Lynn MacLeod, 147-180. Chicago: Chicago Linguistic Society. Lee, K., Eskritt, M., Symons, L. A., & Muir, D. (1998). Children’s use of eye gaze for “mind reading”. Developmental Psychology, 34, 525–539. MacWhinney, B. (1977). Starting points. Language, 53, 152-168. MacWhinney, B. 1977. Starting points. Language 53(1). 152–68. Morales, M., Mundy, P., & Rojas, J. (1998). Following the direction of gaze and language development in 6-month-olds. Infant Behavior and Development, 21, 373–377. Morales, M., Mundy, P., Delgado, C. E. F., Yale, M., Neal, A. R., & Schwartz, H. K. (2000). Gaze following, temperament, and language development in 6-month-olds: A replication and extension. Infant Behavior and Development, 23, 231–236. 36 The Attention-Grammar Interface Mumme, D., Bushnell, E., DiCorcia, J. & Lariviere, L. (2007). Infants' use of gaze cues to interpret others' actions and emotional reactions. In R. Flomm, K. Lee & D. Muir (Eds.), Gaze-following: It's development and significance (pp. 143–170). Mahwah, NJ: Lawrence Erlbaum Associates Myachykov, A, and R. S. Tomlin. (2008). Perceptual priming and structural choice in Russian sentence production. Journal of Cognitive Science 6(1). 31–48. Myachykov, A, S. Garrod, and C. Scheepers. (2010). Perceptual priming of structural choice during English and Finnish sentence production. Language & Cognition: state of the art, ed. by R. K. Mishra and N. Srinivasan,54–72. Munich: Lincom Europa. Myachykov, A., Thompson, D., Scheepers, C., & Garrod, S. (2011). Visual attention and structural choice in sentence production across languages. Language and Linguistic Compass, 5(2), 95-107. DOI: 10.1111/j.1749-818X.2010.00265.x Nappa, R., Wessell, A., McEldoon, K.L., Gleitman, L.R. & Trueswell, J.C. (2009). Use of Speaker’s aze and Syntax in Verb Learning. Language Learning and Development, 5(4), 203-234. Osgood, C. E., and ock, J. K. (1977). “Salience and sentencing: some production principles,” in Sentence Production: Developments in Research and Theory, ed. S. Rosenberg (Hills- dale, NJ: Erlbaum), 89–140. Papafragou, A., Cassidy, K., & Gleitman, L. (2007). When we think about thinking: The acquisition of belief verbs. Cognition, 105, 125–165. Pellicano, E., & Rhodes, G. (2003). The role of eye-gaze in understanding other minds. British Journal of Developmental Psychology, 21, 33–43. 37 The Attention-Grammar Interface Perrett, D. I., & Mistlin, A. J. (1991). Perception of facial characteristics by monkeys. In W. C. Stebbins & M. A. Berkley (Eds.), Comparative perception (pp. 187–215). New York: Wiley. Pick, A. D., and Frankel, G. W. (1974) A developmental study of strategies of visual selectivity. Child Dev, 45: 1162-1165. Pinker, S., Lebeaux, D. and Frost, L. (1987). Productivity and constraints in the acquisition of the passive. Cognition 26. 267. Posner, M. I. 1980. Orienting of attention. Quarterly Journal of Experimental Psychology 32. 3–25. Poulin-Dubois, P., & Forbes, J. (2002). Toddlers’ attention to intentions-in-action in learning novel action words. Developmental Psychology, 38, 104–114. Prentice, J. L. 1967. Effects of cuing actor vs. cuing object on word order in sentence production. Psychonomic Science 8. 163–4. Pye, C. & Poz, P. . (1988). Precocious passives (and antipassives) in uich Mayan. Papers and Reports on Child Language Development 27. 71-80. Radford, A. et al. (1999) Linguistics: an introduction. Cambridge:CUP. Rispoli, M. (1991). The mosaic acquisition of grammatical relations. Journal of Child Language, 18, 517-551. Ruff, H. A., & Rothbart, M. K. (1996). Attention in early development: Themes and variations. New York: Oxford University Press. 38 The Attention-Grammar Interface Salomo, D., Lieven, E., & Tomasello, M. (2010). Young children's sensitivity to new and given information when answering predicate-focus questions. Applied Psycholinguistics, 31(1), 101-115. Striano, T., & Reid, V. M. (2006). Social cognition in the first year. Trends in Cognitive Sciences, 10, 471–476. Suzman, S. (1985). Learning the passive in Zulu. Papers and Reports on Child Language Development 24. 131-37. Symons, L. A., Hains, S. M. J., & Muir, D. W. (1998). Look at me: Five-month-old infants’ sensitivity to very small deviations in eye-gaze during social interactions. Infant Behavior and Development, 21, 531–536. Talmy, L. (1988). Force dynamics in language and cognition. Cognitive Science, 12, 49– 100. Talmy, L. (2000). Toward a cognitive semantics. Cambridge, MA: MIT Press. Talmy, Leonard. (1985). Lexicalization patterns: semantic structure in lexical forms. In T. Shopen (ed.), Language typology and syntactic description III, 57-149. New York: Cambridge University Press. Talmy,L. (2007). Attention phenomena. In Dirk Geeraerts & Hubert Cuyckens (eds.). Oxford Handbook of Cognitive Linguistics Oxford University Press, 264-293. Taylor, J. (2002). Cognitive grammar. Oxford, England: Oxford University Press. Tomasello, M. (2003). Constructing a Language: A Usage-Based Theory of Language Acquisition. Harvard University Press. 39 The Attention-Grammar Interface Tomasello, M., & Barton, M. (1994). Learning words in nonostensive contexts. Developmental Psychology, 30, 639–650. Tomasello, M., & Ferrar M. J. (1986). Joint attention and early language. Child Development, 57, 1454–1463. Tomlin, R. S. (1995). Focal attention, voice, and word order: an experimental, crosslinguistic study. In P. Downing & M. Noonan (Eds.), Word order in discourse (pp. 517-554). Amsterdam: John Benjamins. Tomlin, R. S. (1997). Mapping conceptual representations into linguistic representations: The role of attention in grammar. In J. Nuyts & E. Pederson (Eds.), Language and conceptualization (pp. 162-189). Cambridge: Cambridge University Press. Traxler, Matthew. 2011. Introduction to Psycholinguistics: Understanding Language Science. Wiley-Blackwell: Chicester, UK. Trueswell, J. C., and M. K. Tanenhaus. 2005. Approaches to studying world-situated language use. Cambridge, MA: The MIT Press. Turner, E. A., and R. Rommetveit. 1968. Focus of attention in recall of active and passive sentences. Journal of Verbal Learning and Verbal Behavior 7(2). 543–8. Well, A.D., Larch, E.P., & Anderson, D.R. (1980). Developmental trends in distractibility: Is absolute or proportional decrement the appropriate measure of interference? Journal of Experimental Child Psychology, 30, 109-124. Woodward, A. L. (1998). Infants selectively encode the goal object of an actor’s reach. Cognition, 69, 1–34. 40 The Attention-Grammar Interface Woodward, A. L., & Sommerville, J. A. (2000). Twelve-month-old infants interpret action in context. Psychological Science, 11, 73–77. Yu, C. & Ballard, D. (2007). A unified model of early word learning: Integrating statistical and social cues. Neurocomputing 70, 2149-2165. 41 The Attention-Grammar Interface Acknowledgements Thank you to St Vincent’s Infant School, Freshfields Nursery and all the children who took part in this study; to Anna Coates and Mickie Glover for also assisting in this experiment; to Andriy Myachykov, Ewa Dabrowska and one anonymous reviewer who provided helpful comments on the manuscript. This research was supported by a postdoctoral fellowship from the Max Planck Institute for Evolutionary Anthropology, Leipzig. 42
© Copyright 2026 Paperzz