Artificial grammar learning of shape-based noun classification Jennifer Culbertson ([email protected]) Linguistics Program, 4400 University Drive Fairfax, VA 22030 USA Colin Wilson ([email protected]) Cognitive Science Department, 3400 N. Charles Street Baltimore, MD 21218 USA Abstract Similarly, in the classifier system of Navajo (Mithun, 1986) nouns are classified according to animacy and shape (among other properties); class marking in this language is found on the verb. Signed languages also commonly have noun classification systems based on shape and other functional properties (Supalla, 1986). Systems of noun classification serve to categorize entities based on a set of semantic and/or phonological features. Previous work, for the most part focused on gender-based classes, has suggested that learners acquiring such systems rely primarily on phonological cues, while semantic cues are used only weakly. We show, using an artificial language learning task with adults, that semantic information alone is sufficient to learn a realistic shape-based classification system, challenging the view of phonology bias. Further, our results show that compared to learners exposed to semantically cohesive categories, learners trained on randomly assigned classes are less successful at recalling the category of exposure items. This finding suggests that, contrary to memory-based theories of learning, categories are not necessarily formed by abstraction from memorized exemplars, but can instead be constructed from lower-level properties that category members share. Keywords: classifiers; noun classes; language acquisition; artificial language learning; semantic features Acquisition of noun classification systems Introduction Systems of noun classification—such as gender, noun class, and classifier systems—distinguish or categorize objects according to salient semantic and/or phonological features. Though such systems may differ in their formal realization, the semantic features on which they are based draw from a common pool that includes physical features (e.g. shape, size), function (e.g. food, tool, habitation), as well as animacy and sociocultural status (Denny, 1976; Dixon, 1986; Lakoff, 1987; Comrie, 1989; Aikhenvald, 2000; Senft, 2000). For example, in Cantonese, the use of a classifier morpheme is required in constructions involving a numerical or definite noun phrase, as in example (1) below.1 The choice of classifier in Cantonese is largely determined by the head noun; for example the classifier go[3] is used for people, while the classifier zek[3] is used primarily with animals. Additional classifiers target shape properties like length, dimension, and flexibility. (1) a. sam[1] go[3] jan[4] three CL people ‘three people’ b. sam[1] zek[3] gau[2] three CL dogs ‘three dogs’ 1 Although English does not use them productively, there are nevertheless a number of nouns which can appear with a classifier, e.g. “four strands of hair”, “two sheets of paper”, “a school of fish”. Previous work on the acquisition of systems of noun classification has largely focused on genders and noun classes. Such studies have documented developmental stages including a period of phonological underspecification, and overgeneralization of frequent or default marking, and have highlighted the apparently weak role of semantic (as opposed to phonological or distributional) information (KarmiloffSmith, 1981; Perez-Pereira, 1991; Demuth & Ellis, 2008; Mariscal, 2009; Gagliardi, 2012). The acquisition of classifier systems, although perhaps less well-studied, indicates a similar developmental trajectory. For example, Tse, Li, and Leung (2007) report that Cantonese-speaking children (3;0–5;0) tend to show early use of classifiers in required contexts but are not adult-like in their choice of classifier until quite late. In particular, children tend to over-use the classifier go3—used for people, but also sometimes referred to as a ‘general’ classifier (C. Li & Thompson, 1989)—and to over-generalize other more frequent classifiers. Although P. Li, Huang, and Hsiao (2010) show that Mandarin-speaking children generalize classifiers to novel nouns on the basis of shape features, Tsang and Chambers (2011) argue that adult speakers of Cantonese tend to rely on cues other than the semantic features of the nouns when processing classifiers. In this paper we investigate the extent to which adult learners can use semantic information alone to acquire category distinctions instantiated in a miniature classifier system. Previous work on artificial language learning suggests that, although the population of most interest may be children within any sensitive period for language acquisition, behavioral patterns exhibited by adults can shed light on both general and language-related learning mechanisms (Wilson, 2006; Culbertson, Smolensky, & Legendre, 2012; Finley & Badecker, 2010). The motivation for using an artificial language learning task rather than natural language learning data in this case comes from our hypothesis of why it has been found that phonological cues—even when these are less statistically reliable than semantic properties—are preferentially used by 2118 learners acquiring noun classification systems (Braine, 1987; Frigo & McDonald, 1998; Gagliardi, 2012). It seems likely that children process a great deal of phonological information about dependencies between nouns and nominal modifiers (such as gender-marked determiners or classifiers) before they acquire the meanings of these elements (Polinsky & Jackson, 1999). In some sense, then, it is unsurprising that children privilege phonological information at first during language development. Adults may continue to privilege phonological cues, not because they fail to attend to semantics, but simply because their knowledge of noun classes was initially based in phonological processing. Here, crucially, we use adult English-speakers and construct a miniature language from known objects and their linguistic labels. This removes the problem of acquiring the semantics of nouns and, if our hypothesis is correct, should expose an ability to learn cohesive noun categories on the basis of semantic features alone. While some previous work has suggested that adults can use semantic information to learn classification systems in an artificial language, these studies have exclusively focused on gender-based noun classes (Braine, 1987; Brooks, Braine, Catalano, Brody, & Sudhalter, 1993). Here we target instead shape-based classifiers, which are likely to be less familiar to English-speaking college students (the population typically targeted). The system is modeled on Cantonese (sortal) classifiers, in particular those which pick out shape properties of objects. As mentioned above, the particular shape properties indicated by Cantonese classifiers—related to the length, flexibility, and dimensions of objects—are representative of those found in classifier systems typologically (Craig, 1986; Dixon, 1986; Comrie, 1989). Table 1 shows the two Cantonese classifiers, along with the semantic features with which they are associated, on which our system was modeled. The examples provided represent nouns which take the relevant classifier in Cantonese, and are also nouns actually used in the task. random-assignment condition was used to establish an experimental baseline against which performance in the shapebased condition can be compared, and in particular to assess the role of memory for individual category members in this task. Exemplar-based models of learning argue that category formation begins with a set of memorized exemplars, abstract categories emerging later due to, e.g., computation of featural similarity among exemplars in a given category (Nosofsky, 1986). This predicts that learners exposed to conditioned and random classifier categories should perform equally well when tested on familiar items—in both cases, the set of exemplars presented during exposure should be stored—but should of course differ on their ability to generalize to novel items. Participants Participants were 20 native English-speaking undergraduates from the Johns Hopkins University. They received a small amount of course credit or extra credit for their participation. No subjects reported difficulties hearing or seeing the stimuli. Materials The miniature language was comprised of the English numeral words “one” and “two”, two nonce classifier morphemes “ka” and “po”, and 96 English nouns representing familiar objects. Utterances in the language consisted of a numeral word directly followed by a classifier morpheme, and a noun, as in example (2) below. Utterances were auditorily— using mac text-to-speech, speaker “Alex”—and orthographically presented and were accompanied by a visual image. The image was a single object for numeral “one” or two of the same objects for numeral “two”. (2) b. two-po towel two-CL towel Table 1: Shape-based classifiers tested Classifier Semantic features Examples zi[4] jeung[4] rigid, narrow, long broad, flat, flexible knife, twig, candle sheet, card, table a. one-ka hammer one-CL hammer ‘two towels’ Experiment 1 In Experiment 1, we tested whether adults could learn and generalize categories of nouns, distinguished by their use of the classifiers in Table 1. We compare learning of a system in which classifier use is conditioned on shape-based semantic properties of nouns to learning a random assignment of nouns to classifier categories. We hypothesized that if learners perceive and make use of semantic information in acquiring noun classification systems, they should succeed in learning the semantically-conditioned language. The Figure 1: Example trial 2119 1.0 0.2 0.4 0.6 0.8 seen novel 0.0 Participants were seated in front of a computer, and were instructed that the task was about learning a language similar to English but with two ways of saying the words “one” and “two”. They then listened to examples of “one-po”, “one-ka” and “two-po”, “two-ka”. This was followed by 48 familiarization trials, half with objects using the classifier “ka” and half using “po”. Half of the trials featured a single object and the other half two objects. On each trial, a visual image appeared with four choices below it, one for each possible numeral-classifier combination followed by the object noun pictured. Participants listened to the auditory stimulus and were required to click the choice which matched what they heard. Figure 1 shows an example trial. After familiarization, participants took a brief break, and were then instructed that they would see a visual image and four choices below it, as in the familiarization phase, but they would hear no audio. Instead they were required to choose the phrase they thought was most likely to be used in the language. This testing phase was made up of 96 trials, including all the objects seen during familarization, and 48 novel objects. The seen objects were the same as those seen in the familiarization phase, but appeared with the other numeral (e.g. if a participant heard “one-ka hammer” and saw a single hammer during exposure, they saw two hammers at test). No feedback was given. Participants were randomly assigned to one of two conditions. In the shape condition, the use of “ka” and “po” was conditioned on the semantic properties shown in Table 1 above. The object nouns in each class were a subset of those which actually use the corresponding classifier in Cantonese. As such, although they generally exhibited the relevant properties, there was some amount of variation in the extent to which they did so. For example, the noun “table” takes the classifier jeung[4] in Cantonese even though it does not perfectly exemplify the semantic features of the class. In the random condition, the use of “ka” and “po” was unconditioned, and nouns were randomly paired with a particular classifier. dom effects) reveals a significant effect of condition (β = 1.47, z = 5.32, p < 0.01), with participants in the shape condition choosing the correct classifier on seen items much more often than those in the random condition (0.86 vs. 0.45). A significant interaction between condition and number was also found (β = −0.29, z = −2.63, p < 0.01), indicating the participants in the random condition tended to be less accurate on items with the number “two” compared to “one”. We are also interesting in the extent to which participants in the shape condition could generalize the categorization information they learned during familiarization to novel (unseen) objects at test. As Figure 2 suggests, there was little difference in participants’ choice of the correct classifier on seen item, and their choice of the classifier which matched the relevant semantic features on novel nouns. Analysis using mixed-effects logistic regression revealed no significant effect of item familiarity (β = 0.27, z = 1.13, p = 0.26). A significant interaction between item familiarity and number was found however (β = −0.47, z = −1.98, p < 0.05), indicating the participants tended to be less accurate on seen items with the number “one” compared to “two”. Note that for participants in the random condition, there is no expected correct classifier for novel items, as the noun categories used in familiarization were random, containing no semantic cues. Correct response Design & Procedure Shape Random Condition Results In analyzing the results of this experiment we were interested in two main questions: (i) Do learners in the shape condition—in which classifier choice is determined by semantic features of nouns—succeed in learning and generalizing the correct categories? (ii) Are the categories learned those which were intended, namely the shape-based categories shown in Table 1? To address the first question, we compared first the performance on seen items across the two conditions. Performance on seen items gives an indication of how well the familiarization set was learned by a given participant. The light colored bars in Figure 2 shows proportion choice of the correct classifier on average for participants in each condition. Analysis of this data using mixedeffects logistic regression (with participants and items as ran- Figure 2: Correct choice of classifier for seen and novel nouns in the shape condition, and seen items in the random condition (NB: there is no correct choice for novel nouns in the random condition, since nouns were categorized randomly). If participants in the shape condition in fact consistently inferred the same set of shape-based categories, we expect to see that their responses on novel test items are highly correlated. On the other hand, participants in the random condition were not expected to infer cohesive categories, and thus we do not expect correlated responses. To assess this, for each pair of participants in the shape condition, we computed the proportion of novel test items that they assigned to the same category. The average agreement proportion for this condi- 2120 1.0 Shape Random Condition Participants Participants were 24 native English-speaking workers recruited through Amazon Mechanical Turk. They received $1.00 for their participation in the study. Figure 3: Correct choice of classifier for seen and novel nouns in the shape condition, and seen items in the random condition in Experiment 2 (NB: there is no correct choice for novel nouns in the random condition). Materials The materials were the same as those used in Experiment 1, and participants were again randomly assigned to either the shape condition or the random condition. Results The results of Experiment 2 replicate the major findings of Experiment 1, as shown in Figure 3. Analysis of this data reveals a significant effect of condition (β = 0.91, z = 3.77, p < 0.01), with participants in the shape condition choosing the correct classifier on seen items much more often than those in the random condition (0.82 vs. 0.55). A significant interaction between condition and number was also found (β = 0.17, z = 2.08, p < 0.05), indicating that the participants in the random condition tended to be less accurate on items with the number “one” compared to “two”. This interaction is in the opposite direction as what was found in Experiment 1, suggesting that the effect of number may not be reliable. In terms of generalization to novel items, participants in the shape condition again show a relatively modest but significant increase in accuracy of classifier choice for seen items in comparison to novel items (β = 0.38, z = 2.08, p < 0.05). No other significant effects were observed, again suggesting that differences in performance driven by number in Experiment 1 may not be reliable. As in Experiment 1, for each pair of participants in a given condition we computed the proportion of novel test items that were assigned to the same category. Average agreement was above chance for the shape condition (0.65, SE = 0.04), but note that this represents a lower level of agreement than that found in Experiment 1 for the same condition. Just as in Experiment 1, average agreement for the random condition was at the expected chance level (0.50, SE = 0.02). In the experiments reported above, we exposed adult Englishspeakers to a miniature artificial noun classification system. 0.6 0.0 0.2 In Experiment 2, we sought to replicate our findings in a more diverse population, namely workers on Amazon Mechanical Turk (a service pairing workers with tasks over the internet). This population includes a range of ages and socio-economic backgrounds that may be more representative of the population at large (Mason & Suri, 2012). In addition, this experiment serves to add to the growing body of linguistic and cognitive research using Mechanical Turk. 0.4 Correct response Experiment 2 Discussion seen novel 0.8 tion was high (0.74, SE = 0.04). In contrast, a parallel analysis revealed much lower agreement among participants in the random condition (0.50, SE = 0.02); note that 50% agreement would be expected from purely random responding. In order to investigate the role of semantic features of nouns in the acquisition of classification systems, we used English words, removing an obstacle present in natural language learning. Child language learners likely go through a stage of development in which phonological but not semantic information is available for the acquisition of noun classification and other grammatical features. The results of our experiments indicate that, when exposed to a realistic classification system (based on two Cantonese sortal classifiers) over known nouns, participants are able to learn the correct categories based on semantic information alone, and can readily generalize this information to new nouns. Learning did not extend to participants exposed to randomly generated noun categories which lacked supporting semantic cues. Our findings were robust in both a population of college students, and among the more diverse population found on Amazon Mechanical Turk—despite a relatively small sample size. This finding suggests that semantic features of nouns can be quickly used by learners as the basis of a classification system, calling into question the apparently privileged role of phonology cues argued to hold in previous work on this topic (Karmiloff-Smith, 1981; Perez-Pereira, 1991; Tsang & Chambers, 2011; Gagliardi, 2012). While here we have shown that semantically based noun classification can be learned in the absence of phonological cues, in future work we will ask whether phonological information is nevertheless used preferentially over semantic information when both are simultaneously accessible. We believe our results are also relevant to understanding the initial stages of category formation. In particular, the dramatic difference in performance for seen items—items which were part of a participant’s exposure set—between the two conditions calls into question theories of learning in which 2121 categories are formed by abstraction over a set of stored exemplars (Nosofsky, 1986) (see also (Rouder & Ratcliff, 2004) for relevant discussion and detailed model comparison). Under such a view, the prediction would be that learners should store the set of exemplars presented during familiarization regardless of whether the particular classifier-noun pairings are random or semantically conditioned. It would then remain unexplained why participants in the random condition fail to use the stored pairings to perform with high accuracy on seen items at test. Our participants succeeded at remembering (or reconstructing) particular examples only when those conformed to a more abstract generalization across items. Acknowledgments The authors would like to acknowledge Kelly Glazer and Allison Widen for assistance in constructing the experimental materials and running participants. References Aikhenvald, A. (2000). Classifiers: A typology of noun categorization devices. New York, NY: Oxford University Press. Braine, M. (1987). What is learned in acquiring word classes: A step toward an acquisition theory. In B. MacWhinney (Ed.), Mechanisms of language acquisition (pp. 65–87). Hillsdale, NJ: Lawrence Erlbaum Associates. Brooks, P. J., Braine, M. D. S., Catalano, L., Brody, R. E., & Sudhalter, V. (1993). Acquisition of gender-like noun subclasses in an artificial language: The contribution of phonological markers to learning. Journal of Memory and Language, 32(1), 76 - 95. Comrie, B. (1989). Language universals and linguistic typology: Syntax and morphology. Chicago, IL: University of Chicago press. Craig, C. (1986). Noun classes and categorization: proceedings of a symposium on categorization and noun classification, Eugene, Oregon, october 1983. Philadelphia, PA: John Benjamins. Culbertson, J., Smolensky, P., & Legendre, G. (2012). Learning biases predict a word order universal. Cognition, 122, 306-329. Demuth, K., & Ellis, D. (2008). Revisiting the acquisition of Sesotho noun class prefixes. Mahwah, NJ: Lawrence Erlbaum. Denny, J. (1976). What are noun classifiers good for? In Papers from the 12th regional meeting, chicago linguistic society (pp. 122–132). Dixon, R. (1986). Noun classes and noun classification in typological perspective. Noun classes and categorization, 105–112. Finley, S., & Badecker, W. (2010). Linguistic and nonlinguistic influences on learning biases for vowel harmony. In S. Ohlsson & R. Catrambone (Eds.), Proceedings of the 32nd annual conference of the Cognitive Science Society (p. 706-711). Austin, TX: Cognitive Science Society. Frigo, L., & McDonald, J. (1998). Properties of phonological markers that affect the acquisition of gender-like subclasses. Journal of Memory and Language, 39(2), 218– 245. Gagliardi, A. (2012). Input and intake in language acquisition. Unpublished doctoral dissertation, PhD dissertation, University of Maryland. Karmiloff-Smith, A. (1981). A functional approach to child language: A study of determiners and reference (Vol. 24). New York, NY: Cambridge University Press. Lakoff, G. (1987). Women, fire and dangerous things–what categories reveal about the mind. Chicago, IL: University of Chicago Press. Li, C., & Thompson, S. (1989). Mandarin Chinese: A functional reference grammar. Los Angeles, CA: University of California press. Li, P., Huang, B., & Hsiao, Y. (2010). Learning that classifiers count: Mandarin-speaking children’s acquisition of sortal and mensural classifiers. Journal of East Asian Linguistics, 19(3), 207–230. Mariscal, S. (2009). Early acquisition of gender agreement in the Spanish noun phrase: starting small. Journal of Child Language, 36(1), 143–171. Mason, W., & Suri, S. (2012). Conducting behavioral research on Amazon’s Mechanical Turk. Behavior Research Methods, 44(1), 1–23. Mithun, M. (1986). The convergence of noun classification systems. Noun classes and categorization, 379–397. Nosofsky, R. (1986). Attention, similarity, and the identification–categorization relationship. Journal of Experimental Psychology: General; Journal of Experimental Psychology: General, 115(1), 39. Perez-Pereira, M. (1991). The acquisition of gender: What Spanish children tell us. Journal of Child Language, 18(03), 571–590. Polinsky, M., & Jackson, D. (1999). Noun classes: Language change and learning. In B. A. Fox, D. Jurafsky, & L. Michaelis (Eds.), Cognition and function in language (pp. 29–50). Stanford, CA: CSLI. Rouder, J., & Ratcliff, R. (2004). Comparing categorization models. Journal of Experimental Psychology: General, 133(1), 63. Senft, G. (2000). Systems of nominal classification. Cambridge: Cambridge University Press. Supalla, T. (1986). The classifier system in American sign language. In C. Craig (Ed.), Noun classes and categorization (pp. 181–214). Philadelphia, PA: John Benjamins. Tsang, C., & Chambers, C. (2011). Appearances aren’t everything: Shape classifiers and referential processing in Cantonese. Journal of Experimental Psychology: Learning, Memory, and Cognition, 37(5), 1065. Tse, S., Li, H., & Leung, S. (2007). The acquisition of Cantonese classifiers by preschool children in Hong Kong. Journal of child language, 34(3), 495. Wilson, C. (2006). An experimental and computational study 2122 of velar palatalization. Cognitive Science, 30(5), 945-982. 2123
© Copyright 2026 Paperzz