Faculty of Psychology and Educational Sciences Second Master Theoretical and Experimental Psychology Master thesis PROCESSING NEUTRAL WORDS, NEGATIVE WORDS AND TABOO WORDS: AN ERP STUDY Anneleen Mortier Masterthesis Supervisor: dr. Elisah D’Hooge Promotor: Prof. dr. Robert Hartsuiker Department of Experimental Psychology 2012-2013 Aknowledgments First of all, I would give a BIG thanks to my supervisor, dr. Elisah D’Hooge. She taught me to set up ERP experiments, administer and analyse them. Further, she was a big help for my thesis, and also for other matters. Next, I would like to thank my promotor, Professor dr. Robert Hartsuiker, for his guidance and support. I also would like to thank the other students from 2nd master Experimental Psychology. Of course, thanks to my parents, for supporting me those five years and for believing in me. As last, I want to thank Martijn for his support and for keeping up with me when things were not going in the right direction. PROCESSING NEUTRAL WORDS, NEGATIVE WORDS AND TABOO WORDS: AN ERP STUDY Abstract Bilinguals have reported that they have a stronger emotional experience in their native language than in their second language. Behavioural studies are inconclusive about this question. However, studies that used skin conductance levels as a dependent variable are clearer about this effect: participants elicit a stronger skin conductance response on emotional and taboo words in the native language compared to the second language (e.g. Harris, Ayçiçegi and Gleason, 2003). Moreover, this skin conductance response was higher for taboo words than for negative words, indicating that taboo words are a special type of emotional words. This study tried to investigate the difference in emotional experience between the native language and the second language with a more direct and online measurement: event related potentials. It consisted of two experiments. The first experiment used neutral and negative words in the first language and the second language, the second experiment used neutral and taboo words in the first and second language. Both experiments used Dutch-English bilinguals that performed a delayed naming task. The results of the event related potentials showed that negative and taboo words elicited a P300 followed by a late positive potential. These components were attenuated when neutral words were being presented. Moreover, the difference in amplitude between negative/taboo words and neutral words was more pronounced in the native language. These findings are in line with our hypothesis. The results are discussed with reference to previous findings on bilingual emotion experience. Keywords: Bilingualism, Emotion, Taboo, ERP, P300, LPP Contents Taboo and negative words . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Qualitative Research . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 Quantitative Research . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Bilinguals and physiological studies . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Present study 9 Experiment 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 Results and discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 Experiment 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 Results and discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 General Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 Appendix A 37 Appendix B 38 Appendix C 39 Appendix D 40 Appendix E 49 Appendix F 50 Appendix G 51 1 Maria (L1: Spanish, L2: English): “I never swear in Spanish. I simply cannot. The words are too heavy and are truly a taboo for me.” (Dewaele, 2004b:95). The above quote illustrates how most of the bilinguals feel about their two (or more) languages: they experience greater emotional intensity when using their native language (L1) than when using their second language (L2) or other languages (LX). This intuition has been investigated by several different paradigms and different stimuli. Various studies have shown the importance of including different types of stimuli; since they can result in different outcomes (e.g., Guillet & Arndt (2009) and Jay, Caldwell-Harris, & King, 2008). In this paper, we will first discuss the different findings between taboo and negative words in bilingual research. Then, we will give a short overview of the qualitative research in this domain; next we will discuss previous quantitative research. Furthermore, we will present a short overview about the different physiological studies that have been conducted in this field. Finally, we proceed to our own investigation. Taboo and negative words Taboo words are a special category of negative words. They include terms referring to bodily products, ethnic and racial insults, profanity, slang, vulgarity, body parts and sexual acts (Jay, 1992, 2000). Several studies have found their influence on different types of tasks such as memory tasks (e.g., Guillet, & Arndt, 2009; Jay, et al., 2008), rapid serial representation task (e.g., Mathewson, Arnell & Mansfield, 2008), lexical decision (Zeelenberg, Bocanegra & Pecher, 2011) and language production (Dhooge, De Baene, & Hartsuiker, submitted; Severens, Janssens, Kûhn, Brass & Hartsuiker, 2011). Additionally, various studies have shown the effect of taboo words in different domains: e.g., enhanced attention (e.g., Eilola, Havelka & Sharma, 2007; Mathewson et al., 2008), superior recall (e.g., Ayçiçegi & Harris, 2004) and heightened skin conductance responses (e.g,. Harris, Ayçiçegi & Gleason, 2003). Furthermore, there is evidence that taboo words may involve in a specialized neural mechanism: patients with aphasia are still able to swear (Van Lancker & Cummings, 1999). On the contrary, inability to swear has also been reported when the right hemispheric basal ganglia is impaired (Speedie, Wertman, Ta’ir & Heilman, 1993). It has been suggested that taboo words are a special type of emotional words. Several 2 studies have confirmed this suggestion. Jay et al. (2008) have administered several memory tasks. They found that participants were better at recalling taboo words than negative words, even when they had to report those taboo words as last. Guillet & Arndt (2009) also found that participants had a better memory for taboo words than for negative words. However, they used a different task than Jay et al. (2008). Participants had to remember certain words (central information) in the presence of a distracter (peripheral information). This distracter could be either taboo or negative. They found that the central information was remembered better when the distracter was a taboo word, not when it was a negative word. In a study of Mathewson et al., (2008), a single-target rapid serial visual representation task (RSVP) was used. Their results showed that the taboo words received more attention than negative words. Thus, it seems that taboo words evoke a different response than negative words. As a result, when a study investigates the emotionality in different languages (or just in a single language), it is not sufficient to only use emotional words, taboo words should be included as well. Qualitative Research The difference in emotional intensity between L1 and L2 has already been reported in early clinical studies. Buxbaum (1949) reported that L1 could help to remember lost memories, in comparison to L2. If memories are retrieved in L2, they are colourless and lifeless. Additionally, when patients were talking about negative memories, they used their L2 more often, but when they recalled warm and happy memories, they would use their L1 (Javier, 1996. For an overview: Schrauf, 2000). In contrast with this, some bilinguals can experience detachment from their L1 when they underwent negative experiences (e.g., German with World War II). Further, when people are more attracted to their ‘new’ language, they can experience this language as more emotional as their L1. This indicates that we cannot simply conclude that L1 is more emotional than L2: other factors can still influence this process. Dewaele (2004a, 2004b and 2008) tried to investigate the difference in emotional intensity between the two languages in a more systematically way. In his series of studies, he administered questionnaires from multilinguals, to investigate the perception of their expression of taboo words and swearwords (2004a, 2004b) and their expression of love (2008). The studies in 2004 showed that multilinguals found the emotional force of the taboo words and swearwords were stronger in their L1. This effect diminished for languages that are learned at a later time. Additionally, 3 multilinguals tend to swear in their dominant language. This dominant language is determined by age of acquisition, frequency of use and the learned setting (naturalistic, (e.g.,: parents), or instrumentally, (e.g.,: classroom)). The dominant language is most of the time L1: participants found that L1 has more force to express themselves. Also, they felt more emotionally involved when using L1. Others said that they could not swear in their native language because the emotional weight is too heavy or the words are too taboo. Some participants also mentioned that they could not express their emotional feelings in another language than the native language (i.e., LX), because they cannot find the emotional load of that language. Additionally, multilinguals might swear in their other languages when they want to soften the force that those words are carrying or when they do not want to be associated with their expression. In his study in 2008, Dewaele investigated whether the emotional intensity of the emotion ‘love’ depended on language. He again administered an online questionnaire about experiencing love to a sample of multilinguals. They had to answer open and closed questions about the perceived emotional weight of ‘I love you’ in the different languages the participants spoke. He found that the majority of the respondents found this phrase to have the strongest emotional value in their L1. However, this effect was dependent on certain other variables: age of acquisition, self-perceived language dominance and context of the acquisition of the L2. Participants felt that the emotional weight is the strongest in their L1 and that no other language could match this feeling. Thus, multilinguals prefer to express one of the most universal emotions in their L1. The findings by Dewaele (2004a, 2004b, 2008), Buxbaum (1949) and Javier (1996) seem to suggest that L1 is a personal language with emotional affiliation and is used to express feelings. In contrast with this, L2 is a more emotionally distant language and is used when we do not want to be associated with that expression (Pavlenko, 2002). However, the results of Dewaele (2004a, 2004b, 2008) were based upon self-report, which can be inaccurate. The emotional processes that are associated to the different languages are not necessarily consciously accessible to the speaker (Eilola, 2009). Also, clinical case studies are not representative for the entire population. Hence, the next section discusses studies that explored the emotional difference between L1 and L2 in an experimental way. Quantitative Research Studies with monolinguals have shown that we have a better and more detailed memory for negative words (e.g. Kensinger & Corkin, 2003) and for taboo words (e.g. Grosser & 4 Walsch, 1966; Hadley & MacKay, 2006; Jay et al., 2008; MacKay et al., 2004, among others). Furthermore, as the above studies suggest, the recall can be influenced by the kind of language we use. The study of Anooshian and Hertel (1994) is one of the first who investigated this effect in bilinguals. They asked late Spanish-English bilinguals to rate emotional and neutral words presented in English or Spanish. The presentation of these words was blocked for each language. The experiment was followed by a surprise recall task. Anooshian and Hertel found that their participants were better at recalling emotional words, but only when those words were presented in their L1. The authors suggested that when we learn an L2 at a later time, this wouldn’t be associated with emotional experiences and because of this, will be less emotionally intense. Ayçiçegi and Harris (2004) tried to replicate and extend the effect that Anooshian and Hertel (1994) found. They included reprimands and taboo words in their stimulus set. Additionally, they altered the design. The presentation of the words in each language was mixed for each block. In contrast to the results of what they hypothesized (and thus in contrast with the results of Anooshian and Hertel), they found the opposite result with late Turkish-English participants (learned L2 at the age of 12 to 18 and migrated to the US at the average age of 22). Participants were equally good at recalling negative English and Turkish words. Moreover, this effect was stronger in the L2 for some stimuli. Concerning taboo words, participants had better recall and recognition for these words in comparison with negative words. Ayçiçegi-Dinn and Caldwell-Harris (2009) did a follow up study: perhaps their findings could be related to the level of processing (shallow processing or deep processing). They had four conditions: word rating on emotional intensity (replication of previous study), counting letters (shallow), translation (deep) and word association (deep). Averaging over the four tasks, they found that recall was equally strong for L1 and L2, except for the reprimands (more recall in L2). For the taboo words, the same conclusions hold as their previous study (Ayçiçegi & Harris, 2004): participants had a better recall and recognition for them compared to negative and neutral words. Those two studies supports the suggestion that taboo words should be seen as a ‘special’ type of emotional words. Ferré, García, Fraga, Sánchez-Casas and Molero (2010) investigated whether this effect was dependent of age of acquisition or the setting where bilinguals learned their second language. This last variable had two factors. Either the participants had learned their L2 outside a classroom, i.e., naturalistic context, or the participants had learned it in the classroom, i.e. instrumental context. Early Spanish-Catalan bilinguals (naturalistic context – experiment 2) and late Spanish-English bilinguals (instrumental context – experiment 3) had to rate emotional or neutral words. The language in which the words were presented was blocked (one block 5 for each language). In both experiments, there was no difference in recall between the two languages. Also, when they compared both groups, there was no difference in the magnitude of the emotional effect in L2. These four studies indicate that there is a great variability of results in the studies on memory recall in bilinguals. All of them found different effects: Anooshian and Hertel (1994) found better recall in L1, Ayçiçegi and Harris (2004) found better recall in L2 (for some stimuli), Ayçiçegi-Dinn and Caldwell-Harris (2009) and Ferré et al. (2010) found no differences between the two languages. Furthermore, Ferré et al. showed that language dominance, age of acquisition or context of acquisition of L2 does not influence the emotional experience in L2. This is in contrast with the studies of Dewaele (2004a, 2004b) and the results of Anooshian & Hertel (1994). For taboo words, these studies show that participants have an advantage in recall for these words, but on average, there is no difference between L1 and L2. The findings of those four studies give support to the notion that the emotional intensity between different languages is influenced by other variables. This could be due to characteristics of the participants and/or of the stimuli. Another line of research is investigating attention towards emotional words. A frequently used paradigm is the rapid serial visual presentation (RSVP). This is a paradigm where stimuli are presented very brief to participants. When distractor and target a presented within 500 ms from each other, an attentional blink can occur, resulting in worse performance on the target. Studies with monolinguals have already shown that when this distractor is an emotional or a taboo word, people have a slower reaction on the target (e.g. Zeelenberg et al.). Moreover, Mathewson et al., 2008 found that taboo words led to more interference than emotional words (even negative words). Additionally, when the emotional word is a target, the attentional blink is reduced (Anderson, 2005). This suggests that emotional words capture the attention more than neutral words and that there are less attentional resources required to process emotional words. Colbeck and Bowers (2012) investigated this effect with bilinguals. They showed a stream of 18 words, with 1 critical distractor (English taboo or sexual word) and 1 color target to Chinese-English bilinguals and native English speakers. The participants had to identify the color-word. For the native English speakers, they found the classical finding: they were worse at identifying the color word when it was preceded by an emotional or a sexual distractor. For the bilinguals, this effect was reduced: they performed better on the task than the native speakers when there was an emotional or a sexual distractor. Even when they only took the ‘best scoring’ bilinguals in account, the effect still persists. 6 Another paradigm where the attention for emotional words is investigated is the Stroop task (Stroop, 1935). The emotional Stroop task is a modified version of the original Stroop task. In this task, participants are shown emotional and neutral words in different colours. Their task is to say the colour of the words aloud, and ignore the written word. The general finding is that, when the target word is an emotional word, there is more interference when saying the colour of the word (e.g. Pratto & John, 1991). This interference is caused by the emotional value the word has, and not by incongruency (as in the classical Stroop task). Sutton, Altarriba, Gianico and Basnight-Brown (2007) administered this emotional Stroop task from Spanish-English bilinguals. However, note that the bilinguals were more proficient in their second language than in their first. The results showed that there was an emotional Stroop effect: the participants were slower when they had to pronounce the color of an emotional word. Nevertheless, this effect was not moderated by language: there was no difference between L1 and L2. This could conclude that this null-effect could be due to the proficiency of the L2: the participants used their L2 at that time more than their first, which could have altered their emotional experience in their L2. Eilola et al., (2007) did an emotional Stroop task as well, but their participants were late Finnish-English bilinguals. However, they were still very proficient in their L2. The authors used emotional and taboo words as distractors. Participants were slower on taboo words and negative words compared to neutral words. Likewise, they found no difference in effect between the L1 and L2. Eilola et al. concluded that the emotional effect is dependent on the proficiency of the language: the more proficient you are, the more emotionally attached and the less discrepancy between L1 and L2 in the emotional Stroop task. The authors suggested that proficiency does play a role in the emotional experience towards a certain language. However, these two studies are again in contrast with what Ferré et al. (2010) and Colbeck and Bowers (2012) found and concluded. This again shows that there are mixed results concerning the emotional intensity of L1 and L2 in behavioural studies. Moreover, there is no clear evidence towards what actually influences this effect. In short, these behavioural studies are trustworthier than the self-reported studies from Dewaele (2004a, 2004b, 2008). These studies also suggest that there is a difference in emotional experience between L1 and L2, but this difference can be influenced by various factors. Some of the above studies have proposed several variables that might influence this effect: e.g., age of acquisition, proficiency,. . . However, other studies dismissed these variables. The results of the behavioural studies can also be influenced by other processes (e.g. attention, memory,. . . ). Hence, it is recommended to use a different dependent variable than reaction time. The brain-based 7 perspective suggests that all subjective experiences have a measurable physiological correlate (Harris, 2004). Thus, physiological measurements can investigate the emotional difference between L1 and L2 in a purer and clearer way (e.g. this measurement can pick up effects that are not even conscious). Additionally, another advantage is that this can investigate the underlying mechanisms. Thus, the following studies investigated if L1 and L2 differ from each other in emotional intensity using physiological correlates. Bilinguals and physiological studies The study of Harris et al. (2003) is one of the first that investigated the emotional experience of bilinguals with skin conductance levels (SCL’s, formerly known as Galvanic skin response). They presented English and Turkish target words to Turkish-English bilinguals using the auditory and visual modality. These target words consisted of neutral words, positive and negative words, reprimands and taboo words. The reprimands were childhood reprimands such as ‘Shame on you!’. The language of the words was mixed per block. Results showed that the taboo words elicited the strongest autonomic response; these were even stronger when the taboo words were presented in their L1. Another finding was that when the stimuli were presented in the auditory modality, the skin conductance level was higher than when they were presented in the visual modality. In contrast with this, when the words were presented in L2, there was no difference between modalities. Harris (2004) found that age of acquisition mediated the relationship between emotional intensity in a language and the skin conductance level. She showed the same English target words as in Harris et al. (2003) with their Spanish translations to Spanish-English bilinguals. She had two groups of participants: early bilinguals (average age of acquisition: 3.7 years old) and late bilinguals (average age of acquisition: 7.9 years old). The skin conductance level was measured as well. Results showed that late bilinguals had a stronger skin conductance response when they saw childhood reprimands in their first language (L1 – Spanish). In contrast with this, early bilinguals had no difference in skin conductance response if the childhood reprimands were presented in L1 or L2. Harris explained this by age of acquisition: the early Spanish-English bilinguals had acquired their L2 at an early age. Also, they moved to the US at an early age (average: 3.1 years old), thus they were very familiar with this culture and emotional value of it. Harris and Ayçiçegi-Dinn (2009) also used skin conductance responses, but they let Turkish-English bilinguals read Turkish and English true or false statements. They found a 8 stronger skin conductance response when the participants read false statements and this response was even stronger for L2 (English). These findings supported the ‘double stressor’ account: for most participants it was harder to lie in L2, because speaking a second language requires more effort than speaking a native language. Moreover, lying also requires more effort and cognitive resources than speaking the truth. Thus, it is not sure what causes these stronger responses: it could be the result of the additional cognitive resources (‘I want to do my best, because it is in English, and I do not want to look like a fool’), or it could be the result of a more emotional experience to their L2. Eilola and Havelka (2010) did a follow up on their previous emotional Stroop experiment (Eilola et al., 2007). Native English speakers and Greek-English bilinguals had to react to the colour of an English target word, and ignore the meaning of it. However, what was new to their experiment was that they measured skin conductance levels. The behavioural results showed no difference between monolinguals and bilinguals, but the SCLs clearly showed a difference: it was significantly higher for the monolinguals in the condition of emotional and taboo words than for the Greek-English bilinguals. This study is very crucial because it suggests that at the behavioural level, there is no difference between the emotional perception of L1 and L2 (no difference in RT), but the physiological measurements are suggesting the opposite. Thus, it is very important to study the underlying mechanisms. To summarize, the physiological studies show more clarity: overall, emotional and taboo words in L1 elicited stronger skin conductance responses than in L2. Moreover, as Eilola et al. (2010) showed, this is independent from behavioural results. However, skin conductance levels cannot give information why these levels are higher in L1 than in L2 (as in Harris and Ayçiçegi-Dinn (2009)). As the authors suggested, it could simply be explained by the emotional strength of L1. Nevertheless, SCLs are not time-locked to the stimuli and is a more diffuse measurement, so more general and non-task related factors could influence this response (e.g. mood of the participant, external conditions,. . . ). Moreover, SCLs measure the reaction of the sympathetic nervous system. Thus, it would be better to study the origin of this reaction (the brain). A last remark is that SCLs habituate when stimuli are repeated (Codispoti, & De Cesarei, 2007). This habituation does not happen with ERPs (LPP – Codispoti, & De Cesarei, 2007). Also, ERPs are a continuous, direct and online measurement. It makes it possible to investigate every change between the stimulus presentation and the response. Another advantage of ERPs is that this measurement is time-locked to the processing of the stimuli, so we know with certainty that an event has elicited that response in the ERPs. Thus, when using ERPs, we 9 can be more conclusive about why these differences between L1 and L2 exist. As a conclusion, this is a better measurement to investigate these differences. Present study Emotional stimuli have the property to automatically capture our attention more than neutral stimuli, indicating differential processing of both types of stimuli (see previous studies: e.g. Sutton et al., 2007; Zeelenberg, et al., 2011). This difference is also reflected in the ERPs: when an emotional stimulus is presented, two components can be found. First, emotional stimuli elicit a larger peak within the 200 – 300 ms range (e.g. Dhooge, et al., (submitted); for an overview, see Hajcak, MacNamara & Olvet, 2010; Schupp et al., 2006). Additionally, the P300 might be followed by a Late Positive Potential (LPP), which also shows more positive potentials for emotional stimuli. This is a broad superior-posterior positivity (Hajcak et al., 2010). The difference between the P300 and the LPP is that the P300 is seen as a reflection of the increase in attention towards task-relevant stimuli. The LPP on the other hand is a reflection of the vigilance and processing of intrinsically motivating stimuli (Hajcak et al., 2010). Previous studies already showed that ERPs are effective at measuring the emotional value of a certain language. Wu and Thierry (2012) used ERPs in a translation-priming paradigm. Chinese-English bilinguals were presented English and Chinese emotional and neutral words. The first word served as a prime and they had to judge the the second one whether it was related in meaning to the prime. The crucial part of their experiment was that for some words pairs, prime and target shared the same sounds when they were translated into Chinese, but they were unrelated in meaning and form in English (e.g. failure – poet : shi bai – shi ren). The results of the ERPs showed that when prime and target are related in sounds in Chinese, the ERPs in the negative valence condition were significantly different from the ERPs in the positive and neutral conditions. Moreover, the negative valence failed to elicit the ERPs amplitude. These results suggest that bilinguals use the L1 (Chinese) translation to judge whether the target was related to the prime, but this only happened with neutral and positive words. This indicates that negative valenced words have a ‘special status’ when compared to the positive valenced words. The authors suggested that we are trying to protect ourselves against the negative connotation of these words by not activating their meaning in our L1. This might indicate that this meaning is too emotional or too offensive in our L1. Opitz and Degner (2012) investigated the different sense of emotionality in L1 and 10 L2 with ERPs. They presented positive, negative and neutral nouns to German-French and French-German bilinguals. The participants had to make a lexical decision on the target words. Their dependent measure was the early posterior negativity (EPN). What they found was no difference in amplitude for the EPN in L1 and L2, but it was delayed in L2. However, this experiment has some shortcomings, so that the results might be distorted. The authors matched their stimuli pairwise, but it has not been mentioned if this was also done for every condition. Previous studies have shown that the frequency can influence the amplitude: words with a low frequency can elicit a smaller P300 and high frequency words can elicit a smaller P1. For the EPN, low frequency neutral words elicit a larger EPN than high frequency neutral words. High frequent negative words elicit a larger EPN than low frequency negative words. Moreover, this difference is not observed for positive words (Scott, O’Donnell, Leuthold & Sereno, 2009). So, frequency can influence the amplitudes of the ERPs. Thus, the difference between the amplitude for positive and negative words could be due to the frequency difference than to the emotionality between the two languages. A second remark is that they used stimuli from a database according to valence and arousal rated by monolinguals (German and French). When a study tries to investigate the emotionality of words in L1 and L2, several words will be more emotional in L1 than the translation equivalent in L2. They tried to solve this problem by using a balanced bilingual design. However, it would have been better if they have rated their stimuli by a similar sample of the population on valence and arousal to control for this effect. Another remark is that in their ERPs, they only looked at the effect on the early time course. However, there can be some differences later in that time course (e.g. in the LPP). As a last remark, the authors used a lexical decision task. However, the retrieval of the word can influence the effect the word has when it was presented. Thus, the results of their study could be biased by factors they did not control. The present study tried to control for these confounds. The goal of this study was to investigate with ERPs whether there is a difference in emotional experience in L1 and in L2. Based on the previous studies, we hypothesize that there is a difference in emotional intensity between L1 and L2. More specifically, L1 will elicit a stronger emotional response in comparison to L2. (e.g., Harris, 2004; Harris et al., 2003). We used a delayed-naming task to investigate this matter. In this task, participants are presented with the to-be named target. They can prepare their response prior to the onset of the target word. Since we also wanted to investigate the difference between negative and taboo words, our critical stimuli were negative and neutral words (experiment 1) and taboo and neutral words (experiment 2). The reason why we did not include any positive stimuli is because of the 11 difference in processing between positive and negative words: e.g. subjects are faster on positive words compared to negative and neutral words (Kissler & Koessler, 2011; Palazova, Mantwill, Sommer & Schacht, 2011). Also, we investigated the difference between taboo and emotional words in L1 and L2. Thus, negative words are more in line with the taboo words. We examined this effect with an objective, online and direct method – ERP signals. The dependent measures were the P300 and the long positive potential (LPP). Our hypothesis was that negative and taboo words would elicit a P300 and an LPP. The amplitude of the P300 and the LPP would be larger in L1 than in L2. Experiment 1 In the first experiment, we wanted to investigate if there was a difference in the emotional intensity for negative and neutral words. Second, we wanted to establish whether this difference differed between L1 and L2. A delayed naming paradigm with Dutch and English words was used to study this difference. We chose to use ERPs as the dependent measure, since it is very sensitive, online and direct method. More specifically, we focussed on the P300 and the LPP. We hypothesized that the amplitudes would be larger for negative words in comparison to neutral words. Moreover, the difference in amplitude between negative words and neutral words would be larger in L1 than in L2. Method Participants. Twenty psychology students from Ghent University participated in this experiment (2 male and 18 female). They received a small compensation. Before they were accepted to this study, they had to fill out the LEXTale English and Dutch (Lemhöfer & Broersma, 2011) to make sure that all participants had a sufficient knowledge of English (L2). Only participants that had a score higher than 85% on Dutch and 60% on English were allowed to participate. Also, participants that scored higher on the English version than on the Dutch version were excluded. The average score on the Dutch version was 93.62% and for the English version 78.44%. None of the participants had a history of neurological or psychiatric disorders. All participants were right handed (The Dutch handedness questionnaire, Van Strien, 2002). They reported that Dutch was their native language (L1), and the language they used the most (table 1). All participants received English (L2) education in secondary school and they lived in a Dutch dominant environment. All of them still came in contact on a regular basis 12 with English by internet, tv, films, radio, etc. For their L2, most of the participants learned this at home (n = 16) and some of them at school (n = 2). The other 2 participants learned English on holidays. Table 1: Self-Assessed Ratings (7-point Likert Scale) of L1, L2 and L3 Proficiency (Experiment 1). Standard deviations are included in parentheses Language L1 (Dutch) L2 (English) L3 (French) Stimuli. Skill Experiment 1 Writing 6.20 (.53) Reading 6.70 (.47) Understanding 6.60 (.60) Speaking 6.50 (.69) General proficiency 6.5 (.61) Writing 4.89 (.88) Reading 5.75 (.64) Understanding 5.75 (.64) Speaking 5.40 (.75) General proficiency 5.16 (.51) General proficiency 3.70 (1.22) Before we administered the experiment, we chose to rate our stimuli. Volunteers, who received a small monetary contribution, performed this rating. The volunteers were similar to the subjects in the actual experiment in terms of education, age and L1 and L2 knowledge. First, a huge pool of potential emotional nouns was gathered, using the ANEW and previous studies (Ayçiçegi-Dinn & Caldwell-Harris, 2009; Belezza, Greenwald & Banaji 1986; Bertels, Kolinsky & Morais, 2009; Bertels, Kolinsky & Morais, 2010; Eilola et al., 2007; Ferré et al., 2010; Guillet & Arndt, 2009; Harris et al., 2003; Harris, 2004; Janschewitz, 2008; Pavlenko, 2002; Thomas, Johnstone & Gonsalvez, 2007 and Zeelenberg et al., 2011). English words were being translated to Dutch and vice versa. Then, we omitted all cognates and homophones from our stimulus set, because those words could enhance the bilingual lexical processing (e.g. Costa, Caramazza & Sebastian-Galles, 2000). Further, using the SUBTLEX-Dutch (Keuleers, Brysbaert & New, 2010) and SUBTLEX-English (Brysbaert & New, 2009), we searched for 13 control neutral words, which are matched on frequency, word category (nouns) and number of letters (identical). Using a paired t-test, we found no significant difference between all categories of words in terms of frequency (p > .1). Eventually, only 50 words each category remained (in total: 200). Those stimuli were randomized for each participant in our rating study, so that every participant received a different order. Participants had to judge the stimuli on a 7-point likert scale on valence (1 = very negative, 7 = very positive) and arousal (1 = not arousing, 7 = very arousing). To make sure we had negative stimuli, we selected all Dutch negative words that had a rating lower than 2.5. For the neutral words, Dutch words were selected that had a rating higher than 4 and lower than 5. We only selected on Dutch words, because else we would have influenced the difference of perceived emotional intensity of both languages. If we would have used the same criteria to select on the English words as well, then the perceived difference between Dutch and English words would vanish. From our rated data pool, 26 stimuli per category were selected. Those critical stimuli were being used in the delayed naming task. They were still matched in terms of frequency, number of letters and word category (p > .1) (see table 2). For the full list of critical items: see appendix A. Table 2: The characteristics of the stimuli in experiment 1. Standard deviations are displayed in parentheses. Reported p-values: paired t-test between different conditions. # letters p value letters English Dutch Negative 7.27 (2.32) 7.00 (2.08) Control 7.27 (2.32) 7.00 (2.08) p-value Identical Identical Procedure. word freq p-value freq English Dutch >.5 2.63 (.54) 2.67 (.57) >.4 >.5 2.72 (.55) 2.69 (.51) >.7 >.25 >.6 Before the participants begun with the experiment, they were asked to fill out the informed consent and the relevant questionnaires. Next, they were being prepared for the ERP-experiment. Participants were seated at 60cm in front of a computer screen. Responses were collected by means of a voice key. We emphasized on the fact that they should not blink during the trial (from fixation cross until after the response). At the beginning of the experiment, Dutch instructions were presented on the screen. The subjects started with a practice block in one language, followed by 3 experimental blocks in the same language. Then again a practice block 14 in the other language, followed by 3 experimental blocks in that language. The language blocks were contra-balanced over participants Every trial started with a fixation cross presented in the middle of the screen for 700 ms. Immediately the target word was presented for 1000 ms. Next, an exclamation mark was shown in the centre of the screen, which was the cue for the participants to pronounce the word they have just seen. This remained on the screen until the participants pronounced the word, or until the maximum response time was expired (= 5000 ms). The intertrial interval was 1500 ms, which is a sufficient amount of time to blink. Between each block, subjects could take a self-paced break. Every practice block contained 20 trials (10 emotion words and 10 control words). Those words did not occur in the actual experiment. The experimental blocks contained 52 trials (26 emotion words and 26 neutral words). Each block contained the same words, but they were randomized every time. An additional restriction was included so that an emotional word could not be presented on more than three consecutive trials. After the experiment, the participants had to make backwards and forwards translations on the critical stimuli. Also, they had to judge each word on emotionality on a 7-point likert scale (1 = negative, 4 = neutral, 7 = positive). This was done so that unknown or wrongly translated words could be removed from the analyses. Additionally, words that were rated into a different category were also excluded from analyses. EEG Recording. EEG was recorded continuously at a sampling rate of 512Hz from 64 channels with the Biosemi Active Two system referenced online to the CMS-DRL ground. To record vertical eye-movements, four unipolar electrodes were placed 1 cm below and above the eyes. Two other unipolar electrodes were placed beyond the outer canthus of the eyes, to register horizontal eye-movements. Additionally, two electrodes were placed on the mastoids. Results and discussion Behavioural Results. Trials with a RT slower than 150 ms and faster than 3 standard deviations above the mean RT for each condition of each subject were excluded from analysis. Participants that were excluded from the EEG analyses, were also removed here (4 participants, see below). Additionally, wrongly or not translated words were also excluded. In total, 24.05% of data were not included in the analyses. A repeated measures ANOVA was performed with emotion (2; negative vs neutral) and language (2; Dutch vs English) as crossed variables. There 15 was a marginal significant interaction between language and emotion, F1 (1, 15) = 3.84, p = 0.069; F2 (1, 24) = 10.66, p = .003. 1 Further, there was a main effect of language, F1 (1, 15) = 7.15, p = .017, F2 (1, 24) = 33.50, p = .000 but no significant main effect of emotion, F1 (1, 15) = .24, p = .629, F2 (1, 24) = 0.01, p = .907. Table 3: Average naming latency in experiment 1 (in ms) for negative and neutral stimuli in L1 and L2 in the delayed naming paradigm. Neutral L1 Negative L1 Neutral L2 Negative L2 Mean 523.87 512.74 569.77 588.72 Standard Deviation 142.49 125.96 139.18 149.92 EEG Results and discussion. All EEG data were analyzed using EEPROBE (ANT inc.). The data were offline re-referenced to the average of the right and the left mastoids. The EEG data were filtered offline with a bandpass filter of .01Hz – 30 Hz. Additionally, the data were corrected for blinking, using PCA-transformed EOG components (Nowagk & Pfeifer, 1996). The ERPs were time-locked to the onset of the target, with epochs being generated in a 1200ms time window (baseline: -200ms – 0ms), with 0ms being the onset of the target word. Before averaging, the epochs containing errors (e.g. EEG artefacts,...) and wrongly translated words were removed. Outliers that were removed from the behavioural analyses were still include here, apart from the trials were the voicekey did not trigger. Four participants were excluded due to insufficient (i.e. fewer than 15% in one of the conditions) after artifact rejection. The total data loss was 53.23%. Visual inspection of the grand average waveforms showed a large difference between negative and neutral words, which appeared to be larger for Dutch than English. The effect appeared to be at its maximum between 300ms and 600 ms. To determine the moment in time where the waves started to differ significantly, a repeated measures ANOVA with emotion (negative vs neutral) x language (Dutch vs English) x frontality (posterior vs mid posterior vs central vs mid frontal vs frontal electrodes) x laterality (left hemisphere vs mid left hemisphere vs midline vs mid right hemisphere vs right hemisphere) was performed for 20ms bins. In a series of 3 consecutive time bins, two had to reveal significant effects in order to be taken into account when determining larger time windows. Given our hypotheses, all effects with emotion 1 We did not hypothesize to find any effect in our reaction time, so this was an unexpected finding. Even though it was only marginal significant, we performed a paired samples t-test to investigate this interaction. However, there were no significant results for the relevant pairs (negative Dutch – neutral Dutch: t(15) = 1.00, p = .333 ; negative English – neutral English: t(15) = -1.73, p = .104. 16 (i.e. main effect and interactions) were taken into account when determining the larger time window. The electrodes that were being used were, represented over frontality: F3/F1/Fz/F2/F4; FC3/FC1/FCz/FC2/FC4; C3/C1/Cz/C2/C4; CP3/CP1/CPz/CP2/CP4; P3/P1/Pz/P2/P4. In all analyses, the Greenhouse-Geisser correction was applied when the sphericity assumption was violated. Here, the uncorrected degrees of freedom are reported. Analyses of the bins revealed reliable effects between 400ms until 540ms and 600ms until 880ms. Therefore, a repeated measures ANOVA with the same factors as before was performed in these time windows. In the 400ms – 540ms time window, there was a significant interaction between emotion and frontality, F(4, 60) = 3.87, ε = .205, p = .036). This interaction showed a marginal significant effect on the electrodes on the posterior part of the brain (P3/P1/Pz/P2/P4): F(1, 15) = 3.71, ε = .198, p = .073. There was no significant effect for the frontal part of the brain F(1, 15) = .00, ε = .00, p = .985, the mid-frontal part of the brain, F(1, 15) = 1.22, ε = .075, p = .287, the central part of the brain, F(1, 15) = 2.58, ε = .147, p = .129, and the mid – parietal part of the brain, F(1, 15) = 2.92, ε = .163, p = .108. Important for our hypotheses, there was also a marginal significant interaction between language and emotion, F(1, 15) = 4.40, ε = .227, p = .053. Here, a significant difference for the contrast Dutch neutral words and Dutch negative words, F(1, 15) = 8.05, ε = .349, p = .013, was found, but no significant difference for English neutral words and English negative words, F(1, 15) = 1.13, ε = .07, p = .304. There were no significant effects for the main effect of emotion, F(1, 15) = 2.08, ε = .122, p = .169. Further, the other interactions were not significant (no interaction between emotion and lateralization, F(4, 60) = 2.28, ε = .132, p = .110, no interaction between emotion, frontality and lateralization, F(16, 240) = 1.01, ε = .063, p = .420, no interaction between language, emotion and frontalization, F(4, 60) = .70, ε = .045, p = .528, no interaction between language, emotion an lateralization, F(4, 60) = 1.60, ε = .096, p = .217 and no 4-way interaction between language, emotion, frontality and lateralization, F(16, 240) = 1.27, ε = .078, p = .295). In the second time window between 600ms and 880 ms, we found a significant interaction between language and emotion (F(1, 15) = 5.03, ε = .251, p = .04). Looking into this interaction, we found a significant difference for the contrast Dutch neutral words and Dutch negative words (F(1, 15) = 6.88, ε = .314, p = .019), but no significant difference for English neutral words and English negative words (F(1, 15) = .43, ε = .028, p = .521). The analyses for the other effects showed no significant effects (no main effect of emotion, F(1, 15) = 2.08, ε = .122, p = .170, no interaction between emotion and frontalization, F(4, 60) 17 = 1.24, ε = .076, p = .306, no interaction between emotion and lateralization, F(4, 60) = 1.47, ε = .089, p = .241, no interaction between emotion, frontality and lateralization, F(16, 240) = 1.32, ε = .081, p = .4257, no interaction between language, emotion and frontalization, F(4, 60) = 1.26, ε = .078, p = .299, no interaction between language, emotion an lateralization, F(4, 60) = 1.17, ε = .072, p = .328 and no 4-way interaction between language, emotion, frontality and lateralization, F(16, 240) = 1.08, ε = .067, p = .352). See Appendix B and Appendix C for the average and standard deviation of the amplitude of the waveform over participants per condition per electrode for each significant time window. Additionally, see Appendix D for the grand averages for each electrode. Figure 1. Grand averages for negative words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrode Fz. Figure 2. Grand averages for negative words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrode Cz. 18 Figure 3. Grand averages for negative words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrode Pz. We hypothesized that we would not find any significant results on the reaction time. However, we did find significant results here. The main effect of language can be explained by the task that the participants had to execute. This effect showed that participants were slower for English words in comparison to Dutch words. Even though it was a delayed naming task, it might be that participants found it hard to formulate the words in L2 and needed more time to pronounce the words. This could explain why they were slower to name L2 words. There was also a marginal significant interaction between language and emotion. However, further analyses showed that the relevant contrasts were not significant. Concerning the ERPs, we found a large positive amplitude emerging around 400ms until 540ms and another positive wave between 600ms and 800ms. Both components are distributed at the posterior part of the brain, which was also reflected in the analyses: the interaction between emotion and frontality revealed a marginal significant interaction between emotion and language on the posterior part of the brain. According to the timing and the distribution of these peaks, they can be interpreted as the P300 and the LPP, respectively. The P300 has been shown to reflect the allocation of resources that have limited capacity for stimuli that are task-relevant (Sutton, Braren, Zubin & John, 1965). In this experiment, we used negative words. Emotion stimuli have an intrinsic motivational significance and thus can be automatically processed as task-relevant (Hajcak et al., 2010), reflecting in a P300. Next, the LPP emerges when emotional stimuli are being presented (e.g., Cuthbert, Schupp, Bradley, Birbaumer & Lang, 2000 and Hajcak et al., 2010). Thus, the observation of those two components is valid in this study. There was also a marginal significant interaction between language and emotion that was significant over all the electrodes. Further analyses showed that the contrast for negative L1 19 words and neutral L1 words was significant. In contrast to this, the contrast for negative L2 words and neutral L2 words was not significant. In other words, negative Dutch words have a larger positive amplitude in comparison to neutral Dutch words. This difference was significant for L1, but not for L2. These results support our hypothesis that L1 is more emotional intense in comparison to L2. This experiment only investigated this effect with one type of emotional words. To elaborate the results, the next experiment investigated if this effect also holds for taboo words. As previously mentioned, studies that have included taboo words in their stimuli set found that these stimuli differ from other emotion stimuli (e.g. Guillet & Arndt, 2009; Mathewson et al., 2008). As a result, with experiment 2 we wanted to replicate the findings of experiment 1 with taboo words. Experiment 2 In this experiment, we wanted to investigate if there is any difference in the emotional experience when participants see a taboo word in either Dutch or English. Also, with the two experiments executed in this study, we could compare the differences in experience between emotional and taboo words. We wanted to investigate whether there was a difference in brain activity between taboo and neutral words and whether this was modulated by language. Jay et al. (2008) already showed that there is a physiological difference between taboo and neutral words: taboo words elicited a skin conductance response more frequently than other types of words. However, ERPs are a more sensitive and online and direct method as skin conductance responses, so we chose to investigate this difference with ERPs. Because of this, we could follow the process that happened between the stimulus presentation and the response in real time. Again, we set up a rating study for our stimuli set. We also used the same procedure and method as in experiment 1. Method Participants. Eighteen volunteers participated in this experiment (6 male and 12 female) and received a small monetary compensation. The same criteria were used as in experiment 1. Additionally, they had not participated in the first experiment. Before they were accepted to this study, they filled out the same questionnaires as in experiment 1. Participants were all right handed (Van Strien, 2002). Their all reported that their L1 was Dutch, L2 was English 20 and L3 was French (table 4). All of them still came in contact on a regular basis with English by internet, tv, films, radio, etc. For their L2, most of the participants learned English at school (n = 8), followed by at home (n = 6). The other participants learned English on holidays (n = 3) and on language camp (n = 1). Table 4: Self-Assessed Ratings (7-point Likert Scale) of L1, L2 and L3 Proficiency (Experiment 2). Standard deviations are included in parentheses Language L1 (Dutch) L2 (English) L3 (French) Stimuli. Skill Experiment 1 Writing 6.20 (.51) Reading 6.67 (.59) Understanding 6.72 (.46) Speaking 6.61 (.61) General proficiency 6.67 (.59) Writing 4.78 (.81) Reading 5.72 (.46) Understanding 5.72 (.46) Speaking 5.17 (.71) General proficiency 5.39 (.70) General proficiency 4.00 (0.91) Stimuli were also gathered from previous studies (Ayçiçegi-Dinn & Caldwell- Harris, 2009; Bertels et al., 2009; Eilola, et al., 2007; Guillet & Arndt, 2009; Harris et al., 2003; Harris, 2004; Janschewitz, 2008; MacKay et.al., 2004 and Zeelenberg et al., 2011) and from websites that indicated some words as taboo and inappropriate. Control words were matched the same was as in experiment 1. Here, also a rating study was used similar to experiment 1. In contrast with experiment 1, participants had to to judge the stimuli on a 7-point likert scale for different dimensions similar as in Janschewitz (2008): personal use (1 = never use this word, 7 = use this word all the time), offensiveness (1 = not offended, 7 = very offensive), familiarity (1 = never encountered, 7 = encounter this all the time), tabooness personal (1 = not taboo, 7 = very taboo), tabooness interpersonal (1 = not taboo, 7 = very taboo), arousal (1 = not arousing, 7 = very arousing) and imaginability (1 = does not recall a mental picture, 7 = recall a very vivid mental picture). Despite our efforts get a good taboo rating, participants tended to give 21 low scores on the taboo words when rating for tabooness (overall average rating taboo: 2.58, overall average rating neutral: 1.09; on a likert scale ranging from 1 to 7). Thus, we decided to reject the taboo words with the lowest ratings. Additionally, verbs were also included to have a sufficient amount of stimuli. Eventually, 30 stimuli in each condition were constructed (taboo Dutch, neutral Dutch, taboo English and neutral English). These critical stimuli were used in the delayed naming task. They were still matched on the same criteria as in experiment 1 (see table 5). For the full list of critical items: see appendix B. Table 5: The characteristics of the stimuli in experiment 2. Standard deviations are displayed in parentheses. Reported p-values: paired t-test between different conditions. # letters p value letters English Dutch Taboo 5.40 (1.94) 6.10 (2.26) Control 5.40 (1.94) 6.10 (2.26) p-value Identical Identical Procedure. word freq p-value freq English Dutch >.1 2.54 (.52) 2.74 (.56) >.1 >.1 2.72 (.57) 2.69 (.50) >.5 >.1 >.6 The same procedure was used as in experiment 1 with the exception that the participants saw 60 words per block. EEG Recording. The recording and analyses were identical to experiment 1. Results and discussion Behavioural Results. The same criteria as in experiment 1 were used for data-reduction. In total, 24.05% of data were not included in the analyses. A repeated measures ANOVA was performed with emotion (negative vs neutral) and language (Dutch vs English) as crossed variables. There were no significant interactions, F1 (1, 14) = .78, p = .393, F2 (1, 29) = .06, p = .808), a marginal main effect of emotion for participants, F1 (1, 14) = .3.166, p = .097, F2 (1, 29) = .825, p = .371, and no significant main effect of language, F1 (1, 14) = .064, p = .804, F2 (1, 29) = 1.461, p = .237. 22 Table 6: Average naming latency in experiment 2 (in ms) for taboo and neutral stimuli in L1 and L2 in the delayed naming paradigm. Neutral L1 Taboo L1 Neutral L2 Taboo L2 Mean 607.92 604.56 613.89 603.94 Standard Deviation 123.07 127.11 122.32 115.68 EEG Results and discussion. Before the server crashed, we only had 11 participants to perform the analyses on, which is not sufficient to draw any conclusions on. However, to give an indication, analyses and discussion are still reported below. The same pre-processing was used as in experiment 1. Visual inspection of the grand average waveforms showed a large difference between taboo Dutch and neutral Dutch words. This difference was smaller for taboo English and neutral English words. The effect appeared to be at its maximum between 400ms and 800 ms. The same method was used as in experiment 1 to determine the moment in time where the waves started to differ significantly. Also, the same effect were considered and the same electrodes were used. In all analyses, the Greenhouse-Geisser correction was applied when the sphericity assumption was violated. Here, the uncorrected degrees of freedom are reported. Analyses of the bins revealed reliable effects between -200ms until -140ms, -60ms until 40ms and 380ms until 820ms. Therefore, a repeated measures ANOVA with the same factors as before was performed in these time windows. In the -200ms until -140ms time window, there was a significant main effect of emotion, F(1, 11) = 8.58, ε = .438, p = .014). There was no interaction between language and emotion, F(1, 11) = 1.76, ε = .138, p = .212, a marginal significant interaction between emotion and frontalization, F(4, 44) = 2.97, ε = .213, p = .068 no interaction between emotion and lateralization, F(4, 44) = .75, ε = .064, p = .481, no interaction between emotion, frontality and lateralization, F(16, 176) = 1.14, ε = .094, p = .348, no interaction between language, emotion and frontalization, F(4, 44) = .90, ε = .076, p = .414, no interaction between language, emotion an lateralization, F(4, 44) = .52, ε = .045, p = .633 and no 4-way interaction between language, emotion, frontality and lateralization, F(16, 176) = 1.19, ε = .098, p = .328. In the -60ms until 40ms window, there was a significant main effect of emotion F(1, 11) = 9.19, ε = .455, p = .011). There was no interaction between language and emotion, F(1, 11) = .01, ε = .001, p = .931, no significant interaction between emotion and frontalization, F(4, 23 44) = 1.22, ε = .100, p = .315 no interaction between emotion and lateralization, F(4, 44) = .59, ε = .051, p = .532, no interaction between emotion, frontality and lateralization, F(16, 176) = 1.14, ε = .094, p = .348, no interaction between language, emotion and frontalization, F(4, 44) = .24, ε = .021, p = .725, no interaction between language, emotion an lateralization, F(4, 44) = 1.57, ε = .125, p = .236 and no 4-way interaction between language, emotion, frontality and lateralization, F(16, 176) = 1.32, ε = .107, p = .281. In the crucial time window between 380ms and 820ms, we found a significant main effect of emotion, F(1, 11) = 8.98, ε = .450, p = .012. There was also a significant interaction between emotion and lateralization, F(4, 44) = 3.50, ε = .241, p = .046. Further analyses showed that there was an effect on the left hemisphere (electrodes F3, FC4, C3, CP3, P3), F(1, 11) = 7.65, ε = .410, p = .018, on the mid left hemisphere (electrodes F1, FC1, C1, CP1, P1), F(1, 11) = 11.52, ε = .511, p = .006, on the midline (electrodes Fz, FCz, Cz, CPz, Pz), F(1, 11) = 12.93, ε = .540, p = .004, and on the mid right hemisphere (electrodes F2, FC2, C2, P2), F(1, 11) = 6.37, ε = .367, p = .028. There was only a marginal significant effect on the right hemisphere (electrodes F4, FC4, C4, CP4, P4), F(1, 11) = 3.70, ε = .252, p = .081. The main analyses showed also a significant interaction between language, emotion and frontalization, F(4, 44) = 3.84, ε = .259, p = .009. Further analyses showed an interaction of emotion and language on the posterior part of the brain (electrodes P3, P1, Pz, P2, P4), F(1, 11) = 5.173, ε = .320, p = .044. There was no interaction of emotion and language on the anterior part of the brain (electrodes F3, F1, Fz, F2, F4), F(1, 11) = 1.90, ε = .147, p = .196, the mid anterior part (electrodes FC3, FC1, FCz, FC2, FC4), F(1, 11) = .741, ε = .063, p = .408, the central part (electrodes C3, C1, Cz, C2, C4), F(1, 11) = 2.11, ε = .161, p = .174 and the mid posterior part (electrodes CP3, CP1, CPz, CP2, CP4), F(1, 11) = 1.87, ε = .131, p = .223. We further analyzed the interaction between language and emotion at the posterior part of the brain. This showed a significant difference for the contrast Dutch neutral words and Dutch taboo words, F(1, 15) = 14.03, ε = .561, p = .003, but no significant difference for English neutral words and English taboo words, F(1, 15) = 1.31, ε = .107, p = .276. The main analyses of this window revealed no significant interaction between emotion and frontalization, F(4, 44) = 2.29, ε = .172, p = .112, no interaction between emotion, frontality and lateralization, F(16, 176) = 1.31, ε = .107, p = .277, no interaction between language, emotion an lateralization, F(4, 44) = .63, ε = .054, p = .519 and no 4-way interaction between language, emotion, frontality and lateralization, F(16, 176) = 1.66, ε = .131, p = .186. To see the averages and standard deviations of each electrode in the significant time 24 window, see Appendix F. To see the figures of the grand averages, see Appendix G. Figure 4. Grand averages for taboo words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrode Fz. Figure 5. Grand averages for taboo words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrode Cz. 25 Figure 6. Grand averages for taboo words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrode Pz. In contrast with the previous experiment, we did not find any significant results in the reaction time. This is what we hypothesized in the beginning of this study: there would be no significant differences between reaction times, whereas there would be a significant effect in the ERPs. This dissociation between RTs and ERPs might seem arbitrary, however previous studies have found the same pattern of results. For example, Carretié et al. (2008) did not always find a difference between neutral and negative words in a lexical decision task in the behavioural results. The ERPs, on the other hand, showed that negative words are processed differently from neutral words. Comparing these findings with the previous experiment, we did not find an interaction between language and emotion for taboo words. This could mean that the marginal interaction in the first experiment was indeed not significant, as the further analyses already showed. On the other hand, it could also mean that the effect of emotional difference between the two languages is not present when taboo words are used. Future studies should investigate this matter further. Concerning the ERPs, we found some effects in our baseline. This is probably due to the lack of participants in this experiment. Due to the server crash, it was impossible to do the analyses with more participants. This should be resolved when more participants were added in the analyses. For our main effect, we found a large positive peak that emerged around 380ms. The significant effect lasted until 820ms after target onset. Even though the timing is not at 300ms precise, the first component can be interpreted as the P300. The timing of the P300 can shift to a later moment in time (e.g. McCarthy & Dunchin, 1981 and Polich, 1987), which could have 26 happened in this experiment as well. The grand averages also showed that the P300 blends into the LPP, which would explain why such a large window revealed a significant effect. Also, the lack of participants can explain this large window as well. The P300 and the LPP are related to the processing of emotional words (Hajcak et al., 2010), so the observation of those two components is valid in this experiment. We found an interaction between language, emotion and frontality, which revealed an interaction between emotion and language in the posterior part of the brain. This last interaction showed that the difference in amplitude of the P300 is larger for taboo Dutch words in comparison to neutral Dutch words. This was not the case for taboo English words and neutral English words. This is a similar finding to the first experiment with negative words. This larger difference between taboo words and neutral words in L1 is again evidence for a more emotional experience in L1 in comparison to L2. There was also an interaction between emotion and lateralization, which showed an effect of emotion on the entire electrodes, except for the most right row of electrodes (marginal significant). This shows that the difference between taboo and neutral words is picked up. This is independent from language: just seeing a taboo word elicited a very strong response in the brain in comparison to seeing a neutral word. This shows us that having taboo words have a very strong impact in the brain. General Discussion Research on bilingualism has revealed that bilinguals differ from monolinguals in several aspects, such as language processing (even in their native language) and cognitive control (Anooshian & Hertel, 1994). However, language is not only used to convey propositional information from a speaker to a listener, but of course also allows us to express emotions (Leventhal & Scherer, 1987). An intuition shared by many bilinguals is that the emotional experience of a word in a second language (L2) is considerably less intense than in a first language (L1) (Dewaele, 2004a, 2004b, 2008). The goal of this paper was to investigate whether L1 and L2 differ in emotional intensity. This was done by conducting two experiments: in the first experiment, negative and neutral words were used in L1 and L2. In the second experiment, the results were replicated with taboo and neutral words in L1 and L2. In the first experiment, there was a significant effect of language in the reaction times, namely there was a difference between the two languages. Participants were slower to react on their L2. This effect was no longer present in experiment 2. We believe that this is due to our stimuli. In the first experiment we used negative and neutral words in both language. However, 27 in every day life, L2 emotional words are not often used or encountered. Thus, it can be harder to name those words for bilinguals. On the other hand, in experiment 2 we used taboo and neutral words in L1 and L2. Here, participants encountered and used the taboo words more often than the negative words. It is quite common that Flemish people swear in English (“shit” or “fuck”). Hence, it could be that the participants found those words easier to pronounce. This then resulted in a non significant effect. Considering the ERPs, in both experiments we found a positive wave emerging around 300ms and a late positive complex at 600ms. These can be interpreted as the P300 and the late positive complex (LPP) respectively. Additionally, the topography of the P300 and LPP is distributed in the posterior part of the brain, which is in accordance to our results. Both have already been found in emotion literature (e.g. Opitz & Degner, 2012; Wu & Thierry, 2012): emotional words draw more attention towards them in comparison with neutral words (Cuthbert, et al., 2000 and Hajcak et al., 2010). The finding of these two components is in line with our predictions. Participants were presented with neutral and negative or taboo words. The larger positive amplitude for taboo and negative words demonstrated that our manipulation was successful: taboo and negative words elicited a larger emotional response in comparison to neutral words. In addition to this, the difference in amplitude between negative and taboo words with neutral words was larger in L1 (Dutch) when compared to the L2 (English). These findings support our hypothesis: L1 has a stronger emotional experience than L2. This is supported as well with negative words as with taboo words. Experiment 1 is in accordance with previous behavioural findings (e.g., Anooshian & Hertel, 1994) and clinical studies (e.g., Javier, 1996). Here, we found that negative words in L1 elicited a stronger emotional response in comparison to negative words in L2. Experiment 2 is in accordance with previous physiological findings (e.g., Harris et al., 2003 and Harris, 2004) and self report (Dewaele, 2004a, 2004b). In experiment 2 we found that taboo words elicited a stronger emotional response in L1 in comparison to L2. Moreover, when we asked the participants afterwards about their experience with the experiment, most of them said that they found it weird to say the taboo words out loud in their L1. They also reported to say them less loud in comparison to the L2 taboo words. The findings of these two studies support the notion that L1 is considered as a personal language, the language you use to express emotions with, while L2 is seen as more emotional distant, and more convenient to say taboo words in (Pavlenko, 2002). In experiment 2, we also found an interaction between emotion and lateralization. This was not present in experiment 1. In the second experiment, the presentation of a taboo word 28 elicited a strong response in the brain and this was independent from language. This was not the case with negative words. This could indicate that taboo words have a stronger emotional intensity in comparison to negative words. In the first experiment, we had a lot of data loss. This was mainly due to EEG artefacts. The second reason for this data loss was because participants did not know the word, or incorrectly translated them. We tried to prevent this by rating our stimuli in advance, so that commonly unknown words would not be included in the stimuli set. Additionally, we screened our participants on proficiency in L1 and L2, so that participants would have sufficient knowledge of L2. Nevertheless, some trials were lost due to wrong or incorrect translations. When we compare our results to the previous studies, they are in agreement with the studies of Dewaele (2004a, 2004b, 2008), Anooshian & Hertel (1994) and Harris (2004). Unfortunately, other studies did not find any difference between L1 and L2 when using emotional stimuli. We believe that one of the main reason for this lack of results is the characteristics of the participants. Previous research did not systematically consider these person variables. For example, in memory studies, Anooshian & Hertel (1994) found better recall in L1, Ayçiçegi & Harris (2004) found better recall in L2 for some stimuli and Ayçiçegi-Dinn & Caldwell-Harris (2009) found no differences between the two languages. However, they did not control person variables: Anooshian & Hertel and Ayçiçegi-Dinn & Caldwell-Harris used late L2 bilinguals, while Ayçiçegi & Harris used L2 proficient bilinguals. It could be the case that person variables influence the results (age of acquisition, learning context, proficiency and language environment). Hence, future research should investigate if these variables indeed have an influence on the results. If this is the case, then these variables should be taken into account. Another explanation of the mixed results in the literature is the stimuli set that was used in the previous experiments. Either the amount of words per condition was low (e.g. Ayçiçegi & Harris, 2004; Harris, 2004), the stimuli set contained cognates (e.g. Harris, 2004), the stimuli set was not been rated by other participants (e.g. Wu & Thierry, 2012), or the stimuli set was not controlled for different variables (Ayçiçegi & Harris, 2004; Ayçiçegi-Dinn, & Caldwell-Harris, 2009; Harris et al., 2003). These biases could have influenced the results of the previous studies. Additionally, some emotion words can have different meanings in their translation equivalents (Altarriba & Bauer, 2004). Moreover, there are terms of emotions that only exists in one language (e.g., ‘schadenfreude’ in German, ‘saudade’ in Portugese – “Emotions and Culture”, 2012). In this experiment, we tried to control our stimuli as best as possible. There were a sufficient amount of words per category (ca. 27 in each category). Cognates and 29 homographs were left out the stimuli set, since they could activate the other language (Costa et al., 2000). Additionally, our stimuli set in both experiments is matched over categories on frequency, amount of letters and word category. Moreover, we did not make use of a database like ANEW, since it was not been constructed for Dutch words when the study started (now Moors et al., 2013, have constructed a database for Dutch words). Nevertheless, when using a database, several words can be more emotional in L1 than the translation equivalent in L2. This can be solved by rating the stimuli. Here, all our stimuli were rated before the experiment begun, so that we would have a stimuli set with the desired characteristics, rated by a sample that is similar to the participants in the ERP experiments. With these caretakings, we hope that we ruled out any (or most) biases that could have influenced the results. To summarize, the mixed results found in the literature could be due to the person variables and/or due to the characteristics of the stimuli set. This experiment tried to control these variables: we recruited late L2 bilinguals, who lived in an L1 dominant environment and were proficient in their L2. For our stimuli sets, we controlled for different variables (see above) and rated the sets by other participants. Unfortunately, there are also some flaws in these experiments. First of all, it is difficult to literally translate taboo words, and even if there were literal translation available, those translations can differ in emotional experience in the two languages. However, in this experiment we made sure that the two targets had the equal amount of emotional experience in L1 and L2. Additionally, because literally translated taboo words are rare, this experiment made use of targets with different synonyms (e.g. ‘screw’ and ‘fuck), which could be exchanged with other targets. This has been done to make sure that there were sufficient amount of targets in each condition. Also, in the taboo experiment we made use of verbs as well. This was done because otherwise there would be an insufficient amount of targets per condition. Nonetheless, the experiment would have been cleaner if the stimuli set would only consists of nouns, similar to the first experiment. Second, when we constructed the stimuli set, we used the frequency of the words from the SUBTLEX-Dutch database (Keuleers et al., 2010) and the SUBTLEX-English database (Brysbaert & New, 2009). Yet, the construction of the SUBTLEX-English database relies on the frequency of native English speakers. The frequency of these words can differ from the frequency of L2 English speakers. We tried to accommodate to this shortcoming by rating the stimuli. Nevertheless, it would also be preferred to have frequency ratings from an L2 database. However, no such databases exist up to now. For follow-up studies, we first of all suggest to try to replicate these results with positive 30 words. Palazova et al. (2011) have demonstrated that positive words are being processed faster in comparison to negative and neutral words. Although several studies did not find any significant effects for positive words (e.g. Harris, et al., 2003 and Scott et al., 2009), the effect should also be present with positive words if bilinguals indeed have different emotional experience in L1 and L2. Additionally, the stimuli can also be presented in the auditory modality. Harris et al. (2003) already showed that when they presented stimuli in auditory, the skin conductance levels were higher than in the visual stimuli. Thus, it could be stimuli presented auditory in L1 and L2 have an even stronger effect. Another follow-up experiment could be to replicate these findings with pictorial stimuli. Previous studies have shown that using pictures elicited a stronger emotional response in comparison with emotional words (e.g. Hinojosa, Carretié, Valcárcel, Méndez-Bértolo,& Pozo, 2009). Thus, using pictures and/or presenting stimuli in an auditory modality could provide us more more information about this difference in emotional intensity between L1 and L2. A second line of research is to investigate which person variables influence the emotional experience in L1 and L2. The four most important ones are age of acquisition, L2 proficiency, learning context of L2 and the language environment. Harris et al., (2006) have already proposed an emotional context of learning theory. When L2 is learned after a certain age, this will influence the emotional responsiveness in L2. Also, when bilinguals are less proficient in their L2, this also results into a general decline in emotional responsiveness. The same can be stated for language environment. When bilinguals are immersed into their L2 environment, it can have a beneficial effect on the emotional experience in the second language. However, the learning context of L2 is not straightforward. One could say that learning L2 in a naturalistic way would benefit the emotional experience in L2. Yet, in this experiment taboo words were used – words that are often learned in a naturalistic environment – . Thus, following this reasoning, one would expect to find no difference in experiment 2 between L1 and L2 in the difference in amplitudes for taboo words and neutral words. But this is not what we have found. Hence, it is not so clear how strong this learning context have to be. Perhaps it has more influence on the emotional experience if the entire language is learned in a naturalistic setting. To conclude these two studies, we have shown that there is a difference in emotional experience between L1 and L2. This has been demonstrated by negative and taboo words. We have compared our results to previous findings and have proposed some follow-up studies to elaborate this effect. 31 References Altarriba, J., & Bauer, L.M. (2004). The distinctiveness of emotion concepts: a comparison between emotion, abstract and concrete words. American Journal of Psychology, 117, 389 – 410. http://www.jstor.org/stable/4149007 Anderson, A. K. (2005). Affective influences on the attentional dynamics supporting awareness. Journal of Experimental Psychology: General, 134, 258 – 281. DOI: 10.1037/00963445.134.2.258 Anooshian, L., & Hertel, P. (1994). Emotionality in free recall: Language specificity in bilingual memory. Cognition and Emotion, 8 (6), 503 – 514. DOI: 10.1080/02699939408408956 Ayçiçegi, A. & Harris, C.L. (2004). Bilinguals’ recall and recognition of emotion words. Cognition and Emotion, 18 (7), 977 – 987. DOI: 10.1080/02699930341000301 Ayçiçegi-Dinn, A. & Caldwell-Harris, C.L. (2009). Emotion-memory effects in bilingual speakers: a levels-of-processing approach. Bilingualism: Language and Cognition 12 (3), 291 – 303 DOI: 10.1017/S1366728909990125 Belezza, F.S., Greenwald, A.G., & Banaji, M.R. (1986). Word High and Low in Pleasantness as Rated by Male and Female College Students. Behaviour Research Methods, Instruments, & Computers, 18 (3), 299 – 303. DOI: 10.3758/BF03204403 Bertels, J., Kolinsky, R. & Morais, J. (2009). Norms of Emotional Valence, Arousal, Threat Value and Shock Value for 80 Spoken French Words: Comparison between Neutral and Emotional Tones of Voice. Psychologica Belgica, 49 (1), 19 – 40. http://www .ingentaconnect.com/content/acad/psyb/2009/00000049/00000001/art00002 Bertels, J., Kolinsky, R. & Morais, J. (2010). Emotional Valence of Spoken Words Influences the Spatial Orienting of Attention. Acta Psychologica 134, 264 – 278. DOI: 10.1016/j.actpsy.2010.02.008. Brysbaert, M. & New, B. (2009). Moving beyond Kucera and Francis: a critical evaluation of current word frequency norms and introduction of a new and improved word frequency measure for American English. Behaviour Research Methods, 41 (4), 997 – 990. DOI: 10.3758/BRM.41.4.977. Buxbaum, E. (1949). The Role of Second Language in the Formation of Ego and Superego. Psychoanalytic Quarterly, 18, 279 – 289. Carretié, L., Hinojosa, J.A., Albert, J., López-Martin, S., de la Gándara, B.S., Igoa, J.M., & Sotillo, M. (2008). Modulation of ongoing cognitive processes by emotionally intense 32 words. Psychophysiology, 45 (2), 188 – 196. DOI: 10.1111/j.1469-8986.2007.00617.x Codispoti, M., & De Cesarei, A. (2007). Arousal and attention: Picture size and emotional reactions. Psychophysiology, 44, 680–686. DOI: 10.1111/j.1469-8986.2007.00545.x Colbeck, K.L. & Bowers, J.S. (2012). Blinded by taboo words in L1 but not in L2. Emotion, 12 (2), 217 – 222. DOI: 10.1037/a0026387 Costa, A., Caramazza, A., & Sebastian-Galles, N. (2000). The cognate facilitation effect: Implications for models of lexical access. Journal of Experimental Psychology Learning Memory and Cognition, 26, 1283 – 1296. DOI: 10.1037/0278-7393.26.5.1283 Cuthbert, B.N., Schupp, H.T., Bradley, M.M., Birbaumer, N., & Lang, P.J. (2000). Brain potentials in affective picture processing: covariation with autonomic arousal and affective report. Biological Psychology, 52, 95 – 111. DOI: 10.1016/S0301-0511(99)00044-7, Dewaele, J.-M. (2004a). The Emotional Force of Swearwords and Taboo Words in the Speech of Multilinguals. Journal of multilingual and multicultural development, 25 (2&3), 204 – 222. DOI: 10.1080/01434630408666529 Dewaele, J.-M. (2004b). Blistering Barnacles! What Language do Multilinguals Swear in?! Estudios de Sociolingüística 5 (1), 83 – 105. DOI: 10.1558/sols.v5i1.83 Dewaele, J.-M. (2008). The Emotional Weight of I love you in Multilinguals’ Languages. Journal of Pragmatics, 40, 1753 – 1780. DOI: 10.1016/j.pragma.2008.03.002 Dhooge, E. , De Baene, W., Severens, E., & Hartsuiker, R. J. (submitted). Lifting the ban: How social context affects the brain’s response to taboo words. Eilola, T.M. (2009). Processing emotionally charged words in the first and second language. Unpublished doctoral dissertation, University of Kent, Canterbury, United Kingdom. Eilola, T.M., & Havelka, J. (2010). Behavioural and physiological responses to the emotional and taboo Stroop tasks in native and non-native speakers of English. International Journal of Bilingualism, 15 (3), 353 – 369. DOI: 10.1177/1367006910379263 Eilola, T.M., Havelka, J. & Sharma, D. (2007). Emotional Activation in the First and Second Language. Cognition & Emotion, 21(5), 1064 – 1076. DOI: 10.1080/02699930601054109 Emotions and Cultures (2012). In Wikipedia. Retrieved May 11, 2012, from Wikipedia Ferré, P., Garcia, T., Fraga, I., Sanchez-Casus, R. & Molero, M. (2010). Memory for Emotional Words in Bilinguals: Do Words Have the Same Emotional Intensity in the First and in the Second Language? Cognition and Emotion 24 (5), 760 – 785. Full accDOI: 10.1080/02699930902985779 Grosser, G.S., Walsh, A.A. (1966). Sex differences in the differential recall of taboo and neutral 33 words. Journal of Psychology, 63 (2), 219 – 227. DOI: 10.1080/00223980.1966.10543035 Guillet, R. & Arndt, J. (2009). Taboo Words: The Effect of Emotion of Memory for Peripheral Information. Memory & Cognition, 37 (6), 866 – 879. DOI: 10.3758/MC.37.6.866. Hadley, C.B. & MacKay, D.G. (2006). Does emotion help or hinder immediate memory? Arousal versus priority-binding mechanisms. Journal of Experimental Psychology: Learning, Memory, and Cognition, 32 (1), 79 – 88. DOI: 10.1037/0278-7393.32.1.79 Hajcak, G., MacNamara, A., & Olvet, D.M. (2010). Event-related potentials, emotion and emotion regulation: an integrative review. Developmental neuropsychology, 35 (2), 129 – 155. DOI: 10.1080/87565640903526504. Harris, C.L. (2004). Bilingual Speakers in the Lab: Psychophysiological Measures of Emotional Reactivity. Journal of Multilingual and Multicultural Development, 25 (2 & 3), 223 – 247. DOI: 10.1080/01434630408666530 Harris, C.L. & Ayçiçegi-Dinn, A. (2009). Emotion and lying in a non-native language. International Journal of Psychophysiology, 71, 193 – 204. DOI: 10.1016/j.ijpsycho.2008.09.006. Harris, C.L., Ayçiçegi, A. & Gleason, J.B. (2003). Taboo Words and Reprimands Elicit Greater Autonomic Reactivity in a First Language than in a Second Language. Applied Psycholinguistics, 24 (4), 561 – 579. DOI: 10.1017/S0142716403000286 Hinojosa, J.A., Carretié, L., Valcárcel, M.A., Méndez-Bértolo, C., & Pozo, M.A. (2009). Electrophysiological differences in the processing of affective information in words and pictures. Cognitive, Affective& Behavioural Neuroscience, 9, 173 – 189. DOI: 10.3758/CABN.9.2.173 Janschewitz, K. (2008). Taboo, Emotionally Valenced, and Emotionally Neutral Word Norms. Behaviour Research Methods, 40 (4), 1065 – 1074. DOI: 10.3758/BRM.40.4.1065 Javier, R.A. (1996). In Search of Repressed Memories in Bilingual Individuals. In R.P. Foster, M., Moskovitz & R.A. Javier (Eds.), Reaching across boundaries of culture and class: Widening the scope of psychotherapy (225 – 241). Northvale, NJ: Aronson. Jay, T.B. (1992). Cursing in America. Philadelphia: John Benjamins. Jay, T.B. (2000). Why We Curse: A Neuro-Psycho-Social Theory of Speech. Philadelphia: John Benjamins Pub Co. Jay, T., Caldwell-Harris, C. & King, K. (2008). Recalling Taboo and Nontaboo Words. American Journal of Psychology, 121 (1), 83 – 103. https://www.mcla.edu/Undergraduate/ uploads/textWidget/1457.00016/documents/RecallTabooNonTabooWords.pdf Kensinger, E.A. & Corkin, S. (2003). Memory enhancement for emotional words: Are emotional 34 words more vividly remembered than neutral words? Memory & Cognition, 31 (8), 1169 – 1180. DOI: 10.3758/BF03195800 Keuleers, E., Brysbaert, M. & New, B. (2010). SUBTLEX-NL: A new frequency measure for Dutch words based on film subtitles. Behavior Research Methods, 42(3), 643-650. DOI: 10.3758/BRM.42.3.643. Kissler, J., & Koessler, S. (2011). Emotionally positive stimuli facilitate lexical decisions – An ERP study. Biological Psychology, 86, 254 – 264. DOI: 10.1016/j.biopsycho.2010.12.006. Lemhöfer, K., & Broersma, M. (2011). Introducing LexTale, a quick and valid lexical test for advanced learners of English. Behaviour Research Methods DOI: 10.3758/s13428-0110146-0 Leventhal, H. & Scherer, K. (1987). The relationship of emotion and cognition: a functional approach to a semantic controversy. Cognition & Emotion, 1 (1), 3 – 28 DOI: 10.1080/02699938708408361 MacKay, D.G., & Ahmetzanov, M.V. (2005). Emotion, Memory and Attention in the Taboo Stroop Paradigm – An Experimental Analogue of Flashbulb Memories. Psychological science, 16 (1), 25 – 32. DOI: 10.1111/j.0956-7976.2005.00776.x MacKay, D.G., Shafto, M., Taylor, J.K., Marian, D.E., Abrams, L. & Dyer, J.R. (2004). Relations between emotion, memory and attention: Evidence from taboo Stroop, lexical decision, and immediate memory tasks. Memory & Cognition, 32 (3), 474 – 488. DOI: 10.3758/BF03195840 Mathewson, K.J., Arnell, K.M. & Mansfield, C.A. (2008). Capturing and holding attention: The impact of emotional words in rapid serial visual presentation. Memory and Cognition, 36 (1), 182 – 200. DOI: 10.3758/MC.36.1.182 McCarthy, G., & Donchin, E. (1981). A metric for thought: a comparison of P300 latency and response time. Science, 211, 77 – 80. DOI: 10.1126/science.7444452 Moors, A., De Houwer, J., Hermans, D., Wanmaker, S., van Schie, K., Van Harmelen, A.L., De Schryver, M., De Winne, J., & Brysbart, M. (2013). Norms of valence, arousal, dominance and age of acquisition for 4,300 Dutch words. Behaviour Research Methods, 45 (1), 169 - 177. DOI: 10.3758/s13428-012-0243-8. Nowagk, R., Pfeifer, E. (1996). UNIX implementation of the ERP evaluation package (EEP 3.0). In A.D. Friederici & D.Y. Von Cramon (Eds.), Annual report of Max-Planck Institute of Cognitive Neuroscience. Liepzig, Germany: Max-Planck-Institute. Opitz, B., & Degner, J. (2012). Emotionality in a second language: It’s a matter of time. 35 Neuropsychologia, 50 (8), 191 - 1967. DOI: 10.1016/j.neuropsychologia.2012.04.021 Palazova, M., Mantwill, K., Sommer, W., & Schacht, A. (2011). Are effects of emotion single words non-lexical? Evidence from event-related brain potentials. Neuropsychologia, 49, 2766 – 2775. DOI: 10.1016/j.neuropsychologia.2011.06.005. Pavlenko, A. (2002). Bilingualism and Emotions. Multilingua, 21. 45 – 78. DOI: 10.1515/mult.2002.004 Polich, J. (1987). Task difficulty, probability, and interstimulus interval as determinants of P300 from auditory stimuli. Electroencephalography and Clinical Neurophysiology, 68, 311 – 320. DOI: 10.1016/0168-5597(87)90052-9 Pratto, F. & John, O.P. (1991). Automatic vigilance: The attention-grabbing power of negative social information. Journal of Personality and Social Psychology, 61, 380 – 391. DOI: 10.1037/0022-3514.61.3.380 Scott, G.C., O’Donnell, P.J., Leuthold, H., & Sereno, S.C. (2009). Early emotion word processing: evidence from event-related potentials. Biological Psychology, 80, 95 – 104. DOI: 10.1016/j.biopsycho.2008.03.010. Schrauf, R.W. (2000). Bilingual Autobiographical Memory: Experimental Studies and Clinical Cases. Culture & Psychology, 6 (4), 387 – 417. DOI: 10.1177/1354067X0064001 Schupp, H. T., Cuthbert, B. N., Bradley, M. M., Hillman, C. H., Hamm, A. O., & Lang, P. J. (2004). Brain processes in emotional perception: Motivated attention.Cognition and Emotion, 18, 593–611. DOI:10.1080/02699930341000239 Severens, E., Janssens, I., Kühn, S., Brass, M., Hartsuiker, R.J. (2011). When the brain tames the tongue: Covert editing of inappropriate language. Psychophysiology, 86, 1252 – 1257. DOI: 10.1111/j.1469-8986.2011.01190.x. Speedie, L.J., Wertman, E., Ta’ir, J., & Heilman, K.M. (1993). Disruption of automatic speech following a right basal ganglia lesion. Neurology, 43 (9), 1768 – 1774. DOI: 10.1212/WNL.43.9.1768 Stroop, J.R. (1935). Studies of interference in serial verbal reactions. Journal of Experimental Psychology, 18, 643 – 662. DOI: 10.1037/0096-3445.121.1.15 Sutton, T.M., Altarriba J., Bianico, J.L. & Basnight-Brown, D.M. (2007). The automatic access of emotion: Emotional Stroop effects in Spanish-English bilingual speakers. Cognition & Emotion, 21 (5), 1077 – 1090. DOI: 10.1080/02699930601054133 Sutton, S., Braren, M., Zubin, J., & John, E.R. (1965). Evoked-potential correlates of stimulus uncertainty. Science, 150, 1187 – 1188. DOI: 10.1126/science.150.3700.1187 Thomas, S.J., Johnstone, S.J. & Gonsalvez, C.J. (2007). Event-related Potentials During an 36 Emotional Stroop Task. International Journal of Psychophysiology, 63, 221 – 231. DOI: 10.1016/j.ijpsycho.2006.10.002 Van Lancker, D., & Cummings, J.L. (1999). Expletives: neurolinguistic and neurobehavioral perspectives on swearing. Brain Research Reviews, 31, 83 – 104. DOI: 10.1016/S01650173(99)00060-0 Van Strien, J.W. (2002). The Dutch handedness questionnaire. Based on Van Strien, J.W. (1988). Handedness and hemispheric laterality. Thesis, Vrije universiteit Amsterdam, The Netherlands. http://1011.psyweb.nl/homepage/jan_van_strien_files/ hquestionnaire_article.pdf Wu, Y.J., & Thierry, G. (2012). How reading in a second language protect your heart. The Journal of Neuroscience, 32 (19), 6485 – 6489. DOI: 10.1523/ JNEUROSCI.6119-11.2012 Zeelenberg, R., Bocanegra, B.R. & Pecher, D. (2011). Emotion-Induced Impairments in Speeded Word Recognition Tasks. Experimental Psychology, 58 (5), 400 – 411. DOI: 10.1027/16183169/a000109. 37 Appendix A Table 7: Critical items in experiment 1 English Negative Abuse Addict Avalanche Bullet Crime Damage Danger Despair Disease Disgust Dungeon Fear Grief Humiliation Injury Manslaughter Massacre Misery Mutilation Paralysis Plague Poison Rage Resignation Suffocation Violence Control Pulse Switch Broadcast Border Entry Weight Amount Descent Current Bracket Lobster Meat Speed Expectation Player Refrigerator Corridor Mirror Replacement Publisher Button Cattle Soil Inheritance Hibernation Evidence Dutch Negative Misbruik Verslaafde Lawine Kogel Misdaad Schade Gevaar Wanhoop Ziekte Afkeer Kerker Angst Verdriet Vernedering Letsel Doodslag Bloedbad Ellende Verminking Verlamming Pest Gif Woede Ontslag Suffocation Geweld Control Hartslag Schakelaar Omroep Grens Toegang Belang Aantal Afkomst Stroom Beugel Kreeft Vlees Snelheid Verwachting Speler Koelkast Doorgang Spiegel Vervanging Uitgeverij Knop Vee Bodem Erfenis Winterslaap Bewijs 38 Appendix B Table 8: Averages and standard deviation of the amplitude of the waveform in the critical time window 400ms - 540ms in experiment 1. F3 F1 Fz F2 F4 L1 neutral Mean 0,99278 -0,26930 -0,27258 0,18616 -2,71765 Stdev 1,37943 2,11429 1,57795 1,91881 11,92100 L1 negative Mean 1,09213 0,38673 0,37596 0,55266 -1,50621 Stdev 1,75569 1,99819 1,58566 1,99978 8,17608 L2 neutral Mean 0,29932 -0,36802 0,25965 0,37243 -1,77550 FC3 FC1 FCz FC2 FC4 Stdev 2,16610 3,52334 3,25674 2,75042 10,01782 L2 negative Mean 0,38841 -0,47379 0,39067 0,14134 -0,62635 Stdev 2,14759 2,41585 1,81831 1,73593 2,56415 0,66627 0,22438 -0,13217 -0,32029 -0,39369 1,16127 1,42856 2,05770 2,16530 2,75201 1,36716 1,25182 0,53903 0,81383 0,55413 1,80479 1,93083 2,05608 2,27608 2,77868 1,00455 0,86452 0,76614 0,57338 0,36553 2,35944 2,77838 2,68919 3,10241 2,41620 0,95057 0,66656 0,82415 0,16241 0,04337 1,80915 2,03556 2,34313 2,35515 2,12826 C3 C1 Cz C2 C4 1,80853 -0,26131 0,09137 -0,28010 0,47788 1,87405 2,54506 2,19231 2,20576 1,91876 2,94998 0,90166 1,07862 1,13203 1,52720 2,18175 2,67411 3,05945 2,72414 2,12243 2,37495 0,62780 0,58143 0,80783 0,24932 2,67510 3,24280 2,92091 2,87179 2,75661 2,30742 0,55301 0,77959 0,87430 0,55818 2,14304 2,54596 3,02018 2,22104 2,66235 CP3 CP1 CPz CP2 CP4 1,77403 0,81620 0,43943 0,78572 1,64188 2,58534 2,81475 2,16491 2,06729 2,31414 3,07727 2,14542 1,81276 1,98782 2,30776 3,20901 2,88055 2,88965 2,59301 2,65516 1,81126 1,76326 1,50917 1,62433 1,38814 3,83035 2,46802 3,43296 2,96418 3,23413 2,38795 2,03441 1,28484 1,71138 1,36736 2,78664 2,30494 3,16935 2,65119 2,62942 P3 P1 Pz P2 P4 1,54686 2,11391 1,55957 1,15094 1,84461 2,98707 1,95784 2,01366 2,65422 3,34149 3,02241 3,27787 3,21988 2,41034 2,95479 3,72484 2,72912 2,98329 2,74582 3,17788 2,11155 2,66562 1,73545 1,51620 1,94117 3,47351 3,27591 3,31021 3,21477 2,54075 2,59748 2,97223 2,10690 1,88220 2,17973 3,11896 2,92815 3,19391 2,91173 4,13656 39 Appendix C Table 9: Averages and standard deviation of the amplitude of the waveform in the critical time window 600ms - 880ms in experiment 1. F3 F1 Fz F2 F4 L1 neutral Mean -0,24151 -1,51777 -1,22519 -0,40744 -3,76361 Stdev 2,50877 2,52406 2,35666 2,12173 14,43403 L1 negative Mean 0,14285 -1,09979 -0,79006 -0,41384 -2,21507 Stdev 2,91734 2,84958 3,61421 3,14340 9,26976 L2 neutral Mean -0,28833 -1,06424 -0,10124 0,38287 -2,41401 FC3 FC1 FCz FC2 FC4 Stdev 2,44251 4,26945 3,55663 2,86952 13,31273 L2 negative Mean -0,10233 -1,31538 -0,03236 0,30843 -0,09176 Stdev 2,82969 3,81686 2,38268 2,14758 2,43001 -1,00620 -1,25869 -1,09078 -1,13435 -1,30545 2,48349 2,42150 2,83157 3,31278 3,29075 -0,36019 -0,22167 -0,95416 -0,36124 -0,79821 3,16826 3,84772 3,67631 3,98512 3,80934 0,63092 0,75908 0,51364 0,76560 1,00096 2,69856 3,27646 2,88926 3,41783 2,54552 0,68909 0,52710 0,77067 0,41092 1,00280 2,00531 2,57287 3,26268 3,10931 2,29505 C3 C1 Cz C2 C4 -0,18579 -2,08415 -0,88581 -1,32391 -0,77997 3,20756 3,31670 2,72994 2,86092 2,48934 0,54556 -1,47273 -0,51475 -0,59945 -0,25479 4,00206 4,26005 4,75378 3,99250 2,51596 2,18937 0,38205 0,76917 1,06387 0,54381 3,19354 4,39163 3,57425 3,50125 3,02785 1,95551 0,28724 1,09562 1,42351 0,88826 2,53948 3,22052 3,69328 2,66997 2,90898 CP3 CP1 CPz CP2 CP4 -0,73061 -1,30646 -1,07675 -0,81100 -0,04177 3,67812 3,52017 2,74536 2,35905 2,83846 0,21773 -0,37472 -0,43975 -0,32678 0,27645 4,90126 3,85175 4,27595 3,69473 4,29503 1,50923 1,60834 1,76142 1,77383 1,40493 4,04210 2,95759 4,22230 3,43636 3,68027 1,83424 1,82565 1,40662 1,96746 1,30432 3,17893 2,08496 3,74979 2,82976 2,81214 P3 P1 Pz P2 P4 -1,62776 -0,43240 -0,90815 -1,19913 -0,28532 4,18591 2,79894 2,71175 3,10985 4,06261 -0,79973 0,03048 0,22284 -0,70783 -0,39594 5,18372 4,03504 3,96271 3,85340 3,39667 1,24234 2,03956 1,15495 1,05482 1,99409 3,76014 3,65705 4,09625 4,17755 3,69446 1,59026 2,34867 1,64044 1,54743 1,81299 3,29705 3,08576 3,64373 3,09586 4,58834 40 Appendix D Figure 7. Grand averages for emotion words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes F3, F1 and Fz. 41 Figure 8. Grand averages for emotion words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes F2, F4 and FC3. 42 Figure 9. Grand averages for emotion words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes FC1, FCz and FC2. 43 Figure 10. Grand averages for emotion words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes FC4, C3 and C1. 44 Figure 11. Grand averages for emotion words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes Cz, C2 and C4. 45 Figure 12. Grand averages for emotion words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes CP3, CP1 and CPz. 46 Figure 13. Grand averages for emotion words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes CP2, CP4 and P3. 47 Figure 14. Grand averages for emotion words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes C1, Pz and P2. 48 Figure 15. Grand averages for emotion words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodee P4. 49 Appendix E Table 10: Critical items in experiment 2 English Taboo Ass Bitch Blowjob Boner Boobs Cocksucker Crap Cunt Dickhead Fart Fatso Sissy Fuck Fucker Jerk Loser Moron Motherfucker Nigger Puke Queer Rape Retard Screw Shit Shithead Tits Wanker Whore Control Reet Teef Pijpbeurt Stijve Borsten Rotzak Kak Kut Eikel Scheet Vetzak Mietje Neuken Klootzak Smeerlap Sukkel Debiel Hoerenjong Neger Kots Janet Verkrachting Achterlijke Poepen Schijten Lul Tieten Rukker Hoer Dutch Taboo Ant Paint Artwork Peach Watch Profession Poke Push Customer Frog Diver Value Send Answer Twin Cover Whale Satisfaction Shovel Glue Candy Cure Change Erase Jump Granddad Hill Carpet Guide Control Mier Verf Kunstwerk Perzik Horloge Beroep Por Duw Klant Kikker Duiker Waarde Sturen Antwoord Tweeling Omslag Walvis Voldoening Schop Lijm Snoep Geneesmiddel Verandering Wissen Springen Opa Heuvel Tapijt Gids 50 Appendix F Table 11: Averages and standard deviation of the amplitude of the waveform in the critical time window 380ms - 820ms in experiment 2. F3 F1 Fz F2 F4 L1 neutral Mean 1,60014 2,16024 1,93882 1,76736 0,67580 Stdev 3,14775 4,14551 3,37358 4,56258 2,86920 L1 negative Mean 2,33117 3,09324 3,27483 3,39822 1,30237 Stdev 3,22654 4,14443 4,03340 5,51658 2,42457 L2 neutral Mean 1,63414 1,82506 1,94311 1,27487 0,41760 FC3 FC1 FCz FC2 FC4 Stdev 2,68101 2,21441 2,39488 2,17824 3,40613 L2 negative Mean 1,83958 2,39583 3,06417 1,61930 1,02912 Stdev 3,07845 2,21083 2,19999 2,13258 3,61294 2,69668 1,87671 1,62029 2,52633 2,64178 4,10560 4,38361 3,70422 3,89061 4,20885 3,10994 3,58435 3,04763 3,07410 3,83410 4,65398 5,33704 4,79089 4,58183 5,22411 2,61743 2,07417 1,09905 1,90575 1,98833 2,65975 2,99077 2,39983 2,33038 2,91540 2,95962 3,11657 2,07534 2,75983 2,05681 3,00411 2,88856 2,34462 2,34214 3,15018 C3 C1 Cz C2 C4 2,71836 2,06443 1,77621 1,83536 2,68924 3,86746 4,05321 4,29959 4,13679 4,14092 3,80990 3,67644 3,28497 3,62526 3,97603 5,06650 4,94041 5,61486 5,08198 4,86204 2,71556 1,95137 1,63669 1,87680 2,71592 3,04742 2,50684 2,49290 2,66170 3,13307 3,56581 2,77942 2,48318 2,63188 3,05407 3,62755 2,45898 2,71233 2,80040 3,35882 CP3 CP1 CPz CP2 CP4 2,74269 2,56472 2,13010 3,03039 3,63792 4,12760 4,54244 4,74765 4,09203 5,64263 4,32158 4,40484 4,29847 4,55287 4,84378 5,29785 5,14201 5,65638 5,13839 6,45185 3,51245 2,45080 2,49835 2,51319 3,26101 4,88377 2,80993 2,92720 3,19800 3,61060 3,50621 3,53938 3,75404 3,22433 4,62735 3,02503 3,27443 3,29099 3,63860 3,85284 P3 P1 Pz P2 P4 2,47852 2,87847 2,30730 2,96265 2,48219 4,05894 5,91812 4,86513 4,48866 3,48167 4,67772 5,02280 4,50934 4,99206 5,04609 5,40456 6,57932 5,98991 5,43712 5,18523 2,30631 2,72942 2,69882 2,69735 4,06102 3,48684 3,08792 2,99706 3,53262 5,77600 3,85779 3,64779 3,74797 3,60059 2,52528 3,92068 3,61525 3,67632 3,97194 4,40752 51 Appendix G Figure 16. Grand averages for taboo words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes F3, F1 and Fz. 52 Figure 17. Grand averages for taboo words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes F2, F4 and FC3. 53 Figure 18. Grand averages for taboo words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes FC1, FCz and FC2. 54 Figure 19. Grand averages for taboo words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes FC4, C3 and C1. 55 Figure 20. Grand averages for taboo words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes Cz, C2 and C4. 56 Figure 21. Grand averages for taboo words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes CP3, CP1 and CPz. 57 Figure 22. Grand averages for taboo words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes CP2, CP4 and P3. 58 Figure 23. Grand averages for taboo words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes P1, Pz and P2. 59 Figure 24. Grand averages for taboo words (solid lines) and neutral words (dotted lines) for L1 (green lines) and L2 (red lines) of electrodes P4.
© Copyright 2026 Paperzz