Annotating Topics of Opinions Veselin Stoyanov Claire Cardie Talk Overview Fine-grained sentiment analysis Definitions Examples Opinion topic annotation Definitions Issues Approach and Corpus IA agreement May 29, 2008 LREC 2008, Marakech, Morocco Background Sentiment Analysis: Extraction and representation of attitudes, evaluations, opinions, and sentiment in text. Fine-grained Sentiment Analysis: At the level of individual expressions of opinions. May 29, 2008 LREC 2008, Marakech, Morocco Fine-grained vs. Coarse-grained Sentiment Analysis Coarse-grained May 29, 2008 Sentiment classification Useful in the product review domain Review 1 Positive Review 2 Negative Fine-grained Individual expressions of opinions Multiple opinions per document (even sentence) The [SThe Australian Australian press press] has has launched launched a bitter a bitter attack on attack Italy on after [TItaly] seeing after their seeing beloved their Socceroos beloved eliminated [TSocceroos] on aeliminated controversial on alate controversial penalty. Italian late coach [Tpenalty]. Lippi has [S+Tbeen Italian blasted coach for Lippi] his has favorable also been comments blasted fortoward his favorable the penalty. comments toward [Tthe penalty]. Lippi is preparing his side for the upcoming clash with LippiUkraine. is preparing He hailed his side 10-man for the Italy's upcoming clash determination with Ukraine. to [SHe] beathailed Australia 10-man and reiterated [TItaly]'s that the determination penalty was to rightly beat given. Australia and reiterated that the [Tpenalty] was rightly given. LREC 2008, Marakech, Morocco Fine-grained opinions: Example The Australian press has launched a bitter attack on Italy. May 29, 2008 LREC 2008, Marakech, Morocco Fine-grained opinions: Example The Australian press has launched a bitter attack on Italy. Definitions differ, but five main components: Opinion trigger (opinion words) Source (opinion holder) Polarity – positive/negative Strength Topic (target) May 29, 2008 LREC 2008, Marakech, Morocco Fine-grained opinions: Example The Australian press has launched a bitter attack on Italy. Definitions differ, but five main components: Opinion trigger (opinion words) Source (opinion holder) Polarity – positive/negative Strength Topic (target) May 29, 2008 launched a bitter attack LREC 2008, Marakech, Morocco Fine-grained opinions: Example [SThe Australian press] has launched a bitter attack on Italy. Definitions differ, but five main components: Opinion trigger (opinion words) Source (opinion holder) Polarity – positive/negative Strength Topic (target) May 29, 2008 launched a bitter attack The Australian press LREC 2008, Marakech, Morocco Fine-grained opinions: Example [SThe Australian press] has launched a bitter attack on Italy. Definitions differ, but five main components: Opinion trigger (opinion words) Source (opinion holder) Polarity – positive/negative Strength Topic (target) May 29, 2008 launched a bitter attack The Australian press negative LREC 2008, Marakech, Morocco Fine-grained opinions: Example [SThe Australian press] has launched a bitter attack on Italy. Definitions differ, but five main components: Opinion trigger (opinion words) Source (opinion holder) Polarity – positive/negative Strength Topic (target) May 29, 2008 launched a bitter attack The Australian press negative high LREC 2008, Marakech, Morocco Fine-grained opinions: Example [SThe Australian press] has launched a bitter attack on [TItaly] Definitions differ, but five main components: Opinion trigger (opinion words) Source (opinion holder) Polarity – positive/negative Strength Topic (target) May 29, 2008 launched a bitter attack The Australian press negative high Italy LREC 2008, Marakech, Morocco Fine-grained opinions Five components Source (opinion holder) Opinion trigger (opinion words) As above Strength e.g. [Yu and Hatzivassiloglou, 2003] [Riloff and Wiebe, 2003] Polarity – positive/negative e.g. [Bethard et al., 2004] [Choi et al., 2005] [Kim and Hovy, 2006] e.g. [Wilson et al. 2004] Topic (target) May 29, 2008 ???? LREC 2008, Marakech, Morocco Annotating Topics of Fine-grained Opinions Definitions Issues Approach and Corpus IA agreement May 29, 2008 LREC 2008, Marakech, Morocco Examples (1)[OH John] likes Marseille for its weather and cultural diversity. (2)[OH Al] thinks that the government should tax gas more in order to curb CO2 emissions. May 29, 2008 LREC 2008, Marakech, Morocco Definitions (1)[OH John] likes Marseille for its weather and cultural diversity. May 29, 2008 LREC 2008, Marakech, Morocco Definitions (1)[OH John] likes Marseille for its weather and cultural diversity. Topic: city of Marseille Topic - the real-world object, event or abstract entity that is the subject of the opinion as intended by the opinion holder May 29, 2008 LREC 2008, Marakech, Morocco Definitions (1)[OH John] likes [TOPIC SPAN Marseille] for its weather and cultural diversity. Topic: city of Marseille Topic - the real-world object, event or abstract entity that is the subject of the opinion as intended by the opinion holder Topic span - the closest, minimal span of text that mentions the topic May 29, 2008 LREC 2008, Marakech, Morocco Definitions (1)[OH John] likes [TARGET+TOPIC SPAN Marseille] for its weather and cultural diversity. Topic: city of Marseille Topic - the real-world object, event or abstract entity that is the subject of the opinion as intended by the opinion holder Topic span - the closest, minimal span of text that mentions the topic Target span - the span of text that covers the syntactic surface form comprising the contents of the opinion May 29, 2008 LREC 2008, Marakech, Morocco Definitions (2)[OH Al] thinks that the government should tax gas more in order to curb CO2 emissions. May 29, 2008 LREC 2008, Marakech, Morocco Definitions (2)[OH Al] thinks that [TARGET SPAN the government should tax gas more in order to curb CO2 emissions]. May 29, 2008 LREC 2008, Marakech, Morocco Definitions (2)[OH Al] thinks that [TARGET SPAN [TOPIC SPAN? the government] should [TOPIC SPAN? tax gas] more in order to [TOPIC SPAN? curb [TOPIC SPAN? CO2 emissions]]]. May 29, 2008 LREC 2008, Marakech, Morocco Definitions (2)[OH Al] thinks that [TARGET SPAN the government should tax gas more in order to curb CO2 emissions]. Context: (3) Although he doesn’t like government imposed taxes, he thinks that a fuel tax is the only effective solution. May 29, 2008 LREC 2008, Marakech, Morocco Definitions (2)[OH Al] thinks that [TARGET SPAN the government should [TOPIC SPAN tax gas] more in order to curb CO2 emissions]. Context: (3) Although he doesn’t like government imposed taxes, he thinks that a fuel tax is the only effective solution. May 29, 2008 LREC 2008, Marakech, Morocco Related Work Product reviews E.g. Kobayashi et al. (2004), Yi et al. (2003), Popescu and Etzioni (2005), Hu and Liu (2004 Limit “topics” to mentions of product names, components, and their attributes Lexicon look-up Focused on methods for lexicon acquisition MPQA corpus (Wiebe, Wilson, Cardie, 2004) Fine-grained opinions Topic annotation deemed too difficult Target span annotation is underway Kim & Hovy (2006) Target span extraction using semantic frames Limited evaluation May 29, 2008 LREC 2008, Marakech, Morocco Issues in Opinion Topic Identification Multiple potential topics mentioned within a single target span (2)[OH Al] thinks that [TARGET SPAN [TOPIC SPAN? the government] should [TOPIC SPAN? tax gas] more in order to [TOPIC SPAN? curb [TOPIC SPAN? CO2 emissions]]]. Requires context Topic of an opinion is the entity that comprises the main information goal of the opinion based on the discourse context. May 29, 2008 LREC 2008, Marakech, Morocco Issues in Opinion Topic Identification Opinion topics are not always explicitly mentioned (4) [OH John] believes the violation of Palestinian human rights is one of the main factors. Topic: ISRAELI-PALESTINIAN CONFLICT (5) [OH I] disagree entirely! May 29, 2008 LREC 2008, Marakech, Morocco A Coreference Approach Hypothesize that the notion of topic coreference will facilitate identification of opinion topics Two opinions are topic-coreferent if they share the same opinion topic. Easier than specifying the topic of each opinion in isolation May 29, 2008 LREC 2008, Marakech, Morocco Opinion Topic Corpus Build on the MPQA corpus: (www.cs.pitt.edu/mpqa) 535 Documents manually annotated for finegrained opinions No opinion topic annotation Our goal: Add the opinion topic information on top of the existing opinion annotations Created and used a GUI May 29, 2008 LREC 2008, Marakech, Morocco Annotation Process List of opinions to be processed May 29, 2008 Set of current clusters LREC 2008, Marakech, Morocco Document text Annotation Process May 29, 2008 LREC 2008, Marakech, Morocco Annotation Process fuel tax May 29, 2008 LREC 2008, Marakech, Morocco Interannotator Agreement Annotator 1 Annotator 2 150 of the 535 MPQA documents 20 of these 150 IAG measures from noun phrase coreference resolution B3 a CEAF all opinions .64 .55 .69 sentiment-bearing opinions .72 .73 .80 strong opinions .74 .77 .82 May 29, 2008 LREC 2008, Marakech, Morocco Interannotator Agreement Annotator 1 Annotator 2 150 of the 535 MPQA documents 20 of these 150 IAG measures from noun phrase coreference resolution B3 a CEAF all opinions .64 .55 .69 sentiment-bearing opinions .72 .73 .80 strong opinions .74 .77 .82 May 29, 2008 LREC 2008, Marakech, Morocco Baselines all-in-one 1 opinion per cluster assigns all opinions to the same cluster assigns each opinion to its own cluster same paragraph opinions in the same paragraph are assigned to the same cluster May 29, 2008 LREC 2008, Marakech, Morocco Results Baselines B3 a CEAF all-in-one .37 -.10 .30 1 opinion per cluster .29 .22 .27 same paragraph .55 .31 .50 vs. Interannotator agreement all opinions .64 .55 .69 sentiment-bearing .72 .73 .80 strong opinions .74 .77 .82 May 29, 2008 LREC 2008, Marakech, Morocco Thank you Questions? Annotation instructions + more information available at: www.cs.cornell.edu/~ves May 29, 2008 LREC 2008, Marakech, Morocco Example The Australian press has launched a bitter attack on Italy after seeing their beloved Socceroos eliminated on a controversial late penalty. Italian coach Lippi has been blasted for his favorable comments toward the penalty. Lippi is preparing his side for the upcoming clash with Ukraine. He hailed 10-man Italy's determination to beat Australia and reiterated that the penalty was rightly given. May 29, 2008 LREC 2008, Marakech, Morocco Example – fine-grained opinions [SThe Australian press] has launched a bitter attack on [TItaly] after seeing [Stheir] beloved [TSocceroos] eliminated on a controversial late [Tpenalty]. [S+TItalian coach Lippi] has also been blasted for his favorable comments toward [Tthe penalty]. Lippi is preparing his side for the upcoming clash with Ukraine. [SHe] hailed 10-man [TItaly]'s determination to beat Australia and reiterated that [Tthe penalty] was rightly given. May 29, 2008 LREC 2008, Marakech, Morocco Motivation Sentiment analysis: Useful as stand-alone application Product reviews Tracking opinions in the press Flame detection, etc. Opinion information can benefit many NLP applications Multi-Perspective Question Answering [Stoyanov, Cardie, Litman and Wiebe. AAAI WS 2004] and [Stoyanov, Cardie and Wiebe. HLT-EMNLP 2005] Opinion-Oriented Information Retrieval Clustering, etc. May 29, 2008 LREC 2008, Marakech, Morocco Annotation Process May 29, 2008 LREC 2008, Marakech, Morocco Annotation Process May 29, 2008 LREC 2008, Marakech, Morocco ` May 29, 2008 LREC 2008, Marakech, Morocco
© Copyright 2025 Paperzz