PDF

Semantic Parsing for Single-Relation Question Answering
Scott Wen-tau Yih, Xiaodong He & Christopher Meek
{scottyih, xiaohe, meek}@microsoft.com
Open-Domain QA using Knowledge Base
semantic parsing
πœ†π‘₯. sisterβˆ’of(justinβˆ’bieber, π‘₯)
query
inference
Convolutional Deep Semantic Similarity Model [Shen et al., 2014]
Siamese neural networks
Input
Who is Justin Bieber’s sister?
Jazmyn Bieber
Task & Problem Definition
β€’ A KB as a collection of triples (π‘Ÿ, 𝑒1 , 𝑒2 )
β€’ A single-relation question, describing a relation and
one of its entity arguments
β€œWhen were DVD players invented?”
β€’ Input is mapped to two π‘˜-dimensional vectors
β€’ Probability is determined by softmax of their cosine similarity
500
Semantic projection matrix: Ws
Max pooling layer: v
500
...
exp cos(𝑦𝑅 , 𝑦𝑃 )
𝑃 𝑅𝑃 =
𝑅′ exp cos(𝑦𝑅′ , 𝑦𝑃 )
Output
β€’ An entity that has the relation with the given entity
Semantic layer: y
𝑦𝑃 ∈ Rπ‘˜
Max pooling operation
max
max
...
...
max
...
𝑦𝑅 ∈ Rπ‘˜
...
500
500
...
500
15K
15K
15K
...
15K
15K
<s>
w1
w2
wT
<s>
Convolutional layer: ht
Convolution matrix: Wc
siblingβˆ’of(justinβˆ’bieber, jazmynβˆ’bieber)
gender(jazmynβˆ’bieber, female)
Knowledge
Base
High-level Approach: Semantic Parsing
Word hashing layer: ft
π‘€β„Žπ‘’π‘› π‘€π‘’π‘Ÿπ‘’ 𝑋 𝑖𝑛𝑣𝑒𝑛𝑑𝑒𝑑
beβˆ’inventβˆ’in2
Word hashing matrix: Wf
Word sequence: xt
𝑄 =β€œWhen were DVD players invented?”
οƒ˜ Most common questions in the search query logs
β€’ β€œHow old is Kirk Douglas, the actor?”
β€’ β€œWhat county is St. Elizabeth MO in?”
β€’ β€œWhat year was the 8 track invented?”
β€’ β€œWho owns the Texas Rangers?”
πœ†π‘₯. beβˆ’inventβˆ’in(dvdβˆ’player, π‘₯)
οƒ˜ Foundation for answering complicated questions
β€’ β€œName a director of movies starred by Tom Hanks.”
β€’ CKY parsing that chains answers of single-relation
questions [Bao et al., 2014]
οƒ˜ Challenge: lots of ways to ask the same question
β€’ β€œWhat was the date that Minnesota became a state?”
β€’ β€œMinnesota became a state on?”
β€’ β€œWhen was the state Minnesota created?”
β€’ β€œMinnesota's date it entered the union?”
β€’ β€œWhen was Minnesota established as a state?”
β€’ β€œWhat day did Minnesota officially become a state?”
β€’ β‹―
Key Ideas & Related Work
οƒ˜ Simple Context-Free Grammar
β€’ Separate a question into a relation pattern and an entity
mention
οƒ˜ Paraphrase detection using convolutional neural net
οƒ˜ Inspired by Paralex [Fader et al. 2013]
β€’ 35M question paraphrase pairs from WikiAnswers
β€’ Learn weighted lexical matching rules
Procedure: Enumerate All Hypotheses
𝑄 = β€œWhen were DVD players invented?”
𝑃 β†’ π‘€β„Žπ‘’π‘› X π‘π‘™π‘Žπ‘¦π‘’π‘Ÿπ‘  𝑖𝑛𝑣𝑒𝑛𝑑𝑒𝑑
𝑀 β†’ π‘€π‘’π‘Ÿπ‘’ 𝐷𝑉𝐷
𝑄 = β€œWhen were DVD players invented?”
𝑃 β†’ π‘€β„Žπ‘’π‘› π‘€π‘’π‘Ÿπ‘’ X 𝑖𝑛𝑣𝑒𝑛𝑑𝑒𝑑
𝑀 β†’ 𝐷𝑉𝐷 π‘π‘™π‘Žπ‘¦π‘’π‘Ÿπ‘ 
π‘ƒπ‘Ÿπ‘œπ‘(beβˆ’inventβˆ’in2|π‘€β„Žπ‘’π‘› π‘€π‘’π‘Ÿπ‘’ X 𝑖𝑛𝑣𝑒𝑛𝑑𝑒𝑑) = 0.5
π‘ƒπ‘Ÿπ‘œπ‘(dvdβˆ’player|𝐷𝑉𝐷 π‘π‘™π‘Žπ‘¦π‘’π‘Ÿπ‘ ) = 0.7
π‘ƒπ‘Ÿπ‘œπ‘ πœ†π‘₯. beβˆ’inventβˆ’in(dvdβˆ’player, π‘₯) 𝑄 = 0.35
Paraphrase Detection via
Siamese Neural Networks!
Experiments: Data & Task
Experiments: Results
β€’ Same test questions in the Paralex dataset
β€’ 698 questions from 37 clusters
Knowledge base: ReVerb [Fader et al., 2011]
Relation
be-official-language
be-second-largest-city-in
be-tallest-mountain-in
Entity Argument #1
chinese-and-english
arequipa
ararat
Entity Argument #2
hong-kong
peru
armenia
have-population-of
provide
use-for
…
city-of-vancouver
microsoft
laser
…
587,891
office-software
lasik
…
1
Ours
0.95
Precision
Single-Relation Questions
𝑄 β†’π‘ƒβˆ§π‘€
𝑃 β†’ π‘€β„Žπ‘’π‘› π‘€π‘’π‘Ÿπ‘’ X 𝑖𝑛𝑣𝑒𝑛𝑑𝑒𝑑
𝑀 β†’ 𝐷𝑉𝐷 π‘π‘™π‘Žπ‘¦π‘’π‘Ÿπ‘ 
π‘€β„Žπ‘’π‘› π‘€π‘’π‘Ÿπ‘’ 𝑋 𝑖𝑛𝑣𝑒𝑛𝑑𝑒𝑑 β†’ beβˆ’inventβˆ’in2
𝐷𝑉𝐷 π‘π‘™π‘Žπ‘¦π‘’π‘Ÿπ‘  β†’ dvdβˆ’player
0.9
0.85
0.8
Paralex
0.75
Paralex dataset [Fader et al., 2013]
β€’ 1.8M (question, single-relation queries)
0.7
0
0.05
When were X invented?
beβˆ’inventβˆ’in2
β€’ 160k (mention, entity)
Saint Patrick day
stβˆ’patrickβˆ’day
Task: Question Answering
β€’ What language do people in Hong Kong use?
beβˆ’speakβˆ’in english, hongβˆ’kong
beβˆ’predominantβˆ’languageβˆ’in cantonese, hongβˆ’kong
β€’ Where do you find Mt Ararat?
beβˆ’highestβˆ’mountainβˆ’in(ararat, turkey)
beβˆ’mountainβˆ’in(ararat, armenia)
0.15
0.2
0.25
0.3
0.35
0.4
Recall
When were DVD players invented?
πœ†π‘₯. beβˆ’inventβˆ’in(dvdβˆ’player, π‘₯)
β€’ 1.2M (relation pattern, relation)
0.1
Conclusions
οƒ˜ A new semantic parsing framework for single-relation
questions
β€’ Use a semantic similarity function to match patterns and
relations, as well as mentions and entities
β€’ Semantic similarity model – Convolutional neural
networks with letter-trigram vector input
β€’ Go beyond bag-of-words and handle OOV better
β€’ Outperform previous work using lexical matching rules
οƒ˜ Future work
β€’ Apply this approach to more structured KB (Freebase)
β€’ Extend this work to handle multi-relation questions