Semantic Parsing for Single-Relation Question Answering
Scott Wen-tau Yih, Xiaodong He & Christopher Meek
{scottyih, xiaohe, meek}@microsoft.com
Open-Domain QA using Knowledge Base
semantic parsing
ππ₯. sisterβof(justinβbieber, π₯)
query
inference
Convolutional Deep Semantic Similarity Model [Shen et al., 2014]
Siamese neural networks
Input
Who is Justin Bieberβs sister?
Jazmyn Bieber
Task & Problem Definition
β’ A KB as a collection of triples (π, π1 , π2 )
β’ A single-relation question, describing a relation and
one of its entity arguments
βWhen were DVD players invented?β
β’ Input is mapped to two π-dimensional vectors
β’ Probability is determined by softmax of their cosine similarity
500
Semantic projection matrix: Ws
Max pooling layer: v
500
...
exp cos(π¦π
, π¦π )
π π
π =
π
β² exp cos(π¦π
β² , π¦π )
Output
β’ An entity that has the relation with the given entity
Semantic layer: y
π¦π β Rπ
Max pooling operation
max
max
...
...
max
...
π¦π
β Rπ
...
500
500
...
500
15K
15K
15K
...
15K
15K
<s>
w1
w2
wT
<s>
Convolutional layer: ht
Convolution matrix: Wc
siblingβof(justinβbieber, jazmynβbieber)
gender(jazmynβbieber, female)
Knowledge
Base
High-level Approach: Semantic Parsing
Word hashing layer: ft
π€βππ π€πππ π πππ£πππ‘ππ
beβinventβin2
Word hashing matrix: Wf
Word sequence: xt
π =βWhen were DVD players invented?β
ο Most common questions in the search query logs
β’ βHow old is Kirk Douglas, the actor?β
β’ βWhat county is St. Elizabeth MO in?β
β’ βWhat year was the 8 track invented?β
β’ βWho owns the Texas Rangers?β
ππ₯. beβinventβin(dvdβplayer, π₯)
ο Foundation for answering complicated questions
β’ βName a director of movies starred by Tom Hanks.β
β’ CKY parsing that chains answers of single-relation
questions [Bao et al., 2014]
ο Challenge: lots of ways to ask the same question
β’ βWhat was the date that Minnesota became a state?β
β’ βMinnesota became a state on?β
β’ βWhen was the state Minnesota created?β
β’ βMinnesota's date it entered the union?β
β’ βWhen was Minnesota established as a state?β
β’ βWhat day did Minnesota officially become a state?β
β’ β―
Key Ideas & Related Work
ο Simple Context-Free Grammar
β’ Separate a question into a relation pattern and an entity
mention
ο Paraphrase detection using convolutional neural net
ο Inspired by Paralex [Fader et al. 2013]
β’ 35M question paraphrase pairs from WikiAnswers
β’ Learn weighted lexical matching rules
Procedure: Enumerate All Hypotheses
π = βWhen were DVD players invented?β
π β π€βππ X ππππ¦πππ πππ£πππ‘ππ
π β π€πππ π·ππ·
π = βWhen were DVD players invented?β
π β π€βππ π€πππ X πππ£πππ‘ππ
π β π·ππ· ππππ¦πππ
ππππ(beβinventβin2|π€βππ π€πππ X πππ£πππ‘ππ) = 0.5
ππππ(dvdβplayer|π·ππ· ππππ¦πππ ) = 0.7
ππππ ππ₯. beβinventβin(dvdβplayer, π₯) π = 0.35
Paraphrase Detection via
Siamese Neural Networks!
Experiments: Data & Task
Experiments: Results
β’ Same test questions in the Paralex dataset
β’ 698 questions from 37 clusters
Knowledge base: ReVerb [Fader et al., 2011]
Relation
be-official-language
be-second-largest-city-in
be-tallest-mountain-in
Entity Argument #1
chinese-and-english
arequipa
ararat
Entity Argument #2
hong-kong
peru
armenia
have-population-of
provide
use-for
β¦
city-of-vancouver
microsoft
laser
β¦
587,891
office-software
lasik
β¦
1
Ours
0.95
Precision
Single-Relation Questions
π βπβ§π
π β π€βππ π€πππ X πππ£πππ‘ππ
π β π·ππ· ππππ¦πππ
π€βππ π€πππ π πππ£πππ‘ππ β beβinventβin2
π·ππ· ππππ¦πππ β dvdβplayer
0.9
0.85
0.8
Paralex
0.75
Paralex dataset [Fader et al., 2013]
β’ 1.8M (question, single-relation queries)
0.7
0
0.05
When were X invented?
beβinventβin2
β’ 160k (mention, entity)
Saint Patrick day
stβpatrickβday
Task: Question Answering
β’ What language do people in Hong Kong use?
beβspeakβin english, hongβkong
beβpredominantβlanguageβin cantonese, hongβkong
β’ Where do you find Mt Ararat?
beβhighestβmountainβin(ararat, turkey)
beβmountainβin(ararat, armenia)
0.15
0.2
0.25
0.3
0.35
0.4
Recall
When were DVD players invented?
ππ₯. beβinventβin(dvdβplayer, π₯)
β’ 1.2M (relation pattern, relation)
0.1
Conclusions
ο A new semantic parsing framework for single-relation
questions
β’ Use a semantic similarity function to match patterns and
relations, as well as mentions and entities
β’ Semantic similarity model β Convolutional neural
networks with letter-trigram vector input
β’ Go beyond bag-of-words and handle OOV better
β’ Outperform previous work using lexical matching rules
ο Future work
β’ Apply this approach to more structured KB (Freebase)
β’ Extend this work to handle multi-relation questions
© Copyright 2026 Paperzz