Verbal Multiword Expressions

Language Technology:
Research and Development
Topic: Verbal Multiword Expressions
Fabienne Cap
What is this bear doing?
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
What is this bear doing?
The red bear marches to a di↵erent drummer.
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
What is this bear doing?
The red bear marches to a di↵erent drummer.
Der rote Bär tanzt aus der Reihe. (= dances out of the order)
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
What is this bear doing?
The red bear marches to a di↵erent drummer.
Der rote Bär tanzt aus der Reihe. (= dances out of the order)
L’ours rouge nage à contre-courant. (= swims against the stream)
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
What is this bear doing?
The red bear marches to a di↵erent drummer.
Der rote Bär tanzt aus der Reihe. (= dances out of the order)
L’ours rouge nage à contre-courant. (= swims against the stream)
El oso rojo sale de la fila. (= step out of the line)
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
What is this bear doing?
The red bear marches to a di↵erent drummer.
Der rote Bär tanzt aus der Reihe. (= dances out of the order)
L’ours rouge nage à contre-courant. (= swims against the stream)
El oso rojo sale de la fila. (= step out of the line)
But:
Where is the drummer?
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
What is this bear doing?
The red bear marches to a di↵erent drummer.
Der rote Bär tanzt aus der Reihe. (= dances out of the order)
L’ours rouge nage à contre-courant. (= swims against the stream)
El oso rojo sale de la fila. (= step out of the line)
But:
Where is the drummer?
Is it really dancing?
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
What is this bear doing?
The red bear marches to a di↵erent drummer.
Der rote Bär tanzt aus der Reihe. (= dances out of the order)
L’ours rouge nage à contre-courant. (= swims against the stream)
El oso rojo sale de la fila. (= step out of the line)
But:
Where is the drummer?
Is it really dancing?... or swimming?
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
What is this bear doing?
The red bear marches to a di↵erent drummer.
Der rote Bär tanzt aus der Reihe. (= dances out of the order)
L’ours rouge nage à contre-courant. (= swims against the stream)
El oso rojo sale de la fila. (= step out of the line)
But:
Where is the drummer?
Is it really dancing?... or swimming?
Is it a line it was standing in?
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Verbal Multiword Expressions
Such expressions are called verbal multiword expressions
(VMWEs).
They form a unit that crosses word boundaries (whitespaces).
Just like the bear, VMWEs march to a di↵erent drummer
they behave unlike literal expressions of the same kind.
Consider for example their meaning:
to kick the bucket ! to die (VMWE)
to kick the ball ! to kick the ball (literal combination)
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Verbal Multiword Expressions
Such expressions are called verbal multiword expressions
(VMWEs).
They form a unit that crosses word boundaries (whitespaces).
Just like the bear, VMWEs march to a di↵erent drummer
they behave unlike literal expressions of the same kind.
Consider for example their meaning:
to kick the bucket ! to die (VMWE)
to kick the ball ! to kick the ball (literal combination)
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Verbal Multiword Expressions
Such expressions are called verbal multiword expressions
(VMWEs).
They form a unit that crosses word boundaries (whitespaces).
Just like the bear, VMWEs march to a di↵erent drummer
they behave unlike literal expressions of the same kind.
Consider for example their meaning:
to kick the bucket ! to die (VMWE)
to kick the ball ! to kick the ball (literal combination)
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Verbal Multiword Expressions
Such expressions are called verbal multiword expressions
(VMWEs).
They form a unit that crosses word boundaries (whitespaces).
Just like the bear, VMWEs march to a di↵erent drummer
they behave unlike literal expressions of the same kind.
Consider for example their meaning:
to kick the bucket ! to die (VMWE)
to kick the ball ! to kick the ball (literal combination)
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Verbal Multiword Expressions
Such expressions are called verbal multiword expressions
(VMWEs).
They form a unit that crosses word boundaries (whitespaces).
Just like the bear, VMWEs march to a di↵erent drummer
they behave unlike literal expressions of the same kind.
Consider for example their meaning:
to kick the bucket ! to die (VMWE)
to kick the ball ! to kick the ball (literal combination)
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Definitions
Selected definitions from the literature:
Multiword Expressions (MWEs) are idiosyncratic
interpretations that cross word boundaries. (Sag et al. 2002)
A Multiword Expression is usually taken to be any word
combination (adjacent or otherwise) that has some feature
(syntactic, semantic or purely statistic) that cannot be
predicted on the basis of its component words and/or the
combinatorial process of language. (Bannard, 2007)
... there are many more definitions and terms used to describe
the phenomenon, e.g. Idioms, Fixed Expressions,
Collocations.
For here and now, we stick with verbal MWEs
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Definitions
Selected definitions from the literature:
Multiword Expressions (MWEs) are idiosyncratic
interpretations that cross word boundaries. (Sag et al. 2002)
A Multiword Expression is usually taken to be any word
combination (adjacent or otherwise) that has some feature
(syntactic, semantic or purely statistic) that cannot be
predicted on the basis of its component words and/or the
combinatorial process of language. (Bannard, 2007)
... there are many more definitions and terms used to describe
the phenomenon, e.g. Idioms, Fixed Expressions,
Collocations.
For here and now, we stick with verbal MWEs
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Definitions
Selected definitions from the literature:
Multiword Expressions (MWEs) are idiosyncratic
interpretations that cross word boundaries. (Sag et al. 2002)
A Multiword Expression is usually taken to be any word
combination (adjacent or otherwise) that has some feature
(syntactic, semantic or purely statistic) that cannot be
predicted on the basis of its component words and/or the
combinatorial process of language. (Bannard, 2007)
... there are many more definitions and terms used to describe
the phenomenon, e.g. Idioms, Fixed Expressions,
Collocations.
For here and now, we stick with verbal MWEs
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Definitions
Selected definitions from the literature:
Multiword Expressions (MWEs) are idiosyncratic
interpretations that cross word boundaries. (Sag et al. 2002)
A Multiword Expression is usually taken to be any word
combination (adjacent or otherwise) that has some feature
(syntactic, semantic or purely statistic) that cannot be
predicted on the basis of its component words and/or the
combinatorial process of language. (Bannard, 2007)
... there are many more definitions and terms used to describe
the phenomenon, e.g. Idioms, Fixed Expressions,
Collocations.
For here and now, we stick with verbal MWEs
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Definitions
Selected definitions from the literature:
Multiword Expressions (MWEs) are idiosyncratic
interpretations that cross word boundaries. (Sag et al. 2002)
A Multiword Expression is usually taken to be any word
combination (adjacent or otherwise) that has some feature
(syntactic, semantic or purely statistic) that cannot be
predicted on the basis of its component words and/or the
combinatorial process of language. (Bannard, 2007)
... there are many more definitions and terms used to describe
the phenomenon, e.g. Idioms, Fixed Expressions,
Collocations.
For here and now, we stick with verbal MWEs
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
“Special” at di↵erent levels
MWEs are special at one or more levels of linguistic description:
non-compositional semantics:
let the cat out of the bag(VMWE) 6=
fixed syntax:
to go bananas (VMWE) vs. *bananas are gone
fixed morphology:
feed to the lions (VMWE) vs.
Lit feed
to the lion
frequency: combination occurs more frequently than expected
vaguely remember (VMWE) vs. *vaguely recall
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
“Special” at di↵erent levels
MWEs are special at one or more levels of linguistic description:
non-compositional semantics:
let the cat out of the bag(VMWE) 6=
fixed syntax:
to go bananas (VMWE) vs. *bananas are gone
fixed morphology:
feed to the lions (VMWE) vs.
Lit feed
to the lion
frequency: combination occurs more frequently than expected
vaguely remember (VMWE) vs. *vaguely recall
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
“Special” at di↵erent levels
MWEs are special at one or more levels of linguistic description:
non-compositional semantics:
let the cat out of the bag(VMWE) 6=
fixed syntax:
to go bananas (VMWE) vs. *bananas are gone
fixed morphology:
feed to the lions (VMWE) vs.
Lit feed
to the lion
frequency: combination occurs more frequently than expected
vaguely remember (VMWE) vs. *vaguely recall
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
“Special” at di↵erent levels
MWEs are special at one or more levels of linguistic description:
non-compositional semantics:
let the cat out of the bag(VMWE) 6=
fixed syntax:
to go bananas (VMWE) vs. *bananas are gone
fixed morphology:
feed to the lions (VMWE) vs.
Lit feed
to the lion
frequency: combination occurs more frequently than expected
vaguely remember (VMWE) vs. *vaguely recall
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
“Special” at di↵erent levels
MWEs are special at one or more levels of linguistic description:
non-compositional semantics:
let the cat out of the bag(VMWE) 6=
fixed syntax:
to go bananas (VMWE) vs. *bananas are gone
fixed morphology:
feed to the lions (VMWE) vs.
Lit feed
to the lion
frequency: combination occurs more frequently than expected
vaguely remember (VMWE) vs. *vaguely recall
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Identification of VMWEs
Their special behaviour can be used to identify them!
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Identification of VMWEs
Their special behaviour can be used to identify them!
feature
identification
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Identification of VMWEs
Their special behaviour can be used to identify them!
feature
non-compositional semantics
Fabienne Cap
identification
word alignment
Language Technology:Research and DevelopmentTopic: Verbal
Identification of VMWEs
Their special behaviour can be used to identify them!
feature
non-compositional semantics
fixed syntax
fixed morphology
Fabienne Cap
identification
word alignment
count occurring variants
count occurring variants
Language Technology:Research and DevelopmentTopic: Verbal
Identification of VMWEs
Their special behaviour can be used to identify them!
feature
non-compositional semantics
fixed syntax
fixed morphology
fequency
Fabienne Cap
identification
word alignment
count occurring variants
count occurring variants
statistical association measures
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
to cut the mustard
to sweep under the rug
to spill the beans
to bring somebody down
to go ahead
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
But: not all MWEs are completely non-compositional.
It can be difficult to draw the line:
to cut the mustard
to sweep under the rug
to spill the beans
to bring somebody down
to go ahead
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
But: not all MWEs are completely non-compositional.
It can be difficult to draw the line:
to cut the mustard
to sweep under the rug
to spill the beans
to bring somebody down
to go ahead
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
But: not all MWEs are completely non-compositional.
It can be difficult to draw the line:
to cut the mustard ! to perform well
to sweep under the rug
to spill the beans
to bring somebody down
to go ahead
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
But: not all MWEs are completely non-compositional.
It can be difficult to draw the line:
to cut the mustard ! to perform well
to sweep under the rug
to spill the beans
to bring somebody down
to go ahead
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
But: not all MWEs are completely non-compositional.
It can be difficult to draw the line:
to cut the mustard ! to perform well
to sweep under the rug ! to hide
to spill the beans
to bring somebody down
to go ahead
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
But: not all MWEs are completely non-compositional.
It can be difficult to draw the line:
to cut the mustard ! to perform well
to sweep under the rug ! to hide
to spill the beans
to bring somebody down
to go ahead
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
But: not all MWEs are completely non-compositional.
It can be difficult to draw the line:
to cut the mustard ! to perform well
to sweep under the rug ! to hide
to spill the beans ! to reveal
a secret
to bring somebody down
to go ahead
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
But: not all MWEs are completely non-compositional.
It can be difficult to draw the line:
to cut the mustard ! to perform well
to sweep under the rug ! to hide
to spill the beans ! to reveal (spill) a secret (the “beans”)
to bring somebody down
to go ahead
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
But: not all MWEs are completely non-compositional.
It can be difficult to draw the line:
to cut the mustard ! to perform well
to sweep under the rug ! to hide
to spill the beans ! to reveal (spill) a secret (the “beans”)
to bring somebody down
to go ahead
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
But: not all MWEs are completely non-compositional.
It can be difficult to draw the line:
to cut the mustard ! to perform well
to sweep under the rug ! to hide
to spill the beans ! to reveal (spill) a secret (the “beans”)
to bring somebody down ! to make somebody sad
to go ahead
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
But: not all MWEs are completely non-compositional.
It can be difficult to draw the line:
to cut the mustard ! to perform well
to sweep under the rug ! to hide
to spill the beans ! to reveal (spill) a secret (the “beans”)
to bring somebody down ! to make somebody sad
to go ahead
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Compositionality
The most prominent feature of VMWEs is their non-compositionality:
you cannot tell about the whole based on it’s parts.
But: not all MWEs are completely non-compositional.
It can be difficult to draw the line:
to cut the mustard ! to perform well
to sweep under the rug ! to hide
to spill the beans ! to reveal (spill) a secret (the “beans”)
to bring somebody down ! to make somebody sad
to go ahead ! to start/proceed
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Context
In some contexts, expressions that are taken to be VMWEs might
be used compositionally. Imagine:
A soccer player kicking a bucket
A cat that has been caught in a bag and is later released
A janitor sweeping some dirt under some rug
A child spilling some beans on the floor
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Context
In some contexts, expressions that are taken to be VMWEs might
be used compositionally. Imagine:
A soccer player kicking a bucket
A cat that has been caught in a bag and is later released
A janitor sweeping some dirt under some rug
A child spilling some beans on the floor
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Context
In some contexts, expressions that are taken to be VMWEs might
be used compositionally. Imagine:
A soccer player kicking a bucket
A cat that has been caught in a bag and is later released
A janitor sweeping some dirt under some rug
A child spilling some beans on the floor
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Context
In some contexts, expressions that are taken to be VMWEs might
be used compositionally. Imagine:
A soccer player kicking a bucket
A cat that has been caught in a bag and is later released
A janitor sweeping some dirt under some rug
A child spilling some beans on the floor
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Context
In some contexts, expressions that are taken to be VMWEs might
be used compositionally. Imagine:
A soccer player kicking a bucket
A cat that has been caught in a bag and is later released
A janitor sweeping some dirt under some rug
A child spilling some beans on the floor
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Context
In some contexts, expressions that are taken to be VMWEs might
be used compositionally. Imagine:
A soccer player kicking a bucket
A cat that has been caught in a bag and is later released
A janitor sweeping some dirt under some rug
A child spilling some beans on the floor
! not very likely, probably not very frequent, but not impossible!
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Context
In some contexts, expressions that are taken to be VMWEs might
be used compositionally. Imagine:
A soccer player kicking a bucket
A cat that has been caught in a bag and is later released
A janitor sweeping some dirt under some rug
A child spilling some beans on the floor
! not very likely, probably not very frequent, but not impossible!
! VMWE decisions have to be made context-dependently
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
High Variety
VMWEs come in many di↵erent shapes, e.g.:
Idioms: let the cat out of the bag, spill the beans
Light-verb constructions: make a mistake vs. *do a mistake
Particle verbs: to bring down vs.
Inherently reflexive verbs:
sich enthalten (to abstrain) vs.
Fabienne Cap
Lit to
pick up
Lit enthalten
(to contain)
Language Technology:Research and DevelopmentTopic: Verbal
High Variety
VMWEs come in many di↵erent shapes, e.g.:
Idioms: let the cat out of the bag, spill the beans
Light-verb constructions: make a mistake vs. *do a mistake
Particle verbs: to bring down vs.
Inherently reflexive verbs:
sich enthalten (to abstrain) vs.
Fabienne Cap
Lit to
pick up
Lit enthalten
(to contain)
Language Technology:Research and DevelopmentTopic: Verbal
High Variety
VMWEs come in many di↵erent shapes, e.g.:
Idioms: let the cat out of the bag, spill the beans
Light-verb constructions: make a mistake vs. *do a mistake
Particle verbs: to bring down vs.
Inherently reflexive verbs:
sich enthalten (to abstrain) vs.
Fabienne Cap
Lit to
pick up
Lit enthalten
(to contain)
Language Technology:Research and DevelopmentTopic: Verbal
High Variety
VMWEs come in many di↵erent shapes, e.g.:
Idioms: let the cat out of the bag, spill the beans
Light-verb constructions: make a mistake vs. *do a mistake
Particle verbs: to bring down vs.
Inherently reflexive verbs:
sich enthalten (to abstrain) vs.
Fabienne Cap
Lit to
pick up
Lit enthalten
(to contain)
Language Technology:Research and DevelopmentTopic: Verbal
High Variety
VMWEs come in many di↵erent shapes, e.g.:
Idioms: let the cat out of the bag, spill the beans
Light-verb constructions: make a mistake vs. *do a mistake
Particle verbs: to bring down vs.
Inherently reflexive verbs:
sich enthalten (to abstrain) vs.
Fabienne Cap
Lit to
pick up
Lit enthalten
(to contain)
Language Technology:Research and DevelopmentTopic: Verbal
Problematic for many NLP applications
Everyone knows that VMWEs cause problems in NLP,
but they are still often ignored in many applications.
many di↵erent shapes
long-distance dependencies between parts
data sparsity: many types not so many tokens
compositionality: where to draw the line?
translation: VMWEs do not always translate into VMWEs
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Problematic for many NLP applications
Everyone knows that VMWEs cause problems in NLP,
but they are still often ignored in many applications.
many di↵erent shapes
long-distance dependencies between parts
data sparsity: many types not so many tokens
compositionality: where to draw the line?
translation: VMWEs do not always translate into VMWEs
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Problematic for many NLP applications
Everyone knows that VMWEs cause problems in NLP,
but they are still often ignored in many applications.
many di↵erent shapes
long-distance dependencies between parts
data sparsity: many types not so many tokens
compositionality: where to draw the line?
translation: VMWEs do not always translate into VMWEs
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Problematic for many NLP applications
Everyone knows that VMWEs cause problems in NLP,
but they are still often ignored in many applications.
many di↵erent shapes
long-distance dependencies between parts
data sparsity: many types not so many tokens
compositionality: where to draw the line?
translation: VMWEs do not always translate into VMWEs
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Problematic for many NLP applications
Everyone knows that VMWEs cause problems in NLP,
but they are still often ignored in many applications.
many di↵erent shapes
long-distance dependencies between parts
data sparsity: many types not so many tokens
compositionality: where to draw the line?
translation: VMWEs do not always translate into VMWEs
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Problematic for many NLP applications
Everyone knows that VMWEs cause problems in NLP,
but they are still often ignored in many applications.
many di↵erent shapes
long-distance dependencies between parts
data sparsity: many types not so many tokens
compositionality: where to draw the line?
translation: VMWEs do not always translate into VMWEs
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Projects
Research on VMWEs:
di↵erent languages
di↵erent linguistic description levels
di↵erent applications
di↵erent approaches for identification
(e.g. linguistics-based, heuristic, machine learning, ...)
identification of VMWEs
improved processing of VMWEs (e.g. in SMT)
determine the degree of compositionality of VMWEs
determine the presence of VMWEs in di↵erent contexts
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Projects
Research on VMWEs:
di↵erent languages
di↵erent linguistic description levels
di↵erent applications
di↵erent approaches for identification
(e.g. linguistics-based, heuristic, machine learning, ...)
identification of VMWEs
improved processing of VMWEs (e.g. in SMT)
determine the degree of compositionality of VMWEs
determine the presence of VMWEs in di↵erent contexts
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Projects
Research on VMWEs:
di↵erent languages
di↵erent linguistic description levels
di↵erent applications
di↵erent approaches for identification
(e.g. linguistics-based, heuristic, machine learning, ...)
identification of VMWEs
improved processing of VMWEs (e.g. in SMT)
determine the degree of compositionality of VMWEs
determine the presence of VMWEs in di↵erent contexts
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Projects
Research on VMWEs:
di↵erent languages
di↵erent linguistic description levels
di↵erent applications
di↵erent approaches for identification
(e.g. linguistics-based, heuristic, machine learning, ...)
identification of VMWEs
improved processing of VMWEs (e.g. in SMT)
determine the degree of compositionality of VMWEs
determine the presence of VMWEs in di↵erent contexts
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Projects
Research on VMWEs:
di↵erent languages
di↵erent linguistic description levels
di↵erent applications
di↵erent approaches for identification
(e.g. linguistics-based, heuristic, machine learning, ...)
identification of VMWEs
improved processing of VMWEs (e.g. in SMT)
determine the degree of compositionality of VMWEs
determine the presence of VMWEs in di↵erent contexts
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Projects
Research on VMWEs:
di↵erent languages
di↵erent linguistic description levels
di↵erent applications
di↵erent approaches for identification
(e.g. linguistics-based, heuristic, machine learning, ...)
Possible Projects:
identification of VMWEs
improved processing of VMWEs (e.g. in SMT)
determine the degree of compositionality of VMWEs
determine the presence of VMWEs in di↵erent contexts
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Projects
Research on VMWEs:
di↵erent languages
di↵erent linguistic description levels
di↵erent applications
di↵erent approaches for identification
(e.g. linguistics-based, heuristic, machine learning, ...)
Possible Projects:
identification of VMWEs
improved processing of VMWEs (e.g. in SMT)
determine the degree of compositionality of VMWEs
determine the presence of VMWEs in di↵erent contexts
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Projects
Research on VMWEs:
di↵erent languages
di↵erent linguistic description levels
di↵erent applications
di↵erent approaches for identification
(e.g. linguistics-based, heuristic, machine learning, ...)
Possible Projects:
identification of VMWEs
improved processing of VMWEs (e.g. in SMT)
determine the degree of compositionality of VMWEs
determine the presence of VMWEs in di↵erent contexts
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Projects
Research on VMWEs:
di↵erent languages
di↵erent linguistic description levels
di↵erent applications
di↵erent approaches for identification
(e.g. linguistics-based, heuristic, machine learning, ...)
Possible Projects:
identification of VMWEs
improved processing of VMWEs (e.g. in SMT)
determine the degree of compositionality of VMWEs
determine the presence of VMWEs in di↵erent contexts
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Projects
Research on VMWEs:
di↵erent languages
di↵erent linguistic description levels
di↵erent applications
di↵erent approaches for identification
(e.g. linguistics-based, heuristic, machine learning, ...)
Possible Projects:
identification of VMWEs
improved processing of VMWEs (e.g. in SMT)
determine the degree of compositionality of VMWEs
determine the presence of VMWEs in di↵erent contexts
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
PARSEME
PARSEME = PARSing and Multi-word Expressions,
a European project devoted to multiword expressions.
http://typo.uni-konstanz.de/parseme/
upcoming shared task on identification of VMWEs
dates: t.b.a. February-March 2017
Workshop at EACL2017 in Valencia, Spain (April 2017)
Languages:
Germanic: English, German, Swedish, Yiddish
Romance: French, Italian, Romanian, Spanish, Brazilian Portuguese
Balto-Slavic: Bulgarian, Czech, Croatian, Lithuanian, Polish, Slovene
Other: Farsi, Greek, Hebrew, Hungarian, Maltese, Turkish
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
PARSEME
PARSEME = PARSing and Multi-word Expressions,
a European project devoted to multiword expressions.
http://typo.uni-konstanz.de/parseme/
upcoming shared task on identification of VMWEs
dates: t.b.a. February-March 2017
Workshop at EACL2017 in Valencia, Spain (April 2017)
Languages:
Germanic: English, German, Swedish, Yiddish
Romance: French, Italian, Romanian, Spanish, Brazilian Portuguese
Balto-Slavic: Bulgarian, Czech, Croatian, Lithuanian, Polish, Slovene
Other: Farsi, Greek, Hebrew, Hungarian, Maltese, Turkish
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
PARSEME
PARSEME = PARSing and Multi-word Expressions,
a European project devoted to multiword expressions.
http://typo.uni-konstanz.de/parseme/
upcoming shared task on identification of VMWEs
dates: t.b.a. February-March 2017
Workshop at EACL2017 in Valencia, Spain (April 2017)
Languages:
Germanic: English, German, Swedish, Yiddish
Romance: French, Italian, Romanian, Spanish, Brazilian Portuguese
Balto-Slavic: Bulgarian, Czech, Croatian, Lithuanian, Polish, Slovene
Other: Farsi, Greek, Hebrew, Hungarian, Maltese, Turkish
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
PARSEME
PARSEME = PARSing and Multi-word Expressions,
a European project devoted to multiword expressions.
http://typo.uni-konstanz.de/parseme/
upcoming shared task on identification of VMWEs
dates: t.b.a. February-March 2017
Workshop at EACL2017 in Valencia, Spain (April 2017)
Languages:
Germanic: English, German, Swedish, Yiddish
Romance: French, Italian, Romanian, Spanish, Brazilian Portuguese
Balto-Slavic: Bulgarian, Czech, Croatian, Lithuanian, Polish, Slovene
Other: Farsi, Greek, Hebrew, Hungarian, Maltese, Turkish
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
PARSEME
PARSEME = PARSing and Multi-word Expressions,
a European project devoted to multiword expressions.
http://typo.uni-konstanz.de/parseme/
upcoming shared task on identification of VMWEs
dates: t.b.a. February-March 2017
Workshop at EACL2017 in Valencia, Spain (April 2017)
Languages:
Germanic: English, German, Swedish, Yiddish
Romance: French, Italian, Romanian, Spanish, Brazilian Portuguese
Balto-Slavic: Bulgarian, Czech, Croatian, Lithuanian, Polish, Slovene
Other: Farsi, Greek, Hebrew, Hungarian, Maltese, Turkish
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal
Questions?
Fabienne Cap
Language Technology:Research and DevelopmentTopic: Verbal