Ling 181 Lexical Semantics - University of California, Berkeley

Ling 181 Lexical Semantics
Classification of the Semantic Relations in Noun Compounds
Barbara Rosario
([email protected])
Introduction
This paper is about the classification of noun-compounds (NCs) into classes that reflect the
semantic relationships between the nouns. I am ultimately interested in the automatic classification
of NCs with machine learning techniques. In order to learn the classification (to eventually classify
previously unseen NCs) these techniques need as input noun compounds that have been previously
classified. I created a collection of NCs extracting them from abstracts of medical journals and I
classified them. I developed the classification schema without any knowledge of the linguistic
literature on the subject. This paper is an attempt to overcome this ignorance. I will review a few
linguistic accounts of the semantics of NCs, and compare in detail the differences of my
classification with the ones proposed by [Levi 1978] and by [Warren 1978]. I am also interested in
making sure that my classification schema can be justified from a linguistic point of view, in
exploring ways to improve it, in seeing how (and if) NCs in the medical domain differ (from the
point of view of the semantic relationships) from NCs of common usage. My analysis will take into
account the fact that my goal is the automatic classification of NCs.
This paper is organized as follows: in the next Section I introduce NCs and in Section 2 I
explain my motivations for studying NCs. In Section 3 I briefly review (some of) the linguistic
literature on semantics of NCs. Section 4 describes [Levi 1978], in Section 5 I compare my
classification schema with Levi’s, in Section 6 Warren’s theory [Warren1978] is briefly introduced
and my classification schema compared with hers in Section 7.
1) Introducing Noun Compounds
In this Section I will (briefly) introduce some facts about NCs .
There are a multitude of possible definitions for NCs. [Lauer 1995] says the most popular are:
1. Noun premodifier: Any constituent can appear before a noun to form a NC: out-in-thewilds cottages is therefore considered a NC
2. Compounds (phonological definition due to Chomsky): words preceding a noun form a
compound if they receive primary stress, Thus blackboard is a compound, while black
board is not.
3. Complex Nominals: Levi (1978) chooses to include certain adjective along with nouns
as possible compounding elements that she calls Complex Nominals. The adjectives
included are non-predicative adjectives like in electrical engineer.
4. Noun Noun Compounds: any sequence of nouns that itself functions as a noun.
5. Noun Noun Compounds: any sequence of nouns at least two words in length that itself
functions as a noun, but which contains no genitive markers and is not a name.
Compounds with hyphens are called dvandva compounds. Levi doesn’t talk about them, and
Warrens says she found only ten dvandva compounds in her corpus (for example, founder-director,
poet-painter, banker-editor). She says that these compounds are different from the others in that the
modifier does not predicate something about the second constituent and it doesn’t modifies it’s
meaning. The motivation for combining poet and painter is different from the motivation of
combining girl and friend in girl friend. In the former case, we want to convey that someone is not
only a poet but also a painter, and we are not trying to identify which painter, or to define what kind
of painter we are talking about. In the latter case, we want make our reference more precise. Poetpainter means “more” that just painter, we have an expansion of reference scope, while for girl
friend we narrow the reference scope and girl friend “means less” than friend.
For my application, I use definition 5 with the difference that I consider also names (like
Mexico City). Warren (1978) includes also orthographically joined morphemes such as gunman.
She doesn’t differentiate the semantics of continuous compounds vs. the semantics of compounds
of separate words. For the moment, I am considering only sequences of separate nouns, assuming
that the words in orthographically joined morphemes are joined because they are sufficient
common to warrant inclusion in the lexicon, and thus do not require dynamic processing. In the
future, I’ll need to address this issues and I also plan of analyzing the semantics of adjective-noun
pairs.
There are (many) other important issues about NCs such as lexicalisation, syntactic analysis
(parsing), prosody, linguistic function, grammar (nominalizations vs. non-verbal-nexus
compounds), ambiguity, but they are beyond the scope of this paper that focuses on the semantic
analysis of NCs.
2) Motivation of (my) Noun Compound research
Understanding NCs is a vital component of any sophisticated natural language system. It has
been shown that their frequency has been increasing over the last two centuries and they are
especially common in technical text.
I am particularly interested in the automatic analysis of NCs in the bio-medical domain. This is
a very important application of NLP. Some project in this domain that are currently going on are,
for example, the automatic extraction of medical concepts from medical record texts and combining
genome sequence information with the published biological record, “reading” the scientific
2
literature and successfully predict gene interactions1. Very ambitiously, in general, analysis of text
can lead to hypotheses about diseases (see [Swanson97]) and of course if we want to formulate
hypothesis, we need to determine the relationships that hold between different concepts; for
example we need to know that in the phrase “stress can lead to loss of magnesium” stress and
magnesium are connected by a cause-effect kind of relationship.
One of the important challenges of biomedical text, along with most other technical text, is the
proliferation of noun compounds. This is a real title of a medical article:
Open-labeled long-term study of the
subcutaneous sumatriptan efficacy and
tolerability in acute migraine treatment
The real concern in analyzing such a title is in determining the relationships that hold between
different concepts: we need to know that for example, that migraine treatment is a treatment for the
disease migraine (purpose kind of relationship). My research is about finding automatic algorithms
that can learn /find these relationships and obviously in order to do this , we first need to know
what are these relationships.
To create a collection of noun compounds, I performed searches from MedLine, which contains
references and abstracts from 4300 biomedical journals. I used several query terms, intended to
span across different sub fields. I retained only the titles and the abstracts of the retrieved
documents. On these titles and abstracts I ran a part-of-speech tagger and a program that extracts
only sequences of units tagged as nouns. I extracted NCs with up to 6 constituents, but for this
paper I consider only NCs with 2 constituents.
In order to develop a classification schema for the NCs, I used a corpus-based methodology,
i.e. after acquiring the NCs from the corpus I used every NC2 for the development of the
classification schema (this approach allows also to measure the relative frequency of different
semantic relations, which ensures that emphasis is given to semantic behaviors that are more likely
to be encountered3). The relations were found by iterative refinement based on looking at 2245
extracted compounds and finding commonalities among them. The work of this paper is a study of
some of theories in the linguistic literature on the subject and a comparison of my classification
with 2 of such theories.
3) Range of semantic theories4
The semantic properties of NCs have been hotly debated in linguistics, with numerous
contradictory views being proposed. At the optimistic end of the scale, it has been suggested that
that there exists a small set of semantic relationships that NCs may imply. Levi (1978) proposes a
1
See http://www.newscientist.com/dailynews/news.jsp?id=ns9999678 for a description of this project.
I actually retained only those NCs for which both words could be found in a medical ontology.
3
Another methodology is example-based which consists in collecting a set of example compounds, most usually
without regard to the source, with the aim of arriving at as large a list of examples as possible. The list that results
contains compounds, but no contexts, so that the theory is based of ‘out of context interpretation’ of the examples. Of
the two theories that I analyze for this paper, Levi (1978) takes this approach, Warren (1978) the corpus-based one.
4
This short review is based on [Lauer 1995]
2
3
theory along these lines. If these theories are correct, they provide a firm basis for computational
interpretation, since they specify a concrete list of possibilities.
In contrast, the most pessimistic views claim that the implicit relationships between head and
modifier are entirely unconstrained. For example, Dowing (1977) performed a series of
psychological experiments to support her argument that the semantics of NC cannot be exhausted
by any finite listing of relationships. However, she does observe that certain types of relationship
are more common than others. If Downing’s hypothesis is true in a strong sense, then noun
compound interpretation requires complete pragmatic context-dependent inference and therefore
any attempt to handle noun compounds will involve detailed manual knowledge coding. However,
noun compound semantics do form certain patterns. There is certainly hope that adopting a more
restrictive theory that relies on these patterns will prove useful, even if there are cases that it is
unable to handle.
Between the two ends of this scale, there exists a range of noun compound establishes
nomenclatures. [Lauer 95] briefly describes four such theories5.
In order of the degree to which they constrain the possible semantics, they are:
•
•
•
•
Levi’s (1978) recoverably deleteable predicates and nominalizations
Leonard’s (1984) typology of compounds in fictional prose
Warren’s (1978) analysis of Brown corpus compounds
Finin’s (1980) role-filling theory, incorporating role nominals.
For this paper I consider Levi’s and Warren’s works. I chose to study [Levi 1978] because she
is cited in virtually every paper on computational analysis on NCs, because her theory that all NCs
can express only a very small set of semantic relations (13) is very attractive and one would like to
believe it and because her theory derives from a theory of language and not from empirical
investigation (like mine) and I am interested in seeing a different approach to this problem. I
decided to study Warren’s thesis for exactly the opposite reason, that is, as she says, her thesis “is
primarily a report of the results of an empirical investigation. In other words, my main interest has
been directed towards finding out facts about the semantic patterns of a particular type of
compound, rather than towards the question of how to account for the compounding process in
accordance with a particular theory of language”. This approach is very similar to mine; she
analyzed 4557 different compounds (while Levi’s collection consists only of about 200 examples)
and classified them according to the covert semantic relation they expressed and her far less
constraining theory seemed, at a first glance, more realistic than Levi’s.
4) Judith N. Levi: The Syntax and Semantics of Complex Nominals
In this Section I briefly review Levi’s theory and in the following one I compare her
classification to mine.
5
I should mention the fact that Lauer’s thesis is about the syntactic analysis of NCs (attachment) and, to a lesser extent,
about the semantic analysis using probabilistic methods.
4
From page 5, “The principal purpose of this book is the exploration of the syntactic and
semantic properties of complex nominals (CNs)6 in English, and the elaboration of detailed
derivations for these forms within a theory of generative semantics. The book also represents an
attempt to incorporate into a grammar of English a model of the productive aspects of complex
nominal formation”. The emphasis is therefore on the formation of NCs. On page 6 she claims:
“One of the most important claims of the present work is that all CNs must be derived by just two
syntactic processes: predicate nominalization and predicate deletion. The analysis of CNs derived
by the latter process will entail the proposal of a set of Recoverably Deletable Predicates (RDPs)
representing the only semantic relations which can underlie CNs. This set whose members are
small in number, specifiable, and probably universal, consists of these nine predicates: CAUSE,
HAVE, MAKE, USE, BE, IN, FOR, FROM and ABOUT.” The analysis of nominalization brings
four types of nominalized verbs (AGENT, PATIENT, PRODUCT and ACT) derived from two
types of underlying structures (Objective and Subjective).
The claims are big: a very small set of (universal)7 relationships, only two syntactic processes.
Moreover, she dismisses the ambiguity as being resolved by semantic, lexical and pragmatic clues:
“The fact that a given CN may be derived by the deletion of any of these predicates, or by the
process of nominalization, means that CNs typically exhibits multiple ambiguity. This ambiguity is,
however, reduced to manageable proportions in actual discourse by semantic, lexical and
pragmatic clues”. Unfortunately, automatic systems cannot take for granted such clues and must
deal with ambiguity (and indeed this is the hardest, unsolved problem of computational linguistics).
4.1) Derivation of CNs by Predicate Deletion
Levi claims that the variety of semantic relationships between head nouns and prenominal
modifiers in NCs is confined within a very limited range of possibilities. A first derivation of CNs
by predicate deletion is proposed to account for the larger part of these possible semantic
relationships for those NCs whose two elements are derived from the two arguments of an
underlying predicate, as in “apple pie” or “malarial mosquitoes”. This small set of specifiable
predicates that are recoverably deletable in the process of CN formation is made up of nine
predicates. These predicates, and only these predicates, may be deleted in the process of
transforming an underlying relative clause construction into the typically ambiguous surface
configuration of the CN.
The NCs derived by predicate deletion are derived by a sequence of transformation in which a
predicated was deleted to form a NC: for example, for viral infection (CAUSE) we have the
following set of transformations
1.
2.
3.
4.
Virus causes infection
Infection is caused by virus
Infection is virus-caused
Infection virus-caused
6
Complex nominals are “expressions with a head noun preceded by a modifying element which in some cases is a
noun, in others what appears to be an adjective” (page 2); “apple cake”, “presidential refusal” and “electric shock” are
examples of CNs. In contrast, I consider only sequences of nouns (for the time being).
7
In Section 6.4 Levi suggests that her principles are likely to reflect linguistic universals that constrain the use and
interpretation of NCs in all human language.
5
5. virus-caused Infection
6. viral infection
Similarly marginal note was derived by note in margin
Examples of CNs derived by deletion of these nine predicates are given in the following table.
RDP8
N1 < direct object of relative
clause
CAUSE1
N1 < subject of relative
clause
CAUSE2
tear gas
disease germ
malarial mosquitoes
traumatic event
mortal blow
drug deaths
birth pains
nicotine fit
viral infection
thermal stress
MAKE (productive;
constitutive, compositional)
MAKE1
MAKE2
honeybee
silkworm
musical clock
sebaceous glands
songbird
daisy chains
snowball
consonantal patterns
molecular chains
stellar configurations
HAVE (possessive/dative)
HAVE 1
HAVE 2
picture book
apple cake
gunboat
musical comedy
industrial area
voice vote
steam iron
manual labor
solar generator
vehicular transportation
soldier ant
target structure
professorial friends
consonantal segment
mammalian vertebrates
government land
lemon peel
student power
reptilian scales
feminine intuition
CAUSE (causative)9
USE (instrumental)
BE (essive/appositional)
IN (locative -spatial or
temporal)
field mouse
morning prayers
marine life
marital sex
autumnal rains
FOR (purposive/benefactive)
horse doctor
arms budget
avian sanctuary
8
9
Recoverably Deletable Predicate
In parenthesis, more traditional terms for the relationships expressed by the RDPs
6
FROM (source/ablative)
ABOUT (topic)
aldermanic salaries
nasal mist
olive oil
test-tube baby
apple seed
rural visitors
solar energy
tax law
price war
abortion vote
criminal policy
linguistic lecture
Table 1
4.2) Derivation of CNs by Predicate Nominalization
The second major group is composed of CNs whose head noun is a nominalized verb, and
whose prenominal modifier is derived from either the underlying subject or the underlying direct
object of this verb.
Levi classifies the CNs first according to categories determined by the head noun, and second,
according to categories determined by the pre nominal modifier. For the first case, Levi proposes
four nominalization types: Act Nominalizations (parental refusal, birth control), Product
Nominalizations10 (musical critiques, royal orders), Agent Nominalizations (city planner, film
cutter) and Patient Nominalizations11 (student inventions, mammalian secretions).
CNs whose head nouns are derived by nominalization (NOM CNs) may also be classified
according to the syntactic source of their prenominal modifier(s).
In this way the data may be divided into two major categories: Subjective NOM CNs, whose
prenominal modifier derives from the underlying subject of the nominalized verb, and Objective
NOM CNs, whose prenominal modifier derives instead from that verb’s underlying direct object.
SUBJECTIVE NOM CNs
Act:
Product:
parental refusal
manager attempts
cell decomposition
judicial betrayal
clerical error
peer judgments
10
Act can be paraphrased by “the act of ..(parents refusing)” while Product Nominalizations seem to represent some
object (usually though not always tangible) that is produced as the result of a specific action or event (outcome) and can
be paraphrased by “ that which is produced by (the act) of criticizing music
11
To clarify the distinction between Product Nominalizations and Patient Nominalizations, Levi presents the following
examples, in which (a) are Products and (b) Patients
a.
The managerial appointments infuriated the sales staff.
b.
The managerial appointees infuriated the sales staff.
a.
We expect to receive the senatorial nominations shortly.
b.
We expect to receive the senatorial nominees shortly.
7
Agent:
Patient:
faculty decisions
papal appeals
royal gifts
student inventions
presidential appointees
city employees
OBJECTIVE NOM CNs
Act:
Product:
Agent:
Patient:
birth control
heart massage
subject deletion
musical criticism
oceanic studies
chromatic analyses
stage designs
tuition subsidies
draft dodger
urban planner
acoustic amplifier
electrical conductor
-
Table 2
This classification is based on the meaning and source of the head noun alone and on the
syntactic source of their prenominal modifier; it does not depend on the semantic relationships
between the 2 nouns.
5) Classification Comparison
In this section, I compare the classification I developed for the medical NCs with Levi’s. I first
map to each of my classes, one (or more?) of hers and then I do the opposite. The first analysis will
allow me to see whether I can describe all of my classes under Levi’s classification (and if not,
when and, possibly, why) and if (and if so, in which cases) I have a one to many mappings. This
will also (hopefully) help me to detect problems in my classification.
The second analysis will allow me to see how broad her classes are and how specific (too?)
mine are and what are the classes of my classification that fall under the same ones in hers.
8
My 40 relations were found by iterative refinement based on looking at the 2245 extracted
compounds and finding commonalities among them. I developed a taxonomy of implicit semantic
relations with 2 levels of abstraction. The frequencies of the categories are very different12.
I have a definition (or paraphrase) for each relation and it was suggested that these definitions
should be different, for example, instead of N2 is such-and-such it might be better to say that the
whole NC is such and such. I agree that this formulation is more appropriate, however, how I will
define these relations will mainly depend on their use in the automatic task of text interpretation. I
still have to address this and for the moment I kept the definitions I already had.
TOP
CLASS
(when
applicable)
Name
Description
and Order
Lexicalized
(240)14
Examples
Levi’s
classifica
tion13
cd8 mab
crh neurons
vitamin k
genotype b
(not
included
in the
analysis)
As lexicalized NCs do not represent regular grammatical process and must be learned on an
item-by-item basis, Levi does not include them in her analysis. Her study is restricted to
endocentric complex nominals and does not consider, for example, metaphorical, lexicalized or
idiomatic meanings. Chapter 7 of [Levi 1978] deals with exception classes, i.e. types of NCs that
are exceptions to her theory and therefore not included (for example, CNs whose modifiers denote
units of measure, and CNs whose adjectival modifiers must be derived from adverbs rather than
nouns15).
12
Classes with very few examples are problematic for the classification algorithm and it would probably be convenient
to incorporate them into other more prolific categories. For this paper, however, I consider all the classes regardless of
the number of examples.
13
RDP stands for Recoverably Deletable Predicate, NON for Nominalization
14
The numbers in parenthesis are the numbers of NCs classified under this class
15
Other examples whose meanings (and hence, derivations) her theory does not predict are:
a.
b.
c.
d.
e.
f.
g.
h.
i.
coffee/leg/breast/pipe/baseball man
cat/morning/tennis person
volleyball/poetry/alfalfa/fitness/sitar freak (fiend, bug)
vodka/shopping/studying/yogurt/sweater binge
lunar/solar/fiscal /academic year
dental/medical/psychiatric/gynecological appointment
iron/bronze/industrial/space age
salad/hatcheck/coffee/copy girl
soap/diamond/brewery/textile heir(ess)
Her claims is that these NCs are formed not by the syntactic processes of predicate deletion or
nominalization but rather by a morphological process equivalent to suffixation
9
predicate
Another problematic category consists of those NCs for which a well-defined relationship could
not be determined16 (they are simply subtypes of one another like guinea pig17).
Subtype
(187)
(NC is
subtype of–
kind of head)
headaches migraine
weight peptides
albino mouse
gypsy woman
guinea pig
RDP: BE
This class can correspond to Levi’s BE. She identifies 3 major semantic subtypes in BE:
compositional (snowball, glass cup, bronze statue), “genus-species” (pine tree, cactus plant, winter
season, murder charge) and metaphorical (mother church, queen bee, finger lakes). My Subtype18
class roughly corresponds to her “genus-species” subtype, the metaphorical one being included in
Lexicalized and the “compositional” in another class defined later (Made of).
ACTIVITY: Consists of 4 subclasses: Activity/Physical process, Ending/reduction, Beginning,
Change, Produce (on a genetic level)
ACTIVITY
Change
(24)
n2 is a change
in/of n1
headache transformation
disease development
tissue reinforcement
tumor development
ventricle enlargement
cell growth
NOM:
Subjective
Act
tissue atrophy
tract stenoses
tissue tropism
?
My classification does not depend on whether the nouns are simply nouns or derived from a
verb by nominalization, rather, only the semantic relationship between the nouns is taken into
account. For this reason, I classify under the same class “disease development” and “tissue
atrophy”. For the nominalized nouns there is a corresponding class under Levi’s schema, namely,
NOM Subjective Act, but I could not find an appropriate RDP class for the other NCs. Levi’s RDP
schema cannot describe NCs such as “tissue atrophy”. The closest classes could be HAVE 2 (“the
tissue has a atrophy” ?) or IN (“atrophy in the tissue”) but this feels unsatisfactory to me. The first
noun is the subjective of the relative clause, but in the RDP classification the only cases in which
this happens are for CAUSE2, MAKE2, HAVE 2 (see Table1) neither of which is applicable here.
These NCs could be well described by the proposition “OF” that Levi does not include.
16
These two categories, Lexicalized and Subtype are very broad and, not surprisingly, they were the worst performers
for the algorithm.
17
This will prevent the system to classify “guinea pig” as a pig that lives in Guinea.
18
I indicate a class of my classification schema underlying the name
10
The same considerations apply to the following class:
Activity/Physical
process (63)
NC is an
activity of N2
type in which
N1 is involved
bile secretion
virus reproduction
infection transmission
bowel movement
headache activity
brain function
bile drainage
headache signs
cell necrosis
NOM:
Subjective or
Objective Act
?
Note that here the first noun can be either the object or the subject of the nominalized verb. For
example, the bile is secreted and the infection is transmitted (Objective Act) whereas the brain
functions and the viruses reproduce (Subjective Act). In the future I’ll probably need to divide this
class accordingly to this distinction.
Ending/reduction
(8)
n2 is ending/
reduction of n1
migraine relief
headache decrease
headache improvement
headache reduction
headache resolution
NOM:
Subjective
Product
Beginning of (1)
n2 is beginning
of n1
headache onset
?
I have only one example for this class but again I would need an OF RDP.
Produce (on a
genetic level) (40)
n2 produces n1
actin mrna
cmv dna
receptor mrnas
tyrosinase gene
chemokine genes
RDP:
MAKE1
CAUSE: Consists of 2 subclasses, Cause12 (where the first noun causes the second one) and
Cause21 (opposite direction). There is a good overlap with Levi’s CAUSE1 and CAUSE2
CAUSE
Cause 12 (102)
n1 is cause of n2
11
tension headache
aids death
RDP:
CAUSE2
Cause 21 (22)
n1 is effect on
body of n2 (n2 is
cause of n1)
automobile accident
heat shock
flu virus
influenza infection
immunodeficiency virus
diarrhoea virus
CAUSE2
RDP:
CAUSE1
PROPERTIES: Consists of 4 subclasses, Characteristic of, Physical property, Damage, Physical
make up
PROPERTIES
Characteristic of
n2 is property
of n1
Physical property
n2 is physical
property of n1
Defect (hindrance,
damage)
n2 defect/
hindrance/typ
e of damage
of n1
n2 is physical
make up of n1
Physical make up
cell immunity
disease prevalence
drug toxicity
headache persistence
gene polymorphism
artery diameter
artery pressure
artery hypoplasia
artery aneurysm
tract abnormalities
blood plasma
bile vomit
RDP:
HAVE2
(OF?)
RDP:
HAVE2
(OF?)
RDP:
HAVE2
(OF?)
RDP:
MAKE2
PURPOSE: Consists of 2 subclasses, Purpose and Function
PURPOSE
Purpose
(65)
Function
(2)
n1 is target of n2
n1 is the function of n2
anti-migraine drug
influenza immunization
flu vaccine
asthma therapy
RDP: FOR
headache medication
asthma treatment
tumor treatment
NOM:
Objective
Act
RDP: FOR
growth hormone
Under Levi’s classification “tumor treatment” and “asthma therapy” would be treated differently.
This would not be appropriate for the system I am developing for the automatic semantic analysis
of text.
12
PERSON AFFLICTED BY DISEASE: Person afflicted and Demographic attributes (of person
afflicted)
PERSON AFFLICTED
BY DISEASE
Person afflicted
(52)
n2 is person
afflicted
with n1
Demographic
attributes (19)
n1 is
demographic
attributes of
person
afflicted
with n2
migraine patient
bmt children
hiv populations
headache society
headache sufferer
aids patient
childhood migraine
childhood headache
infant infection
women migraineur
infant irritability
RDP:
HAVE1
RDP: IN
(or also
HAVE1)
TREAT/STUDY/ RESEARCH: Person/center who treats, Research on, Attribute of study,
Procedure
TREAT STUDY
RESEARCH
Person/center
who treats (19)
n2 is person,
center who
treats n1
Research on (8)
n2 is
research on
n1
Attribute of –
clinical- study
(103)
Procedure (60)
n2 is
procedure
for
d t ti /
13
headache specialist
headache center
diseases physicians
disease associations
headache unit
asthma nurse
asthma researchers
headache study
health research
RDP:
FOR
headache parameter
attack study
headache interview
biology analyses
biology laboratory
cell biology
headache pattern
bowel biopsy
genotype diagnosis
blood culture
headache diagnostics
RDP:
ABOUT
RDP:
ABOUT
RDP: ?
(OF?)
detecting/
studying n1
gallbladder pathology
disease pathophysiology
LOCATION: Located in, Part of, Occupied by
LOCATION
Located in
(274)
n2 is located in
n1
Is part of (5)
n2 is part of n1
Occupied by
(3)
n2 is occupied by
n1
liver abnormalities
brain abscess
breast carcinoma
lip mucosa
liver tumor
artery obstruction
dna element
artery segments
artery territory
brain region
RDP: IN
headache interval
attack frequency
football season
headache period
influenza season
fibrosis stage
morning headache
hour headache
weekend migraine
RDP: ?
(OF?)
RDP: IN
RDP:
HAVE 1
TIME: Time of (12), Frequency/duration/ time of
TIME
Frequency/durati
on/ time of 21
(35)
n2 is
frequency,
time of n1
Time of (4)
n1 is time of
n2 (opposite
of previous
class)
RDP: IN
Here again, I couldn’t find an appropriate RDP. OF would have been appropriate.
MEASURE: Measure of, Standard
MEASURE
Measure of (70)
n2 is a measure
of n1
Standard (5)
n2 is a standard
for n1
14
relief rate
asthma mortality
abortion rate
attack duration
hospital survival
headache criteria
society criteria
gold standard
society standard
hygiene standards
RDP: ?
(OF?)
RDP:
ABOUT
INSTRUMENT: Instrument (12), Instrument (21), Instrument (1)
INSTRUMENT
n1 is used
in/for n2
Instrument 12
(102)
Instrument 21 (4)
Instrument 1 (8)
n2 is used
for n1
n1 is used
injection method
laser surgeries
aciclovir therapy
vibration massage
lamivudine therapy
chloroquine treatment
ginseng treatment
laser treatment
biopsy needle
drug use
heroin use
internet use
tobacco use
drug utilization
RDP:
USE
NOM:
Subjectiv
e Act
RDP:
FOR
NOM:
Subjectiv
e Act
Here, like for “tumor treatment” and “asthma therapy” in Purpose (classified respectively under
Objective Act and FOR) “laser treatment” and “laser surgeries” (or “aciclovir therapy”) would be
classified differently (Subjective Act and USE) under Levi’s classification.
OBJECTIVE: Objective, Misuse
OBJECTIVE
Objective (33)
n1 is acted on by
n2 (n2 affects n2,
n2 acts on n1)
bowel transplantation
kidney transplant
liver transplantation
cell death
liver metastases
cell apoptosis
infant death
Misuse (11)
n2 is misuse(r) of
n1
drug abuse
substance abuse
ergotamine abuser
NOM:
Objective
Act
RDP: ?
(OF?)
NOM:
Objective
ACT
acetaminophen overdose
RDP: ?
(OF?)
15
Here again the RDP relations cannot describe, in my view, “infant death” nor “acetaminophen
overdose” for which OF would be a good description.
Other classes, with no superordinates.
Topic (46)
Modal (14)
Material (Made
of) (17)
Subjective (19)
Binds (4)
Activator 12 (6)
Activator 21 (4)
Inhibitor (13)
n1 is the
topic of n2
(n2 is
concerned
with n1)
n1 is how
n2 is done
n2 is
composed/
made by n1
n1 is subject
of n2
n2 binds to
n1
n1 activates
n2
(n1 is
activator of
n2)
n2 activates
n1
(n2 is
activator of
n1)
time visualization
headache questionnaire
tobacco history
vaccination registries
health education
RDP: ABOUT
emergency surgery
emergency nephrectomy
vacuum aspiration
trauma method
agarose gel
aloe gel
gelatin powder
gel capsule
latex gloves
calcium stones
RDP: BE
textile dust
formaldehyde vapors
headache presentation
headache response
FROM19
bile metabolism
drug metabolism
heat transfer
headache response
receptor ligand
carbohydrate ligand
receptor agonist
pain signal
food allergy
food infection20
receptor agonist
receptor antibodies
headache precipitant
headache trigger
receptor antagonist
19
RDP: HAVE 1
NOM:
Subjective Act
RDP: ? (OF)
RDP: FOR
RDP: ? (OF)
NOM:
Subjective
Product
RDP: FOR
NOM
Objective
Agent
RDP: ? (OF)
These nouns compounds could actually be divided into a FROM class and a MADE OF. For example, agarose gel,
aloe gel, and gelatin powder can be MADE OF and textile dust and formaldehyde vapors can be FROM.
20
“food infection” and “food allergy” are also Cause12. Many of my NCs have been classified with 2 (or more) labels.
It is in fact important to associate with a given NC all the possible meanings. For example, “eyelid abnormalities” is
both Located in and Defect, (while “growth abnormalities” is only Defect) and “tissue atrophy” is Subjective, Change,
Located in and Defect.
16
adrenoreceptor blockers
receptor blockers
virus inhibition
influenza prevention
NOM:
Objective
Agent
Food infection and food allergy are also Cause12. Many of my NCs have been classified with 2
(or more) labels. It is in fact important to associate with a given NC all the possible meanings. For
example, “eyelid abnormalities” is both Located in and Defect, (while “growth abnormalities” is
only Defect) and “tissue atrophy” is Subjective, Change, Located in and Defect.
Warren includes an interesting discussion about ambiguities. She says that ambiguity arises
because the semantic features of the constituent scan satisfy more than one semantic pattern and
she distinguishes three different kinds of ambiguous compounds (see page 67 of Warren 78).
In my application, I assume that all the different readings are possible, postponing the
disambiguation to the analysis of the whole text.
This analysis shows that the NCs (of my collection) belonging to the same semantic class
correspond to only one of the Levi relations21. I would hope that this is an indication of the internal
homogeneity of the classes I defined.
5.1) Exclusion of “OF”
I was surprised by the exclusion of the preposition “OF” from the set of recoverably deletable
predicates (especially because I wasn’t able to classify about 250 of my NCs).
On page 97, Levi lists some examples of CNs derived by FOR deletion (“bird sanctuary” from
“sanctuary FOR birds”, “nose drops” from “drops FOR nose”) and on page 95, examples of CNs
derived by IN deletion (“office friendships” from “friendships IN the office”, “marginal note”
from “note IN the margin”). It's not clear to me why she excludes CNs derived by OF deletion:
“headache activity” from “activity OF headache”, “brain function” from “function OF the brain”,
“cell growth” from “growth OF the cell”. On page 161, she discusses about the membership
requirements for the RDP set but she mainly defends her choice for the nine predicates rather than
explaining the reasons for the exclusion of others22. She claims that the predicates she proposes
“seem to embody some of the most rock-bottom-basic semantic relationships expressible in human
language; assuming that there are such things as semantic primes, these nine are surely
outstanding candidates”. This seems quite vague to me. She goes on saying that all the nine
predicates manifest “surface invisibility” and “grammaticality”. She defines “surface invisibility”
as a characterization of those predicate “that can be either deleted or incorporated before they
reach the surface without leaving a surface trace”. So for example the sign “employees only”
means “FOR employees only”, “Adar is as talented as Ariella” means “Adar is as talented as
Ariella IS” and “Max wants a lollipop” means “Max wants (TO HAVE) a lollipop”. I am not sure if
21
Except for the cases in which the different derivations of the nouns determines the classification as both NOM and
RDF (as we saw for Activity/Physical process, Change). I don’t deem this important for the semantic analysis I am
interested in.
22
She never explicitly mentions the preposition “OF”.
17
this applies to the preposition OF, but one example comes to mind: in a medical laboratory with
plates containing biopsies, a label “bowel” would mean “biopsy OF bowel”, and the label “brain”
would stand for “biopsy OF brain”.
The notion of grammaticality means that these predicates “are often expressed not by
independent lexical items but by bound grammatical morphemes”. For example, some predicates in
the RDP set are grammatized into marking of nouns, i.e. into case endings, and for example, in the
Proto-Indo-European system, CAUSE and FROM surface as ablative, FOR as dative, USE as
instrumental, IN as locative, MAKE as accusative, BE as nominative and HAVE as possessive
genitive. It is important to note here that the genitive case has been described only as possessive
genitive that can indeed be paraphrased by HAVE23. However, the genitive is not used only to show
possession but can also be indefinite, objective, partitive, descriptive24 in which cases the
preposition is OF25.
For these reasons, I would argue that the exclusion of “OF” is not well motivated.
I now compare the two classifications in the other direction.
Levi’s
classification
CAUSE1
CAUSE2
MAKE1
MAKE2
HAVE1
HAVE2
USE
BE
IN
My classification
•
•
•
•
•
•
•
•
•
•
•
•
•
•
Cause 12
Cause 21
Produce (on a genetic level) (Activity)
Physical make up (Properties)
Person afflicted (Person afflicted by disease)
Occupied by (Location)
Material, made of
Characteristic of (Properties)
Physical property (Properties)
Defect (hindrance, damage) (Properties)
Instrument 12 (Instrument)
Subtype of
Modal
Demographic attributes (Person afflicted by
23
And this was in fact the case for the classes Characteristic of, Physical property, Defect for which both HAVE2 and
OF are good descriptions.
24
(Example taken from http://www.nd.edu/~archives/gen.htm: Latin Dictionary and Grammar Aid)
genitive charge.He is guilty OF MURDER.
genitive. indefinite. A box OF [some] WEIGHT.
genitive. description. He was a man OF NO CHARACTER.
genitive. material. A statue made OF SILVER.
genitive. objective. He had no fear OF DEATH.
genitive. partitive. One out OF A MILLION.
25
For example, I would define “bowel biopsy” and “acetaminophen overdose” as a objective genitives and “asthma
mortality” as description genitive.
18
FOR
FROM
ABOUT
NOM: Subjective
Act
NOM: Subjective
Product
NOM: Subjective
Patient:
NOM: Objective
Act
NOM: Objective
Product
NOM: Objective
Agent
•
•
•
•
•
•
•
•
•
•
•
disease)
Located in (Location)
Is part of (Location)
Time of (Time)
Function (Purpose)
Purpose (Purpose)
Person/center who treats
(Treat/study/research)
Instrument 21 (Instrument)
Binds
Activator 21
Material, made of
Research on (Treat/study/research)
Attribute of clinical study
(Treat/study/research)
Procedure (Treat/study/research)
Topic
Standard (Measure)
Misuse (Objective)
Activity/physical process (Activity)
Change (Activity)
Instrument 12 (Instrument)
Instrument 1 (Instrument)
Subjective
Ending/reduction (Activity)
Activator 12
•
•
•
Purpose (Purpose)
Misuse (Objective)
Objective (Objective)
•
•
Activator 21
Inhibitor
•
•
•
•
•
•
•
•
•
•
•
•
An obvious observation is that to most of Levi’s classes correspond many categories of my
classification schema (and very different ones). For example Demographic attributes, Located in, Is
part of and Time of all correspond to the RDP IN. This is hardly surprising; the purposes of the two
classification schemas are very different and Levi justifies this vagueness (as she puts it on page
82) “on three major grounds. First, the identification of the RDP set which they constitute allows us
to make highly accurate predictions about which semantic structures can underlie CNs and which
19
can not; second, the specification of general predicates serves to include a number of more specific
relations that might otherwise have to be enumerated individually with a concomitant loss of
generality; third, this analysis creates no more confusion or vagueness than is observable in
periphrastic equivalents with overt lexicalizations of the RDPs (e.g., x has y, x is in y, x is for y),
where some fundamental distinctions are made but where a great deal is typically left unspecified.”
She lists as an example, some of the possible more specific relationships for IN: inhabits, grow-in,
according-to, during, found-in and occur-in. She claims that: “The positing of six or more separate
groups, however, would obscure the generalization that is captured in the present theory by
positing a single predicate IN for both spatial and temporal, both concrete and abstract location.
[...] we must recognize that (a) it cannot be accidental that all of these paraphrases must in any
case include the locative preposition IN, and (b) the more specific verbs are wholly predictable on
the basis of the semantic characteristics of the surface components of the CN. For example, almost
any CN whose head noun denotes an animate creature and whose modifier is a place noun will be
a suitable member for the INHABIT group. [...] Since, however, these differences not only are
predictable on independent grounds but also seem to have no effect whatsoever on the formation of
the corresponding CNs, our description need not include this of specificity.”
The important point is that the purpose of her work is the formation of CNs, while mine is the
semantic analysis of CN by a computer. In order for the computer to predict the specific
relationships on the basis of the semantics of the nouns, I would need to define the specific readings
anyway and to add rules such as:
I.
II.
If the (high level) relationship is IN
If the first noun belongs to the class A (to be defined..) and the second noun
to the class B (also to be defined..)
THEN Ÿ IN is INHABITS
In real life (unfortunately?) rules are always flawed and also it would be difficult to define the
memberships; I think an approach in which NCs are directly mapped to more specific categories is
undoubtedly superior.
An appropriate question would be then, why consider Levi’s work if the goals are so different?
As I said, I wanted to analyze a linguistic theory on this subject and, as already mentioned, most of
the papers on computational linguistics for the analysis of NCs cite and, more importantly, some of
them justify their classification schemas as based on Levi’s (for example [Vanderwende 1994]).
Based on this analysis, I argue that this justification doesn’t hold and different classification
schemas are needed.
6) Warren, Beatrice. Semantic Patterns of Noun-Noun Compounds
Warren’s theory [Warren1978] is far less constraining. Her purpose was to determine whether
there exists a limited number of semantic relations between the constituents of NCs, and to
determine the nature of these relations. As I said in the introduction, [Warren1978] is primarily a
report of the results of an empirical investigation. She made a comprehensive study of compound
20
nouns in 360 thousand words of the Brown corpus26 and she manually extracted 4557 different
compounds; she then developed a taxonomy of implicit semantic relations, with four hierarchic
levels of abstraction. Before getting into the semantic relations, I’d like first to say a few words
about some other aspects of NCs that Warren describes in her thesis. Chapter I is a brief but fairly
comprehensive account of what is known about the structure of NCs in general. Warren enumerates
4 different structures for NCs:
1.
2.
3.
4.
5.
Morphological Structure
Syntactic Structure
Phonological Structure
Information Structure
Semantic Structure
1. Morphological Structure
Warren says that the most important point is that, as a rule, the first noun does not take inflectional
endings, such as plural or genitive suffices, but then she lists exceptions to this rule: plural firstwords (sports center, narcotics law) and first-words with genitive –s (driver’s seat, women’s
colleges)27.
2. Syntactic Structure
Warren discusses several ways of parsing NCs with more that 2 constituents28 (that she calls
compounds-within-compounds) and shows the frequency and distributions of various
combinations.
3. Phonological Structure
Warren talks about the stress pattern and identifies 2 fundamental stress patterns: fore-stress (or
first-stress) and double-stress. Fore-stress involves primary stress on the first noun and secondary
stress on the second noun (cówbòy). Double-stress involves heavy stress on both the first and
second constituents: stóne wàll. She also observes that certain semantic relations are connected
with certain stress patterns and that stress may be used to signal syntactic and semantic differences:
for example, compounds expressing material-artifact are usually pronounced with double stress
(bronze screws, brass wire) and compounds expressing Purpose or Origin have been assigned forestress.
4. Information Structure
Elements in a sentence are not equally important. Some elements are, communicatively, more
important than others and have therefore a different degree of communicative dynamism (CD).
Elements that have a low degree of CD within a sentence are called thematic, and those that carry a
high degree of CD, are called rhematic29. Warren suggests that the same analysis is possible for
NCs and that the two constituents of a NC do not normally have equal informative force and that in
26
In particular, her collection comes from “The standard Corpus of Present-Day Edited American English” assembled
at Brown University during 1963 and 1964.
27
Warren includes a short discussion about the controversy regarding such constructs and she identifies the semantic
relations of these compounds as either Purpose (driver’s seat) or Possession (pastor’s cap).
28
For example spider-leg pedestal table is left-branching within a left-branching structure, whereas home workshop
tool is right-branching within a left-branching structure
29
Warren is referring here to the Prague school of linguistics (Fibras)
21
NCs the first position is reserved to the element with the highest degree of CD; the “information
structure” of a compound is therefore that the rhematic elements tends to precede the thematic
element30.
5. Semantic Structure
Comment-topic structure: NCs can be described as having a bi-partite structure, in that they consis
of a topic and a comment. The topic is what we are talking about (represented by the head of the
NC) and the comment is what we say about the comment. The function of the comment is to make
the reference of the topic more precise. It restricts the reference scope of the topic by either
classifying what kind of topic we have in mind, or by identifying which of a number of known
topics we are referring to. The first case refers to NCs in which the comment has classifying
function (paraphrasable by: “that kind of –topics- that (verb) –comment-”)31, and the second one, to
NCs in which comment has identifying function (paraphrasable by: “that -topics- that (verb) (some
specific) –comment-”).32 Warren hypothesizes that compounds with identifying comments don’t
allow generic reference, in contrast to compounds with classifying comments. Within the semantic
structure, Warren lists also Idiomatization, Established and Novel Compounds33 and finally the
Partecipant Roles: it is obvious that the constituent of the NC are semantically related and that the
semantic relations are not the same: it is indeed the purpose of [Warren 1978] to established
whether there is a limited number of possible types of semantic relations between the constituents
of compounds and what these semantic relations are. Accordingly to Warren’s results, there are six
major types of semantic relations than can be expressed in compounds:
1) A is something that wholly constitutes B34, or vice versa: this is the CONSTITUTE
class divided into Source-Result, Result-Source and Copula classes. Some examples:
metal coupon, paste wax, student group.
2) A is something of which B is a part or a feature or vice versa: POSSESSION class
divided into Part-Whole, Whole-Part and Size-Whole classes. Examples: board
member apple pie.
3) A is the location or origin of B in time or space: LOCATION. Participant roles are
Place-OBJ, Time-OBJ, Origin-OBJ. Examples: coast road, Sunday school, seafood.
4) The comment indicates the PURPOSE of B. Participant roles: Goal-Instrumental.
Examples: pie tin, tablecloth, football.
5) The comment indicates the activity or interest with which B is habitually concerned.
Participant roles: ACTIVITY-ACTOR. (cowboy, fire department)
6) RESEMBLANCE. A indicates something that B resembles. Participant roles are:
Comparant - Compared. Examples: egghead, bullet head.
The semantic relations can often be paraphrased35.
30
My results for the automatic classification seem to contradict this hypothesis (see Rosario 2001).
Like bilge water
32
Like: (the) bathroom door, the Podger cat (where Podger is a family name)
33
Established compounds are those combination that have become the generally accepted word for their referent (like
toothpaste instead of mouth hygiene paste) and novel compounds are combination that have been created for the
occasion
34
In Warren’s notation, A indicates the first noun, B the second one.
35
Paraphrasing does not work with idiomatization.
31
22
MAJOR CLASS
CONTITUTE
POSSESSION
LOCATION
PURPOSE
ACTIVITY-ACTOR
RESEMBLANCE
SUB CLASSES
Source-Result
Result-Source
Copula
Part-Whole
Whole-Part
Place-OBJ
Time-OBJ
Origin-OBJ
Goal-Instrumental
Activity-Obj
Comparant- Compared
PARAPHRASE
OF
IN
OF
WITH
IN, AT, ON
IN, AT, ON
FROM
FOR
LIKE
EXAMPLES
metal sheet
sheet metal
Girl friend
eggshell
armchair
coast road
Sunday school
seafood
pie tin
cowboy
cherry bomb
These classes are further subdivided. I won’t describe the classes in detail (there is a whole
chapter for each class) but in the following pictures I report the hierarchical structures for these
semantic classes and in the next section I’ll describe some of the classes in greater detail.
CONSTITUTE
Figure 1
23
POSSESSION
Figure 2
LOCATION
Figure 3
24
PURPOSE and ACTIVITY-ACTOR
Figure 4
Warren does not include NCs in which one noun is deverbal (the nominalizations in Levi’s
terminology). Her motivation for doing so is that a main goal of her work is to find out what
semantic relations between two nouns can be left unexpressed, or what verb connecting the two
nouns may be discarded. In compound containing a deverbal noun there is no verb deleted and the
type of semantic relation that exists between the constituents is explicit while this relation is
implicit in non-verbal compounds. An important consequence of this distinction is that
nominalizations do NOT appear to be restricted as to the type of relations that may exists between
the constituents (at least not in parallel fashion). Indeed, there is no need for such a restriction since
the type of relation between the nouns is explicitly indicated by the deverbal noun. In contrast,
there is a limited number of relations that are left unexpressed for non-verbal NCs.
As I already said, I don’t distinguish between verbal and non-verbal NCs (although some of the
classes contain mainly one type or the other) and as we’ll see for the classes with mainly verbal
NCs, I don’t find a correspondent class in Warren’s classification.
If the fact that in verbal NCs the relationships are explicit in the verb is true, in my application,
I could first divide NCs into verbal and non-verbal (with a binary classification system) and then
analyze the two classes separately. This would make the system much more complicated but it
would be useful if the number of possible types of relations in verbal NCs is indeed unconstrained
and if we knew how to infer the relation given the verb (which is another problem in it own!).
[Lapata 00] proposes algorithms for the classification of nominalizations according to whether the
modifier is the subject or the object of the underlying verb expressed by the head noun; my aim is
to have more specific relations, sufficiently general to cover a significant number of noun
compounds, but that can be domain specific enough to be useful in analysis and subject-object is
not specific enough.
7) Classification Comparison
In this section, I compare the classification I developed for the medical NCs with Warren’s.
25
TOP
CLASS
(when
applicable)
Name
Description
and Order
Lexicalized
(240)
Examples
Warren’s
classification
cd8 mab
crh neurons
vitamin k
genotype b
csf culture
Proper names
Warren says that the only NCs in her collection that can be said to lack a semantic relation between
the members are compounds that in some sense are proper names, like for example X-ray, Y-region,
Q-fever. She does not include them in the classification schema
Subtype
(187)
(NC is
subtype of–
kind of head)
headaches migraine
albino mouse
gypsy woman
CONTITUTE
(COPULA)
Copula NCs are those noun compounds in which both members cane be said to be two alternative
“names” for the same referent (girl friend, cypress tree). She further divides the copula class into
Attributive NCs (NCs that represent a cross-classification, like girl friend, houseboat) and
Subsumptive NCs (those that are based on a Species genius relation, like cypress tree, panel board.
(See Figure1). I don’t distinguish between Attributive and Subsumptive.
ACTIVITY
Activity/Physical
process (63)
NC is an
activity of N2
type in which
N1 is involved
bile secretion
virus reproduction
infection transmission
headache activity
brain function
cell necrosis
headache signs
bile drainage
?
I couldn’t find a class for these NC s which in fact are mainly nominalizations that are not covered
by Warren. The closest class could be ACTIVITY – ACTOR (Figure 4) of crime syndacate but
Warren says that for all the NCs in this class, the head refers to a human being, a group of people or
an organization which is not the case in my class Activity/Physical process and perhaps more
importantly, the order is different: the paraphrases she proposes for these NCs is “the head is
habitually concerned with the modifier” while I have just the opposite. Some of these NCs could
also be thought as LOCATION (brain function bile drainage) but in her LOCATION class, the
26
heads (called also topics) can only be objects (See Figure 3) while in my case, the heads are really
activities, not objects. The same comments apply to the following classes
n2 is a change
in/of n1
headache transformation
disease development
tissue reinforcement
tumor development
ventricle enlargement
cell growth
tissue atrophy
tract stenoses
tissue tropism
Ending/reduction
(8)
n2 is ending/
reduction of n1
migraine relief
headache decrease
headache improvement
?
Beginning of (1)
n2 is beginning
of n1
headache onset
?
Change (24)
?
Again, we have here mainly nominalizations.
CAUSE: Consists of 2 subclasses, Cause12 (where the first noun causes the second one) and
Cause21 (opposite direction).
CAUSE
Cause 12 (102)
n1 is cause of n2
Cause 21 (22)
n2 is cause of n1
tension headache
aids death
automobile accident
heat shock
flu virus
influenza infection
immunodeficiency virus
(LOCATION,
Origin-Object)
Causer-Result
(Just mentioned)
It interesting to note here that while for my class Cause 12 there is a corresponding class CauserResult of hay fever (Figure 3), Warren just mentions that “there are some compounds where the
topic can be assigned the role Causer: disease agent, lifeblood” (page189), but she found only two
NCs of this type does not include this class in her classification.
27
PROPERTIES
cell immunity
disease prevalence
drug toxicity
headache
persistence
gene polymorphism
artery diameter
artery pressure
BELONGING
TO (Whole-Part)
(Whole-feature)
artery hypoplasia
artery aneurysm
tract abnormalities
BELONGING
TO (Whole-Part)
(Whole-feature)
Characteristic of
n2 is property
of n1
Physical property
n2 is physical
property of n1
Defect (hindrance,
damage)
n2 defect/
hindrance/typ
e of damage
of n1
Physical make up
n2 is physical blood plasma
make up of n1 bile vomit
bile salt
BELONGING
TO (Whole-Part)
(Whole-feature)
LOCATION
(Origin-Obj)
Place of OriginOBJ
Warren describes the Whole-feature class as the class of NCs in which B represents an inherent
delimitable “part” of A. She further subdivides this class into Object-Quality (room temperature),
Object-Extension (particle size) and Object-Abrstact Shape (crime trend) (see Figure 2). My
classes don’t reflect these subclasses but I think that “Whole-feature” is a good description for
them.
PURPOSE
Purpose
(65)
anti-migraine drug
influenza immunization
flu vaccine
asthma therapy
n1 is target of n2
PURPOSE
(GoalInstrument)
headache medication
asthma treatment
tumor treatment
Here also, I think that Warren’ s class Goal-Instrument is similar to mine, but it’s subclasses are
less so although maybe “primary relation involves Instrumental” can work.
PERSON AFFLICTED BY DISEASE: Person afflicted and Demographic attributes (of person
afflicted)
PERSON AFFLICTED
BY DISEASE
Person afflicted
n2 is person
28
migraine patient
BELONGING
(52)
afflicted
with n1
bmt children
hiv populations
headache society
headache sufferer
aids patient
TO (Part Whole), (PartObj)
Belonging Possessor
Demographic
attributes (19)
n1 is
demographic
attributes of
person
afflicted
with n2
childhood migraine
childhood headache
infant infection
women migraineur
infant irritability
BELONGING
TO (WholePart),
Possessor Belonging
Belonging –Possessor (of gunman) and Possessor –Belonging (family estate) are opposite classes
within the BELONGING TO major class (see Figure 2). Belonging –Possessor is sub-class of Part
–Whole while Possessor –Belonging is sub-class of Whole- Part. This is logical, however, the
results is that these two classes that are the natural opposite of each other are “located” in two
positions in the hierarchy that are quite apart from each other. I found this quite confusing. Just
looking at the hierarchy is not immediately evident that these two classes are so close related.
TREAT/STUDY/ RESEARCH: Person/center who treats, Research on, Attribute of study,
Procedure
TREAT STUDY
RESEARCH
Person/center
who treats (19)
n2 is person,
center who
treats n1
Research on (8)
n2 is
research on
n1
headache specialist
headache center
diseases physicians
disease associations
headache unit
asthma nurse
asthma researchers
headache study
health research
headache parameter
attack study
headache interview
biology analyses
biology laboratory
cell biology
headache pattern
Attribute of –
clinical- study
(103)
ACTIVITY –
ACTOR (A is
activity
performed by B)
CONTITUTE
(Source Results)
SubjectMatter-Whole
CONTITUTE
(Source Results)
SubjectMatter-Whole
NCs in the Subject-Matter-Whole class can be paraphrased as “B that concern A” (gangster story,
tax issue)
29
LOCATION
Located in
(274)
n2 is located in
n1
Is part of (5)
n2 is part of n1
liver abnormalities
brain abscess
breast carcinoma
lip mucosa
liver tumor
artery obstruction
dna element
artery segments
LOCATION
(Place-Object)
(ghetto street,
school friends)
BELONGING TO
(Whole -Part)
Whole -Discrete,
Integral Part
(like spoon handle
and hillside)
For the Whole -Discrete, Integral Part NCs, B is a discrete delimitable part of A.
TIME: Time of (12), Frequency/duration/ time of
TIME
Frequency/durati
on/ time of 21
(35)
n2 is
frequency,
time of n1
Time of (4)
n1 is time of
n2 (opposite
of previous
class)
headache interval
attack frequency
football season
headache period
influenza season
fibrosis stage
morning headache
hour headache
weekend migraine
BELONGING TO
(Part- Whole)
OBJ-Time (golf
season)
BELONGING TO
(Size- Whole)
Duration-Whole
(3-day affair)
Also:
LOCATION,
Time-OBJ
(weekend guests,
Sunday paper)
For the “Time of” relation, Warren distinguishes NCs for which the comment indicates when the
topic takes place (morning concert) (LOCATION) with NCs for which the comments indicates the
duration of the topic (75-minute concert) (BELONGING TO -Size- Whole). I agree with this
distinction, however, (like for the Possessor-Belonging case) I find the classification of such close
classes into two different major classes quite confusing.
30
MEASURE: Measure of, Standard
MEASURE
Measure of (70)
n2 is a measure
of n1
relief rate
asthma mortality
abortion rate
attack duration
hospital survival
Standard (5)
n2 is a standard
for n1
headache criteria
society criteria
gold standard
society standard
hygiene standards
BELONGING TO
(Whole-Part)
Whole-Feature.
Could be OBJQuality of room
temperature or
OBJ-Extension of
particle size
BELONGING TO
(Whole-Part)
Whole-Feature
INSTRUMENT: Instrument (12), Instrument (21), Instrument (1)
INSTRUMENT
Instrument 12
(102)
n1 is used
in/for n2
Instrument 21 (4)
n2 is used
for n1
Instrument 1 (8)
n1 is used
injection method
laser surgeries
aciclovir therapy
vibration massage
lamivudine therapy
chloroquine treatment
ginseng treatment
laser treatment
biopsy needle
transport proteins
drug use
heroin use
internet use
tobacco use
drug utilization
COPULA,
Source -Result
PURPOSE
(GoalInstrument)
?
Instrument 12 could correspond to COPULA, Source –Result but looking at the sub-classes of
Source –Result (See Figure 1) I can’t find an appropriate one (the closest one is perhaps MaterialArtefact of clay bird and brass wire).
OBJECTIVE: Objective, Misuse
31
OBJECTIVE
Objective (33)
Misuse (11)
n1 is acted on by
n2 (n2 affects n2,
n2 acts on n1)
n2 is misuse(r) of
n1
bowel transplantation
kidney transplant
liver transplantation
cell death
liver metastases
cell apoptosis
infant death
drug abuse
substance abuse
ergotamine abuser
?
?
acetaminophen overdose
These two classes do not have a correspondence in Warren’s classification, again for the fact that
the NCs are mainly verbal-compounds. Perhaps, “PURPOSE Primary relation involves location”
(see Figure 4) could describe some NCs like kidney transplant, liver transplantation although her
examples for this class are quite different: water bucket, tablecloths.
Topic (46)
n1 is the
topic of n2
(n2 is
concerned
with n1)
time visualization
headache questionnaire
tobacco history
vaccination registries
health education
Modal (14)
n1 is how
n2 is done
Material (Made
of) (17)
n2 is
composed/
made by n1
emergency surgery
vacuum aspiration
trauma method
agarose gel
aloe gel
gelatin powder
gel capsule
latex gloves
calcium stones
textile dust
formaldehyde vapors
CONTITUTE,
Source-Results,
Non Material
Substance-Whole,
Subject-MatterWhole (detective
story)
CONTITUTE,
Source-Results
CONTITUTE,
Source-Results,
Material-Artefact
or also
BELONGING TOPart-Whole, PartObj
For Material, we would have the CONTITUTE for NCs in which what is indicated by one member
is what wholly constitutes the other (clay bird), however if the substance is only part of the whole
we should use the BELONGING TO class.
32
Subjective (19)
Binds (4)
Activator 12 (6)
headache presentation
headache response
bile metabolism
drug metabolism
heat transfer
headache response
receptor ligand
n2 binds to n1
carbohydrate ligand
receptor agonist
n1 activates n2
(n1 is activator of pain signal
food allergy
n1 is subject of
n2
BELONGING
TO (WholePart) Possessor
- Belonging)
?
?
n2)
Activator 21 (4)
Inhibitor (13)
food infection
receptor agonist
receptor antibodies
headache precipitant
headache trigger
receptor antagonist
adrenoreceptor blockers
receptor blockers
virus inhibition
influenza prevention
n2 activates n1
(n2 is activator of
n1)
?
?
The main problem I found in comparing my classification with Warren’s was for those relations
that have mainly verbal compounds that Warren does not include. I stress again the fact that non
including such compounds is not a viable option for my problem. It would be interesting to study
whether it is indeed straightforward to infer the relations from the verbs and what are the
“mechanisms” to do so. However, this would not solve the problem that I want headache
transformation and tissue atrophy to be classified as having the same semantic relation. Warren
acknowledges this fact and notice that her distinction would bring her to include sugar bowl while
excluding sugar container as a verbal combination in spite of the fact that the two NCs express the
same semantic relationships.
Other differences were due to the very different nature of our corpora. I find her classification
very detailed and I especially liked how she defined the sub-classes. I wasn’t always happy with the
organizations of her classes. I already mentioned the fact that she puts Possessor-Belonging and
Belonging-Possessor and Duration-Event (her Duration-whole) and Time-Event (her Time-Obj)
quite apart in the hierarchy, obscuring therefore the similarity of these relations. The fact that
Duration-Event is a size-whole (and therefore a BELONGING TO) kind of relationships was more
relevant in her classification than its similarity with Time-Event. If we don’t use the hierarchy but
only the classes in their own this is not very important, however, if we do want to take advantage of
the hierarchical structure, I think that the fact that 3-day affair ends up being a BELONGING
relation while Saturday-Sunday affair is a LOCATION relation can be problematic.
33
References
[Dowing 1977] Dowing, Pamela Ann, 1977. On the creation and Use of English Compound
Nouns. Language, 53:810-842.
[Finin, 1980] Finin, Tim 1980. The semantic Interpretation of Compound Nominals. PhD
Thesis, University of Illinois, Urbana, IL.
[Humphreys 1998] Humphreys et al. 1998. The unified medical language system: An
informatics research collaboration. Journal of the American Medical Informatics Association
5(1):1--13.
[Lapata, 2000] Maria Lapata, The Automatic Interpretation of Nominalizations, AAAI
Proceedings 2000.
[Lauer, 1995] Mark Lauer. Designing Statistical Language Learners: Experiments on Noun
Compounds. PhD Thesis.
[Leonard 1984] Leonard Rosemary, 1984. The Interpretation of English Noun Sequences on
the Computer. North-Holland,Amsterdam.
[Levi, 1978] Judith Levi, 1978. The Syntax and Semantics of Complex Nominals. Academic
Press, New York.
[Lowe 1994] Lowe and Barnett 1994. Understanding and using the medical subject headings
(MeSH) vocabulary to perform literature searches. Journal of the American Medical Association
(JAMA), 271(4):1103--1108.
[Rosario 2001] Barbara Rosario and Marti Hearst, 2001. “Classifying the Semantic Relations in
Noun Compounds via a Domain-Specific Lexical Hierarchy”. To appear in the Proceeding of
EMNLP 2001 (can be found at: http://www.sims.berkeley.edu/~rosario/papers.html)
[Swanson97] Don R. Swanson and N. R. Smalheiser, 1997. “An interactive system for finding
complementary
literatures:
a
stimulus
to
scientific
discovery”
Artificial Intelligence, 91:183-203.
[Vanderwende 1994] Lucy Vanderwende 1994. Algorithm for automatic interpretation of
noun sequences. Proceedings of COLING-94, pages 782—788.
[Warren, 1978] Warren, Beatrice, 1978. Semantic Patterns of Noun-Noun Compounds.
Gothenburg Studies in English 41, Gothenburg, Actr Universitatis Gothoburgensis.
34