Powerpoint templates for scientific poster

On Domain Minimization
Prepositional Phrase Ordering Revisited
Daniel Wiechmann — RWTH Aachen University
Arne Lohmann — Leibniz Universität Hannover
METHODS & RESULTS
INTRODUCTION
Language users have a choice concerning the relative ordering of
prepositional phrases.
Efficiency-based accounts (Hawkins 1994, 2000, 2004, also
Gibson 1998, 2000), assume that speakers prefer arrangements
that have optimal processing properties. The principle of domain
minimization (Hawkins 2004) predicts that more efficient variants
are preferred in proportion to the minimization difference between
the competing sequences.
Values < 0 mean
actual order is less optimal
-15
-10
The results show that both syntactic (PCD) and semantic
(LDD) domain minimization significantly influence the
order of prepositional phrases.
1226
1191
1156
1121
1086
1051
1016
981
946
911
876
841
806
771
736
701
666
631
596
561
526
491
456
421
386
351
316
281
246
211
176
141
106
71
36
1
i. The astronomer gazed [ into the sky ] [ through his telescope ].
ii. The astronomer gazed [ through his telescope ] [ into the sky ].
-5
Our comparison of the relative strengths of the two
principles suggests that Hawkins’ (2000) claim of a
generally higher importance of PCD (see above) should
be revised:
Values > 0 mean
actual order is more optimal
0
5
Values < 0 mean
actual order is less optimal
iii. John counted [ on your support ] [ in his old age.]
SUBJ Vdep
PPdep
PPindep
iv. John counted [ in his old age] [ on your support. ]
SUBJ Vdep
PPindep
PPdep
Verb entailment test
Does [X V PP] entail [X V]? If so, then assign Vindep.
If not, then assign Vdep & PPdep
15
20
25
1255
1222
1189
1156
1123
1090
1057
1024
991
958
925
892
859
826
793
760
727
694
661
628
595
562
529
496
463
430
397
364
331
298
265
232
199
166
133
100
67
34
1
Syntactic domain minimization principle:
Minimize Phrasal Combination Domains (PCD)
Semantic domain minimization principle:
Minimize lexical dependency domains (LDD)
10
30
35
Figure 2: Observed PCD length differentials (in words)
SYNTACTIC & SEMANTIC DOMAIN MINIMIZATION
Figure 1: PCD length difference resulting from different orders
CONCLUSIONS & OUTLOOK
-15
-10
-5
 It is true that PCD minimization is the most general
principle, which is reflected in a much greater
coverage (78% of the data versus 30% coverage of
semantic constraint)
 Yet it is the LDD minimization which is the strongest
principle, as it has a much greater effect size
(Regression coefficientSEM = 0.67 versus regression
coefficientSYN = 0.39)
The stronger effect size of LDD shows that speakers are
more reluctant to separate semantically dependent
combinations of verb+preposition, than to produce long
syntactic dependencies.
We are currently exploring if this predominance of
semantic minimization is a general one, in extending the
present study to other order alternations which are
influenced by both principles, e.g. particle placement.
Values > 0 mean
actual order is more optimal
0
5
10
15
20
25
Figure 3: Observed LDD length differentials (in words)
FURTHER ISSUES
STATISTICAL ANALYSIS:
Binomial logistic regression w/o intercept (Levy, in progress)
 model evaluates distributional information
 & tries to predict the ACTUAL PP-order as a function of
syntactic and semantic domain minimization
Verb specific behavior
(Do effects vary across different verbs?)
Regression Coefficients:
Estimate Std. Error z-value Pr(>|z|)
semantic minimization 0.65
0.06
11.32 <2e-16 ***
syntactic minimization 0.39
0.03
11.39 <2e-16 ***
v. John accounted [ for this fact ] [ in his book]
SUBJ Vindep
PPdep
PPindep
Model predictions (fitted values)
vi. John accounted [ in his book] [ for this fact ]
SUBJ Vindep
PPindep
PPdep
Pro-verb entailment test
Does [X V PP] entail [X Pro-V PP]*, then assign Pindep.
If not, assign PPdep.
_________________________________________________
*Pro-V sentences: X did something PP; X was PP; something
happened PP; something was done (by X) PP.
Model predictions from
syntactic
minimization
1.0
1.0
200
400
600
800
0.8
0.6
0.2
0.0
0.0
0
Individual verbs are effected differently:
Some words (e.g. be) are strongly affected by the
semantic constraint but not by the syntactic one.
Others (e.g. arrive) show a reverse image of that
behavior: these are strongly affected by the syntactic
constraint but not by the semantic one.
Still others (e.g. come) are equally affected by both
constraints.
0.4
jitter(sample(fitted(sem.model), 1060), amount = 0.03)
0.6
0.2
0.4
jitter(sample(fitted(syn.model), 1060), amount = 0.03)
0.6
0.4
0.2
most general and the strongest is Early Immediate
Constituents (~PCD), which defines a preference for minimal
domains of phrase structure recognition (and production). A
second factor is lexical dependency, […] “
(Hawkins 2000:257)
0.0
“The
0.8
0.8
Figure 3: Mean fitted values for all verbs w/ N ≥ 10
jitter(sample(fitted(model), 1060), amount = 0.03)
Research Question:
What are the relative strengths of syntactic (PCD) &
semantic (LDD) domain minimization?
Model predictions from
semantic
minimization
1.0
Overall model
predictions
1000
0
200
400
Index
600
800
1000
0
200
400
Index
600
800
WORKS CITED
1000
Index
Removing all cases where no predictions are being made
1.0
1.0
jitter(sample(fitted(sem.model)[fitted(sem.model) != 0.5], 406),
amount = 0.03)
0.4
0.6
0.8
jitter(sample(fitted(syn.model)[fitted(syn.model) != 0.5], 977),
amount = 0.03)
0.4
0.6
0.8
mean fitted = 0.62
0.2
0.2
0.8
0.2
0.4
0.6
mean fitted = 0.70
0
200
400
600
Index
800
1000
0
200
400
600
Index
800
406 fitted values ≠ 0.5
0.0
977 fitted values ≠ 0.5
0.0
1060 fitted values ≠ 0.5
0.0
CODING:
o Annotation of each example with respect to syntactic and
semantic properties:
o Syntax: Assess length of actual PCD & alternative PCD
o Semantics: Is there a semantic dependency?
o If so, assess length of actual & alternative LDD
mean fitted = 0.81
jitter(sample(fitted(model)[fitted(model) != 0.5], 1060), amount = 0.03)
CORPUS SAMPLE:
o Extraction of all ―V PP PP‖-sequences from ICE-GB
o  manual weeding out of false hits
o N FINAL DATA SET = 1,256
1.0
DATA
1000
0
100
200
Index
300
400
Hawkins, John. 1994. A performance theory of order
and constituency. Cambridge: CUP.
-- 2000. The relative order of prepositional phrases in
English: Going beyond manner-place-time. Language
Variation and Change 11: 231-266.
-- 2004. Efficiency and Complexity in Grammars.
Oxford: OUP.
Gibson, Edward. 1998. Linguistic complexity: Locality
and syntactic dependencies. Cognition 68:1-76.
-- 2000. The dependency locality theory: a distancebased theory of linguistic complexity. In A.
Marantz, Y. Miyashita, & W. O’Neil (Eds.), Image,
language, brain (pp. 95–126). Cambridge, MA:
MIT Press.
Levy, Roger. in progress. Probabilistic Models in the
Study of Language. Cambridge, MA: MIT Press.