CAS LX 522 Syntax I

CAS LX 522
Syntax I
Some things we know
n
Is this English?
Pat the book lifted.
Pat lifted the book.
n Lifted Pat the book.
n Pat book the lifted.
n
n
Week 1. Introduction
n
It’s surprisingly complicated
It’s surprisingly complicated
1)
2)
Tony threw out the couch.
Tony threw the couch out.
F Prepositions can go on
either side of the object?
3)
4)
Tony stormed out the door.
* Tony stormed the door out.
6)
What did Mary say Pat bought?
What did Mary say that Pat bought?
F
Ok, that is optional?
7)
Who did Mary say bought coffee?
*Who did Mary say that bought coffee?
5)
8)
It’s surprisingly complicated
9)
10)
11)
12)
13)
14)
15)
16)
Bill thinks Mary is a genius.
Her mother thinks Mary is a genius.
She thinks Mary is a genius.
I asked Mary to buy coffee.
What did you ask Mary to buy?
Who did you say bought coffee?
I borrowed the book Bill bought in DC.
*Who did you borrow the book bought in DC?
Why?
How do people know this?
n
All native speakers of English know this.
n
Little kids weren’t told these rules (or
punished for violating them)…
“You can’t question a subject in a
complement embedded with that”
n “You can’t use a proper name as an object if
the subject is co-referential.”
n
1
Two questions
n
What do people know about their
language?
n
n
Systematicity
n
What people eventually end up with is a
system with which they can produce (and rate)
sentences. A grammar.
n
18)
Even if you’ve never heard these before, you
know which one is “English” and which one
isn’t:
Eight very lazy elephants drank brandy.
Eight elephants very lazy brandy drank.
n
Kids say wugs.
Including things we know “unconsciously”
How do people come to know it?
n
Tricky question for things that we don’t know
we know.
Positive and negative evidence
n
n
n
Adults know if a given sentence S is
grammatical or ungrammatical. This is
part of the knowledge kids gain through
language acquisition.
Kids hear grammatical sentences
(positive evidence)
Kids are not generally told which
sentences are ungrammatical
(no negative evidence)
17)
Positive and negative evidence
n
One of the striking things about child language
is how few errors they actually make.
For negative feedback to work, the kids have to
make the errors (so that it can get the negative
response).
n
But they don’t make the errors.
n
n
But languages differ
The “Language instinct”
n
n
n
n
The linguistic capacity is part of being
human.
Like having two arms, ten fingers, a vision
system, humans have a language faculty.
The language faculty (tightly) constrains
what kinds of languages a child can learn.
=“Universal Grammar” (UG).
(Kids do make errors, but not of the kind that one
might expect if they were just trying to extract
patterns from the language data they hear)
n
English, French: Subject Verb Object (SVO)
n
n
n
Japanese, Korean: Subject Object Verb (SOV)
n
n
n
John ate an apple.
Pierre a mangé une pomme.
Taroo-wa ringo-o tabeta.
Chelswu-ka sakwa-lul mekessta.
Irish, Arabic (VSO), Malagasy (VOS), …
2
But languages differ
n
English: Adverbs before verbs
n
Mary quickly eats an apple.
n
*Mary eats quickly an apple.
n (also:
n
Parameters
n
n
Mary ate an apple quickly)
n
French: Adverbs after verbs
n
n
Geneviève mange rapidement une pomme.
*Geneviève rapidement mange une pomme.
n
n
We can categorize languages in terms of their
word order: SVO, SOV, VSO.
This is a parameter by which languages differ.
The dominant formal theory of first language
acquisition holds that children have access to a set
of parameters by which languages can differ;
acquisition is the process of setting those
parameters.
What are the parameters?
What are the “universal” principles of grammar?
The enterprise
n
The data we will primarily be concerned with are
native speaker intuitions.
n
Native speakers, faced with a sentence S, know
whether the sentence S is part of their language
or isn’t. These intuitions are highly systematic.
n
We want to uncover the system (which is
unconscious knowledge) behind the intuitions of
native speakers—their knowledge of language.
I-language
n
n
We are studying the system behind one person’s
pattern of intuitions.
Speakers growing up in the same community have
very similar knowledge, but language is an
individual thing (“I-language”).
n
n
I-languages of a community is can be characterized,
but it is external to the speaker (“E-language”), not
any one person’s knowledge, a generalization over
many people’s I-languages.
n
Competence
n
We are also concerned with what a person
knows. What characterizes a person’s
language competence. We are in general not
concerned here with how a person ends
up using this knowledge (performance).
n
You still have your language competence
when you are sleeping, in the absence of any
performance. Being drunk doesn’t make you
think “bought some John coffee” is English,
though perhaps one might say it.
One doesn’t need to ask the Académie française whether
Geneviève rapidement mange une pomme is a sentence of
French. One knows.
For example, Parisian French.
Prescriptive rules
n
Another thing we need to be cautious of
are prescriptive rules. Often prescriptive
rules of “good grammar” turn out to be
impositions on our native grammar which
run counter to our native competence.
n
After all, why did they need to be rules in
the first place?
3
Prescriptive rules
n
n
n
n
n
n
Prepositions are things you don’t end a
sentence with.
It is important to religiously avoid splitting
infinitives.
Remember: Capitalize the first word after a
colon.
Don’t be so immodest as to say I and John left;
say John and I left instead.
Impact is not a verb.
The book which you just bought is offensive.
Prescriptive rules
n
When making grammaticality judgments (or
when asking others to make grammaticality
judgments), we must do our best to factor out
prescriptive rules (learned explicitly, e.g., in
school).
n
Judgments
We’re not interested in studying the prescriptive rules;
we could just look them up, and it isn’t likely to tell
us anything deep about the makeup of the human
mind. They’re really just a “secret handshake,”
allowing educated people to detect one another.
Syntax as science
n
Another complicating fact is that a sentence can
be bad for any number of reasons, only some of
which we are interested in at a given point.
n
Syntax, as practiced here, is a scientific enterprise.
This means, in particular, approaching syntax
using the scientific method.
n
*Student the meditated happily.
The pebble meditated happily.
A Sun rose in the East.
John wondered who to go with.
n
Step 1: Gather observations (data)
Step 2: Make generalizations
Step 3: Form hypotheses
Step 4: Test predictions made by these
hypotheses, returning to step 1.
n
n
n
A simple introductory example
1)
2)
3)
4)
n
n
n
n
A simple introductory example
Bill kissed himself.
Bill kissed herself.
Sally kissed himself.
Sally kissed herself.
1)
Try these out. Which ones sound good,
which ones don’t?
n
2)
3)
4)
Bill kissed himself.
*Bill kissed herself.
*Sally kissed himself.
Sally kissed herself.
Hypothesis: An anaphor must have an
antecedent which agrees with it in
gender.
4
Hypothesis: An anaphor must have an
antecedent which agrees in gender.
n
Hypothesis: An anaphor must have an
antecedent which agrees in gender.
Let’s test the hypothesis against more data.
5)
5)
6)
7)
8)
9)
10)
11)
The robot saw itself in the mirror.
*John and Bill saw himself in the mirror.
*The boys saw himself in the mirror.
*Mary and Jane saw herself in the mirror.
John and Bill saw themselves in the mirror.
Mary and Jane saw themselves in the mirror.
The boys saw themselves in the mirror.
6)
7)
8)
9)
10)
11)
n
Our hypothesis only explains (5). What is the
generalization?
Hypothesis: An anaphor must agree in
gender and number with its antecedent
12)
13)
14)
15)
16)
17)
18)
n
The executives gave themselves a raise.
*I gave himself a cookie.
I gave myself a cookie.
You gave myself a cookie.
*You gave herself a cookie.
*You gave himself a cookie.
You gave yourself a cookie.
The robot saw itself in the mirror.
*John and Bill saw himself in the mirror.
*The boys saw himself in the mirror.
*Mary and Jane saw herself in the mirror.
John and Bill saw themselves in the mirror.
Mary and Jane saw themselves in the mirror.
The boys saw themselves in the mirror.
Person
n
n
n
n
n
I is “first person singular”
You is “second person singular” (you left) or “second
person plural” (you left = y’all left)
He, She is “third person”
We is “first person plural”
They is “third person plural”
Again, our hypothesis doesn’t successfully
predict which of these are grammatical and
which aren’t. What’s the generalization?
Anaphors seem to agree with person. Myself for
first person singular, ourselves for first person
plural, yourself for second person, himself, herself,
or itself for third person singular, themselves for
third person plural.
Hypothesis about anaphors
Levels of adequacy
n
An anaphor must agree in gender, number,
and person with its antecedent.
n
This is the hypothesis we will end with,
although there will be more to do with
anaphors.
n
Incidentally, gender, number, and person
very often go together.
n
n
n
If our hypotheses can predict the existence
of the grammatical sentences in a corpus
(a set of grammatical sentences), it is
observationally adequate.
If our hypotheses can predict the nativespeaker intuitions about which sentences
are grammatical and which are
ungrammatical, it is descriptively
adequate.
5
Levels of adequacy
n
n
If we can take a descriptively adequate set of
hypotheses one step further and account not
only for the native speaker judgments but also
for how children come to have these judgments,
our hypotheses are explanatorily adequate.
Categories
n
Words can be grouped into categories by part of
speech like noun, verb, adjective, preposition, …
n
Parts of speech are determined distributionally
(traditional “semantic” definitions don’t work)
It’s this last level that we are hoping to achieve.
n
n
n
n
Basic principles
Parameters of variation
How to set the parameters from child’s input
n
Constituents
n
The words that make up a sentence like…
n
n
n
Constituents
n
Functioning as a unit…
The students did their syntax assignment.
The students did the crossword puzzle.
n John did the crossword puzzle.
n The crossword puzzle is what John did.
n *Crossword puzzle is what John did the.
n John likes the crossword puzzle.
n John likes the jigsaw puzzle.
n John likes the theater.
n
The students did their syntax assignment.
n
…are grouped together into component
parts, constituents, which function together
as a unit.
Among them, [the students], the do-ers,
and [their syntax assignment], the done.
Phrases
n
n
A phrase is a constituent that has a central core
word (called the head of the phrase); other
words in the phrase generally relate to (or
modify the meaning of) the head.
The category of the head determines the
category of the phrase.
n
The happy students is a noun phrase, headed by the
noun students. Happy modifies students, the
specifies which students. Ran swiftly is a verb
phrase, swiftly modifies ran.
The yinkish dripner blorked quastofically.
Yinkish is an adjective, dripner is a noun, to blork is a
verb, quastofically is an adverb.
Sentences
n
Complete sentences need to have a subject
and a verb.
John left.
*John.
n *Left.
n The happy students left speedily.
n
n
n
So sentences are made of noun phrases
and verb phrases.
6
Sentences
n
n
n
John left (speedily).
The happy students left (speedily).
n
We could say a sentence is either:
n
n
n
n
n
Trees
a noun and a verb
a noun and a verb phrase
a noun phrase and a verb
a noun phrase and a verb phrase
We might start off by drawing the structure of a
sentence like this, which means: “John left is a
Sentence composed of a Noun Phrase
(composed of John) and a Verb Phrase
(composed of left). Note the heads.
S
Or we could say:
n
n
a sentence is a noun phrase and a verb phrase
a phrase always has a head, and sometimes that’s all.
Trees
n
NP
VP
N
V
John
left
Of the past and the future
In fact, the trees we will draw will end up looking more
like this (well, sort of), where the “phrase is implied”.
We know that John here is a noun phrase because it is
headed by a noun, John—it just happens not to have
anything else in the phrase. Generally, except for heads
that happen to be also phrases, we’ll see “NP”.
n
Serious scientific study of sentence
structure of this kind generally began in
the 50’s, driven to a great extent by the
work of Noam Chomsky.
n
It’s now half a century later, and we have
learned a lot about how syntax works.
S
N
V
John
left
Of the past and the future
n
n
Progress was incremental, and often
required revising our assumptions about
how sentences are really put together.
Data was examined, generalizations were
arrived at, hypotheses were formed,
predictions were tested—and often led to
revisions of the generalizations and the
hypotheses, and so forth.
Of the past and the future
n
Two goals of the class:
n
n
n
Think like a syntactician.
Be able to read (relatively recent) books,
articles, etc. about syntax.
It’s not really enough to just know what
people concluded, we need to understand
why they concluded what they did.
7
T
Some milestones
n
Until about the mid-70’s, phrase structure rules.
S Æ NP VP
n
n
n
VP Æ V (NP)
Mid-70’s, X-Bar Theory (a generalization about
what are possible PSRs).
In the 80’s, a fairly significant shift to Government
and Binding Theory (viewing grammar a little less
like a computer program). Very productive.
In the 90’s, another shift to the Minimalist
Program (an attempt at simplification, as well as
a change in philosophy).
n
T
T
T
T
T
T
T
T
T
We’ll be somewhere between MP and GB.
8