Context-specific erosion

-1-
Context-specific erosion of the tone contrast in the dialect of Roermond
Rachel Fournier1
and
Carlos Gussenhoven1,2
1
2
Radboud University Nijmegen
Queen Mary University of London
Corresponding author.
Carlos Gussenhoven
Email:
[email protected]
23 March 2010
Revised 1 February 2011
Abstract
The tone and intonation system of Roermond Limburgish distinguishes its two lexical tone
categories through pitch contrasts that are specific to each of six contexts. A perception
experiment with 22 listeners revealed that tone recognition in one of these six nonneutralizing contexts, IP-medial accented syllables spoken in interrogative sentences, was
seriously impaired in stimuli spoken by one of two speakers. Follow up production and
perception experiments with 17 speakers showed that in this context, the realization of one of
the two tones, Accent 2, is changing from its traditional form, a low level pitch in the
accented syllable followed by a rise-plus-fall after it, to a form in which the pitch begins to
rise inside the accented syllable. Since the form for Accent 1 in this context is a rise in the
accented syllable followed by a fall, the tendency to move the beginning of the pitch rise for
Accent 2 to an earlier point in time threatens the perceivability of the tone contrast in that
context. The explanation for the change is sought in the low functional load, combined with
the articulatory effort associated with sharp rises, required whenever an Accent 2 syllable
appears close to the IP-end in an interrogative contour. This is the first time empirical data
with contemporary speakers have shown that the Limburgish tone contrast is recessive.
1
-2-
1 Introduction
Dialects in the former Rheinland in Germany (the northern half of Rhineland-Palatinate and
the southern half of North Rhine-Westphalia) and the the larger part of the Belgian and Dutch
provinces of Limburg have a lexical tone contrast that is comparable to the Scandinavian
contrast between Accent 1 and Accent 2. Words have a tone contrast in the stressed syllable,
commonly referred to as TA1 versus TA2, where TA stands for German Tonakzent, or as
Accent 1 versus Accent 2. Unlike the Scandinavian tone contrast, this Central Franconian tone
contrast also exists in word-final syllables (Newman 1980). In addition to varying numbers of
minial pairs of words, every dialect investigated to date has between one and fifteen
monosyllabic nouns which only differ in that they have Accent 2 in the singular and Accent 1
in the plural. The dialects vary substantially in the details of their tonal grammars. In
particular, the underlying value of the lexical tone varies across the dialects, while the dialects
also vary in the number of different intonation contours they have.
This article reports an investigation into the perception of the tone contrast by native speakers
of one dialect. In order to appreciate the issues and findings, we first present the facts of this
dialect, Roermond Dutch, based on Gussenhoven (2000) and Fournier, Verhoeven, Swerts &
Gussenhoven (2006). Table I gives schematic representations of the contrasts in four sentence
prosodic contexts.
Table I. Schematic pitch contours of two intonation contours in accented and unaccented final and
nonfinal positions in the intonational phrase for Accent 1 (solid lines) and Accent 2 (dotted lines). The
shaded boxes represent the syllable in the position concerned. The parts of of the contours following
the boxes are the contours that occur when the final syllable in the IP has Accent 1.
[+accented,+final]
[+accented,−final]
[−accented,+final]
[−accented,−final]
Declarative
Interrogative
2
-3Just like all other Central Franconian tonal dialects that have been analysed in the
autosegmental-metrical framework (e.g. Gussenhoven & van der Vliet 1999, Peters 2006,
2007a,b, 2008), Roermond Dutch has a privative tone contrast. Specifically, Accent 1 has no
lexical tone, either underlyingly or on the surface, while Roermond Dutch Accent 2 is a Htone on the second sonorant mora of the syllable with main stress. This distribution implies
that words whose main stressed syllables have one sonorant mora, like [kɑt] ‘cat’ or [ɑ.ŋəl]
‘thorn’, cannot have the tone contrast. Examples of minimal pairs are [haasI] ‘hare’ vs. [haasII]
‘glove’, [pɪnI] ‘personal identity number’ vs. [pɪnII] ‘skewer’, and [ʃtɛinI] ‘stones’ vs. [ʃtɛinII]
‘stone’. While all words with two sonorant moras in their stressed syllable have either Accent
1 or Accent 2, the number of minimal pairs is not large. There are some ten minimal pairs of
monosyllabic nouns whose singular form has Accent 2 and whose plural form have Accent 1
(cf. the forms for ‘stone’).
Assuming utterances with one sentence accent, the intonational tones are provided by the
declarative intonation, H* Li, or the interrogative intonation, L* HiLi. In addition to a phraseinitial boundary Li, these are the only tones in the representation if all the words have Accent
1. For any word with Accent 2, the intonational tones combine with the lexical H on the
second mora of its stressed syllable. Unexceptionally, intonational tones are sequenced before
the lexical tone in any syllable in which they appear together. The accented forms are shown
in (1), where the singular form for ‘stone’ in (1a) has the lexical H of Accent 2 after the
intonational pitch accent H* and the boundary tone Li, while the plural in (1b) has Accent 1,
and thus only has intonational tones. The corresponding unaccented forms are given in (2a,b).
In (2a), the intonational pitch accent H* is on zag ‘says’, leaving the postfocal final syllable
for the intonational boundary tone and the lexical tone. In (2b), the intonational tones appear
without the lexical H on the Accetn-1 word ‘stones’.
(1) a.
b.
Hae zag ‘sjtein’
| |\
Li
H*LiH
‘He says “stone”’
Hae zag ‘sjtein’
||
Li
H*Li
‘He says “stones”’
3
-4(2) a.
b.
Hae zag ‘sjtein’
|
||
L
H* Li H
‘He SAYS “stone”’
Hae zag ‘sjtein’
|
|
Li H*
Li
‘He SAYS “stones”’
In the accented IP-medial condition, the lexical H again appears with H*, as in (3a). As a
result, the pitch fall occurs after the accented syllable sjtein ‘stone’, while in (3b) the pitch
falls inside the syllable sjtein ‘stones’. The sharp fall in (3b) arises from the association of
final Li with the second mora of the accented syllable. Boundary tones that display this type
of behaviour have been termed ‘phrase accents’ by Grice, Ladd & Arvaniti (2000).
(3) a.
b.
Ich höb ‘sjtein’ gezag
| |
Li
H*H
Li
‘I said “stone”’
Ich höb ‘sjtein’ gezag
|
Li
H*
Li
‘I said “stones”’
We now turn to the interrogatives contours, and begin with the IP-medial context. The
constraint NORISE (Poletto 1998), given as (4), bans phonological LH-contours inside
syllables (cf. Yip 2000: 80, 190, 280). NORISE has two effects in the Roermond dialect. First,
when L* and lexical H come together on the same syllable, as in sjtein ‘stone’ in (5a), the
lexical H assimilates to L, creating the low level pitch in accented nonfinal syllables with
Accent 2 (cf. Table I, second row, third column, dotted contour). Second, when L* is alone in
a syllable (Accent 1), a following Hi fails to associate with the free mora, causing the rise
from L* to Hi to typically extend beyond the accented syllable, as in (5b). The contrast
between (5a) and (5b) is therefore phonetically expressed by low level pitch vs a low-to-midrise in the accented syllable.1
(4) NORISE:
*
m
|
L
σ
m
|
H
4
-5(5) a.
b.
Höbse ‘sjtein’ gezag?
||
|
(Li) L*H (HiLi) → Li
L*L
HiLi
‘Did you say “stone”?’
Höbse ‘sjtein’ gezag?
|
|
Li
L*
HiLi
‘Did you say “stones”?’
The IP-final accented contours are given in (6), where the rise-fall of Accent 1 in (6b)
contrasts with a rise for Accent 2 in (6a). The underlying form of (6a) is L*HiLiH, which is
changed to L*LiLiH (effectively L*LiH) on account of NORISE (4). Lastly, the unaccented IPfinal syllables with interrogative intonation are again distinguished by the final H of Accent 2,
realized as a fall-rise, due to HiLiH, as shown in (7a). It contrasts with the plain fall due to
HiLi for Accent 1 in (7b).
(6) a.
b.
Zag hae ‘sjtein’?
| |\
Li
L*Li H
‘Did he say “stone”?’
(7) a.
Zag hae ‘sjtein’?
|
| |\
Li L*
HiLiH
‘Did he SAY “stone”?’
Zag hae ‘sjtein’?
|
Li
L* HiLi
‘Did he say “stones”?’
b.
Zag hae ‘sjtein’?
|
||
Li L*
HiLi
‘Did he SAY “stones”?’
The fourth data column of Table I reflects the fact that no tone contrast has been reported for
IP-medial unaccented syllables.
1.3 Indications of a recessive tone contrast
There are a number of indications that the present-day geographical distribution of the lexical
tone contrast is smaller than it was at the peak of its diffusion. For one thing, segmental
reinterpretations of the contrast can be found in now toneless dialects spoken on the periphery,
like Letzebuergësch (Gillis 1999) and Weert (Netherlands) (Heijmans 2002), or in toneless
enclaves near the periphery of the tonal area (Cajot 2006, Schouten & Peeters 1996). Also,
dialects with homophones for tonal minimal pairs are spoken just outside the tonal area east of
5
-6Venlo, which suggests that these may have been tonal at an earlier stage. Indications of a
receding pattern may also be seen in the fact that dialects may restrict the tonal opposition to
syllables with specific segmental conditions, as is the case in Maastricht (Gussenhoven &
Aarts 1999), or to particular sentence prosodic contexts, as in the dialects of Helden,
Roermond and Venlo (Gussenhoven 2000, Gussenhoven & van den Beuken ms). Fournier,
Verhoeven, Swerts & Gussenhoven (2006) report that the lexical tones of the dialect of
Roermond are identified virtually as reliably as the vowels of the dialect, provided the tones
occur in syllables that have an intonational pitch accent (henceforth ‘accented’ syllables) or
occur in the final syllable of the intonational phrase (IP), regardless of accentuation. In IPmedial unaccented syllables, whether pre-nuclear or post-nuclear, the contrast was effectively
neutralized.
So far, both the presence of the tone contrast in a dialect and its neutralization in certain
contexts has appeared categorical (cf. also Peters 2006, 2007a,b, 2008, Gussenhoven & Peters
2005). Nevertheless, if phonological features are subject to erosion, it is to be expected that
variable realizations should occur, either within speakers or across speakers from the same
speech community, reflecting a stage in which the contrast is on its way out.
1.4 Aims of the investigation
The goal of our investigation was to see to what extent the dialect of Roermond maintains it
contrast in each of the contexts specified in section 1.2 above. The perception experiment of
Fournier et al. (2006) was restricted to stimuli with declarative intonation. In these data, the
contrast appeared to be robustly present in accented syllables (both IP-final and IP-medial)
and IP-final unaccented syllables, while being effectively neutralized in IP internal
unaccented syllables, both before and after the focal pitch accent. However, we have no
information on the extent to which the contrast is maintained in interrogative intonation
contours. To this end, we ran a forced-choice identification experiment with auditory stimuli
produced by two speakers which included both interrogative and declarative contours. The
structure of the materials as well as the stimuli for the declarative intonation were identical to
those in Fournier et al. (2006), but a different group of listeners participated in the experiment.
Our experiment revealed that the tone contrast in IP-medial nuclear syllables in interrogative
intonation contours (cf. (5a,b)) was poorly discriminated. In this context, the contrast amounts
to a timing difference of a pitch rise. Separating the scores out on the basis of the speaker of
6
-7the stimuli further revealed that the poor scores were obtained for the stimuli produced by one
of our two speakers. Since this suggested that in this context, the contrast was variably lost or
that its salience was gradually reduced, we decided to collect data for the contrast in this
position from a group of 17 speakers below the age of 26. The results confirmed a picture of
variable tone loss, whereby the low pitch of the accented syllable with Accent 2 (cf. (5a)) is
not clearly distinct from the rising pitch movement in equivalent syllables with Acccent 1 (cf.
6b)), because some speakers begin the pitch rise for Accent 2, which in the conservative
speakers occurs after the accented syllable, inside the end of the accented syllable.
An unexpected result was a significant recognition score for the stimuli produced by one of
the two speakers in pre-nuclear position in declarative intonation contours. Fournier et al.
(2006) found no effect of lexical tone category for the same stimuli in this context. Since the
judges in the experiment reported here were older than those reported in the earlier
experiment, we reran part of the identification experiment with a group of younger subjects.
Their performance was comparable to that of the older group in the full experiment, which
suggests that in pre-nuclear declarative position cues to the contrast were present that were
missed by the younger group reported in Fournier et al. (2006). The better perfomance of our
subjects can be atributed to the test conditions, which allowed subjects to pay more attention
to the stimuli.
2 Experiment I
2.1 Experiment I: Materials and recordings
For the forced-choice identification task, we drew up a corpus of sentences in which the
lexical tone (Accent 1 vs Accent 2), the intonation contour (declarative vs interrogative) and
five sentence prosodic positions were varied orthogonally. The five positions corresponded to
the four columns of Table I, plus a pre-nuclear condition. We used minimal pairs for the
number contrast, as opposed to lexcially different members of minimal pairs, in order to
facilitate the formulation of a uniform task and score sheet. Plural and singular forms of the
four nouns bein (‘leg’), derm (‘intestine’), knien (‘rabbit’) and sjtein (‘stone’) were embedded
in the ten sentences each, giving 80 sentences in all, as illustrated for ‘rabbit(s)’ in Table II.
7
-8Table II: Carrier sentences used in Experiment I, exemplified with the target word knienII,I. Sentences
or sentence parts in square brackets were used during the recordings in order to elicit a correct focus
assignment, but were not presented to the listeners.
CONTEXT
[+accented,
+final]
[+accented,
–final]
[–accented,
+final]
[–accented,
–final],
prenucl.
[–accented,
–final],
postnucl.
DECLARATIVE INTONATION
INTERROGATIVE INTONATION
Zaes-se geweun "KNIEN"? [Nae, ich zae
In ’t Remunjs zaes-se geweun “KNIEN”.
"HAAS".]
In Roermond Dutch, you just say
Do you just say “RABBIT(S)”? No, I say
"RABBIT(S)".
"HARE".
[Wat höbs-se geheurd?] Ich höb
Höbs-se "KNIEN" geheurd? [Nae,
“KNIEN” geheurd.
"KIENDJ".]
[What did you hear?] I heard
Did you hear "RABBIT(S)? No, "CHILD".
“RABBIT(S)”.
[Eers ZAG hae “knien”,] toen
SJREEFDE hae "knien"? [Nae, hae ZAG
"knien".]
SJREEFDE hae “knien”.
[First he SAID “rabbit”,] then he WROTE Did he WRITE "rabbit(s)"? [No, he SAID
“rabbit(s)”.
"rabbit(s)".]
[Höbs-se “knien” GEZAG?] Nae, ich höb Höbs-se “knien” GEZÓNGEN? [Nae,
allein GEZAG.]
“knien” GEZÓNGE.
[Did you SAY “rabbit(s)”?] No, I SUNG Did you SING “rabbit(s)”? [No, I just SAID
“rabbit(s)”.
it.]
[Heurt allein WILSKE “knien” good?]
Heurt allein ANNIE “knien” good? [ Nae,
Nae, ouch MIEKE heurt “knien” good.
ouch MIEKE.]
[Does only WILSKE hear “rabbit(s)”
[Does only ANNIE hear “rabbit(s)” well?]
well?] No, also MIEKE hears “rabbit(s)”
No, also MIEKE.
well.
The recordings were made in a professional studio in two sessions. In the first session, six
speakers recorded a corpus with six minimal pairs in the declarative intonation. From these
recordings, we selected the recordings by two speakers who we judged to have good voices
and whose performance appeared to be both fluent and accurate and who were available for a
second session. One was male (PI) and one female (NG), who were between 40 and 60 years
old. In the second session, speakers PI and NG recorded the forty interrogative versions of
these materials. In both recording sessions, sentences were presented to speakers in four
blocks, in which sentences appeared in randomized orders. Accented words appeared in
capital letters; before each sentence, the grammatical number of the target word was specified
in brackets (e.g. eine knien ‘one rabbit’ or twee knien ‘two rabbits’). A native speaker of
Roermond Dutch, who was familiar with the materials and had earlier written an MA-thesis
on a related topic, was present during the recordings. She asked the speakers to repeat any
sentences which she judged to be mispronounced in any way at the end of each block. These
mispronunciations included minor disfluencies and misarticulations as well as unintended
lexcial tones and intonation contours. When inspecting the recordings after the second session,
we noticed that speaker PI had frequently pronounced target words in pre-nuclear position as
8
-9separate IPs, with accented realizations of these target words. This necessitated a third
recording session with PI, which was devoted to the subset of sentences with target words in
pre-nuclear position, in which he practised saying these sentences as single IPs before
recording them again. We thus had 80 stimuli that were judged both natural and correct by the
researchers and the native speaker. We used these stimuli in the perception experiment
reported in Fournier et al. (2006), with the exclusion of the interrogative stimuli, but with an
additional minimal pair based on the plural and singular forms for pin ‘skewer’. We excluded
the sentences with pin in the experiment reported here, since a number of subjects were not
familiar with the plural form pin I.
2.2 Experiment I: Procedure
Twenty-two native speakers of Roermond participated in the experiment, nine of whom were
male. Their ages ranged from 42 and 85 years, with a mean of 61. All subjects had spent the
greater part of their lives in Roermond and spoke the dialect at home and in social encounters
outside the home, except at work, where Standard Dutch was more often used. The
experiment was carried out in a quiet room where subjects listened to the randomized 80
stimuli through headphones in groups of five to eight. The test took 20 minutes. Subjects were
asked to judge whether the target word in each stimulus was singular or plural. On their
answer sheet, the target word for each stimulus was printed in its singular form together with
two boxes, labelled enkelvoud ‘singular’ and meervoud ‘plural’, always in that order, one of
which they were asked to tick on each trial.
2.3 Experiment I: Results
Since our dichotomous data neither suffered from low numbers of cases (cf. Myers, DiCecco,
White & Borden 1982) nor clustered at either the high or low ends of the range (cf.
D’Agostino 1971), we decided to run a repeated measures ANOVA on the raw scores as
pooled over the four nouns. We used the within-subject factors INTONATION (2 levels);
NUMBER (2 levels); CONTEXT (5 levels) and SPEAKER (2 levels). The Huyn-Feldt correction
was applied. Table III lists effects and interactions of effects that reached a 1% significance
level, plus effect sizes.
9
- 10 Table III. Significant main effects and interactions of effects on recognition scores from Experiment I.
Bold-faced rows represent the three strongest (interaction) effects.
Factor
intonation
number
context
intonation*context
number*context
intonation*number*context
intonation*speaker
context*speaker
intonation*context*speaker
number*context*speaker
df
1,21
1,21
4,84
4,84
2.5, 51.5
3.6, 75.3
1,21
3.2, 66.6
3.8, 80.4
3.3, 69.9
F
18,247
25,478
277,21
21,594
59,551
22,206
57,334
11,676
8,269
14,845
sig.
<.001
<.001
<.001
<.001
<.001
<.001
<.001
<.001
<.001
<.001
part. η2
0,465
0,548
0,93
0,507
0,739
0,514
0,732
0,357
0,283
0,414
The strongest factor by far affecting the recognition scores is CONTEXT, with an effect size
of .93. Pairwise comparisons between the five contexts revealed that the lexical tones in the
[+accented,+final] position are recognized significantly better than all others (p<.001), with
more than 98% correct answers. Also, scores are significantly lower in the [–accented, –final]
contexts than in the [+accented,–final] and [–accented,+final] contexts. CONTEXT
significantly interacts with all other effects, especially with NUMBER (η2 = .739). This
interaction reflects the fact that singular forms attract better scores than plural ones in the [–
accented, –final] contexts, in both pre-nuclear and post-nuclear positions. Figure 3 gives a
summary of the scores obtained for declarative and interrogative sentences, in which the
difference between singular and plural forms is apparent in both intonations. This difference
is larger in interrogative than in declarative sentences, as shown by the interaction
INTONATION*NUMBER*CONTEXT.
Interrogative sentences
100
pl
50
sg
0
a
b
c
Context
d1
d2
Recogn. rate
Recogn. rate
Declarative sentences
100
pl
50
sg
0
a
b
c
d1
d2
Context
Figure 1. Recognition rates for singular and plural forms of nouns in declarative (left panel) and
interrogative sentences (right panel) in Experiment I, pooled over both speakers, as a function of
sentence prosodic context (a = [+accented,+final]; b = [+accented,–final]; c = [–accented,+final]; d1 =
[–accented,–final], pre-nuclear; d2 = [–accented,–final], post-nuclear).
10
- 11 With a comparable effect size (η2 = .732), the interaction INTONATION*SPEAKER
highlights a considerable difference in recognition rates for the interrogative stimuli between
the speakers. As reflected in the three-way interaction INTONATION*SPEAKER*CONTEXT,
the differences concern specific contexts. First, compared with the 91% (84%) correct score
for NG's utterances, PI’s utterances elicited a considerably lower score of 66% (40%) in the
[+accented,–final] condition of the interrogative contour. Second, PI’s declarative utterances
in the [–accented,–final] position attracted a 77% correct score, as compared to the 56%
correct score for the equivalent utterances produced by NG. Figure 2 gives a breakdown of
the scores for the two speakers.
Recognition per speaker, intonation and context
100
80
decl NG
60
decl PI
interrog NG
40
interrog PI
20
0
a
b
c
d1
d2
Figure 2: Recognition rates from Experiment I per speaker, intonation contour and sentence prosodic
context.
Lastly, the interaction INTONATION*CONTEXT shows that, for both speakers, recognition
rates are higher for interrogative stimuli than for declarative stimuli in the [–accented,+final]
position (91% against 83%).
Broadly, these results were to be expected on the basis of the description of the dialect as
summarized in Table I as well as by the examples (1)-(3) and (5)-(7). However, there are also
unexpected findings, like the low recognition score in the IP-medial accented syllables of
sentences with interrogative intonations contours. Before attempting to explain these findings,
we present an acoustic analysis of the f0 contours of target words and segment durations of
our stimuli.2
11
- 12 2.3 Experiment I: Acoustic analysis
All stimulus utterances were segmented manually. We segmented the syllable rhymes and
vowels of the target words and ran a Praat script so as to extract rhyme and vowel durations as
well as the the maximum, minimum, beginning and ending f0 of the syllable rhymes of all
target words. Figure 3 shows the average f0 contours for statements (on the left) and questions
(on the right) in accented IP-final (panels (a) and (d)), accented IP-medial (panels (b) and (e))
and unaccented IP-final (panels (c) and ((f)) positions. In addition to beginning and end values,
we given f0 maxima and minima, on normalized time axes. The contours for the declarative
intonation (panels (a), (b) and (c)) correspond closely to the stylized contours in Table I and
(1), (2) and (3), which were based on Gussenhoven (2000) and Fournier et al. (2006). The
interrogative contours, too, are similar to the stylized contours in Table I and (4), (5) and (6),
with an exception for the Accent 2 contour (dotted line) in the [+accented,–final] position
(see panel (e)), where instead of a low flat contour a low-to-mid rise is observed, smaller than
that for Accent 1, but similar in shape.
12
- 13 Declarative intonation
Interrogative intonation
a. +accent/+final
a. +accent/+final
+focus/+final
d.
280
F0 (Hz)
F0 (Hz)
280
230
180
130
230
180
130
80
80
0
50
100
0
50
b. +accent/-final
e. +accent/-final
280
F0 (Hz)
F0 (Hz)
280
230
180
130
230
180
130
80
80
0
50
100
0
c. -accent/+final
50
100
f. -accent/+final
280
F0 (Hz)
280
F0 (Hz)
100
230
180
130
80
230
180
130
80
0
50
100
0
50
100
Figure 3: Average f0 contours of the rhymes of target words for the declarative (left) and the
interrogative (right) forms of [+accented] and [+final] realizations of Accent 1 (uninterrupted lines)
and Accent 2 (dotted lines) in the stimuli used in Experiment I, pooled over the four words and the
two speakers.
A breakdown by speaker of the data in panel (e) is given in Figure 4. PI’s data in panel (a) of
this figure reveal that, on average, a 42 Hz rise occurred on his Accent 2 contours. Although
the f0 excursion is clearly larger in Accent 1 (109 Hz), this weak rise must have led listeners to
perceive Accent 1 where Accent 2 was intended. The data for NG conform to the contrast
shown in Table I and (5a,b), but do show a slight rise of 27 Hz in the Accent 2 form of derm.
Tellingly, whereas there were virtually no recognition mistakes for NG’s Accent 2 versions of
knien, sjtein and bein, we observed a recognition rate of only 41% for derm.
13
- 14 -
Speaker PI
a
Speaker NG
b
Figure 4: Realizations of Accent 1 (bold lines) and Accent 2 (thin lines) in the [+accent, final]
context with interrogative intonation, for speaker PI (left) and speaker NG (right). Solid lines: knien;
dashed lines: derm; dotted lines: bein; dot-dashed lines: sjtein.
Before commenting on these data, we turn to the stimuli that led to a second unexpected result,
the high recognition rates in the pre-nuclear context with declarative intonation for speaker PI.
His plural forms had a recognition score of 70%, while 84% of his singular forms were
correctly identified, against 45% and 65% for the corresponding stimuli spoken by NG. As
observed in section 2.1, we used the same declarative stimuli as in the experiment reported by
Fournier et al. (2006), with the exception of the sentences with pin. In that experiment, the
scores for NG were comparable to those obtained in the experiment reported here, but only
56% of the plural forms and 60% of the singular forms in PI’s stimuli were recognized
correctly in the earlier experiment. Since the scores for pin ‘pin’ did not differ from those for
knien ‘rabbit’, derm ‘intestines’, bein ‘leg’ and sjtein ‘stone’, the difference must be attributed
to the judges, who were older in the present experiment. Apparently, there are cues in PI’s
utterances to which the older group were sensitive, but which failed to be picked up by the
younger group used by Fournier et al. (2006). A breakdown of PI’s f0 data for the two lexical
tone classes is given in Figure 5. This shows that Accent 1 has a weakly rising contour, but
that Accent 2 has a flat or slightly falling contour. Section 3 reprots the experiemtn that was
run in order to check the generality of speaker PI’s speech forms. In this section, we turn to
the second unexpected effect in Experiment I.
In order to see whether the null results for the younger listeners in Fournier et al. (2006) were
reproduceable, we ran a follow-up listening session with 17 younger speakers, with a mean
14
- 15 age of 21. They were presented with the stimuli with target words in the pre-nuclear [–
accented,–final] declarative context (cf. panel (a) of Figure 5). The results showed a tendency for
the weakly falling and flat contours to be identified as Accent 2 (64%) and rising contours as
Accent 1 (78%). This suggests that these pitch differences are in fact usable by younger
speakers, but only in a situation in which their attention is more clearly focused on the target
words than must have been the case in the experiment reported in Fournier et al. (2006),
where they stimuli appeared among stimuli in which their pitch contours were more clearly
different. Although the difference between the Accent 1 and Accent 2 contours shown in
Figure 5 is slight, it resembles the phonetic difference found in the [+accented,–final]
interrogative contours. Apparently, the pre-nuclear position was treated by speaker PI as
accentable. We return to this point in section 4.
Finally, we report vowel durations across accents, contexts and intonation contours. Earlier
research has reported longer vowels for Accent 2 than for Accent 1 in a number of dialects
(e.g. Peters 2008). 3 For Roermond, a small but significant difference was found for the
declarative data in Fournier et al. (2006), in addition to a significant difference between the
longer IP-finalrhymes and the shorter IP-medial ones. An univariate ANOVA of the data
reproted here revealed a significant effect of context only (F(4,140)=19.95, p < .001), with
significant differences between all contexts, except those between [+accented,+final] (178 ms)
and [–accented,+final] (171 ms) and between [+accented,–final] (149 ms) and the postnuclear [–accented,–final] (145 ms). This means that preboundary lengthening emerges as a
strong effect in both investigations, but that there is no effect of accentuation (‘accentual
lengthening’, cf. Cambier & Turk 2000). Neither did the choice of lexical tone affect syllable
or rhyme duration in the present data.
15
- 16 -
Accent 2, Speaker PI
Accent 1, Speaker PI
a
b
Figure 5: f0 contours of Accent 1 and Accent 2 in speaker PI’s stimuli in the pre-nuclear declarative
context used in Experiment I. Solid lines: knien; dashed lines: derm; dotted lines: bein; dot-dashed
lines: sjtein
3. Experiment II
3.1 Experiment II: Materials and recording
The two unexpected findings in the data for speaker PI were investigated in a production
experiment with a group of younger speakers. First, in order to see if younger speakers have
the same pronunciation in IP-medial syllables with interrogative intonation, we composed a
small corpus of sentences specifically designed to elicit the contours for Accent 1 and Accent
2 in IP-medial nuclear positon in interrogative sentences. If the lexical tone contrast in that
context is subject to erosion, we should expect younger speakers to variably use a weak rise
instead of a flat contour in Accent 2 rhymes. This corpus is given in the top row of Table III.
Second, we wanted to see if the subtle rise for Accent 1 in pre-nuclear syllables in declarative
contours which PI produced in a number of cases, was also present in the speech of younger
speakers. Given the general picture of a mildly recessive tone contrast, we should expect them
to neutralize it in pre-nuclear [–accented,–final] positions, both in declaratives and in
interrogatives. To this end, we composed a small corpus of sentences specifically designed to
elicit these contours, given in the bottom row of Table III. We used a non-minimal pair,
weakII and maondjI, so as to see if any effect would show up in a situation in which subjects’
attention is unlikely to be drawn to the position under investigation.
16
- 17 Seventeen native speakers of the Roermond dialect between 18 and 25 years old recorded
these sentences from a script in which they were intermingled with a number filler sentences.
Speakers were recorded individually with the help of a Sharp MiniDisc recorder. The
recordings were monitored by the first author for background noise. Where it occurred during
the pronuniation of a sentence, the speaker was asked to say that sentence again. In one case a
speaker was advised not to break up sentences into separate IPs, but otherwise speakers were
left free to pronounce each sentence as they did. Since grammatical questions may be
pronounced with delarative intonation contours, just as in standard Dutch and standard
German, we obtained a mix of contours in the interrogative condition.
Table III: Sentences used in Experiment II with the target words beinII,I and knienII,1 (nuclear
interrogative position) and waekII, maondjI (pre-nuclear declarative position).
INTERROGATIVE
INTONATION
[+accented,–final]
nuclear
INTERROGATIVE
AND DECLARATIVE
INTONATION
[–accented,–final]
pre-nuclear
Zit diene BEIN aan diene voot?
'Is your LEG attached to your foot?'
Zitte dien BEIN aan dien veut?
'Are your LEGS attached to your feet?'
Höb-se KNIEN gezag?
'Did you say RABBIT(S)?'
Waas hae vorige maondjI in MILAAN?
Nae, hae was vorige maondj in BERLIEN.
'Was he last month in Milan? No, he was last month in Berlin.'
Waas hae vorige waekII in MILAAN?
Nae, hae was vorige waek in BERLIEN.
'Was he last week in Milan? No, he was last week in Berlin.'
3.2 Experiment II: Results
Of the 34 interrogative sentence pairs that were recorded for the [+accented,–final] context,
eleven were accepted as instances of the tonal contrast concerned. A pair of utterances was in
principle included in the data if no segmental errors were found in or around the target words,
and if the f0 contours represented interrogative contours in which Accent 1 had a rising shape
and Accent 2 either a flat or rising shape, in both cases followed by a fall. In sixteen sentence
pairs, at least one target word had a fall of more than 40 Hz. These utterances concerned
syntactic questions spoken with declarative intonation contours, and were excluded. Out of
the remaining 18 pairs of utterances, seven had very similar rises over the target words, with
end points differing by less than 20 Hz. In the other eleven pairs, the difference was larger. Of
17
- 18 these eleven pairs, three were instances of the minimal pair represented by the singular and
plural forms of bein and eight of the minimal pair represented by the singular and plural forms
of knien. We computed average f0 tracks over the eight singular forms and the eight plural
forms of knien. For Accent 1, these turned out to be very similar to NG’s and PI's instances:
mean excursion, the average f0 difference between minimum and maximum, was 113 Hz. In
the forms with Accent 2, only two speakers produced an f0 excursion of less than 10 Hz; the
average excursion was 44 Hz, about the same as what we found for PI's contours. In sum, in
the interrogative contours we found rises for Accent 1 and a range between almost level
contours and rising contours for Accent 2.
The results of Experiment I, in which one of the two speakers showed an approaching merger
in the production data to which listeners responded with lower recognition rates, combined
with the range of variation in the new data between a clearly maintained contrast and a nearlymerged contrast, strongly suggest that the contrast is moving towards neutralization in
interrogative IP-medial accented syllables. Of the small proportion of speakers who produced
a clear contrast, only two actually used the flat Accent 2 contour that was described in
Gussenhoven (2000) and that was best recognized in Experiment I.
The analysis of the target words in the pre-nuclear [–accented,–final] position spoken with
interrogative intonation and declarative intonation confirmed the general neutralization of the
tonal contrast in pre-nuclear position. A mean f0 fall of 20 Hz was observed for both lexical
tones in the interrogative contours, reflecting the overall fall on the stretch of speech
preceding the nuclear word MILAAN. Differences from this mean were tested in an ANOVA,
with lexical tone (Accent 1 vs Accent 2) as the only factor. This factor was not significant
(p=.374).
The original data used in the perception experiment can thus be viewed as entirely
representative instances of prenuclear words pronounced with interrogative intonation. In the
target words in the corresponding declarative contours, we failed to find the weak rise for
Accent 1 which in PI’s stimuli was responsible for the relatively high recognition rate in this
context. Both maondjI and waekII had flat contours. Figure 6 shows the high degree of
agreement for the contours we measured in maondjI.
18
- 19 -
maondj, 16 speakers
Figure 6: f0 contours for maondjI in pre-nuclear position, in declarative intonation contours. N=16.
4 Summary and conclusions
Using a forced-choice identification task with 22 listeners, we found that the dialect of
Roermond neutralizes the lexical tone contrast (Accent 1 vs Accent 2) in IP-medial
unaccented syllables, both before and after the nuclear accented syllable in interrogative
intonation contours. In declarative intonation contours, the neutralization appeared to be
complete only in post-nuclear syllables. In pre-nuclear syllables, we found above-chance
recognition of both Accent 1 and Accent 2. This finding differs from the finding in Fournier et
al. (2006), who observed complete neutralization both before and after the nuclear syllable in
declarative contours. Second, we replicated Fournier et al.’s (2006) finding that he lexical
contrast is firmly maintained in IP-final syllables, whether or not accented, and in IP-medial
accented syllables, in declarative contours. However, we found that in interrogative contours,
firm maintenance of the contrast existed only in IP-final syllables, whether or not accented; in
IP-medial accented syllables the contrast was impaired. Since the stimuli in the experiment
were naturally spoken utterances by two speakers, we carried out acoustic analyses of all
stimuli to see if these two unexpected results could be attributed to the specific pronunciation
of the target words by one or both speakers.
First, the prenuclear declarative target words with Accent 1 spoken by PI had a slight f0 rise
that contrasted with a flat or falling F0 contour for Accent 2. Additional testing with a new
19
- 20 group of subjects suggested that the slight rise was in fact used as a cue for the identification
of pre-nuclear Accent 1 words, but that the speakers did not themselves produce this
distinction in their own realizations. Rather, they produced both lexical tones with a level f0
contour. Second, the IP-medial accented syllables with Accent 1 in interrogative contours
spoken by PI had f0 rises with excursions of more than 85 Hz, while target words with Accent
2 had a range of contours between a low plateau and rises with excursions of up to 64 Hz.
Subsequent recordings with 17 young speakers showed that the unexpected behaviour of the
male speaker in IP-medial accented syllables in interrogative contours was representative of
the speech of younger speakers. Between them, they produced a similar range of contours for
Accent 2 that we found in speaker PI’s utterances, from level to rising f0 contours. Perceptual
confirmation of the cue values of rise for Accent 1 and the level f0 for Accent 2 was provided
by the results of a listening session with the 17 young speakers.
Our interpretation of the small f0 difference in pre-nuclear position in declarative contours is
that speaker PI used pre-nuclear pitch accents on the target words, realizing them nonemphatically. Given their shapes, a weak rise for Accent 1 and a flat contour for Accent 2, the
pitch accents must have been L*Hi, which in combination with Accent 2 gives L*L Hi (by
NORISE from underlying L*H Hi, see (6a)). In this interpretation, shown in (9), the speaker
must have produced a (non-salient) IP boundary after the target word, in order for Hi to
appear. Given that speaker PI tended to produce IP-boundaries in the middle of sentences, it is
not improbable that he continued this behaviour in the third recording session, while playing
down the phonetic salience of the boundaries.
(9)
a. Ich höb ‘knien’ GEZÓNGE
||
|
Li
L*L Hi
H* Li
‘I SANG “rabbit”’
b.
Ich höb ‘knien’ GEZÓNGE
|
|
Li
L* Hi
H* Li
‘I SANG “rabbits”’
More importantly, our interpretation of the reduced contrast between Accent 1 and Accent 2
in IP-medial nuclear syllables with interrogative intonation is that this contrast is on its way to
being neutralized. It is phonologically there in speaker PI, since the difference between the
Accent 1 rises and the Accent 2 rises is phonetically clear. The cause of the reduced salience
of this contrast is the speaker’s abandonment of the late start of the f0 rise after the low level f0
contour in the target syllable, as used in the conservative variety described in Gussenhoven
20
- 21 (2000). If the nucleus is only one or two syllables removed for the boundary, this late rise
must at times be produced quickly, in order to leave time for the production of the fall
towards Li. This is graphically illustrated in (10), which is a repetition of (5) in which the
innovative behaviour has been added as the dotted section of the contour in (10a). Pitch rises
appear to be harder to execute than falls (Ohala & Ewan 1973, Ohala 1978, Sundberg 1979,
Xu & Sun 2002), and the conservative postponement of the rising movement until after the
accented syllable is potentially more effortful than beginning its execution inside it, assuming
that the time available after the accented syllable is limited. This early start of the rise in the
case of Accent 2 will of course endanger its contrast with the rising contour for Accent 1. Xu
& Sun (2002) indicate that the minimal time required for a rise of 130 Hz is 200 ms; a more
comfortable speech style might take 300 ms, and if we reserve 200 ms for the fall, the last two
syllables should take 500 ms. If the pressure to retain the lexical tone contrast falls below this
critical level, therefore, speakers may be expected to take shortcuts. If these shortcuts are
subsequently generalized to situations in which there is less time pressure, the neutralization
will be on its way. Additionally, the interrogative contour may be less frequent than the
declarative contour, reducing the speaker’s motivation to maintain the contrast.
(10) a.
b.
Höbse ‘stjein’ gezag?
||
|
Li
L*L
HiLi
‘Did you say “stone”?’
Höbse ‘sjtein’ gezag?
|
|
Li
L*
HiLi
‘Did you say “stones”?’
The subsequent lack of acoustic salience could lead to the complete disappearance of the
contrast. As Hume and Johnson (2001) put it: “If a contrast is perceptually weak in a certain
position, synchronic phonology works to enhance or sacrifice it by way of epenthesis,
metathesis, dissimilation, assimilation or deletion”. A question that arises here is why the
lexical contrast in unaccented IP-final positions is maintained, in particular in declarative
contours, where the acoustic difference is quite small, as illustrated in (2) and shown in panel
(c) in Figure 1. In fact, recognition scores here are significantly lower than in the
corresponding interrogative contours, where the acoustic difference appears larger, as
illustrated in (7) and shown in panel (f) in Figure 1, but still considerably above the
recognition scores for the IP-medial interrogative syllables. We suggest that this is due to the
greater salience of final syllables, where features are more easily detected (Beckman 1998).
21
- 22 Addionally, it may be the case that is is harder to hear the difference between versions of the
same general shape (in this case, a slow rise vs a faster rise, as in the case of IP-internal
syllables with interrogatives intonation) than between different general shapes (like a fall vs a
level or a rise, as in the case of IP-final unaccented syllables with declarative intonation).
A final comment is devoted to the fact that recognition scores were lower for Accent 1 than
for Accent 2, both in the experiment reported in Fournier et al. (2006) and in our Experiment I.
Fournier et al. (2006) offered two possible explanations. One is that Accent 2 words
systematically represent singular forms. The singular form may be seen as a default, the
option to which listeners resort when they are in doubt. The second interpretation takes us
back to the speech signal, which tends to have level f0 in [–accented, –final] positions. A level
contour may be more readily interpreted as an instance of Accent 2, which generally exhibits
smaller
pitch
excursions
than
Accent
1.
Both
the
morphological
and
the
phonetic/phonological explanations remain plausible, as they can both apply to the declarative
and the interrogative contours. An experiment with homophones might yield data that could
decide between the them.
22
- 23 References
Beckman, Jill N. (1998). Positional faithfulness. Electronic Doctoral Dissertations for
UMass Amherst.
Cajot, José (2006). Phonologisch bedingter Polytonieverlust. Eine tonlose Enklave südlich
von Maastricht. In: Michiel de Vaan (red.) Germanic Tone Accent. Proceedings of
theFirst International Workshop on Franconian Tone Accents. (Zeitschrift für
Dialektologie und Linguistik Beihefte). Stuttgart: Franz Steiner. 12-23.
Cambier-Langeveld, Tina and Turk, Alice (1999). A cross-linguistic study of accentual
lengthening: Dutch vs. English. Journal of Phonetics 27, 171-206.
D'Agostino, Ralph B. (1971). A Second Look at Analysis of Variance on Dichotomous
Data. Journal of Educational Measurement 8, 327-333.
Fournier, Rachel, Verhoeven, Jo, Swerts, Marc & Gussenhoven, Carlos (2006). Perceiving
word prosodic contrasts as a function of sentence prosody in two Dutch Limburgian
dialects. Journal of Phonetics 34, 29-48.
Fournier, Rachel, Gussenhoven, Carlos, Jensen, Ole & Hagoort, Peter (in press).
Lateralization of tonal and intonational pitch processing: An MEG study. Brain
Research.
Gilles, Peter (1999). Dialektausgleich im Letzebuergëschen. Zur phonetischphonologischen Fokussierung einer Nationalsprache.Tübingen: Niemeyer
Grice, Martine, Ladd D. Robert & Arvaniti, Amalia (2000). On the Place of Phrase Accents
in Intonational Phonology. Phonology 17, 143-185.
Gussenhoven, Carlos (2000). The lexical tone contrast of Roermond Dutch in Optimality
Theory. In Merle Horne (Ed.), Intonation: Theory and Experiment (pp. 129-167).
Amsterdam: Kluwer. [Also ROA-382]
Gussenhoven, Carlos & Aarts, Flor (1999). The dialect of Maastricht. Journal of the
International Phonetic Association 29 (2), 155-166.
Gussenhoven, Carlos & Peters, Jörg (2004). A tonal analysis of Cologne Schärfung.
Phonology 21, 251-285.
Gussenhoven, Carlos & van den Vliet (1999). The phonology of tone and intonation in the
dialect of Venlo. Journal of Linguistics 35, 99-135.
23
- 24 Heijmans, Linda (2003). The relationship between tone and vowel length in two
neighboring Dutch Limburgian dialects. In Paula Fikkert & Haike Jacobs (Eds.),
Development in Prosodic Systems (pp. 7-45). New York: Mouton de Gruyter.
Jerome L. Myers, Jerome, L., DiCecco, Joseph V., White, John B. & Borden, Viktor M.
(1982). Repeated Measurements on Dichotomous Variables:
Q and F Tests. Psychological Bulletin 92, 517-525.
Newton, Gerald (1990). Central Franconian. In: Charels V.J. Russ (ed.) The Dialects of
Modern German, A Linguistic Survey. London: Routledge. 136-209.
Ohala, John J. (1978). The production of tone. In: V. A. Fromkin (ed.), Tone: a linguistic
survey. New York: Academic Press. 5 - 39.
Ohala, John J. & Ewan, William G. (1973). Speed of pitch change. The Journal of the
Acoustical Society of America 53 (1), 345.
Peters, Jörg (2006). The dialect of Hasselt. Journal of the International Association of
Phonetics 36, 117-124.
Peters, Jörg (2007a). Lexical tones in the Limburgian dialect of Borgloon. In Tomas Riad &
Carlos Gussenhoven (eds.), Tones and tunes. Volume I: Typological studies in word
and sentence prosody. Berlin: Mouton de Gruyter, 167-198.
Peters, Jörg. (2007b). Tone and quantity in the Limburgian dialect of Neerpelt. Proceedings
of the 16th International Congress of Phonetic Sciences, Saarbrücken.1265-1268.
Peters, Jörg (2008). Tone and intonation in the dialect of Hasselt. Linguistics, 46, 983-1018.
Poletto, Robert (1998). Constraints on tonal association in Olusamia: An Optimality
Theoretic account. In Larry M. Hyman & Charles Kisseberth (eds.), Theoretical
Aspects of Bantu Tone. Stanford: CLSI. 331-364.
Schouten, Bert and Peeters, Wim 1996. The Middle High German vowel shift, measured
acoustically in Dutch and Belgian Limburg: Diphthongization of short vowels.
Zeitschrift für Dialektologie und Linguistik 63, 30-48.
Xu, Yi & Sun, Xuejing (2002) Maximum speed of pitch change and how it may relate to
speech. The Journal of the Acoustical Society of America 111, 1399-1413.
Yip, Moira (2000). Tone. Cambridge: Cambridge University Press.
1
In order to prevent the LiH in form (2a) from being replaced with LiL, an appeal to
positional faithfulness is needed in order to preserve the final H-tone (cf. Gussenhoven 2000).
24
- 25 -
2
We
will
not
comment
on
the
interactions
CONTEXT*SPEAKER
and
CONTEXT*NUMBER*SPEAKER, since they do not highlight any new facts.
3
A longer duration for Accent 2 was however found in isolated pronunciations of 36 words
with declarative and interrogative intonation in unpublished research by Rachel Fournier. In
those data, from NG, the voiced part of Accent II words was 43 ms longer than in Accent I
words, while questions were 26 ms longer than statements.
25