THE ACOUSTIC AND PERCEPTUAL CORRELATES OF EMPHASIS

THE ACOUSTIC AND PERCEPTUAL CORRELATES OF EMPHASIS IN
URBAN JORDANIAN ARABIC
BY
Mohammad Al-Masri
M.A. Yarmouk University, Jordan, 1998
B.A. Yarmouk University, Jordan 1994
Submitted to the graduate degree program in Linguistics and the Graduate Faculty of
the University of Kansas in partial fulfillment of the requirements for the degree of
Doctor of Philosophy
_______________________________
Chairperson
Committee members
_______________________________
_______________________________
_______________________________
_______________________________
Date defended: _______________________________
The Dissertation Committee for Mohammad Al-Masri certifies that this is the
approved version of the following dissertation:
THE ACOUSTIC AND PERCEPTUAL CORRELATES OF EMPHASIS IN
URBAN JORDANIAN ARABIC
_______________________________
Chairperson
Committee members:
_______________________________
_______________________________
_______________________________
_______________________________
Date approved: _______________________________
ii
Abstract
Acoustic and perceptual correlates of emphasis, a secondary articulation in the
posterior vocal tract, in Urban Jordanian Arabic were studied. CVC monosyllables
and CV.CVC bisyllables with emphatic and plain target consonants in word-initial,
word-medial and word-final positions were examined. Spectral measurements on the
target vowels at vowel onset, midpoint and offset revealed a significant increase in F1
and F3 and a decrease in F2 in emphatic compared to plain environments. Emphasis
was observed more in the environment of high and low front vowels than high back
vowels, anticipatory emphasis spread was more salient than perseveratory emphasis
spread, and males showed more emphasis than females. Spectral moment measures of
the target consonants themselves revealed inconsistent or no effects of emphasis.
Results from a perception experiment with cross-spliced monosyllables showed that
perception of emphasis follows the vowel, not the consonant, in the emphatic
syllable, thus confirming the results of the acoustic experiments.
iii
Acknowledgements
Pursing a PhD degree in Linguistics would have been impossible without the
guidance and assistance of my supervisor, Prof. Allard Jongman. I would like to
express my sincere gratitude and appreciation for his help during this long, yet
enriching journey. His continued support, everlasting patience, scholarly spirit and
the many revisions of this dissertation made it possible for this humble work to come
to life. I would also like to thank many other great people: Prof. Sara Rosen for her
kind and supportive spirit, Prof. Joan Sereno for the invaluable remarks on parts of
this dissertation, and Prof. Jie Zhang for his helpful remarks before the dissertation
defense. I am also grateful for all the wonderful faculty members in the Linguistics
Department, especially Prof. Fiona McLaughlin for her faith in me. Special thanks go
to Alex Hornbrook and Corinna Johnson for their dedication and support.
I would also like to thank all my colleagues at KUPPL: Sanae Akaba, Sonja
Combest, Wendy Herd, Yuwen Lai, Kazumi Maniwa, Ariane Rhone, Eva Rodriguez,
Travis Wade and Wael Zuraiq, Many thanks also go to my friends: Osama
Abdulghafer, Khalid Abu Abbas, Abdullah Jaradat, Maria del Carmen Parafita Couto
and Sabri Shboul for the great time and discussions we had.
I would like to extend my appreciation to the great people in the Department
of African and African-American Studies, especially Prof. Naima Boussofara for the
great opportunity she gave me and for being a great mentor, and Prof. Peter
Ukpokodu and Lisa Hall for their continued support.
iv
My love and appreciation go to my great wife, Leena for her unequivocal
support and kind spirit during this experience. And to my little son, Saleh, for being
kind and nice. I am also grateful for the prayers of my wonderful family, especially
my parents.
This research was partially supported by a National Science Foundation grant
(# 0518969). Many thanks for their support.
v
Table of Contents
Abstract ………………………………………………………………………………iii
Acknowledgements …………………………………………………………………..iv
Table of Contents ………………………………………………………………...…..vi
Chapter One: Introduction and literature review ………………………..……………1
1.1.Introduction .…………….……………..……………….…………………1
1.2.Emphasis .………………………………..….…….………………………2
1.3.Urban Jordanian Arabic (UJA) ………………………..……...…….…….6
1.4. The articulatory and acoustic correlates of emphasis ………...………….8
1.5. The perceptual correlates of emphasis ………….……………...……….20
1.6. Statement of the problem …………………….……………...………….21
1.7. Rationale of the study …………………….…………..……………….. 22
1.8. Objectives of the study ……………….……………………...………….23
Chapter Two: Acoustic study – the monosyllables .…………………………………24
2.1. Method ………………………………..…………..……………….……26
2.1.1. Subjects………………………………………………………..26
2.1.2. Materials and Procedure……………………………...………26
2.1.3. Measurements………………………………..………………..27
2.2. Analysis ………………………………………………………………….29
2.3. Results ………………………….………………………………………..30
2.3.1. Duration………………………………...……………………..31
2.3.2. Formant frequency…………………………………………….33
2.3.2.1. F1……………………………...……………………33
2.3.2.2. F2………………………………………..………….39
2.3.2.3. F3 ……………………………………….………….49
2.3.3. Spectral moments………………..……………………………..55
2.3.3.1. Target consonants ………………………………….55
2.3.3.2. /b/ consonant …………………...…………………..56
2.3.4. Locus equations ………………………..………………………57
2.4. Results Summary ………………………………..……………………….61
2.5. Discussion ……………………………………...……………………..….62
Chapter Three: Acoustic Study – the bisyllables….………..…………………..……68
3. 1. Method …………………………………………………………………..69
3.1.1. Subjects………………………………….…...………………….69
3.1.2. Materials and Procedure…..…….….……………...…………...70
3.1.3. Measurements ……...……………………………………………71
3.2. Analysis ……………………………………………………..……………74
3.3. Results …………………………….……………………...……………….75
3.3.1. Duration …………………………….……………………..……..75
3.3.2. Formant Frequency ………………………..……………....……..76
3.3.2.1. F1……………………………………………………77
3.3.2.1.1. target consonant word-initial …….…….…78
3.3.2.1.2. target consonant word-medial ……………82
vi
3.3.2.1.3. target consonant word-final ………………90
3.3.2.2. F2 …………………………………………………...93
3.3.2.2.1 target consonant word-initial ……………...93
3.3.2.2.2. target consonant word-medial…………..…97
3.3.2.2.3. target consonant word-final……………...103
3.3.2.3. F3 ……………………………………………….…105
3.3.2.3.1 target consonant word-initial …………….105
3.3.2.3.2. target consonant word-medial…………....107
3.3.2.3.3. target consonant word-final……………...109
3.4. Results Summary …………………………………………..………...…..111
3.5. Discussion…………………………………………………….…………..113
Chapter Four: The perception study ……………………………….………………117
4. 1. Method …………………………………………..……………..………120
4.1.1. Subjects…………...…………………..………….…………….120
4.1.2. Materials and Procedure………..................……………..…… 120
4.2. Analysis ……………….………………….……………………….……..123
4.3. Results …...………………………………...…………………………….124
4.3.1. Original words……………...….…………...………………. 124
4.3.1.1. Summary ……..………………..……...…………….125
4.3.2. Changing target consonant …..…………..…………...……..126
4.3.2.1. Summary ………...………………………………….131
4.3.3. Changing target vowel……………..........……………………….132
4.3.3.1. Summary ………………………..…….…………… 136
4.4. Discussion ……………………….…………………….…………………137
Chapter Five: Summary, discussion and conclusion ………..…………..…………143
5.1. Summary of results……………………………………...………………..144
5.1.1. Acoustic study: the monosyllables..…….……….………………..144
5.1.2. Acoustic study: the bisyllables…………………..………………..145
5.1.3. The perception study…………...…………………….……………146
5.2. Discussion…………………………………..………………...…………..146
5.3. Conclusion ..…… ………………...………………..…………………….149
5.4. Follow-up … ..…………………………………………..………………..150
References ………….…………...………………...……………..…………………152
Appendices...………………………………………………………………………..157
vii
Chapter One
Introduction and literature review
1.1. Introduction
Emphasis is a feature of Semitic languages under which some consonants are produced
with a secondary articulation in the back part of the oral cavity. Emphasis has been
studied extensively in many dialects of Arabic, mostly through the contrast between
plain and emphatic segments. It has been dealt with as a phenomenon pertaining to
pharyngealization and/or uvularization. Both pharyngeals and uvulars are produced in
the back part of the oral cavity with slightly varying places of articulation. Many
sounds in Semitic languages are primarily produced in this region, and secondary
articulation involving this region is also characteristic of Semitic languages.
Research on emphasis dates back to the era of classical Arab linguists in the eighth
century AD. They described emphasis in terms of production with scattered attempts to
describe the vocal tract configuration during the production of emphatics. A large
number of the studies that investigated emphatics were either articulatory, focusing on
describing the configurations of the articulators involved in the production of
emphatics, or phonological, concentrating on aspects such as the feature RTR
(Retracted Tongue Root), emphasis domain, emphasis spread, and emphasis blockers.
Some of the more recent studies on emphasis focused on sociolinguistic aspects of
emphasis pertaining in particular to gender distinctions in the production of emphasis.
1
A considerably increasing number of studies have been examining the acoustic
correlates of emphasis. However, thorough investigations of the perceptual mapping of
these correlates have been very scarce. Despite the substantial literature on emphasis,
many aspects of the phenomenon are still murky due to a number of reasons such as
variations from one language/dialect to another, the design of the experiments
conducted, the number and dialectal background of the subjects that participate in these
studies, limitations on the consonants and vowels used in the data, to mention a few.
While some of the Semitic languages have lost the emphasis distinction, Arabic still
retains the full set of emphatics reconstructed from Proto-Semitic at a number of places
of articulation (Versteegh, 2001; Watson, 2002). All dialects of Arabic have minimal
pairs of coronal obstruents where the difference is only in terms of emphatic vs. plain.
The present study uses data obtained from Urban Jordanian Arabic (UJA). It presents
an investigation of the acoustic and perceptual correlates of emphasis in UJA.
1.2. Emphasis
Different definitions of emphasis agree that it involves a posterior articulation. Studies
that used different dialects highlighted certain aspects of the definition of emphasis.
Among these, Walter Lehn provides a satisfactory definition of emphasis that
encompasses several aspects of the phenomenon. He says that:
Emphasis is the cooccurrence of the first and one or more others of the following
articulatory features: (1) slight retraction, lateral spreading, and concavity of the tongue
and raising of its back (more or less to what has been called velarization), (2) faucal
and pharyngeal constriction (pharyngealization), and (3) slight lip protrusion or
rounding (labialization), and (4) increased tension of the entire oral and pharyngeal
2
musculature resulting in the emphatics being noticeably more fortis than the plain
segments.
(Lehn, 1963:30-1)
Lehn’s definition accounts for more than one type of emphasis at the level of
articulation.
Lehn
justifiably
makes
a
distinction
between
emphasis
and
pharyngealization proper. Though most emphatics are pharyngeal, it has been found
that some emphatics are labialized in addition to their pharyngealization (Watson,
1999). Other definitions of emphasis are more restricted to pharyngealization. Some
Arab grammarians refer to emphasis as ½itbãq (literally, 'covering') and define it as
“spreading and raising of the tongue” (Lehn, 1963: 29). Others use the word ½isti¿laa½
(literally, 'elevation') and define it as “elevation of the dorsum,” ibid. Delattre (1971:
129) describes the production of pharyngeals as one in which “the root of the tongue
assumes the shape of a bulge and is drawn toward the vertical back wall of the pharynx
to form a stricture. This radical bulge generally divides the vocal tract into 2 cavities:
one below extending from the stricture to the glottis, the other above extending from
the stricture to the lip.” This provides a detailed description of the mechanism of the
production of pharyngeal sounds but it does not adequately account for the production
of emphatics as it involves another secondary yet crucial articulation. Kahn (1975: 39)
defines emphasis as a “secondary pharyngeal articulation of certain consonants, usually
stops and fricatives.” She adds that the articulation of emphasis involves the organs
engaged in the production of a given sound in addition to a secondary pharyngeal
articulation. This means that any consonant in Arabic, more precisely in Cairene
Arabic, the dialect she studies, can be emphatic. Though obstruents are the most
3
common emphatics in Arabic, some studies conducted on Cairene Arabic show that
emphasis can be a property of laterals and rhotics, too (Cf. Ferguson, 1956). This is also
supported by later reports from other studies conducted on other dialects of Arabic.
Zavadovsky (1981) cited in Zemánek (1996) reports that in the Mauritanian dialect
each consonant can have its emphatic and non-emphatic counterpart.
McCarthy (1994: 38) states that emphasis studied in different dialect regions in the
Arab world is always characterized by a "constriction in the upper pharynx". He
distinguishes between these emphatics and pharyngealized consonants claiming that
while the former ones are purely emphatic, the latter ones should be called uvularized –
affected by another set of back segments, i.e., uvulars: /q, ¼, Ë/.
Davis (1995: 465) defines emphasis, which corresponds, as he posits, to
pharyngealization, as the phenomenon of producing sounds “with a primary articulation
at the dental/alveolar region and with a secondary articulation that involves the
constriction of the upper pharynx.” He also provides an account for bilabial emphasis
in Arabic, which adds bilabials to the class of possible emphatics, not confining it to the
dental/ alveolar place of articulation. Emphasis at the labial region was later reported by
Watson (1999: 289) in a dialect of Yemeni Arabic. She says that “emphasis has two
articulatory correlates: pharyngealization and labialization,” and reports some examples
from Yemeni Arabic though there is a clear lack of minimal pairs at the labial place of
articulation. The bilabial consonant mentioned here is the bilabial nasal /m/. This
4
segment does not have an intrinsically emphatic counterpart. There is a set of minimal
pairs reported in the literature where /m/ is emphatic in one case and plain in the other –
/ma:lak/ vs. /m¿a:lak/, 'your (masc. 2nd person) money, and 'what's wrong with you
(masc. 2nd person)?' respectively. Another segment that shows similar behavior is /l/ in
sentences like /walla:hu/ and /wal¿l¿ahu/, 'he appointed him' and 'by God,' respectively.
These occurrences are not common in Arabic, and therefore I will not discuss them in
detail.
One of the most detailed studies of emphasis has been conducted by Card (1983). She
discusses emphasis from the phonological and phonetic perspectives in addition to
describing the articulation of emphasis. Rather than giving a brief definition of
emphasis, she provides a more detailed description of this phenomenon that I will leave
for the next section. More recent investigations on emphasis were provided by Laufer
and Baer (1988) who studied emphasis in Hebrew and Arabic, Zawaydeh (1999) who
studied gutturals in Arabic, Watson (2002) who found evidence for emphasis in uvulars
and bilabials, and Khattab, Al-Tamimi, and Heselwood (2006) who studied the gender
differences in the use of /t/ and /t¿/ in Jordanian Arabic. Their studies also fit better in
the next section. In summary, all previous definitions stress that the production of
emphatics involves a set of features that engage the pharynx in a mechanism much
similar to the production of pharyngeals.
5
1.3. Urban Jordanian Arabic (UJA)
One of the interesting debates in the literature is concerned with the linguistic situation
in Arabic. It is common to engage in discussions about the relation between Modern
Standard Arabic and the different dialects, and also the relation between these dialects
themselves and the different divisions of dialectal regions, hence the need to give a
clear idea about the dialect of Arabic from which the data for this study were collected.
The question of ‘what dialect of Arabic are you studying?’ has for a long time been of
interest to linguists. The linguistic situation within the Arab world is easy to describe:
each Arab community speaks its own dialect and understands a host of other Arabic
dialects including MSA, and to a lesser extent Classical Arabic (CA). Due to the wide
distribution of Egyptian movies and sitcoms in the Arab world, Egyptian Arabic seems
to be the most widespread dialect of the Arab world. Within each country, there are
distinct sub-dialects. Differences between these sub-dialects are more often than not
related to dissimilarity in the consonantal rather than the vocalic inventory. These
distinctions are usually hard to detect. For example, it is hard for a non-Moroccan
speaker of Arabic to distinguish different Moroccan dialects, to which he/she will
collectively refer as ‘Moroccan Arabic’. Similarly, Jordanian Arabic (JA) is commonly
spoken by Jordanians; however, there are different sub-dialects within what is
commonly referred to as JA that are hardly known to non-Jordanians. One of the
relatively vivid investigations of JA which addresses the linguistic situation in Jordan is
Suleiman (1985). He provides an investigation of the linguistic varieties, the
classical/colloquial dichotomy referred to as diglossia, that dominate the scene in
6
Jordanian Arabic, focusing on the linguistic practices of the educated groups in Jordan.
He collected data from 40 students at Yarmouk University in Jordan by means of
personal interviews and questionnaires. He found that the linguistic situation in Jordan
is best described as triglossic consisting of Classical Arabic, Colloquial Arabic and
MSA (p. 93). He divides JA into three dialectal regions: the ‘Madani,’ represented in
the urban centers, the ‘Fallahi,’ spoken in rural areas, and the ‘Bedouin,’ spoken by the
non-sedentary nomads, (p 13). These dialectal regions of JA differed in the consonantal
inventory. A more recent description divides JA into four sub-dialects: Urban, Rural,
Bedouin and Ghorani, (Zuraiq and Zhang, 2006). UJA is spoken by people living in the
major cities, including the capital city, the Rural dialect is spoken by villagers, mostly
from the northern villages of Jordan, the Bedouin dialect is spoken by the Bedouins in
the north east, middle and south of Jordan. And finally, the Ghorani dialect is spoken
by farmers and people living in the Jordan valley. As mentioned earlier, these dialects
differ in their consonantal inventories while the vowels are essentially the same. The
consonant inventory of UJA is shown in Table 1.
7
Table 1. Consonant inventory of Urban Jordanian Arabic (Zuraiq and Zhang, 2006)
ð s
ð¿ s¿
ß
k
g
x
¥
Glottal
}
Pharyngeal
Affricate
Approximant
Glide
f
Velar
Fricative
d
d¿
n
r
z
Palatal
m
t
t¿
Palatoalveolar
Nasal
Trill
Alveolar
b
Interdental
Labiodental
Labial
Plosive
½
š
¿ h
d¹
l
w
j
1.4. The articulatory and acoustic correlates of emphasis
As mentioned earlier, emphasis has been mostly studied from a phonological
perspective. However, many of the phonological observations might very well map on
to the acoustics of emphasis. Sibawayh, cited in Semaan (1968), described the
articulation of emphatic sounds as early as the eighth century AD. Sibawayh uses the
term ½itbãq, literally 'covering,' to refer to emphasis. He says that emphatic sounds are
produced by “the part of the tongue which is the place of their utterance being closely
covered in their utterance by what is opposite to it of the palate” Semaan (1968: 45). He
adds that in the production of emphatics “you raise it [the tongue] towards the palate.”
ibid. He also says that if it was not for emphasis, then /t¿/ would be /d/, /s¿/ would be /s/,
/z¿/ 1 would be /ð/ and /d¿/ would disappear from the language, (ibid p. 46). In other
words, Sibawayh suggests that the emphasis distinction creates minimal pairs differing
1
Semaan uses this phonetic symbol to refer to the emphatic voiced dental fricative.
8
in the feature [+emphatic] or [-emphatic]. This description pairs the /t¿/ with the /d/
rather than the /t/, where it should be, having in mind that the emphatic/ plain minimal
pairs agree in place and manner of articulation but differ only in emphasis. This account
also draws heavily on the physical observation of the speech organs that are brought
together in the production of emphatics. It should be noticed here that a clear account of
the status of these emphatics remains unavailable. Though Classical Arabic (CA), the
variety of Arabic spoken during the early Islamic era, retains the same orthography as
Modern Standard Arabic (MSA), it has undergone many changes at the phonetic level.
It is due to these and other changes that we now have MSA. A simple match of the
orthographic symbols to the same phonetic realizations of MSA is likely misleading.
Another account in the medieval literature on emphatics is sketched by Card (1983). In
addition to her account of Sibawayh’s study, much similar to what Semaan has
presented, she discussed Ibn Sina (980 – 1037) – known in the western literature as
Avicenna. Ibn Sina was a physician and his account of these sounds is much affected
by his scientific background. As an example, as discussed by Card, (1983: 9), he
describes the production of the sounds /t¿/, /t/, /d/, saying that they have the same place
of articulation. “For /t¿/ the air is restricted by the larger part of the tongue tip and the
two sides of the tongue. A depression is formed in the center of the tongue, which
resonates as the air is forcibly driven out. The /t/ has similar articulation but only the
tongue tip restricts the air.” As for the sound /d/, Ibn Sina states that, “there is no
covering of the palate,” and “the air is not strongly restricted,” hinting probably at the
9
energy of voicing in the vocal folds. This, I think, is an insightful description of these
three phonemes in Arabic much valued at a time when very few tools, if any, were
available to provide more sophisticated descriptions. For other descriptions of emphatic
as well as plain sounds of Arabic, Ibn Sina uses terminology such as ‘air bubbles’ and
‘yielding membranes,’ ibid, which are not as linguistic as those used by Sibawayh, yet
his observations are still linguistically valid.
These two accounts that have been presented give us an idea about emphasis but the
descriptions of the sounds might not be totally valid to account for the same sounds that
are now present in different dialects of Arabic. Sibawayh’s and Ibn Sina’s descriptions
cover the era from the eighth to the tenth century A.D. and possibly for a short time
after that. However, in light of the inevitable change that affects many sounds of the
world's languages over time, descriptions of certain sounds have to differ by extents
compatible with the degree of change.
Modern studies of emphasis have changed their focus from descriptions solely based on
observation to more detailed and accurate explanations made possible by available
techniques. Marcáis, as discussed by Card (1983: 13) investigated the articulation of
emphatics in the 1940s making use of palatograms and radioscopy of the vocal tract. He
found that the articulation of emphasis involves “muscular tension and retraction of the
prominent articulating organs.” “The tongue root approaches the back of the pharynx,
with the result that the back of the tongue forms a ‘collapsed plateau’, dipping away
10
from the palate.” The feature that holds for all descriptions of the production of
emphatics so far is that they all involve upper pharyngeal constriction. A distinction has
to be made here between pharyngeals and pharyngealized segments. The term
pharyngeals refers to the sounds whose primary articulator is the pharynx whereas the
term pharyngealized, used to describe emphatics, means that the pharynx is the
secondary articulator for sounds articulated primarily with other speech organs. Ghazeli
(1977) shows the distinction in the production of pharyngeals and pharyngealized
segments. Using cinefluographic films, he shows that for pharyngeals such as /¿/ and
/š/ the greatest constriction was located below the epiglottis, while the greatest
constriction for emphatics occurs in the upper pharynx. This suggests that the
production of these two types of sounds is unrelated – giving no legitimacy for using
the two terms interchangeably as some linguists do.
Lehn (1968: 31) has noticed other features of emphasis. In his phonological study, he
mentions that features of emphasis do not hold for speakers of different dialects to the
same degree. He observed that in Cairene Arabic, emphasis is more characteristic of
men than women and that effects of emphasis on adjacent segments, i.e. emphasis
spread, are also not similar. He does not, however, provide experimental support to
validate his statements that are built on articulatory impressions. If it is true that women
emphasize less than men do, we still need to know if this occurs in all environments
and in all dialects. These observations are mostly impressionistic.
Another
characteristic of emphasis that has been frequently observed and has experimental
11
support is that emphatics affect adjacent vowels by dragging them to the low back
position, hence lowering their F2 frequencies. With this in mind, Lehn’s observation
that “all plain consonants occur before plain vowels” ibid is straightforward. To that he
adds that emphasis never occurs as the only feature of any segment in any environment;
“its minimum domain is CV”. Lehn is the first one to establish that the minimum
emphasis domain is the syllable. He states that the minimum emphasis domain is the
syllable and that the maximum domain, later referred to as emphasis spread, is the
utterance. Further, he says that emphasis domain is calculated at the syllable level,
which means that a whole syllable is either emphatic or plain, (p. 37). For this last
argument, Lehn cites examples such as /t¿i:n/ vs. /ti:n/, 'mud' and 'figs', respectively;
and /z¿u:r/ vs. /zu:r/, 'perjury' and 'visit' (imperative), respectively. For these pairs,
subsequent studies (Card 1983, Davis 1995, and Watson 1999) report that the long
vowels block emphasis spread and are phonologically opaque to emphasis. However,
vowels blocking emphasis do not violate the minimum emphasis domain requirement.
In light of the fact that the present study is devoted to the acoustics of emphasis and
perception of emphasis, it remains unknown whether phonological opacity applies to
phonetic correlates.
Kahn (1975) compared the production of emphatics among males and females. She
found that emphatics in Cairene Arabic do not lower F2 frequency values to the same
degree for male and female speakers. Instead, the extent of F2 lowering for adjacent
segments, an indication of emphasis, is greater for male speakers than for female
12
speakers. This suggests that male speakers ‘emphasize’ more than female speakers,
seemingly an acoustic replication of Lehn’s (1968) phonological observation. Kahn
also tested a number of non-Arab speakers of American English who had been trained
by an Arab male speaker to produce emphatics and found that there was no significant
gender difference in their articulation of emphatic sounds. She concluded that “among
Arabs, the magnitude difference in the formant frequencies between men and women
was found to be much greater than anatomically predicted by Fant and the direction of
sex-related formant differences was opposite to Fant’s predictions.”
(p. 38). The
finding that differences in F2 frequencies between emphatic and plain segments were
smaller for native female than male speakers of Arabic (p.42) should be attributed to
sociolinguistic factors. If these patterns were due to anatomical properties, we would
expect both American and Cairene female speakers to emphasize less than males. The
fact that this was not the case suggests that both male and female speakers can
emphasize by similar degrees but don’t for sociolinguistic reasons: softening harsh
sounds, avoiding course sounds in favor of more ‘feminine’ sounds. That is, female
speakers intentionally emphasize less than males. Similar findings have been reported
in later research: Royal (1985: 155 - 157) on Gamaliya, a dialect of Egyptian Arabic,
and Haeri (1996: 70 - 74) also on Cairene Arabic. However, results contrary to the ones
mentioned above have also been reported. Al-Masri & Jongman (2004) found that
females had a greater degree of F2 lowering in emphatic segments, meaning that they
‘emphasized’ more than male speakers did.
13
Davis (1995) examined two dialects of Palestinian Arabic: a northern dialect and a
southern dialect. His analysis is phonologically motivated. He considers applications of
Grounded Phonology to account for emphasis spread in these two dialects. Davis
reports that emphasis spread behaves asymmetrically: leftward emphasis is not blocked
for the entire word but rightward emphasis is always blocked by [+high, –back] vowels
(p. 468). This goes against what Lehn suggested, namely that the vowels /i:/ and /u:/
always block emphasis spread, and it actually excludes the vowel /u:/ from being
opaque.
Davis' findings support the assumption that different dialects show asymmetries in the
characteristics of emphasis spread with respect to the so-called opaque segments that
block emphasis spread, and the direction of spreading. In more recent phonological
accounts put forward by Davis (1995) and Watson (1999), emphasis is realized by the
spread of the feature Retracted Tongue Root [RTR] to adjacent segments. This, I think,
maps on to the phonetic parameter of low F2. This mapping, however, might very well
be non-linear. In other words, the same distance of spread of the feature [RTR] in the
underlying representation might not correspond to a comparable lowering of F2 values
of the same segments in the same directions.
Card (1983) divides her investigation of emphasis into two parts: phonetic and
phonological. Having established a relatively clear account of the phonological
attributes of emphasis, I shall focus on the phonetic part of her study. Card studied
14
emphasis in Palestinian Arabic and found evidence for the lowering effect of emphasis
on F2 frequency for segments in emphatic environments compared to higher
frequencies for corresponding segments in plain environments. In addition to this bythen clear characteristic of emphasis, she found that emphasis spreads phonetically
rightward and leftward in the whole word, (p.49). Furthermore, she found that if the
emphatic segment is word-initial, then emphasis is likely to spread to an adjacent word.
Secondary emphatics, i.e. those that acquire emphasis effects from neighboring
segments through spread, do not have F2 frequencies as low as those that have
segments with primary emphasis or as high as F2 frequencies of plain segments. Card
noticed that emphasis spread is blocked in the presence of the vowels /i:/ and /u:/. The
presence of these vowels in a word eliminated the main effect of emphasis, F2 drop.
Interestingly, she reports that for one of the stimuli, /t¿i:r/, meaning ‘fly!’, she found a
significant F2 drop despite the occurrence of a ‘blocking’ segment (p. 79).
Royal (1985) explored the sociolinguistic factors crucial in linguistic choice. She
particularly investigated this choice in terms of the degree pharyngealization. Royal
observed a general tendency among Cairene women to have markedly less
pharyngealization than Cairene men. But with a closer look at her results, it is evident
that this difference is ascribed to conventional choice rather than anatomical constraints.
She found that older Gamalyian women, women of Gamalyia – a suburb of Cairo,
display stronger pharyngealization than older Gamalyian men (Royal, 1985:165). This
led Royal to question any anatomical constraints on the vocal tract configurations for
men and women as far as their capability of pharyngealization is regarded. Another
15
observation is that she noticed variability in the degree of pharyngealization among
Gamalyian subjects in response to the sex of the listener. In other words, she found that
the degree of emphasis varied depending on the gender of the addressee.
These
findings contradict Kahn’s (1975). Similar findings, that is, differences ascribed to
social norms rather than anatomical facts, were reported in the other studies mentioned
above.
Another study was conducted by Laufer and Baer (1988) on emphasis in Hebrew and
Arabic. They collected 300 minutes of video recordings and simultaneous audio
recordings for nine Hebrew and Arabic speakers using a fiberscope in the upper
pharynx. Their analysis shows that all the emphatic and pharyngeal sounds were made
with qualitatively the same pharyngeal constriction but that the pharyngeal
constrictions were more extreme for pharyngeal sounds than for oral emphatics. This
provides more evidence for the fact that emphatics and pharyngeals should be
distinguished, cf. McCarthy (1994).
Zawaydeh (1999) studied the phonology and phonetics of gutturals in a number of
Arabic dialects. She distinguishes two types of gutturals but reports that they behave
similarly. These two types of sounds are uvulars: /q, ¼, Ë/ and emphatics: /d¿, t¿, ð¿, s¿/.
Most of her work is devoted to studying the uvular sounds in terms of their
phonological and phonetic behavior. Zawaydeh supports McCarthy’s (1994) argument
that emphatics are uvularized sounds, neither velarized nor pharyngealized. Previous
studies (Ali and Daniloff 1972; Al-Ani 1978) showed that emphatics can be
16
pronounced with a secondary articulation either in the pharynx or at the velum.
Zemánek (1996) reports much similar findings.
Zawaydeh presents results on the acoustic characteristics of gutturals. She posits that
gutturals can be distinguished as one group consisting of emphatics, uvulars,
pharyngeals and laryngeals. Her findings indicate that gutturals show similar but not
identical acoustic behavior. For example, she reports that while pharyngeals are
characterized by a raising of F1 for adjacent vowels, emphatics show a strong effect of
F2 lowering for adjacent vowels. While effects of emphasis and uvularization can be
blocked, the segments that cause blocking are not similar for both classes. She also
distinguishes pharyngeals /š/ and /¿/ from emphatics in that pharyngeals do not cause
F2 drop for adjacent vowels but only a significant raising for F1. Further, her findings
show that F2 lowering spreads to adjacent syllables if it is triggered by emphatics not
by pharyngeals. This is evident in UJA in that low vowels after emphatics are
pronounced with a back allophone that does not appear after pharyngeals (p. 38 – 9).
Zawaydeh also questions whether the spread of emphasis and uvularization is
categorical or gradient. In other words, whether the F2 lowering or F2 raising of the
vowel in the target syllable and F2 of adjacent vowels drop gradually or categorically.
Her findings show that the drop is gradual but some discrepancies are also reported.
More recently, Watson (2002) provides an acoustic account of emphasis. She says that
“the oral emphatics are typically marked by a compact acoustic spectrum through
17
lowering of the upper frequency formant (principally F2) due to an enlarged mouth
cavity, and a raising of F1 due to a reduced pharyngeal cavity.” (p. 270). She does not
mention the effects of opaque vowels or the domain of emphasis.
Khattab, Al-Tamimi, and Heselwood (2006) investigated the gender differences in the
production of /t/ and /t¿/ in Jordanian Arabic. 10 speakers of JA (5 males and 3 females
were from the northern part of Jordan, 2 females were from the capital city of Amman)
produced 4 pairs of /t/ and /t¿/ in word-initial positions with the vowels /i/ and /a/. They
report lower F2 and higher F1 for vowels in emphatic environments compared to their
plain counterparts regardless of gender, (p. 151). They also report longer VOT delays in
/t¿/ than in /t/ only for females. They conclude that females generally have a markedly
less emphatic speech compared to males. In other words, VOT latencies were more
pronounced in male speech. However, this result, as they later explain (p. 155) is
compromised by the fact that the two female speakers that were from Amman are the
ones that show less emphasis. This is in line with the findings of Royal (1985) on
Gamalyian Arabic.
Another acoustic measure pertaining to emphasis is the locus equation. Locus equations
are ‘straight line regression fits to data points formed by plotting F2 transitions along
the y axis and their corresponding midvowel nuclei along the x axis,’ Sussman,
McCafrrey, and Matthews (1991). They provide information about the place of
articulation for the consonants, and also about the degree of coarticulation. Slopes
generated by plotting the F2 Onset by F2 Vowel indicate the degree of coarticulation.
18
Yeou (1997) investigated locus equations for a set of emphatic consonants: /d¿/, /t¿/,
/ð¿/, /s¿/ in Modern Standard Arabic spoken by native speakers of the Moroccan dialect.
He found that locus equations reflected some place distinctions. He also reports that the
pharyngealized set of consonants used in the study, which is the same set of primary
emphatics used in the present study, emerged as a distinct class, ‘having the flattest
locus equation slopes of all consonants.’ This points to minimal coarticulation between
the consonants and the vowels.
Another measurement is the spectral moments (center of gravity, standard deviation,
skewness and kurtosis). Norlin (1987) measured spectral moments for Egyptian
sibilants /s/ and /s¿/ in word-initial positions. He used FFT spectra created from a 26.5
ms sample taken after the first third of the sibilant, (p. 16). The target consonants he
used /s/ and /s¿/ were used in one vocalic environment /a:/.Norlin found that /s¿/ had a
significantly lower center of gravity (5974 Hz) compared to /s/ (6345 Hz), (p. 94).
Jongman, Wayland, and Wang, (2000) found that a greater positive skewness indicates
a concentration of energy in the lower frequencies.
Clearly, back articulation in several dialects of Arabic has triggered much research to
find out its phonological, articulatory, acoustic and, to a much smaller extent,
perceptual correlates. Many of these studies provide us with findings that help draw a
better picture of this back articulation in Arabic. Now we know more about the
mechanism of back articulation, spread of certain classes such as emphatics and
uvulars, blocking segments, directionality of spread and the importance of locus
19
equations. Despite these valuable findings, many of the studies in this respect suffer
from certain limitations. Some are too general to the point that they deal with a
relatively great number of back sounds despite the fact that the articulators of these
sounds are different. As suggested by Perturbation theory (Chiba and Kajiyama,1942),
not all back sounds have similar acoustic effects. Other limitations that are common
include using data from different dialect regions, combining a limited number of
subjects, and using data from the researcher himself/herself.
1.5. The perceptual correlates of emphasis
Perception of emphasis has received much less attention than the acoustics of emphasis.
Obrecht (1968) conducted a study on the perception of emphasis in Lebanese Arabic.
He synthesized a /t¿i:-ti:/ continuum in which he varied F2 in 120 Hz steps from 1080
Hz to 1800 Hz; F3 was fixed at 3000 Hz and F1 was flat for all the experiments.
Obrecht found that the perception of emphatics was categorical. The turning point
between emphatic and plain occurred at an F2 around 1560 Hz. Stimuli higher were
perceived as plain and stimuli lower were perceived as emphatic. He also found that the
perception of emphasis depended largely on vowel quality. In a replication of Obrecht’s
experiments for the /s¿i:-si:/ distinction in Moroccan Arabic, Yeou (1995) found similar
results. He also found that the F1 transition lacks perceptual cues for the plain/
emphatic distinction. Ali and Daniloff (1974) on Iraqi Arabic found that the vocalic
portion of the plain/ emphatic words included sufficient cues for listeners to detect the
distinction.
20
Other studies that report on the perception of emphasis viewed the results in
sociolinguistic terms related to gender, social selection, and existing linguistic norms
(Kahn 1975; Alwan 1983; Wahba 1993; and Al-Wer 2000b). Kahn (1975) states that
emphatics were equally perceived by native speakers of Arabic regardless of their
gender.
Due to the apparent scarcity of studies on the perception of emphasis, our
understanding of the characteristics of emphasis remains wanting. The previous studies
either focused on the emphatic/plain consonant, not on the vowel, or used the whole
word as the unit of perception. While focusing on the consonant gives us a good insight
into the nature of the target obstruents involved in emphasis, using the whole word falls
short of accounting for the source of the difference. Neither of them accurately
investigates the characteristics of the vowel adjacent to the target consonants.
1.6. Statement of the problem
The present study examines the acoustic and perceptual correlates of emphasis in UJA.
Many of the previous studies provided phonological accounts of emphasis including
emphasis domain, emphasis spread, directionality of emphasis spread, and opaque
vowels that delimit the domain of emphasis spread. Although several recent studies
explored the acoustic characteristics of emphasis, many aspects of the acoustic signal
21
for emphatic segments are still unclear. The relevance of the target consonants and
vowels and their effects on the acoustics and perception of emphasis is still debatable.
The present study focuses on the characteristics of the consonants by measuring their
duration, spectral moments and locus equations; and also on the vowels by measuring
their durations, F1, F2, and F3 at onset, temporal midpoint and offset of the vowel. The
present study also tests whether the findings on the acoustics of emphasis map on to
perception. That is, whether the perception of emphasis is realized most on the target
consonant or on the target vowel, with reference to some characteristics of the
consonants (manner) and the vowels (quality).
1.7. Rationale of the study
The current study examines the acoustics and perception of emphasis in UJA to provide
a better understanding of this phenomenon in Arabic linguistics. It aims to further
previous research on this matter and add deeper insights into the nature of emphasis.
Despite the increasing number of acoustic studies conducted on emphasis, several
factors contribute to the need for the present study. Much of the previous research had
limitations on the number of subjects and the inconsistency of their dialects, the nature
of the data used, the limited number of measurements, and in some cases the lack of
statistical analyses. With regard to the perception of emphasis, research is very scarce
to begin with. The limited number of studies on the perception of emphasis leaves this
aspect of emphasis virtually unexplored.
22
1.8. Objectives of the study
The present study aims at first better clarifying the nature of emphasis in UJA. While
many of the previous studies investigated emphasis in different dialects of Arabic, the
present one helps complete the picture by using data from a less commonly studied
dialect of Arabic. This should be helpful in enriching the different fields of Arabic
linguistics and possibly paving the way for much needed future research on Arabic.
One of the basic contributions of this study is to provide a better understanding of
emphasis through a systematic approach that enables the author to report on the
characteristics of the plain and emphatic segments by means of the measurements
conducted. The relatively fine control on the consonant types, using the different
vowels of UJA, using a relatively large and equal number of male and female subjects
and large number of observations should allow for reliable results. In addition to
contributing to the research on Arabic linguistics, a better understanding of the
characteristics of emphasis at the levels of acoustics and perception will definitely be
helpful for pedagogical purposes.
23
Chapter Two
Acoustic Study – the monosyllables
In this chapter, I present the first part of the acoustic study which deals with the
monosyllables. This part aims to introduce the basic findings on the acoustic correlates
of emphasis in UJA. Specifically, it aims to answer the following questions:
1.
What are the characteristics of the plain/emphatic consonants?
2.
Does manner of articulation of the consonants affect emphasis?
3.
Are the effects of emphasis equally attested for different vowels?
4.
How does emphasis spread within the syllable?
5.
Where is emphasis realized most?
6.
Does gender affect the degree of emphasis?
In accordance with perturbation theory (Chiba and Kajiyama, 1942), see also (Kent and
Read, 24:1992), it is expected that the presence of a constriction in the back part of the
vocal tract triggers F2 lowering and may raise F1 and F3. Many previous studies on
emphasis unanimously report F2 lowering of the vowel in the same syllable with the
target consonant (Al-Ani, 1970, 1978; Kahn, 1975; Ghazeli, 1977; Giannini & Pettorio,
1982; Card, 1983; Bukshaisha, 1985; Laufer & Baer, 1988; Haselwood, 1992;
Zawaydeh, 1999; Al-Masri & Jongman, 2004; Khattab, Al-Tamimi & Heselwood, 2006
among others). Fewer studies report F1 raising (Klatt & Stevens, 1969; Al-Ani, 1970,
1978; Ghazeli, 1977; Giannini & Pettorio, 1982; Alwan, 1986; Butcher & Ahmad, 1987;
24
Laufer & Baer, 1988). And finally, Card (1983) reports no differences between F3
values in plain and emphatic environments. In terms of durations, Obrecht (1968)
hinted that F2 consonant-vowel transitions were longer in emphatic positions compared
to their plain counterparts. In Egyptian Arabic, Norlin (1987:76) found that there was a
slight tendency for plain vowels to be longer than emphatic ones, especially for long
vowels. Alioua (1995), however, reported contrasting results, namely that vowels
following emphatic consonants had slightly longer durations than their plain
counterparts. And finally, Norlin (1987) found lower center of gravity for word-initial
emphatic fricatives compared to plain ones. Based on previous literature reviewed in
Chapter 1, I make the following hypotheses:
1.
Emphatic consonants will be slightly longer than their plain counterparts.
2.
Vowel duration is not affected by emphasis.
3.
Emphatic consonants will have a lower center of gravity than their plain
counterparts. They are also expected to have a greater positive skewness due to the
concentration of energy in the lower frequencies.
4.
Locus equations will show flatter slopes for emphatic consonants compared to
their plain counterparts.
5.
F2 will be significantly lower for the vowels adjacent to the emphatic consonant.
6.
F2 lowering will be strongest closest to the emphatic consonant.
7.
The presence of an emphatic consonant will raise F1 and F3 in adjacent positions.
8.
Gender does not affect emphasis; F2 lowering is expected to be similar for males
and females.
25
2.1. Method
2.1.1. Subjects
Eight adult speakers (four males and four females) of UJA aged 20 to 39 years old
(average age was 28.5 years) participated in the study. Three males, who were PhD
students at the University of Kansas, and their spouses were living on the KU campus;
the remaining male and female speakers lived in Fayetteville, AR. All of the subjects
spoke the urban dialect of JA. Duration of stay in the US at the time of recording
ranged from 3 months to 4 years (average duration of stay was 2.85 years). None of the
subjects reported any known speech or hearing impairments.
2.1.2. Materials and Procedure
A list of 96 target CVC stimuli was prepared. The wordlist consisted of 48 pairs of
words where the target consonant (the plain or emphatic consonant) was either wordinitial or word-final. Each of the UJA vowels was used with each of the target
consonants. Thus, four pairs of target consonants: /t/ - /t¿/, /d/ - /d¿/, /ð/ - /ð¿/, and /s/ /s¿/were used word-initially and word-finally along with the 6 vowels of UJA: /i:/, /i/,
/a:/, /a/, /u:/, and /u/ - yielding 48 words with plain consonants and 48 words with
emphatic consonants. The consonant /b/ was used to fill up the remaining C slot. Thus,
the study material consisted of CVb and bVC stimuli, see Appendix I. This yielded 48
nonwords (17 plain and 31 emphatic) that were all phonotactically acceptable in UJA.
Subjects read the target words in a carrier sentence:/'½iš.ki target word ka.'ma:n
'mar.rah/ [Say target word again]. Subjects read 3 times from a wordlist that included
26
12 pages, each having one column. Each column consisted of 12 sentences – the first
and last 2 sentences were fillers and did not include target words. Since Arabic short
vowels are not written in full letters, diacritics were added to illustrate these vowels
where they occurred. If subjects made a mistake while reading, they were instructed to
resume reading two lines before the line in which the mistake was made. Speakers read
the wordlist three times. Recordings were carried out in an anechoic chamber using a
Digital Master DAT recorder (Fostex D-5) and a unidirectional microphone
(ElectroVoice RE-20). Recordings were digitized at a 22050 Hz sampling rate using
Praat speech analysis software (Boersma and Weenink 2005). Each subject was
recorded individually and each recording session took approximately 12 minutes.
Measurements taken from the three repetitions were averaged for each subject and the
average was used for statistical analysis.
2.1.3. Measurements
For each consonant, duration was measured and spectral moments (center of gravity,
standard deviation, skewness and kurtosis) were calculated. Spectral moments were
calculated from FFT. All other measurements were taken from wide-band spectrograms.
For consonants in word-initial position, duration was measured from the offset of the
last segment of the carrier sentence, the vowel /i/, until the offset of the burst for stops
or the end of the frication duration for fricatives. For word-final stops, duration was
measured from the offset of the preceding vowel in the target word until the offset of
the stop burst. For word-final fricatives, the duration was that of the frication. For
fricatives, spectral moments were calculated with a 30-ms window centered around the
27
temporal midpoint of the fricative. For stop consonants, spectral moments were
calculated at the onset of the burst. As a result, window size for stop consonants varied
depending on the duration of the burst, which ranged from 10 to 46 ms. The burst
duration was longer, on average, for word-final stops compared to word-initial stops. In
both positions, the average burst duration was longer for voiceless stops compared to
voiced stops. In very few cases, the burst was very hard to locate. In these cases, the
burst was estimated using auditory and visual cues and a window of 10 to 15 ms,
depending on the duration of the burst, was placed over the estimated onset to calculate
the spectral moment. For the target vowels, duration was measured and F1, F2 and F3
were measured at onset, midpoint and offset of the vowel. The vowel onset is coded as
the point in the vowel closest to the onset of the target word and the vowel offset is the
point in the vowel closest to the offset of the target word regardless of the position of
the target consonant. In other words, in words with final target consonants, the
frequency measurements at the vowel offset resemble the point closest to the consonant
that triggers emphasis. Locus equations were also calculated for each consonant by
fitting a regression line to F2 values measured at the onset and the temporal midpoint of
each vowel where the target consonant was word-initial, or measured at the offset and
temporal midpoint of the vowel where the target consonant was word-final. Figure 1
shows an example of where the measurements were taken.
28
Spectral moments for fricatives
30ms
F3 onset
fricative duration F2 onset
F1 onset
F1, F2, F3 mid
F1, F2, F3 offset
burst: spectral
moments for
/b/ duration stops
Figure 1: Illustration of the nature and locations of acoustic measurements
2.2. Analysis
To analyze the data, different ANOVA tests were conducted on all measurements.
Bonferroni post hoc tests were also used. Twelve dependent variables and eight
independent variables were used, see Table 1 below.
29
Table 1: Dependent and independent variables used in the study
No.
1
Dependent variables
Target consonant duration
No.
1
2
Duration of /b/
2
3
4
Target consonant center of
gravity
Target consonant standard
deviation
Independent variables
Consonant-type (plain, emphatic)
Vowel quality (high front, low front,
high back)
3
Vowel length (long, short)
4
Target consonant position (initial, final)
5
Target consonant skewness
5
6
Target consonant kurtosis
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
Vowel duration
Vowel F1 onset
Vowel F1 midpoint
Vowel F1 offset
Vowel F2 onset
Vowel F2 midpoint
Vowel F2 offset
Vowel F3 onset
Vowel F3 midpoint
Vowel F3 offset
Center of gravity for /b/
Standard deviation for /b/
Skewness for /b/
Kurtosis for /b/
7
8
Target consonant voicing (voiced,
voiceless)
Target consonant manner (stop,
fricative)
Word type (real word, nonword)
Gender (male, female)
2.3. Results
In this section, I present the results of the acoustic study for the monosyllables. I will
present the results on duration, formant frequencies, spectral moments and locus
equations. Since the target consonant appears either word-initially or word-finally, I
will divide the results accordingly. In all analyses, interactions between Consonant-type
30
and Gender never showed any significant differences. Except for one analysis, the same
applies for Word-type.
2.3.1. Duration
The results on target consonant durations in word-initial and word-final positions show
that emphatic consonants were consistently longer than their plain counterparts, [F (1,
766) = 4.747, p = 0.030]. This effect, however, seems to result from word-final target
consonants. Word-final emphatics are significantly longer than their plain counterparts,
[F (1, 382) = 5.681, p = 0.018], whereas word-initial plain and emphatic consonants are
not different in terms of duration, [F (1, 382) = 0.287, p = 0.592]. See Figure 2. The
presence of an asterisk (*) indicates significant differences.
140
Duration (ms)
120
*
100
* 102
96
110
Plain_all data
Emphatic_all data
101
92
94
Plain initial
Emphatic initial
Plain final
Emphatic final
80
60
Figure 2: Target consonant durations
31
When the target consonants are further divided by manner, results show that this
durational difference applies only to word-final stops. While word-final fricatives were
not significantly different, [F (1, 190) = 0.665, p = 0.416], word-final emphatic stops
were significantly longer than their plain counterparts, [F (1, 190) = 5.894, p = 0.016],
see Figure 3 below.
140
Duration (ms)
120
112
*
100
104
108
Plain stop
Emphatic stop
98
Plain fricative
Emphatic fricative
80
60
Figure 3: Target consonant duration in word-final positions by manner
No other significant main effects or interactions were found.
While the duration measurements show significant differences for the target consonants,
vowel durations were not affected by emphasis. Mean vowel duration was 132 ms for
vowels in plain environments and 134 ms for vowels in emphatic environments. No
significant main effects or interactions were found.
32
All target words either started or ended with /b/. An investigation of durational
differences of this consonant in the presence or absence of an emphatic segment yielded
no significant differences. For all tokens, /b/ duration was 84 ms and 85 ms in plain and
emphatic environments, respectively. Similar results for duration are maintained in
words with initial emphatics: 88 ms in plain environments and 89 in emphatic
environments. The same results are also obtained in words with final emphatics: /b/
duration is 80 ms in words with plain consonants and 81 ms in words with emphatic
consonants. No significant main effects or interactions were found.
2.3.2. Formant frequency
In this section, I report on the three vowel formant frequencies, F1, F2 and F3 in three
positions: vowel onset, midpoint and offset.
2.3.2.1. F1
Overall, vowels in emphatic environments have a higher F1 onset compared to vowels
in plain environments. Including measurements from all data, F1 onset is 397 Hz in
emphatic positions and 369 Hz in plain positions. This difference is significant, [F (1,
766) = 26.402, p < 0.001]. This difference holds for words where the emphatic is wordinitial, F1 onset is 403 Hz in emphatic environments and 364 Hz in plain environments,
[F (1, 382) = 25.196, p < 0.001]. The same comparison also yields significance in
33
word-final emphatics, F1 onset is 390 Hz for vowels in emphatic positions, and 372 Hz
for vowels in plain positions: [F (1, 382) = 5.194, p = 0.023], see Figure 4 below.
Frequency (Hz)
500
400
*
397
369
403
*
364
*
390
372
Plain_all data
Emphatic_all data
Plain initial
Emphatic initial
Plain final
300
Emphatic final
200
Figure 4: F1 at vowel onset
A Consonant-type by Position interaction reveals a trend. Though it does not show
significance, [F (1, 767) = 3.599, p = 0.058], there is a tendency for F1 at vowel onset
to rise by a larger extent in words with initial target consonants compared to words with
final target consonants: 364 Hz to 403 Hz for words with initial target consonants
compared to 372 Hz to 390 Hz for words with final target consonants; magnitudes of
rise are 39 Hz and 18 Hz, respectively. No other significant main effects or interactions
were found.
F1 at midpoint also shows significant differences. Across all data, vowels in the same
syllable with the emphatic target consonants have a significantly higher F1 at midpoint
34
than vowels in the same syllable with plain target consonants: 508 Hz and 480 Hz,
respectively, [F (1, 766) = 5.448, p = 0.020]. However, when the data are divided by
position, this difference is no longer significant. F1 midpoint for vowels in words with
initial emphatics is 509 Hz, and 483 Hz for their plain counterparts, [F (1, 382) = 2.315,
p = 0.129]. In words with emphatics in final position, F1 midpoint is 507 Hz in
emphatic environments, and 477 Hz in plain environments, [F (1, 382) = 3.160, p =
0.076]. See Figure 5 below.
600
Frequency (Hz)
500
508
*
480
509
483
507
477
Plain_all data
Emphatic_all data
Plain initial
400
Emphatic initial
Plain final
Emphatic final
300
200
Figure 5: F1 at vowel midpoint
No other significant main effects or interactions were found.
At offset positions, F1 also shows significant differences. F1 offset is higher in
emphatic positions than in their plain counterparts, 409 Hz and 381 Hz, respectively: [F
35
(1, 766) = 19.768, p < 0.001]. This difference is lost in words with initial target
consonants but is maintained in word-final target consonants. F1 offset is 409 Hz in
emphatic environments and 395 Hz in plain environments in words with initial
emphatics. This difference is not significant, [F (1, 382) = 2.071, p = 0.151]. In words
where the target consonant is word-final, F1 offset is 409 Hz in emphatic environments
and 366 Hz in their plain counterparts, [F (1, 382) = 26.181, p < 0.001]. See Figure 6.
Frequency (Hz)
500
400
409
*
381
*
395
409
409
*
366
Plain_all data
Emphatic_all data
Plain initial
Emphatic initial
Plain final
300
Emphatic final
200
Figure 6: F1 at vowel offset
A Consonant-type by Position interaction shows that while words with final target
consonants have a significantly higher F1 offset in emphatic environments (409 Hz)
compared to plain environments (366 Hz), words with initial target consonants lack this
distinction for F1 offset in emphatic positions (409 Hz) and in their plain counterparts
36
(395 Hz) [F (1, 767) = 5.365, p < 0.021]. See Figure 7 below. No other significant main
effects or interactions were found.
Frequency (Hz)
500
400
Word-initial
Word-final
300
200
Plain
Emphatic
Figure 7: Consonant-type by Position interaction for F1 Offset
Since the F1 measurements were taken at three positions, it is worthwhile to look at the
effects of emphasis across the three points of the vowel: onset, midpoint and offset. In
words with initial target consonants, F1 is significantly higher in emphatic
environments. This difference is significant only at the onset of the vowel – the point
closest to the consonant that triggers emphasis. The effect diminishes as the
measurement is taken further away from the target consonant, see Figure 8. Arrows
stand for the direction of significant differences; absence of the arrows indicates
absence of significant differences.
37
Similarly, in words with final target consonants, vowel F1 at the point closest to the
target consonant (F1 offset in this case) is significantly higher in the emphatic
environment. The effect diminishes at the vowel midpoint, though it shows a trend, and
is significantly higher at the point further away from the target consonant, (F1 onset in
this case). See Figure 9.
Frequency (Hz)
600
509
483
500
Plain
Emphatic
400
409
395
403
364
300
Onset
Mid
Offset
Figure 8: Vowel F1 in words with initial target consonants
38
Frequency (Hz)
600
507
500
477
Plain
Emphatic
400
409
390
372
366
300
Onset
Mid
Offset
Figure 9: Vowel F1 in words with final target consonants
2.3.2.2. F2
Measurements for F2 across all data show that F2 is consistently lower in emphatic
than in plain environments. F2 measured at the onset of the vowel in the same syllable
with the target consonant shows significant lowering in the emphatic environment.
Average F2 onset is 1332 Hz in emphatic environments and 1704 Hz in plain
environments. This difference is significant, [F (1, 766) = 212.738, p < 0.001]. This
difference is significant in both word-initial and word-final emphatics. In words with
initial target consonants, F2 onset for vowels is 1332 Hz in emphatic environments and
1838 Hz in plain environments. This effect is significant, [F (1, 382) = 342.174, p <
0.001]. In words with final target consonants, F2 onset is 1341 Hz and 1569 Hz in
emphatic and plain environments, respectively. This is also significant, [F (1, 382) =
31.746, p < 0.001]. See Figure 10.
39
2000
1838
Frequency (Hz)
1800
1704
Plain_all data
Emphatic_all data
* 1569
1600
*
*
1400
1332
1322
1341
Plain initial
Emphatic initial
Plain final
Emphatic final
1200
1000
Figure 10: F2 at vowel onset
An interaction of Consonant-type by Position reveals significant differences [F (1, 767)
= 34.161, p < 0.001]. The drop in F2 onset is significantly greater in words with initial
emphatics (516 Hz) compared to words with final emphatics (228 Hz). Figure 11 shows
this result.
40
2000
1800
1600
Word initial
1400
Word final
1200
1000
Plain
Emphatic
Figure 11: Consonant-type by Position for vowel F2 onset
In addition, a Consonant-type by Vowel Quality interaction reveals significant
differences, [F (1, 767) = 15.544, p < 0.001]. While the effect of emphasis is similar for
the front vowels /i:/ and /i/ and /a:/ and /a/, the F2 decrease for the back vowels /u:/ and
/u/ is much smaller. Table 2 below shows these results. The same results are obtained
for words with initial target consonants, [F (1, 383) = 13.043, p < 0.001], and for words
with final target consonants, [F (1, 383) = 13.097, p < 0.001]. See Table 3 below. No
other significant main effects or interactions were found.
Table 2: F2 onset by vowel quality
High front
Low front
High back
Plain (Hz)
2069
1666
1376
Emphatic(Hz)
1627
1221
1146
41
Plain-emphatic
442
445
230
% difference
21
27
17
Table 3: F2 onset by vowel quality in word-initial and word-final target consonants
High front
Low front
High back
Plain (Hz)
Initial
Final
2131
2007
1769
1564
1616
1137
Emphatic (Hz)
Initial
Final
1486
1769
1240
1202
1241
1052
% difference
Initial
Final
30
12
30
23
23
7
Vowel F2 at midpoint shows a significant drop in emphatic environments. Vowels in
emphatic environments have an F2 midpoint value of 1428 Hz compared to 1678 Hz
for their plain counterparts, [F (1, 766) = 19.768, p < 0.001]. In words with initial target
consonants, F2 midpoint is still lower in emphatic environments, 1427 Hz compared to
their plain counterparts, 1680 Hz, [F (1, 382) = 26.784, p < 0.001]. The same effect is
also detected for words with final target consonants. F2 midpoint in words with final
target consonants is 1428 Hz in emphatic positions and 1676 Hz in plain positions, [F
(1, 382) = 21.837, p < 0.001]. See Figure 12 below.
42
2000
1800
Frequency (Hz)
1678
1600
1680
*
1676
Plain_all data
*
*
Emphatic_all data
Plain initial
1428
1427
1428
1400
Emphatic initial
Plain final
Emphatic final
1200
1000
Figure 12: F2 at vowel midpoint
No other significant main effects or interactions were found.
A significant Consonant-type by Vowel Length interaction reveals differences between
vowels in the two different conditions. Independent of emphasis, long vowels (164 ms)
are longer than short vowels (102 ms). This difference is significant: [F (1, 766) =
784.735, p < 0.001]. In short vowels, F2 midpoint is 1648 Hz in plain environments and
1321 Hz in emphatic environments, a drop of 327 Hz. In long vowels, F2 midpoint is
1708 for vowels in plain environment and 1534 Hz in emphatic environments, a drop of
174 Hz. This difference in lowering is significant: [F (1, 767) = 4.621, p = 0.032]. See
Figure 13 below.
43
2000
Short vowels
Long vowels
Frequency (Hz)
1800
1708
1648
1600
1534
1400
1321
1200
1000
Plain
Emphatic
Figure 13: Consonant-type by Vowel length for vowel F2 midpoint
However, this difference in F2 midpoint is not attested for words with initial target
consonants, [F (1, 383) = 2.468, p = 0.117], nor for words with final target consonants,
[F (1, 383) = 2.156, p = 0.143].
F2 midpoint is affected by vowel quality. Overall, low front vowels show the highest
degree of lowering for F2 midpoint between plain and emphatic environments,
followed by high front vowels. The least degree of lowering is detected for high back
vowels. These differences are significant, [F (1, 767) = 14.770, p < 0.001].Table 4
below shows these values.
44
Table 4: F2 midpoint by vowel quality
High front
Low front
High back
Plain (Hz)
2168
1669
1196
Emphatic (Hz)
1959
1254
1070
Plain - emphatic
209
415
126
% difference
10
25
11
The same effect is attested for words with initial target consonants, [F (1, 383) = 7.220,
p < 0.001], and also for words with final target consonants, [F (1, 383) = 9.159, p <
0.001]. Table 5 below shows these results.
Table 5: F2 midpoint by vowel quality in word-initial and word-final target consonants
High front
Low front
High back
Plain (Hz)
Initial
Final
2131
2206
1668
1670
1240
1151
Emphatic (Hz)
Initial
Final
1969
1949
1256
1253
1056
1083
% difference
Initial
Final
8
12
25
25
15
6
F2 offset is also significantly lower in emphatic environments. Across all data, F2
offset drops from 1615 Hz in plain environments to 1302 in emphatic ones, [F (1, 766)
= 164.548, p < 0.001]. F2 offset values demonstrate the same effects in words with
initial target consonants and words with final target consonants. In word-initial
positions, F2 offset is 1486 Hz in plain environments and 1321 Hz in their emphatic
counterparts, F (1, 382) = 19.613, p < 0.001]. In words with final target consonants, F2
offset is 1744 Hz in plain environments and 1282 Hz in their emphatic counterparts, F
(1, 382) = 258.281, p < 0.001]. Figure 14 shows these results.
45
2000
1744
Frequency (Hz)
1800
Plain_all data
1615
Emphatic_all data
1600
*
1400
1302
1486 *
1321
Plain initial
*
1282
Emphatic initial
Plain final
Emphatic final
1200
1000
Figure 14: F2 at vowel offset
An interaction between Consonant-type and Position of the target consonant reveals
significant results. F2 offset in words with final target consonants drops by 462 Hz,
(1744 in plain environments and 1282 in emphatic ones). In words with initial target
consonants, F2 offset drops only by 165 Hz, (1486 Hz in plain environments compared
to 1321 Hz in their emphatic counterparts). This difference proves significant: [F (1,
767) = 39.683, p < 0.001]. Another interaction between Consonant-type and Vowel
Quality reveals significant differences for F2 offset. High front vowels have the largest
F2 offset drop, followed by low front vowels and finally by high back vowels. These
differences are significant, [F (2, 767) = 9.223, p < 0.001]. Table 6 shows these results.
46
Table 6: F2 offset by vowel quality
High front
Low front
High back
Plain (Hz)
1948
1551
1346
Emphatic (Hz)
1569
1191
1145
Plain - emphatic
379
360
201
% difference
19
23
15
The same effects hold for words with initial emphatics where low front vowels show
the largest drop, followed by high front vowels and then high back vowels. These
differences are significant, [F (2, 383) = 8.981, p < 0.001]. In words with final
emphatics, high front vowels drop by the largest amount, followed by low front vowels
and finally by high back vowels, [F (2, 383) = 7.587, p < 0.001]. Table 7 below shows
these findings.
Table 7: F2 offset by vowel quality in word-initial and word-final target consonants
High front
Low front
High back
Plain (Hz)
Initial
Final
1875
2021
1448
1654
1135
1558
Emphatic (Hz)
Initial
Final
1702
1437
1168
1214
1092
1196
% difference
Initial
Final
9
39
19
27
4
23
No other significant main effects or interactions were found.
Plotting the results for F2 with regard to the point at which the measurements were
taken reveals interesting observations. The closer the measurement is taken to the
emphatic consonant, the larger the F2 drop in emphatic environments. In Figure 15, this
F2 drop is greatest at vowel onset, the point closest to the target consonant, and
47
decreases as the measurement point moves away from the target consonant. The mirror
image is observed in Figure 16 since the target consonant is word-final.
2000
Frequency (Hz)
1800
1838
1680
Plain
1600
Emphatic
1486
1427
1400
1332
1321
1200
Onset
Mid
Offset
Figure 15: Vowel F2 in words with initial target consonants
2000
Frequency (Hz)
1800
1744
1676
1600
Plain
1569
Emphatic
1428
1400
1341
1282
1200
Onset
Mid
Offset
Figure 16: Vowel F2 in words with final target consonants
48
2.3.2.3. F3
Overall, F3 in emphatic environments rises in all three positions: onset, midpoint and
offset, compared to plain ones. Across all data, F3 onset is 2811 Hz in plain
environments compared to 2880 Hz in emphatic environments. This difference is
significant, [F (1, 766) = 11.469, p = 0.001]. The same difference is attested for words
with initial target consonants: F3 onset is 2895 Hz in plain environments and 3005 Hz
in emphatic environments, [F (1, 382) = 15.892 p < 0.001]. However, F3 onset does not
show significant differences in words with final target positions. It is 2727 Hz in plain
environments compared to 2756 Hz in emphatic environments, [F (1, 382) = 1.233, p =
0.268]. Figure 17 shows these values.
3200
3000
* 3005
*
2880
Frequency (Hz)
2811
2800
2895
2727 2756
Plain_all data
Emphatic_all data
Plain initial
2600
Emphatic initial
Plain final
2400
Emphatic final
2200
2000
Figure 17: F3 at vowel onset
49
An interaction between Consonant-type and Word-type reveals significant differences
for F3 onset. In real words, F3 onset is higher in emphatic positions (2928 Hz) than in
plain positions (2810 Hz) by 118 Hz. In nonwords, this difference in F3 onset is only
28 Hz: 2840 Hz for F3 onset in emphatic nonwords compared to 2812 Hz in their plain
counterparts. This difference is significant. See Figure 18.
3200
Frequency (Hz)
3000
Word
2800
Nonword
2600
2400
Plain
Emphatic
Figure 18: Consonant-type by Word-type for vowel F3 onset
No other significant main effects or interactions were found.
F3 at vowel midpoint is higher in emphatic environments. When all data are included,
vowels in emphatic positions have an F3 midpoint of 2937 Hz compared to 2870 Hz in
plain positions; [F (1, 766) = 8.610, p = 0.003]. In words with initial target consonants,
F3 midpoint is still higher in emphatic environments, 2945 Hz, compared to 2869 Hz in
their plain counterparts, [F (1, 382) = 5.373, p = 0.021]. In words with final target
50
consonants, F3 midpoint is 2929 Hz in emphatic environments and 2870 Hz in their
plain counterparts. Though F3 midpoint is still higher in emphatic positions, this effect
is not significant, [F (1, 382) = 3.331, p = 0.069]. Figure 19 shows these results.
3200
Frequency (Hz)
3000
* 2937
2870
* 2945
2929
2869
2870
Plain_all data
2800
Emphatic_all data
Plain initial
2600
Emphatic initial
Plain final
2400
Emphatic final
2200
2000
Figure 19: F3 at vowel midpoint
A Consonant-type by Vowel Quality interaction for F3 midpoint proves significant: low
front vowels have a higher F3 midpoint in emphatic positions than in plain ones while
high front vowels have a lower F3 midpoint in emphatic positions compared to plain
ones, [F (2, 767) = 5.367, p = 0.005]. However, F3 midpoint in high back vowels does
not show significant differences. Table 8 below shows F3 midpoint values. The minus
symbol represents the direction of difference – namely that the value is larger in
emphatic positions.
51
Table 8: F3 midpoint by vowel quality
High front
Low front
High back
Plain (Hz)
2971
2821
2816
Emphatic (Hz)
2958
2871
2981
Plain - emphatic
13
-50
-165
% difference
0.5
-1
-1
When data are split by position of the target consonant, this interaction is not observed
for words with initial target consonants. There is no significant interaction between
consonant-type and vowel quality for F3 midpoint in words with initial target
consonants, [F (1, 383) = 1.249, p = 0.288]. The magnitude of drop for F3 midpoint is
significantly different for high front vowels compared to low front vowels, where F3
rises in emphatic positions, whereas high back vowels do not show significant
differences. Table 9 shows these values.
Table 9: F3 midpoint by vowel quality in word-initial and word-final target consonants
High front
Low front
High back
Plain (Hz)
Initial
Final
2953
2990
2825
2818
2830
2801
Emphatic (Hz)
Initial
Final
2968
2948
2897
2845
2970
2992
% difference
Initial
Final
-1
0.2
-1
-1
-1
-1
No other significant main effects or interactions were found.
Across all data, F3 offset is higher in emphatic positions. F3 offset is 2917 Hz for
vowels in emphatic positions, and 2838 Hz for vowels in plain positions, [F (1, 766) =
12.467, p < 0.001]. For words with initial target consonants, F3 offset is 2812 Hz in
emphatic positions, and 2752 Hz in plain positions. This effect shows a trend but is not
52
significant, [F (1, 3.652) = 8.610, p = 0.057]. For words with word-final target
consonants, F3 offset is 3021 Hz in emphatic positions compared to 2923 Hz in their
plain counterparts. This difference is significant, [F (1, 382) = 11.535, p = 0.001].
Figure 20 plots these results.
3200
3000
*
* 3021
2923
2917
Frequency (Hz)
2838
2800
2752
2812
Plain_all data
Emphatic_all data
Plain initial
2600
Emphatic initial
Plain final
2400
Emphatic final
2200
2000
Figure 20: F3 at vowel offset
No other significant main effects or interactions were found.
When looking at the change in F3 measured at different positions of the vowel, results
are compatible with what is reported for F1 and F2 above. The closer the measurement
is taken to the consonant that triggers emphasis, the greater the rise. As the
53
measurement is taken further away, this rise either declines on disappears. See Figure
21 and Figure 22.
Frequency (Hz)
3200
3000
3005
2895
2945
Plain
2869
Emphatic
2812
2800
2752
2600
Onset
Mid
Offset
Figure 21: Vowel F3 in words with initial target consonants
Frequency (Hz)
3200
3021
3000
2929
2923
2870
Emphatic
2800
2756
2727
2600
Onset
Plain
Mid
Offset
Figure 22: Vowel F3 in words with final target consonants
54
2.3.3. Spectral moments
2.3.3.1. Target consonants
Overall, none of the spectral moments for plain and emphatic consonants show
significant differences. Center of gravity was 6840 for plain consonants and 6740 for
emphatic consonants, [F (1, 766) = 0.999, p = 0.318]. The same results are obtained for
word-initial emphatics – center of gravity is not different between plain and emphatic
consonants. A Consonant by Manner interaction reveals significant differences. In
word-final target consonants, plain stops have a higher center of gravity (6793 Hz)
compared to their emphatic counterparts (6485 Hz), whereas plain fricatives have a
lower center of gravity (7906 Hz) compared to their emphatic counterparts (7945 Hz).
This difference is significant, [F (1, 383) = 4.082, p = 0.044]. See Figure 23 below. No
other significant main effects or interactions were found.
8000
7945
7906
Frequency (Hz)
7600
7200
6800
Stop
Fricative
6793
6485
6400
6000
Plain
Emphatic
Figure 23: Consonant-type by Manner of articulation for Center of gravity in words with final
target consonants
55
For standard deviation, no significant main effects or interactions were found. Mean
variance was 2330 Hz for plain consonants and 2359 Hz for emphatic consonants, [F (1,
766) = 0.255, p = 0.613].
No significant main effects or interactions were found for skewness. Plain consonants
had a skewness value of 3.464 and -0.4711 for their emphatic counterparts, [F (1, 766)
= 1.000, p = 0.318].
Finally, for kurtosis, no significant main effects or interactions were found. Plain target
consonants have a kurtosis value of 0.3696 and 0.5755 for their emphatic counterparts,
[F (1, 766) = 2.214, p = 0.137].
2.3.3.2. /b/ consonant
Overall, spectral moments for /b/ yielded no significant main effects or interactions.
Table 10 shows these results.
Table 10: Spectral moments for /b/
Measurement
Center of gravity (Hz)
Standard deviation (Hz)
Skewness
Kurtosis
Plain
5418
3258
0.0067
-0.9954
Emphatic
5405
3277
0.0035
-0.9946
Similar results are obtained for words with initial target consonants and words with
final target consonants as well, see Table 11.
56
Table 11: Spectral moments for /b/ in word-initial and word-final target consonants
Measurement
Center of gravity (Hz)
Standard deviation (Hz)
Skewness
Kurtosis
Plain
Initial
5922
3029
-0.1431
-0.9582
Final
4941
3488
0.1566
-1.0326
Emphatic
Initial
Final
5952
4859
3035
3520
-0.1764
0.1834
-0.9580
-1.0289
2.3.4. Locus equations
Locus equations were calculated for each target consonant. A linear regression was
conducted with F2 onset as the dependent variable and F2 vowel as the independent
variable to obtain the slope and the y-intercept values. A one-way ANOVA shows that
emphatic target consonants have a lower mean slope (0.464) than their plain
counterparts (0.640). This difference is significant, [F (1, 14) = 5.586, p = 0.033]. This
effect is attested for words with initial target consonants: slope value is 0.310 for
emphatic consonants, and 0.576 for their plain counterparts, [F (1, 6) = 11.223, p =
0.015]. The same effect is also evident for words with final target consonant: slope
value is 0.618 for emphatic consonants and 0.708 for their plain counterparts, [F (1, 6)
= 7.153, p = 0.037]. See Figure 24. Figure 25 shows the scatter plots for all the data.
57
2
Plain_all data
Slope
Emphatic_all data
Plain initial
1
*
0.64
*
0.576
*
0.708
0.618
0.464
Emphatic initial
Plain final
Emphatic final
0.31
0
Figure 24: Slope values
58
1:plain; 2:emphatic
plain
emphatic
3000.00
F2 Onset (Hz)
2500.00
2000.00
1500.00
1000.00
R Sq Linear = 0.652
500.00
R Sq Linear = 0.585
0.00
0.00
1000.00
2000.00
3000.00
F2 Midpoint (Hz)
Figure 25: scatter plot, all data
Analysis of y-intercept values did not reveal significant differences. For all data, mean
y-intercept was 630 Hz for plain and 668 Hz for emphatic consonants, [F (1, 14) =
0.075,
p = 0.788]. The same results were obtained for words with initial target
consonants. y-intercept value was 872 Hz for plain consonants, and 878 Hz for their
emphatic counterparts, [F (1, 6) = 0.002 , p = 0.962]. Similarly, words with final target
consonants show no difference in y-intercept values. Plain consonants have a y-
59
intercept of 388 Hz and their emphatic counterparts have a y-intercept of 457 Hz, [F (1,
6) = 3.252, p = 0.121]. Figure 26 below shows these results.
1000
872 878
Frequency (Hz)
800
Plain_all data
630
668
Emphatic_all data
Plain initial
600
Emphatic initial
457
400
388
200
Figure 26: y-intercept value
60
Plain final
Emphatic final
2.4. Results Summary
Tables 12 and Table 13 summarize the findings of the acoustic study. This arrow (y)
represents an increase in emphatic positions, the other one (z) represents a decrease in
the emphatic position, and an arrow with a (t) represents a trend in the direction of the
arrow. The number of arrows in one box indicates the magnitude of the raise or drop.
Variables that did not show significant differences are excluded from these two tables.
Table 12: Main findings
C duration
F1 onset
F1 midpoint
F1 offset
F2 onset
F2 midpoint
F2 offset
F3 onset
F3 midpoint
F3 offset
All
y
y
y
y
z
z
z
y
y
y
Initial
y
z
z
z
y
y
ty
61
Final
y
y
ty
y
z
z
z
y
Table 13: Summary of interactions: from left to right: I: Initial; F: Final; S: Stop; F: Fricative;
VD: Voiced; VS: Voiceless; S: Short; L: Long; W: Word; N: Nonword; M: Male; F: Female.
Position
I
C duration
Manner
F
S
y
y
F
Voicing
VD VS
V length
S
L
V quality
/i/
/i:/
/a/
/a:/
/u/
/u:/
W N
M
F
ty
F1 midpoint
F2 onset
Gender
y
/b/ duration
F1 onset
Wordtype
ty
z
z
F2 midpoint
F2 offset
z
zz
zz
z
zz
zz
z
z
zz
z
zz
z
y
F3 onset
z
F3 midpoint
C gravity
y2
y
z3
2.5. Discussion
The present results show that emphatic stops in word-final position are consistently
longer than their plain counterparts. Preliminary results (Al-Masri and Jongman, 2004)
indicated an effect of position of the target consonant – namely, that word-initial target
emphatics tend to have longer durations than their plain counterparts. However, this
tendency might have been influenced by the fact that it was obtained from data with
2
3
Only in word-final position
Only in word-final position
62
different word lengths. The results obtained here suggest that stops are more sensitive
to emphasis than fricatives in terms of duration. This might be due to the fact that, in
general, fricative durations are longer than stop durations, which means that stops are
more likely capable of showing slight durational differences compared to fricatives.
Besides, this supports Zawaydeh’s (1999) finding about the directionality of emphasis
in UJA, namely that it is more prominent leftward than rightward. The lack of such
findings for the other segments, the vowel and the /b/ consonant, might be due to the
fact that durational differences in emphasis are not likely to spread in either direction.
While durational differences are observed on the target consonants under certain
conditions, they are not expected to spread to adjacent segments. Ultimately, back
articulation, which is characteristic of emphasis, is not closely related to duration.
As for formant frequencies, different results are reported. The overall means do not
seem to be incongruent with previous findings, (Obrecht, 1968; Alosh 1987; Zawaydeh
1999). However, the present results are more detailed since all formant frequencies
were measured at three positions of the vowel: onset, midpoint and offset. F1 onset is
consistently higher in emphatic than in plain positions. There is a trend for F1 to rise by
a larger amount in words with initial target consonants compared to words with final
consonants. F1 at the steady state of the vowel is still higher in emphatic positions
compared to plain ones for all data. This difference is lost for word-initial emphatics
and is only obtained as a trend for word-final emphatics. The lack of significance for F1
rise in the steady state of the vowel for word-initial target consonants, and the presence
63
of a slight trend for words with final target consonants support the previous findings in
this study for the prominence of leftward spread of emphasis within monosyllables. For
F1 at offset positions, a mirror image of F1 onset is presented. Overall, F1 offset is
higher in emphatic positions. Also, the magnitude of rise for F1 offset is clearer for
word-final target consonants. From the findings for F1 onset and F1 offset, we can
conclude that emphasis is gradient within the monosyllable: the closer the vowel and
the segment that triggers emphasis, the more pronounced the rise in F1.
Most of the previous studies report on F2. F2 is lower in emphatic positions than in
plain ones at the three measurements points. F2 onset is significantly lower for all data
and when the target consonant is word-initial and word-final as well. The difference
between F2 onset values in emphatic positions is significantly more pronounced in
word-initial than in word-final positions. Another detailed finding is that the drop is not
similar for different vowels. It is evident that the drop is larger for high front and low
front vowels compared to high back vowels. Seemingly, there is more room for this
drop in front vowels than in back vowels. Additional interesting results are observed for
F2 midpoint. Consistently, F2 midpoint is lower in emphatic environments.
Interestingly, no interaction with position of the target consonant is observed here now
that the measurements are taken equidistant from the consonant that triggers emphasis.
Interactions with vowel quality are also more salient here. Low front vowels show the
largest F2 midpoint drop, followed by high front vowels and finally high back vowels.
As for vowel duration, in line with previous findings, (Card, 1983; Zawaydeh, 1999),
64
short vowels show a larger F2 midpoint drop in emphatic positions than their long
counterparts. This also points to the gradient nature of emphasis spread.
F2 offset shows a significant drop in emphatic than in plain positions. This drop is
reported for all data and for words with initial and final target consonants as well. As
expected, a mirror effect is obtained here: F2 offset drop is larger in words with final
target consonants compared to words with initial consonants due to the fact that the
temporal interval between the emphatic segment and the vowel offset is shorter in
offset positions than in initial positions. High front vowels here show a larger F2 offset
drop than low front vowels, which, in turn, show a larger drop compared to high back
vowels. Evidently, emphatic consonants exert more influence on front vowels
compared to back vowels. This may be because back vowels already have lower F2 and
higher F1 values compared to front vowels. Therefore, there is less room for each
formant to change under emphasis compared to the room available for front vowels.
F3 is higher in emphatic than in plain positions. F3 onset is higher in emphatic
positions when all data are included in the measurements. This effect carries on to
words with initial target consonants but not to words with final target consonants. There
is an unexpected interaction with word-type where F3 onset rises by a larger degree in
words compared to nonwords. F3 midpoint is also higher in emphatic positions than in
plain ones for all data and also for words with initial target consonants. However, a
puzzling interaction with vowel quality is observed for F3 midpoint. While emphasis
65
causes F3 midpoint to rise, it does so for low front vowels and high back vowels. While
the difference is significant in low front vowels, it is not significant in high back vowels.
Surprisingly, emphasis causes F3 midpoint to decrease in high front vowels. F3 does
not rise at the midpoint position for words with final target consonants. In offset
positions, F3 rises in emphatic positions for all data, for words with final target
consonants, and shows a trend for words with initial target consonants. This effect is in
harmony with the findings on the directionality of emphasis spread within the
monosyllables.
While durations and formants show a host of significant distinctions in the acoustic
signal between plain and emphatic segments and environments, spectral moments fail
to show similar differences. The only significant finding that can be reported is the
interaction of manner with the center of gravity for the target consonants: while
emphatic stops have a higher center of gravity than their plain counterparts, fricatives
have a lower center of gravity compared to their plain counterparts. All other spectral
measurements on the target consonant and on the /b/ consonant as well fail to yield
significant differences.
As for the findings on locus equations, emphatic consonants have flatter slopes
compared to their plain counterparts. This finding it true for stops and fricatives and
also regardless of position of the target consonant. This suggests that emphatics are
more stable in terms of their resistance to effects of adjacent vowels. This is in line with
66
what Yeou (1997) found for the same set of consonants in Modern Standard Arabic
spoken by native speakers of the Moroccan dialect. He found that the same set of
emphatics emerged as a distinct class with the flattest slope values, which indicates
minimal coarticulation between the consonants and the vowels. Using data from
Cairene Arabic produced by three male speakers, Sussman, Hoemeke and Ahmed,
(1993) found that /d/ and /d¿/ had similar slopes but significantly differed in their yintercept values. They show that y-intercept was 1307 Hz for /d/ and 933 Hz for /d¿/.
The present study found no significant differences in slope values or in y-intercept
values for /d/ and /d¿/. While slope turns out to be a factor in defining emphatics as a
separate class of sounds, y-intercept fails to yield significant differences between plain
and emphatic consonants. Emphatics consistently had a higher y-intercept compared to
their plain counterparts, but this difference was not significant.
67
Chapter Three
Acoustic Study – the bisyllables
This chapter presents the acoustic findings on the bisyllables. In Chapter 2, analyses on
the monosyllables provided a first idea of emphasis spread by comparing words with
the target consonant in initial and final position. To get a better understanding of the
nature and domain/extent of spread, investigation of longer words will give us an idea
about emphasis spread to adjacent syllables. This chapter provides a first look at this
issue. Since spectral measures for the consonants in monosyllables did not really reveal
differences between plain and emphatic tokens, analysis of bisyllabic words will focus
on spectral measures of the vowels only. More precisely, this chapter aims to answer
the following questions:
1. Are the results obtained for monosyllables identical to those obtained for the syllable
which has the target consonant (the target syllable)?
2. Does emphasis spread beyond the target syllable?
3. Is emphasis spread attested for duration (for consonants and vowels) and/or vowel
formant frequency (F1, F2, F3)?
4. Does spreading to the right and left of the target syllable pattern similarly?
5. What is the effect of vowel quality on emphasis spread?
6. Is emphasis spread influenced by gender of the speaker?
68
Despite several limitations, see Section 1.7. above, previous research addresses some
of these questions. For example, Card (1983) found that emphasis, as an effect of F2
lowering, spreads beyond the target syllable unless in the presence of a ‘blocking
vowel,’ which she defines as one of the three long vowels /i, a, u/. Davis (1995) found
that in Palestinian Arabic, leftward emphasis was not blocked whereas rightward
emphasis was blocked by [+ high, - back] vowels. Based on these and similar previous
findings, see Section 1.4. above – and in accordance with the present findings on the
monosyllables, see Chapter Two, I can make the following hypotheses:
1. Emphatic consonants will be slightly longer than their plain counterparts only in
target syllables. Durational differences will disappear in adjacent syllables.
2. Vowel duration will not be affected by emphasis, neither in target syllables nor in
adjacent ones.
3.Vowel F1 and F3 values will rise, especially when measurements are taken closest to
the vowel in the same syllable with the emphatic target consonant. Vowel F2 will
decrease in the target vowel and in adjacent vowels as well
4. No gender interactions will be obtained.
5. No word/nonword differences will be obtained.
3. 1. Method
3.1.1. Subjects
Eight adult speakers (four males and four females) of UJA aged 16 to 26 years old
(average age was 20.6 years) participated in the study. Five of them (four males and
69
one female) were recruited from the Yarmouk University, Irbid, Jordan. The three other
females were recruited from the local community of Irbid: two of them were high
school students and one was a university graduate. All of the subjects spoke UJA. None
of them ever left Jordan. Four of the university students were majoring in religious
studies, one was majoring in psychology and one had a degree in English language and
literature. None of the subjects reported any known speech or hearing impairments.
They were paid $10 for their participation
3.1.2. Materials and Procedure
A long list of target words was prepared as part of a larger project. The wordlist in
question consisted of 80 target CV.CVC pairs (plain-emphatic). It included the four
emphatic consonants and their plain counterparts along with the six UJA vowels. It was
designed as follows: 24 pairs with the target consonant in word-initial position, and the
adjacent syllable was /-bat/ - yielding pairs such as /sabat/-/s¿abat/, /sibat/-/s¿ibat/,
/subat/-/s¿ubat/ etc.; another 24 pairs with the target consonant in a word-final position
and the adjacent syllable was /ta-/ - yielding pairs such as /tabas/-/tabas¿/, /tabis/-/tabis¿/,
/tabus/-/tabus¿/, etc.; and 32 pairs with the target consonant in word-medial position. In
this subset, all the target consonants were used, the filler consonant was /k/ and only
two UJA vowel pairs were used: long and short /a/ and /i/ - yielding pairs such as
/kasak/-/kas¿ak/, /kasik/-/kas¿ik/, /kasaak/-/kas¿aak/, etc. Thus, the study material
consisted of CV.bat, ta.bVC, and kV.CVk stimuli, see Appendix II. This design yielded
many nonwords that were all phonotactically acceptable in UJA. Subjects read the
70
target words in a carrier sentence: /'½iš.ki target word ka.'maan 'mar.rah/ [Say target
word again]. The wordlist included 12 pages, each having one column, and each
column consisted of 12 sentences – the first and last two sentences were fillers and did
not include target words. The best two of these three repetitions were used for analysis.
Choice was made based on the correct completion of the wordlist with the minimum
amount of mistakes. Since Arabic short vowels are not written in full letters, diacritics
were added to illustrate these vowels where they occurred. Speakers read the wordlist
three times. Recordings were carried out in a sound-proofed booth using a portable
solid-state recorder (Marantz PMD 671) and a unidirectional microphone (ElectroVoice
N/D767a). Each subject was recorded individually and each recording session took
approximately 70 minutes. Recordings were digitized at a 22050 Hz sampling rate.
Using Praat speech analysis software (Boersma and Weenink 2005), a script was
written to carry out automatic segmentation of the stimuli in order to isolate the target
words. Due to the relatively long duration of the recording sessions, some subjects
occasionally made mistakes in reading parts of the wordlist. In these instances, they
were instructed to repeat two lines before and after the line in which they made these
occasional mistakes. During the segmentation process through which the target sound
files were obtained, these lines which included possible mistakes were disposed of.
3.1.3. Measurements
The same method in Section 2.1.3 above was adopted here. However, slight changes
were applied due to the huge amount of data used in this section. The measurement
71
procedure this time was automatic. The cursors were placed by hand, using a waveform
and a spectrogram, at a number of locations in the speech signal. For stop consonants,
cursors were placed at the offset of the previous segment (the beginning of closure), at
the release of the burst, and at the onset of the following vowel. For fricatives, cursors
were placed at the onset and offset of the frication noise. For vowels, cursors were
placed at the start of F1 and the offset of F2. When the burst was very hard to locate, it
was estimated using auditory and visual cues. For the frequency values of the target
vowels, F1, F2 and F3 were measured at onset, midpoint and offset of the vowel. While
the cursors for the onset and offset points where easy to locate, the cursor for the vowel
midpoint was automatically derived. A script was written which took the measurements
for each segment and wrote them to an Excel file. Figure 1 below shows the placement
of the cursor at different points for an example target word: /sabat/
72
Figure 1: Sample measurements conducted for one stimulus word: /sabat/: (a) represents the
initial consonant; (b) the first vowel; (c) the closure of the second consonant; (d) the second
consonant burst; (e) the second vowel; (f) the closure of the third consonant; (g) the third
consonant burst; (h) the third consonant post-burst release.
For consonants in medial positions, the cursors where placed at the offset of the
previous vowel, where the three first formants ceased to line up, and at the onset of the
following vowel. For word-final consonants, the cursors were placed at the offset of the
previous vowel and at the beginning of the burst before the onset of the next stop
consonant of the carrier sentence. During this stage, different cues were utilized using
both the waveform and the spectrogram. Vowel formant frequency measurements were
based on LPC analysis using a 20-ms window.
73
3.2. Analysis
Different ANOVA tests were conducted on all measurements to analyze the data and
obtain the main findings. Multivariate analyses for different dependent variables were
also conducted to test interactions between the degree of emphasis and other
independent variables. Bonferroni post hoc tests were used for independent variables
with more than two levels, namely for vowel quality. Due to the fact that this chapter
deals with bisyllables, several dependent and independent variables were used, see
Table 1 below. The measurements mentioned in this table were made for each segment
in the word. Dependent variables are mentioned here in groups, that is; DV 1 refers to
durations of each consonant, and so on.
Table 1: Dependent and independent variables used in the study
No.
1
Dependent variables
Consonant duration
No.
1
2
Vowel duration
2
3
Word duration
Vowel F1 (onset, midpoint,
offset)
Vowel F2 (onset, midpoint,
offset)
Vowel F3 (onset, midpoint,
offset)
3
Independent variables
Consonant-type (plain, emphatic)
Vowel quality (high front, low front,
high back)
Adjacent vowel quality
4
Vowel length (long, short)
5
Adjacent vowel length
4
5
6
6
7
8
9
74
Target consonant position (initial,
medial, final)
Target consonant voicing (voiced,
voiceless)
Target consonant manner (stop,
fricative)
Speaker gender (male, female)
3.3. Results
In this section, I will report the main effects and the interactions for consonant and
vowel durations and for vowel formant frequencies. While all main effects and
interactions will be tested, I will focus on reporting those effects that show significant
differences between plain and emphatic environments.
3.3.1. Duration
Plain and emphatic target consonant durations were exactly equal (when rounded) at 93
ms, [F (1, 1782) = 0.118, p = 0.731]. A similar result was obtained for target
consonants in word-initial position where plain target consonants were 89 ms and
emphatic ones were 90 ms, [F (1, 374) = 0.036, p = 0.849]. The same also held for
target consonants in word-medial position where plain and emphatic target consonants
were 89 ms, [F (1, 1014) = 0.085, p = 0.771]. Word-final target consonant durations
were also not different. Plain target consonants in word-final position were 106 ms and
their emphatic counterparts were 111 ms, [F (1, 390) = 0.637, p = 0.425].
Except for one condition, other consonants in the stimuli did not show any significant
differences. That is to say, emphasis spread did not show on consonants in adjacent
positions. The only consonants which showed significant durational differences were
initial consonants in words where the target consonant was in word-final position. In
these stimuli, initial consonants in words with final plain targets were longer than their
counterparts in words with final emphatic targets, 83 ms and 78 ms, respectively. This
75
difference is significant, [F (1, 390) = 7.473, p = 0.007]. No interactions with target
consonant duration in word-initial positions were obtained.
As for vowels, measurements were taken in relation to the three possible target
consonant positions. Similar to target consonants, results on vowel durations did not
show significant differences. No significant durational differences were established for
vowels when the target consonant was in word-initial or word-final positions. However,
vowel durational differences were established when the target consonant was in word
medial position. Vowels in the same syllable with the emphatic consonant where the
target consonant was in word-medial position were significantly longer than their
counterparts in plain environments, 122 ms, and 114 ms, respectively. This difference
is significant: [F (1, 1014) = 4.779, p = 0.029]. There were no significant interactions
between target consonant manner, vowel length, and vowel quality.
3.3.2. Formant Frequency
Results on vowel formant frequencies are reported in this section in target and in
adjacent positions for F1, F2, and F3 at vowel onset, midpoint and offset. Main findings
will be reported first, then interactions between the degree of emphasis and other
independent variables. Where statistical analyses fail to show significant interactions,
results will not be reported.
76
3.3.2.1. F1
Since the target consonants appear in three different positions: word-initial, wordmedial and word-final, the results will be reported accordingly.
3.3.2.1.1. F1 target consonant word-initial
Under this condition, F1 for vowels in the same syllable with the target consonant, e.g.
[u] in [s¿ubat], was measured at three points in the vowel: onset, midpoint and offset.
Results show significant differences. In onset positions, F1 for vowels in plain
environments was 406 Hz, and 480 Hz for vowels in emphatic environments, [F (1,
374) = 52.364, p < 0.001]. The same significance was captured at vowel midpoint
positions, [F (1, 374) = 8.15, p =0.004], and at vowel offset positions, [F (1, 374) =
7.524, p = 0006]. Figure 2 shows these results:
77
600
508
480
Frequency (Hz)
500
*
*
465
474
454
*
406
Plain
400
Emphatic
300
200
Onset
Mid
Offset
Figure 2: F1 values in word-initial positions at onset, midpoint and offset positions for the
vowel in the same syllable with the target consonant.
A Consonant-type by Gender interaction shows that in emphatic environments, F1 at
onset position rises by 97 Hz for males and by 51 Hz for females. This difference is
significant,
[F (2, 372) = 5.19, p = 0.023]. Figure 3 shows these results.
78
600
Frequency (Hz)
500
Male
400
Female
300
200
Plain
Emphatic
Figure 3: Consonant-type by Gender interaction for F1 Onset
A similar result is obtained for vowel F1 at offset positions; a Consonant-type by
Gender interaction is obtained where vowel F1 in emphatic environments rises by 52
Hz for males and by 8 Hz for females. This difference is significant, [F (2, 372) = 4.42,
p = 0.036]; see Figure 4. No other interactions were obtained.
79
600
Frequency(Hz)
500
Male
Female
400
300
200
Plain
Emphatic
Figure 4: Consonant-type by Gender interaction for F1 Offset
Examining adjacency, e.g. [a] in [s¿ubat], F1 for the vowel in the right-adjacent syllable
was measured at three points: onset, midpoint and offset.
Except when F1 is measured in onset position, the differences shown here proved
insignificant. In onset position, F1 was significantly higher in emphatic environments,
[F (1, 374) = 15. 503, p < 0.001]. No significant results were obtained for F1
measurements for vowels in adjacent positions when these measurements were taken at
midpoint, [F (1, 374) = 1.245, p =0.265], or at offset, [F (1, 374) = 0.158, p =0.691].
Figure 5 plots these results:
80
700
611
Frequency (Hz)
600
*
620
561
542
539
532
Plain
500
Emphatic
400
300
Onset
Mid
Offset
Figure 5: F1 values in word-initial positions at onset, midpoint and offset positions for the
vowel right-adjacent to the syllable with the target consonant.
A significant Consonant-type by Gender interaction is obtained for the vowel in the
right-adjacent position when the measurement is taken at onset position: F1 in emphatic
environments rises by 29 Hz for males and only by 10 Hz for females. This difference
is significant, [F (2, 372) = 4.79, p = 0.029]; see Figure 6. No other significant
interactions were obtained.
81
600
Frequency(Hz)
500
Male
Female
400
300
200
Plain
Emphatic
Figure 6: Consonant-type by Gender interaction for F1 Onset for the vowel in the right-adjacent
position.
3.3.2.1.2. F1 target consonant word-medial
The stimuli used for this condition had only variations of two vowels: /i/ and /a/. For
vowels in word medial positions in the same syllable with the target consonant, e.g. [i]
in [kat¿ik], F1 was measured at three points in the vowel: onset, midpoint and offset.
The results show that F1 values at onset and mid positions were significantly higher in
emphatic environments compared to their plain counterparts, [F (1, 1014) = 68.897, p <
0.001], and [F (1, 1014) = 7.882, p = 0.005], respectively. F1 values at offset position
82
for the target vowel did not show significant differences, [F (1, 1014) = 2.300, p =
0.130]. Figure 7 below shows these results.
700
Frequency (Hz)
600
500
496
*
* 542
513
Plain
475
455
440
Emphatic
400
300
Onset
Mid
Offset
Figure 7: F1 values at onset, midpoint and offset positions for the vowel in the same syllable
with the word-medial target consonant.
No Consonant-type by Vowel Quality interactions were obtained for the target vowels
themselves, i.e. the vowels in the same syllable with the target consonant. However,
some interactions for the adjacent vowel were found. A Consonant-type by Vowel
Quality interaction was obtained for F1 Onset of the vowel to the left of the target one.
When the target vowel was /a/, F1 Onset for the left adjacent vowel rose by 71 Hz in
emphatic position compared to only 46 Hz when the target vowel was /i/. This
83
difference is significant, [F (1, 1011) = 8.05, p = 0.005]. Figure 8 below displays these
results.
800
Frequency (Hz)
700
600
/i/
500
/a/
400
300
200
Plain
Emphatic
Figure 8: Consonant-type by Vowel Quality interaction for F1 Onset of the vowel to the left
of the target one.
Another interaction, a Consonant-type by Adjacent Vowel Quality was obtained for F1
Onset, e.g. for /a/ in [kat¿ik]. Results show that in emphatic environments, F1 Onset
rises by (44 Hz) for /a/ and by 22 Hz for /i/. This difference is significant, [F (1, 1011)
= 17.77, p < 0.001]. Figure 9 below plots these results.
84
800
Frequency (Hz)
700
600
/i/
500
/a/
400
300
200
Plain
Emphatic
Figure 9: Consonant-type by Adjacent Vowel Quality interaction for F1 Onset of the vowel to
the left to the target one.
A similar result was obtained when the measurement was taken at the vowel midpoint.
A Consonant-type by Adjacent Vowel Quality was obtained for F1 Mid, e.g. for /a/ in
[kat¿ik]. Results show that in emphatic environments, F1 Mid rises by 73 Hz for /a/ and
by 43 Hz for /i/. This difference is significant, [F (1, 1011) = 8.49, p = 0.004]. Figure
10 below plots these results.
85
800
Frequency (Hz)
700
600
/i/
500
/a/
400
300
200
Plain
Emphatic
Figure 10: Consonant-type by Adjacent Vowel Quality interaction for F1 Mid of the vowel to
the left of the target one.
Similarly, a Consonant-type by Adjacent Vowel Quality interaction was obtained for
F1 Offset, e.g. for /a/ in [kat¿ik]. Results show that in emphatic environments, F1 Offset
rises by 82 Hz for /a/ and by 56 Hz for /i/. This difference is significant, [F (1, 1011) =
3.88, p = 0.049]. Figure 11 below plots these results.
86
800
Frequency (Hz)
700
600
/i/
500
/a/
400
300
200
Plain
Emphatic
Figure 11: Consonant-type by Adjacent Vowel Quality interaction for F1 Offset of the vowel
to the left of the target one.
Additionally, a Consonant-type by Gender interaction was also obtained for F1 Onset
of the vowel left to the target one. F1 Onset for males rises by 74 Hz in emphatic
environments compared to only 36 Hz for females. This difference is significant, [F (1,
1011) = 8.43, p = 0.004]. Figure 12 below displays this result.
87
600
Frequecy(Hz)
500
Male
400
Female
300
200
Plain
Emphatic
Figure 12: Consonant-type by Gender interaction for F1 Onset of the vowel to the left of the
target one.
When F1 measurements were taken at the middle of the left adjacent vowel, similar
results were obtained. A Consonant-type by Vowel Quality interaction was obtained for
F1 Mid of the vowel to the left of the target one. When the target vowel was /a/, F1 Mid
for the left adjacent vowel rose by 47 Hz in emphatic position compared to only 24 Hz
when the target vowel was /i/. This difference is significant, [F (1, 1011) = 3.97, p =
0.046]. Figure 13 below displays these results.
88
800
Frequency (Hz)
700
600
/i/
500
/a/
400
300
200
Plain
Emphatic
Figure 13: Consonant-type by Vowel Quality interaction for F1 Mid of the vowel to the left of
the target one.
Interestingly, when examining leftward adjacency, e.g. [a] in [kat¿ik], F1 shows a
significant rise when measurements were taken at onset, mid and offset positions. At
onset positions, F1 was significantly higher in emphatic environments compared to its
counterpart in plain environments, [F (1, 1014) = 47.631, p < 0.001]. The same applied
to F1 values when the measurement was taken at midpoint, [F (1, 1014) = 43.878, p <
0.001] and at offset, [F (1, 1014) = 87.760, p < 0.001]. Figure 14 shows these results.
89
700
Frequency (Hz)
600
513
400
485
*
451
500
*
384
Plain
Emphatic
*
418
414
300
Onset
Mid
Offset
Figure 14: F1 values in word-medial positions at onset, midpoint and offset positions for the
vowel left-adjacent to the syllable with the target consonant.
3.3.2.1.3. target consonant word-final
For vowels in word final positions in the same syllable with the target consonant, e.g. [i]
in [tabis¿], F1 was measured at three points in the vowel: onset, midpoint and offset.
Results show that target vowel F1 was significantly higher in emphatic environment
compared to its plain counterpart at two positions: onset and offset of the vowel.
Differences obtained when the measurements were taken at vowel midpoint were not
significant. For onset positions, [F (1, 389) = 5.137, p = 0.024]; for mid vowel positions,
[F (1, 389) = 2.975, p = 0.085]; and for vowel offset, [F (1, 389) = 20.402, p < 0.001].
Figure 15 below shows these results.
90
700
Frequency (Hz)
600
537
*
500
513
505
490
*
468
451
Plain
Emphatic
400
300
Onset
Mid
Offset
Figure 15: F1 values in word-final positions at onset, midpoint and offset positions for the
vowel in the same syllable with the target consonant.
F1 for the vowel in the left adjacent positions, e. g. [a] in [tabis¿], was significantly
higher in emphatic environments compared to plain ones when measurements were
taken at three positions: onset, [F (1, 389) = 72.503, p = 0.00], vowel midpoint, [F (1,
389) = 40.933, p = 0.00], and vowel offset, [F (1, 389) = 44.162, p = 0.00]. Figure 16
below shows these results.
91
700
Frequency (Hz)
600
*
540
*
500
580
*
522
534
497
485
Plain
Emphatic
400
300
Onset
Mid
Offset
Figure 16: F1 values in word-final positions at onset, midpoint and offset positions for the
vowel left-adjacent to the syllable with the target consonant.
A significant Consonant-type by Gender interaction was obtained for F1 of the left
adjacent vowel when the measurement was taken at the offset. F1 in emphatic
environments rises for males by 79 Hz compared to females 30 Hz. This difference is
significant, [F (2, 387) = 4.17, p = 0.042]; see Figure 17 below.
92
600
Frequecy(Hz)
500
Male
400
Female
300
200
Plain
Emphatic
Figure 17: Consonant-type by Gender interaction for F1 Offset for the vowel in the left adjacent
position.
3.3.2.2. F2
Since the target consonants appear in three different positions: word-initial,
word-medial and word-final, the results will be reported accordingly.
3.3.2.2.1. target consonant word-initial
For vowels in the same position with the target consonant, e.g. [u] in [s¿ubat], F2 was
measured at three points: onset, midpoint and offset. Where target consonants appear in
word-initial positions, the results show that F2 values for vowels in emphatic
environments drop significantly. However, the drop is only significant when F2 is
93
measured at onset position. F2 in vowel onset position is significantly lower in
emphatic environments compared to plain ones, [F (1, 374) = 32.261, p <0.001]. When
measured at vowel midpoint, differences were not significant, [F (1, 374) = 2.789, p =
0.096]; and similarly for vowel offset, [F (1, 374) = 0.862, p = 0.354]. Figure 18 below
shows the results:
2000
1983
1875
*
1800
1767
1701
1702
Frequency (Hz)
1646
1600
Plain
Emphatic
1400
1200
1000
Onset
Mid
Offset
Figure 18: F2 values in word-initial positions at onset, midpoint and offset positions for the
vowel in the same syllable with the target consonant.
A Consonant-type by Vowel Quality interaction was obtained for F2 in word-initial
positions. In emphatic environments, F2 for the vowel in word-initial target positions
dropped by 437 Hz for /i/, 421 Hz for /a/ and only by 32 Hz for /u/. This difference is
significant, [F(2, 370) = 14.21, p<0.001]. Figure 19 shows these results:
94
2600
2400
Frequency (Hz)
2200
2000
Plain
1800
Emphatic
1600
1400
1200
1000
/i/
/a/
/u/
Figure 19: Consonant-type by Vowel Quality interaction for F2 Onset in target positions.
A Consonant-type by Gender interaction is also obtained for F2 in word-initial
positions. In emphatic environments, F2 drops by 458 Hz for females and by 105 Hz
for males, [F (2, 372) =13.31, p<0.001], Figure 20 below shows these results:
95
2600
2400
Frequency(Hz)
2200
2000
Plain
1800
Emphatic
1600
1400
1200
1000
Male
Female
Figure 20: Consonant-type by Gender interaction for F2 Onset in target positions.
Results show that F2 for vowels in right adjacent positions, e.g., [a] in [s¿ubat], was
significantly lower in emphatic environments compared to plain ones. In onset position,
the F2 drop for vowels in emphatic positions was significant, [F (1, 374) = 32.512, p <
0.001]. A similar result was obtained when the measurement was taken at midpoint, [F
(1, 374) = 50.212, p < 0.001], and when the measurement was taken at vowel offset, [F
(1, 374) = 13.453, p < 0.001]. Figure 21 plots these results:
96
2000
1800
1749
Frequency (Hz)
1641
1600
*
1522
1496
1400
*
1670
Plain
*
Emphatic
1371
1200
1000
Onset
Mid
Offset
Figure 21: F2 values at onset, midpoint and offset positions for the vowel right-adjacent to the
syllable with the word-initial target consonant.
3.3.2.2.2. target consonant word-medial
For vowels in word medial positions in the same syllable with the target consonant, e.g.
[i] in [kat¿ik], results show that F2 drops significantly in emphatic positions when
measurements were taken at vowel onset, [F (1, 1014) = 340.498, p < 0.001]; vowel
midpoint, [F (1, 1014) = 104.776, p < 0.001]; and vowel offset, [F (1, 1014) = 70.116, p
< 0.001]. Figure 22 below shows these results.
97
2400
2267
2200
*
2058
2103
Frequency (Hz)
2052
2000
*
Plain
1823
1800
Emphatic
*
1626
1600
1400
Onset
Mid
Offset
Figure 22: F2 values at onset, midpoint and offset positions for the vowel in the same syllable
with the word-medial target consonant.
A Consonant-type by Vowel Quality interaction is obtained where F2 in emphatic
environments drops by 349 Hz for /a/ and only by 92 Hz for /i/, [F(1, 1011) =65.42,
p<0.001]. Figure 23 plots these results:
98
2600
2400
Frequency(Hz)
2200
2000
Plain
1800
Emphatic
1600
1400
1200
1000
/i/
/a/
Figure 23: Consonant-type by Vowel Quality interaction for F2 Mid in target positions.
A Consonant-type by Gender interaction is also obtained where F2 for females drops in
emphatic environments by 264 Hz, and by 156 Hz for males. This is a significant
difference, [F(1, 1011) =4.74 , p=0.030]. Figure 24 shows the results:
99
2600
2400
Frequecy(Hz)
2200
2000
Male
1800
Female
1600
1400
1200
1000
Plain
Emphatic
Figure 24: Consonant-type by Gender interaction for F2 Mid in target positions.
A Consonant-type by Adjacent Vowel Quality interaction is also obtained for F2 in
word-medial positions for the vowels adjacent to the target one where F2 drops by 408
Hz for the vowel /a/ and by 113 Hz for the vowel /i/. This is a significant difference,
[F(1, 1011) =92.94, p<0.001]. Figure 25 below shows the results:
100
2600
2400
Frequency (Hz)
2200
2000
Plain
1800
Emphatic
1600
1400
1200
1000
/i/
/a/
Figure 25: Consonant-type by Adjacent Vowel Quality interaction for F2 Onset in adjacent
positions.
In adjacent positions, a Consonant-type by Gender interaction is also obtained where F2
in emphatic environments drops by 323 Hz for females and by 204 Hz for males. This
difference proves significant, [F (1, 1011) =11.00 , p=0.001]. See Figure 26.
101
2600
2400
Frequecy(Hz)
2200
2000
Male
1800
Female
1600
1400
1200
1000
Plain
Emphatic
Figure 26: Consonant-type by Gender interaction for F2 Onset in adjacent positions.
For vowels in left adjacent positions, e.g., [a] in [kat¿ik], F2 drops significantly in
emphatic positions when measurements were taken at vowel onset, [F (1, 1014) =
166.839, p < 0.001]; vowel midpoint, [F (1, 1014) = 207.395, p < 0.001]; and vowel
offset, [F (1, 1014) = 464.616, p < 0.001]. Figure 27 below shows these results.
102
2400
2385
2231
Frequency (Hz)
2200
*
2121
2127
*
2000
Plain
1880
Emphatic
1800
*
1662
1600
1400
Onset
Mid
Offset
Figure 27: F2 values at onset, midpoint and offset positions for the vowel left-adjacent to the
syllable with the word-medial target consonant.
3.3.2.2.3. target consonant word-final
For vowels in word final positions in the same syllable with the target consonant, e.g. [i]
in [tabis¿], F2 was measured at three points in the vowel: onset, midpoint and offset. At
all measurement points, F2 drops significantly in emphatic environments compared to
the plain counterparts. At onset positions, [F (1, 389) = 7.389, p = 0.007]; at mid
positions, [F (1, 389) = 8.105, p = 0.005]; and at offset positions, [F (1, 389) = 29.169,
p < 0.001]. Figure 28 shows these results.
103
2000
1822
1800
1736
Frequency (Hz)
1666
1600
*
*
1508
1559
*
1595
Plain
Emphatic
1400
1200
1000
Onset
Mid
Offset
Figure 28: F2 values in word-final positions at onset, midpoint and offset positions for the
vowel in the same syllable with the target consonant.
F2 for the vowel in the left-adjacent positions, e.g. [a] in [tabis¿], was significantly
lower in emphatic environments compared to plain ones. At onset positions, [F (1, 389)
= 100.231, p < 0.001]; at mid positions, [F (1, 389) = 84.499, p < 0.001]; and at offset
positions, [F (1, 389) = 55.099, p < 0.001]. Figure 29 shows these results.
104
2000
1883
*
1717
Frequency (Hz)
1800
1811
*
1622
1600
1652
*
1452
Plain
Emphatic
1400
1200
1000
Onset
Mid
Offset
Figure 29: F2 values at onset, midpoint and offset positions for the vowel left-adjacent to the
syllable with the word-final target consonant.
3.3.2.3. F3
Since the target consonants appear in three different positions: word-initial,
word-medial and word-final, the results will be reported accordingly.
3.3.2.3.1. target consonant word-initial
For vowels in the same position with the target consonant, e.g. [u] in [s¿ubat], F3 was
measured at three points: onset, midpoint and offset. The results show that F3 is higher
for vowels in emphatic environments. This increase is significant regardless of the
position in the vowel at which the measurement was taken. In onset position, F3 rises
105
significantly in emphatic environments, [F (1, 374) = 18.819, p < 0.001]. When the
measurement is taken at vowel midpoint, a similar result is revealed, [F (1, 374) =
4.732, p = 0.030]. And the same is true for measurements taken at vowel offset
positions, [F (1, 374) = 16.143, p < 0.001]. Figure 30 plots these results:
3200
*
3000
3094
2990
* 3087
3027
2991
*
Frequency (Hz)
2883
2800
Plain
2600
Emphatic
2400
2200
2000
Onset
Mid
Offset
Figure 30: F3 values in word-initial positions at onset, midpoint and offset positions for the
vowel in the same syllable with the target consonant.
For F3 measurements for vowels in right adjacent positions, e.g. [a] in [s¿ubat], the
results show that F3 was significantly higher in emphatic environments compared to
plain ones only when the measurements were taken at onset and midpoint positions, not
at offset positions.
106
Vowel F3 in onset position is significantly higher in emphatic compared to plain
environments, [F (1, 374) = 13.328, p < 0.001]. A similar result is obtained for vowel
F3 at midpoint, [F (1, 374) = 8.376, p = 0.004]. However, no significant difference was
obtained for vowel F3 in offset position, [F (1, 374) = 2.972, p = 0.086]. Figure 31 plots
these results:
3200
2956
3000
Frequency (Hz)
*
2800
*
2723
2803
3000
2857
2799
Plain
2600
Emphatic
2400
2200
2000
Onset
Mid
Offset
Figure 31: F3 values at onset, midpoint and offset positions for the vowel right adjacent to the
syllable with the target consonant.
3.3.2.3.2. target consonant word-medial
For vowels in word medial positions in the same syllable with the target consonant, e.g.
[i] in [kat¿ik], results show that F3 rises significantly in emphatic environments when
measurements are taken at onset and offset positions but not at vowel midpoint. At
107
onset positions, [F (1, 1014) = 32.793, p < 0.001]; at vowel midpoint, [F (1, 1014) =
2.400, p = 0.122]; and at vowel offset, [F (1, 1014) = 4.267, p = 0.039]. Figure 32
below plots these results.
3200
Frequency (Hz)
3000
* 3008
2927
2932 2956
* 2975
2940
2800
Plain
2600
Emphatic
2400
2200
2000
Onset
Mid
Offset
Figure 32: F3 values at onset, midpoint and offset positions for the vowel in the same syllable
with the word-medial target consonant.
Surprisingly, for vowels in left adjacent positions, e.g., [a] in [kat¿ik], F3 drops
significantly in emphatic environments when measurements were taken at onset and
midpoint positions, but expectedly rises when measurements were taken at offset
positions. F3 onset and midpoint for the left adjacent vowel were significantly lower in
emphatic environments, [F (1, 1014) = 27.801, p < 0.001], and [F (1, 1014) = 6.439, p
= 0.011], respectively. F3 offset for the left adjacent vowel was significantly higher in
108
emphatic environments, [F (1, 1014) = 21.142, p < 0.001]. Figure 33 below shows
these results.
3200
3028
Frequency (Hz)
3000
*
2944
2952
*
2914
* 2998
2932
2800
Plain
2600
Emphatic
2400
2200
2000
Onset
Mid
Offset
Figure 33: F3 values at onset, midpoint and offset positions for the vowel left-adjacent to the
syllable with the target consonant.
3.3.2.3.3. target consonant word-final
For vowels in word final positions in the same syllable with the target consonant, e.g. [i]
in [tabis¿], F3 was measured at three points in the vowel: onset, midpoint and offset. At
all measurement points, vowel F3 significantly rises in emphatic environments
compared to plain counterparts. F3 is higher in emphatic environments when
measurements were taken at onset positions, [F (1, 389) = 15.857, p = 0.00]; at
109
midpoint, [F (1, 389) = 12.830, p = 0.00]; and at offset, [F (1, 389) = 56.965, p = 0.00].
Figure 34 below displays these results.
3139
3200
*
3000
2972
*
3074
2975
*
2958
Frequency (Hz)
2858
2800
Plain
2600
Emphatic
2400
2200
2000
Onset
Mid
Offset
Figure 34: F3 values at onset, midpoint and offset positions for the vowel in the same syllable
with the word-final target consonant.
F3 for the vowel in the left-adjacent positions, e.g. [a] in [tabis¿], was significantly
higher in emphatic environments compared to plain ones. However, this result was
obtained when measurements were taken at vowel midpoint and offset positions but not
at vowel onset positions. At onset positions, vowel F3 did not show significant
differences, [F (1, 389) = 3.249, p = 0.072]. At vowel midpoint, F3 was significantly
higher in emphatic environments, [F (1, 389) = 6.061, p = 0.014], and similar results
were obtained when measurements were conducted at vowel offset positions, [F (1, 389)
= 14.876, p = 0.00]. Figure 35 below shows these results.
110
3200
Frequency (Hz)
3000
2891
2928
*
2839
2892
2800
2849
*
2760
Plain
2600
Emphatic
2400
2200
2000
Onset
Mid
Offset
Figure 35: F3 values at onset, midpoint and offset positions for the vowel left-adjacent to the
syllable with the target consonant.
3.4. Results Summary
Tables 2, 3, and 4 below summarize the main results on the bisyllables. Since segment
durations did not show any consistent effects, we will focus on spectral measures here
and in the subsequent discussion. This arrow (y) represents an increase in formant
frequency in emphatic position, the other one (z) represents a decrease in formant
frequency in emphatic position. The numbers in each table under Difference show the
magnitude of the increase or decrease (in Hz).Wherever numbers are shown in
parentheses, the difference was not significant.
111
Table 3: Emphasis in words with initial target consonants.
sa.bat
F1 onset
F1 mid
F1 offset
F2 onset
F2 mid
F2 offset
F3 onset
F3 mid
F3 offset
Vowel 1 Difference (Hz)
74
y
43
y
20
y
282
z
(108)
z
(56)
z
104
y
60
y
108
y
Vowel 2
y
y
z
z
z
z
y
y
y
Difference
19
(9)
(7)
125
119
79
80
58
(44)
Vowel 2
F1 onset
y
F1 mid
y
F1 offset
y
F2 onset
z
F2 mid
z
F2 offset
z
F3 onset
y
F3 mid
y
F3 offset
y
Difference
56
29
(20)
426
280
209
81
(24)
35
Vowel 1
F1 onset
y
F1 mid
y
F1 offset
y
F2 onset
z
F2 mid
z
F2 offset
z
F3 onset
y
F3 mid
y
F3 offset
y
Difference
37
40
37
166
189
200
(37)
53
89
F1 onset
F1 mid
F1 offset
F2 onset
F2 mid
F2 offset
F3 onset
F3 mid
F3 offset
Table 4: Emphasis in words with medial target consonants.
ka.sak
F1 onset
F1 mid
F1 offset
F2 onset
F2 mid
F2 offset
F3 onset
F3 mid
F3 offset
Vowel 1
y
y
y
z
z
z
z
z
y
Difference (Hz)
34
62
71
264
351
465
84
38
66
Table 5: Emphasis in words with final target consonants.
ta.bas
F1 onset
F1 mid
F1 offset
F2 onset
F2 mid
F2 offset
F3 onset
F3 mid
F3 offset
Vowel 2
y
y
y
z
z
z
y
y
y
Difference (Hz)
22
(24)
54
158
177
227
114
99
181
112
3.5. Discussion
In line with previous studies and similar to the present results for the monosyllables, the
main findings indicate a significant increase for vowel F1 and F3 values in emphatic
target positions and a significant decrease for vowel F2. However, results are more
complex for bisyllables. In words with initial target consonants, e.g. [s¿a.bat], F1
significantly rises for the target vowel in all measurement positions but this difference
fades away in the right-adjacent vowel when measurements are taken at midpoint and
offset positions. For words with a medial target consonant, e.g., [ka.s¿ak], F1 values for
the vowel in the same syllable with the target consonant rises at onset and midpoint but
no significant differences were obtained for target vowel offset. Interestingly, for the
vowel in the left-adjacent position, i.e. crossing a syllable, F1 rises at the three
measurement points. As for words with target consonants in final positions, i.e.
[ta.bas¿], F1 rises at all three measurement points for the target vowel and continues to
rise for F1 onset and offset positions, not for F1 midpoint. These results suggest that
for the most part, F1 increases in emphatic positions when the vowel is in the same
syllable with the target consonant. The same effect seems to spread leftwards more
easily than rightward. Card (1983) mentions leftward emphasis spread – as observed
through F2 lowering – being more attested compared to rightward emphasis. However,
her results are based on monosyllables. In the present study, leftward emphasis spread
crosses the syllable boundary compared to rightward emphasis spread, which seems to
be confined to the target syllable.
113
F2 shows a significant drop in emphatic environments. In words with initial target
consonants, F2 drops significantly at vowel onset but the drop is insignificant at vowel
midpoint and offset. The drop is attested in vowels right-adjacent to the target
consonant at all measurement points. This only occurs in words with initial target
consonants. In words with medial and final target consonants, F2 drops significantly in
emphatic positions at the three measurement points and for the two vowels: the target
and the left-adjacent ones. In accordance with our previous observations, F2 drops in
the target positions and left of the syllable with the target vowel. This gives more
evidence that leftward emphasis spread is more prominent in UJA.
F3 shows a significant increase in emphatic environments. In words with initial targets,
vowels in the same syllable show a significant increase. This effect spreads to the
vowel in the right-adjacent syllable when measurements are taken at vowel onset and
midpoint, not at vowel offset. In words with medial target consonants, F3 rises in target
positions when the measurements are taken at vowel onset and offset but not at vowel
midpoint. For the vowel left of the target syllable, F3 surprisingly drops at vowel onset
and midpoint but rises significantly at vowel offset. In words with final target
consonants, F3 rises in the vowel in the same syllable with the target consonant when
the measurements are taken at vowel midpoint and offset, but not at vowel onset.
However, F3 significantly increases in left-adjacent vowels at the three measurement
points.
Despite only a few odd findings in the results, mentioned above, there is a clear effect
of emphasis on vowels whereby F1 and F3 increase while F2 drops in emphatic
114
positions; and where leftward emphasis spread – as an effect of F1/F3 increase and F2
drop – is more obvious compared to rightward emphasis spread such that the former
spans across the syllable boundary. This shows that anticipatory coarticulation is more
common than preservatory coarticulation. Emphasis spread is asymmetrical in nature
and "pharyngealization (like velarization) is anchored on the onset phase of the primary
articulation and for that reason tends to spread in an anticipatory manner, affecting the
formants of preceding segment(s) more than the formants of following segment(s),"
Watson (1999: 293). See also Laver 1994; Davis 1995; Ladefoged and Maddieson 1996
for similar discussions on the prevalence of anticipatory coarticulation. Watson,
supported by Ladefoged and Maddieson (1996), ascribes this asymmetrical nature of
emphasis spread to secondary articulation, (1999: 298). She explains that in emphasis,
"the pharynx narrows prior to the hold phase of the primary articulation; thus,
pharyngealization is anchored more on the onset of the primary articulation, which
results in the anticipatory nature of spread of pharyngealization".
The analyses in this chapter give room to a wider range of interactions that can enable
us to better understand the acoustic correlates of emphasis. A number of independent
variables were tested, see Table 1 above. Among the different variables that were
examined, two seem to influence the degree of emphasis and/or emphasis spread:
gender of the speaker and vowel quality. As an effect of emphasis, F1 values increase
more in males than in females whereas F2 drops significantly more in females
compared to males; see Figures 3, 4, 6, 12, and 17 above.
115
With regard to vowel quality, it is rather obvious that the vowels that are transparent to
emphasis, i.e., do not seem to block emphasis spread are /a/ and /i/, where results show
that sometimes emphasis as an effect of F1 increase is significantly more evident in /a/
compared to /i/. However, the high back vowel /u/ seems to block emphasis spread.
This result holds for both the target vowels and the vowels in adjacent positions. Since
emphasis involves a clear secondary back articulation, it is not surprising that it affects
front and low vowels more than back vowels. This is what the data on bisyllables
confirm.
The acoustic findings on the bisyllables substantiate earlier findings on the
monosyllables and give a better idea of emphasis spread and its extent/domain. It is fair
to claim that most effects of emphasis can be observed in the vowels within the target
or adjacent syllables – despite a general feeling among native speakers of UJA, and
MSA as well, that the target consonant triggers the emphatic perception of the
consonant. These acoustic findings will be tested in the next chapter to verify, from the
point of view of the listener, whether the vowels or the consonants carry the effects of
emphasis.
116
Chapter Four
The perception study
The main findings of the acoustic study for the monosyllables showed that emphatic
stops in word-final positions were consistently longer than their plain counterparts.
While this was the only significant difference in terms of duration, the frequency
measurements on the vowels provided deeper and more vivid insights into the nature of
emphasis. Vowel F1 and F3 rose or tended to rise while F2 was consistently lower in
emphatic positions compared to their plain counterparts. However, different
interactions showed that some independent variables affected the degrees of increase or
decrease of formant frequency such as vowel quality, position of the target consonant
and vowel length. In the present chapter, I investigate whether some of the findings of
the acoustic study map on to perception. Namely, whether the difference in formant
frequencies between plain and emphatic obstruents and its variability due to vowel
quality affect perception. Of course, investigating every aspect of the difference
between plain and emphatic consonants and vowels in their neighborhood requires
extensive efforts and experimental designs to account for every interaction between the
consonant-type (plain and emphatic) and all other independent variables. This is
beyond the scope of this chapter. For the purposes of this study, I included two
independent variables: manner of articulation: a stop /t-t¿/ and a fricative /s-s¿/, and
vowel quality: low front /a/, and high back /u/.
117
While high front vowels /i-i:/ and low front vowels /a-a:/ maintain similar effects on
emphasis, cf. Section 2.3.2.2., high back vowels /u-u:/ show different effects. For low
front vowels in word-initial position, F2 onset drops by 529 Hz in emphatic positions
compared to their plain counterparts, whereas F2 onset for high back vowels drops only
by 375 Hz. F2 midpoint drops by 412 Hz for low front vowels and only by 184 Hz for
high back vowels, both in word-initial position. And finally, F2 offset drops by 280 Hz
for low front vowels and only by 42 Hz for high back vowels in the same position. F3
midpoint rises in emphatic positions for low front vowels but it does not show
significant differences for high back vowels. No other interactions were found for /a-a:/
or /u-u:/ with consonant-type that affected F1 or F3 measurements. All measurements
on duration and spectral moments for /b/ in plain and emphatic positions yielded no
differences.
Given these results, it is crucial to test the effects of the target consonants and target
vowels (that is, those vowels in the same syllable with the target consonant) on the
perception of emphasis. To investigate this issue, the present experimental design
utilizes splicing the segments that make up the target word and rearranging them such
that the one segment that is crucial to emphasis can be singled out. The first
requirement for this design is to establish whether subjects detect emphasis in spliced
target words. That is, to examine whether subjects can detect the plain or emphatic
target consonants in words that have been spliced and reassembled. This provides the
118
grounds for arguing that splicing per se does not affect perception. The following are
the questions this experiment attempts to address:
1. Is the perception of emphasis primarily driven by the target consonant or by the
target vowel?
2. Is the perception of emphasis affected by the manner of articulation of the emphatic
consonant?
3. Is the perception of emphasis affected by vowel quality?
To answer these questions, an experimental design is adopted to find out what
segment is crucial to the perception of emphasis and under what conditions, the
reaction times to the different conditions, and the rating of the subjects with regard to
their confidence level in their answers. Based on the findings of the acoustic study on
monosyllables and aided by the literature on emphasis, I make the following
hypotheses:
1. The perception of emphasis is not affected by the presence or absence of the
emphatic target consonant.
2. The perception of emphasis is governed by the target vowels.
3. Vowel quality affects the perception of emphasis: emphasis is more clearly
perceivable in low front vowels than in high back vowels.
119
4. 1. Method
4.1.1. Subjects
20 participants, 16 males and 4 females, whose native dialect was UJA took part in the
experiment. 16 of them were recruited from Kansas (Kansas City, Lawrence and
Manhattan) and 4 were recruited from Arkansas (Fayetteville). All of them were
college-age students, employees or their spouses. None of them reported having any
hearing or speech impairments.
4.1.2. Materials and Procedure
Four pairs of plain and emphatic words produced by the same subject were selected
from the sound files used in the acoustic study on the monosyllables: /tab-t¿ab/, /tubt¿ub/ /sab-s¿ab/, and /sub-s¿ub/. Table 1 shows the duration and formant frequency
measurements for each of these sounds.
120
Table 1: The duration and formant frequency measurements for the perception stimuli
Word
TC duration
Vowel duration
/b/ duration
V_F1 onset
V_F1 midpoint
V_F1 offset
V_F2 onset
V_F2 midpoint
V_F2 offset
V_F3 onset
V_F3 midpoint
V_F3 offset
sab
s¿ab
sub
s¿ub
tab
t¿ab
tub
t¿ub
161
145
126
127
114
149
151
139
117
134
120
107
109
126
136
100
105
90
106
126
126
108
126
109
368
443
354
374
372
477
379
434
473
505
404
434
480
525
416
411
390
301
309
328
345
385
214
345
1457
1117
1387
1308
1568
1248
1425
1833
1432
1157
1230
1527
1455
1146
1232
1037
1252
1102
1029
1140
1316
1026
936
1363
2464
2405
2551
2920
2714
2649
2745
3105
2385
2538
2378
3262
2377
2579
2447
2486
2246
2352
2291
2390
3113
2281
2466
3277
Each of these words was cut into its three segments, and then spliced back together
such that every word that the subjects heard had been spliced. This condition ensures
that none of the words used in the experiment is presented intact. Each segment was
defined in accordance with the procedure used in the acoustic study on the
monosyllables. Specifically, the same criteria adopted for the duration measurements of
the stops, the fricatives and the vowels in Chapter 2 were also adopted for these stimuli,
cf. Section 2.1.3. Two other conditions were applied to each sound file. For each pair,
the plain and the emphatic target consonants were cut and swapped. This created two
stimuli: the plain word with the emphatic target consonant, and the emphatic word with
the plain target consonant. Then, the same process is repeated for the vowels: for each
pair, the vowels in the plain and the emphatic environments were cut and swapped,
yielding two other stimuli: the plain word with the vowel from the emphatic word and
the emphatic word with the vowel from the plain word. The same procedure was used
121
for the four pairs of target words. Every condition is applied to a copy of the original
‘intact’ target word. Each condition was repeated 10 times for a total of 240 sound files.
Table 2 below shows the conditions used to create the stimuli. Letters in upper case
represent the segments that are emphatic; letters in lower case represent the segments
that are plain.
Table 2: The conditions used in the perception study
No.
1
2
3
4
5
6
7
8
9
10
11
12
Condition
original words spliced
change target Stop
change target Vowel
original words spliced
change target Stop
change target Vowel
original words spliced
change target Fricative
change target Vowel
original words spliced
change target Fricative
change target Vowel
Plain
tab
T¿ab
tAb
tub
T¿ub
tUb
sab
S¿ab
sAb
sub
S¿ub
sUb
Emphatic
T¿AB
tAB
T¿aB
T¿UB
tUB
T¿uB
S¿AB
sAB
S¿aB
S¿UB
sUB
S¿uB
The experiment was conducted using the Superlab software (Version 2). A picture file
containing the text “How confident are you of your answer” written in Modern
Standard Arabic orthography was created in Microsoft Paint. The sound files were used
to determine what target consonant the subjects heard and also to collect reaction time
data; the picture files were used to measure confidence ratings for each sound file.
Subjects heard the sound files on a Sony MDR-7502 Dynamic Stereo Headphone and
used a laptop with defined keyboard buttons to record their responses.
122
Each subject took a practice session of 24 trials, followed by the experimental set of
240 trials. No feedback was given during the practice session. Each trial consisted of a
sound file and a picture file. Trials were randomized by Superlab, and a 1000 ms ISI
separated each event. Subjects were tested individually and were instructed that they
would hear a list of monosyllabic words starting with one of the target consonants: /t, t¿,
s, s¿/. They were asked to push a response button that represented the initial consonant
of the word that they heard. Reaction time was measured from the onset of the sound
file until the subject pushed the response button. Once they responded to the sound file,
a picture file would appear asking them how confident they were of their answers. They
were asked to push a response button indicating their confidence level at a scale from 1
(least confident), to 5 (most confident). Therefore, every time the subjects heard a
sound file, they responded to it, and then the picture file with the question “How
confident are you of your answer” followed. Average time for completing the
experiment was around 16 minutes per subject.
4.2. Analysis
For each subject, responses to the sound file, reaction time and confidence ratings were
collected. These were the dependent variables. For the sound file responses and the
confidence ratings, a simple frequency count was used. Different ANOVA’s were used
to compare reaction times among the different conditions. The results will be presented
for stops first, then for fricatives. Though it was expected that subjects would respond
with a plain or an emphatic segment of the same manner, some subjects responded with
123
a different manner. In other words, where a certain condition was applied to /t/ or /t¿ /,
responses were expected to be confined to these two stops. However, in some of these
conditions, a few subjects responded with an /s/ or /s¿ /.
4.3. Results
In this section, I report the responses to the three conditions applied to the sound files,
the reaction times, and the confidence ratings.
4.3.1. Original words
Under this condition, the original sound files were cut into their segments and spliced
back together. Subjects detected the correct target consonants at rates ranging from
99% correct responses for /T¿UB/ and /S¿UB/ to 100% correct responses for /tab/, /sab/,
/S¿AB/ and /sub/. /T¿AB/ and /tub/ fell in the middle with 99.5% correct responses.
Despite the fact that the results go in the expected direction for this condition, the
reaction time data in each case tell a slightly different story. A one-way ANOVA on
reaction times shows that subjects responded significantly faster to /t/ in /tab/ (992 ms)
compared to /t¿/ in /T¿AB/ (1151 ms), [F (1, 398) = 13.639, p < 0.001]. This difference
is not detected for stops in words with high back vowels. Subjects detected /t/ in /tub/
with a mean RT of 1058 ms, and /t¿/ in /T¿UB/ with a mean RT of 1123 ms. This is not
a significant difference, [F (1, 398) = 2.835, p = 0.092]. While subjects responded faster
to /t/ than /t¿/ in the low front vowel context, mean RT for /s/ in /sab/ is 1059 ms and
124
986 ms for /s¿/ in /S¿AB/. This difference is significant, [F (1, 398) = 5.188, p = 0.023].
No significant difference is obtained in RT for /s/ in /sub/ (1098 ms) and /s¿/ in /S¿UB/
(1124 ms), [F (1, 398) = 0.389, p = 0.532].
The confidence rating data show that subjects were highly confident in most of their
answers. Table 3 shows the mean confidence for each condition.
Table 3: Mean confidence in responses for spliced words
Condition
tab
T¿AB
tub
T¿UB
sab
S¿AB
sub
S¿UB
total
Confidence rating
4.9
4.7
4.8
4.7
4.9
4.9
4.9
4.7
4.8
4.3.1.1. Summary
The results for the first condition show that subjects detected the target consonants with
a high degree of accuracy ranging from 99 – 100%. Reaction time data show that
subjects detected plain stops faster than emphatic stops in the environment of low front
vowels and they detected emphatic fricatives faster than plain fricatives in the same
vocalic environment. In the environment of high back vowels, no significant
differences were found. And finally, subjects expressed a high confidence rating of
their answers ranging from 4.7 to 4.9.
125
4.3.2. Changing target consonant
To investigate whether the target consonant is crucial in the perception of words with
plain and emphatic consonants, the target consonants were swapped in this condition.
Subjects were presented with one of two types of words: a plain word with the
emphatic target consonant, or an emphatic word with the plain target consonant. If the
target consonant is the key, then perception of emphatic words is expected to follow the
emphatic consonant regardless of the vocalic environment. This condition was used for
fricatives and stops in the context of the low front vowel /a/ and the high back vowel
/u/. In condition 2 in Table 2 /T¿ab/ and /tAB/ the emphatic target consonant was placed
in the plain word and the plain target consonant was spliced in the emphatic word. In
/T¿ab/, 94.5% reported hearing /t/ and only 5.5% reported hearing /t¿/. In /tAB/, 66%
reported a /t¿/ and 34% reported a /t/. The same condition was applied to words with the
high back vowel, /u/. In /T¿ub/ 52.5% responded with a /t/ and 42.5% responded with a
/t¿/. 5% responded that they heard /s¿/. In /tUB/, 65.5% reported that they heard /t/ and
29.5% reported that they heard /t¿/. Similar to the case in /T¿ub/, 5% reported hearing
/s/.
Figure 1 shows these results. Letters in upper case represent the emphatic
segments.
126
100
94.5
% responses
80
66
60
65.5
52.5
%t
42.5
34
40
%t¿
29.5
20
5.5
0
T¿ab
tAB
T¿ub
tUB
Figure 1: percent /t/ and /t¿/ responses for words where target consonants were swapped
The same conditions in fricatives yielded similar results. For /S¿ab/, 93.5% reported
hearing an /s/ and only 1.5% said they heard an /s¿/. For /sAB/, 95% reported hearing
an /s¿/ and no one reported hearing an /s/. 2% reported hearing /t/ and 3% reported
hearing /t¿/. In the vowel /u/ environment, detection of plain and emphatic target
consonants is affected. In /S¿ub/, only 24% reported hearing /s/ and 76% reported
hearing /s¿/. In /sUB/, 53.5% reported hearing /s/ and 46.5% reported hearing /s¿/.
Figure 2 shows these results.
127
100
95
93.5
76
% responses
80
53.5
46.5
60
%s
%s¿
40
24
20
1.5
0
0
S¿ab
sAB
S¿ub
sUB
Figure 2: percent /s/ and /s¿/ responses for words where target consonants were swapped
In terms of reaction time, subjects responded significantly faster to detecting the plain
stops (1094 ms) than their emphatic counterparts (1379 ms) in the context of low front
vowels ([F (1, 398) = 15.796, p < 0.001]). With high back vowels, subjects detected
plain stops faster than their emphatic counterparts, (1255 ms) and (1299 ms),
respectively, but this difference was not significant, [F (1, 398) = 0.383, p = 0.536]. See
Figure 3.
128
1400
1379
1299
1300
1255
*
RT (ms)
1200
1094
1100
1000
900
800
T¿ab
tAB
T¿ub
tUB
Figure 3: Mean reaction time for /t/ and /t ¿/ when target consonants are swapped
For fricatives, similar results are obtained in the context of low front vowels but
opposite results are obtained in the context of high back vowels. Subjects detected /s¿/
with a mean RT of 986 ms and /s/ with a mean RT of 1059. This difference is
significant, [F (1, 398) = 15.796, p < 0.001]. This is not detected in the context of the
high back vowel. Subjects’ mean RT for detection of /s/ was 1098 ms and 1125 ms for
/s¿/. This difference is not significant, [F (1, 398) = 0.390, p = 0.533]. Figure 4 shows
these results.
129
1200
1125
1098
RT (ms)
1100
1000
1059
986
*
900
800
S¿ab
sAB
S¿ub
sUB
Figure 4: Mean reaction time for /s/ and /s ¿/ when target consonants are swapped
The confidence rating data show that subjects were largely confident of their responses.
Confidence ratings ranged from 4.5 for swapping the target stop to 4.9 for swapping the
low front vowel and the emphatic fricative. Table 4 shows these ratings for the different
conditions of changing the target consonant.
Table 4: Mean confidence for responses to changing the target consonant
Condition
tAB
T¿ab
tUB
T¿ub
sAB
S¿ab
sUB
S¿ub
total
Confidence rating
4.6
4.6
4.5
4.5
4.9
4.9
4.7
4.6
4.7
130
4.3.2.1. Summary
For /tab/ and /t¿ab/, when the emphatic target consonant was spliced in /tab/, 94.5%
subjects reported a plain /t/, and only 5.5% reported an emphatic /t¿/. However, in the
mirror image condition: the plain target consonant in /t¿ab/, only 66% subjects reported
an emphatic /t¿/ and 34% reported a plain /t/. When this manipulation is applied to /t/
and /t¿/ in /tub/ and /t¿ub/, that is, when the emphatic target consonant from /t¿ub/ was
spliced in /tub/, only 52.5% reported a plain /t/ and 42.5% reported /t¿/. In the mirror
image condition: /t/ in /t¿ub/, 65.5% reported a plain /t/ and 29.5% reported an emphatic
/t¿/. In fricatives, for /sab/ and /s¿ab/, when the emphatic target consonant was spliced
in /sab/, 93.5% subjects reported a plain /s/, and only 1.5% reported an emphatic /s¿/. In
the mirror image condition: the plain target consonant in /s¿ab/, 95% subjects reported
an emphatic /s¿/ and no one reported a plain /s/. When this manipulation is applied to /s/
and /s¿/ in /sub/ and /s¿ub/, that is, when the emphatic target consonant from /s¿ub/ was
spliced in the /sub/, only 24% reported a plain /s/ and 76% reported /s¿/. In the mirror
image condition: /s/ in /s¿ub/, 53.5% reported a plain /s/ and 46.5% reported an
emphatic /s¿/.
With regard to reaction time, subjects detected the plain stops faster than the emphatic
ones in the context of low front vowels. No significant differences were obtained in the
context of high back vowels. For fricatives, subjects detected the emphatic fricatives
faster than their plain counterparts in the context of low front vowels. No significant
differences were obtained in the context of high back vowels.
131
Finally, confidence ratings are still high under all of the manipulations in this condition.
A total of 4.7 confidence rating is scored for the condition of changing the target
consonant.
4.3.3. Changing target vowel
In this condition, the vowel in the same syllable with the target consonant, also referred
to as the target vowel, was cut and swapped. Therefore, the conditions resulting from
this manipulation are the plain word with the vowel from its emphatic counterpart and
the emphatic word with the vowel from the plain one. The perception of emphasis is
expected to follow the target vowel in these conditions – in line with the findings from
the acoustic study. While this finding is not expected to be affected by the manner of
the target consonant, the quality of the target vowel is expected to have an effect on the
perception of emphasis. It is expected that emphasis is perceived more clearly in the
presence of low front vowels than in the presence of high back vowels.
For / tAb /, only 6.5% responses were /t/ and an overwhelming 93.5% responses were
/t¿/. For / T¿aB /, subjects reported hearing 97.5% /t/ and only 2.5% /t¿/. For /tUb/, 69%
responses reported hearing a /t/ and only 26% reported hearing /t¿/. When the plain /u/
is spliced in the emphatic word, /T¿uB/, 53% reported hearing a /t/ and 41.5 reported
hearing a /t¿/. See Figure 5.
132
100
97.5
93.5
% responses
80
69
60
53
%t
41
%t¿
40
26
20
2.5
6.5
0
T¿aB
tAb
T¿uB
tUb
Figure 5: percent /t/ and /t¿/ responses for words where target vowels were swapped
The results obtained for fricatives are similar. In /sAb/, 91.5% responses were /s¿/ and
only 3% were /s/. In /S¿aB/, only 3% responses were /s¿/ and 92% were /s/. In the
environment of high back vowels, responses are split for /sUb/: 50% /s/ and 50 % /s¿/
but in /S¿uB/, 95.5% responses were /s/ and only 4.5% responses were /s¿/. Figure 6
shows these findings.
133
100
92
91.5
95.5
% responses
80
60
50 50
%s
%s¿
40
20
3
4.5
3
0
S¿aB
sAb
S¿uB
sUb
Figure 6: percent /s/ and /s¿/ responses for words where target vowels were swapped
The reaction times between these conditions did not show any significant differences.
Subjects detected /t¿/ in /T¿aB/ with a mean RT of 1083 ms and 1097 ms for /t/ in /tAb/.
Also, similar reaction times are obtained in the context of high back vowels: 1255 ms
for /t¿/ in /T¿uB/ and 1289 ms for /t/ in /tUb/. Figure 7 shows these results.
1400
1300
1255
1289
RT (ms)
1200
1100
1083
1097
T¿aB
tAb
1000
900
800
T¿uB
tUb
Figure 7: Mean reaction time for /t/ and /t ¿/ when target vowels are swapped
134
For stops, the reaction time data do not show significant differences. In the context of
low front vowels, mean reaction time for /s¿/ in /S¿aB/ was 1079 ms and 1058 ms for /s/
in /sAb/. In the context of high back vowels, subjects detected /s/ in /sUb/ faster than
they detected /s¿/ in /S¿uB/: 1259 and 1311 ms, respectively. See Figure 8.
1400
1311
1259
1300
RT (ms)
1200
1100
1079
1058
1000
900
800
S¿aB
sAb
S¿uB
sUb
Figure 8: Mean reaction time for /s/ and /s ¿/ when target vowels are swapped
The confidence ratings for the sound files in this condition were also very high. Similar
to the previous condition where the target consonant was spliced, subjects responded
with a high confidence level to sound files under this condition. Table 5 shows these
results.
135
Table 5: Mean confidence for responses to changing the target vowel
Condition
tAb
T¿aB
tUb
T¿uB
sAb
S¿aB
sUb
S¿uB
total
Confidence rating
4.9
4.7
4.5
4.6
4.8
4.9
4.6
4.7
4.7
4.3.3.1. Summary
In stops, when the plain vowel /a/ is spliced in the emphatic word /t¿ab/, 97.5% subjects
reported hearing a plain /t/, and only 2.5% reported the hearing the emphatic /t¿/. When
the emphatic vowel /a/ is spliced in the plain word /tab/, 93.5% reported an emphatic
/t¿/ and only 6.5% reported hearing /t/. The same conditions for /t¿ub/ and /tub/ yielded
different results. When the plain vowel /u/ is spliced in the emphatic word /t¿ub/, 53%
reported a plain /t/ and 41% reported an emphatic /t¿/. When the emphatic vowel /u/ is
spliced in the plain word /tub/, 69% reported a plain /t/ and only 26% reported an
emphatic /t¿/. In fricatives, when the plain vowel /a/ is spliced in the emphatic word
/s¿ab/, 92% subjects reported hearing a plain /s/, and only 3% reported the hearing the
emphatic /s¿/. When the emphatic vowel /a/ is spliced in the plain word /sab/, 91.5%
reported an emphatic /s¿/ and only 4.5% reported hearing /s/. The same conditions for
/s¿ub/ and /sub/ yielded similar results in one manipulation and different results in
another. When the plain vowel /u/ is spliced in the emphatic word /s¿ub/, 95.5%
reported a plain /s/ and 4.5% reported an emphatic /s¿/. However, when the emphatic
136
vowel /u/ is spliced in the plain word /sub/, subjects were equally split between /s/ and
/s¿/ reporting 50% each.
Reaction time data for this condition did not yield any significant differences. Subjects
reacted faster to emphatic stops than to their plain counterparts but slower to emphatic
fricatives than to their plain counterparts. None of these differences was significant.
The same pattern in terms of confidence rating is obtained here. The lowest confidence
ratings were obtained for swapping the target high back vowel in the word with the stop
consonant, 4.5. The highest ratings were obtained for swapping the target low front
vowel and the target emphatic fricative, 4.9.
4.4. Discussion
The results for the original target words show that they were detected at high accuracy
rates ranging from 99% to 100% expected responses. This finding suggests that splicing
the word into its segments and reassembling these segments does not affect the
detection rates considerably, which enables us to eventually conclude that different
responses to the other two conditions would not be due to splicing as opposed to
swapping the target consonants and vowels. Detection percentages for spliced words
did not seem to have been influenced by vowel quality. Subjects responded with similar
detection rates to words that had low front vowels and high back vowels. The
significant differences in reaction times, however, seem to be harder to explain.
137
Subjects detected plain stops faster than emphatic stops and plain fricatives slower than
emphatic fricatives – both in the context of low front vowels. This difference might not
be related to emphasis per se. It could be related to a possibility that /tab/ and /s¿ab/ are
more commonly frequent in the language than /t¿ab/ and /sab/, which seems to hold
water, from an impressionistic perspective, given the lack of word frequency charts for
UJA.
The difference in reaction times reported for words with low front vowels is absent in
high back contexts. This discrepancy in reaction times is compensated by the results on
the confidence ratings for these words. The confidence ratings obtained for this
condition ranged from a low of 4.7 to a high of 4.9, which indicates that subjects were
highly confident on their answers.
With regard to changing the target consonant, a few interesting findings can be
reported. When the emphatic stop is spliced into the plain word, a vast majority of the
responses, 94.5%, recognized a plain stop. The mirror image, though at a lesser
magnitude, is obtained when the plain stop is spliced in the emphatic word: 66%
reported hearing an emphatic stop. This suggests that the target consonant does not
carry the effects of emphasis in the environment of low front vowels. That is,
perception of emphasis does not depend on the properties of the emphatic stop. The
same results are fostered by the findings on the fricatives. In conditions identical to the
ones mentioned here, subjects reported hearing a plain fricative when an emphatic
138
fricative was spliced in a plain word, and reported hearing an emphatic fricative when a
plain one was spliced in an emphatic word. Therefore, we can state that regardless of
the manner of articulation for the target consonant, the perception of emphasis is not
driven by the consonant in the environment of low front vowels.
Contrary to the findings on manner of articulation with regard to emphasis perception,
vowel quality seems to be critical in influencing the perception of emphasis. As
mentioned in the acoustic study, see Chapter 2, the mean difference in the second
formant frequency between low front vowels in plain and emphatic environments is
larger than that in high back vowels. This difference maps on to perception. Based on
the frequency of target consonant detections, subjects found it harder to decide whether
they heard a plain or an emphatic /t/ when these two stops were swapped in / T¿ub/ and
/Tub/. However, when the plain stop was spliced into the emphatic word, 65.5%
responses reported a plain /t/ compared 29.5% /t ¿/ responses. On the other hand,
subjects showed preference for /s¿/ (76%) when the emphatic fricative was spliced in
the plain word in the environment of high back vowels. These results show that the
high back vowels tend to impede the clear effects of emphasis because the magnitude
of F2 lowering in emphatic environments is smaller compared to that in low front
vowels. The findings on the reaction time data support the overriding influences of low
front vowels compared to the lesser effects of high back vowels. In stops and fricatives,
differences in reaction times to plain and emphatic segments were significant only in
the environment of low front vowels. They were, however, contradictory. While
subjects detected plain stops faster than emphatic stops, they detected emphatic
139
fricatives faster than plain fricatives. No significant differences were found in the
context of high back vowels. Despite these findings on reaction time, subjects thought
they were confident of their answers. The lowest end on the confidence ratings scale
was at 4.5 and the highest was at 4.9. Both of these ratings are substantially indicative
of high confidence.
In the third condition, swapping the target vowels, the perception results support the
overriding effects of the vowels in emphasis. When the vowel from the plain word is
spliced into the emphatic word, subjects detected the plain stop /t/ with a very high
percentage. When the vowel from the emphatic word was spliced into the plain word,
subjects detected the emphatic stop /t¿/. These results were also true for the fricatives.
In both cases for stops and fricatives, these results are obtained clearly in the context of
low front vowels. In the context of high back vowels, subjects’ responses were close to
chance level. However, the results for the vowels show some differences from those
obtained for the stops. When the plain vowel was spliced into the emphatic word,
95.5% responses reported a plain /s/, meaning that the high back vowel /u/ did
influence subjects’ perception. The mirror image of this manipulation, splicing the
emphatic vowel in the plain word, yielded perfect chance level responses split between
the plain and emphatic fricative detections. In terms of reaction time data, no significant
differences have been found under the different manipulations in this condition. One
finding is that reaction times were generally shorter for words that had the low front
vowel compared to those that had the high back vowel. As for the confidence ratings,
140
the results follow the pattern in the first two conditions. Subjects respond with high
confidence ratings averaging 4.7.
While the results are neat and tidy in the environment of the low front vowel /a/, the
change in vowel quality to /u/ does not yield consistent results. In the environment of
the vowel /a/, wherever the vowel is spliced, subjects’ perception is driven by it. So, if
the vowel comes from an emphatic word, the perception of the target consonant is
largely /t¿/ or /s¿/, and vice versa. In the context of the high back vowel /u/, these results
do not form a pattern. Sometimes subjects report hearing a plain consonant by high
percentages and sometimes their perception is close to or at a chance level. The
intrinsic characteristics of the high back vowel, namely that the difference between F1
and F2 is too small compared to that in /a/, leaves less room for F2 to decline and F1 to
rise in the emphatic environments. This leads to inconsistent results in the perception of
emphasis under the conditions presented in this chapter. Another consistent result that
supports this finding is that reaction times to words with low front vowel were
consistently shorter compared to these with high back vowels.
The results of this study show that perception of emphasis is not driven by the target
consonant. Although emphasis is triggered by the existence of an emphatic segment,
which is always an obstruent, its effects are observed on the vowel. The low front
vowel is more susceptible to emphasis than the high back vowel. While emphasis is
clearly realized on the low front vowel, it is not as salient in the context of high back
141
vowels. This is due to the fact that there is more room for F1 rising and F2 lowering in
the context of low front vowels than in the context of high back vowels.
142
Chapter Five
Summary, discussion and conclusion
The present study examined the acoustic and perceptual correlates of emphasis in
Urban Jordanian Arabic. The study consisted of three main experiments: two acoustic
experiments: one on monosyllabic words, and another on bisyllabic words, and a
perception experiment. In the acoustic experiment on monosyllables, four pairs of plain
and emphatic consonants were used along with the six vowels of UJA to create a list of
48 minimal pairs of monosyllabic words of the CVC structure. The target consonants
were either word-initial or word-final. The duration of the plain and emphatic
consonants was measured, and their spectral moments and locus equations were
calculated. The three formant frequencies of the vowel, F1, F2, and F3 were measured
at onset, temporal midpoint and offset of the vowel. As for the acoustic experiment on
bisyllables, the wordlist consisted of 80 target CV.CVC pairs where the target
consonant was word-initial, word-medial or word-final. The perception of emphasis
was also investigated by means of splicing and swapping the target consonants and
vowels. Four pairs of words were selected from the stimuli used in the acoustic
experiment. The perception experiment was used to test whether the results in the
acoustic study are perceptually valid.
143
5.1. Summary of results
5.1.1. Acoustic study: the monosyllables
The results for the acoustic study show that emphatic consonants are longer than their
plain counterparts. This difference is, however, limited to fricatives in word-final
positions. Word-initial and word-final stops do not show significant differences in plain
and emphatic positions. Additionally, fricatives in word-initial positions are not
significantly different. Target vowel duration was not significantly different in plain
and emphatic environments. Only one of the four spectral moments measured showed
significant differences; the center of gravity is significantly higher for emphatic stops
and lower for emphatic fricatives in word-final positions. Locus equations showed that
emphatic consonants had a lower mean slope compared to their plain counterparts
while y-intercept values showed no significant differences.
As expected, vowel F1 was significantly higher in emphatic environments. However,
this effect was strongest closest to the target consonant. There was often a tendency for
this effect to fade as the measurement was taken further away from the target
consonant. F2 was consistently lower in emphatic environments. The results for F3
were similar to those attested for F1: F3 increased in emphatic environments but this
increase faded away as the measurement was taken further away from the target
consonant.
Several
interactions
were
observed
for
the
formant
frequency
measurements. The formant frequency values were often affected by vowel quality.
High front and low front vowels were less resistant to emphasis effects compared to
144
high back vowels. In other words, where the formant frequency was expected to
increase or decrease, it did so by a greater extent in the environment of high and low
front vowels and by a lesser extent in the environment of high back vowels. Sporadic
interactions were found for position of the target consonant, manner of articulation,
vowel length and word type. No interactions were observed for voicing and gender.
5.1.2. Acoustic study: the bisyllables
Measurements on consonant and vowel durations and spectral moments for consonants
did not reveal consistent differences. Measurements on the bisyllables focused on
spectral measurements for vowels. For most of the data, vowel F1 and F3 significantly
increase but vowel F2 significantly decreases in emphatic environments. Formant
values usually show these results when the measurements are taken for the target
vowels: that is, vowels in the same syllable with the target consonants. As the
measurements are taken outside of the target vowel, effects of emphasis tend to fade
away. This is clearer when the measurements are taken on the left rather than right of
the target vowel, thus showing more prominence for anticipatory emphasis spread.
Furthermore, the effects of emphasis are clearer in the environments of high and low
front vowels compared to high back vowels. This was also attested for monosyllables.
Finally, measurements on vowel formant frequency show greater plain – emphatic
differences for males than females, thus suggesting that emphasis is more salient in
male speech.
145
5.1.3. The perception study
As for the perception study, the two types of consonants, stop and fricative, were
included along with two types of vowel quality: low front and high back. Three
conditions were used in this study: the original words spliced, the target consonant
swapped, and the target vowel swapped. Results for the first condition show that
subjects responded with a high degree of accuracy. That is, they identified each target
consonant as expected with an accuracy rate ranging from 99 – 100%. In the second
condition, results showed that subjects identified the consonant based on the vocalic
environment and not on the target consonant itself. If the target consonant was plain but
spliced in an emphatic vocalic environment, subjects reported an emphatic consonant,
and vice versa. The mirror image was detected for the third condition – splicing the
target vowel. If the vowel came from the emphatic word and was spliced into the plain
environment, subjects detected an emphatic target consonant, and vice versa. These
results, however, were influenced by vowel quality. If the vowel was low front, it
carried the effects of emphasis. If the vowel was high back, in most cases, responses
were close to a chance level.
5.2. Discussion
The results of the present study are in line with previous research on emphasis in the
areas where similar measurements and procedures have been adopted. Previous
findings for the duration of the target consonant showed that emphatic consonants were
146
significantly longer than plain ones, see Obrecht (1968). In the present study, the target
vowel durations were not significantly different. There have been two accounts in the
literature with contradicting findings for target vowel durations. The acoustic study of
the bisyllables shows that durational differences are not consistently influenced by the
presence of emphatic consonants. It seems that consonant and vowel duration are not a
stable cue for emphasis, especially in the case of short vowels. The allophonic changes
in the Arabic vowels seem to be easier to capture in quality rather than quantity. The
investigations regarding the spectral moments did not yield significant differences.
Apart from the findings on the center of gravity, which showed significant differences
in monosyllables, none of the other moments turned out to show significant differences.
While Norlin (1987) found significant lowering in the center of gravity for word-initial
emphatic sibilants, the present study reports such findings in the same direction only for
word-final fricatives and in the opposite direction for word-final stops. None of the
moments was significantly different in word-initial position. It should be noted here
that the measurements for spectral moments were characterized by a wide degree of
variability depending on the point at which the moments were calculated. This
variability was inevitable due to the differences in burst duration for stops. Based on
findings for the monosyllables, and due to the unclear effect of consonantal spectral
moments, moments were not reported in the study on the bisyllables.
As reported by almost all studies that investigated the acoustics of emphasis, vowel F2
turns out to be a very salient cue for emphasis. Vowel F2 drops in emphatic positions
147
compared to their plain counterparts. This is true for vowels in monosyllables and in
bisyllables as well. In monosyllables, Vowel F2 in words with initial target consonants
drops by 516 Hz, 253 Hz, and 165 Hz at onset, temporal midpoint and offset positions,
respectively. For words with final target consonants, F2 drops by 228 Hz, 248 Hz and
462 Hz at onset, temporal midpoint and offset positions, respectively. It should be
noted here that we are looking at the mirror image in word-initial and word-final
positions. While F2 onset is closest to the target consonant in words with initial
emphatics, F2 offset is closest to the target consonant in words with final emphatics.
This points to the gradient nature of emphasis spread within the same word, (cf.
Zawaydeh 1999; Al-Masri and Jongman 2004). F1 and F3 are higher in emphatic
positions compared to their plain counterparts. While the gradient effect is evident in
the degree of drop in F2 in emphatic positions, it is similarly evident in the results for
F1 and F3 in that the increase in emphatic positions diminishes as the measurement is
taken further away from the target consonant. Additionally, the extent of emphasis was
not similar for all vowels. It is worth noting that emphasis is realized more in the
environment of high front and low front vowels compared to high back vowels where
the relatively compact spectrum for /u/ limits the degree of formant increase or
decrease. This leads us to predict that high back vowels are more likely to block
emphasis spread compared to high front and low front vowels.
148
In terms of locus equations, the set of emphatic consonants in UJA patterned like it did
in other dialects of Arabic (Yeou, 1997) with flatter slopes compared to their plain
counterparts.
In bisyllables, the results pattern similarly. In addition, measurements on the bisyllables
give us a clearer idea about emphasis spread. It shows that emphasis clearly crosses the
left syllable boundary but is confined in the right direction to the target syllable.
Further, measurements on the bisyllables show a clear influence of vowel quality,
whereby the high back vowel /u/ seems to be a stronger blocker compared to the high
front and high back vowels. Finally, the magnitude of the plain-emphatic difference
suggests a greater tendency among males for pronouncing emphasis compared to
females.
The third experiment illustrates the perceptual consequences of the findings in the
acoustic study. The most salient findings on the acoustics of emphasis carried over to
perception, indicating that the target vowel drives the perception of emphasis. In
addition to this result, the more intricate findings on vowel quality materialized in
perception. Perception of emphasis was carried by low front vowels but not by high
back vowels. Despite the differences in the experimental design, these findings
replicate reports in the literature (Obrecht, 1968).
5.3. Conclusion
The findings reported in this study are significant in several ways. Experimental
research on Arabic has been limited compared to other languages. It is only recently
149
that more similar studies are reported. The previous literature on Arabic, especially the
literature written in Arabic, has been mostly descriptive. The present study lends itself
to the experimental domain. The findings reported in the present study should be
incorporated in the pedagogical fields, especially now that textbooks on Arabic are in
demand. In the absence of the emphasis distinction in most world languages, it is
crucial that the methods adopted in presenting emphasis as a phonemic distinctive
feature in Arabic be relevant to the tools available for the second language learner.
Rather than focusing on the differences between the consonants, which is how most
native speakers of Arabic think they perceive emphasis, the differences in the vowels
should be highlighted.
5.4. Follow-up
This study is basically one step on a long way. Several aspects of emphasis are still
wanting. While the acoustic correlates of monosyllables and bisyllables have been
systematically presented, analysis of emphasis spread beyond the neighboring syllable
is still lacking. Studying the acoustics of trisyllables should complete the picture. It
should allow us to understand the extent and directionality of emphasis spread, possible
consonantal and vocalic segments that block emphasis spread or limit it to its minimum
domain. Moreover, the investigation of the possible role of amplitude and spectral
moments and locus equations for consonants in adjacent syllables is warranted.
Studying the effects of these factors on the perception of emphasis is yet another
possible area of investigation. In addition to the investigation of emphasis in
150
trisyllables, geminate and adjacent emphatics have not been discussed in the literature
so far. A follow-up study using these data would possibly complete the picture.
This is the first step in a series of more acoustic and perceptual experiments to cover all
the unexplored characteristics of the emphatic signal. Conducting these experiments
should allow for much needed contributions to the field of Arabic linguistics.
151
References
Al-Ani, S. (1970) Arabic Phonology. The Hague: Mouton.
Ali, L. & Daniloff, R. (1974) A contrastive cinefluorographic investigation of the
articulation of the emphatic – nonemphatic cognate consonants, Studia Linguistica, 26,
81 – 105.
Alioua, A. (1995). L'effet des consonnes d'arriere et des emphatiques sur la nature
acoustique des voyelles longues de l'Arabe litteral moderne. Unpublished doctoral
dissertation, Universite Laval, Montreal, Canada.
Al-Masri, M. and Jongman, A. (2004) Acoustic correlates of emphasis in Jordanian
Arabic: preliminary results. In Augustine Agwuele et al, (ed.)
Proceedings of the
2003 Texas Linguistics Society Conference. Somerville, MA: Cascadilla Proceedings
Project, 96-106.
Alosh, M. (1987) The perception and acquisition of pharyngealized fricatives by
American learners of Arabic and implications for teaching Arabic phonology. PhD
Dissertation, Ohio State University.
Alwan, A. (1983) Acoustic and perceptual correlates of pharyngeal and uvular
consonants. PhD Dissertation, MIT.
Alwan, A. (1989) Perceptual cues for place of articulation for the voiced pharyngeal
and uvular consonants, Journal of the acoustic society of America,
86, 549 – 556.
Bani Yasin, R and Owens, J. (1987) The phonology of a Northern Jordanian Arabic
dialect, ZDMG, 137, 291 – 331.
152
Card, E. (1983) A Phonetic and Phonological Study of Arabic Emphatics. PhD
Dissertation, Cornell University, University Microfilms International, Ann Arbor:
Michigan.
Davis, S. (1995) Emphasis in Grounded Phonology, Linguistic Inquiry, 26, 465 – 498.
Chiba, T. & Kajiyama, M. (1942) The vowel: Its nature and structure. Phonetic Society
of Japan: Tokyo
Delattre, P. (1971) Pharyngeal Features in the Consonants of Arabic, German, Spanish,
French and American English, Phonetica. 23, 129 – 155.
El-Halees, Y. (1985) The role of F1 in the place-of-articulation distinction in Arabic,
Journal of Phonetics, 13, 287 – 298.
El-Halees, Y. (1987) F2 and Back Articulation in Arabic, Journal of the College of
Arts, King Saud University, 14, 21 – 44.
Ferguson, C. (1956) The Emphatic l in Arabic, Language. 32, 446 – 452.
Ghowail, Thanaa Ibrahim (1987) The acoustic phonetic study of two pharyngeals /š, ¿/
and two laryngeals /½,h/ in Arabic. PhD Dissertation, Indiana University.
Giannini, A. and Pettorino, M. (1982) The emphatic consonants in Arabic. Istituto
Universitario Orientale: Napoli.
Herzallah R. (1990) Aspects of Palestinian Arabic Phonology: A non-linear approach.
PhD Dissertation, Cornell University.
Jongman, A., Wayland, R., & Wong, S. (2000) Acoustic characteristics of English
fricatives, Journal of the Acoustical Society of America. 108, 1252 – 1263.
153
Kahn, M. (1975) Arabic Emphasis: The Evidence for Cultural Determinants of
Phonetic Sex-Typing, Phonetica, 31, 38 – 50.
Kent, R. and Read, Ch. (1992) The Acoustic Analysis of Speech. Singular Publishing
Group: California.
Khattab, G., Al-Tamimi, F. & Heselwood, B. (2006) Acoustic and Auditory differences
in the /t/-/t/ opposition in male and female speakers of Jordanian Arabic. In Boudelaa,
S. (ed.) Perspectives on Arabic Linguistics XVI. John Benjamins Publishing Company:
Amsterdam/Philadelphia
Ladefoged, P., & Maddieson, I. 1996. The Sounds of the World’s Languages. Oxford:
Blackwell
Laufer, A. and Baer, T. (1988) The emphatic and pharyngeal sounds in Hebrew and in
Arabic. Language and Speech. 31, 181 – 205.
Laver, John. 1994. Principles of phonetics. Cambridge: Cambridge University Press.
Lehn, W. (1963) Emphasis in Cairo Arabic, Language 39, 129 – 139.
McCarthy, J. (1994) The phonetics and phonology of Semitic pharyngeals. In Norlin,
K. (1985) Pharyngealization in Cairo Arabic, Working papers, 28, Lund University:
Sweden.
Norlin, K. (1987) A phonetic study of emphasis and vowels in Egyptian Arabic. PhD
dissertation. Lund University: Sweden
Obrecht, D. (1968) Effects of the 2nd formant on the perception of velarized consonants
in Arabic. The Hague, Mouton, Paris.
154
Obrecht, D (1961) Effects of the second formant on the perception of velarized
consonants in Arabic. PhD Dissertation, University of Pennsylvania.
Royal, A. (1985) Male/female pharyngealization patterns in Cairo Arabic: a
sociolinguistic study of two neighborhoods. PhD Dissertation, University of Texas at
Austin.
Semaan, K. (1968) Linguistics in the Middle Ages: Phonetic Studies in Early Islam.
Leiden: E. J. Brill.
Suleiman, S. (1985) Jordanian Arabic between Diglossia and Bilingualism: Linguistic
analysis. John Benjamins Publishing Company: Amsterdam/Philadelphia
Sussman, H. et al. (1991) An investigation of locus equations as a source of relational
invariance for stop place categorization, Journal of the Acoustical Society of America,
90, 1309 – 1325.
Versteegh, K. (2001) The Arabic Language. Edinburgh University Press.
Wahba, K. (1993) A sociolinguistic treatment of the feature of emphasis in Egypt. PhD
Dissertation, Alexandria University: Egypt.
Watson, J. (1999) Remarks and Replies: the Directionality of Emphasis in Arabic.
Linguistic Inquiry, 30, 289 – 300.
Watson, J. (2002) The Phonology and Morphology of Arabic. Oxford UP.
Yeou, M. (1995) Trading relations between cues for the pharyngealized/non
pharyngealized contrast, Proceedings of the 13th ICPhSc, KTH and Stockholm
University, Stockholm, 464 – 467.
155
Yeou, M. (1997) Locus equations and the degree of coarticulation of Arabic
consonants. Phonetica, 54, 187 – 202.
Younes, M. (1994) On emphasis and /r/ in Arabic. In Hole, C. and Eid, M. (eds.)
Perspectives on Arabic linguistics, Vol. VI. Amsterdam: John Benjamins.
Zawaydeh, B. (1999) The Phonetics and Phonology of Gutturals in Arabic. PhD
Dissertation, Indiana University.
Zemánek, P. (1996) The origins of Pharyngealization in Semitic. Praha: Enigma
corporation, Ltd.
Zuraiq, W., & Zhang, J. (2006). Phonological assimilation in Urban Jordanian Arabic.
Kansas Working Papers in Linguistics, 28, 33-64.
156
Appendices
Appendix I: The acoustic study: the monosyllables:
Monosyllables: target word-initial
No
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
Plain
sab
sib
sub
sa:b
si:b
su:b
tab
tib
tub
ta:b
ti:b
tu:b
dab
dib
dub
da:b
di:b
du:b
ðab
ðub
ðib
ða:b
ðu:b
ði:b
Gloss
He cursed
He cursed
You curse
He left
You leave
Nonword
He perished
Nonword
You repent
He repented
Nonword
You repent
He fell
Nonword
Bear
It melted
Bear
You melt
Nonword
You protect
Nonword
It melted
You melt
Wolf
Emphatic
s¿ab
s¿ib
s¿ub
s¿a:b
s¿i:b
s¿u:b
t¿ab
t¿ib
t¿ub
t¿a:b
t¿i:b
t¿u:b
d¿ab
d¿ib
d¿ub
d¿a:b
d¿i:b
d¿u:b
ð¿ab
ð¿ub
ð¿ib
ð¿a:b
ð¿u:b
ð¿i:b
157
Gloss
He poured
Nonword
You pour
He scored
You score
Nonword
It touched
You recover
Nonword
He recovered
You recover
Brick
Nonword
Nonword
You keep
Nonword
Nonword
Nonword
Nonword
Nonword
Nonword
Nonword
Nonword
Nonword
Structure
CVC
Monosyllables: target word-final
No
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
Plain
bas
bis
bus
ba:s
bi:s
bu:s
bat
bit
but
ba:t
bi:t
bu:t
bad
bid
bud
ba:d
bi:d
bu:d
bað
bið
buð
ba:ð
bi:ð
bu:ð
Gloss
Enough!
Cat
Nonword
He kissed
Nonword
You kiss
He decided
You decide
You decide
He slept
You sleep
Shoes
Big stone
Nonword
Nonword
It ended
You kill; end
Nonword
Nonword
Nonword
Nonword
Nonword
Nonword
Nonword
Emphatic
bas¿
bis¿
bus¿
ba:s¿
bi:s¿
bu:s¿
bat¿
bit¿
but¿
ba:t¿
bi:t¿
bu:t¿
bad¿
bid¿
bud¿
ba:d¿
bi:d¿
bu:d¿
bað¿
bið¿
buð¿
ba:ð¿
bi:ð¿
bu:ð¿
158
Gloss
He saw
Nonword
You see
Bus
Nonword
Nonword
Nonword
Nonword
Nonword
Armpit
shoes
Nonword
Nonword
Nonword
Nonword
It laid eggs
White (pl)
Nonword
Nonword
Nonword
Nonword
Nonword
Nonword
Nonword
Structure
CVC
Appendix II: The acoustic study: the bisyllables:
Rightward spread
Bisyllables: (target C word-initial)
No
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
Plain
sabat
sibat
subat
saabat
siibat
suubat
thabat
thibat
thubat
thaabat
thiibat
thuubat
tabat
tibat
tubat
taabat
tiibat
tuubat
dabat
dibat
dubat
daabat
diibat
duubat
Gloss
he selpt
nonword
nonword
she left you
nonword
nonword
nonword
nonword
nonword
it melted
nonword
nonword
nonword
nonword
nonword
she repented
nonword
nonword
nonword
nonword
nonword
it melted
nonword
nonword
Emphatic
sabat
sibat
subat
saabat
siibat
suubat
thabat
thibat
thubat
thaabat
thiibat
thuubat
tabat
tibat
tubat
taabat
tiibat
tuubat
dabat
dibat
dubat
daabat
diibat
duubat
159
Gloss
nonword
nonword
nonword
she touched you
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
she recovered
kindness
his brick
nonword
nonword
nonword
nonword
nonword
nonword
Structure
CV.CVC
Leftward spread
Bisyllables: (target C word-final)
No
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
Plain
tabas
tabis
tabus
tabaas
tabiis
tabuus
tabath
tabith
tabuth
tabaath
tabiith
tabuuth
tabat
tabit
tabut
tabaat
tabiit
tabuut
tabad
tabid
tabud
tabaad
tabiid
tabuud
Gloss
nonword
nonword
nonword
nonword
nonword
she kisses
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
she sleeps
she sleeps
nonword
nonword
nonword
nonword
nonword
nonword
nonword
Emphatic
tabas
tabis
tabus
tabaas
tabiis
tabuus
tabath
tabith
tabuth
tabaath
tabiith
tabuuth
tabat
tabit
tabut
tabaat
tabiit
tabuut
tabad
tabid
tabud
tabaad
tabiid
tabuud
160
Gloss
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
it lays eggs
nonword
Structure
CV.CVC
Bidirectional spread
Bisyllables: (target C word-medial)
No
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
Plain
kasak
kasik
kasaak
kasiik
kisak
kisik
kisaak
kisiik
kathak
kathik
kathaak
kathiik
kithak
kithik
kithaak
kithiik
katak
katik
kataak
katiik
kitak
kitik
kitaak
kitiik
kadak
kadik
kadaak
kadiik
kidak
kidik
kidaak
kidiik
Gloss
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
Emphatic
kasak
kasik
kasaak
kasiik
kisak
kisik
kisaak
kisiik
kathak
kathik
kathaak
kathiik
kithak
kithik
kithaak
kithiik
katak
katik
kataak
katiik
kitak
kitik
kitaak
kitiik
kadak
kadik
kadaak
kadiik
kidak
kidik
kidaak
kidiik
161
Gloss
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
nonword
Structure
CV.CVC