ISSP2014Proceedings Weirich Simpson Final
ISSP2014Proceedings Weirich Simpson Final
net/publication/261376774
CITATIONS READS
8 2,224
2 authors:
All content following this page was uploaded by Melanie Weirich on 07 April 2014.
For several languages it has been found that female speakers Articulatory recordings of 5 male and 4 female German speak-
have a larger acoustic vowel space than male speakers (e.g. ers were made at Potsdam University with the NDI-Wave
Diehl et al. (1996) for American English; Whiteside (2001) for system. The speakers were between 23 and 43 years old and
British English; Weirich & Simpson (2014) for German). revealed no known speech or hearing impairments. All of
Different reasons have been held responsible for this including them came from the Eastern Central German dialect area but
behavioral and physiological factors. Despite having a larger showed very little dialectal influence based on the auditory
acoustic vowel space, females have been found to have a impression of the authors. Altogether, six coils were attached
smaller articulatory space (Simpson 2001, 2002). Furthermore, to the tongue, the lips, and the lower jaw. Four coils, one
Simpson (1998) found sex-specific differences in correlations above the upper incisors, one at the bridge of the nose and two
between formant values and duration. While male speakers behind the participants’ ears served as reference coils which
showed the expected significant relationship in read speech could be used to compensate for head movements. The bite
between duration and e.g. F1 of /aː oː ɔ ʊ/, female speakers plane was established using three sensors attached to a set
often did not exhibit this relationship. From this and the mis- square which a speaker held between his/her teeth.
match between articulatory and acoustic vowel spaces the The speech material presented here is twofold and part of a
question arises whether there might be sex-specific differences larger corpus comprising 20 different target words in varying
in terms of undershoot. In other words, do females reach their accent conditions.
articulatory targets more often than males? The first set of data used in the current study includes the three
The phenomenon of undershoot has been found cross- point vowels /aː uː iː/ contained in the vowel sequences in the
linguistically in many studies investigating the role of accent abbreviation IAA, AUU and BII. The articulatory positions of
and lexical stress (Öhmann 1967, Fowler 1981, Rietveld & the vowels were expected to be extreme and only minimally
Koopmans-van Beinum 1987, de Jong 1995, Palethorpe et al. effected by coarticulatory influences due to their temporally
1999, Harrington et al. 2000, Cho 2004). Vowel undershoot privileged occurrence in the abbreviations. Each target word
patterns have been explained both in terms of paradigmatic was repeated 10-12 times and embedded in a carrier sentence,
enhancement and by varying degrees of coarticulation. With e.g. Sie fuhren letzte Woche zur IAA ganz schnell (‘They went
respect to paradigmatic enhancement, more peripheral vowels to the IAA very fast last week’).
are supposed to be found in stressed or accented positions, The second set of data comprises the sequence /gV/ with V
whereas in unstressed or unaccented positions vowel centrali- being /iː ɪ eː ɛ aː a oː ɔ uː ʊ/ in the name GVbi embedded in the
zation is expected. Following the coarticulatory explanation, carrier sentence Ich sah GVbi an (‘I looked at GVbi’). For the
undershoot in unstressed or unaccented positions is a product /gV/-material, three different accent conditions were recorded.
of a higher degree of coarticulation related to contextual reduc- First, the participants were asked to read the sentences pre-
tion. Vowel reduction due to increased coarticulation is ex- sented to them from a screen (neutral condition, n). Second,
plained by target undershoot reflecting the shorter durations in speakers produced the name in response to questions from the
unstressed/unaccented positions. Mooshammer & Geng (2008) experimenter. For the accented condition (a) the experimenter
examined acoustic and articulatory vowel reduction patterns of asked 1) Sahst du /gVbi/ oder /gVbi/ an? (‘Did you look at
tense and lax vowels in German in stressed and unstressed /gVbi/ or /gVbi/?’), eliciting in the reply an accentuation of the
syllables. They found a higher degree of coarticulation in name. For the unaccented condition (u) the experimenter asked
unstressed vowels than in stressed vowels. Thus, a relationship 2) Siehst du oder sahst du /gVbi/ an? (‘Do you or did you look
at /gVbi/?’), eliciting in the reply an accentuation of the verb. mm² vs. 93 mm²). Moreover, the dimensions of the space
Each vowel was again repeated 10-12 times in each accent differ between the sexes: while the male speakers reveal a
condition. The sentences were recorded in a randomized order larger vertical than horizontal expansion (around 1.3 times
with additional filler sentences in between. larger), the relationship between vertical and horizontal expan-
sion for the females is ca. 1 (or even less). A significant differ-
2.2. Articulatory labeling and analyses ence between males and females was found for the Euclidean
distance (ED) between /iː/ and /aː/ with a larger distance for
All articulatory labeling was done with the help of mview
males (Welch two-sample t-test, t = -2.7, df = 5.9, p < .05).
(software written by M. Tiede). Here, we will concentrate on
males females
the horizontal and vertical position of the backmost lingual
sensor (tongue dorsum). First, the articulatory position at the
5 10
5 10
acoustic midpoint of the double vowel sequences were meas-
TBy_trans0
TBy_trans0
ured (cf. Figure 1, left graph).This provided a notional extreme
0
articulatory vowel space for each speaker (the IAU-polygon).
To compare the articulatory positions across speakers, the data
-10
-10
was translated with the midpoint set to the origin (0/0). The
vowel space size of the IAU-polygon and the Euclidean Dis- -15 -5 5 15 -15 -5 5 15
tances (EDs) between the vowels were calculated. Also, the TBx_trans0 TBx_trans0
EDs from the midpoint of the polygon to each vowel were
measured for each speaker. Second, the start and end of the Figure 2: Vowel spaces for male (left) and female (right)
opening gesture /gV/ was labeled for each speaker, stimulus speakers centered around 0/0
and accent condition oriented on the tangential velocity of the
tongue dorsum sensor (cf. Figure 1, right graph). The end When we look at the articulatory positions of the different
points of the gestures (which correspond to the vowels) were vowels, we can see that the sexes differ particularly in /aː/ and
used to parameterize the polygon area spanned by the vowels /iː/, while the positions for /uː/ overlap (Figure 3).
contained in the /gV/-sequences and the EDs from the mid-
point. Third, each speaker’s /gV/-polygon was normalized by
expressing it as a percentage of the temporally privileged IAU-
5
TB_y (mm)
vowel space. The percentages were calculated for the polygon
0
size and the EDs between vowels and midpoint.
For statistical analyses, Welch two sample t-tests and linear
-5
and the random factors speaker and repetition. Likelihood ratio Figure 3: Vowel spaces of the /IAU/-polygon for all male
tests were used for model comparisons with different factors (blue) and female (red) speakers
included to find the model with the best fit to the data.
This is mirrored in the statistical results. For both vertical and
horizontal tongue position as dependent variable, LMMs
revealed the best fit to the data with the interaction term
Sex*Vowel included. For the horizontal tongue position,
significant differences between males and females were found
for /aː/, for the vertical tongue position males and females
differ significantly in /aː/ and /iː/ (see Table 1). Thus, the
female vowel space is smaller in terms of a higher and more
fronted position for /aː/ and a lower position for /iː/.
f m
Polygon tense (mm²)
5
60
TB_y (mm)
TB_y (mm)
5
40
-5
-5
f m f m f m 0
-15
-15
n a u n a u
15
15
TB_y (mm)
TB_y (mm)
0 5
0 5
the EDs from the midpoint of the vowel space to each vowel
were used as dependent variable for the statistical analysis
-10
(and not the overall polygon size). Model comparisons showed -10
that the LMM with the interaction terms Sex*Accent and 30 40 50 60 30 40 50 60
Sex*Vowel as fixed factors revealed the best fit to the data
TB_x (mm) TB_x (mm)
(random factors included were speaker and repetition). Table 2
Figure 5: Tongue dorsum positions during the IAU (extreme)
gives the summary statistics for the fixed effects. To summa-
rize, there is a significant difference between males and fe- vowel space (red) and /gV/-sequence of two male (M1, M2)
and two female (F1, F2) speakers
males in the EDs only for the vowel /aː/. Second, males show
a significant difference between the accented and unaccented
Analogous to Figure 4 above, Figure 6 shows the polygon
condition, while females do not. Since there was no three-way
interaction of Accent, Vowel and Sex, this sex-specific differ- spanned by the tense vowels of the /gV/-sequence separated by
sex and accent condition (left graph). However, this time the
ence in the effect of accent seems to hold for all vowels.
values are expressed in percent of the IAU-polygon (calculat-
ed for each speaker and accent condition separately). Interest-
Table 2: Summary statistics for factors Sex, Vowel and Accent,
dependent variable is absolute ED between midpoint and all ingly, the males reveal smaller values than the females in
every accent condition, but especially in the unaccented one.
tense vowels (in mm) (default level: vowel /aː/ in accented
condition for females, observations:1104, rep:12, speakers:9) The right graph shows the EDs from the midpoints to the tense
vowels /iː aː uː/ expressed as percent of the EDs measured in
Estimate Std.Error t-value pMCMC
the IAU-polygon. Here again, the males differ between the
(Intercept) 5.1 0.6 9.1 <.001
Accent [a vs. n] 0.5 0.2 2.7 <.001 accented and unaccented condition, while the females do not.
Accent [a vs. u] -0.1 0.2 -0.3 0.8
ED MP-tense vowels (%)
f m
Sex [f vs. m] 2.3 0.7 3.1 <.01
Polygon tense (%)