AUC Philologica (Acta Universitatis Carolinae Philologica) is an academic journal published by Charles University. It publishes scholarly articles in a large number of disciplines (English, German, Greek and Latin, Oriental, Romance and Slavonic studies, as well as in phonetics and translation studies), both on linguistic and on literary and cultural topics. Apart from articles it publishes reviews of new academic books or special issues of academic journals.
The journal is indexed in CEEOL, DOAJ, EBSCO, and ERIH PLUS.
AUC PHILOLOGICA, Vol 2017 No 3 (2017), 35–44
Temporal variability of fundamental frequency contours
Robin Hruška, Tomáš Bořil
DOI: https://doi.org/10.14712/24646830.2017.31
published online: 01. 09. 2017
abstract
Intonation is one of the means of performing a speech style. Thus, observing pitch variation in an utterance may be a clue to identifying speech style. We design a cumulative slope (CS) index based upon the amount of pitch variation in a measured F0 contour and the duration of that contour. The more pitch changes there are and the greater their frequency range is, the greater the CS index is. This is confirmed by an experiment we conduct: the CS index of utterances with expressive intonation is higher than that of utterances with neutral intonation, and for utterances with neutral intonation the CS index is higher than for utterances with monotonous or flat intonation. However, as there is a great variability between speakers, the CS index as defined currently, cannot be used to universally differentiate between the styles. Results obtained using automatic voice activity detection (VAD) are close to those obtained with manual VAD and thus the extraction of CS index can be reliably automatized.
keywords: fundamental frequency; melody of speech; stylization; variability of pitch contours
references (14)
1. Bates, D., Maechler, M., Bolker, B. & Walker, S. (2015). Fitting Linear Mixed-Effects Models Using lme4. Journal of Statistical Software, 67(1), 1–48. CrossRef
2. Boersma, P. & Weenink, D. (2016). Praat: doing phonetics by computer [Computer program]. Version 6.0.14, retrieved 11 February 2016 from http://www.praat.org.
3. Bořil, T. & Skarnitzl, R. (2016). Tools rPraat and mPraat. In: P. Sojka, A. Horák, I. Kopeček & K. Pala (Eds.), Text, Speech, and Dialogue, 367–374. Berlin: Springer International Publishing. CrossRef
4. Fujisaki, H. & Ohno, S. (1997). Comparison and assessment of models in the study of fundamental frequency contours of speech. In: INT – 1997, 131–134.
5. Hermes, D.J. (2006). Stylization of pitch contours. In: S. Sudhoff et al. (Eds.), Methods in empirical prosody research, 29–61. Berlin: Walter de Gruyter. CrossRef
6. Hirst, D. & Espesser, R. (1993). Automatic modelling of fundamental frequency using a quadratic spline function. In: Travaux de l'Institut de Phonétique d'Aix, Vol. 15, 75–85.
7. Hruška, R. (2016). Properties of fundamental frequency contours in segmental contexts. Unpublished bachelor thesis. Prague: Institute of Phonetics, Faculty of Arts, Charles University.
8. Laan, G. P. M. (1996). The Contribution of Intonation, Segmental Durations, and Spectral Features to the Perception of a Spontaneous and a Read Speaking Style. Amsterdam: Institute of Phonetic Sciences, University of Amsterdam.
9. Mertens, P. & d'Alessandro, C. (1995). Pitch contour stylization using a tonal perception model. In: Proceedings of 13th International Congress of Phonetic Sciences, Vol. 4, 228–231.
10. Mertens, P. (2004). The Prosogram: Semi-Automatic Transcription of Prosody based on a Tonal Perception Model. In: B. Bel & I. Marlien (Eds.), Proceedings of Speech Prosody 2004, Nara (Japan): ISCA.
11. Palková, Z. (1997). Modelling intonation in Czech: Neutral vs. marked TTS F0-patterns. In: Intonation: Theory, Models, and Applications. Athens, Greece.
12. R Core Team (2017). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org.
13. Sohn, J., Kim, N. S. & Sung, W. (1999). A statistical model-based voice activity detection. IEEE Signal Processing Lett., 6 (1), 1–3. CrossRef
14. Volín, J. & Bořil, T. (2013). General and speaker-specific properties of F0 contours in short utterances. AUC Philologia 1/2014, Phonetica Pragensia XIII, 9–20.
Temporal variability of fundamental frequency contours is licensed under a Creative Commons Attribution 4.0 International License.
230 x 157 mm
periodicity: 3 x per year
print price: 150 czk
ISSN: 0567-8269
E-ISSN: 2464-6830