AUC Philologica (Acta Universitatis Carolinae Philologica) je akademický časopis publikující jak lingvistické, tak literárně historické a teoretické studie. Nedílnou součástí časopisu jsou i recenze odborných knih a zprávy z akademického prostředí.
Časopis je indexován v databázích CEEOL, DOAJ, EBSCO a ERIH PLUS.
AUC PHILOLOGICA, Vol 2022 No 1 (2022), 83–95
Intra- and inter-speaker variability of vowel space using three different formant extraction methods
Alžběta Houzar, Radek Skarnitzl
DOI: https://doi.org/10.14712/24646830.2022.30
zveřejněno: 17. 01. 2023
Abstract
Individual speakers’ voices display various unique patterns, one of the most prominent of which is vowel articulation. This study focuses on vowel space properties of 15 Czech speakers in read and spontaneous speech, comparing outputs of three formant extraction methods, measuring formants: (1) in the vowels’ temporal midpoints, (2) as their mean from the vowels’ middle thirds, and (3) in the vowels’ articulatory targets. The results show extensive variability across speakers, but also great within-speaker variability between the two speech styles, with spontaneous speech manifesting more centralised vowel pronunciation than read utterances. The first two measurement methods did not yield systematically different results, while formant values extracted from acoustically defined articulatory targets lead to noticeably larger vowel spaces. The results suggest that care should be taken when interpreting formant values obtained by different methods.
klíčová slova: vowel space area; vowel formants; intra-speaker variability; inter-speaker variability; Czech
reference (22)
1. Boersma, P., & Weenink, D. (2015). Praat: doing phonetics by computer (Version 6.0). Retrieved from http://www.praat.org
2. Cavalcanti, J. C., Eriksson, A., & Barbosa, P. A. (2021). Acoustic analysis of vowel formant frequencies in genetically-related and non-genetically related speakers with implications for forensic speaker comparison. Plos One, 16(2), e0246645. CrossRef
3. Fletcher, A. R., McAuliffe, M. J., Lansford, K. L., & Liss, J. M. (2015). The relationship between speech segment duration and vowel centralization in a group of older speakers. Journal of the Acoustical Society of America, 138(4), 2132-2139. CrossRef
4. Fletcher, A. R., McAuliffe, M. J., Lansford, K. L., & Liss, J. M. (2017). Assessing vowel centralization in dysarthria: A comparison of methods. Journal of Speech, Language, and Hearing Research, 60(2), 341-354. CrossRef
5. Fuchs, S. (2017). Changes and challenges in explaining speech variation: A brief review. Available at: https://www.researchgate.net/publication/320991961_Changes_and_challenges_in_explaining_ speech_variation_A_brief_review.
6. Jacewicz, E., Fox, R. A., & Salmons, J. (2007). Vowel space areas across dialects and gender. In Proceedings of the 16th ICPhS, 1465-1468.
7. Machač, P., & Skarnitzl, R. (2009). Principles of phonetic segmentation. Epocha.
8. Nolan, F., & Grigoras, C. (2005). A case for formant analysis in forensic speaker identification. International Journal of Speech, Language and the Law, 12(2), 143-173. CrossRef
9. Pettinato, M., Tuomainen, O., Granlund, S., & Hazan, V. (2016). Vowel space area in later childhood and adolescence: Effects of age, sex and ease of communication. Journal of Phonetics, 54, 1-14. CrossRef
10. Pollák, P., Volín, J., & Skarnitzl, R. (2007). HMM-Based Phonetic Segmentation in Praat Environment. Proceedings of SPECOM 2007, 537-541. MSLU.
11. R Core Team (2021). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Available at: https://www.R-project.org/.
12. Reetz, H., & Jongman, A. (2009). Phonetics: Transcription, production, acoustics, and perception. Blackwell.
13. Rose, P. (2015). Forensic voice comparison with monophthongal formant trajectories - a likelihood ratio-based discrimination of "schwa" vowel acoustics in a close social group of young Australian females. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4819-4823. CrossRef
14. Simpson, A., & Ericsdotter, C. (2007). Sex-specific differences in f0 and vowel space. In Proceedings of the 16th ICPhS, 933-936.
15. Skarnitzl, R., Vaňková, J., & Bořil, T. (2015). Optimizing the extraction of vowel formants. In: Niebuhr, O. & Skarnitzl, R. (Eds.), Tackling the complexity in speech, 165-182. Charles University, Faculty of Arts.
16. Skarnitzl, R., & Vaňková, J. (2017). Fundamental frequency statistics for male speakers of Common Czech. Acta Universitatis Carolinae - Philologica, 3, 7-17. CrossRef
17. Skarnitzl, R., & Volín, J. (2012). Referenční hodnoty vokalických formantů pro mladé dospělé mluvčí standardní češtiny. Akustické listy, 18, 7-11.
18. Story, B. H., & Bunton, K. (2017). Vowel space density as an indicator of speech performance. Journal of the Acoustical Society of America, 141(5), EL458-EL464. CrossRef
19. Šimáčková, Š., Podlipský, V. J., & Chládková, K. (2012). Czech spoken in Bohemia and Moravia. Journal of the International Phonetic Association, 42(2), 225-232. CrossRef
20. Tykalová, T., Škrabal, D., Bořil, T., Čmejla, R., Volín, J., & Rusz, J. (2021). Effect of Ageing on Acoustic Characteristics of Voice Pitch and Formants in Czech Vowels. Journal of Voice, 35(6), 931.e21-931.e33. CrossRef
21. Weirich, M., & Simpson, A. (2013). Investigating the relationship between average speaker fundamental frequency and acoustic vowel space size. Journal of the Acoustical Society of America, 134(4), 2965-2974. CrossRef
22. Wickham H (2016). ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York. Available at: https://ggplot2.tidyverse.org/. CrossRef
Intra- and inter-speaker variability of vowel space using three different formant extraction methods is licensed under a Creative Commons Attribution 4.0 International License.
230 x 157 mm
vychází: 3 x ročně
cena tištěného čísla: 150 Kč
ISSN: 0567-8269
E-ISSN: 2464-6830