AUC Philologica (Acta Universitatis Carolinae Philologica) is an academic journal published by Charles University. It publishes scholarly articles in a large number of disciplines (English, German, Greek and Latin, Oriental, Romance and Slavonic studies, as well as in phonetics and translation studies), both on linguistic and on literary and cultural topics. Apart from articles it publishes reviews of new academic books or special issues of academic journals.
The journal is indexed in CEEOL, DOAJ, EBSCO, and ERIH PLUS.
AUC PHILOLOGICA, Vol 2017 No 3 (2017), 7–17
Fundamental frequency statistics for male speakers of Common Czech
Radek Skarnitzl, Jitka Vaňková
DOI: https://doi.org/10.14712/24646830.2017.29
published online: 01. 09. 2017
abstract
In speaker identification, a forensic phonetician’s task often involves comparingvoices of two or more speakers and assessing their similarity, but also their typicality. For the latter, it is necessary to have background information about the relevant speaker population. This paper introduces the database of Common Czech which was compiled as a reference database, and presents the first set of compiled statistics pertaining to fundamental frequency (F0). The population statistics are computed from a reading task and spontaneous speech. The results confirm the superiority of F0 baseline over mean or median values when assessing typicality and demonstrate, in many speakers, a narrower intonation range in spontaneous speech than in reading. The role of F0 in speaker comparison is also discussed.
keywords: fundamental frequency; forensic phonetics; speaker identification; Czech
references (35)
1. 3GPP. (2012). TS 26.071 AMR speech CODEC; General description. Retrieved from http://www.3gpp.org/ftp/specs/archive/26_series/26.071/
2. Boersma, P. & Weenink, D. (2016). Praat: Doing Phonetics by Computer (Version 6.0.20). Retrieved on September 12, 2016 from www.praat.org
3. Boss, D. (1996). The problem of F0 and real-life speaker identification: a case study. Forensic Linguistics, 3, 155–159.
4. Braun, A. (1995). Fundamental frequency – How speaker-specific is it? BEIPHOL 64, Studies in Forensic Phonetics, 9–23.
5. Chromý, J. (2014). Demokratizace spisovné češtiny a ideologie jazykové kultury po roce 1948. Acta Universitatis Carolinae, Philologica, 3, 71–81. [in Czech]
6. De Jong, G., McDougall, K. & Nolan, F. (2007). Sound change and speaker identity: An acoustic study. In: Muller, C. (Ed.), Speaker Classification II, 130–141. Berlin: Springer. CrossRef
7. Hirson, A., French, P. & Howard, D. (1995). Speech fundamental frequency over the telephone and face-to-face: some implications for forensic phonetics. In: Windsor Lewis, J. (Ed.), Studies in General and English Phonetics: Essays in Honour of Professor J. D. O'Connor, 230–240. London: Routledge.
8. Hollien, H. (2002). Forensic Voice Identification. San Diego: Academic Press.
9. Hollien, H. & Hollien, P. A. (1995). Improving aural-perceptual speaker identification techniques. BEIPHOL 64, Studies in Forensic Phonetics, 87–97.
10. Hollien, H., Hollien, P. A. & de Jong, G. (1997). Effects of three parameters on speaking fundamental frequency. Journal of the Acoustical Society of America, 102, 2984–2992. CrossRef
11. Hudson, T., de Jong, G., McDougall, K., Harrison, P. & Nolan, F. (2007). F0 statistics for 100 young male speakers of Standard Southern British English. Proceedings of the 16th ICPhS, 1809–1812.
12. Hughes, V. & Foulkes, P. (2015). The relevant population in forensic voice comparison: Effects of varying delimitations of social class and age. Speech Communication, 66, 218–230. CrossRef
13. Jessen, M. (2007). Forensic reference data on articulation rate in German. Science & Justice, 47, 50–67. CrossRef
14. Jessen, M., Koster, O. & Gfroerer, S. (2005). Influence of vocal effort on average and variability of fundamental frequency. International Journal of Speech, Language and the Law, 12, 174–213. CrossRef
15. Krčmová, M. (2005). Stratifikace současné češtiny. Linguistica Online. Retrieved from http://www.phil.muni.cz/linguistica/art/krcmova/krc-012.pdf [in Czech]
16. Lindh, J. & Eriksson, A. (2007). Robustness of Long Time Measures of Fundamental Frequency. Proceedings of Interspeech 2007, 2025–2028.
17. McDougall, K., Duckworth, M. & Hudson, T. (2015). Individual and group variation in disfluency features: A cross-accent investigation. Proceedings of the 18th ICPhS.
18. Morrison, G. S. & Ochoa, F. (2012). Database selection for forensic voice comparison. Proceedings of Odyssey 2012: The Language and Speaker Recognition Workshop. Singapore: ISCA.
19. Nolan, F. (1983). The Phonetic Bases of Speaker Recognition. Cambridge: Cambridge University Press.
20. Nolan, F. (1999). Speaker recognition and forensic phonetics. In: Hardcastle, W. J. & Laver, J. (Eds.), The Handbook of Phonetic Sciences, 744–767. Oxford: Blackwell Publishers.
21. Nolan, F., McDougall, K., De Jong, G. & Hudson, T. (2009). The DyViS database: style-controlled recordings of 100 homogeneous speakers for forensic phonetic research. International Journal of Speech, Language and the Law, 16, 31–57.
22. Pollák, P., Volín, J. & Skarnitzl, R. (2007). HMM-Based Phonetic Segmentation in Praat Environment. Proceedings of SPECOM 2007, 537–541. Moscow: MSLU.
23. R Core Team (2015). R: A language and environment for statistical computing (version 3.2.2). R Foundation for Statistical Computing, Vienna. Retrieved from http://www.R-project.org/.
24. Reynolds, D. A. & Campbell, W. M. (2008). Text-independent speaker recognition. In: Benesty, J., Sondhi, M. M. & Huang, Y. (Eds.), Springer Handbook of Speech Processing, 763–781. Berlin: Springer-Verlag. CrossRef
25. Reynolds, D. A., Quatieri, T. F. & Dunn, R. B. (2000). Speaker verification using adapted Gaussian mixture models. Digital Signal Processing, 10, 19–41. CrossRef
26. Rose, P. (2002). Forensic Speaker Identification. London: Taylor & Francis. CrossRef
27. Růžičková, A. & Skarnitzl, R. (2017). Voice disguise strategies in Czech male speakers. AUC Philologica 3, Phonetica Pragensia, pp. 19–34.
28. Skarnitzl, R. (2014). Forenzní fonetika. In: Skarnitzl, R. (Ed.), Fonetická identifikace mluvčího, pp. 11–20. Praha: Faculty of Arts, Charles University in Prague. [in Czech]
29. Skarnitzl, R. & Hývlová, D. (2014). Statistický popis hodnot zakladní frekvence. In: Skarnitzl, R. (Ed.), Fonetická identifikace mluvčího, pp. 49–64. Praha: Faculty of Arts, Charles University in Prague. [in Czech]
30. Skarnitzl, R. & Vaňková, J. (2015). Presenting the population statistics of Common Czech: preliminary F0 results. Presented at IAFPA 2015, Leiden.
31. Skarnitzl, R. & Volín, J. (2012). Referenční hodnoty vokalických formantů pro mladé dospělé mluvčí standardní češtiny. Akustické listy, 18, 7–11. [in Czech]
32. Vaňková, J. & Bořil, T. (2014). Telefonní přenos. In: Skarnitzl, R. (Ed.), Fonetická identifikace mluvčího, pp. 104–115. Praha: Faculty of Arts, Charles University in Prague. [in Czech]
33. Volín, J. (2007a). Statistické metody ve fonetickém výzkumu. Praha: Epocha. [in Czech]
34. Volín, J. (2007b). Data volume requirements for reliable F0 normalization. In: Vich, R. (Ed.), 17th Czech-German Workshop – Speech Processing, 62–67. Praha: Czech Academy of Sciences.
35. Wickham, H. (2009). ggplot2: Elegant Graphics for Data Analysis. New York: Springer-Verlag. CrossRef
Fundamental frequency statistics for male speakers of Common Czech is licensed under a Creative Commons Attribution 4.0 International License.
230 x 157 mm
periodicity: 3 x per year
print price: 150 czk
ISSN: 0567-8269
E-ISSN: 2464-6830