Remember Me
Or use your Academic/Social account:


Or use your Academic/Social account:


You have just completed your registration at OpenAire.

Before you can login to the site, you will need to activate your account. An e-mail will be sent to you with the proper instructions.


Please note that this site is currently undergoing Beta testing.
Any new content you create is not guaranteed to be present to the final version of the site upon release.

Thank you for your patience,
OpenAire Dev Team.

Close This Message


Verify Password:
Verify E-mail:
*All Fields Are Required.
Please Verify You Are Human:
fbtwitterlinkedinvimeoflicker grey 14rssslideshare1
Proutskova, Polina; Rhodes, Christophe; Wiggins, Geraint; Crawford, Tim (2012)
Languages: English
Types: Unknown
This paper presents a new reference dataset of sustained, sung vowels with attached labels indicating the phonation mode. The dataset is intended for training computational models for automated phonation mode detection.\ud \ud Four phonation modes are distinguished by Johan Sundberg: breathy, neutral, flow (or resonant) and pressed. The presented dataset consists of ca. 700 recordings of nine vowels from several languages, sung at various pitches in various phonation modes. The recorded sounds were produced by one female singer under controlled conditions, following recommendations by voice acoustics researchers.\ud \ud While datasets on phonation modes in speech exist, such resources for singing are not available. Our dataset closes this gap and offers researchers in various disciplines a reference and a training set. It will be made available online under Creative Commons license. Also, the format of the dataset is extensible. Further content additions and future support for the dataset are planned.
  • The results below are discovered through our pilot algorithms. Let us know how we are doing!

    • Borch, D. Z. and Sundberg, J. (2011). Some phonatory and resonatory characteristics of the rock, pop, soul, and swedish dance band styles of singing. J Voice, 25(5):532-7.
    • Drugman, T., Bozkurt, B., and Dutoit, T. (2012). A comparative study of glottal source estimation techniques. Computer Speech and Language, 26:20-34.
    • Drugman, T., Dubuisson, T., Moinet, A., D'Alessandro, N., and Dutoit, T. (2008). Glottal source estimation robustness. In Proc. of the IEEE International Conference on Signal Processing and Multimedia Applications (SIGMAP08).
    • Fritzell, B. (1992). Inverse filtering. Journal of Voice, 6(2):111-114.
    • Froeschels, E. (1943). Hygiene of the voice. Arch Otolaryngol., 38(2):122-130.
    • Gudnason, J., Mark R.P. Thomas, D. P. E., and Naylor, P. A. (2012). Data-driven voice source waveform analysis and synthesis. Speech Communication, 54:199-211.
    • Howard, D. M. (2010). Electrolaryngographically revealed aspects of the voice source in singing. Logopedics Phoniatrics Vocology, 35(2):81-89.
    • [8] (2009). Guidelines on the Production and Preservation of Digital Audio Objects: Standards, Recommended Practices and Strategies (IASA-TC 04). IASA (International Association for Sound- and Audiovisual Archives) Technical Committee, 2 edition.
    • [9] Lehto, L., Airas, M., Björkner, E., Sundberg, J., and Alku, P. (2007). Comparison of two inverse filtering methods in parameterization of the glottal closing phase characteristics in different phonation types. J Voice, 21(2):138-50.
    • [10] Lomax, A. (1977). Cantometrics: A Method of Musical Anthropology (audio-cassettes and handbook). Berkeley: University of California Media Extension Center.
    • [11] Orr, R., Cranen, B., de Jong, F., d'Alessandro, C., and Scherer, K. (2003). An investigation of the parameters derived from the inverse filtering of flow and microphone signals. In Voice Quality: Functions, Analysis and Synthesis (VOQUAL '03). Taalwetenschap Otorhinolaryngology.
    • [12] Pulakka, H. (2005). Analysis of human voice production using inverse filtering, high-speed imaging, and electroglottography. Master's thesis, HELSINKI UNIVERSITY OF TECHNOLOGY, Department of Computer Science and Engineering.
    • [13] Ramig, L. O. and Verdolini, K. (1998). Journal of speech, language, and hearing research. Journal of Speech, Language, and Hearing Research, 41:101- 116.
    • [14] Rothenberg, M. (1973). A new inverse-filtering technique for deriving the glottal air flow waveform during voicing. The Journal of the Acoustical Society of America, 53:1632-1645.
    • [15] Sundberg, J. (1987). The science of the singing voice. Illinois University Press.
    • [16] Sundberg, J., Thalén, M., Alku, P., and Vilkman, E. (2004). Estimating perceived phonatory pressedness in singing from flow glottograms. J Voice, 18(1):56-62.
    • [17] Svec, J. G. and Granqvist, S. (2010). Guidelines for selecting microphones for human voice production research. American Journal of Speech-Language Pathology, 19:356-368.
    • [18] Thalén, M. and Sundberg, J. (2001). Describing different styles of singing: a comparison of a female singer's voice source in "classical", "pop", "jazz" and "blues". Logoped Phoniatr Vocol, 26(2):82-93.
    • [19] Walker, J. and Murphy, P. (2007). A review of glottal waveform analysis. In PROGRESS IN NONLINEAR SPEECH PROCESSING, volume 4391 of Lecture Notes in Computer Science, pages 1-21. Springer.
  • No related research data.
  • No similar publications.

Share - Bookmark

Cite this article