Remember Me
Or use your Academic/Social account:


Or use your Academic/Social account:


You have just completed your registration at OpenAire.

Before you can login to the site, you will need to activate your account. An e-mail will be sent to you with the proper instructions.


Please note that this site is currently undergoing Beta testing.
Any new content you create is not guaranteed to be present to the final version of the site upon release.

Thank you for your patience,
OpenAire Dev Team.

Close This Message


Verify Password:
Verify E-mail:
*All Fields Are Required.
Please Verify You Are Human:
fbtwitterlinkedinvimeoflicker grey 14rssslideshare1
Benetos, E.; Dixon, S. (2011)
Publisher: IEEE
Languages: English
Types: Unknown
Subjects: M1, QA76

Classified by OpenAIRE into

In this paper, an approach for polyphonic music transcription based on joint multiple-F0 estimation and note onset/offset detection is proposed. For preprocessing, the resonator time-frequency image of the input music signal is extracted and noise suppression is performed. A pitch salience function is extracted for each frame along with tuning and inharmonicity parameters. For onset detection, late fusion is employed by combining a novel spectral flux-based feature which incorporates pitch tuning information and a novel salience function-based descriptor. For each segment defined by two onsets, an overlapping partial treatment procedure is used and a pitch set score function is proposed. A note offset detection procedure is also proposed using HMMs trained on MIDI data. The system was trained on piano chords and tested on classic and jazz recordings from the RWC database. Improved transcription results are reported compared to state-of-the-art approaches.
  • The results below are discovered through our pilot algorithms. Let us know how we are doing!

    • Etot 40.3% 38.8%
    • [1] A. Klapuri and M. Davy, Eds., Signal Processing Methods for Music Transcription, Springer-Verlag, New York, 2nd edition, 2006.
    • [2] R. Zhou, Feature extraction of musical content for automatic music transcription, Ph.D. thesis, E´ cole Polytechnique Fe´de´rale de Lausanne, Oct. 2006.
    • [3] C. Yeh, A. Ro¨bel, and X. Rodet, “Multiple fundamental frequency estimation and polyphony inference of polyphonic music signals,” IEEE Trans. Audio, Speech, and Language Processing, vol. 18, no. 6, pp. 1116-1126, Aug. 2010.
    • [4] E. Benetos and S. Dixon, “Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution,” in ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, Sept. 2010, pp. 13-18.
    • [5] J. P. Bello, L. Daudet, S. Abdallah, C. Duxbury, M. Davies, and M. Sandler, “A tutorial on onset detection of music signals,” IEEE Trans. Audio, Speech, and Language Processing, vol. 13, no. 5, pp. 1035-1047, Sept. 2005.
    • [6] A. Holzapfel and Y. Stylianou, “Three dimensions of pitched instrument onset detection,” IEEE Trans. Audio, Speech, and Language Processing, vol. 18, no. 6, pp. 1517-1527, 2010.
    • [7] M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, “RWC music database: music genre database and musical instrument sound database,” in Int. Conf. Music Information Retrieval, Oct. 2003.
    • [8] M. Varewyck and J.-P. Martens, “Assessment of state-of-theart meter analysis systems with an extended meter description model,” in 8th Int. Conf. Music Information Retrieval, 2007.
    • [9] V. Emiya, R. Badeau, and B. David, “Multipitch estimation of piano sounds using a new probabilistic spectral smoothness principle,” IEEE Trans. Audio, Speech, and Language Processing, vol. 18, no. 6, pp. 1643-1654, Aug. 2010.
    • [10] D. Schwarz and X. Rodet, “Spectral envelope estimation and representation for sound analysis-synthesis,” in Int. Computer Music Conf., Oct. 1999.
    • [11] A. Pertusa and J. M. In˜esta, “Multiple fundamental frequency estimation using Gaussian smoothness,” in IEEE Int. Conf. Acoustics, Speech, and Signal Processing, 2008, pp. 105-108.
    • [12] J. A. Nelder and R. Mead, “A simplex method for function minimization,” Computer Journal, vol. 7, pp. 308-313, 1965.
    • [13] F.J. Can˜adas-Quesada, N. Ruiz-Reyes, P. Vera Candeas, J. J. Carabias-Orti, and S. Maldonado, “A multiple-F0 estimation approach based on Gaussian spectral modelling for polyphonic music transcription,” J. New Music Research, vol. 39, no. 1, pp. 93-107, Apr. 2010.
    • [14] H. Kameoka, T. Nishimoto, and S. Sagayama, “A multipitch analyzer based on harmonic temporal structured clustering,” IEEE Trans. Audio, Speech, and Language Processing, vol. 15, no. 3, pp. 982-994, Mar. 2007.
    • [15] S. Saito, H. Kameoka, K. Takahashi, T. Nishimoto, and S. Sagayama, “Specmurt analysis of polyphonic music signals,” IEEE Trans. Audio, Speech, and Language Processing, vol. 16, no. 3, pp. 639-650, Mar. 2008.
  • No related research data.
  • No similar publications.

Share - Bookmark

Download from

Cite this article