Remember Me
Or use your Academic/Social account:


Or use your Academic/Social account:


You have just completed your registration at OpenAire.

Before you can login to the site, you will need to activate your account. An e-mail will be sent to you with the proper instructions.


Please note that this site is currently undergoing Beta testing.
Any new content you create is not guaranteed to be present to the final version of the site upon release.

Thank you for your patience,
OpenAire Dev Team.

Close This Message


Verify Password:
Verify E-mail:
*All Fields Are Required.
Please Verify You Are Human:
fbtwitterlinkedinvimeoflicker grey 14rssslideshare1
Kendrick, P; Jackson, IR; Li, FF; Fazenda, BM; Cox, TJ
Publisher: Audio Engineering Society
Languages: English
Types: Article
Subjects: media_dig_tech_and_creative_econ
For field recordings and user generated content recorded on phones, tablets, and other mobile\ud devices nonlinear distortions caused by clipping and limiting at pre-amplification stages, and\ud dynamic range control (DRC) are common causes of poor audio quality. A single-ended\ud method to detect these distortions and predict perceived degradation in speech, music, and\ud soundscapes has been developed. This was done by training an ensemble of decision trees.\ud During training, both clean and distorted audio was available and so the perceived quality\ud could be gauged using HASQI (Hearing Aid Sound Quality Index). The new single-ended\ud method can correctly predict HASQI from distorted samples to an accuracy of ±0.19 (95%\ud confidence interval) using a quality range between 0.0 and 1.0. The method also has potential\ud for estimating HASQI when other types of degradations are present. Subsequent perceptual\ud tests validated the method for music and soundscapes. For the average mean opinion score\ud for perceived audio quality on a scale from 0 to 1, the single ended method could estimate it\ud within ±0.33.
  • The results below are discovered through our pilot algorithms. Let us know how we are doing!

    • [1.] C. Wardle, S. Dubberley, and P. Brown, “Amateur Footage: A Global Study of UserGenerated Content in TV and Online-News Output” (2014). [Online]. Available: http://towcenter.org/wpcontent/uploads/2014/04/80458 Tow-Center-ReportWEB.pdf. [Accessed: 26-Nov-2014].
    • [2.] I. Jackson, “What You Told Us about Recording Audio: An Overview of Our Web Survey,” The Good Recording Project Blog (2012). [Online]. Available: http://www.goodrecording.net/211/. [Accessed: 20- Nov-2012].
    • [3.] I. R. Jackson, P. Kendrick, T. J. Cox, B. M. Fazenda, and F. F. Li, “Perception and Automatic Detection of WindInduced Microphone Noise,” J. Acous. Soc. Am., vol. 136, no. 3, p. 1176 (2014). http://dx.doi.org/10.1121/1.4892772
    • [4.] “Sound System Equipment-Part 5 Loudspeakers,” BS EN 60268-5 (2009).
    • [5.] “Measurement of Intermodulation Distortion in Audio Systems,” SMPTE Recommended Practice RP 120:2005 (2005).
    • [6.] R. Small, “Total Difference-Frequency Distortion: Practical Measurements,” J. Audio Eng. Soc., vol. 34, no. 6, pp. 427-436 (1986 June).
    • [7.] “Sound System Equipment-Part 3 Amplifiers,” BS EN 60268-3 (2001).
    • [8.] L. Lee and E. Geddes, “Auditory Perception of Nonlinear Distortion,” presented at the 115th Convention of the Audio Engineering Society (2003 Oct.), convention paper 5891.
    • [9.] E. Geddes and L. Lee, “Auditory Perception of Nonlinear Distortion-Theory,” presented at the 115th Convention of the Audio Engineering Society (2003 Oct.), convention paper 5890.
    • [10.] A. W. Rix, M. P. Hollier, A. P. Hekstra and J. G. Beerends, “PESQ, the New ITU Standard for Objective Measurement of Perceived Speech Quality, Part I-Time Alignment,” J. Audio Eng. Soc., vol. 50, pp. 755-764 (2002 Oct.).
    • [11.] J. G. Beerends, A. P. Hekstra, A. W. Rix, and M. P. Hollier, “PESQ, the New ITU Standard for Objective Measurement of Perceived Speech Quality, Part II-Perceptual Model,” J. Audio Eng. Soc., vol. 50, pp. 765-778 (2002 Oct.).
    • [12.] J. G. Beerends, C. Schmidmer, J. Berger, M. Obermann, R. Ullman, J. Pomy and M. Keyhl, “Perceptual Objective Listening Quality Assessment (POLQA), The Third Generation ITU-T Standard for End-to-End Speech Quality Measurement Part I-Temporal Alignment,” J. Audio Eng. Soc., vol. 61, pp. 366-384 (2013 June).
    • [13.] J. G. Beerends, C. Schmidmer, J. Berger, M. Obermann, R. Ullman, J. Pomy and M. Keyhl, “Perceptual Objective Listening Quality Assessment (POLQA), The Third Generation ITU-T Standard for End-to-End Speech Quality Measurement Part II-Perceptual Model,” J. Audio Eng. Soc., vol. 61, pp. 385-402 (2013 June).
    • [14.] “Perceptual Evaluation of Speech Quality (PESQ): An Objective Method for End-to-End Speech Quality Assessment of Narrow-Band Telephone Networks,” ITU P. 862 (2001).
    • [15.] “Perceptual Objective Listening Quality Assessment,” ITU-T P.863 (2011).
    • [16.] T. Thiede, W. Treurniet, and R. Bitto, “PEAQThe ITU Standard for Objective Measurement of Perceived Audio Quality,” J. Audio Eng. Soc., vol. 48, pp. 3-29 (2000 Jan./Feb.).
    • [17.] C. Tan, B. Moore, N. Zacharov, and V. Mattila, “Predicting the Perceived Quality of Nonlinearly Distorted Music and Speech Signals,” J. Audio Eng. Soc., vol. 52, pp. 699-711 (2004 Jul./Aug.).
    • [18.] C. Tan, B. Moore, and N. Zacharov, “The Effect of Nonlinear Distortion on the Perceived Quality of Music and Speech Signals,” J. Audio Eng. Soc., vol. 51, pp. 1012-1031 (2003 Nov.).
    • [19.] J. Kates and K. Arehart, “The Hearing-Aid Speech Quality Index (HASQI),” J. Audio Eng. Soc., vol. 58, pp. 363-381 (2010 May).
    • [20.] A. A. Kressner, D. V Anderson, and C. J. Rozell, “Evaluating the Generalization of the Hearing Aid Speech Quality Index (HASQI),” IEEE Trans. Audio. Speech Lang. Processing, vol. 21, no. 2, pp. 407-415 (2013). http://dx.doi.org/10.1109/tasl.2012.2217132
    • [21.] T. Cox, B. Fazenda, S. Groves-Kirkby, I. Jackson, P. Kendrick, and F. Li, “Quality Timbre and Distortion: Perceived Quality of Clipped Music,” in 30th Anniversary Conference Reproduced Sound Oct. 14-15 (2014).
    • [22.] “Single-Ended Method for Objective Speech Quality Assessment in Narrow-Band Telephony Applications,” ITU P. 563 (2004).
    • [23.] S. Mare, “Detection of Nonlinear Distortion in Audio Signals,” Broadcast. IEEE Trans., vol. 48, no. 2, pp. 76-80 (2002). http://dx.doi.org/10.1109/tbc.2002. 1021270
    • [24.] M. Massberg, “Investigation in Dynamic Range Compression,” MSc dissertation, Queen Mary University of London (2009).
    • [25.] P. E. Souza, L. M. Jenstad, and K. T. Boike, Measuring the Acoustic Effects of Compression Amplification on Speech in Noise,” J. Acoust. Soc. Am., vol. 119, no. 1, p. 41 (2006). http://dx.doi.org/10.1121/1.2108861
    • [26.] P. Kendrick, S. Groves-kirkby, I. Jackson, T. Cox, and B. Fazenda, “Measuring a Portable Audio Device's Response to Excessive Sound Levels,” Internal report, Salford (2013). Available: http://usir.salford.ac.uk/29371/
    • [27.] B. De Man and J. D. Reiss, “Adaptive Control of Amplitude Distortion Effects,” presented at the 53rd AES International Conference: Semantic Audio (2014 Jan.), conference paper P2-9.
    • [28.] D. Giannoulis, M. Massberg, and J. Reiss, “Digital Dynamic Range Compressor Design-A Tutorial and Analysis,” J. Audio Eng. Soc, vol. 60, pp. 399-408 (2012 June).
    • [29.] “General Methods for the Subjective Assessment of Sound Quality,” ITU 1284-1 (1997).
    • [30.] L. Breiman, “Random Forests,” Mach. Learn., vol. 45, no. 1, pp. 1-33 (2001).
    • [31.] Matlab, Matlab:2013b (Natick, MA, The MathWorks Inc. 2013).
    • [32.] O. Lartillot, P. Toiviainen, and T. Eerola, “MIRtoolbox” (2014).
    • [33.] I. Guyon and A. Elisseeff, “An Introduction to Variable and Feature Selection 1 Introduction,” J. Mach. Learn. Res., vol. 3, pp. 1157-1182 (2003).
    • [34.] G. Jurman, S. Riccadonna, and C. Furlanello, “A Comparison of MCC and CEN Error Measures in Multi-Class Prediction,” PLoS One, vol. 7, no. 8, p. e41882 (2012 Jan.). http://dx.doi.org/10.1371/journal.pone. 0041882
    • [35.] C. Strobl, T. Hothorn, and A. Zeileis, “2009 Party On! A New, Conditional Variable Importance Measure for Random Forests Available in the Party Package,” Technical Report Number 050, Department of Statistics, University of Munich (2009).
    • [36.] P. Latinne, O. Debeir, and C. Decaestecker, “Limiting the Number of Trees in Random Forests,” in Proceedings of MCS, LNCS 2096 (2001), pp. 1-10. http://dx.doi.org/10.1007/3-540-48219-9 18
    • [37.] W. A. Sethares, Tuning, Timbre, Spectrum, Scale (Springer-Verlag, 1998). http://dx.doi.org/10.1007/ b138848
    • [38.] K. Dittrich and D. Oberfeld, “A Comparison of the Temporal Weighting of Annoyance and Loudness,” J. Acoust. Soc. Am., vol. 126, no. 6, pp. 3168-78 (2009 Dec.). http://dx.doi.org/10.1121/1.3238233
    • [39.] D. Va¨stfja¨ll, “The 'End Effect' in Retrospective Sound Quality Evaluation,” Acoust. Sci. Technol., vol. 25, no. 2, pp. 170-172 (2004). http://dx.doi.org/ 10.1250/ast.25.170
    • [40.] A. Rozin, P. Rozin, and E. Goldberg, “The Feeling of Music Past: How Listeners Remember Musical Affect,” Music Percept., vol. 22, no. 1, pp. 15-39 (2004). http://dx.doi.org/10.1525/mp.2004.22.1.15
    • [41.] J. Steffens and C. Guastavino, “(Tr-)end Effects of Momentary and Retrospective Soundscape Evaluations,” Acta Acust. united with Acust., vol. 98 (2014).
    • [42.] D. A. N. Ariely and Z. I. V Carmon, “Gestalt Characteristics of Experiences: The Defining Features of Summarized Events,” J. Behav. Dec. Mak., vol. 201, pp. 191-201 (2000). http://dx.doi.org/10.1002/(sici)1099- 0771(200004/06)13:2<191::aid-bdm330>3.3.co;2-1
    • [43.] P. J. Rentfrow and S. D. Gosling, “The Do Re Mi's of Everyday Life: The Structure and Personality Correlates of Music Preferences,” J. Pers. Soc. Psychol., vol. 84, no. 6, pp. 1236-1256 (2003). http://dx.doi.org/10.1037/0022- 3514.84.6.1236
    • [44.] K. H. Arehart, J. M. Kates, and M. C. Anderson, “Effects of Noise, Nonlinear Processing, and Linear Filtering on Percieved Music Quality,” Int. J. Audiol., vol. 50, pp. 177-190 (2011). http://dx.doi.org/ 10.3109/14992027.2010.539273
    • [45.] J. Aucouturier and F. Pachet, “Music Similarity Measures: What's the Use,” Proc. Conference of the International Society for Music Information Retrieval (ISMIR) (2002).
    • [46.] D. Stowell, and M. D. Plumbley, “An open dataset for research on audio field recording archives: freefield1010”, submitted. http://arxiv.org/abs/1309.5275.
    • [47.] P. Kendrick, “Distortion and Clipping, C ++program for Automatic Detection and Metering” (2015). [Online]. Available: http://usir.salford.ac.uk/35954/.
  • No related research data.
  • No similar publications.

Share - Bookmark

Cite this article