Publication:
Objective measurement of temporally localized distortions

dc.contributor.advisor Sen, Deep en_US
dc.contributor.author Lu, Wenliang en_US
dc.date.accessioned 2022-03-21T10:47:40Z
dc.date.available 2022-03-21T10:47:40Z
dc.date.issued 2012 en_US
dc.description.abstract Evaluating speech quality in an objective manner has been the “Holy Grail” of digital speech processing over the last 50 years. The assessment of speech quality using objective measures through the use of computational algorithms, provides increased efficiency and reliability compared to its subjective counterpart. However, existing methods such as SNR, LSD, BSD, PESQ and their variants, either lack sufficient accuracy, or fail to handle a comprehensive range of scenarios. One of the intrinsically problematic issues with these methods, is their reliance on a uni-dimensional quality classification schema. Recent advances in speech quality research have converged on the notion that speech quality is a multi-dimensional space. Research by Voiers, Sen and Hall, have all shown that speech quality can be adequately described using three orthogonal dimensions, whose axes correspond to temporally-localised distortions, frequency-localised distortions, and distortions which are not attributed to the first two. This thesis explores the prediction of temporally-localised distortions, which have been shown to contribute to 55% of the variance of the overall quality. Various features extracted from spectrograms, psychoacoustic masking models and non-linear cochlear models, are explored for the development of a robust representation for temporally-localised distortions. Features extracted from the non-linear cochlear model are shown to yield the best results, achieving correlation coefficients higher than 0.9 with respect to subjective scores. en_US
dc.identifier.uri http://hdl.handle.net/1959.4/51682
dc.language English
dc.language.iso EN en_US
dc.publisher UNSW, Sydney en_US
dc.rights CC BY-NC-ND 3.0 en_US
dc.rights.uri https://creativecommons.org/licenses/by-nc-nd/3.0/au/ en_US
dc.subject.other Temporally localized distortion en_US
dc.subject.other Objective measurement of speech quality en_US
dc.subject.other Cochlear model en_US
dc.subject.other Formant en_US
dc.title Objective measurement of temporally localized distortions en_US
dc.type Thesis en_US
dcterms.accessRights open access
dcterms.rightsHolder Lu, Wenliang
dspace.entity.type Publication en_US
unsw.accessRights.uri https://purl.org/coar/access_right/c_abf2
unsw.identifier.doi https://doi.org/10.26190/unsworks/15302
unsw.relation.faculty Engineering
unsw.relation.originalPublicationAffiliation Lu, Wenliang, Electrical Engineering & Telecommunications, Faculty of Engineering, UNSW en_US
unsw.relation.originalPublicationAffiliation Sen, Deep, Electrical Engineering & Telecommunications, Faculty of Engineering, UNSW en_US
unsw.relation.school School of Electrical Engineering and Telecommunications *
unsw.thesis.degreetype PhD Doctorate en_US
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
whole.pdf
Size:
5.16 MB
Format:
application/pdf
Description:
Resource type