Evaluation of a Fused FM and Cepstral-Based Speaker Recognition System on the NIST 2008 SRE

Download files
Access & Terms of Use
open access
Abstract
In this paper, the fusion of two speaker recognition subsystems, one based on Frequency Modulation (FM) and another on MFCC features, is reported. The motivation for their fusion was to improve the recognition accuracy across different types of channel variations, since the two features are believed to contain complementary information. It was found that the MFCC-based subsystem outperformed the FM-based subsystem on telephone conversations from NIST SRE-06 dataset, while the opposite was true for NIST SRE-08 telephone data. As a result, the FM-based subsystem performed as well as the MFCC-based subsystem and their fusion gave up to 23% relative improvement in terms of EER over the MFCC subsystem alone, when evaluated on the NIST 2008 core condition.
Persistent link to this record
DOI
Additional Link
Author(s)
Nosratighods, M
Thiruvaran, T
Epps, J
Ambikairajah, E
Ma, B.
Li, H.
Supervisor(s)
Creator(s)
Editor(s)
Translator(s)
Curator(s)
Designer(s)
Arranger(s)
Composer(s)
Recordist(s)
Conference Proceedings Editor(s)
Other Contributor(s)
Corporate/Industry Contributor(s)
Publication Year
2009
Resource Type
Conference Paper
Degree Type
UNSW Faculty
Files
download Evaluation of a Fused FM and Cepstral-Based Speaker.pdf 512.89 KB Adobe Portable Document Format
Related dataset(s)