The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition

Dean, David B., Kanagasundaram, Ahilan, Ghaemmaghami, Houman, Rahman, Md Hafizur, & Sridharan, Sridha (2015) The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition. In Proceedings of the 16th Annual Conference of the International Speech Communication Association, Interspeech 2015, International Speech Communication Association, Dresden, Germany, pp. 3456-3460.

View at publisher

Abstract

The QUT-NOISE-SRE protocol is designed to mix the large QUT-NOISE database, consisting of over 10 hours of back- ground noise, collected across 10 unique locations covering 5 common noise scenarios, with commonly used speaker recognition datasets such as Switchboard, Mixer and the speaker recognition evaluation (SRE) datasets provided by NIST. By allowing common, clean, speech corpora to be mixed with a wide variety of noise conditions, environmental reverberant responses, and signal-to-noise ratios, this protocol provides a solid basis for the development, evaluation and benchmarking of robust speaker recognition algorithms, and is freely available to download alongside the QUT-NOISE database. In this work, we use the QUT-NOISE-SRE protocol to evaluate a state-of-the-art PLDA i-vector speaker recognition system, demonstrating the importance of designing voice-activity-detection front-ends specifically for speaker recognition, rather than aiming for perfect coherence with the true speech/non-speech boundaries.

Impact and interest:

0 citations in Scopus
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

102 since deposited on 09 Jul 2015
54 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 85240
Item Type: Conference Paper
Refereed: Yes
Additional URLs:
Keywords: noisy speaker verification, speech databases, evaluation protocols
ISSN: 1990-9770
Subjects: Australian and New Zealand Standard Research Classification > ENGINEERING (090000) > ELECTRICAL AND ELECTRONIC ENGINEERING (090600) > Signal Processing (090609)
Divisions: Current > Schools > School of Electrical Engineering & Computer Science
Past > QUT Faculties & Divisions > Faculty of Science and Technology
Past > Institutes > Institute for Creative Industries and Innovation
Past > Institutes > Information Security Institute
Funding:
Copyright Owner: Copyright 2015 [Please consult the author]
Deposited On: 09 Jul 2015 22:29
Last Modified: 24 Sep 2015 23:40

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page