QUT ePrints

Modelling session variability in text independent speaker verification

Vogt, Robert J., Baker, Brendan J., & Sridharan, Sridha (2005) Modelling session variability in text independent speaker verification. In Eurospeech/Interspeech : Proceedings of the 9th European Conference on Speech Communication and Technology 2005, 4-8 September 2005, Lisbon, Portugal.

Abstract

Presented is an approach to modelling session variability for GMM-based text-independent speaker verification incorporating a constrained session variability component in both the training and testing procedures. The proposed technique reduces the data labelling requirements and removes discrete categorisation needed by techniques such as feature mapping and H-Norm, while providing superior performance. Experiments on Switchboard-II conversational telephony data show improvements of as much as 48% in detection cost with a single training utterance and 68% with multiple training utterances over a baseline system.

Impact and interest:

10 citations in Scopus
Search Google Scholar™

Citation countsare sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

1,329 since deposited on 06 Nov 2008
374 in the past twelve months

Full-text downloadsdisplays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 15490
Item Type: Conference Paper
Additional URLs:
Subjects: Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING (080100) > Natural Language Processing (080107)
Divisions: Past > QUT Faculties & Divisions > Faculty of Built Environment and Engineering
Past > Institutes > Information Security Institute
Past > Schools > School of Engineering Systems
Copyright Owner: Copyright 2005 International Speech Communication Association (ISCA)
Copyright Statement: Reproduced in accordance with the copyright policy of the publisher.
Deposited On: 06 Nov 2008
Last Modified: 29 Feb 2012 23:13

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page