Factor analysis modelling for speaker verification with short utterances

, , & (2008) Factor analysis modelling for speaker verification with short utterances. In Brummer, N & du Preez, J (Eds.) Proceedings of Odyssey 2008: The Speaker and Language Recognition Workshop. Institute of Electrical and Electronics Engineers Inc., CD Rom, pp. 1-4.

[img]
Preview
PDF (124kB)
12629.pdf.

Description

This paper examines combining both relevance MAP and subspace speaker adaptation processes to train GMM speaker models for use in speaker verification systems with a particular focus on short utterance lengths. The subspace speaker adaptation method involves developing a speaker GMM mean supervector as the sum of a speaker-independent prior distribution and a speaker dependent offset constrained to lie within a low-rank subspace, and has been shown to provide improvements in accuracy over ordinary relevance MAP when the amount of training data is limited. It is shown through testing on NIST SRE data that combining the two processes provides speaker models which lead to modest improvements in verification accuracy for limited data situations, in addition to improving the performance of the speaker verification system when a larger amount of available training data is available.

Impact and interest:

32 citations in Scopus
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

595 since deposited on 25 Feb 2008
44 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 12629
Item Type: Chapter in Book, Report or Conference volume (Conference contribution)
ORCID iD:
Sridharan, Sridhaorcid.org/0000-0003-4316-9001
Measurements or Duration: 4 pages
Event Title: Speaker and Language Recognition Workshop
Event Dates: 2008-01-21 - 2008-01-24
Event Location: UNSPECIFIED
Keywords: Factor Analysis, Probabilistic PCA, Speaker Verification
ISBN: 978-0-620-40331-3
Pure ID: 33568465
Divisions: Past > QUT Faculties & Divisions > Faculty of Built Environment and Engineering
Past > QUT Faculties & Divisions > Faculty of Science and Technology
Past > Institutes > Information Security Institute
Past > QUT Faculties & Divisions > Science & Engineering Faculty
Current > Research Centres > Australian Research Centre for Aerospace Automation
Copyright Owner: Consult author(s) regarding copyright matters
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au
Deposited On: 25 Feb 2008 10:00
Last Modified: 20 Apr 2026 04:31