Improving out-domain PLDA speaker verification using unsupervised inter-dataset variability compensation approach

Kanagasundaram, Ahilan, Dean, David, & Sridharan, Sridha (2015) Improving out-domain PLDA speaker verification using unsupervised inter-dataset variability compensation approach. In Proceedings of 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2015), IEEE, Brisbane, QLD, pp. 4654-4658.

View at publisher

Abstract

Experimental studies have found that when the state-of-the-art probabilistic linear discriminant analysis (PLDA) speaker verification systems are trained using out-domain data, it significantly affects speaker verification performance due to the mismatch between development data and evaluation data. To overcome this problem we propose a novel unsupervised inter dataset variability (IDV) compensation approach to compensate the dataset mismatch. IDV-compensated PLDA system achieves over 10% relative improvement in EER values over out-domain PLDA system by effectively compensating the mismatch between in-domain and out-domain data.

Impact and interest:

3 citations in Scopus
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

71 since deposited on 27 Apr 2015
14 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 83774
Item Type: Conference Paper
Refereed: Yes
Additional URLs:
Keywords: speaker recognition, domain adaptation, PLDA, inter-dataset variability
DOI: 10.1109/ICASSP.2015.7178853
ISBN: 9781467369978
Divisions: Current > Schools > School of Electrical Engineering & Computer Science
Past > QUT Faculties & Divisions > Faculty of Science and Technology
Past > Institutes > Information Security Institute
Funding:
Copyright Owner: Copyright 2015 IEEE
Copyright Statement: Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.
Deposited On: 27 Apr 2015 22:45
Last Modified: 12 Sep 2015 18:19

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page