Speaker linking using complete-linkage clustering
Ghaemmaghami, Houman, Dean, David, & Sridharan, Sridha (2012) Speaker linking using complete-linkage clustering. In Cox, F, Lin, S, Shaw, J, Yuen, I, Miles, K, Demuth, K, et al. (Eds.) Speech Science and Technology 2012: Proceedings of the 14th Australasian International Conference on Speech Science and Technology. The Australasian Speech Science and Technology Association (ASSTA), Australia, pp. 1-4.
Description
Speaker diarization determines instances of the same speaker within a recording. Extending this task to a collection of recordings for linking together segments spoken by a unique speaker requires speaker linking. In this paper we propose a speaker linking system using linkage clustering and state-of-the-art speaker recognition techniques. We evaluate our approach against two baseline linking systems using agglomerative cluster merging (AC) and agglomerative clustering with model retraining (ACR). We demonstrate that our linking method, using complete-linkage clustering, provides a relative improvement of 20% and 29% in attribution error rate (AER), over the AC and ACR systems, respectively.
Impact and interest:
Citation counts are sourced monthly from Scopus and Web of Science® citation databases.
These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.
Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.
| ID Code: | 57369 | ||
|---|---|---|---|
| Item Type: | Chapter in Book, Report or Conference volume (Conference contribution) | ||
| ORCID iD: |
|
||
| Measurements or Duration: | 4 pages | ||
| Event Title: | Australasian International Conference on Speech Science and Technology | ||
| Event Dates: | 2012-12-03 - 2012-12-06 | ||
| Event Location: | Australia | ||
| Keywords: | agglomerative clustering, complete-linkage, cross-likelihood ratio, joint factor analysis, speaker attribution, speaker diarization, speaker linking | ||
| ISBN: | 1039-0227 | ||
| Pure ID: | 32303820 | ||
| Divisions: | Past > QUT Faculties & Divisions > Science & Engineering Faculty | ||
| Funding: | |||
| Copyright Owner: | Copyright 2012 ASSTA | ||
| Copyright Statement: | This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au | ||
| Deposited On: | 19 Feb 2013 12:37 | ||
| Last Modified: | 01 Apr 2026 15:52 |
Export: EndNote | Dublin Core | BibTeX
Repository Staff Only: item control page