Bayes Factor based speaker segmentation for speaker diarization
Wang, David, Vogt, Robert J., & Sridharan, Sridha (2010) Bayes Factor based speaker segmentation for speaker diarization. In Proceedings of the 11th Annual Conference of the International Speech Communication Association (Interspeech 2010), International Speech Communication Association, Makuhari Messe International Convention Complex, Chiba, Makuhari, Japan, 1405 -1408.
This paper proposes the use of the Bayes Factor as a distance metric for speaker segmentation within a speaker diarization system. The proposed approach uses a pair of constant sized, sliding windows to compute the value of the Bayes Factor between the adjacent windows over the entire audio. Results obtained on the 2002 Rich Transcription Evaluation dataset show an improved segmentation performance compared to previous approaches reported in literature using the Generalized Likelihood Ratio. When applied in a speaker diarization system, this approach results in a 5.1% relative improvement in the overall Diarization Error Rate compared to the baseline.
Impact and interest:
Citation counts are sourced monthly from and citation databases.
Citations counts from theindexing service can be viewed at the linked Google Scholar™ search.
Repository Staff Only: item control page