Speaker identification using higher order spectral phase features and their effectiveness vis-a-vis Mel-Cepstral features

Chandran, Vinod, Ning, Daryl, & Sridharan, Subramanian (2004) Speaker identification using higher order spectral phase features and their effectiveness vis-a-vis Mel-Cepstral features. In Zhang, D & Jain, A (Eds.) Biometric Authentication First International Conference (ICBA 2004) Proceedings, 15-17 July 2004, Hong Kong, China.

View at publisher


The effectiveness of higher-order spectral (HOS) phase features in speaker recognition is investigated by comparison with Mel Cepstral features on the same speech data. HOS phase features retain phase information from the Fourier spectrum unlikeMel–frequency Cepstral coefficients (MFCC). Gaussian mixture models are constructed from Mel– Cepstral features and HOS features, respectively, for the same data from various speakers in the Switchboard telephone Speech Corpus. Feature clusters, model parameters and classification performance are analyzed. HOS phase features on their own provide a correct identification rate of about 97% on the chosen subset of the corpus. This is the same level of accuracy as provided by MFCCs. Cluster plots and model parameters are compared to show that HOS phase features can provide complementary information to better discriminate between speakers.

Impact and interest:

2 citations in Scopus
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 24302
Item Type: Conference Paper
Refereed: Yes
DOI: 10.1007/978-3-540-25948-0_84
ISBN: 3-540-22146-8
Subjects: Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING (080100) > Pattern Recognition and Data Mining (080109)
Divisions: Past > QUT Faculties & Divisions > Faculty of Built Environment and Engineering
Deposited On: 17 Jun 2009 14:27
Last Modified: 25 Feb 2013 05:30

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page