QUT ePrints

An approach to statistical lip modelling for speaker identification via chromatic feature extraction

Wark, T., Sridharan, S., & Chandran, V. (1998) An approach to statistical lip modelling for speaker identification via chromatic feature extraction. In Proceedings of the Fourteenth International Conference on Pattern Recognition, 1998, IEEE, Brisbane, Australia, pp. 123-125.

[img] Published Version (PDF 189kB)
Administrators only | Request a copy from author

View at publisher

Abstract

This paper presents a novel technique for the tracking of moving lips for the purpose of speaker identification. In our system, a model of the lip contour is formed directly from chromatic information in the lip region. Iterative refinement of contour point estimates is not required. Colour features are extracted from the lips via concatenated profiles taken around the lip contour. Reduction of order in lip features is obtained via principal component analysis (PCA) followed by linear discriminant analysis (LDA). Statistical speaker models are built from the lip features based on the Gaussian mixture model (GMM). Identification experiments performed on the M2VTS1 database, show encouraging results

Impact and interest:

7 citations in Web of Science®
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 45589
Item Type: Conference Paper
Keywords: Gaussian distribution, biometrics (access control), feature extraction, image recognition, speaker recognition, statistical analysis, tracking, GMM, Gaussian mixture model, LDA, M2VTS<sup>1</sup> database, PCA, chromatic feature extraction, concatenated profiles, linear discriminant analysis, lip contour model, principal component analysis, speaker identification, statistical lip modelling, statistical speaker models
DOI: 10.1109/ICPR.1998.711095
ISBN: 0818685123
Divisions: Past > QUT Faculties & Divisions > Faculty of Built Environment and Engineering
Past > Schools > School of Engineering Systems
Copyright Owner: Copyright 1998 IEEE
Copyright Statement: Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Deposited On: 17 Oct 2011 02:00
Last Modified: 17 Oct 2011 02:24

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page