An approach to statistical lip modelling for speaker identification via chromatic feature extraction
Wark, T., Sridharan, S., & Chandran, V. (1998) An approach to statistical lip modelling for speaker identification via chromatic feature extraction. In Proceedings of the Fourteenth International Conference on Pattern Recognition, 1998, IEEE, Brisbane, Australia, pp. 123-125.
|Published Version (PDF 189kB) |
Administrators only | Request a copy from author
This paper presents a novel technique for the tracking of moving lips for the purpose of speaker identification. In our system, a model of the lip contour is formed directly from chromatic information in the lip region. Iterative refinement of contour point estimates is not required. Colour features are extracted from the lips via concatenated profiles taken around the lip contour. Reduction of order in lip features is obtained via principal component analysis (PCA) followed by linear discriminant analysis (LDA). Statistical speaker models are built from the lip features based on the Gaussian mixture model (GMM). Identification experiments performed on the M2VTS1 database, show encouraging results
Impact and interest:
Citation countsare sourced monthly fromand citation databases.
These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.
Citations counts from theindexing service can be viewed at the linked Google Scholar™ search.
|Item Type:||Conference Paper|
|Keywords:||Gaussian distribution, biometrics (access control), feature extraction, image recognition, speaker recognition, statistical analysis, tracking, GMM, Gaussian mixture model, LDA, M2VTS<sup>1</sup> database, PCA, chromatic feature extraction, concatenated profiles, linear discriminant analysis, lip contour model, principal component analysis, speaker identification, statistical lip modelling, statistical speaker models|
|Divisions:||Past > QUT Faculties & Divisions > Faculty of Built Environment and Engineering|
Past > Schools > School of Engineering Systems
|Copyright Owner:||Copyright 1998 IEEE|
|Copyright Statement:||Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.|
|Deposited On:||17 Oct 2011 12:00|
|Last Modified:||17 Oct 2011 12:24|
Repository Staff Only: item control page