QUT ePrints

An approach to statistical lip modelling for speaker identification via chromatic feature extraction

Wark, T. , Sridharan, S., & Chandran, V. (1998) An approach to statistical lip modelling for speaker identification via chromatic feature extraction. In Proceedings of the Fourteenth International Conference on Pattern Recognition, 1998, IEEE, Brisbane, Australia, pp. 123-125.

[img] Published Version (PDF 189kB)
Administrators only | Request a copy from author

    View at publisher

    Abstract

    This paper presents a novel technique for the tracking of moving lips for the purpose of speaker identification. In our system, a model of the lip contour is formed directly from chromatic information in the lip region. Iterative refinement of contour point estimates is not required. Colour features are extracted from the lips via concatenated profiles taken around the lip contour. Reduction of order in lip features is obtained via principal component analysis (PCA) followed by linear discriminant analysis (LDA). Statistical speaker models are built from the lip features based on the Gaussian mixture model (GMM). Identification experiments performed on the M2VTS1 database, show encouraging results

    Impact and interest:

    4 citations in Web of Science®
    Search Google Scholar™

    Citation countsare sourced monthly from Scopus and Web of Science® citation databases.

    These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

    Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

    ID Code: 45589
    Item Type: Conference Paper
    Keywords: Gaussian distribution, biometrics (access control), feature extraction, image recognition, speaker recognition, statistical analysis, tracking, GMM, Gaussian mixture model, LDA, M2VTS<sup>1</sup> database, PCA, chromatic feature extraction, concatenated profiles, linear discriminant analysis, lip contour model, principal component analysis, speaker identification, statistical lip modelling, statistical speaker models
    DOI: 10.1109/ICPR.1998.711095
    ISBN: 0818685123
    Divisions: Past > QUT Faculties & Divisions > Faculty of Built Environment and Engineering
    Past > Schools > School of Engineering Systems
    Copyright Owner: Copyright 1998 IEEE
    Copyright Statement: Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
    Deposited On: 17 Oct 2011 12:00
    Last Modified: 17 Oct 2011 12:24

    Export: EndNote | Dublin Core | BibTeX

    Repository Staff Only: item control page