Improved Facial-Feature Detection for AVSP via Unsupervised Clustering and Discriminant Analysis

Lucey, Simon, Sridharan, Sridha, & Chandran, Vinod (2003) Improved Facial-Feature Detection for AVSP via Unsupervised Clustering and Discriminant Analysis. EURASIP Journal on Applied Signal Processing, 2003(3), pp. 264-275.


View at publisher


An integral part of any audio-visual speech processing (AVSP) system is the front-end visual system that detects facial-features (e.g., eyes and mouth) pertinent to the task of visual speech processing. The ability of this front-end system to not only locate, but also give a confidence measure that the facial-feature is present in the image, directly affects the ability of any subsequent post-processing task such as speech or speaker recognition. With these issues in mind, this paper presents a framework for a facial-feature detection system suitable for use in an AVSP system, but whose basic framework is useful for any application requiring frontal facial-feature detection. A novel approach for facial-feature detection is presented, based on an appearance paradigm. This approach, based on intraclass unsupervised clustering and discriminant analysis, displays improved detection performance over conventional techniques.

Impact and interest:

10 citations in Scopus
Search Google Scholar™
6 citations in Web of Science®

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

176 since deposited on 11 Oct 2007
7 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 10093
Item Type: Journal Article
Refereed: Yes
Additional Information: The contents of this journal can be freely accessed online via the journal’s web page (see hypertext link).
Keywords: audio, visual speech processing, facial, feature detection, unsupervised clustering, discriminant analysis
DOI: 10.1155/S1110865703209045
ISSN: 1110-8657
Divisions: Past > QUT Faculties & Divisions > Faculty of Built Environment and Engineering
Copyright Owner: Copyright 2003 (The authors)
Deposited On: 11 Oct 2007 00:00
Last Modified: 29 Feb 2012 13:03

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page