Problems Associated with Current Area-Based Visual Speech Feature Extraction Techniques
Lucey, Patrick J., Dean, David B., & Sridharan, Sridha (2005) Problems Associated with Current Area-Based Visual Speech Feature Extraction Techniques. In International Conference on Auditory-Visual Speech Processing (AVSP), July 24-27, Vancouver Island, British Columia, Canada.
Abstract
Techniques such as principle component analysis (PCA), linear discriminant analysis (LDA) and the discrete cosine transform (DCT) have all been used to good effect in face recognition. As these techniques are able to compactly represent a set of features, researchers have sought to use these methods to extract the visual speech content for audio-visual speech recognition (AVSR). In this paper, we expose the problems of employing such techniques in AVSR by running some visual-only speech recognition experiments. The results of these experiments illustrate that current area-based feature extraction techniques are heavily dependent on the visual front-end, as well as being ineffective in decoupling adequate speech content from a speaker’s mouth. As a potential solution, we introduce the concept of a free-parts representation, which may be able to circumvent many of these problems experienced by current area-based techniques.
Citations:

Citation counts are sourced monthly from Scopus and Web of Science citation databases.
These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science generally from 1980 onwards.
Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.
Full-text downloads:

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.
| ID Code: | 12847 |
|---|---|
| Item Type: | Conference Paper |
| Additional URLs: | |
| Divisions: | Past > QUT Faculties & Divisions > Faculty of Built Environment and Engineering Past > Institutes > Information Security Institute |
| Copyright Statement: | Copyright 2005 (please consult author) |
| Deposited On: | 05 Mar 2008 |
| Last Modified: | 02 Feb 2012 19:59 |
Export: EndNote | Dublin Core | BibTeX
Staff only: HERDC collection form
Repository Staff Only: item control page