Problems Associated With Current Area-Based Visual Speech Feature Extraction Techniques
Lucey, Patrick J., Dean, David B., & Sridharan, Sridha (2005) Problems Associated With Current Area-Based Visual Speech Feature Extraction Techniques. In Auditory-Visual Speech Processing International Conference 2005, July 24-27, British Columbia, Canada.
Techniques such as principle component analysis (PCA), linear discriminant analysis (LDA) and the discrete cosine transform (DCT) have all been used to good effect in face recognition. As these techniques are able to compactly represent a set of features, researchers have sought to use these methods to extract the visual speech content for audio-visual speech recognition (AVSR). In this paper, we expose the problems of employing such techniques in AVSR by running some visual-only speech recognition experiments. The results of these experiments illustrate that current area-based feature extraction techniques are heavily dependent on the visual front-end, as well as being ineffective in decoupling adequate speech content from a speaker’s mouth. As a potential solution, we introduce the concept of a free-parts representation, which may be able to circumvent many of these current problems experience by current area-based techniques.
Impact and interest:
Citation counts are sourced monthly from and citation databases.
Citations counts from theindexing service can be viewed at the linked Google Scholar™ search.
|Item Type:||Conference Paper|
|Subjects:||Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING (080100) > Natural Language Processing (080107)|
|Divisions:||Past > QUT Faculties & Divisions > Faculty of Built Environment and Engineering|
|Copyright Owner:||Copyright 2005 (please consult author)|
|Deposited On:||19 Oct 2006|
|Last Modified:||09 Jun 2010 12:34|
Repository Staff Only: item control page