Robust Feature Extraction Based on Image-based Approach for Visual Speech Recognition

Autor: Sung-Taek Hwang, Seung-You Na, Thanh Trung Pham, Jing-Young Kim, Song-Min Gyu, So-Hee Min
Rok vydání: 2010
Předmět:
Zdroj: Journal of Korean Institute of Intelligent Systems. 20:348-355
ISSN: 1976-9172
Popis: In spite of development in speech recognition technology, speech recognition under noisy environment is still a difficult task. To solve this problem, Researchers has been proposed different methods where they have been used visual information except audio information for visual speech recognition. However, visual information also has visual noises as well as the noises of audio information, and this visual noises cause degradation in visual speech recognition. Therefore, it is one the field of interest how to extract visual features parameter for enhancing visual speech recognition performance. In this paper, we propose a method for visual feature parameter extraction based on image-base approach for enhancing recognition performance of the HMM based visual speech recognizer. For experiments, we have constructed Audio-visual database which is consisted with 105 speackers and each speaker has uttered 62 words. We have applied histogram matching, lip folding, RASTA filtering, Liner Mask, DCT and PCA. The experimental results show that the recognition performance of our proposed method enhanced at about 21% than the baseline method.
Databáze: OpenAIRE