Robust Feature Extraction Based on Image-based Approach for Visual Speech Recognition
Autor: | Sung-Taek Hwang, Seung-You Na, Thanh Trung Pham, Jing-Young Kim, Song-Min Gyu, So-Hee Min |
---|---|
Rok vydání: | 2010 |
Předmět: |
Audio mining
business.industry Computer science Speech recognition Feature extraction ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION Histogram matching Pattern recognition Speaker recognition Speech processing Field (computer science) Discrete cosine transform Artificial intelligence business Hidden Markov model |
Zdroj: | Journal of Korean Institute of Intelligent Systems. 20:348-355 |
ISSN: | 1976-9172 |
Popis: | In spite of development in speech recognition technology, speech recognition under noisy environment is still a difficult task. To solve this problem, Researchers has been proposed different methods where they have been used visual information except audio information for visual speech recognition. However, visual information also has visual noises as well as the noises of audio information, and this visual noises cause degradation in visual speech recognition. Therefore, it is one the field of interest how to extract visual features parameter for enhancing visual speech recognition performance. In this paper, we propose a method for visual feature parameter extraction based on image-base approach for enhancing recognition performance of the HMM based visual speech recognizer. For experiments, we have constructed Audio-visual database which is consisted with 105 speackers and each speaker has uttered 62 words. We have applied histogram matching, lip folding, RASTA filtering, Liner Mask, DCT and PCA. The experimental results show that the recognition performance of our proposed method enhanced at about 21% than the baseline method. |
Databáze: | OpenAIRE |
Externí odkaz: |