Automatic Extraction of Geometric Lip Features with Application to Multi-Modal Speaker Identification

Autor:	I. Arsic, R. Vilagut, Jean-Philippe Thiran
Rok vydání:	2006
Předmět:	business.industry Computer science Speech recognition LTS5 Parallel Computing in Electrical Engineering Feature extraction Pattern recognition Color space Speaker recognition Fuzzy logic Robustness (computer science) Artificial intelligence business Cluster analysis Face detection
Zdroj:	ICME
DOI:	10.1109/icme.2006.262594
Popis:	In this paper we consider the problem of automatic extraction of the geometric lip features for the purposes of multi-modal speaker identification. The use of visual information from the mouth region can be of great importance for improving the speaker identification system performance in noisy conditions. We propose a novel method for automated lip features extraction that utilizes color space transformation and a fuzzy-based c-means clustering technique. Using the obtained visual cues closed-set audio-visual speaker identification experiments are performed on the CUAVE database, [1] showing promising results.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::26261a3878758131e69dbb4134b61ef5 https://doi.org/10.1109/icme.2006.262594 Zobrazit plný text záznamu