Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Lashini, Ali"'
Publikováno v:
In 2022 12th International Conference on Computer and Knowledge Engineering (ICCKE) (pp. 225-230). IEEE
Lip-reading has made impressive progress in recent years, driven by advances in deep learning. Nonetheless, the prerequisite such advances is a suitable dataset. This paper provides a new in-the-wild dataset for Persian word-level lipreading containi
Externí odkaz:
http://arxiv.org/abs/2304.04068
A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset
Autor:
Peymanfard, Javad, Heydarian, Samin, Lashini, Ali, Zeinali, Hossein, Mohammadi, Mohammad Reza, Mozayani, Nasser
In recent years, significant progress has been made in automatic lip reading. But these methods require large-scale datasets that do not exist for many low-resource languages. In this paper, we have presented a new multipurpose audio-visual dataset f
Externí odkaz:
http://arxiv.org/abs/2301.10180
A multi-purpose audio-visual corpus for multi-modal Persian speech recognition: The Arman-AV dataset
Autor:
Peymanfard, Javad a, Heydarian, Samin a, 1, Lashini, Ali a, 1, Zeinali, Hossein b, Mohammadi, Mohammad Reza a, Mozayani, Nasser a, ⁎
Publikováno v:
In Expert Systems With Applications 15 March 2024 238 Part E