Zobrazeno 1 - 10
of 2 492
pro vyhledávání: '"Visual speech"'
Publikováno v:
Complex & Intelligent Systems, Vol 10, Iss 4, Pp 5721-5741 (2024)
Abstract Conformer-based models have proven highly effective in Audio-visual Speech Recognition, integrating auditory and visual inputs to significantly enhance speech recognition accuracy. However, the widely utilized softmax attention mechanism wit
Externí odkaz:
https://doaj.org/article/161fb7353b394aadbfa77731cc1122d9
Publikováno v:
EURASIP Journal on Audio, Speech, and Music Processing, Vol 2024, Iss 1, Pp 1-15 (2024)
Abstract Visual speech recognition (VSR) is a challenging task that has received increasing interest during the last few decades. Current state of the art employs powerful end-to-end architectures based on deep learning which depend on large amounts
Externí odkaz:
https://doaj.org/article/63367b081cf944e18bc66619b622bdba
Publikováno v:
EURASIP Journal on Audio, Speech, and Music Processing, Vol 2023, Iss 1, Pp 1-19 (2023)
Abstract Speakers with dysarthria often struggle to accurately pronounce words and effectively communicate with others. Automatic speech recognition (ASR) is a powerful tool for extracting the content from speakers with dysarthria. However, the narro
Externí odkaz:
https://doaj.org/article/468574ec11904261bfd628a1c09e2885
Publikováno v:
Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki, Vol 23, Iss 4, Pp 767-775 (2023)
Visual speech recognition or automated lip-reading systems actively apply to speech-to-text translation. Video data proves to be useful in multimodal speech recognition systems, particularly when using acoustic data is difficult or not available at
Externí odkaz:
https://doaj.org/article/66375603e608485f93cbe94a01929e68
Publikováno v:
Frontiers in Human Neuroscience, Vol 18 (2024)
Externí odkaz:
https://doaj.org/article/a6c74f88ced7450e9d51e327c2463e86
Publikováno v:
Компьютерная оптика, Vol 47, Iss 2, Pp 287-305 (2023)
Communication refers to a wide range of different behaviors and activities aimed at handing over information. The communication process includes verbal, paraverbal and non-verbal components, conveying the informational part of a message and its emoti
Externí odkaz:
https://doaj.org/article/c169b0a537a146c28320fcedbdb747c8
Publikováno v:
Acoustics, Vol 5, Iss 1, Pp 343-353 (2023)
Visual speech recognition (VSR) is a method of reading speech by noticing the lip actions of the narrators. Visual speech significantly depends on the visual features derived from the image sequences. Visual speech recognition is a stimulating proces
Externí odkaz:
https://doaj.org/article/1ab88d9b29094efa8ffd92b261fa69e0
Publikováno v:
NeuroImage, Vol 282, Iss , Pp 120391- (2023)
There is considerable debate over how visual speech is processed in the absence of sound and whether neural activity supporting lipreading occurs in visual brain areas. Much of the ambiguity stems from a lack of behavioral grounding and neurophysiolo
Externí odkaz:
https://doaj.org/article/88a422e2bc9f4b6788b1de7815107748
Publikováno v:
Компьютерная оптика, Vol 46, Iss 6, Pp 955-962 (2022)
The paper proposes a method of visual analysis for automatic speech recognition of the vehicle driver. Speech recognition in acoustically noisy conditions is one of big challenges of artificial intelligence. The problem of effective automatic lip-rea
Externí odkaz:
https://doaj.org/article/f072be7316aa453f942983362a000386
Publikováno v:
Egyptian Informatics Journal, Vol 23, Iss 4, Pp 1-12 (2022)
Lipreading is the ability to recognize words or sentences from the mouth movements of a speaking person. This process is also known as Visual Speech Recognition (VSR). Lipreading has two main advantages: facilitate communication for people with heari
Externí odkaz:
https://doaj.org/article/1ea3e88a621e4e3dbc538e88a82cdb94