Zobrazeno 1 - 10
of 138
pro vyhledávání: '"Prajwal K"'
Autor:
Raude, Charles, Prajwal, K R, Momeni, Liliane, Bull, Hannah, Albanie, Samuel, Zisserman, Andrew, Varol, Gül
In this work, our goals are two fold: large-vocabulary continuous sign language recognition (CSLR), and sign language retrieval. To this end, we introduce a multi-task Transformer model, CSLR2, that is able to ingest a signing sequence and output in
Externí odkaz:
http://arxiv.org/abs/2405.10266
Autor:
Purnendu Rout, Varsha Modipalle, Shruthi S Hedge, Nirav Patel, Sravani Uppala, Prajwal K Shetty
Publikováno v:
Journal of Family Medicine and Primary Care, Vol 8, Iss 12, Pp 3821-3825 (2019)
Objective: Tuberculosis (Tb) is a fatal infectious disease that primarily affects the pulmonary system and rarely occurs in other body organs including oral cavity. The aim of this study was to report all patients with primary manifestations of oral
Externí odkaz:
https://doaj.org/article/f467810c139b43859a0b87fab94466bc
The goal of this work is to detect and recognize sequences of letters signed using fingerspelling in British Sign Language (BSL). Previous fingerspelling recognition methods have not focused on BSL, which has a very different signing alphabet (e.g.,
Externí odkaz:
http://arxiv.org/abs/2211.08954
Publikováno v:
Journal of Clinical and Diagnostic Research, Vol 10, Iss 8, Pp ZJ10-ZJ11 (2016)
Externí odkaz:
https://doaj.org/article/4cab0a14ecb444d58f143ee17daf0166
In this work, we address the problem of generating speech from silent lip videos for any speaker in the wild. In stark contrast to previous works, our method (i) is not restricted to a fixed number of speakers, (ii) does not explicitly impose constra
Externí odkaz:
http://arxiv.org/abs/2209.00642
Recently, sign language researchers have turned to sign language interpreted TV broadcasts, comprising (i) a video of continuous signing and (ii) subtitles corresponding to the audio content, as a readily available and large-scale source of training
Externí odkaz:
http://arxiv.org/abs/2208.02802
Autor:
Ashwinirani SR, Girish Suragimath, Abhijeet R Sande, Prasad Kulkarni, Anand Nimbal, T.Shankar, T.Snigdha Gowd, Prajwal K Shetty
Publikováno v:
Journal of Clinical and Diagnostic Research, Vol 8, Iss 10, Pp ZC40-ZC43 (2014)
Background: The study of lip-print pattern (cheiloscopy) is a scientific method for personal identification and plays a major role in forensic and criminal investigations. Objective: To compare the lip print patterns in Kerala and Maharashtra pop
Externí odkaz:
https://doaj.org/article/7b3fb1b2f9b3421295ee700d0303016b
In this paper, we consider the task of spotting spoken keywords in silent video sequences -- also known as visual keyword spotting. To this end, we investigate Transformer-based models that ingest two streams, a visual encoding of the video and a pho
Externí odkaz:
http://arxiv.org/abs/2110.15957
The goal of this paper is to learn strong lip reading models that can recognise speech in silent videos. Most prior works deal with the open-set visual speech recognition problem by adapting existing automatic speech recognition techniques on top of
Externí odkaz:
http://arxiv.org/abs/2110.07603
In this work, we re-think the task of speech enhancement in unconstrained real-world environments. Current state-of-the-art methods use only the audio stream and are limited in their performance in a wide range of real-world noises. Recent works usin
Externí odkaz:
http://arxiv.org/abs/2012.10852