Zobrazeno 1 - 10
of 1 582
pro vyhledávání: '"Wasnik A"'
Autor:
Chudasama, Vishal, Sarkar, Hiran, Wasnik, Pankaj, Balasubramanian, Vineeth N, Kalla, Jayateja
Object detection is a critical field in computer vision focusing on accurately identifying and locating specific objects in images or videos. Traditional methods for object detection rely on large labeled training datasets for each object category, w
Externí odkaz:
http://arxiv.org/abs/2408.14249
Audio-visual alignment after dubbing is a challenging research problem. To this end, we propose a novel method, DubWise Multi-modal Large Language Model (LLM)-based Text-to-Speech (TTS), which can control the speech duration of synthesized speech in
Externí odkaz:
http://arxiv.org/abs/2406.08802
Despite the significant advancements in Text-to-Speech (TTS) systems, their full utilization in automatic dubbing remains limited. This task necessitates the extraction of voice identity and emotional style from a reference speech in a source languag
Externí odkaz:
http://arxiv.org/abs/2406.08076
Self-supervised learned (SSL) models such as Wav2vec and HuBERT yield state-of-the-art results on speech-related tasks. Given the effectiveness of such models, it is advantageous to use them in conventional ASR systems. While some approaches suggest
Externí odkaz:
http://arxiv.org/abs/2404.12628
Autor:
Mhaskar, Shivam Ratnakant, Shah, Nirmesh J., Zaki, Mohammadi, Gudmalwar, Ashishkumar P., Wasnik, Pankaj, Shah, Rajiv Ratn
Traditional Automatic Video Dubbing (AVD) pipeline consists of three key modules, namely, Automatic Speech Recognition (ASR), Neural Machine Translation (NMT), and Text-to-Speech (TTS). Within AVD pipelines, isometric-NMT algorithms are employed to r
Externí odkaz:
http://arxiv.org/abs/2403.15469
Deep learning methods have led to significant improvements in the performance on the facial landmark detection (FLD) task. However, detecting landmarks in challenging settings, such as head pose changes, exaggerated expressions, or uneven illuminatio
Externí odkaz:
http://arxiv.org/abs/2402.15044
Semi-supervised object detection (SSOD) has made significant progress with the development of pseudo-label-based end-to-end methods. However, many of these methods face challenges due to class imbalance, which hinders the effectiveness of the pseudo-
Externí odkaz:
http://arxiv.org/abs/2306.02268
Autor:
Biswal, Swoyam, Wasnik, Vaibhav
Publikováno v:
The European Physical Journal E, 46(4), 30 (2023)
Glutamate and glycine are important neurotransmitters in the brain. An action potential prop- agating in the terminal of a presynatic neuron causes the release of glutamate and glycine in the synapse by vesicles fusing with the cell membrane, which t
Externí odkaz:
http://arxiv.org/abs/2305.01230
Autor:
Wasnik, Vaibhav
In literature on stochastic thermodynamics it is stated that for a system connected to multiple thermal reservoirs, the transition rates between two energy levels equals the sum of transition rates corresponding to each thermal bath the system is con
Externí odkaz:
http://arxiv.org/abs/2303.14949
Publikováno v:
Exploration of Immunology, Vol 4, Iss 4, Pp 502-522 (2024)
Keratinocytes play an integral role in the human epidermis, serving as a barrier between the internal and external environment. They are immune-competent cells involved in both innate and adaptive cutaneous immune responses, crucial for maintaining s
Externí odkaz:
https://doaj.org/article/da7cdc8bb1394df59b6e00ae352a00a8