Zobrazeno 1 - 10
of 12
pro vyhledávání: '"Gudmalwar, Ashishkumar"'
Audio-visual alignment after dubbing is a challenging research problem. To this end, we propose a novel method, DubWise Multi-modal Large Language Model (LLM)-based Text-to-Speech (TTS), which can control the speech duration of synthesized speech in
Externí odkaz:
http://arxiv.org/abs/2406.08802
Despite the significant advancements in Text-to-Speech (TTS) systems, their full utilization in automatic dubbing remains limited. This task necessitates the extraction of voice identity and emotional style from a reference speech in a source languag
Externí odkaz:
http://arxiv.org/abs/2406.08076
Autor:
Mhaskar, Shivam Ratnakant, Shah, Nirmesh J., Zaki, Mohammadi, Gudmalwar, Ashishkumar P., Wasnik, Pankaj, Shah, Rajiv Ratn
Traditional Automatic Video Dubbing (AVD) pipeline consists of three key modules, namely, Automatic Speech Recognition (ASR), Neural Machine Translation (NMT), and Text-to-Speech (TTS). Within AVD pipelines, isometric-NMT algorithms are employed to r
Externí odkaz:
http://arxiv.org/abs/2403.15469
Publikováno v:
IEEE Transactions on Consumer Electronics. 69:226-235
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
2022 IEEE 19th India Council International Conference (INDICON).
Autor:
Rantu Buragohain, R. Aditya Reddy, Yenduri Venkatesh, Gudmalwar Ashishkumar Prabhakar, Ch. V. Rama Rao
Publikováno v:
Communications in Computer and Information Science ISBN: 9783031070044
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::7aa07fd681a71ea7a765565ce90e9989
https://doi.org/10.1007/978-3-031-07005-1_28
https://doi.org/10.1007/978-3-031-07005-1_28
Publikováno v:
Communications in Computer and Information Science ISBN: 9783031070044
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::ac72140109cc984dacb0e5b885ebd881
https://doi.org/10.1007/978-3-031-07005-1_29
https://doi.org/10.1007/978-3-031-07005-1_29
Autor:
Sudesna Manjari Mahanta, Srikant Kumar Beura, Bishnulatpam Pushpa Devi, Gudmalwar Ashishkumar Prabhakar, Prabir Saha
Publikováno v:
ICCCNT
In this paper, an inexact 3:2 compressor has been proposed. The proposed design has been used in radix-4 based 8×8 multiplier for the addition of intermediate partial products. Error parameters have been evaluated using MATLAB and compared with repo
Publikováno v:
International Journal of Speech Technology; Sep2019, Vol. 22 Issue 3, p521-531, 11p