Výsledky vyhledávání - "Gudmalwar, Ashishkumar"

Report

DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing

Autor: Sahipjohn, Neha, Gudmalwar, Ashishkumar, Shah, Nirmesh, Wasnik, Pankaj, Shah, Rajiv Ratn

Audio-visual alignment after dubbing is a challenging research problem. To this end, we propose a novel method, DubWise Multi-modal Large Language Model (LLM)-based Text-to-Speech (TTS), which can control the speech duration of synthesized speech in

Externí odkaz: http://arxiv.org/abs/2406.08802

Zobrazit plný text záznamu

Report

VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech

Autor: Gudmalwar, Ashishkumar, Shah, Nirmesh, Akarsh, Sai, Wasnik, Pankaj, Shah, Rajiv Ratn

Despite the significant advancements in Text-to-Speech (TTS) systems, their full utilization in automatic dubbing remains limited. This task necessitates the extraction of voice identity and emotional style from a reference speech in a source languag

Externí odkaz: http://arxiv.org/abs/2406.08076

Zobrazit plný text záznamu

Report

Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning

Autor: Mhaskar, Shivam Ratnakant, Shah, Nirmesh J., Zaki, Mohammadi, Gudmalwar, Ashishkumar P., Wasnik, Pankaj, Shah, Rajiv Ratn

Traditional Automatic Video Dubbing (AVD) pipeline consists of three key modules, namely, Automatic Speech Recognition (ASR), Neural Machine Translation (NMT), and Text-to-Speech (TTS). Within AVD pipelines, isometric-NMT algorithms are employed to r

Externí odkaz: http://arxiv.org/abs/2403.15469

Zobrazit plný text záznamu

Multichannel CNN-BLSTM Architecture for Speech Emotion Recognition System by Fusion of Magnitude and Phase Spectral Features Using DCCA for Consumer Applications

Autor: Gudmalwar Ashishkumar Prabhakar, Biplove Basel, Anirban Dutta, Ch. V. Rama Rao

Publikováno v: IEEE Transactions on Consumer Electronics. 69:226-235

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::7fc00c0e21735f4a3df4e2b0826378a2
https://doi.org/10.1109/tce.2023.3236972

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání