Výsledky vyhledávání - "Tathe, Aniket"

Report

Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART

Autor: Tathe, Aniket, Kamble, Anand, Kumbharkar, Suyash, Bhandare, Atharva, Mitra, Anirban C.

This research addresses the challenge of training an ASR model for personalized voices with minimal data. Utilizing just 14 minutes of custom audio from a YouTube video, we employ Retrieval-Based Voice Conversion (RVC) to create a custom Common Voice

Externí odkaz: http://arxiv.org/abs/2403.00212

Zobrazit plný text záznamu

Report

End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2

Autor: Tathe, Aniket, Kamble, Anand, Kumbharkar, Suyash, Bhandare, Atharva, Mitra, Anirban C.

Speech has long been a barrier to effective communication and connection, persisting as a challenge in our increasingly interconnected world. This research paper introduces a transformative solution to this persistent obstacle an end-to-end speech co

Externí odkaz: http://arxiv.org/abs/2401.06183

Zobrazit plný text záznamu

Report

Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion

Autor: Kamble, Anand, Tathe, Aniket, Kumbharkar, Suyash, Bhandare, Atharva, Mitra, Anirban C.

This paper proposes two innovative methodologies to construct customized Common Voice datasets for low-resource languages like Hindi. The first methodology leverages Bark, a transformer-based text-to-audio model developed by Suno, and incorporates Me

Externí odkaz: http://arxiv.org/abs/2311.14836

Zobrazit plný text záznamu

Akademický článek

Object following robot based on AI/ML

Autor: Kamble, Anand, Mitra, Anirban C., Tathe, Aniket, Kumbharkar, Suyash, Bhandare, Atharva

Publikováno v: In Materials Today: Proceedings 2023 72 Part 3:1817-1824

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání