Zobrazeno 1 - 10
of 56
pro vyhledávání: '"Shah, Priyanshi"'
Reinforcement Learning can be applied to various tasks, and environments. Many of these environments have a similar shared structure, which can be exploited to improve RL performance on other tasks. Transfer learning can be used to take advantage of
Externí odkaz:
http://arxiv.org/abs/2308.00318
In this paper, we present a comprehensive study on semantic segmentation with the Pascal VOC dataset. Here, we have to label each pixel with a class which in turn segments the entire image based on the objects/entities present. To tackle this, we fir
Externí odkaz:
http://arxiv.org/abs/2304.13216
Autor:
Chhimwal, Neeraj, Gupta, Anirudh, Gaur, Rishabh, Chadha, Harveen Singh, Shah, Priyanshi, Dhuriya, Ankur, Raghavan, Vivek
In this paper, we propose a pipeline to find the number of speakers, as well as audios belonging to each of these now identified speakers in a source of audio data where number of speakers or speaker labels are not known a priori. We used this approa
Externí odkaz:
http://arxiv.org/abs/2205.02475
Autor:
Gupta, Anirudh, Chhimwal, Neeraj, Dhuriya, Ankur, Gaur, Rishabh, Shah, Priyanshi, Chadha, Harveen Singh, Raghavan, Vivek
Automatic Speech Recognition (ASR) generates text which is most of the times devoid of any punctuation. Absence of punctuation is text can affect readability. Also, down stream NLP tasks such as sentiment analysis, machine translation, greatly benefi
Externí odkaz:
http://arxiv.org/abs/2203.16825
Autor:
Gupta, Anirudh, Gaur, Rishabh, Dhuriya, Ankur, Chadha, Harveen Singh, Chhimwal, Neeraj, Shah, Priyanshi, Raghavan, Vivek
In the recent years end to end (E2E) automatic speech recognition (ASR) systems have achieved promising results given sufficient resources. Even for languages where not a lot of labelled data is available, state of the art E2E ASR systems can be deve
Externí odkaz:
http://arxiv.org/abs/2203.16823
Autor:
Shah, Priyanshi, Chadha, Harveen Singh, Gupta, Anirudh, Dhuriya, Ankur, Chhimwal, Neeraj, Gaur, Rishabh, Raghavan, Vivek
We propose a new method for the calculation of error rates in Automatic Speech Recognition (ASR). This new metric is for languages that contain half characters and where the same character can be written in different forms. We implement our methodolo
Externí odkaz:
http://arxiv.org/abs/2203.16601
Autor:
Dhuriya, Ankur, Chadha, Harveen Singh, Gupta, Anirudh, Shah, Priyanshi, Chhimwal, Neeraj, Gaur, Rishabh, Raghavan, Vivek
We study the effect of applying a language model (LM) on the output of Automatic Speech Recognition (ASR) systems for Indic languages. We fine-tune wav2vec $2.0$ models for $18$ Indic languages and adjust the results with language models trained on t
Externí odkaz:
http://arxiv.org/abs/2203.16595
Autor:
Chadha, Harveen Singh, Shah, Priyanshi, Dhuriya, Ankur, Chhimwal, Neeraj, Gupta, Anirudh, Raghavan, Vivek
Training multilingual automatic speech recognition (ASR) systems is challenging because acoustic and lexical information is typically language specific. Training multilingual system for Indic languages is even more tougher due to lack of open source
Externí odkaz:
http://arxiv.org/abs/2203.16578
Autor:
Chadha, Harveen Singh, Gupta, Anirudh, Shah, Priyanshi, Chhimwal, Neeraj, Dhuriya, Ankur, Gaur, Rishabh, Raghavan, Vivek
We present Vakyansh, an end to end toolkit for Speech Recognition in Indic languages. India is home to almost 121 languages and around 125 crore speakers. Yet most of the languages are low resource in terms of data and pretrained models. Through Vaky
Externí odkaz:
http://arxiv.org/abs/2203.16512
Autor:
Shah, Priyanshi, Maheshwari, Twinkle, Patel, Deepali, Patel, Zinal, Dikkatwar, Manoj S., Rathod, Mrudangsinh M.
Publikováno v:
In Clinical Epidemiology and Global Health September-October 2024 29