Zobrazeno 1 - 7
of 7
pro vyhledávání: '"Bharadwaj, Shikhar"'
Autor:
Vashishth, Shikhar, Singh, Harman, Bharadwaj, Shikhar, Ganapathy, Sriram, Asawaroengchai, Chulayuth, Audhkhasi, Kartik, Rosenberg, Andrew, Bapna, Ankur, Ramabhadran, Bhuvana
Representing speech as discrete tokens provides a framework for transforming speech into a format that closely resembles text, thus enabling the use of speech as an input to the widely successful large language models (LLMs). Currently, while several
Externí odkaz:
http://arxiv.org/abs/2409.02384
As large language models (LLMs) see increasing adoption across the globe, it is imperative for LLMs to be representative of the linguistic diversity of the world. India is a linguistically diverse country of 1.4 Billion people. To facilitate research
Externí odkaz:
http://arxiv.org/abs/2404.16816
Autor:
Bharadwaj, Shikhar, Ma, Min, Vashishth, Shikhar, Bapna, Ankur, Ganapathy, Sriram, Axelrod, Vera, Dalmia, Siddharth, Han, Wei, Zhang, Yu, van Esch, Daan, Ritchie, Sandy, Talukdar, Partha, Riesa, Jason
Spoken language identification refers to the task of automatically predicting the spoken language in a given utterance. Conventionally, it is modeled as a speech-based language identification task. Prior techniques have been constrained to a single m
Externí odkaz:
http://arxiv.org/abs/2309.10567
In the recent years, speech representation learning is constructed primarily as a self-supervised learning (SSL) task, using the raw audio signal alone, while ignoring the side-information that is often available for a given speech recording. In this
Externí odkaz:
http://arxiv.org/abs/2307.10982
Autor:
Vashishth, Shikhar, Bharadwaj, Shikhar, Ganapathy, Sriram, Bapna, Ankur, Ma, Min, Han, Wei, Axelrod, Vera, Talukdar, Partha
Speech representation learning approaches for non-semantic tasks such as language recognition have either explored supervised embedding extraction methods using a classifier model or self-supervised representation learning approaches using raw data.
Externí odkaz:
http://arxiv.org/abs/2306.04374
Autor:
Sahu, Surya Prakash, Mandal, Madhurima, Bharadwaj, Shikhar, Kanade, Aditya, Maniatis, Petros, Shevade, Shirish
Developers often have questions about semantic aspects of code they are working on, e.g., "Is there a class whose parent classes declare a conflicting attribute?". Answering them requires understanding code semantics such as attributes and inheritanc
Externí odkaz:
http://arxiv.org/abs/2209.08372
In the recent years, speech representation learning is constructed primarily as a self-supervised learning (SSL) task, using the raw audio signal alone, while ignoring the side-information that is often available for a given speech recording. In this
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::584235e12dee30d694115d99b8fce2cf
http://arxiv.org/abs/2307.10982
http://arxiv.org/abs/2307.10982