Výsledky vyhledávání - "Bharadwaj, Shikhar"

Report

STAB: Speech Tokenizer Assessment Benchmark

Autor: Vashishth, Shikhar, Singh, Harman, Bharadwaj, Shikhar, Ganapathy, Sriram, Asawaroengchai, Chulayuth, Audhkhasi, Kartik, Rosenberg, Andrew, Bapna, Ankur, Ramabhadran, Bhuvana

Representing speech as discrete tokens provides a framework for transforming speech into a format that closely resembles text, thus enabling the use of speech as an input to the widely successful large language models (LLMs). Currently, while several

Externí odkaz: http://arxiv.org/abs/2409.02384

Zobrazit plný text záznamu

Report

IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages

Autor: Singh, Harman, Gupta, Nitish, Bharadwaj, Shikhar, Tewari, Dinesh, Talukdar, Partha

As large language models (LLMs) see increasing adoption across the globe, it is imperative for LLMs to be representative of the linguistic diversity of the world. India is a linguistically diverse country of 1.4 Billion people. To facilitate research

Externí odkaz: http://arxiv.org/abs/2404.16816

Zobrazit plný text záznamu

Report

Multimodal Modeling For Spoken Language Identification

Autor: Bharadwaj, Shikhar, Ma, Min, Vashishth, Shikhar, Bapna, Ankur, Ganapathy, Sriram, Axelrod, Vera, Dalmia, Siddharth, Han, Wei, Zhang, Yu, van Esch, Daan, Ritchie, Sandy, Talukdar, Partha, Riesa, Jason

Spoken language identification refers to the task of automatically predicting the spoken language in a given utterance. Conventionally, it is modeled as a speech-based language identification task. Prior techniques have been constrained to a single m

Externí odkaz: http://arxiv.org/abs/2309.10567

Zobrazit plný text záznamu

Report

MASR: Multi-label Aware Speech Representation

Autor: Raj, Anjali, Bharadwaj, Shikhar, Ganapathy, Sriram, Ma, Min, Vashishth, Shikhar

In the recent years, speech representation learning is constructed primarily as a self-supervised learning (SSL) task, using the raw audio signal alone, while ignoring the side-information that is often available for a given speech recording. In this

Externí odkaz: http://arxiv.org/abs/2307.10982

Zobrazit plný text záznamu

Report

Label Aware Speech Representation Learning For Language Identification

Autor: Vashishth, Shikhar, Bharadwaj, Shikhar, Ganapathy, Sriram, Bapna, Ankur, Ma, Min, Han, Wei, Axelrod, Vera, Talukdar, Partha

Speech representation learning approaches for non-semantic tasks such as language recognition have either explored supervised embedding extraction methods using a classifier model or self-supervised representation learning approaches using raw data.

Externí odkaz: http://arxiv.org/abs/2306.04374

Zobrazit plný text záznamu

Report

CodeQueries: A Dataset of Semantic Queries over Code

Autor: Sahu, Surya Prakash, Mandal, Madhurima, Bharadwaj, Shikhar, Kanade, Aditya, Maniatis, Petros, Shevade, Shirish

Developers often have questions about semantic aspects of code they are working on, e.g., "Is there a class whose parent classes declare a conflicting attribute?". Answering them requires understanding code semantics such as attributes and inheritanc

Externí odkaz: http://arxiv.org/abs/2209.08372

Zobrazit plný text záznamu

MASR: Metadata Aware Speech Representation

Autor: Raj, Anjali, Bharadwaj, Shikhar, Ganapathy, Sriram, Ma, Min, Vashishth, Shikhar

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::584235e12dee30d694115d99b8fce2cf
http://arxiv.org/abs/2307.10982

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání