Výsledky vyhledávání

Report

Parameter-efficient Adaptation of Multilingual Multimodal Models for Low-resource ASR

Autor: Gupta, Abhishek, Parulekar, Amruta, Chattopadhyay, Sameep, Jyothi, Preethi

Automatic speech recognition (ASR) for low-resource languages remains a challenge due to the scarcity of labeled training data. Parameter-efficient fine-tuning and text-only adaptation are two popular methods that have been used to address such low-r

Externí odkaz: http://arxiv.org/abs/2410.13445

Zobrazit plný text záznamu

Report

SALSA: Speedy ASR-LLM Synchronous Aggregation

Autor: Mittal, Ashish, Prabhu, Darshan, Sarawagi, Sunita, Jyothi, Preethi

Harnessing pre-trained LLMs to improve ASR systems, particularly for low-resource languages, is now an emerging area of research. Existing methods range from using LLMs for ASR error correction to tightly coupled systems that replace the ASR decoder

Externí odkaz: http://arxiv.org/abs/2408.16542

Zobrazit plný text záznamu

Report

LoFTI: Localization and Factuality Transfer to Indian Locales

Autor: Simon, Sona Elza, Mondal, Soumen Kumar, Singhania, Abhishek, Sen, Sayambhu, Jyothi, Preethi

Large language models (LLMs) encode vast amounts of world knowledge acquired via training on large web-scale datasets crawled from the internet. However, these datasets typically exhibit a geographical bias towards English-speaking Western countries.

Externí odkaz: http://arxiv.org/abs/2407.11833

Zobrazit plný text záznamu

Report

Boosting Zero-Shot Crosslingual Performance using LLM-Based Augmentations with Effective Data Selection

Autor: Fazili, Barah, Agrawal, Ashish Sunil, Jyothi, Preethi

Large language models (LLMs) are very proficient text generators. We leverage this capability of LLMs to generate task-specific data via zero-shot prompting and promote cross-lingual transfer for low-resource target languages. Given task-specific dat

Externí odkaz: http://arxiv.org/abs/2407.10582

Zobrazit plný text záznamu

Report

CharSS: Character-Level Transformer Model for Sanskrit Word Segmentation

Autor: Bhatt, Krishnakant, J, Karthika N, Ramakrishnan, Ganesh, Jyothi, Preethi

Subword tokens in Indian languages inherently carry meaning, and isolating them can enhance NLP tasks, making sub-word segmentation a crucial process. Segmenting Sanskrit and other Indian languages into subtokens is not straightforward, as it may inc

Externí odkaz: http://arxiv.org/abs/2407.06331

Zobrazit plný text záznamu

Report

Improving Self-supervised Pre-training using Accent-Specific Codebooks

Autor: Prabhu, Darshan, Gupta, Abhishek, Nitsure, Omkar, Jyothi, Preethi, Ganapathy, Sriram

Speech accents present a serious challenge to the performance of state-of-the-art end-to-end Automatic Speech Recognition (ASR) systems. Even with self-supervised learning and pre-training of ASR models, accent invariance is seldom achieved. In this

Externí odkaz: http://arxiv.org/abs/2407.03734

Zobrazit plný text záznamu

Report

Multi-Convformer: Extending Conformer with Multiple Convolution Kernels

Autor: Prabhu, Darshan, Peng, Yifan, Jyothi, Preethi, Watanabe, Shinji

Convolutions have become essential in state-of-the-art end-to-end Automatic Speech Recognition~(ASR) systems due to their efficient modelling of local context. Notably, its use in Conformers has led to superior performance compared to vanilla Transfo

Externí odkaz: http://arxiv.org/abs/2407.03718

Zobrazit plný text záznamu

Report

CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving

Autor: Shankar, Bhavani, Jyothi, Preethi, Bhattacharyya, Pushpak

Code-switching is a widely prevalent linguistic phenomenon in multilingual societies like India. Building speech-to-text models for code-switched speech is challenging due to limited availability of datasets. In this work, we focus on the problem of

Externí odkaz: http://arxiv.org/abs/2406.10993

Zobrazit plný text záznamu

Report

ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation

Autor: Song, Yurun, Zhao, Junchen, Harris, Ian G., Jyothi, Sangeetha Abdu

This study introduces an approach to optimize Parameter Efficient Fine Tuning (PEFT) for Pretrained Language Models (PLMs) by implementing a Shared Low Rank Adaptation (ShareLoRA). By strategically deploying ShareLoRA across different layers and adap

Externí odkaz: http://arxiv.org/abs/2406.10785

Zobrazit plný text záznamu

Report

A Little Aggression Goes a Long Way

Autor: Krishnan, Jyothi, Misra, Neeldhara, Nanoti, Saraswati Girish

Aggression is a two-player game of troop placement and attack played on a map (modeled as a graph). Players take turns deploying troops on a territory (a vertex on the graph) until they run out. Once all troops are placed, players take turns attackin

Externí odkaz: http://arxiv.org/abs/2406.05742

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání