Zobrazeno 1 - 10
of 25
pro vyhledávání: '"Sunayana Sitaram"'
Autor:
Anirudh Srinivasan, Gauri Kholkar, Rahul Kejriwal, Tanuja Ganu, Sandipan Dandapat, Sunayana Sitaram, Balakrishnan Santhanam, Somak Aditya, Kalika Bali, Monojit Choudhury
Publikováno v:
Proceedings of the AAAI Conference on Artificial Intelligence. 36:13227-13229
Pre-trained multilingual language models are gaining popularity due to their cross-lingual zero-shot transfer ability, but these models do not perform equally well in all languages. Evaluating task-specific performance of a model in a large number of
Autor:
Chiranjeevi Yarra, Anuj Diwan, Ankita Singh, Preethi Jyothi, Saurabh Vyas, Karthik Sankaranarayanan, Vinit Unni, Sunayana Sitaram, Samarth Bharadwaj, Ashish Mittal, Kalika Bali, Prasanta Kumar Ghosh, Srinivasa Raghavan, Jai Nanavati, Sanket Shah, Vivek Seshadri, Rakesh Vaideeswaran, Raoul Nanavati, Shreya Khare, Akash Rajpuria
Publikováno v:
Interspeech 2021.
Publikováno v:
ACL/IJCNLP (1)
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021)
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021)
The analysis of data in which multiple languages are represented has gained popularity among computational linguists in recent years. So far, much of this research focuses mainly on the improvement of computational methods and largely ignores linguis
Publikováno v:
EACL (System Demonstrations)
Code-mixing is common in multilingual communities around the world, and processing it is challenging due to the lack of labeled and unlabeled data. We describe a tool that can automatically generate code-mixed data given parallel data in two language
Publikováno v:
ACL
Code-switching is the use of more than one language in the same conversation or utterance. Recently, multilingual contextual embedding models, trained on multiple monolingual corpora, have shown promising results on cross-lingual and multilingual tas
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::de1e82356acfa3558f6494ce3471c8fc
Using Monolingual Speech Recognition for Spoken Term Detection in Code-switched Hindi-English Speech
Autor:
Sanket Shah, Sunayana Sitaram
Publikováno v:
ICDM Workshops
Code-switching is the alternation of two or more languages in a single utterance or a conversation and is prevalent in multilingual communities all over the world. Spoken Term Detection (STD) is the task of detecting a given word or phrase in audio.
Publikováno v:
Proceedings of the First Workshop on Aggregating and Analysing Crowdsourced Annotations for NLP.
Code-switching refers to the alternation of two or more languages in a conversation or utterance and is common in multilingual communities across the world. Building code-switched speech and natural language processing systems are challenging due to
Publikováno v:
INTERSPEECH
Code-switching or mixing is the use of multiple languages in a single utterance or conversation. Borrowing occurs when a word from a foreign language becomes part of the vocabulary of a language. In multilingual societies, switching/mixing and borrow
Effect of TTS Generated Audio on OOV Detection and Word Error Rate in ASR for Low-resource Languages
Publikováno v:
INTERSPEECH
Out-of-Vocabulary (OOV) detection and recovery is an important aspect of reducing Word Error Rate (WER) in Automatic Speech Recognition (ASR). In this paper, we evaluate the effect of OOV detection and recovery for a low-resource language on WER. We
Autor:
Radhakrishnan Srikanth, Krishna Doss Mohan, Sunayana Sitaram, Kalika Bali, Brij Mohan Lal Srivastava, Rupesh Kumar Mehta, Niranjan S. Nayak, Sandeepkumar Satpal, Pallavi Matani
Publikováno v:
SLTU
India has more than 1500 languages, with 30 of them spoken by more than one million native speakers. Most of them are low-resource and could greatly benefit from speech and language technologies. Building speech recognition support for these low-res