Výsledky vyhledávání - "Arsikere, Harish"

Report

Unified Modeling of Multi-Domain Multi-Device ASR Systems

Autor: Mitra, Soumyajit, Ray, Swayambhu Nath, Padi, Bharat, Sen, Arunasish, Bilgi, Raghavendra, Arsikere, Harish, Ghosh, Shalini, Srinivasamurthy, Ajay, Garimella, Sri

Modern Automatic Speech Recognition (ASR) systems often use a portfolio of domain-specific models in order to get high accuracy for distinct user utterance types across different devices. In this paper, we propose an innovative approach that integrat

Externí odkaz: http://arxiv.org/abs/2205.06655

Zobrazit plný text záznamu

Report

Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End

Autor: Ray, Swayambhu Nath, Wu, Minhua, Raju, Anirudh, Ghahremani, Pegah, Bilgi, Raghavendra, Rao, Milind, Arsikere, Harish, Rastrow, Ariya, Stolcke, Andreas, Droppo, Jasha

Publikováno v: Proc. Interspeech, Sept. 2021, pp. 3455-3459

Comprehending the overall intent of an utterance helps a listener recognize the individual words spoken. Inspired by this fact, we perform a novel study of the impact of explicitly incorporating intent representations as additional information to imp

Externí odkaz: http://arxiv.org/abs/2105.07071

Zobrazit plný text záznamu

Report

REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling

Autor: Hu, Hu, Yang, Xuesong, Raeesy, Zeynab, Guo, Jinxi, Keskin, Gokce, Arsikere, Harish, Rastrow, Ariya, Stolcke, Andreas, Maas, Roland

Accents mismatching is a critical problem for end-to-end ASR. This paper aims to address this problem by building an accent-robust RNN-T system with domain adversarial training (DAT). We unveil the magic behind DAT and provide, for the first time, a

Externí odkaz: http://arxiv.org/abs/2012.07353

Zobrazit plný text záznamu

Report

Knowledge Distillation and Data Selection for Semi-Supervised Learning in CTC Acoustic Models

Autor: Swarup, Prakhar, Chakrabarty, Debmalya, Sapru, Ashtosh, Tulsiani, Hitesh, Arsikere, Harish, Garimella, Sri

Semi-supervised learning (SSL) is an active area of research which aims to utilize unlabelled data in order to improve the accuracy of speech recognition systems. The current study proposes a methodology for integration of two key ideas: 1) SSL using

Externí odkaz: http://arxiv.org/abs/2008.03923

Zobrazit plný text záznamu

Report

Streaming End-to-End Bilingual ASR Systems with Joint Language Identification

Autor: Punjabi, Surabhi, Arsikere, Harish, Raeesy, Zeynab, Chandak, Chander, Bhave, Nikhil, Bansal, Ankish, Müller, Markus, Murillo, Sergio, Rastrow, Ariya, Garimella, Sri, Maas, Roland, Hans, Mat, Mouchtaris, Athanasios, Kunzmann, Siegfried

Multilingual ASR technology simplifies model training and deployment, but its accuracy is known to depend on the availability of language information at runtime. Since language identity is seldom known beforehand in real-world scenarios, it must be i

Externí odkaz: http://arxiv.org/abs/2007.03900

Zobrazit plný text záznamu

Report

Language Model Bootstrapping Using Neural Machine Translation For Conversational Speech Recognition

Autor: Punjabi, Surabhi, Arsikere, Harish, Garimella, Sri

Building conversational speech recognition systems for new languages is constrained by the availability of utterances that capture user-device interactions. Data collection is both expensive and limited by the speed of manual transcription. In order

Externí odkaz: http://arxiv.org/abs/1912.00958

Zobrazit plný text záznamu

Akademický článek

Automatic estimation of the first three subglottal resonances from adults’ speech signals with application to speaker height estimation

Autor: Arsikere, Harish, Leung, Gary K.F., Lulich, Steven M., Alwan, Abeer

Publikováno v: In Speech Communication January 2013 55(1):51-70

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

On the role of subglottal acoustics in height estimation, and speech and speaker recognition

Autor: Arsikere, Harish

Publikováno v: Arsikere, Harish. (2014). On the role of subglottal acoustics in height estimation, and speech and speaker recognition. UCLA: Electrical Engineering 0303. Retrieved from: http://www.escholarship.org/uc/item/2fz2q7s8

The subglottal system comprises the trachea, bronchi and their accompanying airways. Its configuration changes very little compared to that of the supraglottal vocal tract, as a result of which its acoustic properties are relatively more stationary a

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od_______325::35faaad12f240c1aafe85da9f8c70d07
http://n2t.net/ark:/13030/m5q54xc7

Zobrazit plný text záznamu

Conference

Novel acoustic features for automatic dialog-act tagging.

Autor: Arsikere, Harish, Sen, Arunasish, Prathosh, A. P., Tyagi, Vivek

Publikováno v: 2016 IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP); 2016, p6105-6109, 5p

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání