Zobrazeno 1 - 10
of 40
pro vyhledávání: '"Arsikere, Harish"'
Autor:
Mitra, Soumyajit, Ray, Swayambhu Nath, Padi, Bharat, Sen, Arunasish, Bilgi, Raghavendra, Arsikere, Harish, Ghosh, Shalini, Srinivasamurthy, Ajay, Garimella, Sri
Modern Automatic Speech Recognition (ASR) systems often use a portfolio of domain-specific models in order to get high accuracy for distinct user utterance types across different devices. In this paper, we propose an innovative approach that integrat
Externí odkaz:
http://arxiv.org/abs/2205.06655
Autor:
Ray, Swayambhu Nath, Wu, Minhua, Raju, Anirudh, Ghahremani, Pegah, Bilgi, Raghavendra, Rao, Milind, Arsikere, Harish, Rastrow, Ariya, Stolcke, Andreas, Droppo, Jasha
Publikováno v:
Proc. Interspeech, Sept. 2021, pp. 3455-3459
Comprehending the overall intent of an utterance helps a listener recognize the individual words spoken. Inspired by this fact, we perform a novel study of the impact of explicitly incorporating intent representations as additional information to imp
Externí odkaz:
http://arxiv.org/abs/2105.07071
Autor:
Hu, Hu, Yang, Xuesong, Raeesy, Zeynab, Guo, Jinxi, Keskin, Gokce, Arsikere, Harish, Rastrow, Ariya, Stolcke, Andreas, Maas, Roland
Accents mismatching is a critical problem for end-to-end ASR. This paper aims to address this problem by building an accent-robust RNN-T system with domain adversarial training (DAT). We unveil the magic behind DAT and provide, for the first time, a
Externí odkaz:
http://arxiv.org/abs/2012.07353
Autor:
Swarup, Prakhar, Chakrabarty, Debmalya, Sapru, Ashtosh, Tulsiani, Hitesh, Arsikere, Harish, Garimella, Sri
Semi-supervised learning (SSL) is an active area of research which aims to utilize unlabelled data in order to improve the accuracy of speech recognition systems. The current study proposes a methodology for integration of two key ideas: 1) SSL using
Externí odkaz:
http://arxiv.org/abs/2008.03923
Autor:
Punjabi, Surabhi, Arsikere, Harish, Raeesy, Zeynab, Chandak, Chander, Bhave, Nikhil, Bansal, Ankish, Müller, Markus, Murillo, Sergio, Rastrow, Ariya, Garimella, Sri, Maas, Roland, Hans, Mat, Mouchtaris, Athanasios, Kunzmann, Siegfried
Multilingual ASR technology simplifies model training and deployment, but its accuracy is known to depend on the availability of language information at runtime. Since language identity is seldom known beforehand in real-world scenarios, it must be i
Externí odkaz:
http://arxiv.org/abs/2007.03900
Building conversational speech recognition systems for new languages is constrained by the availability of utterances that capture user-device interactions. Data collection is both expensive and limited by the speed of manual transcription. In order
Externí odkaz:
http://arxiv.org/abs/1912.00958
Publikováno v:
In Speech Communication January 2013 55(1):51-70
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Autor:
Arsikere, Harish
Publikováno v:
Arsikere, Harish. (2014). On the role of subglottal acoustics in height estimation, and speech and speaker recognition. UCLA: Electrical Engineering 0303. Retrieved from: http://www.escholarship.org/uc/item/2fz2q7s8
The subglottal system comprises the trachea, bronchi and their accompanying airways. Its configuration changes very little compared to that of the supraglottal vocal tract, as a result of which its acoustic properties are relatively more stationary a
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od_______325::35faaad12f240c1aafe85da9f8c70d07
http://n2t.net/ark:/13030/m5q54xc7
http://n2t.net/ark:/13030/m5q54xc7
Publikováno v:
2016 IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP); 2016, p6105-6109, 5p