Zobrazeno 1 - 10
of 31
pro vyhledávání: '"Ivan Bulyko"'
Autor:
Rahul Pandey, Roger Ren, Qi Luo, Jing Liu, Ariya Rastrow, Ankur Gandhe, Denis Filimonov, Grant Strimel, Andreas Stolcke, Ivan Bulyko
End-to-End (E2E) automatic speech recognition (ASR) systems used in voice assistants often have difficulties recognizing infrequent words personalized to the user, such as names and places. Rare words often have non-trivial pronunciations, and in suc
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0f33cfd7f5039f3615dd10a65c9c5d3c
http://arxiv.org/abs/2303.17131
http://arxiv.org/abs/2303.17131
Autor:
Ivan Bulyko
Publikováno v:
Христианское чтение. :111-124
Publikováno v:
Interspeech 2021.
Autor:
Church Arts, Ivan Bulyko
Publikováno v:
Vestnik of Northern (Arctic) Federal University. Series "Humanitarian and Social Sciences". :99-109
Autor:
Aditya Gourav, Ankur Gandhe, Denis Filimonov, Linda Liu, Ariya Rastrow, Ivan Bulyko, Shashank Kalmane, Yile Gu
Publikováno v:
ICASSP
As voice assistants become more ubiquitous, they are increasingly expected to support and perform well on a wide variety of use-cases across different domains. We present a domain-aware rescoring framework suitable for achieving domain-adaptation dur
Autor:
Yile Gu, Gautam Tiwari, Ivan Bulyko, Shashank Kalmane, Linda Liu, Andreas Stolcke, Guitang Lan, Xiangyang Huang, Denis Filimonov, Ankur Gandhe, Ariya Rastrow, Aditya Gourav
Publikováno v:
ICASSP
The recognition of personalized content, such as contact names, remains a challenging problem for end-to-end speech recognition systems. In this work, we demonstrate how first and second-pass rescoring strategies can be leveraged together to improve
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b478fe456c395d84b6ae8a16f4a1ab7a
Autor:
Ariya Rastrow, Ivan Bulyko, Ankur Gandhe, Andreas Stolcke, Scott Novotney, Richard Diehl Martinez
Publikováno v:
ACL/IJCNLP (Findings)
Language modeling (LM) for automatic speech recognition (ASR) does not usually incorporate utterance level contextual information. For some domains like voice assistants, however, additional context, such as the time at which an utterance was spoken,
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5ae87029e06fe835d4cd27f5e1e906b9
Autor:
Chao-Han Huck Yang, Linda Liu, Ankur Gandhe, Yile Gu, Anirudh Raju, Denis Filimonov, Ivan Bulyko
End-to-end automatic speech recognition (ASR) systems are increasingly popular due to their relative architectural simplicity and competitive performance. However, even though the average accuracy of these systems may be high, the performance on rare
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4b1f046eb5c9be44b06e0e17e81e914e
Publikováno v:
ACM Transactions on Speech and Language Processing. 5:1-25
This article describes a methodology for collecting text from the Web to match a target sublanguage both in style (register) and topic. Unlike other work that estimates n-gram statistics from page counts, the approach here is to select and filter doc
Publikováno v:
Speech Communication. 45:271-288
Speech understanding errors in spoken dialogue systems can be frustrating for users and difficult to recover from in a mixed-initiative spoken dialogue system. Handling such errors requires both detecting error conditions and adjusting the response g