Zobrazeno 1 - 10
of 43
pro vyhledávání: '"Jan Trmal"'
Autor:
Yujun Wang, Wei Zou, Guoguo Chen, Shuaijiang Zhao, Guan-Bo Wang, Mingjie Jin, Yongqing Wang, Wei-Qiang Zhang, Jiayu Du, Shuzhou Chai, Daniel Povey, Zhiyong Yan, Jan Trmal, Shinji Watanabe, Xuchen Yao, Sanjeev Khudanpur, Zhao You, Dan Su, Junbo Zhang, Chao Weng, Xiangang Li
This paper introduces GigaSpeech, an evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio suitable for supervised training, and 40,000 hours of total audio suitable for semi-supervised and unsupervi
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ac0c3ecc57cd5952693b5ab446d686f2
Autor:
Maarten Van Segbroeck, Roland Maas, Ksenia Kutsenko, Bjorn Hoffmeister, Cirenia Huerta, Jan Trmal, Xuewen Luo, Ahmed Zaid, Tinh Nguyen, Maurizio Omologo
Publikováno v:
INTERSPEECH
We present a speech data corpus that simulates a "dinner party" scenario taking place in an everyday home environment. The corpus was created by recording multiple groups of four Amazon employee volunteers having a natural conversation in English aro
Autor:
Naoyuki Kanda, David Snyder, Ashish Arora, Bar Ben Yair, Christoph Boeddeker, Neville Ryant, Jan Trmal, Xuankai Chang, Emmanuel Vincent, Daniel Povey, Aswin Shanmugam Subramanian, Shinji Watanabe, Michael I. Mandel, Zhaoheng Ni, Shota Horiguchi, Takuya Yoshioka, Sanjeev Khudanpur, Vimal Manohar, Yusuke Fujita, Desh Raj, Jon Barker
Publikováno v:
CHiME 2020-6th International Workshop on Speech Processing in Everyday Environments
CHiME 2020-6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain
CHiME 2020-6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain
International audience; Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the 6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge revisits the previous CHiME-5 challenge and further co
Autor:
Santiago Pascual, Mirco Ravanelli, Jianyuan Zhong, Jan Trmal, Joao Monteiro, Pawel Swietojanski, Yoshua Bengio
Publikováno v:
ICASSP
Despite the growing interest in unsupervised learning, extracting meaningful knowledge from unlabelled audio remains an open challenge. To take a step in this direction, we recently proposed a problem-agnostic speech encoder (PASE), that combines a c
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::99a19f66f756458a9da225e87dab1244
http://arxiv.org/abs/2001.09239
http://arxiv.org/abs/2001.09239
Publikováno v:
ASRU
Existing acoustic models can be transferred to any language with a pronunciation lexicon (lexicon) that uses the same set of sub-word units as in training. Unfortunately such lexicons are not readily available in many low-resource languages. We bypas
Publikováno v:
GlobalSIP
Unsupervised term discovery is the task of identifying and grouping reoccurring word-like patterns from the untranscribed audio data. It facilitates unsupervised acoustic model training in zero resource setting where no or minimal transcribed speech
Autor:
Desh Raj, Chun Chieh Chang, Jan Trmal, Hossein Hadian, Yiwen Shao, Sanjeev Khudanpur, Babak Rekabdar, Paola Garcia, Bagher BabaAli, David Etter, Shinji Watanabe, Vimal Manohar, Daniel Povey, Ashish Arora
Publikováno v:
ICDAR
Hybrid deep neural network hidden Markov models (DNN-HMM) have achieved impressive results on large vocabulary continuous speech recognition (LVCSR) tasks. However, the recent approaches using DNN-HMM models are not explored much for text recognition
Publikováno v:
SIGMORPHON
We investigate the problem of searching for a lexeme-set in speech by searching for its inflectional variants. Experimental results indicate how lexeme-set search performance changes with the number of hypothesized inflections, while ablation experim
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c3482004361c5eceefb64568acb218de
Publikováno v:
SLT
We present our work on improving the numerator graph for discriminative training using the lattice-free maximum mutual information (MMI) criterion. Specifically, we propose a scheme for creating unconstrained numerator graphs by removing time constra
Publikováno v:
Interspeech 2018-19th Annual Conference of the International Speech Communication Association
Interspeech 2018-19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India
INTERSPEECH
Interspeech 2018-19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India
INTERSPEECH
International audience; The CHiME challenge series aims to advance robust automatic speech recognition (ASR) technology by promoting research at the interface of speech and language processing, signal processing , and machine learning. This paper int
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::40f3c5f936ca2eea2faffe993aa09f4a
https://hal.inria.fr/hal-01744021/document
https://hal.inria.fr/hal-01744021/document