Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Parthasarathy, Partha"'
Neural transducer is now the most popular end-to-end model for speech recognition, due to its naturally streaming ability. However, it is challenging to adapt it with text-only data. Factorized neural transducer (FNT) model was proposed to mitigate t
Externí odkaz:
http://arxiv.org/abs/2212.01992
Autor:
Yoshioka, Takuya, Abramovski, Igor, Aksoylar, Cem, Chen, Zhuo, David, Moshe, Dimitriadis, Dimitrios, Gong, Yifan, Gurvich, Ilya, Huang, Xuedong, Huang, Yan, Hurvitz, Aviv, Jiang, Li, Koubi, Sharon, Krupka, Eyal, Leichter, Ido, Liu, Changliang, Parthasarathy, Partha, Vinnikov, Alon, Wu, Lingfeng, Xiao, Xiong, Xiong, Wayne, Wang, Huaming, Wang, Zhenghao, Zhang, Jun, Zhao, Yong, Zhou, Tianyan
This paper describes a system that generates speaker-annotated transcripts of meetings by using a microphone array and a 360-degree camera. The hallmark of the system is its ability to handle overlapped speech, which has been an unsolved problem in r
Externí odkaz:
http://arxiv.org/abs/1912.04979
Publikováno v:
2014 IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP); 2014, p3236-3240, 5p
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.