Zobrazeno 1 - 10
of 64
pro vyhledávání: '"Michael I. Mandel"'
Autor:
Ali Raza Syed, Michael I. Mandel
Publikováno v:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Autor:
Viet Anh Trinh, Michael I. Mandel
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 29:312-323
Automatic speech recognition (ASR) has reached human performance on many clean speech corpora, but it remains worse than human listeners in noisy environments. This paper investigates whether this difference in performance might be due to a differenc
Publikováno v:
Perspectives of the ASHA Special Interest Groups. 4:1653-1666
PurposeThe “bubble noise” technique has recently been introduced as a method to identify the regions in time–frequency maps (i.e., spectrograms) of speech that are especially important for listeners in speech recognition. This technique identif
Publikováno v:
2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).
Publikováno v:
INTERSPEECH
Autor:
Hussein Ghaly, Michael I. Mandel
Publikováno v:
Speech Prosody 2020.
Autor:
Viet Anh Trinh, Michael I. Mandel
Publikováno v:
INTERSPEECH
In this paper, we propose a metric that we call the structured saliency benchmark (SSBM) to evaluate importance maps computed for automatic speech recognizers on individual utterances. These maps indicate time-frequency points of the utterance that a
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6bf6c8f3f1a1282ed205e9fe0f448a3e
http://arxiv.org/abs/2005.10929
http://arxiv.org/abs/2005.10929
Autor:
Naoyuki Kanda, David Snyder, Ashish Arora, Bar Ben Yair, Christoph Boeddeker, Neville Ryant, Jan Trmal, Xuankai Chang, Emmanuel Vincent, Daniel Povey, Aswin Shanmugam Subramanian, Shinji Watanabe, Michael I. Mandel, Zhaoheng Ni, Shota Horiguchi, Takuya Yoshioka, Sanjeev Khudanpur, Vimal Manohar, Yusuke Fujita, Desh Raj, Jon Barker
Publikováno v:
CHiME 2020-6th International Workshop on Speech Processing in Everyday Environments
CHiME 2020-6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain
CHiME 2020-6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain
International audience; Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the 6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge revisits the previous CHiME-5 challenge and further co
Autor:
Michael I. Mandel, Zhaoheng Ni
Publikováno v:
6th International Workshop on Speech Processing in Everyday Environments (CHiME 2020).
Autor:
Michael I. Mandel, Soumi Maiti
Publikováno v:
ICASSP
Traditional speech enhancement systems produce speech with compromised quality. Here we propose to use the high quality speech generation capability of neural vocoders for better quality speech enhancement. We term this parametric resynthesis (PR). I