Výsledky vyhledávání - "Murali Karthick Baskar"

Ask2Mask: Guided Data Selection for Masked Speech Modeling

Autor: Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang, Pedro Moreno

Masked speech modeling (MSM) methods such as wav2vec2 or w2v-BERT learn representations over speech frames which are randomly masked within an utterance. While these methods improve performance of Automatic Speech Recognition (ASR) systems, they have

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::950aa5ea6024ec86da39dbb74c4e24d6

Zobrazit plný text záznamu

EAT: Enhanced ASR-TTS for Self-supervised Speech Recognition

Autor: Shinji Watanabe, Jan Cernocky, Ramón Fernandez Astudillo, Lukas Burget, Murali Karthick Baskar

Publikováno v: ICASSP

Self-supervised ASR-TTS models suffer in out-of-domain data conditions. Here we propose an enhanced ASR-TTS (EAT) model that incorporates two main features: 1) The ASR$\rightarrow$TTS direction is equipped with a language model reward to penalize the

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::59ad37018c8778608b4237075aa261f2
http://arxiv.org/abs/2104.07474

Zobrazit plný text záznamu

Speech Enhancement Using End-to-End Speech Recognition Objectives

Autor: Shinji Watanabe, Yuya Fujita, Murali Karthick Baskar, Toru Taniguchi, Dung Tran, Aswin Shanmugam Subramanian, Xiaofei Wang

Publikováno v: WASPAA

Speech enhancement systems, which denoise and dereverberate distorted signals, are usually optimized based on signal reconstruction objectives including the maximum likelihood and minimum mean square error. However, emergent end-to-end neural methods

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::fae67e958623a80925e95a630cae81d0
https://doi.org/10.1109/waspaa.2019.8937250

Zobrazit plný text záznamu

DNNs for unsupervised extraction of pseudo speaker-normalized features without explicit adaptation data

Autor: Neethu Mariam Joy, Srinivasan Umesh, Murali Karthick Baskar

Publikováno v: Speech Communication. 92:64-76

In this paper, we propose using deep neural networks (DNN) as a regression model to estimate speaker-normalized features from un-normalized features. We consider three types of speaker-specific feature normalization techniques, viz., feature-space ma

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::7239ba0cbc4d6b154baf660e0e8f7fd1
https://doi.org/10.1016/j.specom.2017.06.002

Zobrazit plný text záznamu

Semi-Supervised Sequence-to-Sequence ASR Using Unpaired Speech and Text

Autor: Takaaki Hori, Murali Karthick Baskar, Ramón Fernandez Astudillo, Shinji Watanabe, Jan Cernocký, Lukas Burget

Publikováno v: INTERSPEECH

Sequence-to-sequence automatic speech recognition (ASR) models require large quantities of data to attain high performance. For this reason, there has been a recent surge in interest for unsupervised and semi-supervised training in such models. This

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::c5bb1e1cd512b2f670c1b1fc95c69752
https://doi.org/10.21437/interspeech.2019-3167

Zobrazit plný text záznamu

Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition

Autor: Najim Dehak, Hirofumi Inaguma, Takaaki Hori, Murali Karthick Baskar, Shinji Watanabe, Jesús Villalba, Jaejin Cho

Publikováno v: ICASSP

In this paper, we explore several new schemes to train a seq2seq model to integrate a pre-trained LM. Our proposed fusion methods focus on the memory cell state and the hidden state in the seq2seq decoder long short-term memory (LSTM), and the memory

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5fbf733b0b468aa4543d18100f2f742c
https://doi.org/10.1109/icassp.2019.8683380

Zobrazit plný text záznamu

Self-supervised Sequence-to-sequence ASR using Unpaired Speech and Text

Autor: Murali Karthick Baskar, Watanabe, Shinji, Astudillo, Ramon, Hori, Takaaki, Lukáš Burget, Jaň Cernock´Y Cernock´Y

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::61d0570219368e6e6a667c63a069884e

Zobrazit plný text záznamu

Transfer learning of language-independent end-to-end ASR with language model fusion

Autor: Hirofumi Inaguma, Jaejin Cho, Shinji Watanabe, Tatsuya Kawahara, Murali Karthick Baskar

Publikováno v: ICASSP

This work explores better adaptation methods to low-resource languages using an external language model (LM) under the framework of transfer learning. We first build a language-independent ASR system in a unified sequence-to-sequence (S2S) architectu

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::fb4ef0b208410575406d6736f59dde8a
http://arxiv.org/abs/1811.02134

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání