Zobrazeno 1 - 10
of 15
pro vyhledávání: '"Takanori Ashihara"'
Publikováno v:
IEEE Access, Vol 12, Pp 98835-98855 (2024)
Self-supervised learning (SSL), an unsupervised representation learning technique, has received widespread attention across various modalities. Speech, with its inherent complexity encompassing acoustic (e.g., speaker, phoneme, and paralinguistic cue
Externí odkaz:
https://doaj.org/article/e4b647188a4c4606a0dc899c95f100cf
Autor:
Tomohiro Tanaka, Ryo Masumura, Mana Ihori, Hiroshi Sato, Taiga Yamane, Takanori Ashihara, Kohei Matsuura, Takafumi Moriya
Publikováno v:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Autor:
Takafumi Moriya, Takanori Ashihara, Atsushi Ando, Hiroshi Sato, Tomohiro Tanaka, Kohei Matsuura, Ryo Masumura, Marc Delcroix, Takahiro Shinozaki
Publikováno v:
ICASSP. :8282-8286
Self-supervised learning (SSL) has been dramatically successful not only in monolingual but also in cross-lingual settings. However, since the two settings have been studied individually in general, there has been little research focusing on how effe
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::9607f09fe7d7b77892eb7909b01750c0
Autor:
Tomohiro Tanaka, Ryo Masumura, Hiroshi Sato, Mana Ihori, Kohei Matsuura, Takanori Ashihara, Takafumi Moriya
Publikováno v:
Interspeech 2022.
Autor:
Atsushi Ando, Yumiko Murata, Ryo Masumura, Satoshi Suzuki, Naoki Makishima, Takafumi Moriya, Takanori Ashihara, Hiroshi Sato
Publikováno v:
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Publikováno v:
Interspeech 2021.
Autor:
Takanori Ashihara, Marc Delcroix, Hiroshi Sato, Ryo Masumura, Tsubasa Ochiai, Ando Atsushi, Tomohiro Tanaka, Takafumi Moriya, Taichi Asami
Publikováno v:
Interspeech 2021.
Autor:
Takafumi Moriya, Mana Ihori, Takanori Ashihara, Shota Orihashi, Akihiko Takashima, Naoki Makishima, Tomohiro Tanaka, Ryo Masumura
Publikováno v:
Interspeech 2021.
We propose a cross-modal transformer-based neural correction models that refines the output of an automatic speech recognition (ASR) system so as to exclude ASR errors. Generally, neural correction models are composed of encoder-decoder networks, whi
Autor:
Tomoki Toda, Ryo Masumura, Takanori Ashihara, Hiroshi Sato, Ando Atsushi, Yusuke Ijima, Takafumi Moriya
Publikováno v:
ICASSP
This paper presents a novel speech emotion recognition scheme that can deal with the individuality of emotion perception. Most conventional methods directly model the majority decision of multiple listener’s perceived emotions. However, emotion per