Zobrazeno 1 - 10
of 8 171
pro vyhledávání: '"P. Mimura"'
Autor:
Moriya, Takafumi, Ashihara, Takanori, Mimura, Masato, Sato, Hiroshi, Matsuura, Kohei, Masumura, Ryo, Asami, Taichi
A hybrid autoregressive transducer (HAT) is a variant of neural transducer that models blank and non-blank posterior distributions separately. In this paper, we propose a novel internal acoustic model (IAM) training strategy to enhance HAT-based spee
Externí odkaz:
http://arxiv.org/abs/2409.20313
Autor:
Moriya, Takafumi, Horiguchi, Shota, Delcroix, Marc, Masumura, Ryo, Ashihara, Takanori, Sato, Hiroshi, Matsuura, Kohei, Mimura, Masato
Extending the RNN Transducer (RNNT) to recognize multi-talker speech is essential for wider automatic speech recognition (ASR) applications. Multi-talker RNNT (MT-RNNT) aims to achieve recognition without relying on costly front-end source separation
Externí odkaz:
http://arxiv.org/abs/2409.20301
Autor:
Kamo, Naoyuki, Tawara, Naohiro, Ando, Atsushi, Kano, Takatomo, Sato, Hiroshi, Ikeshita, Rintaro, Moriya, Takafumi, Horiguchi, Shota, Matsuura, Kohei, Ogawa, Atsunori, Plaquet, Alexis, Ashihara, Takanori, Ochiai, Tsubasa, Mimura, Masato, Delcroix, Marc, Nakatani, Tomohiro, Asami, Taichi, Araki, Shoko
We present a distant automatic speech recognition (DASR) system developed for the CHiME-8 DASR track. It consists of a diarization first pipeline. For diarization, we use end-to-end diarization with vector clustering (EEND-VC) followed by target spea
Externí odkaz:
http://arxiv.org/abs/2409.05554
Autor:
Matsuura, Kohei, Ashihara, Takanori, Moriya, Takafumi, Mimura, Masato, Kano, Takatomo, Ogawa, Atsunori, Delcroix, Marc
This paper introduces a novel approach called sentence-wise speech summarization (Sen-SSum), which generates text summaries from a spoken document in a sentence-by-sentence manner. Sen-SSum combines the real-time processing of automatic speech recogn
Externí odkaz:
http://arxiv.org/abs/2408.00205
This study introduces a groundbreaking approach to simultaneous interpretation by directly leveraging the predictive capabilities of Large Language Models (LLMs). We present a novel algorithm that generates real-time translations by predicting speake
Externí odkaz:
http://arxiv.org/abs/2407.14269
Autor:
Sato, Hiroshi, Moriya, Takafumi, Mimura, Masato, Horiguchi, Shota, Ochiai, Tsubasa, Ashihara, Takanori, Ando, Atsushi, Shinayama, Kentaro, Delcroix, Marc
Real-time target speaker extraction (TSE) is intended to extract the desired speaker's voice from the observed mixture of multiple speakers in a streaming manner. Implementing real-time TSE is challenging as the computational complexity must be reduc
Externí odkaz:
http://arxiv.org/abs/2407.01857
Autor:
Mimura, Yoshifumi
A parabolic system of three unknown functions, not expressible as gradient flows, is treated as three coupled gradient flows. For each unknown function, the minimizing movement scheme is used to construct a time-discrete approximate solution. Unlike
Externí odkaz:
http://arxiv.org/abs/2406.14536
Autor:
Aonishi, Toru, Nagasawa, Tatsuya, Koizumi, Toshiyuki, Gunathilaka, Mastiyage Don Sudeera Hasaranga, Mimura, Kazushi, Okada, Masato, Kako, Satoshi, Yamamoto, Yoshihisa
In recent years, quantum Ising machines have drawn a lot of attention, but due to physical implementation constraints, it has been difficult to achieve dense coupling, such as full coupling with sufficient spins to handle practical large-scale applic
Externí odkaz:
http://arxiv.org/abs/2406.05377
Autor:
Kawasaki, Morimichi, Kimura, Mitsuaki, Maruyama, Shuhei, Matsushita, Takahiro, Mimura, Masato
This article provides an expository account of the celebrated duality theorem of Bavard and three its strengthenings. The Bavard duality theorem connects scl (stable commutator length) and quasimorphisms on a group. Calegari extended the framework fr
Externí odkaz:
http://arxiv.org/abs/2406.04319
Autor:
Gunathilaka, Mastiyage Don Sudeera Hasaranga, Inui, Yoshitaka, Kako, Satoshi, Mimura, Kazushi, Okada, Masato, Yamamoto, Yoshihisa, Aonishi, Toru
Coherent Ising Machine (CIM) is a network of optical parametric oscillators that solves combinatorial optimization problems by finding the ground state of an Ising Hamiltonian. As a practical application of CIM, Aonishi et al. proposed a quantum-clas
Externí odkaz:
http://arxiv.org/abs/2405.00366