Zobrazeno 1 - 10
of 339
pro vyhledávání: '"Tomoki Toda"'
Autor:
Mohammad Eshghi, Tomoki Toda
Publikováno v:
IEEE Access, Vol 12, Pp 50137-50153 (2024)
Total laryngectomy (TL) is as a well-established treatment for advanced laryngeal malignancies, entailing the complete removal of the larynx. Speech rehabilitation following TL is crucial for improving the quality of life and facilitating social rein
Externí odkaz:
https://doaj.org/article/84f969f43c0b4915a9957d13cc4f1d3f
Autor:
Haruki Yamashita, Takuma Okamoto, Ryoichi Takashima, Yamato Ohtani, Tetsuya Takiguchi, Tomoki Toda, Hisashi Kawai
Publikováno v:
IEEE Access, Vol 12, Pp 31409-31421 (2024)
Although end-to-end (E2E) text-to-speech (TTS) models with HiFi-GAN-based neural vocoder (e.g. VITS and JETS) can achieve human-like speech quality with fast inference speed, these models still have room to further improve the inference speed with a
Externí odkaz:
https://doaj.org/article/65f4b9e442c0485f987d349b630c164a
Autor:
Keisuke Matsubara, Takuma Okamoto, Ryoichi Takashima, Tetsuya Takiguchi, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai
Publikováno v:
IEEE Access, Vol 9, Pp 94923-94933 (2021)
This paper investigates a real-time neural speech synthesis system on CPUs that can synthesize high-fidelity 48 kHz speech waveforms to cover the entire frequency range audible by human beings. Although most previous studies on 48 kHz speech synthesi
Externí odkaz:
https://doaj.org/article/2b1161e52b18489e9d7ce9bb4a6bde4d
Publikováno v:
IEEE Access, Vol 8, Pp 62094-62106 (2020)
In this paper, we integrate a simple non-parallel voice conversion (VC) system with a WaveNet (WN) vocoder and a proposed collapsed speech suppression technique. The effectiveness of WN as a vocoder for generating high-fidelity speech waveforms on th
Externí odkaz:
https://doaj.org/article/25e27621ae974715a7711beb5a1a26f4
Autor:
Yi-Chiao Wu, Patrick Lumban Tobing, Kazuki Yasuhara, Noriyuki Matsunaga, Yamato Ohtani, Tomoki Toda
Publikováno v:
APSIPA Transactions on Signal and Information Processing, Vol 11, Iss 1 (2022)
Externí odkaz:
https://doaj.org/article/63870b609ef947e59ec27d9dbae3de91
Publikováno v:
IEEE Access, Vol 7, Pp 171114-171125 (2019)
In this paper, we present a novel framework for a voice conversion (VC) system based on a cyclic recurrent neural network (CycleRNN) and a finely tuned WaveNet vocoder. Even though WaveNet is capable of producing natural speech waveforms when fed wit
Externí odkaz:
https://doaj.org/article/e7bfa6675bd5419cb69615d083e9b1db
Publikováno v:
IEEE Access, Vol 7, Pp 168104-168115 (2019)
This paper deals with a multichannel audio source separation problem under underdetermined conditions. Multichannel non-negative matrix factorization (MNMF) is a powerful method for underdetermined audio source separation, which adopts the NMF concep
Externí odkaz:
https://doaj.org/article/87d3296b7d2d414e895f24fbbe506e2d
Autor:
Keisuke Matsubara, Takuma Okamoto, Ryoichi Takashima, Tetsuya Takiguchi, Tomoki Toda, Hisashi Kawai
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 31:1902-1915
Publikováno v:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Autor:
Atsushi Miyashita, Tomoki Toda
Publikováno v:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).