Výsledky vyhledávání

Akademický článek

An Investigation of Fundamental Frequency Pattern Prediction for Japanese Electrolaryngeal Speech Enhancement Based on Frame-Wise Phoneme Representations

Autor: Mohammad Eshghi, Tomoki Toda

Publikováno v: IEEE Access, Vol 12, Pp 50137-50153 (2024)

Total laryngectomy (TL) is as a well-established treatment for advanced laryngeal malignancies, entailing the complete removal of the larynx. Speech rehabilitation following TL is crucial for improving the quality of life and facilitating social rein

Externí odkaz: https://doaj.org/article/84f969f43c0b4915a9957d13cc4f1d3f

Zobrazit plný text záznamu

Akademický článek

Fast Neural Speech Waveform Generative Models With Fully-Connected Layer-Based Upsampling

Autor: Haruki Yamashita, Takuma Okamoto, Ryoichi Takashima, Yamato Ohtani, Tetsuya Takiguchi, Tomoki Toda, Hisashi Kawai

Publikováno v: IEEE Access, Vol 12, Pp 31409-31421 (2024)

Although end-to-end (E2E) text-to-speech (TTS) models with HiFi-GAN-based neural vocoder (e.g. VITS and JETS) can achieve human-like speech quality with fast inference speed, these models still have room to further improve the inference speed with a

Externí odkaz: https://doaj.org/article/65f4b9e442c0485f987d349b630c164a

Zobrazit plný text záznamu

Akademický článek

Full-Band LPCNet: A Real-Time Neural Vocoder for 48 kHz Audio With a CPU

Autor: Keisuke Matsubara, Takuma Okamoto, Ryoichi Takashima, Tetsuya Takiguchi, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai

Publikováno v: IEEE Access, Vol 9, Pp 94923-94933 (2021)

This paper investigates a real-time neural speech synthesis system on CPUs that can synthesize high-fidelity 48 kHz speech waveforms to cover the entire frequency range audible by human beings. Although most previous studies on 48 kHz speech synthesi

Externí odkaz: https://doaj.org/article/2b1161e52b18489e9d7ce9bb4a6bde4d

Zobrazit plný text záznamu

Akademický článek

Non-Parallel Voice Conversion System With WaveNet Vocoder and Collapsed Speech Suppression

Autor: Yi-Chiao Wu, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Hayashi, Tomoki Toda

Publikováno v: IEEE Access, Vol 8, Pp 62094-62106 (2020)

In this paper, we integrate a simple non-parallel voice conversion (VC) system with a WaveNet (WN) vocoder and a proposed collapsed speech suppression technique. The effectiveness of WN as a vocoder for generating high-fidelity speech waveforms on th

Externí odkaz: https://doaj.org/article/25e27621ae974715a7711beb5a1a26f4

Zobrazit plný text záznamu

Akademický článek

A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System

Autor: Yi-Chiao Wu, Patrick Lumban Tobing, Kazuki Yasuhara, Noriyuki Matsunaga, Yamato Ohtani, Tomoki Toda

Publikováno v: APSIPA Transactions on Signal and Information Processing, Vol 11, Iss 1 (2022)

Externí odkaz: https://doaj.org/article/63870b609ef947e59ec27d9dbae3de91

Zobrazit plný text záznamu

Akademický článek

Voice Conversion With CycleRNN-Based Spectral Mapping and Finely Tuned WaveNet Vocoder

Autor: Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda

Publikováno v: IEEE Access, Vol 7, Pp 171114-171125 (2019)

In this paper, we present a novel framework for a voice conversion (VC) system based on a cyclic recurrent neural network (CycleRNN) and a finely tuned WaveNet vocoder. Even though WaveNet is capable of producing natural speech waveforms when fed wit

Externí odkaz: https://doaj.org/article/e7bfa6675bd5419cb69615d083e9b1db

Zobrazit plný text záznamu

Akademický článek

Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder

Autor: Shogo Seki, Hirokazu Kameoka, Li Li, Tomoki Toda, Kazuya Takeda

Publikováno v: IEEE Access, Vol 7, Pp 168104-168115 (2019)

This paper deals with a multichannel audio source separation problem under underdetermined conditions. Multichannel non-negative matrix factorization (MNMF) is a powerful method for underdetermined audio source separation, which adopts the NMF concep

Externí odkaz: https://doaj.org/article/87d3296b7d2d414e895f24fbbe506e2d

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání