Zobrazeno 1 - 10
of 28
pro vyhledávání: '"Nobukatsu Hojo"'
Publikováno v:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Autor:
Ryo Masumura, Yoshihiro Yamazaki, Saki Mizuno, Naoki Makishima, Mana Ihori, Mihiro Uchida, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Shota Orihashi, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando
Publikováno v:
Interspeech 2022.
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 29:656-670
This paper proposes a voice conversion (VC) method based on a sequence-to-sequence (S2S) learning framework, which enables simultaneous conversion of the voice characteristics, pitch contour, and duration of input speech. We previously proposed an S2
Publikováno v:
NTT Technical Review. 18:27-31
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 28:2982-2995
We previously proposed a method that allows for nonparallel voice conversion (VC) by using a variant of generative adversarial networks (GANs) called StarGAN. The main features of our method, called StarGAN-VC, are as follows: First, it requires no p
Publikováno v:
ICASSP
Non-parallel voice conversion (VC) is a technique for training voice converters without a parallel corpus. Cycle-consistent adversarial network-based VCs (CycleGAN-VC and CycleGAN-VC2) are widely accepted as benchmark methods. However, owing to their
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 27:1432-1443
This paper proposes a non-parallel voice conversion (VC) method using a variant of the conditional variational autoencoder (VAE) called an auxiliary classifier VAE. The proposed method has two key features. First, it adopts fully convolutional archit
This paper proposes architectures that facilitate the extrapolation of emotional expressions in deep neural network (DNN)-based text-to-speech (TTS). In this study, the meaning of "extrapolate emotional expressions" is to borrow emotional expressions
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::67d77820eba73fb1ba2a9618255dab3b
Publikováno v:
INTERSPEECH
Non-parallel voice conversion (VC) is a technique for learning mappings between source and target speeches without using a parallel corpus. Recently, cycle-consistent adversarial network (CycleGAN)-VC and CycleGAN-VC2 have shown promising results reg
Publikováno v:
IEICE Transactions on Information and Systems. :462-472