Výsledky vyhledávání - "Kim, Bongwan"

Report

Intelli-Z: Toward Intelligible Zero-Shot TTS

Autor: Jung, Sunghee, Jang, Won, Yoon, Jaesam, Kim, Bongwan

Although numerous recent studies have suggested new frameworks for zero-shot TTS using large-scale, real-world data, studies that focus on the intelligibility of zero-shot TTS are relatively scarce. Zero-shot TTS demands additional efforts to ensure

Externí odkaz: http://arxiv.org/abs/2401.13921

Zobrazit plný text záznamu

Report

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Autor: Jang, Won, Lim, Dan, Yoon, Jaesam, Kim, Bongwan, Kim, Juntae

Most neural vocoders employ band-limited mel-spectrograms to generate waveforms. If full-band spectral features are used as the input, the vocoder can be provided with as much acoustic information as possible. However, in some models employing full-b

Externí odkaz: http://arxiv.org/abs/2106.07889

Zobrazit plný text záznamu

Report

JDI-T: Jointly trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment

Autor: Lim, Dan, Jang, Won, O, Gyeonghwan, Park, Heayoung, Kim, Bongwan, Yoon, Jaesam

We propose Jointly trained Duration Informed Transformer (JDI-T), a feed-forward Transformer with a duration predictor jointly trained without explicit alignments in order to generate an acoustic feature sequence from an input text. In this work, ins

Externí odkaz: http://arxiv.org/abs/2005.07799

Zobrazit plný text záznamu