Výsledky vyhledávání - "Park, Heayoung"

Report

FastFit: Towards Real-Time Iterative Neural Vocoder by Replacing U-Net Encoder With Multiple STFTs

Autor: Jang, Won, Lim, Dan, Park, Heayoung

This paper presents FastFit, a novel neural vocoder architecture that replaces the U-Net encoder with multiple short-time Fourier transforms (STFTs) to achieve faster generation rates without sacrificing sample quality. We replaced each encoder block

Externí odkaz: http://arxiv.org/abs/2305.10823

Zobrazit plný text záznamu

Report

JDI-T: Jointly trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment

Autor: Lim, Dan, Jang, Won, O, Gyeonghwan, Park, Heayoung, Kim, Bongwan, Yoon, Jaesam

We propose Jointly trained Duration Informed Transformer (JDI-T), a feed-forward Transformer with a duration predictor jointly trained without explicit alignments in order to generate an acoustic feature sequence from an input text. In this work, ins

Externí odkaz: http://arxiv.org/abs/2005.07799

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání