Výsledky vyhledávání - "Niu, Zhikang"

Report

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Autor: Chen, Yushen, Niu, Zhikang, Ma, Ziyang, Deng, Keqi, Wang, Chunhui, Zhao, Jian, Yu, Kai, Chen, Xie

This paper introduces F5-TTS, a fully non-autoregressive text-to-speech system based on flow matching with Diffusion Transformer (DiT). Without requiring complex designs such as duration model, text encoder, and phoneme alignment, the text input is s

Externí odkaz: http://arxiv.org/abs/2410.06885

Zobrazit plný text záznamu

Report

NDVQ: Robust Neural Audio Codec with Normal Distribution-Based Vector Quantization

Autor: Niu, Zhikang, Chen, Sanyuan, Zhou, Long, Ma, Ziyang, Chen, Xie, Liu, Shujie

Built upon vector quantization (VQ), discrete audio codec models have achieved great success in audio compression and auto-regressive audio generation. However, existing models face substantial challenges in perceptual quality and signal distortion,

Externí odkaz: http://arxiv.org/abs/2409.12717

Zobrazit plný text záznamu

Report

VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech

Autor: Du, Chenpeng, Guo, Yiwei, Wang, Hankun, Yang, Yifan, Niu, Zhikang, Wang, Shuai, Zhang, Hui, Chen, Xie, Yu, Kai

Recent TTS models with decoder-only Transformer architecture, such as SPEAR-TTS and VALL-E, achieve impressive naturalness and demonstrate the ability for zero-shot adaptation given a speech prompt. However, such decoder-only TTS models lack monotoni

Externí odkaz: http://arxiv.org/abs/2401.14321

Zobrazit plný text záznamu

Report

Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning

Autor: Yang, Guanrou, Ma, Ziyang, Zheng, Zhisheng, Song, Yakun, Niu, Zhikang, Chen, Xie

Recent years have witnessed significant advancements in self-supervised learning (SSL) methods for speech-processing tasks. Various speech-based SSL models have been developed and present promising performance on a range of downstream tasks including

Externí odkaz: http://arxiv.org/abs/2309.13860

Zobrazit plný text záznamu