Výsledky vyhledávání

Report

SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction

Autor: Tang, Yuxun, Shi, Jiatong, Wu, Yuning, Jin, Qin

In speech generation tasks, human subjective ratings, usually referred to as the opinion score, are considered the "gold standard" for speech quality evaluation, with the mean opinion score (MOS) serving as the primary evaluation metric. Due to the h

Externí odkaz: http://arxiv.org/abs/2406.10911

Zobrazit plný text záznamu

Report

SingOMD: Singing Oriented Multi-resolution Discrete Representation Construction from Speech Models

Autor: Tang, Yuxun, Wu, Yuning, Shi, Jiatong, Jin, Qin

Discrete representation has shown advantages in speech generation tasks, wherein discrete tokens are derived by discretizing hidden features from self-supervised learning (SSL) pre-trained models. However, the direct application of speech SSL models

Externí odkaz: http://arxiv.org/abs/2406.08905

Zobrazit plný text záznamu

Report

VISinger2+: End-to-End Singing Voice Synthesis Augmented by Self-Supervised Learning Representation

Autor: Yu, Yifeng, Shi, Jiatong, Wu, Yuning, Watanabe, Shinji

Singing Voice Synthesis (SVS) has witnessed significant advancements with the advent of deep learning techniques. However, a significant challenge in SVS is the scarcity of labeled singing voice data, which limits the effectiveness of supervised lear

Externí odkaz: http://arxiv.org/abs/2406.08761

Zobrazit plný text záznamu

Report

TokSing: Singing Voice Synthesis based on Discrete Tokens

Autor: Wu, Yuning, zhang, Chunlei, Shi, Jiatong, Tang, Yuxun, Yang, Shan, Jin, Qin

Recent advancements in speech synthesis witness significant benefits by leveraging discrete tokens extracted from self-supervised learning (SSL) models. Discrete tokens offer higher storage efficiency and greater operability in intermediate represent

Externí odkaz: http://arxiv.org/abs/2406.08416

Zobrazit plný text záznamu

Report

The Interspeech 2024 Challenge on Speech Processing Using Discrete Units

Autor: Chang, Xuankai, Shi, Jiatong, Tian, Jinchuan, Wu, Yuning, Tang, Yuxun, Wu, Yihan, Watanabe, Shinji, Adi, Yossi, Chen, Xie, Jin, Qin

Representing speech and audio signals in discrete units has become a compelling alternative to traditional high-dimensional feature vectors. Numerous studies have highlighted the efficacy of discrete units in various applications such as speech compr

Externí odkaz: http://arxiv.org/abs/2406.07725

Zobrazit plný text záznamu

Report

State Space Paradox of Computational Research in Creativity

Autor: Akin, Ömer, Wu, Yuning

This paper explores the paradoxical nature of computational creativity, focusing on the inherent limitations of closed digital systems in emulating the open-ended, dynamic process of human creativity. Through a comprehensive analysis, we delve into t

Externí odkaz: http://arxiv.org/abs/2404.15303

Zobrazit plný text záznamu

Report

Towards Human-Centered Construction Robotics: An RL-Driven Companion Robot For Contextually Assisting Carpentry Workers

Autor: Wu, Yuning, Wei, Jiaying, Oh, Jean, Llach, Daniel Cardoso

In the dynamic construction industry, traditional robotic integration has primarily focused on automating specific tasks, often overlooking the complexity and variability of human aspects in construction workflows. This paper introduces a human-cente

Externí odkaz: http://arxiv.org/abs/2403.19060

Zobrazit plný text záznamu

Report

Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and ACE-KiSing

Autor: Shi, Jiatong, Lin, Yueqian, Bai, Xinyi, Zhang, Keyi, Wu, Yuning, Tang, Yuxun, Yu, Yifeng, Jin, Qin, Watanabe, Shinji

In singing voice synthesis (SVS), generating singing voices from musical scores faces challenges due to limited data availability. This study proposes a unique strategy to address the data scarcity in SVS. We employ an existing singing voice synthesi

Externí odkaz: http://arxiv.org/abs/2401.17619

Zobrazit plný text záznamu

Report

A Systematic Exploration of Joint-training for Singing Voice Synthesis

Autor: Wu, Yuning, Yu, Yifeng, Shi, Jiatong, Qian, Tao, Jin, Qin

There has been a growing interest in using end-to-end acoustic models for singing voice synthesis (SVS). Typically, these models require an additional vocoder to transform the generated acoustic features into the final waveform. However, since the ac

Externí odkaz: http://arxiv.org/abs/2308.02867

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání