Zobrazeno 1 - 10
of 1 219
pro vyhledávání: '"Zhu, PengCheng"'
Cell-free massive multiple-input multiple-output (MIMO) systems, leveraging tight cooperation among wireless access points, exhibit remarkable signal enhancement and interference suppression capabilities, demonstrating significant performance advanta
Externí odkaz:
http://arxiv.org/abs/2410.05652
Fixed-dimensional speaker embeddings have become the dominant approach in speaker modeling, typically spanning hundreds to thousands of dimensions. These dimensions are hyperparameters that are not specifically picked, nor are they hierarchically ord
Externí odkaz:
http://arxiv.org/abs/2409.15782
In accented voice conversion or accent conversion, we seek to convert the accent in speech from one another while preserving speaker identity and semantic content. In this study, we formulate a novel method for creating multi-accented speech samples,
Externí odkaz:
http://arxiv.org/abs/2409.09352
This paper introduces Easy One-Step Text-to-Speech (E1 TTS), an efficient non-autoregressive zero-shot text-to-speech system based on denoising diffusion pretraining and distribution matching distillation. The training of E1 TTS is straightforward; i
Externí odkaz:
http://arxiv.org/abs/2409.09351
One of the primary challenges in short packet ultra-reliable and low-latency communications (URLLC) is to achieve reliable channel estimation and data detection while minimizing the impact on latency performance. Given the small packet size in mini-s
Externí odkaz:
http://arxiv.org/abs/2408.14089
Streaming voice conversion has become increasingly popular for its potential in real-time applications. The recently proposed DualVC 2 has achieved robust and high-quality streaming voice conversion with a latency of about 180ms. Nonetheless, the rec
Externí odkaz:
http://arxiv.org/abs/2406.07846
This letter introduces a novel framework for dense Visual Simultaneous Localization and Mapping (VSLAM) based on Gaussian Splatting. Recently, SLAM based on Gaussian Splatting has shown promising results. However, in monocular scenarios, the Gaussian
Externí odkaz:
http://arxiv.org/abs/2405.06241
The receiver design for multi-input multi-output (MIMO) ultra-reliable and low-latency communication (URLLC) systems can be a tough task due to the use of short channel codes and few pilot symbols. Consequently, error propagation can occur in traditi
Externí odkaz:
http://arxiv.org/abs/2404.07721
Accent transfer aims to transfer an accent from a source speaker to synthetic speech in the target speaker's voice. The main challenge is how to effectively disentangle speaker timbre and accent which are entangled in speech. This paper presents a VI
Externí odkaz:
http://arxiv.org/abs/2312.16850
Autor:
Yu, Jingxuan, Zeng, Fan, Li, Jiamin, Liu, Feiyang, Zhu, Pengcheng, Wang, Dongming, You, Xiaohu
This paper investigates how to achieve integrated sensing and communication (ISAC) based on a cell-free radio access network (CF-RAN) architecture with a minimum footprint of communication resources. We propose a new passive sensing scheme. The schem
Externí odkaz:
http://arxiv.org/abs/2311.06003