Zobrazeno 1 - 10
of 9 728
pro vyhledávání: '"Yu, Meng"'
Autor:
Wang, Helin, Yu, Meng, Hai, Jiarui, Chen, Chen, Hu, Yuchen, Chen, Rilin, Dehak, Najim, Yu, Dong
In this paper, we introduce SSR-Speech, a neural codec autoregressive model designed for stable, safe, and robust zero-shot text-based speech editing and text-to-speech synthesis. SSR-Speech is built on a Transformer decoder and incorporates classifi
Externí odkaz:
http://arxiv.org/abs/2409.07556
Spatial audio formats like Ambisonics are playback device layout-agnostic and well-suited for applications such as teleconferencing and virtual reality. Conventional Ambisonic encoding methods often rely on spherical microphone arrays for efficient s
Externí odkaz:
http://arxiv.org/abs/2409.06954
This paper compared physics-informed neural network (PINN), conventional neural network (NN) and traditional numerical discretization methods on solving differential equations (DEs) through literature investigation and experimental validation. We foc
Externí odkaz:
http://arxiv.org/abs/2408.11077
The proliferation of deep neural networks has spawned the rapid development of acoustic echo cancellation and noise suppression, and plenty of prior arts have been proposed, which yield promising performance. Nevertheless, they rarely consider the de
Externí odkaz:
http://arxiv.org/abs/2406.11175
Autor:
Shao, Yiwen, Zhang, Shi-Xiong, Xu, Yong, Yu, Meng, Yu, Dong, Povey, Daniel, Khudanpur, Sanjeev
In the field of multi-channel, multi-speaker Automatic Speech Recognition (ASR), the task of discerning and accurately transcribing a target speaker's speech within background noise remains a formidable challenge. Traditional approaches often rely on
Externí odkaz:
http://arxiv.org/abs/2406.09589
Autor:
Pan, Xue-Feng, Hei, Xin-Lei, Yao, Xiao-Yu, Chen, Jia-Qiang, Ren, Yu-Meng, Dong, Xing-Liang, Qiao, Yi-Fan, Li, Peng-Bo
Skyrmion qubits are a new highly promising logic element for quantum information processing. However, their scalability to multiple interacting qubits remains challenging. We propose a hybrid quantum setup with skyrmion qubits strongly coupled to nan
Externí odkaz:
http://arxiv.org/abs/2404.09390
Image dehazing poses significant challenges in environmental perception. Recent research mainly focus on deep learning-based methods with single modality, while they may result in severe information loss especially in dense-haze scenarios. The infrar
Externí odkaz:
http://arxiv.org/abs/2404.07790
Image restoration is rather challenging in adverse weather conditions, especially when multiple degradations occur simultaneously. Blind image decomposition was proposed to tackle this issue, however, its effectiveness heavily relies on the accurate
Externí odkaz:
http://arxiv.org/abs/2404.07770
Autor:
Gao, Yu-Meng, Zhang, Yue-Jiao, Zhao, Xiao-Lin, Li, Xin-Yu, Wang, Shu-Hui, Jin, Chen-Dong, Zhang, Hu, Lian, Ru-Qian, Wang, Rui-Ning, Gong, Peng-Lai, Wang, Jiang-Long, Shi, Xing-Qiang
The electronic structure evolutions of few-layer black phosphorus (BP) under pressure shows a wealth of phenomena, such as the nonmonotonic change of direct gap at the {\Gamma} point, the layer-number dependence, and the distinct responses to normal
Externí odkaz:
http://arxiv.org/abs/2403.01149
Audio zooming, a signal processing technique, enables selective focusing and enhancement of sound signals from a specified region, attenuating others. While traditional beamforming and neural beamforming techniques, centered on creating a directional
Externí odkaz:
http://arxiv.org/abs/2311.13075