Zobrazeno 1 - 10
of 3 590
pro vyhledávání: '"Keqi, A."'
Autor:
Chen, Yushen, Niu, Zhikang, Ma, Ziyang, Deng, Keqi, Wang, Chunhui, Zhao, Jian, Yu, Kai, Chen, Xie
This paper introduces F5-TTS, a fully non-autoregressive text-to-speech system based on flow matching with Diffusion Transformer (DiT). Without requiring complex designs such as duration model, text encoder, and phoneme alignment, the text input is s
Externí odkaz:
http://arxiv.org/abs/2410.06885
Autor:
Du, Yexing, Ma, Ziyang, Yang, Yifan, Deng, Keqi, Chen, Xie, Yang, Bo, Xiang, Yang, Liu, Ming, Qin, Bing
Speech Language Models (SLMs) have demonstrated impressive performance on speech translation tasks. However, existing research primarily focuses on direct instruction fine-tuning and often overlooks the inherent reasoning capabilities of SLMs. In thi
Externí odkaz:
http://arxiv.org/abs/2409.19510
Autor:
Shang, Tian, Wang, Yuting, Yu, Bochen, Xia, Keqi, Gawryluk, Darek J., Xu, Yang, Zhan, Qingfeng, Zhao, Jianzhou, Shiroka, Toni
Publikováno v:
Phys. Rev. B. 110, 064510 (2024)
The orthorhombic molybdenum carbide superconductor with $T_c$ = 3.2 K was investigated by muon-spin rotation and relaxation ($\mu$SR) measurements and by first-principle calculations. The low-temperature superfluid density, determined by transverse-f
Externí odkaz:
http://arxiv.org/abs/2409.02380
Autor:
Deng, Keqi, Woodland, Philip C.
While the neural transducer is popular for online speech recognition, simultaneous speech translation (SST) requires both streaming and re-ordering capabilities. This paper presents the LS-Transducer-SST, a label-synchronous neural transducer for SST
Externí odkaz:
http://arxiv.org/abs/2406.04541
Wav2Prompt is proposed which allows straightforward integration between spoken input and a text-based large language model (LLM). Wav2Prompt uses a simple training process with only the same data used to train an automatic speech recognition (ASR) mo
Externí odkaz:
http://arxiv.org/abs/2406.00522
The 4H-SiC material exhibits good detection performance, but there are still many problems like signal distortion and poor signal quality. The 4H-SiC low gain avalanche detector (LGAD) has been fabricated for the first time to solve these problems, w
Externí odkaz:
http://arxiv.org/abs/2405.18112
This study investigates the stiffness characteristics of the Sprint Z3 head, also known as 3-PRS Parallel Kinematics Machines, which are among the most extensively researched and viably successful manipulators for precision machining applications. De
Externí odkaz:
http://arxiv.org/abs/2405.08418
We present a new self-supervised approach, SelfPose3d, for estimating 3d poses of multiple persons from multiple camera views. Unlike current state-of-the-art fully-supervised methods, our approach does not require any 2d or 3d ground-truth poses and
Externí odkaz:
http://arxiv.org/abs/2404.02041
Theoretical Modeling and Bio-inspired Trajectory Optimization of A Multiple-locomotion Origami Robot
Recent research on mobile robots has focused on increasing their adaptability to unpredictable and unstructured environments using soft materials and structures. However, the determination of key design parameters and control over these compliant rob
Externí odkaz:
http://arxiv.org/abs/2403.12471
Autor:
He, Ye, Li, Xingchen, Xu, Zijun, Qi, Ming, Wang, Congcong, Wang, Chenwei, Lu, Hai, Nie, Xiaojun, Fan, Ruirui, Jing, Hantao, Song, Weiming, Wang, Keqi, Liu, Kai, Liu, Peilian, Li, Hui, Li, Zaiyi, Fu, Chenxi, Zhang, Xiyuan, Kang, Xiaoshen, Li, Zhan, Lu, Weiguo, Xiao, Suyu, Shi, Xin
A high precision beam monitor system based on silicon carbide PIN sensor is designed for China Spallation Neutron Source 1.6 GeV proton beam to monitor the proton beam fluence.The concept design of the beam monitor system is finished together with fr
Externí odkaz:
http://arxiv.org/abs/2403.09244