Zobrazeno 1 - 10
of 27 665
pro vyhledávání: '"LIU, Rui"'
Medical report generation is a critical task in healthcare that involves the automatic creation of detailed and accurate descriptions from medical images. Traditionally, this task has been approached as a sequence generation problem, relying on visio
Externí odkaz:
http://arxiv.org/abs/2409.00250
Solar filaments can undergo eruptions and result in the formation of coronal mass ejections (CMEs), which could significantly impact planetary space environments. Observations of eruptions involving polar crown filaments, situated in the polar region
Externí odkaz:
http://arxiv.org/abs/2408.15892
Autor:
Liu, Rui-Chen, Sun, C. P.
The frame-dragging phenomenon in gravitational fields is revisited to explore the geometric effects induced by spacetime curvature. We quantize a massless scalar field in the spacetime of a rotating sphere, incorporating the frame-dragging frequency
Externí odkaz:
http://arxiv.org/abs/2408.13016
Automatic Video Dubbing (AVD) aims to take the given script and generate speech that aligns with lip motion and prosody expressiveness. Current AVD models mainly utilize visual information of the current sentence to enhance the prosody of synthesized
Externí odkaz:
http://arxiv.org/abs/2408.11593
Conversational Speech Synthesis (CSS) aims to express a target utterance with the proper speaking style in a user-agent conversation setting. Existing CSS methods employ effective multi-modal context modeling techniques to achieve empathy understandi
Externí odkaz:
http://arxiv.org/abs/2407.21491
Autor:
Liu, Rui, Wang, Wensi
It is under debate whether the magnetic field in the solar atmosphere carries neutralized electric currents; particularly, whether a magnetic flux rope (MFR), which is considered the core structure of coronal mass ejections, carries neutralized elect
Externí odkaz:
http://arxiv.org/abs/2407.15148
Navigation instruction generation, which requires embodied agents to describe the navigation routes, has been of great interest in robotics and human-computer interaction. Existing studies directly map the sequence of 2D perspective observations to r
Externí odkaz:
http://arxiv.org/abs/2407.15087
Early detection and accurate diagnosis can predict the risk of malignant disease transformation, thereby increasing the probability of effective treatment. Identifying mild syndrome with small pathological regions serves as an ominous warning and is
Externí odkaz:
http://arxiv.org/abs/2407.07720
The training of large models, involving fine-tuning, faces the scarcity of high-quality data. Compared to the solutions based on centralized data centers, updating large models in the Internet of Things (IoT) faces challenges in coordinating knowledg
Externí odkaz:
http://arxiv.org/abs/2407.05268
Autor:
Bai, Ye, Chen, Jingping, Chen, Jitong, Chen, Wei, Chen, Zhuo, Ding, Chuang, Dong, Linhao, Dong, Qianqian, Du, Yujiao, Gao, Kepan, Gao, Lu, Guo, Yi, Han, Minglun, Han, Ting, Hu, Wenchao, Hu, Xinying, Hu, Yuxiang, Hua, Deyu, Huang, Lu, Huang, Mingkun, Huang, Youjia, Jin, Jishuo, Kong, Fanliu, Lan, Zongwei, Li, Tianyu, Li, Xiaoyang, Li, Zeyang, Lin, Zehua, Liu, Rui, Liu, Shouda, Lu, Lu, Lu, Yizhou, Ma, Jingting, Ma, Shengtao, Pei, Yulin, Shen, Chen, Tan, Tian, Tian, Xiaogang, Tu, Ming, Wang, Bo, Wang, Hao, Wang, Yuping, Wang, Yuxuan, Xia, Hanzhang, Xia, Rui, Xie, Shuangyi, Xu, Hongmin, Yang, Meng, Zhang, Bihong, Zhang, Jun, Zhang, Wanyi, Zhang, Yang, Zhang, Yawei, Zheng, Yijie, Zou, Ming
Modern automatic speech recognition (ASR) model is required to accurately transcribe diverse speech signals (from different domains, languages, accents, etc) given the specific contextual information in various application scenarios. Classic end-to-e
Externí odkaz:
http://arxiv.org/abs/2407.04675