Zobrazeno 1 - 10
of 19 260
pro vyhledávání: '"An, Gaofeng"'
Recently, ``textless" speech language models (SLMs) based on speech units have made huge progress in generating naturalistic speech, including non-verbal vocalizations. However, the generated speech samples often lack semantic coherence. In this pape
Externí odkaz:
http://arxiv.org/abs/2501.00805
Effectively distinguishing the pronunciation correlations between different written texts is a significant issue in linguistic acoustics. Traditionally, such pronunciation correlations are obtained through manually designed pronunciation lexicons. In
Externí odkaz:
http://arxiv.org/abs/2501.00804
In robotic bimanual teleoperation, multimodal sensory feedback plays a crucial role, providing operators with a more immersive operating experience, reducing cognitive burden, and improving operating efficiency. In this study, we develop an immersive
Externí odkaz:
http://arxiv.org/abs/2501.00822
Autor:
Chen, Gaofeng, Zhang, Yaoduo, Huang, Li, Wang, Pengfei, Zhang, Wenyu, Zeng, Dong, Ma, Jianhua, He, Ji
Supervised deep-learning (SDL) techniques with paired training datasets have been widely studied for X-ray computed tomography (CT) image reconstruction. However, due to the difficulties of obtaining paired training datasets in clinical routine, the
Externí odkaz:
http://arxiv.org/abs/2501.01456
The performance of automatic speech recognition models often degenerates on domains not covered by the training data. Domain adaptation can address this issue, assuming the availability of the target domain data in the target language. However, such
Externí odkaz:
http://arxiv.org/abs/2412.11185
Rethinking Comprehensive Benchmark for Chart Understanding: A Perspective from Scientific Literature
Scientific Literature charts often contain complex visual elements, including multi-plot figures, flowcharts, structural diagrams and etc. Evaluating multimodal models using these authentic and intricate charts provides a more accurate assessment of
Externí odkaz:
http://arxiv.org/abs/2412.12150
Sewing patterns, the essential blueprints for fabric cutting and tailoring, act as a crucial bridge between design concepts and producible garments. However, existing uni-modal sewing pattern generation models struggle to effectively encode complex d
Externí odkaz:
http://arxiv.org/abs/2412.08603
Autor:
Guo, Longfei, Xu, Shaowen, Cui, Qilong, Hu, Qingmin, Li, Ruixue, Xu, Gaofeng, Jia, Fanhao, Li, Yuan
Weyl semimetals, such as $TaIrTe_{4}$, characterized by their unique band structures and exotic transport phenomena, have become a central focus in modern electronics. Despite extensive research, a systematic understanding of the impact of heterogene
Externí odkaz:
http://arxiv.org/abs/2411.15517
Autor:
Zhao, Hongbo, Fan, Lue, Chen, Yuntao, Wang, Haochen, Yang, yuran, Jin, Xiaojuan, Zhang, Yixin, Meng, Gaofeng, Zhang, Zhaoxiang
In this paper, we propose OpenSatMap, a fine-grained, high-resolution satellite dataset for large-scale map construction. Map construction is one of the foundations of the transportation industry, such as navigation and autonomous driving. Extracting
Externí odkaz:
http://arxiv.org/abs/2410.23278
Modeling and producing lifelike clothed human images has attracted researchers' attention from different areas for decades, with the complexity from highly articulated and structured content. Rendering algorithms decompose and simulate the imaging pr
Externí odkaz:
http://arxiv.org/abs/2410.14429