Zobrazeno 1 - 10
of 4 824
pro vyhledávání: '"Xu, Hang"'
Autor:
Sun, Jianhan, Lv, Jianfeng, Tian, Shang, Liu, Juntao, Zhang, Zihao, Xu, Hang, Lin, Lin, Huang, Senlin
DC-SRF-II gun, a high-brightness continuous-wave photocathode gun, has greater potential in electron beam irradiation applications. This paper presents the in-vacuum and in-air irradiation dosimetry study of the high repetition rate electron beam fro
Externí odkaz:
http://arxiv.org/abs/2411.16247
Autor:
Xiang, Kun, Liu, Zhili, Jiang, Zihao, Nie, Yunshuang, Huang, Runhui, Fan, Haoxiang, Li, Hanhui, Huang, Weiran, Zeng, Yihan, Han, Jianhua, Hong, Lanqing, Xu, Hang, Liang, Xiaodan
In this paper, we address the challenging task of multimodal mathematical reasoning by incorporating the ability of ``slow thinking" into multimodal large language models (MLLMs). Contrary to existing methods that rely on direct or fast thinking, our
Externí odkaz:
http://arxiv.org/abs/2411.11930
Recent advancements utilizing large-scale video data for learning video generation models demonstrate significant potential in understanding complex physical dynamics. It suggests the feasibility of leveraging diverse robot trajectory data to develop
Externí odkaz:
http://arxiv.org/abs/2411.09153
Autor:
Zhang, Kaidong, Ren, Pengzhen, Lin, Bingqian, Lin, Junfan, Ma, Shikui, Xu, Hang, Liang, Xiaodan
Language-guided robotic manipulation is a challenging task that requires an embodied agent to follow abstract user instructions to accomplish various complex manipulation tasks. Previous work trivially fitting the data without revealing the relation
Externí odkaz:
http://arxiv.org/abs/2410.10394
This study leverages deep reinforcement learning (DRL) to train synthetic jet-based flow control strategies for circular and square cylinders. The central aim is to ascertain the optimal jet placements that strike an ideal balance between energy effi
Externí odkaz:
http://arxiv.org/abs/2410.00424
By leveraging the high dimensional nonlinear mapping capabilities of artificial neural networks in conjunction with the powerful control mechanisms of reinforcement learning, we attain real-time, precise modulation of synthetic jet flow rates over el
Externí odkaz:
http://arxiv.org/abs/2410.00421
Autor:
Chen, Kai, Gou, Yunhao, Huang, Runhui, Liu, Zhili, Tan, Daxin, Xu, Jing, Wang, Chunwei, Zhu, Yi, Zeng, Yihan, Yang, Kuo, Wang, Dingdong, Xiang, Kun, Li, Haoyuan, Bai, Haoli, Han, Jianhua, Li, Xiaohui, Jin, Weike, Xie, Nian, Zhang, Yu, Kwok, James T., Zhao, Hengshuang, Liang, Xiaodan, Yeung, Dit-Yan, Chen, Xiao, Li, Zhenguo, Zhang, Wei, Liu, Qun, Yao, Jun, Hong, Lanqing, Hou, Lu, Xu, Hang
GPT-4o, an omni-modal model that enables vocal conversations with diverse emotions and tones, marks a milestone for omni-modal foundation models. However, empowering Large Language Models to perceive and generate images, texts, and speeches end-to-en
Externí odkaz:
http://arxiv.org/abs/2409.18042
Currently, vision encoder models like Vision Transformers (ViTs) typically excel at image recognition tasks but cannot simultaneously support text recognition like human visual recognition. To address this limitation, we propose UNIT, a novel trainin
Externí odkaz:
http://arxiv.org/abs/2409.04095
Autor:
Wang, Cong, Gu, Jiaxi, Hu, Panwen, Zhao, Haoyu, Guo, Yuanfan, Han, Jianhua, Xu, Hang, Liang, Xiaodan
Following the advancements in text-guided image generation technology exemplified by Stable Diffusion, video generation is gaining increased attention in the academic community. However, relying solely on text guidance for video generation has seriou
Externí odkaz:
http://arxiv.org/abs/2408.13005
Score Distillation Sampling (SDS) by well-trained 2D diffusion models has shown great promise in text-to-3D generation. However, this paradigm distills view-agnostic 2D image distributions into the rendering distribution of 3D representation for each
Externí odkaz:
http://arxiv.org/abs/2407.12291