Výsledky vyhledávání

Report

Dosimetry study of high repetition rate MeV electron beam from a continuous-wave photocathode gun

Autor: Sun, Jianhan, Lv, Jianfeng, Tian, Shang, Liu, Juntao, Zhang, Zihao, Xu, Hang, Lin, Lin, Huang, Senlin

DC-SRF-II gun, a high-brightness continuous-wave photocathode gun, has greater potential in electron beam irradiation applications. This paper presents the in-vacuum and in-air irradiation dosimetry study of the high repetition rate electron beam fro

Externí odkaz: http://arxiv.org/abs/2411.16247

Zobrazit plný text záznamu

Report

AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning

Autor: Xiang, Kun, Liu, Zhili, Jiang, Zihao, Nie, Yunshuang, Huang, Runhui, Fan, Haoxiang, Li, Hanhui, Huang, Weiran, Zeng, Yihan, Han, Jianhua, Hong, Lanqing, Xu, Hang, Liang, Xiaodan

In this paper, we address the challenging task of multimodal mathematical reasoning by incorporating the ability of ``slow thinking" into multimodal large language models (MLLMs). Contrary to existing methods that rely on direct or fast thinking, our

Externí odkaz: http://arxiv.org/abs/2411.11930

Zobrazit plný text záznamu

Report

VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation

Autor: Wen, Youpeng, Lin, Junfan, Zhu, Yi, Han, Jianhua, Xu, Hang, Zhao, Shen, Liang, Xiaodan

Recent advancements utilizing large-scale video data for learning video generation models demonstrate significant potential in understanding complex physical dynamics. It suggests the feasibility of leveraging diverse robot trajectory data to develop

Externí odkaz: http://arxiv.org/abs/2411.09153

Zobrazit plný text záznamu

Report

PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation

Autor: Zhang, Kaidong, Ren, Pengzhen, Lin, Bingqian, Lin, Junfan, Ma, Shikui, Xu, Hang, Liang, Xiaodan

Language-guided robotic manipulation is a challenging task that requires an embodied agent to follow abstract user instructions to accomplish various complex manipulation tasks. Previous work trivially fitting the data without revealing the relation

Externí odkaz: http://arxiv.org/abs/2410.10394

Zobrazit plný text záznamu

Report

Energy-Efficient Balanced Flow Control Achieved through Optimization of Synthetic Jet Placement Based on Deep Reinforcement Learning

Autor: Jia, Wang, Xu, Hang

This study leverages deep reinforcement learning (DRL) to train synthetic jet-based flow control strategies for circular and square cylinders. The central aim is to ascertain the optimal jet placements that strike an ideal balance between energy effi

Externí odkaz: http://arxiv.org/abs/2410.00424

Zobrazit plný text záznamu

Report

Complete vortex shedding suppression in highly slender elliptical cylinders through deep reinforcement learning-driven flow control

Autor: Jia, Wang, Xu, Hang

By leveraging the high dimensional nonlinear mapping capabilities of artificial neural networks in conjunction with the powerful control mechanisms of reinforcement learning, we attain real-time, precise modulation of synthetic jet flow rates over el

Externí odkaz: http://arxiv.org/abs/2410.00421

Zobrazit plný text záznamu

Report

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

GPT-4o, an omni-modal model that enables vocal conversations with diverse emotions and tones, marks a milestone for omni-modal foundation models. However, empowering Large Language Models to perceive and generate images, texts, and speeches end-to-en

Externí odkaz: http://arxiv.org/abs/2409.18042

Zobrazit plný text záznamu

Report

UNIT: Unifying Image and Text Recognition in One Vision Encoder

Autor: Zhu, Yi, Zhou, Yanpeng, Wang, Chunwei, Cao, Yang, Han, Jianhua, Hou, Lu, Xu, Hang

Currently, vision encoder models like Vision Transformers (ViTs) typically excel at image recognition tasks but cannot simultaneously support text recognition like human visual recognition. To address this limitation, we propose UNIT, a novel trainin

Externí odkaz: http://arxiv.org/abs/2409.04095

Zobrazit plný text záznamu

Report

EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation

Autor: Wang, Cong, Gu, Jiaxi, Hu, Panwen, Zhao, Haoyu, Guo, Yuanfan, Han, Jianhua, Xu, Hang, Liang, Xiaodan

Following the advancements in text-guided image generation technology exemplified by Stable Diffusion, video generation is gaining increased attention in the academic community. However, relying solely on text guidance for video generation has seriou

Externí odkaz: http://arxiv.org/abs/2408.13005

Zobrazit plný text záznamu

Report

JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation

Autor: Jiang, Chenhan, Zeng, Yihan, Hu, Tianyang, Xu, Songcun, Zhang, Wei, Xu, Hang, Yeung, Dit-Yan

Score Distillation Sampling (SDS) by well-trained 2D diffusion models has shown great promise in text-to-3D generation. However, this paradigm distills view-agnostic 2D image distributions into the rendering distribution of 3D representation for each

Externí odkaz: http://arxiv.org/abs/2407.12291

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání