Výsledky vyhledávání

Report

Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models

Autor: Li, Xinghang, Li, Peiyan, Liu, Minghuan, Wang, Dong, Liu, Jirong, Kang, Bingyi, Ma, Xiao, Kong, Tao, Zhang, Hanbo, Liu, Huaping

Foundation Vision Language Models (VLMs) exhibit strong capabilities in multi-modal representation learning, comprehension, and reasoning. By injecting action components into the VLMs, Vision-Language-Action Models (VLAs) can be naturally formed and

Externí odkaz: http://arxiv.org/abs/2412.14058

Zobrazit plný text záznamu

Report

SIDE: Socially Informed Drought Estimation Toward Understanding Societal Impact Dynamics of Environmental Crisis

Autor: Shang, Lanyu, Chen, Bozhang, Liu, Shiwei, Zhang, Yang, Zong, Ruohan, Vora, Anav, Cai, Ximing, Wei, Na, Wang, Dong

Drought has become a critical global threat with significant societal impact. Existing drought monitoring solutions primarily focus on assessing drought severity using quantitative measurements, overlooking the diverse societal impact of drought from

Externí odkaz: http://arxiv.org/abs/2412.12575

Zobrazit plný text záznamu

Report

Exploring Enhanced Contextual Information for Video-Level Object Tracking

Autor: Kang, Ben, Chen, Xin, Lai, Simiao, Liu, Yang, Liu, Yi, Wang, Dong

Contextual information at the video level has become increasingly crucial for visual object tracking. However, existing methods typically use only a few tokens to convey this information, which can lead to information loss and limit their ability to

Externí odkaz: http://arxiv.org/abs/2412.11023

Zobrazit plný text záznamu

Report

Space Charge-Induced Emittance Growth in the Downstream Section of ERL Injectors

Autor: Chen, Xiuji, Liu, Zipeng, Chen, Si, Gu, Duan, qian, Houjun, Wang, Dong, Deng, Haixiao

The injector for ERL-FEL has been widely researched. Unlike traditional linacs, the bunch in the injector for ERLs requires additional deflection and matching section at lower energies. It makes the bunch more susceptible to the effects of the Space

Externí odkaz: http://arxiv.org/abs/2412.05834

Zobrazit plný text záznamu

Report

The UV Sensitivity of Axion Monodromy Inflation

Autor: Pajer, Enrico, Wang, Dong-Gang, Zhang, Bowei

We revisit axion monodromy inflation in the context of UV-complete theories and uncover a novel sensitivity of cosmological observables to heavy fields with masses far above the Hubble scale, such as the moduli of flux compactifications. By studying

Externí odkaz: http://arxiv.org/abs/2412.05762

Zobrazit plný text záznamu

Report

Controlling the Latent Diffusion Model for Generative Image Shadow Removal via Residual Generation

Autor: Li, Xinjie, Zhao, Yang, Wang, Dong, Chen, Yuan, Cao, Li, Liu, Xiaoping

Large-scale generative models have achieved remarkable advancements in various visual tasks, yet their application to shadow removal in images remains challenging. These models often generate diverse, realistic details without adequate focus on fidel

Externí odkaz: http://arxiv.org/abs/2412.02322

Zobrazit plný text záznamu

Report

ContextGNN: Beyond Two-Tower Recommendation Systems

Autor: Yuan, Yiwen, Zhang, Zecheng, He, Xinwei, Nitta, Akihiro, Hu, Weihua, Wang, Dong, Shah, Manan, Huang, Shenyang, Stojanovič, Blaž, Krumholz, Alan, Lenssen, Jan Eric, Leskovec, Jure, Fey, Matthias

Recommendation systems predominantly utilize two-tower architectures, which evaluate user-item rankings through the inner product of their respective embeddings. However, one key limitation of two-tower models is that they learn a pair-agnostic repre

Externí odkaz: http://arxiv.org/abs/2411.19513

Zobrazit plný text záznamu

Report

Entropic uncertainty and quantum non-classicality of Unruh-Dewitt detectors in relativity

Autor: Zhang, Yu-Kun, Li, Li-Juan, Song, Xue-Ke, Ye, Liu, Wang, Dong

Publikováno v: Physics Letters B 858 (2024) 139063

An object moving with the acceleration will change the temperature of environment around it, because of the presence of the Unruh thermal effect. In this work, we investigate the impact of Unruh thermal noise on the quantum-memory-assisted {entropic}

Externí odkaz: http://arxiv.org/abs/2411.16135

Zobrazit plný text záznamu

Report

Open-Vocabulary Octree-Graph for 3D Scene Understanding

Autor: Wang, Zhigang, Su, Yifei, Li, Chenhui, Wang, Dong, Huang, Yan, Zhao, Bin, Li, Xuelong

Open-vocabulary 3D scene understanding is indispensable for embodied agents. Recent works leverage pretrained vision-language models (VLMs) for object segmentation and project them to point clouds to build 3D maps. Despite progress, a point cloud is

Externí odkaz: http://arxiv.org/abs/2411.16253

Zobrazit plný text záznamu

Report

Improving Transferable Targeted Attacks with Feature Tuning Mixup

Autor: Liang, Kaisheng, Dai, Xuelong, Li, Yanjie, Wang, Dong, Xiao, Bin

Deep neural networks exhibit vulnerability to adversarial examples that can transfer across different models. A particularly challenging problem is developing transferable targeted attacks that can mislead models into predicting specific target class

Externí odkaz: http://arxiv.org/abs/2411.15553

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání