Zobrazeno 1 - 10
of 48 723
pro vyhledávání: '"Wang,Dong"'
Autor:
Li, Xinghang, Li, Peiyan, Liu, Minghuan, Wang, Dong, Liu, Jirong, Kang, Bingyi, Ma, Xiao, Kong, Tao, Zhang, Hanbo, Liu, Huaping
Foundation Vision Language Models (VLMs) exhibit strong capabilities in multi-modal representation learning, comprehension, and reasoning. By injecting action components into the VLMs, Vision-Language-Action Models (VLAs) can be naturally formed and
Externí odkaz:
http://arxiv.org/abs/2412.14058
Autor:
Shang, Lanyu, Chen, Bozhang, Liu, Shiwei, Zhang, Yang, Zong, Ruohan, Vora, Anav, Cai, Ximing, Wei, Na, Wang, Dong
Drought has become a critical global threat with significant societal impact. Existing drought monitoring solutions primarily focus on assessing drought severity using quantitative measurements, overlooking the diverse societal impact of drought from
Externí odkaz:
http://arxiv.org/abs/2412.12575
Contextual information at the video level has become increasingly crucial for visual object tracking. However, existing methods typically use only a few tokens to convey this information, which can lead to information loss and limit their ability to
Externí odkaz:
http://arxiv.org/abs/2412.11023
The injector for ERL-FEL has been widely researched. Unlike traditional linacs, the bunch in the injector for ERLs requires additional deflection and matching section at lower energies. It makes the bunch more susceptible to the effects of the Space
Externí odkaz:
http://arxiv.org/abs/2412.05834
We revisit axion monodromy inflation in the context of UV-complete theories and uncover a novel sensitivity of cosmological observables to heavy fields with masses far above the Hubble scale, such as the moduli of flux compactifications. By studying
Externí odkaz:
http://arxiv.org/abs/2412.05762
Large-scale generative models have achieved remarkable advancements in various visual tasks, yet their application to shadow removal in images remains challenging. These models often generate diverse, realistic details without adequate focus on fidel
Externí odkaz:
http://arxiv.org/abs/2412.02322
Autor:
Yuan, Yiwen, Zhang, Zecheng, He, Xinwei, Nitta, Akihiro, Hu, Weihua, Wang, Dong, Shah, Manan, Huang, Shenyang, Stojanovič, Blaž, Krumholz, Alan, Lenssen, Jan Eric, Leskovec, Jure, Fey, Matthias
Recommendation systems predominantly utilize two-tower architectures, which evaluate user-item rankings through the inner product of their respective embeddings. However, one key limitation of two-tower models is that they learn a pair-agnostic repre
Externí odkaz:
http://arxiv.org/abs/2411.19513
Publikováno v:
Physics Letters B 858 (2024) 139063
An object moving with the acceleration will change the temperature of environment around it, because of the presence of the Unruh thermal effect. In this work, we investigate the impact of Unruh thermal noise on the quantum-memory-assisted {entropic}
Externí odkaz:
http://arxiv.org/abs/2411.16135
Open-vocabulary 3D scene understanding is indispensable for embodied agents. Recent works leverage pretrained vision-language models (VLMs) for object segmentation and project them to point clouds to build 3D maps. Despite progress, a point cloud is
Externí odkaz:
http://arxiv.org/abs/2411.16253
Deep neural networks exhibit vulnerability to adversarial examples that can transfer across different models. A particularly challenging problem is developing transferable targeted attacks that can mislead models into predicting specific target class
Externí odkaz:
http://arxiv.org/abs/2411.15553