Zobrazeno 1 - 10
of 35 248
pro vyhledávání: '"YI, LI"'
Egocentric Hand Object Interaction (HOI) videos provide valuable insights into human interactions with the physical world, attracting growing interest from the computer vision and robotics communities. A key task in fully understanding the geometry a
Externí odkaz:
http://arxiv.org/abs/2411.09145
Publikováno v:
NeurIPS 2024
Open-vocabulary 3D object detection (OV-3Det) aims to generalize beyond the limited number of base categories labeled during the training phase. The biggest bottleneck is the scarcity of annotated 3D data, whereas 2D image datasets are abundant and r
Externí odkaz:
http://arxiv.org/abs/2410.24001
Autor:
Liu, Yunze, Yi, Li
Mamba has achieved significant advantages in long-context modeling and autoregressive tasks, but its scalability with large parameters remains a major limitation in vision applications. pretraining is a widely used strategy to enhance backbone model
Externí odkaz:
http://arxiv.org/abs/2410.00871
Autor:
Wu, Yecheng, Zhang, Zhuoyang, Chen, Junyu, Tang, Haotian, Li, Dacheng, Fang, Yunhao, Zhu, Ligeng, Xie, Enze, Yin, Hongxu, Yi, Li, Han, Song, Lu, Yao
VILA-U is a Unified foundation model that integrates Video, Image, Language understanding and generation. Traditional visual language models (VLMs) use separate modules for understanding and generating visual content, which can lead to misalignment a
Externí odkaz:
http://arxiv.org/abs/2409.04429
An important parameter characterising the synchronisation of turbulent flows is the threshold coupling wavenumber. This study investigates the relationship between the threshold coupling wavenumber and the leading Lyapunov vector using large eddy sim
Externí odkaz:
http://arxiv.org/abs/2407.13081
Understanding how humans cooperatively rearrange household objects is critical for VR/AR and human-robot interaction. However, in-depth studies on modeling these behaviors are under-researched due to the lack of relevant datasets. We fill this gap by
Externí odkaz:
http://arxiv.org/abs/2406.19353
Recently, Patel et.al introduced a higher dimensional version of the SYK model with random coupling in Yukawa interaction to find the linear-$T$ resistivity. We test the universality of the mechanism by replacing the scalar with vector field in vario
Externí odkaz:
http://arxiv.org/abs/2406.11170
Human motion synthesis is a fundamental task in computer animation. Despite recent progress in this field utilizing deep learning and motion capture data, existing methods are always limited to specific motion categories, environments, and styles. Th
Externí odkaz:
http://arxiv.org/abs/2406.10740
Autor:
Cong, Xiaoyan, Yang, Haitao, Chen, Liyan, Zhang, Kaifeng, Yi, Li, Bajaj, Chandrajit, Huang, Qixing
This paper presents a novel approach 4DRecons that takes a single camera RGB-D sequence of a dynamic subject as input and outputs a complete textured deforming 3D model over time. 4DRecons encodes the output as a 4D neural implicit surface and presen
Externí odkaz:
http://arxiv.org/abs/2406.10167
The credibility and practicality of a reconstructed hand-object interaction sequence depend largely on its physical plausibility. However, due to high occlusions during hand-object interaction, physical plausibility remains a challenging criterion fo
Externí odkaz:
http://arxiv.org/abs/2405.11481