Výsledky vyhledávání

Report

UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos

Autor: Yuan, Chengbo, Chen, Geng, Yi, Li, Gao, Yang

Egocentric Hand Object Interaction (HOI) videos provide valuable insights into human interactions with the physical world, attracting growing interest from the computer vision and robotics communities. A key task in fully understanding the geometry a

Externí odkaz: http://arxiv.org/abs/2411.09145

Zobrazit plný text záznamu

Report

ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images

Autor: Yang, Timing, Ju, Yuanliang, Yi, Li

Publikováno v: NeurIPS 2024

Open-vocabulary 3D object detection (OV-3Det) aims to generalize beyond the limited number of base categories labeled during the training phase. The biggest bottleneck is the scarcity of annotated 3D data, whereas 2D image datasets are abundant and r

Externí odkaz: http://arxiv.org/abs/2410.24001

Zobrazit plný text záznamu

Report

MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone's Potential with Masked Autoregressive Pretraining

Autor: Liu, Yunze, Yi, Li

Mamba has achieved significant advantages in long-context modeling and autoregressive tasks, but its scalability with large parameters remains a major limitation in vision applications. pretraining is a widely used strategy to enhance backbone model

Externí odkaz: http://arxiv.org/abs/2410.00871

Zobrazit plný text záznamu

Report

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Autor: Wu, Yecheng, Zhang, Zhuoyang, Chen, Junyu, Tang, Haotian, Li, Dacheng, Fang, Yunhao, Zhu, Ligeng, Xie, Enze, Yin, Hongxu, Yi, Li, Han, Song, Lu, Yao

VILA-U is a Unified foundation model that integrates Video, Image, Language understanding and generation. Traditional visual language models (VLMs) use separate modules for understanding and generating visual content, which can lead to misalignment a

Externí odkaz: http://arxiv.org/abs/2409.04429

Zobrazit plný text záznamu

Report

Intrinsic relationship between synchronisation thresholds and Lyapunov vectors: evidence from large eddy simulations and shell models

Autor: Jian, Li, Wenwen, Si, Yi, Li, Peng, Xu

An important parameter characterising the synchronisation of turbulent flows is the threshold coupling wavenumber. This study investigates the relationship between the threshold coupling wavenumber and the leading Lyapunov vector using large eddy sim

Externí odkaz: http://arxiv.org/abs/2407.13081

Zobrazit plný text záznamu

Report

CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement

Autor: Zhang, Chengwen, Liu, Yun, Xing, Ruofan, Tang, Bingda, Yi, Li

Understanding how humans cooperatively rearrange household objects is critical for VR/AR and human-robot interaction. However, in-depth studies on modeling these behaviors are under-researched due to the lack of relevant datasets. We fill this gap by

Externí odkaz: http://arxiv.org/abs/2406.19353

Zobrazit plný text záznamu

Report

Linear-T Resistivity from Spatially Random Vector Coupling

Autor: Wang, Yi-Li, Ge, Xian-Hui, Sin, Sang-Jin

Recently, Patel et.al introduced a higher dimensional version of the SYK model with random coupling in Yukawa interaction to find the linear-$T$ resistivity. We test the universality of the mechanism by replacing the scalar with vector field in vario

Externí odkaz: http://arxiv.org/abs/2406.11170

Zobrazit plný text záznamu

Report

FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models

Autor: Zhang, Zhikai, Li, Yitang, Huang, Haofeng, Lin, Mingxian, Yi, Li

Human motion synthesis is a fundamental task in computer animation. Despite recent progress in this field utilizing deep learning and motion capture data, existing methods are always limited to specific motion categories, environments, and styles. Th

Externí odkaz: http://arxiv.org/abs/2406.10740

Zobrazit plný text záznamu

Report

4DRecons: 4D Neural Implicit Deformable Objects Reconstruction from a single RGB-D Camera with Geometrical and Topological Regularizations

Autor: Cong, Xiaoyan, Yang, Haitao, Chen, Liyan, Zhang, Kaifeng, Yi, Li, Bajaj, Chandrajit, Huang, Qixing

This paper presents a novel approach 4DRecons that takes a single camera RGB-D sequence of a dynamic subject as input and outputs a complete textured deforming 3D model over time. 4DRecons encodes the output as a 4D neural implicit surface and presen

Externí odkaz: http://arxiv.org/abs/2406.10167

Zobrazit plný text záznamu

Report

Physics-aware Hand-object Interaction Denoising

Autor: Luo, Haowen, Liu, Yunze, Yi, Li

The credibility and practicality of a reconstructed hand-object interaction sequence depend largely on its physical plausibility. However, due to high occlusions during hand-object interaction, physical plausibility remains a challenging criterion fo

Externí odkaz: http://arxiv.org/abs/2405.11481

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání