Výsledky vyhledávání - "Wang, Xiaolong"

Report

Bunny-VisionPro: Real-Time Bimanual Dexterous Teleoperation for Imitation Learning

Autor: Ding, Runyu, Qin, Yuzhe, Zhu, Jiyue, Jia, Chengzhe, Yang, Shiqi, Yang, Ruihan, Qi, Xiaojuan, Wang, Xiaolong

Teleoperation is a crucial tool for collecting human demonstrations, but controlling robots with bimanual dexterous hands remains a challenge. Existing teleoperation systems struggle to handle the complexity of coordinating two hands for intricate ma

Externí odkaz: http://arxiv.org/abs/2407.03162

Zobrazit plný text záznamu

Report

Image Neural Field Diffusion Models

Autor: Chen, Yinbo, Wang, Oliver, Zhang, Richard, Shechtman, Eli, Wang, Xiaolong, Gharbi, Michael

Diffusion models have shown an impressive ability to model complex data distributions, with several key advantages over GANs, such as stable training, better coverage of the training distribution's modes, and the ability to solve inverse problems wit

Externí odkaz: http://arxiv.org/abs/2406.07480

Zobrazit plný text záznamu

Report

SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model

Autor: Cheng, An-Chieh, Yin, Hongxu, Fu, Yang, Guo, Qiushan, Yang, Ruihan, Kautz, Jan, Wang, Xiaolong, Liu, Sifei

Vision Language Models (VLMs) have demonstrated remarkable performance in 2D vision and language tasks. However, their ability to reason about spatial arrangements remains limited. In this work, we introduce Spatial Region GPT (SpatialRGPT) to enhanc

Externí odkaz: http://arxiv.org/abs/2406.01584

Zobrazit plný text záznamu

Report

Hierarchical World Models as Visual Whole-Body Humanoid Controllers

Autor: Hansen, Nicklas, S V, Jyothir, Sobal, Vlad, LeCun, Yann, Wang, Xiaolong, Su, Hao

Whole-body control for humanoids is challenging due to the high-dimensional nature of the problem, coupled with the inherent instability of a bipedal morphology. Learning from visual observations further exacerbates this difficulty. In this work, we

Externí odkaz: http://arxiv.org/abs/2405.18418

Zobrazit plný text záznamu

Report

A Construct-Optimize Approach to Sparse View Synthesis without Camera Pose

Autor: Jiang, Kaiwen, Fu, Yang, T, Mukund Varma, Belhe, Yash, Wang, Xiaolong, Su, Hao, Ramamoorthi, Ravi

Novel view synthesis from a sparse set of input images is a challenging problem of great practical interest, especially when camera poses are absent or inaccurate. Direct optimization of camera poses and usage of estimated depths in neural radiance f

Externí odkaz: http://arxiv.org/abs/2405.03659

Zobrazit plný text záznamu

Report

Editable Image Elements for Controllable Synthesis

Autor: Mu, Jiteng, Gharbi, Michaël, Zhang, Richard, Shechtman, Eli, Vasconcelos, Nuno, Wang, Xiaolong, Park, Taesung

Diffusion models have made significant advances in text-guided synthesis tasks. However, editing user-provided images remains challenging, as the high dimensional noise input space of diffusion models is not naturally suited for image inversion or sp

Externí odkaz: http://arxiv.org/abs/2404.16029

Zobrazit plný text záznamu

Report

Dynamic Gaussians Mesh: Consistent Mesh Reconstruction from Monocular Videos

Autor: Liu, Isabella, Su, Hao, Wang, Xiaolong

Modern 3D engines and graphics pipelines require mesh as a memory-efficient representation, which allows efficient rendering, geometry processing, texture editing, and many other downstream operations. However, it is still highly difficult to obtain

Externí odkaz: http://arxiv.org/abs/2404.12379

Zobrazit plný text záznamu

Report

Feature Splatting: Language-Driven Physics-Based Scene Synthesis and Editing

Autor: Qiu, Ri-Zhao, Yang, Ge, Zeng, Weijia, Wang, Xiaolong

Scene representations using 3D Gaussian primitives have produced excellent results in modeling the appearance of static and dynamic 3D scenes. Many graphics applications, however, demand the ability to manipulate both the appearance and the physical

Externí odkaz: http://arxiv.org/abs/2404.01223

Zobrazit plný text záznamu

Report

Visual Whole-Body Control for Legged Loco-Manipulation

Autor: Liu, Minghuan, Chen, Zixuan, Cheng, Xuxin, Ji, Yandong, Qiu, Ri-Zhao, Yang, Ruihan, Wang, Xiaolong

We study the problem of mobile manipulation using legged robots equipped with an arm, namely legged loco-manipulation. The robot legs, while usually utilized for mobility, offer an opportunity to amplify the manipulation capabilities by conducting wh

Externí odkaz: http://arxiv.org/abs/2403.16967

Zobrazit plný text záznamu

Report

HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data

Autor: Zhang, Mengqi, Fu, Yang, Ding, Zheng, Liu, Sifei, Tu, Zhuowen, Wang, Xiaolong

3D hand-object interaction data is scarce due to the hardware constraints in scaling up the data collection process. In this paper, we propose HOIDiffusion for generating realistic and diverse 3D hand-object interaction data. Our model is a condition

Externí odkaz: http://arxiv.org/abs/2403.12011

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání