Zobrazeno 1 - 10
of 181
pro vyhledávání: '"Wang, Xiaolong"'
Autor:
Ding, Runyu, Qin, Yuzhe, Zhu, Jiyue, Jia, Chengzhe, Yang, Shiqi, Yang, Ruihan, Qi, Xiaojuan, Wang, Xiaolong
Teleoperation is a crucial tool for collecting human demonstrations, but controlling robots with bimanual dexterous hands remains a challenge. Existing teleoperation systems struggle to handle the complexity of coordinating two hands for intricate ma
Externí odkaz:
http://arxiv.org/abs/2407.03162
Diffusion models have shown an impressive ability to model complex data distributions, with several key advantages over GANs, such as stable training, better coverage of the training distribution's modes, and the ability to solve inverse problems wit
Externí odkaz:
http://arxiv.org/abs/2406.07480
Autor:
Cheng, An-Chieh, Yin, Hongxu, Fu, Yang, Guo, Qiushan, Yang, Ruihan, Kautz, Jan, Wang, Xiaolong, Liu, Sifei
Vision Language Models (VLMs) have demonstrated remarkable performance in 2D vision and language tasks. However, their ability to reason about spatial arrangements remains limited. In this work, we introduce Spatial Region GPT (SpatialRGPT) to enhanc
Externí odkaz:
http://arxiv.org/abs/2406.01584
Whole-body control for humanoids is challenging due to the high-dimensional nature of the problem, coupled with the inherent instability of a bipedal morphology. Learning from visual observations further exacerbates this difficulty. In this work, we
Externí odkaz:
http://arxiv.org/abs/2405.18418
Autor:
Jiang, Kaiwen, Fu, Yang, T, Mukund Varma, Belhe, Yash, Wang, Xiaolong, Su, Hao, Ramamoorthi, Ravi
Novel view synthesis from a sparse set of input images is a challenging problem of great practical interest, especially when camera poses are absent or inaccurate. Direct optimization of camera poses and usage of estimated depths in neural radiance f
Externí odkaz:
http://arxiv.org/abs/2405.03659
Autor:
Mu, Jiteng, Gharbi, Michaël, Zhang, Richard, Shechtman, Eli, Vasconcelos, Nuno, Wang, Xiaolong, Park, Taesung
Diffusion models have made significant advances in text-guided synthesis tasks. However, editing user-provided images remains challenging, as the high dimensional noise input space of diffusion models is not naturally suited for image inversion or sp
Externí odkaz:
http://arxiv.org/abs/2404.16029
Modern 3D engines and graphics pipelines require mesh as a memory-efficient representation, which allows efficient rendering, geometry processing, texture editing, and many other downstream operations. However, it is still highly difficult to obtain
Externí odkaz:
http://arxiv.org/abs/2404.12379
Scene representations using 3D Gaussian primitives have produced excellent results in modeling the appearance of static and dynamic 3D scenes. Many graphics applications, however, demand the ability to manipulate both the appearance and the physical
Externí odkaz:
http://arxiv.org/abs/2404.01223
Autor:
Liu, Minghuan, Chen, Zixuan, Cheng, Xuxin, Ji, Yandong, Qiu, Ri-Zhao, Yang, Ruihan, Wang, Xiaolong
We study the problem of mobile manipulation using legged robots equipped with an arm, namely legged loco-manipulation. The robot legs, while usually utilized for mobility, offer an opportunity to amplify the manipulation capabilities by conducting wh
Externí odkaz:
http://arxiv.org/abs/2403.16967
3D hand-object interaction data is scarce due to the hardware constraints in scaling up the data collection process. In this paper, we propose HOIDiffusion for generating realistic and diverse 3D hand-object interaction data. Our model is a condition
Externí odkaz:
http://arxiv.org/abs/2403.12011