Výsledky vyhledávání - "Liu Bingbing"

Report

An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training

Autor: Zhang, Haiming, Xue, Ying, Yan, Xu, Zhang, Jiacheng, Qiu, Weichao, Bai, Dongfeng, Liu, Bingbing, Cui, Shuguang, Li, Zhen

The field of autonomous driving is experiencing a surge of interest in world models, which aim to predict potential future scenarios based on historical observations. In this paper, we introduce DFIT-OccWorld, an efficient 3D occupancy world model th

Externí odkaz: http://arxiv.org/abs/2412.13772

Zobrazit plný text záznamu

Report

HUGSIM: A Real-Time, Photo-Realistic and Closed-Loop Simulator for Autonomous Driving

Autor: Zhou, Hongyu, Lin, Longzhong, Wang, Jiabao, Lu, Yichong, Bai, Dongfeng, Liu, Bingbing, Wang, Yue, Geiger, Andreas, Liao, Yiyi

In the past few decades, autonomous driving algorithms have made significant progress in perception, planning, and control. However, evaluating individual components does not fully reflect the performance of entire systems, highlighting the need for

Externí odkaz: http://arxiv.org/abs/2412.01718

Zobrazit plný text záznamu

Report

Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning

Autor: Xu, Tianshuo, Chen, Zhifei, Wu, Leyi, Lu, Hao, Chen, Yuying, Jiang, Lihui, Liu, Bingbing, Chen, Yingcong

Recent numerous video generation models, also known as world models, have demonstrated the ability to generate plausible real-world videos. However, many studies have shown that these models often produce motion results lacking logical or physical co

Externí odkaz: http://arxiv.org/abs/2412.00547

Zobrazit plný text záznamu

Report

D$^2$-World: An Efficient World Model through Decoupled Dynamic Flow

Autor: Zhang, Haiming, Yan, Xu, Xue, Ying, Guo, Zixuan, Cui, Shuguang, Li, Zhen, Liu, Bingbing

This technical report summarizes the second-place solution for the Predictive World Model Challenge held at the CVPR-2024 Workshop on Foundation Models for Autonomous Systems. We introduce D$^2$-World, a novel World model that effectively forecasts f

Externí odkaz: http://arxiv.org/abs/2411.17027

Zobrazit plný text záznamu

Report

UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations

Autor: Ren, Yuan, Wu, Guile, Li, Runhao, Yang, Zheyuan, Liu, Yibo, Chen, Xingxin, Cao, Tongtong, Liu, Bingbing

Urban scene reconstruction is crucial for real-world autonomous driving simulators. Although existing methods have achieved photorealistic reconstruction, they mostly focus on pinhole cameras and neglect fisheye cameras. In fact, how to effectively s

Externí odkaz: http://arxiv.org/abs/2411.15355

Zobrazit plný text záznamu

Report

VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving

Autor: Zhang, Haiming, Zhou, Wending, Zhu, Yiyao, Yan, Xu, Gao, Jiantao, Bai, Dongfeng, Cai, Yingjie, Liu, Bingbing, Cui, Shuguang, Li, Zhen

This paper introduces VisionPAD, a novel self-supervised pre-training paradigm designed for vision-centric algorithms in autonomous driving. In contrast to previous approaches that employ neural rendering with explicit depth supervision, VisionPAD ut

Externí odkaz: http://arxiv.org/abs/2411.14716

Zobrazit plný text záznamu

Report

OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction

Autor: Li, Leheng, Qiu, Weichao, Yan, Xu, He, Jing, Zhou, Kaiqiang, Cai, Yingjie, Lian, Qing, Liu, Bingbing, Chen, Ying-Cong

We present OmniBooth, an image generation framework that enables spatial control with instance-level multi-modal customization. For all instances, the multimodal instruction can be described through text prompts or image references. Given a set of us

Externí odkaz: http://arxiv.org/abs/2410.04932

Zobrazit plný text záznamu

Report

SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D Semantic MPIs

Autor: Li, Leheng, Qiu, Weichao, Cai, Yingjie, Yan, Xu, Lian, Qing, Liu, Bingbing, Chen, Ying-Cong

The advancement of autonomous driving is increasingly reliant on high-quality annotated datasets, especially in the task of 3D occupancy prediction, where the occupancy labels require dense 3D annotation with significant human effort. In this paper,

Externí odkaz: http://arxiv.org/abs/2410.00337

Zobrazit plný text záznamu

Report

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Autor: He, Jing, Li, Haodong, Yin, Wei, Liang, Yixun, Li, Leheng, Zhou, Kaiqiang, Zhang, Hongbo, Liu, Bingbing, Chen, Ying-Cong

Leveraging the visual priors of pre-trained text-to-image diffusion models offers a promising solution to enhance zero-shot generalization in dense prediction tasks. However, existing methods often uncritically use the original diffusion formulation,

Externí odkaz: http://arxiv.org/abs/2409.18124

Zobrazit plný text záznamu

Report

Efficient Depth-Guided Urban View Synthesis

Autor: Miao, Sheng, Huang, Jiaxin, Bai, Dongfeng, Qiu, Weichao, Liu, Bingbing, Geiger, Andreas, Liao, Yiyi

Recent advances in implicit scene representation enable high-fidelity street view novel view synthesis. However, existing methods optimize a neural radiance field for each scene, relying heavily on dense training images and extensive computation reso

Externí odkaz: http://arxiv.org/abs/2407.12395

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání