Výsledky vyhledávání - "Han-WenCheng"

Report

RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model

Autor: Chen, Hongjun, Han, Wencheng, Zheng, Huan, Shen, Jianbing

Recent advancements in sRGB-to-RAW de-rendering have increasingly emphasized metadata-driven approaches to reconstruct RAW data from sRGB images, supplemented by partial RAW information. In image-based de-rendering, metadata is commonly obtained thro

Externí odkaz: http://arxiv.org/abs/2411.11717

Zobrazit plný text záznamu

Report

DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation

Autor: Yan, Tianyi, Wu, Dongming, Han, Wencheng, Jiang, Junpeng, Zhou, Xia, Zhan, Kun, Xu, Cheng-zhong, Shen, Jianbing

Autonomous driving evaluation requires simulation environments that closely replicate actual road conditions, including real-world sensory data and responsive feedback loops. However, many existing simulations need to predict waypoints along fixed ro

Externí odkaz: http://arxiv.org/abs/2411.11252

Zobrazit plný text záznamu

Report

Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion

Autor: Wei, Haoran, Han, Wencheng, Dong, Xingping, Shen, Jianbing

Recent diffusion-based Single-image 3D portrait generation methods typically employ 2D diffusion models to provide multi-view knowledge, which is then distilled into 3D representations. However, these methods usually struggle to produce high-fidelity

Externí odkaz: http://arxiv.org/abs/2411.10369

Zobrazit plný text záznamu

Report

ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction

Autor: Chen, Dubing, Fang, Jin, Han, Wencheng, Cheng, Xinjing, Yin, Junbo, Xu, Chenzhong, Khan, Fahad Shahbaz, Shen, Jianbing

Vision-based semantic occupancy and flow prediction plays a crucial role in providing spatiotemporal cues for real-world tasks, such as autonomous driving. Existing methods prioritize higher accuracy to cater to the demands of these tasks. In this wo

Externí odkaz: http://arxiv.org/abs/2411.07725

Zobrazit plný text záznamu

Report

Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution

Autor: Zheng, Huan, Han, Wencheng, Shen, Jianbing

Recovering high-quality depth maps from compressed sources has gained significant attention due to the limitations of consumer-grade depth cameras and the bandwidth restrictions during data transmission. However, current methods still suffer from two

Externí odkaz: http://arxiv.org/abs/2411.03239

Zobrazit plný text záznamu

Report

High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior

Autor: Han, Wencheng, Shen, Jianbing

In the area of self-supervised monocular depth estimation, models that utilize rich-resource inputs, such as high-resolution and multi-frame inputs, typically achieve better performance than models that use ordinary single image input. However, these

Externí odkaz: http://arxiv.org/abs/2408.00361

Zobrazit plný text záznamu

Report

RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception

Autor: Li, Chunliang, Han, Wencheng, Yin, Junbo, Zhao, Sanyuan, Shen, Jianbing

Concurrent processing of multiple autonomous driving 3D perception tasks within the same spatiotemporal scene poses a significant challenge, in particular due to the computational inefficiencies and feature competition between tasks when using tradit

Externí odkaz: http://arxiv.org/abs/2407.10876

Zobrazit plný text záznamu

Report

AdaOcc: Adaptive Forward View Transformation and Flow Modeling for 3D Occupancy and Flow Prediction

Autor: Chen, Dubing, Han, Wencheng, Fang, Jin, Shen, Jianbing

In this technical report, we present our solution for the Vision-Centric 3D Occupancy and Flow Prediction track in the nuScenes Open-Occ Dataset Challenge at CVPR 2024. Our innovative approach involves a dual-stage framework that enhances 3D occupanc

Externí odkaz: http://arxiv.org/abs/2407.01436

Zobrazit plný text záznamu

Report

Bootstrapping Referring Multi-Object Tracking

Autor: Zhang, Yani, Wu, Dongming, Han, Wencheng, Dong, Xingping

Referring multi-object tracking (RMOT) aims at detecting and tracking multiple objects following human instruction represented by a natural language expression. Existing RMOT benchmarks are usually formulated through manual annotations, integrated wi

Externí odkaz: http://arxiv.org/abs/2406.05039

Zobrazit plný text záznamu

Report

DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving

Autor: Han, Wencheng, Guo, Dongqian, Xu, Cheng-Zhong, Shen, Jianbing

In the field of autonomous driving, two important features of autonomous driving car systems are the explainability of decision logic and the accuracy of environmental perception. This paper introduces DME-Driver, a new autonomous driving system that

Externí odkaz: http://arxiv.org/abs/2401.03641

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání