Výsledky vyhledávání - "Liu, Zhijian"

Report

Sparse Refinement for Efficient High-Resolution Semantic Segmentation

Autor: Liu, Zhijian, Zhang, Zhuoyang, Khaki, Samir, Yang, Shang, Tang, Haotian, Xu, Chenfeng, Keutzer, Kurt, Han, Song

Semantic segmentation empowers numerous real-world applications, such as autonomous driving and augmented/mixed reality. These applications often operate on high-resolution images (e.g., 8 megapixels) to capture the fine details. However, this comes

Externí odkaz: http://arxiv.org/abs/2407.19014

Zobrazit plný text záznamu

Report

LidarDM: Generative LiDAR Simulation in a Generated World

Autor: Zyrianov, Vlas, Che, Henry, Liu, Zhijian, Wang, Shenlong

We present LidarDM, a novel LiDAR generative model capable of producing realistic, layout-aware, physically plausible, and temporally coherent LiDAR videos. LidarDM stands out with two unprecedented capabilities in LiDAR generative modeling: (i) LiDA

Externí odkaz: http://arxiv.org/abs/2404.02903

Zobrazit plný text záznamu

Report

StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation

Autor: Kodaira, Akio, Xu, Chenfeng, Hazama, Toshiki, Yoshimoto, Takanori, Ohno, Kohei, Mitsuhori, Shogo, Sugano, Soichi, Cho, Hanying, Liu, Zhijian, Keutzer, Kurt

We introduce StreamDiffusion, a real-time diffusion pipeline designed for interactive image generation. Existing diffusion models are adept at creating images from text or image prompts, yet they often fall short in real-time interaction. This limita

Externí odkaz: http://arxiv.org/abs/2312.12491

Zobrazit plný text záznamu

Report

Point Transformer V3: Simpler, Faster, Stronger

Autor: Wu, Xiaoyang, Jiang, Li, Wang, Peng-Shuai, Liu, Zhijian, Liu, Xihui, Qiao, Yu, Ouyang, Wanli, He, Tong, Zhao, Hengshuang

This paper is not motivated to seek innovation within the attention mechanism. Instead, it focuses on overcoming the existing trade-offs between accuracy and efficiency within the context of point cloud processing, leveraging the power of scale. Draw

Externí odkaz: http://arxiv.org/abs/2312.10035

Zobrazit plný text záznamu

Report

TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs

Autor: Tang, Haotian, Yang, Shang, Liu, Zhijian, Hong, Ke, Yu, Zhongming, Li, Xiuyu, Dai, Guohao, Wang, Yu, Han, Song

Sparse convolution plays a pivotal role in emerging workloads, including point cloud processing in AR/VR, autonomous driving, and graph understanding in recommendation systems. Since the computation pattern is sparse and irregular, specialized high-p

Externí odkaz: http://arxiv.org/abs/2311.12862

Zobrazit plný text záznamu

Report

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Autor: Chen, Yukang, Qian, Shengju, Tang, Haotian, Lai, Xin, Liu, Zhijian, Han, Song, Jia, Jiaya

We present LongLoRA, an efficient fine-tuning approach that extends the context sizes of pre-trained large language models (LLMs), with limited computation cost. Typically, training LLMs with long context sizes is computationally expensive, requiring

Externí odkaz: http://arxiv.org/abs/2309.12307

Zobrazit plný text záznamu

Report

MapPrior: Bird's-Eye View Map Layout Estimation with Generative Models

Autor: Zhu, Xiyue, Zyrianov, Vlas, Liu, Zhijian, Wang, Shenlong

Despite tremendous advancements in bird's-eye view (BEV) perception, existing models fall short in generating realistic and coherent semantic map layouts, and they fail to account for uncertainties arising from partial sensor information (such as occ

Externí odkaz: http://arxiv.org/abs/2308.12963

Zobrazit plný text záznamu

Report

CA-CentripetalNet: A novel anchor-free deep learning framework for hardhat wearing detection

Autor: Liu, Zhijian, Cai, Nian, Ouyang, Wensheng, Zhang, Chengbin, Tian, Nili, Wang, Han

Publikováno v: Signal, Image and Video Processing,2023

Automatic hardhat wearing detection can strengthen the safety management in construction sites, which is still challenging due to complicated video surveillance scenes. To deal with the poor generalization of previous deep learning based methods, a n

Externí odkaz: http://arxiv.org/abs/2307.04103

Zobrazit plný text záznamu

Report

SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer

Autor: Chen, Xuanyao, Liu, Zhijian, Tang, Haotian, Yi, Li, Zhao, Hang, Han, Song

High-resolution images enable neural networks to learn richer visual representations. However, this improved performance comes at the cost of growing computational complexity, hindering their usage in latency-sensitive applications. As not all pixels

Externí odkaz: http://arxiv.org/abs/2303.17605

Zobrazit plný text záznamu

Report

FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer

Autor: Liu, Zhijian, Yang, Xinyu, Tang, Haotian, Yang, Shang, Han, Song

Transformer, as an alternative to CNN, has been proven effective in many modalities (e.g., texts and images). For 3D point cloud transformers, existing efforts focus primarily on pushing their accuracy to the state-of-the-art level. However, their la

Externí odkaz: http://arxiv.org/abs/2301.08739

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání