Výsledky vyhledávání - "Lu, Zhuqiang"

Report

B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens

Autor: Lu, Zhuqiang, Yin, Zhenfei, He, Mengwei, Wang, Zhihui, Liu, Zicheng, Wang, Zhiyong, Hu, Kun

Recently, Vision Large Language Models (VLLMs) integrated with vision encoders have shown promising performance in vision understanding. The key of VLLMs is to encode visual content into sequences of visual tokens, enabling VLLMs to simultaneously pr

Externí odkaz: http://arxiv.org/abs/2412.09919

Zobrazit plný text záznamu

Report

Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation

Autor: Lu, Zhuqiang, Hu, Kun, Wang, Chaoyue, Bai, Lei, Wang, Zhiyong

A 360-degree (omni-directional) image provides an all-encompassing spherical view of a scene. Recently, there has been an increasing interest in synthesising 360-degree images from conventional narrow field of view (NFoV) images captured by digital c

Externí odkaz: http://arxiv.org/abs/2309.03467

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání