Výsledky vyhledávání - "Ming, Tianshi"

Report

DAMRO: Dive into the Attention Mechanism of LVLM to Reduce Object Hallucination

Autor: Gong, Xuan, Ming, Tianshi, Wang, Xinpeng, Wei, Zhihua

Despite the great success of Large Vision-Language Models (LVLMs), they inevitably suffer from hallucination. As we know, both the visual encoder and the Large Language Model (LLM) decoder in LVLMs are Transformer-based, allowing the model to extract

Externí odkaz: http://arxiv.org/abs/2410.04514

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání