Výsledky vyhledávání - "Ouyang, Xuecheng"

Report

Imp: Highly Capable Large Multimodal Models for Mobile Devices

Autor: Shao, Zhenwei, Yu, Zhou, Yu, Jun, Ouyang, Xuecheng, Zheng, Lihao, Gai, Zhenbiao, Wang, Mingyang, Ding, Jiajun

By harnessing the capabilities of large language models (LLMs), recent large multimodal models (LMMs) have shown remarkable versatility in open-world multimodal understanding. Nevertheless, they are usually parameter-heavy and computation-intensive,

Externí odkaz: http://arxiv.org/abs/2405.12107

Zobrazit plný text záznamu

Report

Prophet: Prompting Large Language Models with Complementary Answer Heuristics for Knowledge-based Visual Question Answering

Autor: Yu, Zhou, Ouyang, Xuecheng, Shao, Zhenwei, Wang, Meng, Yu, Jun

Knowledge-based visual question answering (VQA) requires external knowledge beyond the image to answer the question. Early studies retrieve required knowledge from explicit knowledge bases (KBs), which often introduces irrelevant information to the q

Externí odkaz: http://arxiv.org/abs/2303.01903

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání