Výsledky vyhledávání

Report

ClawMachine: Fetching Visual Tokens as An Entity for Referring and Grounding

Autor: Ma, Tianren, Xie, Lingxi, Tian, Yunjie, Yang, Boyu, Zhang, Yuan, Doermann, David, Ye, Qixiang

An essential topic for multimodal large language models (MLLMs) is aligning vision and language concepts at a finer level. In particular, we devote efforts to encoding visual referential information for tasks such as referring and grounding. Existing

Externí odkaz: http://arxiv.org/abs/2406.11327

Zobrazit plný text záznamu

Report

Artemis: Towards Referential Understanding in Complex Videos

Autor: Qiu, Jihao, Zhang, Yuan, Tang, Xi, Xie, Lingxi, Ma, Tianren, Yan, Pengyu, Doermann, David, Ye, Qixiang, Tian, Yunjie

Videos carry rich visual information including object description, action, interaction, etc., but the existing multimodal large language models (MLLMs) fell short in referential understanding scenarios such as video-based referring. In this paper, we

Externí odkaz: http://arxiv.org/abs/2406.00258

Zobrazit plný text záznamu

Report

ChatterBox: Multi-round Multimodal Referring and Grounding

Autor: Tian, Yunjie, Ma, Tianren, Xie, Lingxi, Qiu, Jihao, Tang, Xi, Zhang, Yuan, Jiao, Jianbin, Tian, Qi, Ye, Qixiang

In this study, we establish a baseline for a new task named multimodal multi-round referring and grounding (MRG), opening up a promising direction for instance-level multimodal dialogues. We present a new benchmark and an efficient vision-language mo

Externí odkaz: http://arxiv.org/abs/2401.13307

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Contents list.

Publikováno v: Catalysis Science & Technology; 3/21/2021, Vol. 11 Issue 6, p1985-1993, 9p

Zobrazit plný text záznamu

Elektronická kniha

Intelligent Data Engineering and Automated Learning – IDEAL 2017 : 18th International Conference, Guilin, China, October 30 – November 1, 2017, Proceedings

Autor: Hujun Yin, Yang Gao, Songcan Chen, Yimin Wen, Guoyong Cai, Tianlong Gu, Junping Du, Antonio J. Tallón-Ballesteros, Minling Zhang

This book constitutes the refereed proceedings of the 18th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2017, held in Guilin, China, in October/November 2017.The 65 full papers presented were carefully review

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání