Výsledky vyhledávání

Report

Exploring the Capabilities of Large Multimodal Models on Dense Text

Autor: Zhang, Shuo, Yang, Biao, Li, Zhang, Ma, Zhiyin, Liu, Yuliang, Bai, Xiang

While large multi-modal models (LMM) have shown notable progress in multi-modal tasks, their capabilities in tasks involving dense textual content remains to be fully explored. Dense text, which carries important information, is often found in docume

Externí odkaz: http://arxiv.org/abs/2405.06706

Zobrazit plný text záznamu

Report

TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document

Autor: Liu, Yuliang, Yang, Biao, Liu, Qiang, Li, Zhang, Ma, Zhiyin, Zhang, Shuo, Bai, Xiang

We present TextMonkey, a large multimodal model (LMM) tailored for text-centric tasks. Our approach introduces enhancement across several dimensions: By adopting Shifted Window Attention with zero-initialization, we achieve cross-window connectivity

Externí odkaz: http://arxiv.org/abs/2403.04473

Zobrazit plný text záznamu

Report

Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Autor: Li, Zhang, Yang, Biao, Liu, Qiang, Ma, Zhiyin, Zhang, Shuo, Yang, Jingxu, Sun, Yabo, Liu, Yuliang, Bai, Xiang

Large Multimodal Models (LMMs) have shown promise in vision-language tasks but struggle with high-resolution input and detailed scene understanding. Addressing these challenges, we introduce Monkey to enhance LMM capabilities. Firstly, Monkey process

Externí odkaz: http://arxiv.org/abs/2311.06607

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Study on the adsorption mechanisms of gold chloride on the surface of fine quartz.

Autor: Ma, Fangyuan¹ (AUTHOR), Dai, Shujuan² (AUTHOR) shujuandai@163.com, Tao, Dongping² (AUTHOR), Tao, Youjun¹ (AUTHOR) 2576101555@qq.com, Ma, Zhiyin³ (AUTHOR)

Publikováno v: Energy Sources Part A: Recovery, Utilization & Environmental Effects. 2021, Vol. 43 Issue 10, p1151-1161. 11p.

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání