Zobrazeno 1 - 10
of 18 808
pro vyhledávání: '"Ma, Yan"'
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation
Previous open-source large multimodal models (LMMs) have faced several limitations: (1) they often lack native integration, requiring adapters to align visual representations with pre-trained large language models (LLMs); (2) many are restricted to s
Externí odkaz:
http://arxiv.org/abs/2407.06135
Autor:
Ma, Yubo, Zang, Yuhang, Chen, Liangyu, Chen, Meiqi, Jiao, Yizhu, Li, Xinze, Lu, Xinyuan, Liu, Ziyu, Ma, Yan, Dong, Xiaoyi, Zhang, Pan, Pan, Liangming, Jiang, Yu-Gang, Wang, Jiaqi, Cao, Yixin, Sun, Aixin
Understanding documents with rich layouts and multi-modal components is a long-standing and practical task. Recent Large Vision-Language Models (LVLMs) have made remarkable strides in various tasks, particularly in single-page document understanding
Externí odkaz:
http://arxiv.org/abs/2407.01523
Autor:
Huang, Zhen, Wang, Zengzhi, Xia, Shijie, Li, Xuefeng, Zou, Haoyang, Xu, Ruijie, Fan, Run-Ze, Ye, Lyumanshan, Chern, Ethan, Ye, Yixin, Zhang, Yikai, Yang, Yuqing, Wu, Ting, Wang, Binjie, Sun, Shichao, Xiao, Yang, Li, Yiyuan, Zhou, Fan, Chern, Steffi, Qin, Yiwei, Ma, Yan, Su, Jiadi, Liu, Yixiu, Zheng, Yuxiang, Zhang, Shaoting, Lin, Dahua, Qiao, Yu, Liu, Pengfei
The evolution of Artificial Intelligence (AI) has been significantly accelerated by advancements in Large Language Models (LLMs) and Large Multimodal Models (LMMs), gradually showcasing potential cognitive reasoning abilities in problem-solving and s
Externí odkaz:
http://arxiv.org/abs/2406.12753
A story premise succinctly defines a story's main idea, foundation, and trajectory. It serves as the initial trigger in automatic story generation. Existing sources of story premises are limited by a lack of diversity, uneven quality, and high costs
Externí odkaz:
http://arxiv.org/abs/2406.05690
In this article, we present the package Blade as the first implementation of the block-triangular form improved Feynman integral reduction method. The block-triangular form has orders of magnitude fewer equations compared to the plain integration-by-
Externí odkaz:
http://arxiv.org/abs/2405.14621
We propose to measure the energy correlator in quarkonium production, which tracks the energy deposited in the calorimeter $\chi$-angular distance away from the identified quarkonium. The observable eliminates the need for jets while sustaining the p
Externí odkaz:
http://arxiv.org/abs/2405.10056
Autor:
Ma Yan, Pierre Haski
“Heartbreakingly inspirational.” (AsianWeek)Ma Yan's heart-wrenching, honest diary chronicles her struggle to escape hardship through her persistent, sometimes desperate, attempts to continue her schooling.In a drought-stricken corner of rural Ch
The next Point of Interest (POI) recommendation aims to recommend the next POI for users at a specific time. As users' check-in records can be viewed as a long sequence, methods based on Recurrent Neural Networks (RNNs) have recently shown good appli
Externí odkaz:
http://arxiv.org/abs/2404.00367