Zobrazeno 1 - 10
of 113
pro vyhledávání: '"Ma, Yan"'
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation
Previous open-source large multimodal models (LMMs) have faced several limitations: (1) they often lack native integration, requiring adapters to align visual representations with pre-trained large language models (LLMs); (2) many are restricted to s
Externí odkaz:
http://arxiv.org/abs/2407.06135
Autor:
Ma, Yubo, Zang, Yuhang, Chen, Liangyu, Chen, Meiqi, Jiao, Yizhu, Li, Xinze, Lu, Xinyuan, Liu, Ziyu, Ma, Yan, Dong, Xiaoyi, Zhang, Pan, Pan, Liangming, Jiang, Yu-Gang, Wang, Jiaqi, Cao, Yixin, Sun, Aixin
Understanding documents with rich layouts and multi-modal components is a long-standing and practical task. Recent Large Vision-Language Models (LVLMs) have made remarkable strides in various tasks, particularly in single-page document understanding
Externí odkaz:
http://arxiv.org/abs/2407.01523
Autor:
Huang, Zhen, Wang, Zengzhi, Xia, Shijie, Li, Xuefeng, Zou, Haoyang, Xu, Ruijie, Fan, Run-Ze, Ye, Lyumanshan, Chern, Ethan, Ye, Yixin, Zhang, Yikai, Yang, Yuqing, Wu, Ting, Wang, Binjie, Sun, Shichao, Xiao, Yang, Li, Yiyuan, Zhou, Fan, Chern, Steffi, Qin, Yiwei, Ma, Yan, Su, Jiadi, Liu, Yixiu, Zheng, Yuxiang, Zhang, Shaoting, Lin, Dahua, Qiao, Yu, Liu, Pengfei
The evolution of Artificial Intelligence (AI) has been significantly accelerated by advancements in Large Language Models (LLMs) and Large Multimodal Models (LMMs), gradually showcasing potential cognitive reasoning abilities in problem-solving and s
Externí odkaz:
http://arxiv.org/abs/2406.12753
A story premise succinctly defines a story's main idea, foundation, and trajectory. It serves as the initial trigger in automatic story generation. Existing sources of story premises are limited by a lack of diversity, uneven quality, and high costs
Externí odkaz:
http://arxiv.org/abs/2406.05690
In this article, we present the package Blade as the first implementation of the block-triangular form improved Feynman integral reduction method. The block-triangular form has orders of magnitude fewer equations compared to the plain integration-by-
Externí odkaz:
http://arxiv.org/abs/2405.14621
We propose to measure the energy correlator in quarkonium production, which tracks the energy deposited in the calorimeter $\chi$-angular distance away from the identified quarkonium. The observable eliminates the need for jets while sustaining the p
Externí odkaz:
http://arxiv.org/abs/2405.10056
The next Point of Interest (POI) recommendation aims to recommend the next POI for users at a specific time. As users' check-in records can be viewed as a long sequence, methods based on Recurrent Neural Networks (RNNs) have recently shown good appli
Externí odkaz:
http://arxiv.org/abs/2404.00367
Autor:
Wei, Dehui, Zhang, Jiao, Li, Haozhe, Xue, Zhichen, Peng, Yajie, Pang, Xiaofei, Han, Rui, Ma, Yan, Li, Jialin
As ByteDance's business expands, the substantial infrastructure expenses associated with centralized Content Delivery Network (CDN) networks have rendered content distribution costs prohibitively high. In response, we embarked on exploring a peer-to-
Externí odkaz:
http://arxiv.org/abs/2401.15839
Publikováno v:
JHEP 06 (2024) 216
We study the mass spectra of hidden-charm tetraquark systems with quantum numbers $(I^G)J^P=(1^+)1^+$ using QCD sum rules. The analysis incorporates the complete next-to-leading order (NLO) contribution to the perturbative QCD part of the operator pr
Externí odkaz:
http://arxiv.org/abs/2312.14224
Publikováno v:
Phys.Rev.Lett. 132 (2024) 231802
We present the results for the complete next-to-leading order electroweak corrections to $pp \to HH$ at the Large Hadron Collider, focusing on the dominant gluon-gluon fusion process. While the corrections at the total cross-section level are approxi
Externí odkaz:
http://arxiv.org/abs/2311.16963