Zobrazeno 1 - 10
of 7 289
pro vyhledávání: '"Zhixuan An"'
In this paper, we introduce FAMMA, an open-source benchmark for financial multilingual multimodal question answering (QA). Our benchmark aims to evaluate the abilities of multimodal large language models (MLLMs) in answering questions that require ad
Externí odkaz:
http://arxiv.org/abs/2410.04526
Autor:
Wei, Zhenyu, Xu, Zhixuan, Guo, Jingxiang, Hou, Yiwen, Gao, Chongkai, Cai, Zhehao, Luo, Jiayu, Shao, Lin
Dexterous grasping is a fundamental yet challenging skill in robotic manipulation, requiring precise interaction between robotic hands and objects. In this paper, we present D(R,O) Grasp, a novel framework that models the interaction between the robo
Externí odkaz:
http://arxiv.org/abs/2410.01702
Large language models are typically fine-tuned to align with human preferences, but tuning large models is computationally intensive and complex. In this work, we introduce $\textit{Integrated Value Guidance}$ (IVG), a method that uses implicit and e
Externí odkaz:
http://arxiv.org/abs/2409.17819
This paper introduces GateAttentionPose, an innovative approach that enhances the UniRepLKNet architecture for pose estimation tasks. We present two key contributions: the Agent Attention module and the Gate-Enhanced Feedforward Block (GEFB). The Age
Externí odkaz:
http://arxiv.org/abs/2409.07798
Pose estimation is a crucial task in computer vision, with wide applications in autonomous driving, human motion capture, and virtual reality. However, existing methods still face challenges in achieving high accuracy, particularly in complex scenes.
Externí odkaz:
http://arxiv.org/abs/2409.07752
Autor:
Chen, Yuqi, Li, Yifan, Zhou, Kyrie Zhixuan, Fu, Xiaokang, Liu, Lingbo, Bao, Shuming, Sui, Daniel, Zhang, Luyao
In the digital era, blockchain technology, cryptocurrencies, and non-fungible tokens (NFTs) have transformed financial and decentralized systems. However, existing research often neglects the spatiotemporal variations in public sentiment toward these
Externí odkaz:
http://arxiv.org/abs/2409.00843
The accuracy and efficiency of a coarse-grained (CG) force field are pivotal for high-precision molecular simulations of large systems with complex molecules. We present an automated mapping and optimization framework for molecular simulation (AMOFMS
Externí odkaz:
http://arxiv.org/abs/2408.06609
Graph clustering is a fundamental and challenging learning task, which is conventionally approached by grouping similar vertices based on edge structure and feature similarity.In contrast to previous methods, in this paper, we investigate how multi-v
Externí odkaz:
http://arxiv.org/abs/2408.06029
Polynomial reconstruction on Cartesian grids is fundamental in many scientific and engineering applications, yet it is still an open problem how to construct for a finite subset $K$ of $\mathbb{Z}^{\textsf{D}}$ a lattice $\mathcal{T}\subset K$ so tha
Externí odkaz:
http://arxiv.org/abs/2408.03814
Autor:
Wang, Shiyu, Chu, Zhixuan, Sun, Yinbo, Liu, Yu, Guo, Yuliang, Chen, Yang, Jian, Huiyang, Ma, Lintao, Lu, Xingyu, Zhou, Jun
Accurate workload forecasting is critical for efficient resource management in cloud computing systems, enabling effective scheduling and autoscaling. Despite recent advances with transformer-based forecasting models, challenges remain due to the non
Externí odkaz:
http://arxiv.org/abs/2407.19697