Zobrazeno 1 - 10
of 1 360
pro vyhledávání: '"Zhang, Jiarui"'
Autor:
Zhang, Jiarui
In modern commercial systems, including Recommendation, Ranking, and E-Commerce platforms, there is a trend towards improving customer experiences by incorporating Personalization context as input into Large Language Models (LLMs). However, LLMs ofte
Externí odkaz:
http://arxiv.org/abs/2409.13093
Autor:
Fan, Zhen, Dai, Peng, Su, Zhuo, Gao, Xu, Lv, Zheng, Zhang, Jiarui, Du, Tianyuan, Wang, Guidong, Zhang, Yang
Egocentric human pose estimation (HPE) using wearable sensors is essential for VR/AR applications. Most methods rely solely on either egocentric-view images or sparse Inertial Measurement Unit (IMU) signals, leading to inaccuracies due to self-occlus
Externí odkaz:
http://arxiv.org/abs/2408.17168
With the development of autonomous driving technology, there are increasing demands for vehicle control, and MPC has become a widely researched topic in both industry and academia. Existing MPC control methods based on vehicle kinematics or dynamics
Externí odkaz:
http://arxiv.org/abs/2407.08401
Stance detection classifies stance relations (namely, Favor, Against, or Neither) between comments and targets. Pretrained language models (PLMs) are widely used to mine the stance relation to improve the performance of stance detection through pretr
Externí odkaz:
http://arxiv.org/abs/2405.10991
Autor:
Jiang, Yifan, Zhang, Jiarui, Sun, Kexuan, Sourati, Zhivar, Ahrabian, Kian, Ma, Kaixin, Ilievski, Filip, Pujara, Jay
While multi-modal large language models (MLLMs) have shown significant progress on many popular visual reasoning benchmarks, whether they possess abstract visual reasoning abilities remains an open question. Similar to the Sudoku puzzles, abstract vi
Externí odkaz:
http://arxiv.org/abs/2404.13591
Multimodal Large Language Models (MLLMs) have recently shown remarkable perceptual capability in answering visual questions, however, little is known about the limits of their perception. In particular, while prior works have provided anecdotal evide
Externí odkaz:
http://arxiv.org/abs/2402.07384
Autor:
Ahrabian, Kian, Sourati, Zhivar, Sun, Kexuan, Zhang, Jiarui, Jiang, Yifan, Morstatter, Fred, Pujara, Jay
While large language models (LLMs) are still being adopted to new domains and utilized in novel applications, we are experiencing an influx of the new generation of foundation models, namely multi-modal large language models (MLLMs). These models int
Externí odkaz:
http://arxiv.org/abs/2401.12117
Passive non-line-of-sight (NLOS) imaging has witnessed rapid development in recent years, due to its ability to image objects that are out of sight. The light transport condition plays an important role in this task since changing the conditions will
Externí odkaz:
http://arxiv.org/abs/2312.16014
Large language models (LLMs) can be used as accessible and intelligent chatbots by constructing natural language queries and directly inputting the prompt into the large language model. However, different prompt' constructions often lead to uncertain
Externí odkaz:
http://arxiv.org/abs/2312.08027