Výsledky vyhledávání - "Zhang, Jiarui"

Report

Guided Profile Generation Improves Personalization with LLMs

Autor: Zhang, Jiarui

In modern commercial systems, including Recommendation, Ranking, and E-Commerce platforms, there is a trend towards improving customer experiences by incorporating Personalization context as input into Large Language Models (LLMs). However, LLMs ofte

Externí odkaz: http://arxiv.org/abs/2409.13093

Zobrazit plný text záznamu

Report

EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs

Autor: Fan, Zhen, Dai, Peng, Su, Zhuo, Gao, Xu, Lv, Zheng, Zhang, Jiarui, Du, Tianyuan, Wang, Guidong, Zhang, Yang

Egocentric human pose estimation (HPE) using wearable sensors is essential for VR/AR applications. Most methods rely solely on either egocentric-view images or sparse Inertial Measurement Unit (IMU) signals, leading to inaccuracies due to self-occlus

Externí odkaz: http://arxiv.org/abs/2408.17168

Zobrazit plný text záznamu

Report

Application of Data-Driven Model Predictive Control for Autonomous Vehicle Steering

Autor: Zhang, Jiarui, Kong, Aijing, Tang, Yu, Lv, Zhichao, Guo, Lulu, Hang, Peng

With the development of autonomous driving technology, there are increasing demands for vehicle control, and MPC has become a widely researched topic in both industry and academia. Existing MPC control methods based on vehicle kinematics or dynamics

Externí odkaz: http://arxiv.org/abs/2407.08401

Zobrazit plný text záznamu

Report

Relative Counterfactual Contrastive Learning for Mitigating Pretrained Stance Bias in Stance Detection

Autor: Zhang, Jiarui, Wu, Shaojuan, Zhang, Xiaowang, Feng, Zhiyong

Stance detection classifies stance relations (namely, Favor, Against, or Neither) between comments and targets. Pretrained language models (PLMs) are widely used to mine the stance relation to improve the performance of stance detection through pretr

Externí odkaz: http://arxiv.org/abs/2405.10991

Zobrazit plný text záznamu

Report

MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning

Autor: Jiang, Yifan, Zhang, Jiarui, Sun, Kexuan, Sourati, Zhivar, Ahrabian, Kian, Ma, Kaixin, Ilievski, Filip, Pujara, Jay

While multi-modal large language models (MLLMs) have shown significant progress on many popular visual reasoning benchmarks, whether they possess abstract visual reasoning abilities remains an open question. Similar to the Sudoku puzzles, abstract vi

Externí odkaz: http://arxiv.org/abs/2404.13591

Zobrazit plný text záznamu

Report

Exploring Perceptual Limitation of Multimodal Large Language Models

Autor: Zhang, Jiarui, Hu, Jinyi, Khayatkhoei, Mahyar, Ilievski, Filip, Sun, Maosong

Multimodal Large Language Models (MLLMs) have recently shown remarkable perceptual capability in answering visual questions, however, little is known about the limits of their perception. In particular, while prior works have provided anecdotal evide

Externí odkaz: http://arxiv.org/abs/2402.07384

Zobrazit plný text záznamu

Report

The Curious Case of Nonverbal Abstract Reasoning with Multi-Modal Large Language Models

Autor: Ahrabian, Kian, Sourati, Zhivar, Sun, Kexuan, Zhang, Jiarui, Jiang, Yifan, Morstatter, Fred, Pujara, Jay

While large language models (LLMs) are still being adopted to new domains and utilized in novel applications, we are experiencing an influx of the new generation of foundation models, namely multi-modal large language models (MLLMs). These models int

Externí odkaz: http://arxiv.org/abs/2401.12117

Zobrazit plný text záznamu

Report

Passive Non-Line-of-Sight Imaging with Light Transport Modulation

Autor: Zhang, Jiarui, Geng, Ruixu, Du, Xiaolong, Chen, Yan, Li, Houqiang, Hu, Yang

Passive non-line-of-sight (NLOS) imaging has witnessed rapid development in recent years, due to its ability to image objects that are out of sight. The light transport condition plays an important role in this task since changing the conditions will

Externí odkaz: http://arxiv.org/abs/2312.16014

Zobrazit plný text záznamu

Report

Helping Language Models Learn More: Multi-dimensional Task Prompt for Few-shot Tuning

Autor: Weng, Jinta, Zhang, Jiarui, Hu, Yue, Fa, Daidong, Xuand, Xiaofeng, Huang, Heyan

Large language models (LLMs) can be used as accessible and intelligent chatbots by constructing natural language queries and directly inputting the prompt into the large language model. However, different prompt' constructions often lead to uncertain

Externí odkaz: http://arxiv.org/abs/2312.08027

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání