Zobrazeno 1 - 10
of 15 949
pro vyhledávání: '"An, Haiyun"'
Autor:
He, Jinghan, Zhu, Kuan, Guo, Haiyun, Fang, Junfeng, Hua, Zhenglin, Jia, Yuheng, Tang, Ming, Chua, Tat-Seng, Wang, Jinqiao
Large vision-language models (LVLMs) have made substantial progress in integrating large language models (LLMs) with visual inputs, enabling advanced multimodal reasoning. Despite their success, a persistent challenge is hallucination-where generated
Externí odkaz:
http://arxiv.org/abs/2412.13949
Lane detection plays an important role in autonomous driving perception systems. As deep learning algorithms gain popularity, monocular lane detection methods based on them have demonstrated superior performance and emerged as a key research directio
Externí odkaz:
http://arxiv.org/abs/2411.16316
Out-of-distribution (OOD) detection is crucial for ensuring reliable deployment of machine learning models. Recent advancements focus on utilizing easily accessible auxiliary outliers (e.g., data from the web or other datasets) in training. However,
Externí odkaz:
http://arxiv.org/abs/2411.14049
Continual learning (CL) is crucial for language models to dynamically adapt to the evolving real-world demands. To mitigate the catastrophic forgetting problem in CL, data replay has been proven a simple and effective strategy, and the subsequent dat
Externí odkaz:
http://arxiv.org/abs/2411.06171
This paper presents a novel application of large language models (LLMs) to enhance user comprehension of privacy policies through an interactive dialogue agent. We demonstrate that LLMs significantly outperform traditional models in tasks like Data P
Externí odkaz:
http://arxiv.org/abs/2410.11906
Are Multi-modal Large Language Models (MLLMs) stochastic parrots? Do they genuinely understand? This paper aims to explore the core cognitive abilities that human intelligence builds upon to perceive, comprehend, and reason in MLLMs. To this end, we
Externí odkaz:
http://arxiv.org/abs/2410.10855
Large Language Models (LLMs) boosts human efficiency but also poses misuse risks, with watermarking serving as a reliable method to differentiate AI-generated content from human-created text. In this work, we propose a novel theoretical framework for
Externí odkaz:
http://arxiv.org/abs/2410.02890
Conservation is a critical milestone of cognitive development considered to be supported by both the understanding of quantitative concepts and the reversibility of mental operations. To assess whether this critical component of human intelligence ha
Externí odkaz:
http://arxiv.org/abs/2410.00332
Knowing others' intentions and taking others' perspectives are two core components of human intelligence that are typically considered to be instantiations of theory-of-mind. Infiltrating machines with these abilities is an important step towards bui
Externí odkaz:
http://arxiv.org/abs/2410.00324
Mechanical reasoning is a fundamental ability that sets human intelligence apart from other animal intelligence. Mechanical reasoning allows us to design tools, build bridges and canals, and construct houses which set the foundation of human civiliza
Externí odkaz:
http://arxiv.org/abs/2410.00318