Výsledky vyhledávání

Report

Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation

Autor: Li, Jinyuan, Li, Ziyan, Li, Han, Yu, Jianfei, Xia, Rui, Sun, Di, Pan, Gang

Grounded Multimodal Named Entity Recognition (GMNER) task aims to identify named entities, entity types and their corresponding visual regions. GMNER task exhibits two challenging attributes: 1) The tenuous correlation between images and text on soci

Externí odkaz: http://arxiv.org/abs/2406.07268

Zobrazit plný text záznamu

Report

SemEval-2024 Task 3: Multimodal Emotion Cause Analysis in Conversations

Autor: Wang, Fanfan, Ma, Heqing, Yu, Jianfei, Xia, Rui, Cambria, Erik

Publikováno v: Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)

The ability to understand emotions is an essential component of human-like artificial intelligence, as emotions greatly influence human cognition, decision making, and social interactions. In addition to emotion recognition in conversations, the task

Externí odkaz: http://arxiv.org/abs/2405.13049

Zobrazit plný text záznamu

Report

DPPA: Pruning Method for Large Language Model to Model Merging

Autor: Zhu, Yaochen, Xia, Rui, Zhang, Jiajun

Model merging is to combine fine-tuned models derived from multiple domains, with the intent of enhancing the model's proficiency across various domains. The principal concern is the resolution of parameter conflicts. A substantial amount of existing

Externí odkaz: http://arxiv.org/abs/2403.02799

Zobrazit plný text záznamu

Report

VCD: Knowledge Base Guided Visual Commonsense Discovery in Images

Autor: Shen, Xiangqing, Song, Yurun, Wu, Siwei, Xia, Rui

Visual commonsense contains knowledge about object properties, relationships, and behaviors in visual data. Discovering visual commonsense can provide a more comprehensive and richer understanding of images, and enhance the reasoning and decision-mak

Externí odkaz: http://arxiv.org/abs/2402.17213

Zobrazit plný text záznamu

Report

Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale Pretraining Corpus for Math

Autor: Wang, Zengzhi, Xia, Rui, Liu, Pengfei

High-quality, large-scale corpora are the cornerstone of building foundation models. In this work, we introduce \textsc{MathPile}, a diverse and high-quality math-centric corpus comprising about 9.5 billion tokens. Throughout its creation, we adhered

Externí odkaz: http://arxiv.org/abs/2312.17120

Zobrazit plný text záznamu

Report

In-Context Learning for Knowledge Base Question Answering for Unmanned Systems based on Large Language Models

Autor: Chen, Yunlong, Zhang, Yaming, Yu, Jianfei, Yang, Li, Xia, Rui

Knowledge Base Question Answering (KBQA) aims to answer factoid questions based on knowledge bases. However, generating the most appropriate knowledge base query code based on Natural Language Questions (NLQ) poses a significant challenge in KBQA. In

Externí odkaz: http://arxiv.org/abs/2311.02956

Zobrazit plný text záznamu

Report

A New Dialogue Response Generation Agent for Large Language Models by Asking Questions to Detect User's Intentions

Autor: Wu, Siwei, Shen, Xiangqing, Xia, Rui

Large Language Models (LLMs), such as ChatGPT, have recently been applied to various NLP tasks due to its open-domain generation capabilities. However, there are two issues with applying LLMs to dialogue tasks. 1. During the dialogue process, users m

Externí odkaz: http://arxiv.org/abs/2310.03293

Zobrazit plný text záznamu

Report

Ask Again, Then Fail: Large Language Models' Vacillations in Judgment

Autor: Xie, Qiming, Wang, Zengzhi, Feng, Yi, Xia, Rui

We observe that current conversational language models often waver in their judgments when faced with follow-up questions, even if the original judgment was correct. This wavering presents a significant challenge for generating reliable responses and

Externí odkaz: http://arxiv.org/abs/2310.02174

Zobrazit plný text záznamu

Report

On-the-Fly SfM: What you capture is What you get

Autor: Zhan, Zongqian, Xia, Rui, Yu, Yifei, Xu, Yibo, Wang, Xin

Over the last decades, ample achievements have been made on Structure from motion (SfM). However, the vast majority of them basically work in an offline manner, i.e., images are firstly captured and then fed together into a SfM pipeline for obtaining

Externí odkaz: http://arxiv.org/abs/2309.11883

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání