Výsledky vyhledávání

Report

The Best of Both Worlds: Bridging Quality and Diversity in Data Selection with Bipartite Graph

Autor: Wu, Minghao, Vu, Thuy-Trang, Qu, Lizhen, Haffari, Gholamreza

The performance of large language models (LLMs) in natural language processing (NLP) tasks is significantly influenced by the quality and diversity of data used for supervised fine-tuning (SFT). Current data selection methods often focus solely on qu

Externí odkaz: http://arxiv.org/abs/2410.12458

Zobrazit plný text záznamu

Report

PromptDSI: Prompt-based Rehearsal-free Instance-wise Incremental Learning for Document Retrieval

Autor: Huynh, Tuan-Luc, Vu, Thuy-Trang, Wang, Weiqing, Wei, Yinwei, Le, Trung, Gasevic, Dragan, Li, Yuan-Fang, Do, Thanh-Toan

Differentiable Search Index (DSI) utilizes Pre-trained Language Models (PLMs) for efficient document retrieval without relying on external indexes. However, DSI needs full re-training to handle updates in dynamic corpora, causing significant computat

Externí odkaz: http://arxiv.org/abs/2406.12593

Zobrazit plný text záznamu

Report

SCAR: Efficient Instruction-Tuning for Large Language Models via Style Consistency-Aware Response Ranking

Autor: Li, Zhuang, Hua, Yuncheng, Vu, Thuy-Trang, Zhan, Haolan, Qu, Lizhen, Haffari, Gholamreza

Recent studies have shown that maintaining a consistent response style by human experts and enhancing data quality in training sets can significantly improve the performance of fine-tuned Large Language Models (LLMs) while reducing the number of trai

Externí odkaz: http://arxiv.org/abs/2406.10882

Zobrazit plný text záznamu

Report

Exploring the Potential of Multimodal LLM with Knowledge-Intensive Multimodal ASR

Autor: Wang, Minghan, Wang, Yuxia, Vu, Thuy-Trang, Shareghi, Ehsan, Haffari, Gholamreza

Recent advancements in multimodal large language models (MLLMs) have made significant progress in integrating information across various modalities, yet real-world applications in educational and scientific domains remain challenging. This paper intr

Externí odkaz: http://arxiv.org/abs/2406.10880

Zobrazit plný text záznamu

Report

Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models

Autor: Wu, Minghao, Vu, Thuy-Trang, Qu, Lizhen, Haffari, Gholamreza

Large language models (LLMs) are typically fine-tuned on diverse and extensive datasets sourced from various origins to develop a comprehensive range of skills, such as writing, reasoning, chatting, coding, and more. Each skill has unique characteris

Externí odkaz: http://arxiv.org/abs/2406.08811

Zobrazit plný text záznamu

Report

Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs

Autor: Nguyen, Minh-Vuong, Luo, Linhao, Shiri, Fatemeh, Phung, Dinh, Li, Yuan-Fang, Vu, Thuy-Trang, Haffari, Gholamreza

Large language models (LLMs) demonstrate strong reasoning abilities when prompted to generate chain-of-thought (CoT) explanations alongside answers. However, previous research on evaluating LLMs has solely focused on answer accuracy, neglecting the c

Externí odkaz: http://arxiv.org/abs/2402.11199

Zobrazit plný text záznamu

Report

Conversational SimulMT: Efficient Simultaneous Translation with Large Language Models

Autor: Wang, Minghan, Vu, Thuy-Trang, Wang, Yuxia, Shareghi, Ehsan, Haffari, Gholamreza

Simultaneous machine translation (SimulMT) presents a challenging trade-off between translation quality and latency. Recent studies have shown that LLMs can achieve good performance in SimulMT tasks. However, this often comes at the expense of high i

Externí odkaz: http://arxiv.org/abs/2402.10552

Zobrazit plný text záznamu

Report

Continual Learning for Large Language Models: A Survey

Autor: Wu, Tongtong, Luo, Linhao, Li, Yuan-Fang, Pan, Shirui, Vu, Thuy-Trang, Haffari, Gholamreza

Large language models (LLMs) are not amenable to frequent re-training, due to high training costs arising from their massive scale. However, updates are necessary to endow LLMs with new skills and keep them up-to-date with rapidly evolving human know

Externí odkaz: http://arxiv.org/abs/2402.01364

Zobrazit plný text záznamu

Report

Adapting Large Language Models for Document-Level Machine Translation

Autor: Wu, Minghao, Vu, Thuy-Trang, Qu, Lizhen, Foster, George, Haffari, Gholamreza

Large language models (LLMs) have significantly advanced various natural language processing (NLP) tasks. Recent research indicates that moderately-sized LLMs often outperform larger ones after task-specific fine-tuning. This study focuses on adaptin

Externí odkaz: http://arxiv.org/abs/2401.06468

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání