Výsledky vyhledávání

Report

Comparable Demonstrations are Important in In-Context Learning: A Novel Perspective on Demonstration Selection

Autor: Fan, Caoyun, Tian, Jidong, Li, Yitian, He, Hao, Jin, Yaohui

In-Context Learning (ICL) is an important paradigm for adapting Large Language Models (LLMs) to downstream tasks through a few demonstrations. Despite the great success of ICL, the limitation of the demonstration number may lead to demonstration bias

Externí odkaz: http://arxiv.org/abs/2312.07476

Zobrazit plný text záznamu

Report

Can Large Language Models Serve as Rational Players in Game Theory? A Systematic Analysis

Autor: Fan, Caoyun, Chen, Jindou, Jin, Yaohui, He, Hao

Game theory, as an analytical tool, is frequently utilized to analyze human behavior in social science research. With the high alignment between the behavior of Large Language Models (LLMs) and humans, a promising research direction is to employ LLMs

Externí odkaz: http://arxiv.org/abs/2312.05488

Zobrazit plný text záznamu

Report

Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language Understanding

Autor: Fan, Caoyun, Tian, Jidong, Li, Yitian, Chen, Wenqing, He, Hao, Jin, Yaohui

Chain-of-Thought (CoT) is a technique that guides Large Language Models (LLMs) to decompose complex tasks into multi-step reasoning through intermediate steps in natural language form. Briefly, CoT enables LLMs to think step by step. However, althoug

Externí odkaz: http://arxiv.org/abs/2310.11721

Zobrazit plný text záznamu

Report

Accurate Use of Label Dependency in Multi-Label Text Classification Through the Lens of Causality

Autor: Fan, Caoyun, Chen, Wenqing, Tian, Jidong, Li, Yitian, He, Hao, Jin, Yaohui

Multi-Label Text Classification (MLTC) aims to assign the most relevant labels to each given text. Existing methods demonstrate that label dependency can help to improve the model's performance. However, the introduction of label dependency may cause

Externí odkaz: http://arxiv.org/abs/2310.07588

Zobrazit plný text záznamu

Report

Unlock the Potential of Counterfactually-Augmented Data in Out-Of-Distribution Generalization

Autor: Fan, Caoyun, Chen, Wenqing, Tian, Jidong, Li, Yitian, He, Hao, Jin, Yaohui

Counterfactually-Augmented Data (CAD) -- minimal editing of sentences to flip the corresponding labels -- has the potential to improve the Out-Of-Distribution (OOD) generalization capability of language models, as CAD induces language models to explo

Externí odkaz: http://arxiv.org/abs/2310.06666

Zobrazit plný text záznamu

Report

MaxGNR: A Dynamic Weight Strategy via Maximizing Gradient-to-Noise Ratio for Multi-Task Learning

Autor: Fan, Caoyun, Chen, Wenqing, Tian, Jidong, Li, Yitian, He, Hao, Jin, Yaohui

When modeling related tasks in computer vision, Multi-Task Learning (MTL) can outperform Single-Task Learning (STL) due to its ability to capture intrinsic relatedness among tasks. However, MTL may encounter the insufficient training problem, i.e., s

Externí odkaz: http://arxiv.org/abs/2302.09352

Zobrazit plný text záznamu

Report

Improving the Out-Of-Distribution Generalization Capability of Language Models: Counterfactually-Augmented Data is not Enough

Autor: Fan, Caoyun, Chen, Wenqing, Tian, Jidong, Li, Yitian, He, Hao, Jin, Yaohui

Counterfactually-Augmented Data (CAD) has the potential to improve language models' Out-Of-Distribution (OOD) generalization capability, as CAD induces language models to exploit causal features and exclude spurious correlations. However, the empiric

Externí odkaz: http://arxiv.org/abs/2302.09345

Zobrazit plný text záznamu

Report

Dependent Multi-Task Learning with Causal Intervention for Image Captioning

Autor: Chen, Wenqing, Tian, Jidong, Fan, Caoyun, He, Hao, Jin, Yaohui

Recent work for image captioning mainly followed an extract-then-generate paradigm, pre-extracting a sequence of object-based features and then formulating image captioning as a single sequence-to-sequence task. Although promising, we observed two pr

Externí odkaz: http://arxiv.org/abs/2105.08573

Zobrazit plný text záznamu

Akademický článek

Unlock the Potential of Counterfactually-Augmented Data in Out-Of-Distribution Generalization

Autor: Fan, Caoyun, Chen, Wenqing, Tian, Jidong, Li, Yitian, He, Hao, Jin, Yaohui

Publikováno v: In Expert Systems With Applications 15 March 2024 238 Part C

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání