Výsledky vyhledávání - "Tian, Jidong"

Report

Hypothesis Testing Prompting Improves Deductive Reasoning in Large Language Models

Autor: Li, Yitian, Tian, Jidong, He, Hao, Jin, Yaohui

Combining different forms of prompts with pre-trained large language models has yielded remarkable results on reasoning tasks (e.g. Chain-of-Thought prompting). However, along with testing on more complex reasoning, these methods also expose problems

Externí odkaz: http://arxiv.org/abs/2405.06707

Zobrazit plný text záznamu

Report

Logical Negation Augmenting and Debiasing for Prompt-based Methods

Autor: Li, Yitian, Tian, Jidong, He, Hao, Jin, Yaohui

Prompt-based methods have gained increasing attention on NLP and shown validity on many downstream tasks. Many works have focused on mining these methods' potential for knowledge extraction, but few explore their ability to make logical reasoning. In

Externí odkaz: http://arxiv.org/abs/2405.04872

Zobrazit plný text záznamu

Report

Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning in Large Language Models

Autor: Liao, Haoran, Tian, Jidong, Hu, Shaohua, He, Hao, Jin, Yaohui

Large language models (LLMs) still grapple with complex tasks like mathematical reasoning. Despite significant efforts invested in improving prefix prompts or reasoning process, the crucial role of problem context might have been neglected. Accurate

Externí odkaz: http://arxiv.org/abs/2402.15764

Zobrazit plný text záznamu

Report

Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent

Autor: Liao, Haoran, Du, Qinyi, Hu, Shaohua, He, Hao, Xu, Yanyan, Tian, Jidong, Jin, Yaohui

Large language models (LLMs) face challenges in solving complex mathematical problems that require comprehensive capacities to parse the statements, associate domain knowledge, perform compound logical reasoning, and integrate the intermediate ration

Externí odkaz: http://arxiv.org/abs/2312.08926

Zobrazit plný text záznamu

Report

Comparable Demonstrations are Important in In-Context Learning: A Novel Perspective on Demonstration Selection

Autor: Fan, Caoyun, Tian, Jidong, Li, Yitian, He, Hao, Jin, Yaohui

In-Context Learning (ICL) is an important paradigm for adapting Large Language Models (LLMs) to downstream tasks through a few demonstrations. Despite the great success of ICL, the limitation of the demonstration number may lead to demonstration bias

Externí odkaz: http://arxiv.org/abs/2312.07476

Zobrazit plný text záznamu

Report

Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language Understanding

Autor: Fan, Caoyun, Tian, Jidong, Li, Yitian, Chen, Wenqing, He, Hao, Jin, Yaohui

Chain-of-Thought (CoT) is a technique that guides Large Language Models (LLMs) to decompose complex tasks into multi-step reasoning through intermediate steps in natural language form. Briefly, CoT enables LLMs to think step by step. However, althoug

Externí odkaz: http://arxiv.org/abs/2310.11721

Zobrazit plný text záznamu

Report

Accurate Use of Label Dependency in Multi-Label Text Classification Through the Lens of Causality

Autor: Fan, Caoyun, Chen, Wenqing, Tian, Jidong, Li, Yitian, He, Hao, Jin, Yaohui

Multi-Label Text Classification (MLTC) aims to assign the most relevant labels to each given text. Existing methods demonstrate that label dependency can help to improve the model's performance. However, the introduction of label dependency may cause

Externí odkaz: http://arxiv.org/abs/2310.07588

Zobrazit plný text záznamu

Report

Unlock the Potential of Counterfactually-Augmented Data in Out-Of-Distribution Generalization

Autor: Fan, Caoyun, Chen, Wenqing, Tian, Jidong, Li, Yitian, He, Hao, Jin, Yaohui

Counterfactually-Augmented Data (CAD) -- minimal editing of sentences to flip the corresponding labels -- has the potential to improve the Out-Of-Distribution (OOD) generalization capability of language models, as CAD induces language models to explo

Externí odkaz: http://arxiv.org/abs/2310.06666

Zobrazit plný text záznamu

Report

MaxGNR: A Dynamic Weight Strategy via Maximizing Gradient-to-Noise Ratio for Multi-Task Learning

Autor: Fan, Caoyun, Chen, Wenqing, Tian, Jidong, Li, Yitian, He, Hao, Jin, Yaohui

When modeling related tasks in computer vision, Multi-Task Learning (MTL) can outperform Single-Task Learning (STL) due to its ability to capture intrinsic relatedness among tasks. However, MTL may encounter the insufficient training problem, i.e., s

Externí odkaz: http://arxiv.org/abs/2302.09352

Zobrazit plný text záznamu

Report

Improving the Out-Of-Distribution Generalization Capability of Language Models: Counterfactually-Augmented Data is not Enough

Autor: Fan, Caoyun, Chen, Wenqing, Tian, Jidong, Li, Yitian, He, Hao, Jin, Yaohui

Counterfactually-Augmented Data (CAD) has the potential to improve language models' Out-Of-Distribution (OOD) generalization capability, as CAD induces language models to exploit causal features and exclude spurious correlations. However, the empiric

Externí odkaz: http://arxiv.org/abs/2302.09345

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání