Výsledky vyhledávání - "Song, Linfeng"

Report

SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models

Autor: Yu, Dian, Peng, Baolin, Tian, Ye, Song, Linfeng, Mi, Haitao, Yu, Dong

There is a growing trend of teaching large language models (LLMs) to solve mathematical problems through coding. Existing studies primarily focus on prompting powerful, closed-source models to generate seed training data followed by in-domain data au

Externí odkaz: http://arxiv.org/abs/2408.15565

Zobrazit plný text záznamu

Report

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Autor: Zhang, Yuheng, Yu, Dian, Peng, Baolin, Song, Linfeng, Tian, Ye, Huo, Mingyue, Jiang, Nan, Mi, Haitao, Yu, Dong

Reinforcement Learning with Human Feedback (RLHF) has achieved great success in aligning large language models (LLMs) with human preferences. Prevalent RLHF approaches are reward-based, following the Bradley-Terry (BT) model assumption, which may not

Externí odkaz: http://arxiv.org/abs/2407.00617

Zobrazit plný text záznamu

Report

LiteSearch: Efficacious Tree Search for LLM

Autor: Wang, Ante, Song, Linfeng, Tian, Ye, Peng, Baolin, Yu, Dian, Mi, Haitao, Su, Jinsong, Yu, Dong

Recent research suggests that tree search algorithms (e.g. Monte Carlo Tree Search) can dramatically boost LLM performance on complex mathematical reasoning tasks. However, they often require more than 10 times the computational resources of greedy d

Externí odkaz: http://arxiv.org/abs/2407.00320

Zobrazit plný text záznamu

Report

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Autor: Tian, Ye, Peng, Baolin, Song, Linfeng, Jin, Lifeng, Yu, Dian, Mi, Haitao, Yu, Dong

Despite the impressive capabilities of Large Language Models (LLMs) on various tasks, they still struggle with scenarios that involves complex reasoning and planning. Recent work proposed advanced prompting techniques and the necessity of fine-tuning

Externí odkaz: http://arxiv.org/abs/2404.12253

Zobrazit plný text záznamu

Report

Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models

Autor: Das, Souvik, Jin, Lifeng, Song, Linfeng, Mi, Haitao, Peng, Baolin, Yu, Dong

Large language models (LLMs) exhibit impressive natural language capabilities but suffer from hallucination -- generating content ungrounded in the realities of training data. Recent work has focused on decoding techniques to improve factuality durin

Externí odkaz: http://arxiv.org/abs/2404.09338

Zobrazit plný text záznamu

Akademický článek

Semantic Neural Machine Translation Using AMR

Autor: Song, Linfeng, Gildea, Daniel, Zhang, Yue, Wang, Zhiguo, Su, Jinsong

Publikováno v: Transactions of the Association for Computational Linguistics, Vol 7, Pp 19-31 (2019)

It is intuitive that semantic representations can be useful for machine translation, mainly because they can help in enforcing meaning preservation and handling data sparsity (many sentences correspond to one meaning) of machine translation models. O

Externí odkaz: https://doaj.org/article/86d54844cecd4a84ace6a747a8e54b02

Zobrazit plný text záznamu

Report

Self-Consistency Boosts Calibration for Math Reasoning

Autor: Wang, Ante, Song, Linfeng, Tian, Ye, Peng, Baolin, Jin, Lifeng, Mi, Haitao, Su, Jinsong, Yu, Dong

Calibration, which establishes the correlation between accuracy and model confidence, is important for LLM development. We design three off-the-shelf calibration methods based on self-consistency (Wang et al., 2022) for math reasoning tasks. Evaluati

Externí odkaz: http://arxiv.org/abs/2403.09849

Zobrazit plný text záznamu

Report

A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation

Autor: Li, Xiangci, Song, Linfeng, Jin, Lifeng, Mi, Haitao, Ouyang, Jessica, Yu, Dong

Knowledge-based, open-domain dialogue generation aims to build chit-chat systems that talk to humans using mined support knowledge. Many types and sources of knowledge have previously been shown to be useful as support knowledge. Even in the era of l

Externí odkaz: http://arxiv.org/abs/2403.03496

Zobrazit plný text záznamu

Report

Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal

Autor: Huang, Jianheng, Cui, Leyang, Wang, Ante, Yang, Chengyi, Liao, Xinting, Song, Linfeng, Yao, Junfeng, Su, Jinsong

Large language models (LLMs) suffer from catastrophic forgetting during continual learning. Conventional rehearsal-based methods rely on previous training data to retain the model's ability, which may not be feasible in real-world applications. When

Externí odkaz: http://arxiv.org/abs/2403.01244

Zobrazit plný text záznamu

Report

Collaborative decoding of critical tokens for boosting factuality of large language models

Autor: Jin, Lifeng, Peng, Baolin, Song, Linfeng, Mi, Haitao, Tian, Ye, Yu, Dong

The most common training pipeline for large language models includes pretraining, finetuning and aligning phases, with their respective resulting models, such as the pretrained model and the finetuned model. Finetuned and aligned models show improved

Externí odkaz: http://arxiv.org/abs/2402.17982

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání