Výsledky vyhledávání

Report

SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement

Autor: Antoniades, Antonis, Örwall, Albert, Zhang, Kexun, Xie, Yuxi, Goyal, Anirudh, Wang, William

Software engineers operating in complex and dynamic environments must continuously adapt to evolving requirements, learn iteratively from experience, and reconsider their approaches based on new insights. However, current large language model (LLM)-b

Externí odkaz: http://arxiv.org/abs/2410.20285

Zobrazit plný text záznamu

Report

COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement

Autor: Xie, Yuxi, Goyal, Anirudh, Wu, Xiaobao, Yin, Xunjian, Xu, Xiao, Kan, Min-Yen, Pan, Liangming, Wang, William Yang

Iterative refinement has emerged as an effective paradigm for enhancing the capabilities of large language models (LLMs) on complex tasks. However, existing approaches typically implement iterative refinement at the application or prompting level, re

Externí odkaz: http://arxiv.org/abs/2410.09675

Zobrazit plný text záznamu

Report

MVP-Bench: Can Large Vision--Language Models Conduct Multi-level Visual Perception Like Humans?

Autor: Li, Guanzhen, Xie, Yuxi, Kan, Min-Yen

Humans perform visual perception at multiple levels, including low-level object recognition and high-level semantic interpretation such as behavior understanding. Subtle differences in low-level details can lead to substantial changes in high-level p

Externí odkaz: http://arxiv.org/abs/2410.04345

Zobrazit plný text záznamu

Report

Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models

Autor: Liu, Hongfu, Xie, Yuxi, Wang, Ye, Shieh, Michael

Language Language Models (LLMs) face safety concerns due to potential misuse by malicious users. Recent red-teaming efforts have identified adversarial suffixes capable of jailbreaking LLMs using the gradient-based search algorithm Greedy Coordinate

Externí odkaz: http://arxiv.org/abs/2408.14866

Zobrazit plný text záznamu

Report

Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

Autor: Xie, Yuxi, Goyal, Anirudh, Zheng, Wenyue, Kan, Min-Yen, Lillicrap, Timothy P., Kawaguchi, Kenji, Shieh, Michael

We introduce an approach aimed at enhancing the reasoning capabilities of Large Language Models (LLMs) through an iterative preference learning process inspired by the successful strategy employed by AlphaZero. Our work leverages Monte Carlo Tree Sea

Externí odkaz: http://arxiv.org/abs/2405.00451

Zobrazit plný text záznamu

Report

Prompt Optimization via Adversarial In-Context Learning

Autor: Do, Xuan Long, Zhao, Yiran, Brown, Hannah, Xie, Yuxi, Zhao, James Xu, Chen, Nancy F., Kawaguchi, Kenji, Shieh, Michael, He, Junxian

We propose a new method, Adversarial In-Context Learning (adv-ICL), to optimize prompt for in-context learning (ICL) by employing one LLM as a generator, another as a discriminator, and a third as a prompt modifier. As in traditional adversarial lear

Externí odkaz: http://arxiv.org/abs/2312.02614

Zobrazit plný text záznamu

Report

InstructCoder: Instruction Tuning Large Language Models for Code Editing

Autor: Li, Kaixin, Hu, Qisheng, Zhao, Xu, Chen, Hui, Xie, Yuxi, Liu, Tiedong, Xie, Qizhe, He, Junxian

Code editing encompasses a variety of pragmatic tasks that developers deal with daily. Despite its relevance and practical usefulness, automatic code editing remains an underexplored area in the evolution of deep learning models, partly due to data s

Externí odkaz: http://arxiv.org/abs/2310.20329

Zobrazit plný text záznamu

Report

ECHo: A Visio-Linguistic Dataset for Event Causality Inference via Human-Centric Reasoning

Autor: Xie, Yuxi, Li, Guanzhen, Kan, Min-Yen

We introduce ECHo (Event Causality Inference via Human-Centric Reasoning), a diagnostic dataset of event causality inference grounded in visio-linguistic social scenarios. ECHo employs real-world human-centric deductive information building on a tele

Externí odkaz: http://arxiv.org/abs/2305.14740

Zobrazit plný text záznamu

Report

Automatic Model Selection with Large Language Models for Reasoning

Autor: Zhao, James Xu, Xie, Yuxi, Kawaguchi, Kenji, He, Junxian, Xie, Michael Qizhe

Chain-of-Thought (CoT) and Program-Aided Language Models (PAL) represent two distinct reasoning methods, each with its own strengths. CoT employs natural language, offering flexibility and interpretability, while PAL utilizes programming language, yi

Externí odkaz: http://arxiv.org/abs/2305.14333

Zobrazit plný text záznamu

Report

Self-Evaluation Guided Beam Search for Reasoning

Autor: Xie, Yuxi, Kawaguchi, Kenji, Zhao, Yiran, Zhao, Xu, Kan, Min-Yen, He, Junxian, Xie, Qizhe

Breaking down a problem into intermediate steps has demonstrated impressive performance in Large Language Model (LLM) reasoning. However, the growth of the reasoning chain introduces uncertainty and error accumulation, making it challenging to elicit

Externí odkaz: http://arxiv.org/abs/2305.00633

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání