Výsledky vyhledávání

Report

On the Empirical Complexity of Reasoning and Planning in LLMs

Autor: Kang, Liwei, Zhao, Zirui, Hsu, David, Lee, Wee Sun

Chain-of-thought (CoT), tree-of-thought (ToT), and related techniques work surprisingly well in practice for some complex reasoning tasks with Large Language Models (LLMs), but why? This work seeks the underlying reasons by conducting experimental ca

Externí odkaz: http://arxiv.org/abs/2404.11041

Zobrazit plný text záznamu

Report

Differentiable Parsing and Visual Grounding of Natural Language Instructions for Object Placement

Autor: Zhao, Zirui, Lee, Wee Sun, Hsu, David

We present a new method, PARsing And visual GrOuNding (ParaGon), for grounding natural language in object placement tasks. Natural language generally describes objects and spatial relations with compositionality and ambiguity, two major obstacles to

Externí odkaz: http://arxiv.org/abs/2210.00215

Zobrazit plný text záznamu

Report

Active Learning for Risk-Sensitive Inverse Reinforcement Learning

Autor: Chen, Rui, Wang, Wenshuo, Zhao, Zirui, Zhao, Ding

One typical assumption in inverse reinforcement learning (IRL) is that human experts act to optimize the expected utility of a stochastic cost with a fixed distribution. This assumption deviates from actual human behaviors under ambiguity. Risk-sensi

Externí odkaz: http://arxiv.org/abs/1909.07843

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání