Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Zhao, Zirui"'
Chain-of-thought (CoT), tree-of-thought (ToT), and related techniques work surprisingly well in practice for some complex reasoning tasks with Large Language Models (LLMs), but why? This work seeks the underlying reasons by conducting experimental ca
Externí odkaz:
http://arxiv.org/abs/2404.11041
We present a new method, PARsing And visual GrOuNding (ParaGon), for grounding natural language in object placement tasks. Natural language generally describes objects and spatial relations with compositionality and ambiguity, two major obstacles to
Externí odkaz:
http://arxiv.org/abs/2210.00215
One typical assumption in inverse reinforcement learning (IRL) is that human experts act to optimize the expected utility of a stochastic cost with a fixed distribution. This assumption deviates from actual human behaviors under ambiguity. Risk-sensi
Externí odkaz:
http://arxiv.org/abs/1909.07843