Výsledky vyhledávání - "Heess, Nicolas"

Report

Preference Optimization as Probabilistic Inference

Autor: Abdolmaleki, Abbas, Piot, Bilal, Shahriari, Bobak, Springenberg, Jost Tobias, Hertweck, Tim, Joshi, Rishabh, Oh, Junhyuk, Bloesch, Michael, Lampe, Thomas, Heess, Nicolas, Buchli, Jonas, Riedmiller, Martin

Existing preference optimization methods are mainly designed for directly learning from human feedback with the assumption that paired examples (preferred vs. dis-preferred) are available. In contrast, we propose a method that can leverage unpaired p

Externí odkaz: http://arxiv.org/abs/2410.04166

Zobrazit plný text záznamu

Report

DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots

Autor: Bauza, Maria, Chen, Jose Enrique, Dalibard, Valentin, Gileadi, Nimrod, Hafner, Roland, Martins, Murilo F., Moore, Joss, Pevceviciute, Rugile, Laurens, Antoine, Rao, Dushyant, Zambelli, Martina, Riedmiller, Martin, Scholz, Jon, Bousmalis, Konstantinos, Nori, Francesco, Heess, Nicolas

We present DemoStart, a novel auto-curriculum reinforcement learning method capable of learning complex manipulation behaviors on an arm equipped with a three-fingered robotic hand, from only a sparse reward and a handful of demonstrations in simulat

Externí odkaz: http://arxiv.org/abs/2409.06613

Zobrazit plný text záznamu

Report

A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning

Autor: Khetarpal, Khimya, Guo, Zhaohan Daniel, Pires, Bernardo Avila, Tang, Yunhao, Lyle, Clare, Rowland, Mark, Heess, Nicolas, Borsa, Diana, Guez, Arthur, Dabney, Will

Learning a good representation is a crucial challenge for Reinforcement Learning (RL) agents. Self-predictive learning provides means to jointly learn a latent representation and dynamics model by bootstrapping from future latent representations (BYO

Externí odkaz: http://arxiv.org/abs/2406.02035

Zobrazit plný text záznamu

Report

Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice

Autor: Jiao, Yusheng, Ling, Feng, Heydari, Sina, Heess, Nicolas, Merel, Josh, Kanso, Eva

Animals and robots exist in a physical world and must coordinate their bodies to achieve behavioral objectives. With recent developments in deep reinforcement learning, it is now possible for scientists and engineers to obtain sensorimotor strategies

Externí odkaz: http://arxiv.org/abs/2405.11457

Zobrazit plný text záznamu

Report

Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning

Autor: Tirumala, Dhruva, Wulfmeier, Markus, Moran, Ben, Huang, Sandy, Humplik, Jan, Lever, Guy, Haarnoja, Tuomas, Hasenclever, Leonard, Byravan, Arunkumar, Batchelor, Nathan, Sreendra, Neil, Patel, Kushal, Gwira, Marlon, Nori, Francesco, Riedmiller, Martin, Heess, Nicolas

We apply multi-agent deep reinforcement learning (RL) to train end-to-end robot soccer policies with fully onboard computation and sensing via egocentric RGB vision. This setting reflects many challenges of real-world robotics, including active perce

Externí odkaz: http://arxiv.org/abs/2405.02425

Zobrazit plný text záznamu

Report

The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models

Autor: Siegel, Noah Y., Camburu, Oana-Maria, Heess, Nicolas, Perez-Ortiz, Maria

In order to oversee advanced AI systems, it is important to understand their underlying decision-making process. When prompted, large language models (LLMs) can provide natural language explanations or reasoning traces that sound plausible and receiv

Externí odkaz: http://arxiv.org/abs/2404.03189

Zobrazit plný text záznamu

Report

Genie: Generative Interactive Environments

We introduce Genie, the first generative interactive environment trained in an unsupervised manner from unlabelled Internet videos. The model can be prompted to generate an endless variety of action-controllable virtual worlds described through text,

Externí odkaz: http://arxiv.org/abs/2402.15391

Zobrazit plný text záznamu

Report

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Large language models (LLMs) have been shown to exhibit a wide range of capabilities, such as writing robot code from language commands -- enabling non-experts to direct robot behaviors, modify them based on feedback, or compose them to perform new t

Externí odkaz: http://arxiv.org/abs/2402.11450

Zobrazit plný text záznamu

Report

NfgTransformer: Equivariant Representation Learning for Normal-form Games

Autor: Liu, Siqi, Marris, Luke, Piliouras, Georgios, Gemp, Ian, Heess, Nicolas

Normal-form games (NFGs) are the fundamental model of strategic interaction. We study their representation using neural networks. We describe the inherent equivariance of NFGs -- any permutation of strategies describes an equivalent game -- as well a

Externí odkaz: http://arxiv.org/abs/2402.08393

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání