Výsledky vyhledávání - "Wulfmeier, Markus"

Report

Imitating Language via Scalable Inverse Reinforcement Learning

Autor: Wulfmeier, Markus, Bloesch, Michael, Vieillard, Nino, Ahuja, Arun, Bornschein, Jorg, Huang, Sandy, Sokolov, Artem, Barnes, Matt, Desjardins, Guillaume, Bewley, Alex, Bechtle, Sarah Maria Elisabeth, Springenberg, Jost Tobias, Momchev, Nikola, Bachem, Olivier, Geist, Matthieu, Riedmiller, Martin

The majority of language model training builds on imitation learning. It covers pretraining, supervised fine-tuning, and affects the starting conditions for reinforcement learning from human feedback (RLHF). The simplicity and scalability of maximum

Externí odkaz: http://arxiv.org/abs/2409.01369

Zobrazit plný text záznamu

Report

Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning

Autor: Tirumala, Dhruva, Wulfmeier, Markus, Moran, Ben, Huang, Sandy, Humplik, Jan, Lever, Guy, Haarnoja, Tuomas, Hasenclever, Leonard, Byravan, Arunkumar, Batchelor, Nathan, Sreendra, Neil, Patel, Kushal, Gwira, Marlon, Nori, Francesco, Riedmiller, Martin, Heess, Nicolas

We apply multi-agent deep reinforcement learning (RL) to train end-to-end robot soccer policies with fully onboard computation and sensing via egocentric RGB vision. This setting reflects many challenges of real-world robotics, including active perce

Externí odkaz: http://arxiv.org/abs/2405.02425

Zobrazit plný text záznamu

Report

Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution

Autor: Seyde, Tim, Werner, Peter, Schwarting, Wilko, Wulfmeier, Markus, Rus, Daniela

Recent reinforcement learning approaches have shown surprisingly strong capabilities of bang-bang policies for solving continuous control benchmarks. The underlying coarse action space discretizations often yield favourable exploration characteristic

Externí odkaz: http://arxiv.org/abs/2404.04253

Zobrazit plný text záznamu

Report

Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning

Autor: Bhardwaj, Mohak, Lampe, Thomas, Neunert, Michael, Romano, Francesco, Abdolmaleki, Abbas, Byravan, Arunkumar, Wulfmeier, Markus, Riedmiller, Martin, Buchli, Jonas

Recent advances in real-world applications of reinforcement learning (RL) have relied on the ability to accurately simulate systems at scale. However, domains such as fluid dynamical systems exhibit complex dynamic phenomena that are hard to simulate

Externí odkaz: http://arxiv.org/abs/2402.06102

Zobrazit plný text záznamu

Report

Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots

Autor: Lampe, Thomas, Abdolmaleki, Abbas, Bechtle, Sarah, Huang, Sandy H., Springenberg, Jost Tobias, Bloesch, Michael, Groth, Oliver, Hafner, Roland, Hertweck, Tim, Neunert, Michael, Wulfmeier, Markus, Zhang, Jingwei, Nori, Francesco, Heess, Nicolas, Riedmiller, Martin

Reinforcement learning solely from an agent's self-generated data is often believed to be infeasible for learning on real robots, due to the amount of data needed. However, if done right, agents learning from real data can be surprisingly efficient t

Externí odkaz: http://arxiv.org/abs/2312.11374

Zobrazit plný text záznamu

Report

Foundations for Transfer in Reinforcement Learning: A Taxonomy of Knowledge Modalities

Autor: Wulfmeier, Markus, Byravan, Arunkumar, Bechtle, Sarah, Hausman, Karol, Heess, Nicolas

Contemporary artificial intelligence systems exhibit rapidly growing abilities accompanied by the growth of required resources, expansive datasets and corresponding investments into computing infrastructure. Although earlier successes predominantly f

Externí odkaz: http://arxiv.org/abs/2312.01939

Zobrazit plný text záznamu

Report

Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning

Autor: Pinneri, Cristina, Bechtle, Sarah, Wulfmeier, Markus, Byravan, Arunkumar, Zhang, Jingwei, Whitney, William F., Riedmiller, Martin

We present a novel approach to address the challenge of generalization in offline reinforcement learning (RL), where the agent learns from a fixed dataset without any additional interaction with the environment. Specifically, we aim to improve the ag

Externí odkaz: http://arxiv.org/abs/2309.07578

Zobrazit plný text záznamu

Report

Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World

Experimentation on real robots is demanding in terms of time and costs. For this reason, a large part of the reinforcement learning (RL) community uses simulators to develop and benchmark algorithms. However, insights gained in simulation do not nece

Externí odkaz: http://arxiv.org/abs/2308.07741

Zobrazit plný text záznamu

Report

Towards A Unified Agent with Foundation Models

Autor: Di Palo, Norman, Byravan, Arunkumar, Hasenclever, Leonard, Wulfmeier, Markus, Heess, Nicolas, Riedmiller, Martin

Language Models and Vision Language Models have recently demonstrated unprecedented capabilities in terms of understanding human intentions, reasoning, scene understanding, and planning-like behaviour, in text form, among many others. In this work, w

Externí odkaz: http://arxiv.org/abs/2307.09668

Zobrazit plný text záznamu

Report

Massively Scalable Inverse Reinforcement Learning in Google Maps

Autor: Barnes, Matt, Abueg, Matthew, Lange, Oliver F., Deeds, Matt, Trader, Jason, Molitor, Denali, Wulfmeier, Markus, O'Banion, Shawn

Inverse reinforcement learning (IRL) offers a powerful and general framework for learning humans' latent preferences in route recommendation, yet no approach has successfully addressed planetary-scale problems with hundreds of millions of states and

Externí odkaz: http://arxiv.org/abs/2305.11290

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání