Výsledky vyhledávání - "Buffet, Olivier"

Report

How to Exhibit More Predictable Behaviors

Autor: Lepers, Salomé, Lemonnier, Sophie, Thomas, Vincent, Buffet, Olivier

This paper looks at predictability problems, i.e., wherein an agent must choose its strategy in order to optimize the predictions that an external observer could make. We address these problems while taking into account uncertainties on the environme

Externí odkaz: http://arxiv.org/abs/2404.11296

Zobrazit plný text záznamu

Report

Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach

Autor: Peralez, Johan, Delage, Aurélien, Buffet, Olivier, Dibangoye, Jilles S.

A recent theory shows that a multi-player decentralized partially observable Markov decision process can be transformed into an equivalent single-player game, enabling the application of \citeauthor{bellman}'s principle of optimality to solve the sin

Externí odkaz: http://arxiv.org/abs/2402.02954

Zobrazit plný text záznamu

Report

Monte-Carlo Search for an Equilibrium in Dec-POMDPs

Autor: You, Yang, Thomas, Vincent, Colas, Francis, Buffet, Olivier

Decentralized partially observable Markov decision processes (Dec-POMDPs) formalize the problem of designing individual controllers for a group of collaborative agents under stochastic dynamics and partial observability. Seeking a global optimum is d

Externí odkaz: http://arxiv.org/abs/2305.11811

Zobrazit plný text záznamu

Report

Robust Robot Planning for Human-Robot Collaboration

Autor: You, Yang, Thomas, Vincent, Colas, Francis, Alami, Rachid, Buffet, Olivier

In human-robot collaboration, the objectives of the human are often unknown to the robot. Moreover, even assuming a known objective, the human behavior is also uncertain. In order to plan a robust robot behavior, a key preliminary question is then: H

Externí odkaz: http://arxiv.org/abs/2302.13916

Zobrazit plný text záznamu

Report

HSVI can solve zero-sum Partially Observable Stochastic Games

Autor: Delage, Aurélien, Buffet, Olivier, Dibangoye, Jilles S., Saffidine, Abdallah

State-of-the-art methods for solving 2-player zero-sum imperfect information games rely on linear programming or regret minimization, though not on dynamic programming (DP) or heuristic search (HS), while the latter are often at the core of state-of-

Externí odkaz: http://arxiv.org/abs/2210.14640

Zobrazit plný text záznamu

Report

HSVI for zs-POSGs using Concavity, Convexity and Lipschitz Properties

Autor: Delage, Aurélien, Buffet, Olivier, Dibangoye, Jilles

Dynamic programming and heuristic search are at the core of state-of-the-art solvers for sequential decision-making problems. In partially observable or collaborative settings (\eg, POMDPs and Dec-POMDPs), this requires introducing an appropriate sta

Externí odkaz: http://arxiv.org/abs/2110.14529

Zobrazit plný text záznamu

Report

Solving infinite-horizon Dec-POMDPs using Finite State Controllers within JESP

Autor: You, Yang, Thomas, Vincent, Colas, Francis, Buffet, Olivier

This paper looks at solving collaborative planning problems formalized as Decentralized POMDPs (Dec-POMDPs) by searching for Nash equilibria, i.e., situations where each agent's policy is a best response to the other agents' (fixed) policies. While t

Externí odkaz: http://arxiv.org/abs/2109.08755

Zobrazit plný text záznamu

Report

Monte Carlo Information-Oriented Planning

Autor: Thomas, Vincent, Hutin, Gérémy, Buffet, Olivier

In this article, we discuss how to solve information-gathering problems expressed as rho-POMDPs, an extension of Partially Observable Markov Decision Processes (POMDPs) whose reward rho depends on the belief state. Point-based approaches used for sol

Externí odkaz: http://arxiv.org/abs/2103.11345

Zobrazit plný text záznamu

Report

On Bellman's Optimality Principle for zs-POSGs

Autor: Buffet, Olivier, Dibangoye, Jilles, Delage, Aurélien, Saffidine, Abdallah, Thomas, Vincent

Many non-trivial sequential decision-making problems are efficiently solved by relying on Bellman's optimality principle, i.e., exploiting the fact that sub-problems are nested recursively within the original problem. Here we show how it can apply to

Externí odkaz: http://arxiv.org/abs/2006.16395

Zobrazit plný text záznamu

Report

Reinforcement Learning

Autor: Buffet, Olivier, Pietquin, Olivier, Weng, Paul

Reinforcement learning (RL) is a general framework for adaptive control, which has proven to be efficient in many domains, e.g., board games, video games or autonomous vehicles. In such problems, an agent faces a sequential decision-making problem wh

Externí odkaz: http://arxiv.org/abs/2005.14419

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání