Výsledky vyhledávání - "Bertsekas, Dimitri P."

Report

Superior Computer Chess with Model Predictive Control, Reinforcement Learning, and Rollout

Autor: Gundawar, Atharva, Li, Yuchao, Bertsekas, Dimitri

In this paper we apply model predictive control (MPC), rollout, and reinforcement learning (RL) methodologies to computer chess. We introduce a new architecture for move selection, within which available chess engines are used as components. One engi

Externí odkaz: http://arxiv.org/abs/2409.06477

Zobrazit plný text záznamu

Report

Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming

Autor: Bertsekas, Dimitri P.

In this paper we describe a new conceptual framework that connects approximate Dynamic Programming (DP), Model Predictive Control (MPC), and Reinforcement Learning (RL). This framework centers around two algorithms, which are designed largely indepen

Externí odkaz: http://arxiv.org/abs/2406.00592

Zobrazit plný text záznamu

Report

An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking

Autor: Musunuru, Pratyusha, Li, Yuchao, Weber, Jamison, Bertsekas, Dimitri

In this work, we consider data association problems involving multi-object tracking (MOT). In particular, we address the challenges arising from object occlusions. We propose a framework called approximate dynamic programming track (ADPTrack), which

Externí odkaz: http://arxiv.org/abs/2405.15137

Zobrazit plný text záznamu

Report

Most Likely Sequence Generation for $n$-Grams, Transformers, HMMs, and Markov Chains, by Using Rollout Algorithms

Autor: Li, Yuchao, Bertsekas, Dimitri

In this paper we consider a transformer with an $n$-gram structure, such as the one underlying ChatGPT. The transformer provides next word probabilities, which can be used to generate word sequences. We consider methods for computing word sequences t

Externí odkaz: http://arxiv.org/abs/2403.15465

Zobrazit plný text záznamu

Report

Approximate Multiagent Reinforcement Learning for On-Demand Urban Mobility Problem on a Large Map (extended version)

Autor: Garces, Daniel, Bhattacharya, Sushmita, Bertsekas, Dimitri, Gil, Stephanie

In this paper, we focus on the autonomous multiagent taxi routing problem for a large urban environment where the location and number of future ride requests are unknown a-priori, but can be estimated by an empirical distribution. Recent theory has s

Externí odkaz: http://arxiv.org/abs/2311.01534

Zobrazit plný text záznamu

Report

New Auction Algorithms for the Assignment Problem and Extensions

Autor: Bertsekas, Dimitri

We consider the classical linear assignment problem, and we introduce new auction algorithms for its optimal and suboptimal solution. The algorithms are founded on duality theory, and are related to ideas of competitive bidding by persons for objects

Externí odkaz: http://arxiv.org/abs/2310.03159

Zobrazit plný text záznamu

Report

Distributed Online Rollout for Multivehicle Routing in Unmapped Environments

Autor: Weber, Jamison W., Giriyan, Dhanush R., Parkar, Devendra R., Bertsekas, Dimitri P., Richa, Andréa W.

In this work we consider a generalization of the well-known multivehicle routing problem: given a network, a set of agents occupying a subset of its nodes, and a set of tasks, we seek a minimum cost sequence of movements subject to the constraint tha

Externí odkaz: http://arxiv.org/abs/2305.15596

Zobrazit plný text záznamu

Report

Rollout Algorithms and Approximate Dynamic Programming for Bayesian Optimization and Sequential Estimation

Autor: Bertsekas, Dimitri

We provide a unifying approximate dynamic programming framework that applies to a broad variety of problems involving sequential estimation. We consider first the construction of surrogate cost functions for the purposes of optimization, and we focus

Externí odkaz: http://arxiv.org/abs/2212.07998

Zobrazit plný text záznamu

Report

Multiagent Reinforcement Learning for Autonomous Routing and Pickup Problem with Adaptation to Variable Demand

Autor: Garces, Daniel, Bhattacharya, Sushmita, Gil, Stephanie, Bertsekas, Dimitri

We derive a learning framework to generate routing/pickup policies for a fleet of autonomous vehicles tasked with servicing stochastically appearing requests on a city map. We focus on policies that 1) give rise to coordination amongst the vehicles,

Externí odkaz: http://arxiv.org/abs/2211.14983

Zobrazit plný text záznamu

Report

Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach

Autor: Bhambri, Siddhant, Bhattacharjee, Amrita, Bertsekas, Dimitri

In this paper we address the solution of the popular Wordle puzzle, using new reinforcement learning methods, which apply more generally to adaptive control of dynamic systems and to classes of Partially Observable Markov Decision Process (POMDP) pro

Externí odkaz: http://arxiv.org/abs/2211.10298

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání