Výsledky vyhledávání

Report

Semi-Markovian Planning to Coordinate Aerial and Maritime Medical Evacuation Platforms

Autor: Al-Husseini, Mahdi, Wray, Kyle H., Kochenderfer, Mykel J.

The transfer of patients between two aircraft using an underway watercraft increases medical evacuation reach and flexibility in maritime environments. The selection of any one of multiple underway watercraft for patient exchange is complicated by pa

Externí odkaz: http://arxiv.org/abs/2410.04523

Zobrazit plný text záznamu

Report

Rao-Blackwellized POMDP Planning

Autor: Lee, Jiho, Ahmed, Nisar R., Wray, Kyle H., Sunberg, Zachary N.

Partially Observable Markov Decision Processes (POMDPs) provide a structured framework for decision-making under uncertainty, but their application requires efficient belief updates. Sequential Importance Resampling Particle Filters (SIRPF), also kno

Externí odkaz: http://arxiv.org/abs/2409.16392

Zobrazit plný text záznamu

Report

Watercraft as Overwater Ambulance Exchange Points to Enhance Aeromedical Evacuation

Autor: Al-Husseini, Mahdi, Wray, Kyle H., Kochenderfer, Mykel J.

Ambulance exchange points are preidentified sites where patients are transferred between evacuation platforms while en route to enhanced medical care. We propose a new capability for maritime medical evacuation, which involves co-opting underway wate

Externí odkaz: http://arxiv.org/abs/2408.13847

Zobrazit plný text záznamu

Report

Hierarchical Framework for Optimizing Wildfire Surveillance and Suppression using Human-Autonomous Teaming

Autor: Al-Husseini, Mahdi, Wray, Kyle, Kochenderfer, Mykel

The integration of manned and unmanned aircraft can help improve wildfire response. Wildfire containment failures occur when resources available to first responders, who execute the initial stages of wildfire management referred to as the initial att

Externí odkaz: http://arxiv.org/abs/2406.17189

Zobrazit plný text záznamu

Report

Entropy-regularized Point-based Value Iteration

Autor: Delecki, Harrison, Vazquez-Chanlatte, Marcell, Yel, Esen, Wray, Kyle, Arnon, Tomer, Witwicki, Stefan, Kochenderfer, Mykel J.

Model-based planners for partially observable problems must accommodate both model uncertainty during planning and goal uncertainty during objective inference. However, model-based planners may be brittle under these types of uncertainty because they

Externí odkaz: http://arxiv.org/abs/2402.09388

Zobrazit plný text záznamu

Report

Decision Making in Non-Stationary Environments with Policy-Augmented Search

Autor: Pettet, Ava, Zhang, Yunuo, Luo, Baiting, Wray, Kyle, Baier, Hendrik, Laszka, Aron, Dubey, Abhishek, Mukhopadhyay, Ayan

Sequential decision-making under uncertainty is present in many important problems. Two popular approaches for tackling such problems are reinforcement learning and online search (e.g., Monte Carlo tree search). While the former learns a policy by in

Externí odkaz: http://arxiv.org/abs/2401.03197

Zobrazit plný text záznamu

Report

Constrained Hierarchical Monte Carlo Belief-State Planning

Autor: Jamgochian, Arec, Buurmeijer, Hugo, Wray, Kyle H., Corso, Anthony, Kochenderfer, Mykel J.

Optimal plans in Constrained Partially Observable Markov Decision Processes (CPOMDPs) maximize reward objectives while satisfying hard cost constraints, generalizing safe planning under state and transition uncertainty. Unfortunately, online CPOMDP p

Externí odkaz: http://arxiv.org/abs/2310.20054

Zobrazit plný text záznamu

Report

Active teacher selection for reinforcement learning from human feedback

Autor: Freedman, Rachel, Svegliato, Justin, Wray, Kyle, Russell, Stuart

Reinforcement learning from human feedback (RLHF) enables machine learning systems to learn objectives from human feedback. A core limitation of these systems is their assumption that all feedback comes from a single human teacher, despite querying a

Externí odkaz: http://arxiv.org/abs/2310.15288

Zobrazit plný text záznamu

Report

Experience Filter: Using Past Experiences on Unseen Tasks or Environments

Autor: Yildiz, Anil, Yel, Esen, Corso, Anthony L., Wray, Kyle H., Witwicki, Stefan J., Kochenderfer, Mykel J.

One of the bottlenecks of training autonomous vehicle (AV) agents is the variability of training environments. Since learning optimal policies for unseen environments is often very costly and requires substantial data collection, it becomes computati

Externí odkaz: http://arxiv.org/abs/2305.18633

Zobrazit plný text záznamu

Report

Multi-Objective Policy Gradients with Topological Constraints

Autor: Wray, Kyle Hollins, Tiomkin, Stas, Kochenderfer, Mykel J., Abbeel, Pieter

Multi-objective optimization models that encode ordered sequential constraints provide a solution to model various challenging problems including encoding preferences, modeling a curriculum, and enforcing measures of safety. A recently developed theo

Externí odkaz: http://arxiv.org/abs/2209.07096

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání