Zobrazeno 1 - 10
of 3 473
pro vyhledávání: '"NEARY, P."'
A run of the deferred acceptance (DA) algorithm may contain proposals that are sure to be rejected. We introduce the accelerated deferred acceptance algorithm that proceeds in a similar manner to DA but with sure-to-be rejected proposals ruled out. A
Externí odkaz:
http://arxiv.org/abs/2409.06865
We propose and demonstrate a compositional framework for training and verifying reinforcement learning (RL) systems within a multifidelity sim-to-real pipeline, in order to deploy reliable and adaptable RL policies on physical hardware. By decomposin
Externí odkaz:
http://arxiv.org/abs/2312.01249
In order to coordinate players in a game must first identify a target pattern of behaviour. In this paper we investigate the difficulty of identifying prominent outcomes in two kinds of binary action coordination problems in social networks: pure coo
Externí odkaz:
http://arxiv.org/abs/2311.03195
Autor:
Wongpiromsarn, Tichakorn, Ghasemi, Mahsa, Cubuktepe, Murat, Bakirtzis, Georgios, Carr, Steven, Karabag, Mustafa O., Neary, Cyrus, Gohari, Parham, Topcu, Ufuk
Formal methods refer to rigorous, mathematical approaches to system development and have played a key role in establishing the correctness of safety-critical systems. The main building blocks of formal methods are models and specifications, which are
Externí odkaz:
http://arxiv.org/abs/2311.01258
Autor:
Mark D. Rodefeld, MD, Timothy Conover, PhD, Richard Figliola, PhD, Mike Neary, MS, Guruprasad Giridharan, PhD, Artem Ivashchenko, MEng, Edward M. Bennett, PhD
Publikováno v:
JTCVS Open, Vol 21, Iss , Pp 257-266 (2024)
Objective: After Fontan palliation, patients with single-ventricle physiology are committed to chronic circulatory inefficiency for the duration of their lives. This is due in large part to the lack of a subpulmonary ventricle. A low-pressure rise ca
Externí odkaz:
https://doaj.org/article/2dd496fbebdc42f4b217f9ef3b236088
We propose a framework for verifiable and compositional reinforcement learning (RL) in which a collection of RL subsystems, each of which learns to accomplish a separate subtask, are composed to achieve an overall task. The framework consists of a hi
Externí odkaz:
http://arxiv.org/abs/2309.06420
Recently developed pretrained models can encode rich world knowledge expressed in multiple modalities, such as text and images. However, the outputs of these models cannot be integrated into algorithms to solve sequential decision-making tasks. We de
Externí odkaz:
http://arxiv.org/abs/2308.05295
We introduce and study a new optimization problem on digraphs, termed Maximum Weighted Digraph Partition (MWDP) problem. We prove three complexity dichotomies for MWDP: on arbitrary digraphs, on oriented digraphs, and on symmetric digraphs. We demons
Externí odkaz:
http://arxiv.org/abs/2307.01109
We present a framework and algorithms to learn controlled dynamics models using neural stochastic differential equations (SDEs) -- SDEs whose drift and diffusion terms are both parametrized by neural networks. We construct the drift term to leverage
Externí odkaz:
http://arxiv.org/abs/2306.06335
We investigate the difficulty of finding economically efficient solutions to coordination problems on graphs. Our work focuses on two forms of coordination problem: pure-coordination games and anti-coordination games. We consider three objectives in
Externí odkaz:
http://arxiv.org/abs/2305.07124