Výsledky vyhledávání - "Jordan P. M."

Report

Two-dimensional simulations of disks in close binaries: Simulating outburst cycles in cataclysmic variables

Autor: Jordan, Lucas M., Wehner, Dennis, Kuiper, Rolf

Previous simulations of cataclysmic variables studied either the quiescence, or the outburst state in multiple dimensions or they simulated complete outburst cycles in one dimension using simplified models for the gravitational torques. We self-consi

Externí odkaz: http://arxiv.org/abs/2407.16610

Zobrazit plný text záznamu

Report

Position: Benchmarking is Limited in Reinforcement Learning Research

Autor: Jordan, Scott M., White, Adam, da Silva, Bruno Castro, White, Martha, Thomas, Philip S.

Novel reinforcement learning algorithms, or improvements on existing ones, are commonly justified by evaluating their performance on benchmark environments and are compared to an ever-changing set of standard algorithms. However, despite numerous cal

Externí odkaz: http://arxiv.org/abs/2406.16241

Zobrazit plný text záznamu

Report

A New View on Planning in Online Reinforcement Learning

Autor: Roice, Kevin, Panahi, Parham Mohammad, Jordan, Scott M., White, Adam, White, Martha

This paper investigates a new approach to model-based reinforcement learning using background planning: mixing (approximate) dynamic programming updates and model-free updates, similar to the Dyna architecture. Background planning with learned models

Externí odkaz: http://arxiv.org/abs/2406.01562

Zobrazit plný text záznamu

Report

On the two-dimensional Brillouin flow

Autor: Revolinsky, Ryan A., Swenson, Christopher J., Jordan, Nicholas M., Lau, Y. Y., Gilgenbach, Ronald M.

The Brillouin flow is a rectilinear, sheared electron fluid flow in a crossed electric field (E) and magnetic field (B), in the E x B direction with zero flow velocity and zero electric field at the surface with which the flow is in contact. It is br

Externí odkaz: http://arxiv.org/abs/2404.11047

Zobrazit plný text záznamu

Report

Enhancing interferometry using weak value amplification with real weak values

Autor: Huang, Jing-Hui, Jordan, Kyle M., Dada, Adetunmise C., Hu, Xiang-Yun, Lundeen, Jeff. S.

We introduce an ultra-sensitive interferometric protocol that combines weak value amplification (WVA) with traditional interferometry. This WVA+interferometry protocol uses weak value amplification of the relative delay between two paths to enhance t

Externí odkaz: http://arxiv.org/abs/2404.00531

Zobrazit plný text záznamu

Report

FargoCPT: A 2D Multi-Physics Code for Simulating the Interaction of Disks with Stars, Planets and Particles

Autor: Rometsch, Thomas, Jordan, Lucas M., Moldenhauer, Tobias W., Wehner, Dennis, Restrepo, Steven Rendon, Müller, Tobias W. A., Picogna, Giovanni, Kley, Wilhelm, Dullemond, Cornelis P.

Context: Planet-disk interactions play a crucial role in the understanding of planet formation and disk evolution. There are multiple numerical tools available to simulate these interactions, including the often-used FARGO code and its variants. Many

Externí odkaz: http://arxiv.org/abs/2401.16203

Zobrazit plný text záznamu

Report

From Past to Future: Rethinking Eligibility Traces

Autor: Gupta, Dhawal, Jordan, Scott M., Chaudhari, Shreyas, Liu, Bo, Thomas, Philip S., da Silva, Bruno Castro

In this paper, we introduce a fresh perspective on the challenges of credit assignment and policy evaluation. First, we delve into the nuances of eligibility traces and explore instances where their updates may result in unexpected credit assignment

Externí odkaz: http://arxiv.org/abs/2312.12972

Zobrazit plný text záznamu

Report

Behavior Alignment via Reward Function Optimization

Autor: Gupta, Dhawal, Chandak, Yash, Jordan, Scott M., Thomas, Philip S., da Silva, Bruno Castro

Designing reward functions for efficiently guiding reinforcement learning (RL) agents toward specific behaviors is a complex task. This is challenging since it requires the identification of reward structures that are not sparse and that avoid inadve

Externí odkaz: http://arxiv.org/abs/2310.19007

Zobrazit plný text záznamu

Report

Acoustic singular surfaces in an exponential class of inhomogeneous gases: A new numerical approach based on Krylov subspace spectral methodologies

Autor: Rester, Bailey, Lambers, James V., Jordan, Pedro M.

We investigate the propagation of acoustic singular surfaces, specifically, linear shock waves and nonlinear acceleration waves, in a class of inhomogeneous gases whose ambient mass density varies exponentially. Employing the mathematical tools of si

Externí odkaz: http://arxiv.org/abs/2306.04611

Zobrazit plný text záznamu

Report

Coagent Networks: Generalized and Scaled

Autor: Kostas, James E., Jordan, Scott M., Chandak, Yash, Theocharous, Georgios, Gupta, Dhawal, White, Martha, da Silva, Bruno Castro, Thomas, Philip S.

Coagent networks for reinforcement learning (RL) [Thomas and Barto, 2011] provide a powerful and flexible framework for deriving principled learning rules for arbitrary stochastic neural networks. The coagent framework offers an alternative to backpr

Externí odkaz: http://arxiv.org/abs/2305.09838

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání