Výsledky vyhledávání - "Midgley, Laurence"

Report

Efficient and Unbiased Sampling of Boltzmann Distributions via Consistency Models

Autor: Zhang, Fengzhe, He, Jiajun, Midgley, Laurence I., Antorán, Javier, Hernández-Lobato, José Miguel

Diffusion models have shown promising potential for advancing Boltzmann Generators. However, two critical challenges persist: (1) inherent errors in samples due to model imperfections, and (2) the requirement of hundreds of functional evaluations (NF

Externí odkaz: http://arxiv.org/abs/2409.07323

Zobrazit plný text záznamu

Report

SE(3) Equivariant Augmented Coupling Flows

Autor: Midgley, Laurence I., Stimper, Vincent, Antorán, Javier, Mathieu, Emile, Schölkopf, Bernhard, Hernández-Lobato, José Miguel

Coupling normalizing flows allow for fast sampling and density evaluation, making them the tool of choice for probabilistic modeling of physical systems. However, the standard coupling architecture precludes endowing flows that operate on the Cartesi

Externí odkaz: http://arxiv.org/abs/2308.10364

Zobrazit plný text záznamu

Report

Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

Open-source reinforcement learning (RL) environments have played a crucial role in driving progress in the development of AI algorithms. In modern RL research, there is a need for simulated environments that are performant, scalable, and modular to e

Externí odkaz: http://arxiv.org/abs/2306.09884

Zobrazit plný text záznamu

Report

Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function

Autor: Bonnet, Clément, Midgley, Laurence, Laterre, Alexandre

Meta-gradient Reinforcement Learning (RL) allows agents to self-tune their hyper-parameters in an online fashion during training. In this paper, we identify a bias in the meta-gradient of current meta-gradient RL approaches. This bias comes from usin

Externí odkaz: http://arxiv.org/abs/2211.10550

Zobrazit plný text záznamu

Report

Synthesis of separation processes with reinforcement learning

Autor: van Kalmthout, Stephan C. P. A., Midgley, Laurence I., Franke, Meik B.

This paper shows the implementation of reinforcement learning (RL) in commercial flowsheet simulator software (Aspen Plus V12) for designing and optimising a distillation sequence. The aim of the SAC agent was to separate a hydrocarbon mixture in its

Externí odkaz: http://arxiv.org/abs/2211.04327

Zobrazit plný text záznamu

Report

Flow Annealed Importance Sampling Bootstrap

Autor: Midgley, Laurence Illing, Stimper, Vincent, Simm, Gregor N. C., Schölkopf, Bernhard, Hernández-Lobato, José Miguel

Normalizing flows are tractable density models that can approximate complicated target distributions, e.g. Boltzmann distributions of physical systems. However, current methods for training flows either suffer from mode-seeking behavior, use samples

Externí odkaz: http://arxiv.org/abs/2208.01893

Zobrazit plný text záznamu

Report

Bootstrap Your Flow

Autor: Midgley, Laurence Illing, Stimper, Vincent, Simm, Gregor N. C., Hernández-Lobato, José Miguel

Normalizing flows are flexible, parameterized distributions that can be used to approximate expectations from intractable distributions via importance sampling. However, current flow-based approaches are limited on challenging targets where they eith

Externí odkaz: http://arxiv.org/abs/2111.11510

Zobrazit plný text záznamu

Report

Deep Reinforcement Learning for Process Synthesis

Autor: Midgley, Laurence Illing

This paper demonstrates the application of reinforcement learning (RL) to process synthesis by presenting Distillation Gym, a set of RL environments in which an RL agent is tasked with designing a distillation train, given a user defined multi-compon

Externí odkaz: http://arxiv.org/abs/2009.13265

Zobrazit plný text záznamu

Reinforcement learning for chemical engineering process synthesis

Autor: Midgley, Laurence, Thomson, Michael

This thesis demonstrated, for the first time, that reinforcement learning (RL) can be applied to chemical engineering process synthesis (sequencing and design of unit operations to generate a process flowsheet). Two case studies were used, with simpl

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ac5dc7616768c0289be81e49a644d4d9

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání