Výsledky vyhledávání - "Shpilman, Aleksei"

Report

Data-Driven Short-Term Daily Operational Sea Ice Regional Forecasting

Autor: Grigoryev, Timofey, Verezemskaya, Polina, Krinitskiy, Mikhail, Anikin, Nikita, Gavrikov, Alexander, Trofimov, Ilya, Balabin, Nikita, Shpilman, Aleksei, Eremchenko, Andrei, Gulev, Sergey, Burnaev, Evgeny, Vanovskiy, Vladimir

Global warming made the Arctic available for marine operations and created demand for reliable operational sea ice forecasts to make them safe. While ocean-ice numerical models are highly computationally intensive, relatively lightweight ML-based met

Externí odkaz: http://arxiv.org/abs/2210.08877

Zobrazit plný text záznamu

Report

Scalable Multi-Agent Model-Based Reinforcement Learning

Autor: Egorov, Vladimir, Shpilman, Aleksei

Recent Multi-Agent Reinforcement Learning (MARL) literature has been largely focused on Centralized Training with Decentralized Execution (CTDE) paradigm. CTDE has been a dominant approach for both cooperative and mixed environments due to its capabi

Externí odkaz: http://arxiv.org/abs/2205.15023

Zobrazit plný text záznamu

Report

Traffic4cast at NeurIPS 2021 -- Temporal and Spatial Few-Shot Transfer Learning in Gridded Geo-Spatial Processes

The IARAI Traffic4cast competitions at NeurIPS 2019 and 2020 showed that neural networks can successfully predict future traffic conditions 1 hour into the future on simply aggregated GPS probe data in time and space bins. We thus reinterpreted the c

Externí odkaz: http://arxiv.org/abs/2203.17070

Zobrazit plný text záznamu

Report

Self-Imitation Learning from Demonstrations

Autor: Pshikhachev, Georgiy, Ivanov, Dmitry, Egorov, Vladimir, Shpilman, Aleksei

Despite the numerous breakthroughs achieved with Reinforcement Learning (RL), solving environments with sparse rewards remains a challenging task that requires sophisticated exploration. Learning from Demonstrations (LfD) remedies this issue by guidi

Externí odkaz: http://arxiv.org/abs/2203.10905

Zobrazit plný text záznamu

Report

Improving State-of-the-Art in One-Class Classification by Leveraging Unlabeled Data

Autor: Bagirov, Farid, Ivanov, Dmitry, Shpilman, Aleksei

When dealing with binary classification of data with only one labeled class data scientists employ two main approaches, namely One-Class (OC) classification and Positive Unlabeled (PU) learning. The former only learns from labeled positive data, wher

Externí odkaz: http://arxiv.org/abs/2203.07206

Zobrazit plný text záznamu

Report

MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned

Reinforcement learning competitions advance the field by providing appropriate scope and support to develop solutions toward a specific problem. To promote the development of more broadly applicable methods, organizers need to enforce the use of gene

Externí odkaz: http://arxiv.org/abs/2202.10583

Zobrazit plný text záznamu

Report

Maximum Entropy Model-based Reinforcement Learning

Autor: Svidchenko, Oleg, Shpilman, Aleksei

Recent advances in reinforcement learning have demonstrated its ability to solve hard agent-environment interaction tasks on a super-human level. However, the application of reinforcement learning methods to practical and real-world tasks is currentl

Externí odkaz: http://arxiv.org/abs/2112.01195

Zobrazit plný text záznamu

Report

Simple End-to-end Deep Learning Model for CDR-H3 Loop Structure Prediction

Autor: Zenkova, Natalia, Sedykh, Ekaterina, Shugaeva, Tatiana, Strashko, Vladislav, Ermak, Timofei, Shpilman, Aleksei

Predicting a structure of an antibody from its sequence is important since it allows for a better design process of synthetic antibodies that play a vital role in the health industry. Most of the structure of an antibody is conservative. The most var

Externí odkaz: http://arxiv.org/abs/2111.10656

Zobrazit plný text záznamu

Report

Solving Traffic4Cast Competition with U-Net and Temporal Domain Adaptation

Autor: Konyakhin, Vsevolod, Lukashina, Nina, Shpilman, Aleksei

In this technical report, we present our solution to the Traffic4Cast 2021 Core Challenge, in which participants were asked to develop algorithms for predicting a traffic state 60 minutes ahead, based on the information from the previous hour, in 4 d

Externí odkaz: http://arxiv.org/abs/2111.03421

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání