Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Ramos, Joao A. Candido"'
In this paper, we introduce MAAD, a novel, sample-efficient on-policy algorithm for Imitation Learning from Observations. MAAD utilizes a surrogate reward signal, which can be derived from various sources such as adversarial games, trajectory matchin
Externí odkaz:
http://arxiv.org/abs/2306.09805
In this work, we want to learn to model the dynamics of similar yet distinct groups of interacting objects. These groups follow some common physical laws that exhibit specificities that are captured through some vectorial description. We develop a mo
Externí odkaz:
http://arxiv.org/abs/2106.11083