Zobrazeno 1 - 8
of 8
pro vyhledávání: '"Rati Devidze"'
Publikováno v:
IJCAI
Machine teaching studies the interaction between a teacher and a student/learner where the teacher selects training examples for the learner to learn a specific task. The typical assumption is that the teacher has perfect knowledge of the task---this
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::145e2057539c283c6036fca35e380cba
Publikováno v:
IJCAI
We study the problem of inverse reinforcement learning (IRL) with the added twist that the learner is assisted by a helpful teacher. More formally, we tackle the following algorithmic question: How could a teacher provide an informative sequence of d
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3d120d3f6dfa56e02a9f837445ca14ec
http://arxiv.org/abs/1905.11867
http://arxiv.org/abs/1905.11867
Autor:
CINÀ, ANTONIO EMANUELE1 antonioemanuele.cina@unive.it, GROSSE, KATHRIN2 kathrin.grosse@unica.it, DEMONTIS, AMBRA3 ambra.demontis@unica.it, VASCON, SEBASTIANO4 sebastiano.vascon@unive.it, ZELLINGER, WERNER5 werner.zellinger@scch.at, MOSER, BERNHARD A.5 bernhard.moser@scch.at, OPREA, ALINA6 a.oprea@northeastern.edu, BIGGIO, BATTISTA3,7 battista.biggio@unica.it, PELILLO, MARCELLO1 pelillo@unive.it, ROLI, FABIO7,8 fabio.roli@unige.it
Publikováno v:
ACM Computing Surveys. 2023 Suppl13s, Vol. 55, p1-39. 39p.
Publikováno v:
ACM Computing Surveys; Jun2024, Vol. 56 Issue 6, p1-39, 39p
Publikováno v:
ACM Transactions on Software Engineering & Methodology; May2024, Vol. 33 Issue 4, p1-31, 31p
Autor:
ZHIBO WANG1,2 zhibowang@zju.edu.cn, JINGJING MA1 jingjingma@whu.edu.cn, XUE WANG1 shannonwang@whu.edu.cn, JIAHUI HU2 jiahuihu@zju.edu.cn, ZHAN QIN2 qinzhan@zju.edu.cn, KUI REN2 kuiren@zju.edu.cn
Publikováno v:
ACM Computing Surveys. Jul2023, Vol. 55 Issue 7, p1-36. 36p.
Autor:
A.M. Metelli
In recent decades, Reinforcement Learning (RL) has emerged as an effective approach to address complex control tasks. In a Markov Decision Process (MDP), the framework typically used, the environment is assumed to be a fixed entity that cannot be alt
The multi-volume set LNAI 14169 until 14175 constitutes the refereed proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2023, which took place in Turin, Italy, in September 2023.The 196 papers w