Zobrazeno 1 - 10
of 12
pro vyhledávání: '"Czarnecki WM"'
Neural information processing systems foundation. All rights reserved. Most deep reinforcement learning algorithms are data inefficient in complex and rich environments, limiting their applicability to many scenarios. One direction for improving data
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8a58aca0985ad5d6f2398021287f8f35
https://ora.ox.ac.uk/objects/uuid:0cfdde8d-8b0b-440a-97b7-7d2a185d1ad6
https://ora.ox.ac.uk/objects/uuid:0cfdde8d-8b0b-440a-97b7-7d2a185d1ad6
Autor:
Galashov, A, Jayakumar, SM, Hasenclever, L, Tirumala, D, Schwarz, J, Desjardins, G, Czarnecki, WM, Teh, YW, Pascanu, R, Heess, N
Many real world tasks exhibit rich structure that is repeated across different parts of the state space or in time. In this work we study the possibility of leveraging such repeated structure to speed up and regularize learning. We start from the KL
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::bee681c2a811d4a8ce0e8cac42a6add3
http://arxiv.org/abs/1905.01240
http://arxiv.org/abs/1905.01240
Autor:
Liu S; DeepMind, London, UK., Lever G; DeepMind, London, UK., Wang Z; DeepMind, London, UK., Merel J; DeepMind, London, UK., Eslami SMA; DeepMind, London, UK., Hennes D; DeepMind, London, UK., Czarnecki WM; DeepMind, London, UK., Tassa Y; DeepMind, London, UK., Omidshafiei S; DeepMind, London, UK., Abdolmaleki A; DeepMind, London, UK., Siegel NY; DeepMind, London, UK., Hasenclever L; DeepMind, London, UK., Marris L; DeepMind, London, UK., Tunyasuvunakool S; DeepMind, London, UK., Song HF; DeepMind, London, UK., Wulfmeier M; DeepMind, London, UK., Muller P; DeepMind, London, UK., Haarnoja T; DeepMind, London, UK., Tracey B; DeepMind, London, UK., Tuyls K; DeepMind, London, UK., Graepel T; DeepMind, London, UK., Heess N; DeepMind, London, UK.
Publikováno v:
Science robotics [Sci Robot] 2022 Aug 31; Vol. 7 (69), pp. eabo0235. Date of Electronic Publication: 2022 Aug 31.
Autor:
Omidshafiei S; DeepMind, Paris, France. somidshafiei@google.com., Tuyls K; DeepMind, Paris, France., Czarnecki WM; DeepMind, London, UK., Santos FC; INESC-ID and Instituto Superior Técnico, Universidade de Lisboa, Lisboa, Portugal., Rowland M; DeepMind, London, UK., Connor J; DeepMind, London, UK., Hennes D; DeepMind, Paris, France., Muller P; DeepMind, Paris, France., Pérolat J; DeepMind, Paris, France., Vylder B; DeepMind, Paris, France., Gruslys A; DeepMind, London, UK., Munos R; DeepMind, Paris, France.
Publikováno v:
Nature communications [Nat Commun] 2020 Nov 05; Vol. 11 (1), pp. 5603. Date of Electronic Publication: 2020 Nov 05.
Autor:
Vinyals O; DeepMind, London, UK. vinyals@google.com., Babuschkin I; DeepMind, London, UK., Czarnecki WM; DeepMind, London, UK., Mathieu M; DeepMind, London, UK., Dudzik A; DeepMind, London, UK., Chung J; DeepMind, London, UK., Choi DH; DeepMind, London, UK., Powell R; DeepMind, London, UK., Ewalds T; DeepMind, London, UK., Georgiev P; DeepMind, London, UK., Oh J; DeepMind, London, UK., Horgan D; DeepMind, London, UK., Kroiss M; DeepMind, London, UK., Danihelka I; DeepMind, London, UK., Huang A; DeepMind, London, UK., Sifre L; DeepMind, London, UK., Cai T; DeepMind, London, UK., Agapiou JP; DeepMind, London, UK., Jaderberg M; DeepMind, London, UK., Vezhnevets AS; DeepMind, London, UK., Leblond R; DeepMind, London, UK., Pohlen T; DeepMind, London, UK., Dalibard V; DeepMind, London, UK., Budden D; DeepMind, London, UK., Sulsky Y; DeepMind, London, UK., Molloy J; DeepMind, London, UK., Paine TL; DeepMind, London, UK., Gulcehre C; DeepMind, London, UK., Wang Z; DeepMind, London, UK., Pfaff T; DeepMind, London, UK., Wu Y; DeepMind, London, UK., Ring R; DeepMind, London, UK., Yogatama D; DeepMind, London, UK., Wünsch D; Team Liquid, Utrecht, Netherlands., McKinney K; DeepMind, London, UK., Smith O; DeepMind, London, UK., Schaul T; DeepMind, London, UK., Lillicrap T; DeepMind, London, UK., Kavukcuoglu K; DeepMind, London, UK., Hassabis D; DeepMind, London, UK., Apps C; DeepMind, London, UK., Silver D; DeepMind, London, UK. davidsilver@google.com.
Publikováno v:
Nature [Nature] 2019 Nov; Vol. 575 (7782), pp. 350-354. Date of Electronic Publication: 2019 Oct 30.
Autor:
Omidshafiei S; DeepMind, Paris, France., Papadimitriou C; Columbia University, New York, USA., Piliouras G; Singapore University of Technology and Design, Singapore, Singapore., Tuyls K; DeepMind, Paris, France. karltuyls@google.com., Rowland M; DeepMind, London, UK., Lespiau JB; DeepMind, Paris, France., Czarnecki WM; DeepMind, London, UK., Lanctot M; DeepMind, Edmonton, Canada., Perolat J; DeepMind, London, UK., Munos R; DeepMind, Paris, France.
Publikováno v:
Scientific reports [Sci Rep] 2019 Jul 09; Vol. 9 (1), pp. 9937. Date of Electronic Publication: 2019 Jul 09.
Autor:
Jaderberg M; DeepMind, London, UK. lejlot@google.com jaderberg@google.com., Czarnecki WM; DeepMind, London, UK. lejlot@google.com jaderberg@google.com., Dunning I; DeepMind, London, UK., Marris L; DeepMind, London, UK., Lever G; DeepMind, London, UK., Castañeda AG; DeepMind, London, UK., Beattie C; DeepMind, London, UK., Rabinowitz NC; DeepMind, London, UK., Morcos AS; DeepMind, London, UK., Ruderman A; DeepMind, London, UK., Sonnerat N; DeepMind, London, UK., Green T; DeepMind, London, UK., Deason L; DeepMind, London, UK., Leibo JZ; DeepMind, London, UK., Silver D; DeepMind, London, UK., Hassabis D; DeepMind, London, UK., Kavukcuoglu K; DeepMind, London, UK., Graepel T; DeepMind, London, UK.
Publikováno v:
Science (New York, N.Y.) [Science] 2019 May 31; Vol. 364 (6443), pp. 859-865.
Autor:
Podlewska S; Department of Medicinal Chemistry, Institute of Pharmacology, Polish Academy of Sciences , Smętna 12, 31-343 Kraków, Poland., Czarnecki WM; Faculty of Mathematics and Computer Science, Jagiellonian University , 30-348 Kraków, Poland., Kafel R; Department of Medicinal Chemistry, Institute of Pharmacology, Polish Academy of Sciences , Smętna 12, 31-343 Kraków, Poland., Bojarski AJ; Department of Medicinal Chemistry, Institute of Pharmacology, Polish Academy of Sciences , Smętna 12, 31-343 Kraków, Poland.
Publikováno v:
Journal of chemical information and modeling [J Chem Inf Model] 2017 Feb 27; Vol. 57 (2), pp. 133-147. Date of Electronic Publication: 2017 Feb 15.
Autor:
Leśniak D; Jagiellonian University, Faculty of Mathematics and Computer Science, 6 S. Lojasiewicza Street, Kraków 30-048, Poland., Jastrzębski S; Jagiellonian University, Faculty of Mathematics and Computer Science, 6 S. Lojasiewicza Street, Kraków 30-048, Poland., Podlewska S; Institute of Pharmacology, Polish Academy of Sciences, Department of Medicinal Chemistry, 12 Smętna Street, Kraków 31-343, Poland., Czarnecki WM; Jagiellonian University, Faculty of Mathematics and Computer Science, 6 S. Lojasiewicza Street, Kraków 30-048, Poland., Bojarski AJ; Institute of Pharmacology, Polish Academy of Sciences, Department of Medicinal Chemistry, 12 Smętna Street, Kraków 31-343, Poland. Electronic address: bojarski@if-pan.krakow.pl.
Publikováno v:
Bioorganic & medicinal chemistry letters [Bioorg Med Chem Lett] 2017 Feb 01; Vol. 27 (3), pp. 626-631. Date of Electronic Publication: 2016 Dec 02.
Autor:
Czarnecki WM; Faculty of Mathematics and Computer Science, Jagiellonian University, Lojasiewicza 6, 30-348 Krakow, Poland. wojciech.czarnecki@uj.edu.pl., Podlewska S; Institute of Pharmacology, Polish Academy of Sciences, Smetna 12, 31-343 Krakow, Poland. smusz@if-pan.krakow.pl.; Faculty of Chemistry, Jagiellonian University, Ingardena 3, 30-060 Krakow, Poland. smusz@if-pan.krakow.pl., Bojarski AJ; Institute of Pharmacology, Polish Academy of Sciences, Smetna 12, 31-343 Krakow, Poland. bojarski@if-pan.krakow.pl.
Publikováno v:
Molecules (Basel, Switzerland) [Molecules] 2015 Nov 09; Vol. 20 (11), pp. 20107-17. Date of Electronic Publication: 2015 Nov 09.