Zobrazeno 1 - 10
of 485
pro vyhledávání: '"Czarnecki, WM"'
Neural information processing systems foundation. All rights reserved. Most deep reinforcement learning algorithms are data inefficient in complex and rich environments, limiting their applicability to many scenarios. One direction for improving data
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8a58aca0985ad5d6f2398021287f8f35
https://ora.ox.ac.uk/objects/uuid:0cfdde8d-8b0b-440a-97b7-7d2a185d1ad6
https://ora.ox.ac.uk/objects/uuid:0cfdde8d-8b0b-440a-97b7-7d2a185d1ad6
Autor:
Galashov, A, Jayakumar, SM, Hasenclever, L, Tirumala, D, Schwarz, J, Desjardins, G, Czarnecki, WM, Teh, YW, Pascanu, R, Heess, N
Many real world tasks exhibit rich structure that is repeated across different parts of the state space or in time. In this work we study the possibility of leveraging such repeated structure to speed up and regularize learning. We start from the KL
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::bee681c2a811d4a8ce0e8cac42a6add3
http://arxiv.org/abs/1905.01240
http://arxiv.org/abs/1905.01240
Autor:
Liu S; DeepMind, London, UK., Lever G; DeepMind, London, UK., Wang Z; DeepMind, London, UK., Merel J; DeepMind, London, UK., Eslami SMA; DeepMind, London, UK., Hennes D; DeepMind, London, UK., Czarnecki WM; DeepMind, London, UK., Tassa Y; DeepMind, London, UK., Omidshafiei S; DeepMind, London, UK., Abdolmaleki A; DeepMind, London, UK., Siegel NY; DeepMind, London, UK., Hasenclever L; DeepMind, London, UK., Marris L; DeepMind, London, UK., Tunyasuvunakool S; DeepMind, London, UK., Song HF; DeepMind, London, UK., Wulfmeier M; DeepMind, London, UK., Muller P; DeepMind, London, UK., Haarnoja T; DeepMind, London, UK., Tracey B; DeepMind, London, UK., Tuyls K; DeepMind, London, UK., Graepel T; DeepMind, London, UK., Heess N; DeepMind, London, UK.
Publikováno v:
Science robotics [Sci Robot] 2022 Aug 31; Vol. 7 (69), pp. eabo0235. Date of Electronic Publication: 2022 Aug 31.
Autor:
Omidshafiei S; DeepMind, Paris, France. somidshafiei@google.com., Tuyls K; DeepMind, Paris, France., Czarnecki WM; DeepMind, London, UK., Santos FC; INESC-ID and Instituto Superior Técnico, Universidade de Lisboa, Lisboa, Portugal., Rowland M; DeepMind, London, UK., Connor J; DeepMind, London, UK., Hennes D; DeepMind, Paris, France., Muller P; DeepMind, Paris, France., Pérolat J; DeepMind, Paris, France., Vylder B; DeepMind, Paris, France., Gruslys A; DeepMind, London, UK., Munos R; DeepMind, Paris, France.
Publikováno v:
Nature communications [Nat Commun] 2020 Nov 05; Vol. 11 (1), pp. 5603. Date of Electronic Publication: 2020 Nov 05.
Autor:
Moskovitz, Ted1,2 (AUTHOR) ted@gatsby.ucl.ac.uk, Miller, Kevin J.2,3 (AUTHOR), Sahani, Maneesh1 (AUTHOR), Botvinick, Matthew M.1,2 (AUTHOR)
Publikováno v:
PLoS Computational Biology. 10/18/2024, Vol. 20 Issue 10, p1-22. 22p.
Autor:
Menke, Janosch1 (AUTHOR) janosch.menke.research@proton.me, Nahal, Yasmine2 (AUTHOR), Bjerrum, Esben Jannik3 (AUTHOR), Kabeshov, Mikhail4 (AUTHOR), Kaski, Samuel2,5 (AUTHOR), Engkvist, Ola1,4 (AUTHOR)
Publikováno v:
Journal of Cheminformatics. 8/14/2024, Vol. 16 Issue 1, p1-9. 9p.
Autor:
Grohs, Philipp1,2,3 (AUTHOR) philipp.grohs@univie.ac.at, Voigtlaender, Felix1,4,5 (AUTHOR)
Publikováno v:
Foundations of Computational Mathematics. Aug2024, Vol. 24 Issue 4, p1085-1143. 59p.
Autor:
Vinyals O; DeepMind, London, UK. vinyals@google.com., Babuschkin I; DeepMind, London, UK., Czarnecki WM; DeepMind, London, UK., Mathieu M; DeepMind, London, UK., Dudzik A; DeepMind, London, UK., Chung J; DeepMind, London, UK., Choi DH; DeepMind, London, UK., Powell R; DeepMind, London, UK., Ewalds T; DeepMind, London, UK., Georgiev P; DeepMind, London, UK., Oh J; DeepMind, London, UK., Horgan D; DeepMind, London, UK., Kroiss M; DeepMind, London, UK., Danihelka I; DeepMind, London, UK., Huang A; DeepMind, London, UK., Sifre L; DeepMind, London, UK., Cai T; DeepMind, London, UK., Agapiou JP; DeepMind, London, UK., Jaderberg M; DeepMind, London, UK., Vezhnevets AS; DeepMind, London, UK., Leblond R; DeepMind, London, UK., Pohlen T; DeepMind, London, UK., Dalibard V; DeepMind, London, UK., Budden D; DeepMind, London, UK., Sulsky Y; DeepMind, London, UK., Molloy J; DeepMind, London, UK., Paine TL; DeepMind, London, UK., Gulcehre C; DeepMind, London, UK., Wang Z; DeepMind, London, UK., Pfaff T; DeepMind, London, UK., Wu Y; DeepMind, London, UK., Ring R; DeepMind, London, UK., Yogatama D; DeepMind, London, UK., Wünsch D; Team Liquid, Utrecht, Netherlands., McKinney K; DeepMind, London, UK., Smith O; DeepMind, London, UK., Schaul T; DeepMind, London, UK., Lillicrap T; DeepMind, London, UK., Kavukcuoglu K; DeepMind, London, UK., Hassabis D; DeepMind, London, UK., Apps C; DeepMind, London, UK., Silver D; DeepMind, London, UK. davidsilver@google.com.
Publikováno v:
Nature [Nature] 2019 Nov; Vol. 575 (7782), pp. 350-354. Date of Electronic Publication: 2019 Oct 30.
Autor:
Omidshafiei S; DeepMind, Paris, France., Papadimitriou C; Columbia University, New York, USA., Piliouras G; Singapore University of Technology and Design, Singapore, Singapore., Tuyls K; DeepMind, Paris, France. karltuyls@google.com., Rowland M; DeepMind, London, UK., Lespiau JB; DeepMind, Paris, France., Czarnecki WM; DeepMind, London, UK., Lanctot M; DeepMind, Edmonton, Canada., Perolat J; DeepMind, London, UK., Munos R; DeepMind, Paris, France.
Publikováno v:
Scientific reports [Sci Rep] 2019 Jul 09; Vol. 9 (1), pp. 9937. Date of Electronic Publication: 2019 Jul 09.
Autor:
Jaderberg M; DeepMind, London, UK. lejlot@google.com jaderberg@google.com., Czarnecki WM; DeepMind, London, UK. lejlot@google.com jaderberg@google.com., Dunning I; DeepMind, London, UK., Marris L; DeepMind, London, UK., Lever G; DeepMind, London, UK., Castañeda AG; DeepMind, London, UK., Beattie C; DeepMind, London, UK., Rabinowitz NC; DeepMind, London, UK., Morcos AS; DeepMind, London, UK., Ruderman A; DeepMind, London, UK., Sonnerat N; DeepMind, London, UK., Green T; DeepMind, London, UK., Deason L; DeepMind, London, UK., Leibo JZ; DeepMind, London, UK., Silver D; DeepMind, London, UK., Hassabis D; DeepMind, London, UK., Kavukcuoglu K; DeepMind, London, UK., Graepel T; DeepMind, London, UK.
Publikováno v:
Science (New York, N.Y.) [Science] 2019 May 31; Vol. 364 (6443), pp. 859-865.