Learning with Delayed Rewards-A Case Study on Inverse Defect Design in 2D Materials.
Autor: | Banik S; Center for Nanoscale Materials, Argonne National Laboratory, Lemont, Illinois 60439, United States.; Department of Mechanical and Industrial Engineering, University of Illinois, Chicago, Illinois 60607, United States., Loeffler TD; Center for Nanoscale Materials, Argonne National Laboratory, Lemont, Illinois 60439, United States.; Department of Mechanical and Industrial Engineering, University of Illinois, Chicago, Illinois 60607, United States., Batra R; Center for Nanoscale Materials, Argonne National Laboratory, Lemont, Illinois 60439, United States., Singh H; Research and Development, Sentient Science Corporation, West Lafayette, Indiana 47906, United States., Cherukara MJ; Advanced Photon Source, Argonne National Laboratory, Lemont, Illinois 60439, United States., Sankaranarayanan SKRS; Center for Nanoscale Materials, Argonne National Laboratory, Lemont, Illinois 60439, United States.; Department of Mechanical and Industrial Engineering, University of Illinois, Chicago, Illinois 60607, United States. |
---|---|
Jazyk: | angličtina |
Zdroj: | ACS applied materials & interfaces [ACS Appl Mater Interfaces] 2021 Aug 04; Vol. 13 (30), pp. 36455-36464. Date of Electronic Publication: 2021 Jul 21. |
DOI: | 10.1021/acsami.1c07545 |
Abstrakt: | Defect dynamics in materials are of central importance to a broad range of technologies from catalysis to energy storage systems to microelectronics. Material functionality depends strongly on the nature and organization of defects-their arrangements often involve intermediate or transient states that present a high barrier for transformation. The lack of knowledge of these intermediate states and the presence of this energy barrier presents a serious challenge for inverse defect design, especially for gradient-based approaches. Here, we present a reinforcement learning (RL) [Monte Carlo Tree Search (MCTS)] based on delayed rewards that allow for efficient search of the defect configurational space and allows us to identify optimal defect arrangements in low-dimensional materials. Using a representative case of two-dimensional MoS |
Databáze: | MEDLINE |
Externí odkaz: |