Physics-informed reinforcement learning optimization of nuclear assembly design
Autor: | Uuganbayar Otgonbaatar, Koroush Shirvan, Majdi I. Radaideh, Benoit Forget, Isaac Wolverton, Joshua Joseph, Nicholas Roy, James J. Tusar |
---|---|
Přispěvatelé: | Massachusetts Institute of Technology. Department of Nuclear Science and Engineering, MIT Intelligence Initiative |
Rok vydání: | 2021 |
Předmět: |
Nuclear and High Energy Physics
Decision support system Mathematical optimization Mechanical Engineering Q-learning Cost reduction Nuclear Energy and Engineering Safety assurance Simulated annealing Fuel efficiency Reinforcement learning General Materials Science Stochastic optimization Safety Risk Reliability and Quality Waste Management and Disposal |
Zdroj: | Prof. Roy |
ISSN: | 0029-5493 |
DOI: | 10.1016/j.nucengdes.2020.110966 |
Popis: | Optimization of nuclear fuel assemblies if performed effectively, will lead to fuel efficiency improvement, cost reduction, and safety assurance. However, assembly optimization involves solving high-dimensional and computationally expensive combinatorial problems. As such, fuel designers’ expert judgement has commonly prevailed over the use of stochastic optimization (SO) algorithms such as genetic algorithms and simulated annealing. To improve the state-of-art, we explore a class of artificial intelligence (AI) algorithms, namely, reinforcement learning (RL) in this work. We propose a physics-informed AI optimization methodology by establishing a connection through reward shaping between RL and the tactics fuel designers follow in practice by moving fuel rods in the assembly to meet specific constraints and objectives. The methodology utilizes RL algorithms, deep Q learning and proximal policy optimization, and compares their performance to SO algorithms. The methodology is applied on two boiling water reactor assemblies of low-dimensional ( ∼ 2 × 10 6 combinations) and high-dimensional ( ∼ 10 31 combinations) natures. The results demonstrate that RL is more effective than SO in solving high dimensional problems, i.e., 10 × 10 assembly, through embedding expert knowledge in form of game rules and effectively exploring the search space. For a given computational resources and timeframe relevant to fuel designers, RL algorithms outperformed SO through finding more feasible patterns, 4–5 times more than SO, and through increasing search speed, as indicated by the RL outstanding computational efficiency. The results of this work clearly demonstrate RL effectiveness as another decision support tool for nuclear fuel assembly optimization. |
Databáze: | OpenAIRE |
Externí odkaz: |