Achieving performance portability in Gaussian basis set density functional theory on accelerator based architectures in NWChemEx
Autor: | Wibe A. de Jong, Douglas W. Doerfler, David B. Williams-Young, Abhishek Bagusetty, Hubertus J. J. van Dam, Theresa L. Windus, Chao Yang, Álvaro Vázquez-Mayagoitia |
---|---|
Rok vydání: | 2021 |
Předmět: |
Scheme (programming language)
Computer Networks and Communications Computer science Accelerator Theoretical Computer Science Set (abstract data type) Software portability Software Artificial Intelligence computer.programming_language business.industry Design pattern Performance portability Computer Graphics and Computer-Aided Design Numerical integration Range (mathematics) Graphics processing unit Computer engineering Hardware and Architecture Key (cryptography) Density functional theory Cognitive Sciences business Distributed Computing computer |
Popis: | The numerical integration of the exchange–correlation (XC) potential is one of the primary computational bottlenecks in Gaussian basis set Kohn–Sham density functional theory (KS-DFT). To achieve optimal performance and accuracy, care must be taken in this numerical integration to preserve local sparsity as to allow for near linear weak scaling with system size. This leads to an integration scheme with several performance critical kernels which must be hand optimized for each architecture of interest. As the set of available accelerator hardware goes more diverse, a key challenge for developers of KS-DFT software is to maintain performance portability across a wide range of computational architectures. In this work, we examine a modular software design pattern which decouples the implementation details of performance critical kernels from the expression of high-level algorithmic workflows in a device-agnostic language such as C++; thus allowing for developers to target existing and emerging accelerator hardware within a single code base. We consider the efficacy of such a design pattern in the numerical integration of the XC potential by demonstrating its ability to achieve performance portability across a set of accelerator architectures which are representative of those on current and future U.S. Department of Energy Leadership Computing Facilities. |
Databáze: | OpenAIRE |
Externí odkaz: |