An Initial Assessment of NVSHMEM for High Performance Computing

Autor:	Chris J. Newburn, Neena Imam, Akhil Langer, Sreeram Potluri, Chung-Hsing Hsu
Rok vydání:	2020
Předmět:	business.industry Computer science Distributed computing Concurrency Programming complexity Scalability Programming paradigm Usability Partitioned global address space Solver business Supercomputer
Zdroj:	IPDPS Workshops
DOI:	10.1109/ipdpsw50202.2020.00104
Popis:	High Performance Computing has been a driving force behind important tasks such as scientific discovery and deep learning. It tends to achieve performance through greater concurrency and heterogeneity, where the underlying complexity of richer topologies is managed through software abstraction.In this paper, we present our initial assessment of NVSHMEM, an experimental programming library that supports the Partitioned Global Address Space programming model for NVIDIA GPU clusters. NVSHMEM offers several concrete advantages. One is that it reduces overheads and software complexity by allowing communication and computation to be interleaved vs. separating them into different phases. Another is that it implements the OpenSHMEM specification to provide efficient finegrained one-sided communication, streamlining away overheads due to tag matching, wildcards, and unexpected messages which have compounding effect with increasing concurrency. It also offers ease of use by abstracting away low-level configuration operations that are required to enable low-overhead communication and direct loads and stores across processes.We evaluated NVSHMEM in terms of usability, functionality, and scalability by running two math kernels, matrix multiplication and Jacobi solver, on the 27,648-GPU Summit supercomputer. Our exercise of NVSHMEM at scale contributed to making NVSHMEM more robust and preparing it for production release.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::d0eaaf7c81ff67ae16029099704ade31 https://doi.org/10.1109/ipdpsw50202.2020.00104 Zobrazit plný text záznamu