Performance Optimization of the HPCG Benchmark on the Sunway TaihuLight Supercomputer
Autor: | Fangfang Liu, Chao Yang, Yulong Ao, Lijuan Jiang, Qiao Sun, Wanwang Yin |
---|---|
Rok vydání: | 2018 |
Předmět: |
Hardware architecture
020203 distributed computing Computer science Memory bandwidth 010103 numerical & computational mathematics 02 engineering and technology Parallel computing Supercomputer 01 natural sciences Data mapping Hardware and Architecture 0202 electrical engineering electronic engineering information engineering Key (cryptography) Benchmark (computing) 0101 mathematics Software Information Systems Sunway TaihuLight Block (data storage) |
Zdroj: | ACM Transactions on Architecture and Code Optimization. 15:1-20 |
ISSN: | 1544-3973 1544-3566 |
DOI: | 10.1145/3182177 |
Popis: | In this article, we present some key techniques for optimizing HPCG on Sunway TaihuLight and demonstrate how to achieve high performance in memory-bound applications by exploiting specific characteristics of the hardware architecture. In particular, we utilize a block multicoloring approach for parallelization and propose methods such as requirement-based data mapping and customized gather collective to enhance the effective memory bandwidth. Experiments indicate that the optimized HPCG code can sustain 77% of the theoretical memory bandwidth and scale to the full system of more than 10 million cores, with an aggregated performance of 480.8 Tflop/s and a weak scaling efficiency of 87.3%. |
Databáze: | OpenAIRE |
Externí odkaz: |