Performance Optimisation of Smoothed Particle Hydrodynamics Algorithms for Multi/Many-Core Architectures
Autor: | Nicolay Hammer, Luigi Iapichino, Vasileios Karakasis, Fabio Baruffa |
---|---|
Rok vydání: | 2016 |
Předmět: |
FOS: Computer and information sciences
Xeon Computer science FOS: Physical sciences 010103 numerical & computational mathematics Ivy Bridge Computational Physics (physics.comp-ph) 01 natural sciences Smoothed-particle hydrodynamics Software portability Computer Science - Distributed Parallel and Cluster Computing 0103 physical sciences Scalability Code (cryptography) Distributed Parallel and Cluster Computing (cs.DC) 0101 mathematics Astrophysics - Instrumentation and Methods for Astrophysics Physics - Computational Physics 010303 astronomy & astrophysics Algorithm Instrumentation and Methods for Astrophysics (astro-ph.IM) Xeon Phi |
DOI: | 10.48550/arxiv.1612.06090 |
Popis: | We describe a strategy for code modernisation of Gadget, a widely used community code for computational astrophysics. The focus of this work is on node-level performance optimisation, targeting current multi/many-core IntelR architectures. We identify and isolate a sample code kernel, which is representative of a typical Smoothed Particle Hydrodynamics (SPH) algorithm. The code modifications include threading parallelism optimisation, change of the data layout into Structure of Arrays (SoA), auto-vectorisation and algorithmic improvements in the particle sorting. We obtain shorter execution time and improved threading scalability both on Intel XeonR ($2.6 \times$ on Ivy Bridge) and Xeon PhiTM ($13.7 \times$ on Knights Corner) systems. First few tests of the optimised code result in $19.1 \times$ faster execution on second generation Xeon Phi (Knights Landing), thus demonstrating the portability of the devised optimisation solutions to upcoming architectures. Comment: 8 pages, 2 columns, 4 figures, accepted as paper at HPCS Proceedings 2017, IEEE XPLORE |
Databáze: | OpenAIRE |
Externí odkaz: |