Performance Optimisation of Smoothed Particle Hydrodynamics Algorithms for Multi/Many-Core Architectures

Autor:	Nicolay Hammer, Luigi Iapichino, Vasileios Karakasis, Fabio Baruffa
Rok vydání:	2016
Předmět:	FOS: Computer and information sciences Xeon Computer science FOS: Physical sciences 010103 numerical & computational mathematics Ivy Bridge Computational Physics (physics.comp-ph) 01 natural sciences Smoothed-particle hydrodynamics Software portability Computer Science - Distributed Parallel and Cluster Computing 0103 physical sciences Scalability Code (cryptography) Distributed Parallel and Cluster Computing (cs.DC) 0101 mathematics Astrophysics - Instrumentation and Methods for Astrophysics Physics - Computational Physics 010303 astronomy & astrophysics Algorithm Instrumentation and Methods for Astrophysics (astro-ph.IM) Xeon Phi
DOI:	10.48550/arxiv.1612.06090
Popis:	We describe a strategy for code modernisation of Gadget, a widely used community code for computational astrophysics. The focus of this work is on node-level performance optimisation, targeting current multi/many-core IntelR architectures. We identify and isolate a sample code kernel, which is representative of a typical Smoothed Particle Hydrodynamics (SPH) algorithm. The code modifications include threading parallelism optimisation, change of the data layout into Structure of Arrays (SoA), auto-vectorisation and algorithmic improvements in the particle sorting. We obtain shorter execution time and improved threading scalability both on Intel XeonR ($2.6 \times$ on Ivy Bridge) and Xeon PhiTM ($13.7 \times$ on Knights Corner) systems. First few tests of the optimised code result in $19.1 \times$ faster execution on second generation Xeon Phi (Knights Landing), thus demonstrating the portability of the devised optimisation solutions to upcoming architectures. Comment: 8 pages, 2 columns, 4 figures, accepted as paper at HPCS Proceedings 2017, IEEE XPLORE
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ddb0f2304bbd4f0d461edf1580a791b6 Zobrazit plný text záznamu