Machine Learning to Design an Auto-tuning System for the Best Compressed Format Detection for Parallel Sparse Computations

Autor:	Olfa Hamdi-Larbi, Ichrak Mehrez, Thomas Dufaud
Přispěvatelé:	Université de Tunis El Manar (UTM), Taibah University, Laboratoire d'Informatique Parallélisme Réseaux Algorithmes Distribués (LI-PaRAD), Université de Versailles Saint-Quentin-en-Yvelines (UVSQ), Maison de la Simulation (MDLS), Université de Versailles Saint-Quentin-en-Yvelines (UVSQ)-Institut National de Recherche en Informatique et en Automatique (Inria)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	Smart system Computer science Computation Process (computing) 02 engineering and technology 01 natural sciences Theoretical Computer Science Auto tuning 010104 statistics & probability Computer engineering Work (electrical) Hardware and Architecture 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing 0101 mathematics [INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC] Software Sparse matrix
Zdroj:	Parallel Processing Letters Parallel Processing Letters, World Scientific Publishing, 2021, ⟨10.1142/S0129626421500195⟩ Parallel Processing Letters, 2021, 31 (04), pp.2150019. ⟨10.1142/S0129626421500195⟩
ISSN:	0129-6264 1793-642X
Popis:	International audience; Many applications in scientific computing process very large sparse matrices on parallel architectures. The presented work in this paper is a part of a project where our general aim is to develop an auto-tuner system for the selection of the best matrix compression format in the context of high-performance computing. The target smart system can automatically select the best compression format for a given sparse matrix, a numerical method processing this matrix, a parallel programming model and a target architecture. Hence, this paper describes the design and implementation of the proposed concept. We consider a case study consisting of a numerical method reduced to the sparse matrix vector product (SpMV), some compression formats, the data parallel as a programming model and, a distributed multi-core platform as a target architecture. This study allows extracting a set of important novel metrics and parameters which are relative to the considered programming model. Our metrics are used as input to a machine-learning algorithm to predict the best matrix compression format. An experimental study targeting a distributed multi-core platform and processing random and real-world matrices shows that our system can improve in average up to 7% the accuracy of the machine learning.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0f1cfb5f2f3585dc8aaff7064334497a https://hal.archives-ouvertes.fr/hal-03429364 Zobrazit plný text záznamu