Autor: |
Rémy, Adrien, Baboulin, Marc, Sosonkina, Masha, Rozoy, Brigitte |
Přispěvatelé: |
Laboratoire de Recherche en Informatique (LRI), Université Paris-Sud - Paris 11 (UP11)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), Performance Optimization by Software Transformation and Algorithms & Librairies Enhancement (POSTALE), Université Paris-Sud - Paris 11 (UP11)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-Université Paris-Sud - Paris 11 (UP11)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-Inria Saclay - Ile de France, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria), Old Dominion University [Norfolk] (ODU), INRIA |
Jazyk: |
angličtina |
Rok vydání: |
2014 |
Předmět: |
|
Zdroj: |
[Research Report] RR-8497, INRIA. 2014 |
Popis: |
We study the impact of non-uniform memory accesses (NUMA) on the solution of dense general linear systems using an LU factorization algorithm. In particular we illustrate how an appropriate placement of the threads and memory on a NUMA architecture can improve the performance of the panel factorization and consequently accelerate the global LU factorization. We apply these placement strategies and present performance results for a hybrid multicore/GPU LU algorithm as it is implemented in the public domain library MAGMA. |
Databáze: |
OpenAIRE |
Externí odkaz: |
|