Elemental

Autor: Bryan Marker, Nichols A. Romero, Jack Poulson, Robert A. van de Geijn, Jeff R. Hammond
Rok vydání: 2013
Předmět:
Zdroj: ACM Transactions on Mathematical Software. 39:1-24
ISSN: 1557-7295
0098-3500
DOI: 10.1145/2427023.2427030
Popis: Parallelizing dense matrix computations to distributed memory architectures is a well-studied subject and generally considered to be among the best understood domains of parallel computing. Two packages, developed in the mid 1990s, still enjoy regular use: ScaLAPACK and PLAPACK. With the advent of many-core architectures, which may very well take the shape of distributed memory architectures within a single processor, these packages must be revisited since the traditional MPI-based approaches will likely need to be extended. Thus, this is a good time to review lessons learned since the introduction of these two packages and to propose a simple yet effective alternative. Preliminary performance results show the new solution achieves competitive, if not superior, performance on large clusters.
Databáze: OpenAIRE