Scalability of parallel finite element algorithms on multi-core platforms
Autor: | A.K. Novikov, N. S. Nedozhogin, S.P. Kopysov, V.N. Rychkov |
---|---|
Rok vydání: | 2016 |
Předmět: |
Multi-core processor
Speedup Computer science 010103 numerical & computational mathematics 02 engineering and technology Parallel computing Shared mesh 01 natural sciences 020204 information systems Scalability 0202 electrical engineering electronic engineering information engineering Multiplication Polygon mesh 0101 mathematics Algorithm Access time Block (data storage) |
Zdroj: | IOP Conference Series: Materials Science and Engineering. 158:012055 |
ISSN: | 1757-899X 1757-8981 |
DOI: | 10.1088/1757-899x/158/1/012055 |
Popis: | The speedup of element-by-element FEM algorithms depends not only on peak processor performance but also on access time to shared mesh data. Eliminating memory boundness would significantly speed up unstructured mesh computations on hybrid multi-core architectures, where the gap between processor and memory performance continues to grow. The speedup can be achieved by ordering unknowns so that only those elements are processed in parallel which do not have common nodes. Therefore, memory conflicts are minimized. FEM assembly is performed with respect to the ordering, which defines how to compose vectors. Mesh can be partitioned into disjoint subdomains by using different layer-by-layer schemes. In this work, we evaluated several partitioning schemes (block, odd, even, and their modifications) on multi-core platforms, using Gunther's Universal Law of Computational Scalability. We performed numerical experiments with element-by-element matrix-vector multiplication on unstructured meshes on multi-core processors accelerated by MIC and GPU. With ordering, we achieved 5-times speedup on CPU, 40-times speedup on MIC, and 200- times speedup on GPU. |
Databáze: | OpenAIRE |
Externí odkaz: |