Efficient parallelization of multilevel fast multipole algorithm for electromagnetic simulation on many-core SW26010 processor
Autor: | Ming-Lin Yang, Wei-Jia He, Wu Wang, Xin-Qing Sheng |
---|---|
Rok vydání: | 2020 |
Předmět: |
020203 distributed computing
Speedup Computer science Multiple buffering 02 engineering and technology Parallel computing SW26010 Data structure Theoretical Computer Science Tree (data structure) Hardware and Architecture Parallel programming model 0202 electrical engineering electronic engineering information engineering Central processing unit Algorithm Software Information Systems Scratchpad memory |
Zdroj: | The Journal of Supercomputing. 77:1502-1516 |
ISSN: | 1573-0484 0920-8542 |
DOI: | 10.1007/s11227-020-03308-9 |
Popis: | A many-core parallel approach of the multilevel fast multipole algorithm (MLFMA) based on the Athread parallel programming model is presented on the homegrown many-core SW26010 CPU of China. In the proposed many-core implementation of MLFMA, the data access efficiency is improved by using data structures based on the structure of array. The adaptive workload distribution strategies are adopted on different MLFMA tree levels to ensure full utilization of computing capability and the scratchpad memory. A double buffering scheme is specially designed to make communication overlapped computation. The resulting Athread-based many-core implementation of the MLFMA is capable of solving real-life problems with over one million unknowns with a remarkable speedup. The capability and efficiency of the proposed method are analyzed through the examples of computing scattering by spheres and a practical aerocraft. Numerical results show that with the proposed parallel scheme, the total speedup ratios from 6.4 to 8.0 can be achieved, compared with the CPU master core. |
Databáze: | OpenAIRE |
Externí odkaz: |