Design and Implementation of Multithreaded Reproducible DGEMV for Phytium Processor

Autor: CHEN Lei, TANG Tao, QI Hai-jun, JIANG Hao, HE Kang
Jazyk: čínština
Rok vydání: 2022
Předmět:
Zdroj: Jisuanji kexue, Vol 49, Iss 10, Pp 27-35 (2022)
Druh dokumentu: article
ISSN: 1002-137X
DOI: 10.11896/jsjkx.220100125
Popis: In high-performance computing,the accumulation of rounding error in the process of solving the large-scale,long time and ill-conditioned problem will lead to invalidated results.These results are useful for the developers to debug programs and check their correctness.Therefore,the reproducibility of the numerical results of the algorithm becomes very important.Based on the OpenBLAS’s framework,combining with Demmel’s reproducible method in ReproBLAS and multilayer block technology proposed by Castaldo,this paper designs a reproducible algorithm of multithreaded DGEMV for Phytium processor with rounding error analysis and error free transformation.Numerical experiments show that the output of the algorithm is the same as that of the ReproBLAS,which verifies the reproducibility.Our algorithm is up to 2x faster than that in ReproBLAS.Compared with the DGEMV function of OzBLAS proposed by Mukunoki,our algorithm runs at least 20x faster than that in OzBLAS with single thread,and 9x faster than that in OzBLAS with multi-threads.Theoretical analysis and numerical experiments illustrate that improved algorithm is accurate,validated and efficiency.
Databáze: Directory of Open Access Journals