Improving Structured Grid-Based Sparse Matrix-Vector Multiplication and Gauss–Seidel Iteration on GPDSP

Autor:	Yang Wang, Jie Liu, Xiaoxiong Zhu, Qingyang Zhang, Shengguo Li, Qinglin Wang
Jazyk:	angličtina
Rok vydání:	2023
Předmět:	GPDSP SpMV Gauss–Seidel iteration parrallel algorithm Technology Engineering (General). Civil engineering (General) TA1-2040 Biology (General) QH301-705.5 Physics QC1-999 Chemistry QD1-999
Zdroj:	Applied Sciences, Vol 13, Iss 15, p 8952 (2023)
Druh dokumentu:	article
ISSN:	2076-3417
DOI:	10.3390/app13158952
Popis:	Structured grid-based sparse matrix-vector multiplication and Gauss–Seidel iterations are very important kernel functions in scientific and engineering computations, both of which are memory intensive and bandwidth-limited. GPDSP is a general purpose digital signal processor, which is a very significant embedded processor that has been introduced into high-performance computing. In this paper, we designed various optimization methods, which included a blocking method to improve data locality and increase memory access efficiency, a multicolor reordering method to develop Gauss–Seidel fine-grained parallelism, a data partitioning method designed for GPDSP memory structures, and a double buffering method to overlap computation and access memory on structured grid-based SpMV and Gauss–Seidel iterations for GPDSP. At last, we combined the above optimization methods to design a multicore vectorization algorithm. We tested the matrices generated with structured grids of different sizes on the GPDSP platform and obtained speedups of up to 41× and 47× compared to the unoptimized SpMV and Gauss–Seidel iterations, with maximum bandwidth efficiencies of 72% and 81%, respectively. The experiment results show that our algorithms could fully utilize the external memory bandwidth. We also implemented the commonly used mixed precision algorithm on the GPDSP and obtained speedups of 1.60× and 1.45× for the SpMV and Gauss–Seidel iterations, respectively.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/c2e6e6c2a9154ed19cddb5368fefdf68 Zobrazit plný text záznamu View record in DOAJ