Developing a New Storage Format and a Warp-Based SpMV Kernel for Configuration Interaction Sparse Matrices on the GPU
Autor: | Hassan Reza, Mohammed Mahmoud, Mark R. Hoffmann |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2018 |
Předmět: |
General Computer Science
Computer science Graphics processing unit GPU CUDA 02 engineering and technology lcsh:QA75.5-76.95 Theoretical Computer Science Computational science Matrix (mathematics) SpMV 0202 electrical engineering electronic engineering information engineering linear system Eigenvalues and eigenvectors Sparse matrix 020203 distributed computing Applied Mathematics Linear system 020206 networking & telecommunications CI kernel Modeling and Simulation Kernel (statistics) Computer Science::Mathematical Software Multiplication lcsh:Electronic computers. Computer science |
Zdroj: | Computation Volume 6 Issue 3 Computation, Vol 6, Iss 3, p 45 (2018) |
ISSN: | 2079-3197 |
DOI: | 10.3390/computation6030045 |
Popis: | Sparse matrix-vector multiplication (SpMV) can be used to solve diverse-scaled linear systems and eigenvalue problems that exist in numerous, and varying scientific applications. One of the scientific applications that SpMV is involved in is known as Configuration Interaction (CI). CI is a linear method for solving the nonrelativistic Schrö dinger equation for quantum chemical multi-electron systems, and it can deal with the ground state as well as multiple excited states. In this paper, we have developed a hybrid approach in order to deal with CI sparse matrices. The proposed model includes a newly-developed hybrid format for storing CI sparse matrices on the Graphics Processing Unit (GPU). In addition to the new developed format, the proposed model includes the SpMV kernel for multiplying the CI matrix (proposed format) by a vector using the C language and the Compute Unified Device Architecture (CUDA) platform. The proposed SpMV kernel is a vector kernel that uses the warp approach. We have gauged the newly developed model in terms of two primary factors, memory usage and performance. Our proposed kernel was compared to the cuSPARSE library and the CSR5 (Compressed Sparse Row 5) format and already outperformed both. |
Databáze: | OpenAIRE |
Externí odkaz: |