High Performance Matrix Multiplication on General Purpose Graphics Processing Units

Autor: Jessica Brazelton, Miguel Cabral, Fan Wu
Rok vydání: 2010
Předmět:
Zdroj: 2010 International Conference on Computational Intelligence and Software Engineering.
DOI: 10.1109/cise.2010.5677044
Popis: In recent years, there has been significant interest from both academia and industry in applying commodity graphics processing units (GPUs) toward general computing problems. The nVidia CUDA programming model provides a straightforward means of describing inherently parallel computations. In this paper, we present our GPU-based matrix multiplication with high performance on General Purpose Graphics Processing Unit (GPGPUs). We implemented our algorithm using nVidia CUDA API and compared its performance with an optimized CPU-implementation on a high-end AMD Opteron Dual Core CPU. Our experimental results show that a significant performance improvement over CPU-based algorithm and the maximum observed speedups are about 100 times.
Databáze: OpenAIRE