Data Mining Using Graphics Processing Units.

Autor: Böhm, Christian, Noll, Robert, Plant, Claudia, Wackersreuther, Bianca, Zherdin, Andrew
Zdroj: Transactions on Large-scale Data- & Knowledge-centered Systems I; 2009, p63-90, 28p
Abstrakt: During the last few years, Graphics Processing Units (GPU) have evolved from simple devices for the display signal preparation into powerful coprocessors that do not only support typical computer graphics tasks such as rendering of 3D scenarios but can also be used for general numeric and symbolic computation tasks such as simulation and optimization. As major advantage, GPUs provide extremely high parallelism (with several hundred simple programmable processors) combined with a high bandwidth in memory transfer at low cost. In this paper, we propose several algorithms for computationally expensive data mining tasks like similarity search and clustering which are designed for the highly parallel environment of a GPU. We define a multidimensional index structure which is particularly suited to support similarity queries under the restricted programming model of a GPU, and define a similarity join method. Moreover, we define highly parallel algorithms for density-based and partitioning clustering. In an extensive experimental evaluation, we demonstrate the superiority of our algorithms running on GPU over their conventional counterparts in CPU. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index