CuWide: Towards Efficient Flow-Based Training for Sparse Wide Models on GPUs

Autor:	Lele Yu, Xupeng Miao, Zhi Yang, Lingxiao Ma, Jiawei Jiang, Bin Cui, Yingxia Shao
Rok vydání:	2022
Předmět:	Schema (genetic algorithms) High memory Computational Theory and Mathematics Memory hierarchy Computer science Property (programming) Computation Bandwidth (signal processing) Locality of reference Parallel computing Performance improvement Computer Science Applications Information Systems
Zdroj:	IEEE Transactions on Knowledge and Data Engineering. 34:4119-4132
ISSN:	2326-3865 1041-4347
DOI:	10.1109/tkde.2020.3038109
Popis:	Wide models such as generalized linear models and factorization-based models have been extensively used in various predictive applications, e.g., recommendation systems. Due to the memory bounded property of the models, the performance improvement on CPU is reaching the limitation. GPU is known to have many computation units and high memory bandwidth, and becomes a promising platform for training machine learning models. However, the GPU training for the wide models is far from optimal due to the sparsity and irregularity in wide models. The existing GPU-based wide models are even slower than the ones using CPU. The classical training schema of the wide models does not optimized for the GPU architecture, which suffers from large amount of random memory accesses and redundant read/write of intermediate values. In this paper, we propose an efficient GPU-training framework for the large-scale wide models, named cuWide. To fully benefit from the memory hierarchy of GPU, cuWide applies a new flow-based schema for training, which leverages the spatial and temporal locality of wide models to drastically reduce the amount of communication with GPU global memory. We show that cuWide can be up to more than 20° faster than the state-of-the-art GPU solutions and multi-core CPU solutions.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::3d6407e7713e52cf1edaf2c313f11069 https://doi.org/10.1109/tkde.2020.3038109 Zobrazit plný text záznamu