A general purpose contention manager for software transactions on the GPU

Autor:	Qi Shen, Albert Y. Zomaya, Gary Ushaw, Graham Morgan, Richard Davison, Rajiv Ranjan, Craig Sharp
Rok vydání:	2020
Předmět:	Computer Networks and Communications Computer science Graphics processing unit Transactional memory 020206 networking & telecommunications 02 engineering and technology Parallel computing Thread (computing) Theoretical Computer Science Rendering (computer graphics) Artificial Intelligence Hardware and Architecture Server Scalability 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Central processing unit General-purpose computing on graphics processing units Software
Zdroj:	Journal of Parallel and Distributed Computing. 139:1-17
ISSN:	0743-7315
DOI:	10.1016/j.jpdc.2019.12.018
Popis:	The Graphics Processing Unit (GPU) is now used extensively for general purpose GPU programming (GPGPU), allowing for greater exploitation of the multi-core model across many application domains. This is particularly true in cloud/edge/fog computing, where multiple GPU enabled servers support many different end user services. This move away from the naturally parallel domain of graphics can incur significant performance issues. Unlike the CPU, code that is hindered from execution due to blocking/waiting on the GPU can affect thousands of threads, rendering the advantages of a GPU irrelevant and reducing a highly parallel environment down to a serial one in the worst case. In this paper we present a solution that minimises blocking/waiting in GPGPU computing using a contention manager that offsets memory conflicts across threads through thread re-ordering. We consider conflicts of memory not only to avoid corruption (standard for transactional memory) but also in the semantic layer of application logic (e.g., enforcing ordering to ensure money drawn from bank account occurs after all deposits). We demonstrate how our approach is successful across a number of industry benchmarks and compare our approach to the only other related solution. We also demonstrate that our approach is scalable in terms of thread numbers (a key requirement on the GPU). We believe this is the first work of its kind demonstrating a generalised conflict and semantic contention manager suitable for the scale of parallel execution found on a GPU.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::955458b6b1f47721c0345016bb77629d https://doi.org/10.1016/j.jpdc.2019.12.018 Zobrazit plný text záznamu Full Text from ScienceDirect