Algorithm for automatic loop parallelization for graphics processing units
Autor: | O.G. Beketov, А.Yu. Doroshenko, Olena Yatsenko |
---|---|
Rok vydání: | 2017 |
Předmět: |
Statement (computer science)
For loop Loop optimization Multi-core processor 021103 operations research Computer science 0211 other engineering and technologies Graphics processing unit Event loop 02 engineering and technology Parallel computing 01 natural sciences 010104 statistics & probability Rewriting 0101 mathematics Graphics |
Zdroj: | PROBLEMS IN PROGRAMMING. :028-036 |
ISSN: | 1727-4907 |
DOI: | 10.15407/pp2017.04.028 |
Popis: | Parallelization of loop operators is a long standing problem of parallel programming. The widespread use of graphics processing units for computational tasks has resulted in the new statement of the mentioned problem for this class of multicore systems. The purpose of this work is to improve the mechanism of transformation of cyclic operators for loop parallelization for execution on a graphics processing unit. Software tool for computation optimization that allows to parallelize cyclic operators semi‑automatically was developed. Data bufferization synchronized with main loop execution was implemented, and the software tool using the rewriting rules system TermWare was built and integrated with the toolkit for design and synthesis of programs IDS. The developed system was tested using heterogeneous multicore cluster. The advantages of the developed system in comparison with well-known parallelization system Par4All consist in processing speed and the possibility of processing of data amounts exceeding the amount of memory of a graphics processing unit, and also the ability to use several graphics processing units simultaneously. The developed system was applied for parallelization of a serial loop, which is the part of a numerical weather forecasting program. Problems in programming 2017; 4: 028-036 |
Databáze: | OpenAIRE |
Externí odkaz: |