Partitioning GPUs for Improved Scalability
Autor: | David Black-Schaffer, Johan Janzen, Andra Hugo |
---|---|
Rok vydání: | 2016 |
Předmět: |
Computer science
Parallel computing 01 natural sciences 010101 applied mathematics Task (computing) 0103 physical sciences Scalability Parallelism (grammar) Overall performance 0101 mathematics General-purpose computing on graphics processing units 010303 astronomy & astrophysics Throughput (business) Cholesky decomposition Block (data storage) |
Zdroj: | SBAC-PAD |
DOI: | 10.1109/sbac-pad.2016.14 |
Popis: | To port applications to GPUs, developers need to express computational tasks as highly parallel executions with tens of thousands of threads to fill the GPU's compute resources. However, while this will fill the GPU's resources, it does not necessarily deliver the best efficiency, as the task may scale poorly when run with sufficient parallelism to fill the GPU. In this work we investigate how we can improve throughput by co-scheduling poorly-scaling tasks on sub-partitions of the GPU to increase utilization efficiency. We first investigate the scalability of typical HPC tasks on GPUs, and then use this insight to improve throughput by extending the StarPU framework to co-schedule tasks on the GPU. We demonstrate that co-scheduling poorly-scaling GPU tasks accelerates the execution of the critical tasks of a Cholesky Factorization and improves the overall performance of the application by 9% across a wide range of block sizes. |
Databáze: | OpenAIRE |
Externí odkaz: |