Domain-Specific Optimization and Generation of High-Performance GPU Code for Stencil Computations.

Autor: Rawat, Prashant Singh, Vaidya, Miheer, Sukumaran-Rajam, Aravind, Ravishankar, Mahesh, Grover, Vinod, Rountev, Atanas, Pouchet, Louis-Noel, Sadayappan, P.
Předmět:
Zdroj: Proceedings of the IEEE; Nov2018, Vol. 106 Issue 11, p1902-1920, 19p
Abstrakt: Stencil computations arise in a number of computational domains. They exhibit significant data parallelism and are thus well suited for execution on graphical processing units (GPUs), but can be memory-bandwidth limited unless temporal locality is utilized via tiling. This paper describes how effective tiled code can be generated for GPUs from a domain-specific language (DSL) for stencils. Experimental results demonstrate the benefits of such a domain-specific optimization approach over state-of-the-art general-purpose compiler optimizations. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index