Výsledky vyhledávání - "Prashant Singh Rawat"

Autor: Aravind Sukumaran-Rajam, P. Sadayappan, Atanas Rountev, Louis-Noël Pouchet, Prashant Singh Rawat, Miheer Vaidya

Publikováno v: IPDPS

Stencil computations are often the compute-intensive kernel in many scientific applications. With the increasing demand for computational accuracy, and the emergence of massively data-parallel high-bandwidth architectures like GPUs, stencils have ste

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::61883daaa84a71ee9eb5b68a44a3ca48
https://doi.org/10.1109/ipdps.2019.00073

Zobrazit plný text záznamu

Efficient Cache Simulation for Affine Computations

Autor: P. Sadayappan, Sriram Krishnamoorthy, Wenlei Bao, Martin Kong, Louis-Noël Pouchet, Prashant Singh Rawat

Publikováno v: Languages and Compilers for Parallel Computing ISBN: 9783030352240
LCPC

Trace based cache simulation are common techniques in design space exploration. In this paper, we develop an efficient strategy to simulate cache behavior for affine computations. Our framework exploits the regularity of polyhedral programs to implem

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::b0bfd409b035b40aa83dde3aa28db772
https://doi.org/10.1007/978-3-030-35225-7_6

Zobrazit plný text záznamu

POSTER: Statement Reordering to Alleviate Register Pressure for Stencils on GPUs

Autor: Aravind Sukumaran-Rajam, Fabrice Rastello, P. Sadayappan, Atanas Rountev, Prashant Singh Rawat, Louis-Noël Pouchet

Publikováno v: PACT

Compute-intensive GPU architectures allow the use of high-order 3D stencils for better computational accuracy. These stencils are usually compute-bound. While current state-of-the-art register allocators are satisfactory for most applications, they a

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::e8d437a413f7dc3e9f77007b7c522b03
https://doi.org/10.1109/pact.2017.40

Zobrazit plný text záznamu

Performance modeling for GPUs using abstract kernel emulation

Autor: Louis-Noël Pouchet, Prashant Singh Rawat, Jinsung Kim, Fabrice Rastello, Changwan Hong, Sriram Krishnamoorthy, P. Sadayappan, Aravind Sukumaran-Rajam

Publikováno v: PPOPP

Performance modeling of GPU kernels is a significant challenge. In this paper, we develop a novel approach to performance modeling for GPUs through abstract kernel emulation along with latency/gap modeling of resources. Experimental results on all be

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0ee1b2bc9f4c84d9c6b60b9553ef0b78
https://doi.org/10.1145/3200691.3178524

Zobrazit plný text záznamu

Resource Conscious Reuse-Driven Tiling for GPUs

Autor: Vinod Grover, Atanas Rountev, Louis-Noël Pouchet, P. Sadayappan, Changwan Hong, Prashant Singh Rawat, Mahesh Ravishankar

Publikováno v: PACT

Computations involving successive application of 3D stencil operators are widely used in many application domains, such as image processing, computational electromagnetics, seismic processing, and climate modeling. Enhancement of temporal and spatial

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::dd3485de87a570278f84ebd7a51e4724
https://doi.org/10.1145/2967938.2967967

Zobrazit plný text záznamu

Effective resource management for enhancing performance of 2D and 3D stencils on GPUs

Autor: Vinod Grover, Mahesh Ravishankar, P. Sadayappan, Louis-Noël Pouchet, Prashant Singh Rawat, Changwan Hong

Publikováno v: GPGPU@PPoPP

GPUs are an attractive target for data parallel stencil computations prevalent in scientific computing and image processing applications. Many tiling schemes, such as overlapped tiling and split tiling, have been proposed in past to improve the perfo

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::255b9b270bb80a0ec810b5dde7fba491
https://doi.org/10.1145/2884045.2884047

Zobrazit plný text záznamu

SDSLc

Autor: Prashant Singh Rawat, Kevin Stock, Thomas Henretty, Martin Kong, J. Ramanujam, Justin Holewinski, P. Sadayappan, Atanas Rountev, Louis-Noël Pouchet

Publikováno v: WOLFHPC@SC

Stencil computations are at the core of applications in a number of scientific computing domains. We describe a domain-specific language for regular stencil computations that allows specification of the computations in a concise manner. We describe a

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::62f4c8112e6518218f2c724e0833785b
https://doi.org/10.1145/2830018.2830025

Zobrazit plný text záznamu

Liveness-Based Pointer Analysis

Autor: Prashant Singh Rawat, Alan Mycroft, Uday P. Khedker

Publikováno v: Static Analysis ISBN: 9783642331244
SAS

Precise flow- and context-sensitive pointer analysis (FCPA) is generally considered prohibitively expensive for large programs; most tools relax one or both of the requirements for scalability. We argue that precise FCPA has been over-harshly judged-

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::ea5bc3df51a5726bbc9f431cd25c569d
https://doi.org/10.1007/978-3-642-33125-1_19

Zobrazit plný text záznamu

GPU Code Optimization using Abstract Kernel Emulation and Sensitivity Analysis

Autor: Jinsung Kim, Prashant Singh Rawat, P. Sadayappan, Louis-Noël Pouchet, Aravind Sukumaran-Rajam, Changwan Hong, Sriram Krishnamoorthy, Fabrice Rastello

Publikováno v: BASE-Bielefeld Academic Search Engine
PLDI 2018-39th ACM SIGPLAN Conference on Programming Language Design and Implementation
PLDI 2018-39th ACM SIGPLAN Conference on Programming Language Design and Implementation, Jun 2018, Philadelphia, United States. pp.736-751, ⟨10.1145/3192366.3192397⟩
PLDI

International audience; In this paper, we develop an approach to GPU kernel optimization by focusing on identification of bottleneck resources and determining optimization parameters that can alleviate the bottleneck. Performance modeling for GPUs is

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::31a604e327aa30bbd8f6e52385be9159
https://hal.inria.fr/hal-01955475

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání