Zobrazeno 1 - 10
of 11
pro vyhledávání: '"Prashant Singh Rawat"'
Autor:
Prashant Singh Rawat, Atanas Rountev, Mahesh Ravishankar, P. Sadayappan, Vinod Grover, Aravind Sukumaran-Rajam, Miheer Vaidya, Louis-Noël Pouchet
Publikováno v:
Proceedings of the IEEE. 106:1902-1920
Stencil computations arise in a number of computational domains. They exhibit significant data parallelism and are thus well suited for execution on graphical processing units (GPUs), but can be memory-bandwidth limited unless temporal locality is ut
Autor:
Aravind Sukumaran-Rajam, P. Sadayappan, Atanas Rountev, Louis-Noël Pouchet, Prashant Singh Rawat, Miheer Vaidya
Publikováno v:
IPDPS
Stencil computations are often the compute-intensive kernel in many scientific applications. With the increasing demand for computational accuracy, and the emergence of massively data-parallel high-bandwidth architectures like GPUs, stencils have ste
Autor:
P. Sadayappan, Sriram Krishnamoorthy, Wenlei Bao, Martin Kong, Louis-Noël Pouchet, Prashant Singh Rawat
Publikováno v:
Languages and Compilers for Parallel Computing ISBN: 9783030352240
LCPC
LCPC
Trace based cache simulation are common techniques in design space exploration. In this paper, we develop an efficient strategy to simulate cache behavior for affine computations. Our framework exploits the regularity of polyhedral programs to implem
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::b0bfd409b035b40aa83dde3aa28db772
https://doi.org/10.1007/978-3-030-35225-7_6
https://doi.org/10.1007/978-3-030-35225-7_6
Autor:
Aravind Sukumaran-Rajam, Fabrice Rastello, P. Sadayappan, Atanas Rountev, Prashant Singh Rawat, Louis-Noël Pouchet
Publikováno v:
PACT
Compute-intensive GPU architectures allow the use of high-order 3D stencils for better computational accuracy. These stencils are usually compute-bound. While current state-of-the-art register allocators are satisfactory for most applications, they a
Autor:
Louis-Noël Pouchet, Prashant Singh Rawat, Jinsung Kim, Fabrice Rastello, Changwan Hong, Sriram Krishnamoorthy, P. Sadayappan, Aravind Sukumaran-Rajam
Publikováno v:
PPOPP
Performance modeling of GPU kernels is a significant challenge. In this paper, we develop a novel approach to performance modeling for GPUs through abstract kernel emulation along with latency/gap modeling of resources. Experimental results on all be
Autor:
Vinod Grover, Atanas Rountev, Louis-Noël Pouchet, P. Sadayappan, Changwan Hong, Prashant Singh Rawat, Mahesh Ravishankar
Publikováno v:
PACT
Computations involving successive application of 3D stencil operators are widely used in many application domains, such as image processing, computational electromagnetics, seismic processing, and climate modeling. Enhancement of temporal and spatial
Autor:
Vinod Grover, Mahesh Ravishankar, P. Sadayappan, Louis-Noël Pouchet, Prashant Singh Rawat, Changwan Hong
Publikováno v:
GPGPU@PPoPP
GPUs are an attractive target for data parallel stencil computations prevalent in scientific computing and image processing applications. Many tiling schemes, such as overlapped tiling and split tiling, have been proposed in past to improve the perfo
Autor:
Prashant Singh Rawat, Kevin Stock, Thomas Henretty, Martin Kong, J. Ramanujam, Justin Holewinski, P. Sadayappan, Atanas Rountev, Louis-Noël Pouchet
Publikováno v:
WOLFHPC@SC
Stencil computations are at the core of applications in a number of scientific computing domains. We describe a domain-specific language for regular stencil computations that allows specification of the computations in a concise manner. We describe a
Publikováno v:
Static Analysis ISBN: 9783642331244
SAS
SAS
Precise flow- and context-sensitive pointer analysis (FCPA) is generally considered prohibitively expensive for large programs; most tools relax one or both of the requirements for scalability. We argue that precise FCPA has been over-harshly judged-
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::ea5bc3df51a5726bbc9f431cd25c569d
https://doi.org/10.1007/978-3-642-33125-1_19
https://doi.org/10.1007/978-3-642-33125-1_19
Autor:
Jinsung Kim, Prashant Singh Rawat, P. Sadayappan, Louis-Noël Pouchet, Aravind Sukumaran-Rajam, Changwan Hong, Sriram Krishnamoorthy, Fabrice Rastello
Publikováno v:
BASE-Bielefeld Academic Search Engine
PLDI 2018-39th ACM SIGPLAN Conference on Programming Language Design and Implementation
PLDI 2018-39th ACM SIGPLAN Conference on Programming Language Design and Implementation, Jun 2018, Philadelphia, United States. pp.736-751, ⟨10.1145/3192366.3192397⟩
PLDI
PLDI 2018-39th ACM SIGPLAN Conference on Programming Language Design and Implementation
PLDI 2018-39th ACM SIGPLAN Conference on Programming Language Design and Implementation, Jun 2018, Philadelphia, United States. pp.736-751, ⟨10.1145/3192366.3192397⟩
PLDI
International audience; In this paper, we develop an approach to GPU kernel optimization by focusing on identification of bottleneck resources and determining optimization parameters that can alleviate the bottleneck. Performance modeling for GPUs is
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::31a604e327aa30bbd8f6e52385be9159
https://hal.inria.fr/hal-01955475
https://hal.inria.fr/hal-01955475