Zobrazeno 1 - 10
of 36
pro vyhledávání: '"Carlo Bertolli"'
Autor:
Carlo Bertolli, Kevin O'Brien, Steven A. Wright, Stephen A. Jarvis, David Truby, Gheorghe-Teodor Bercea
With the diversification of HPC architectures beyond traditional CPU-based clusters, a number of new frameworks for performance portability across architectures have arisen. One way of implementing such frameworks is to use C++ templates and lambda e
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c26358b99dc650fba3bf95d988e05f06
Autor:
Carlo Bertolli, Alexey Bataev, Gheorghe-Teodor Bercea, John Kevin Patrick O'Brien, Alexandre E. Eichenberger
Publikováno v:
IBM Journal of Research and Development. 64:12:1-12:23
Programming models that use a higher level of abstraction to express parallelism can target both CPUs and any attached devices, alleviating the maintainability and portability concerns facing today's heterogenous systems. This article describes the d
Publikováno v:
2018 IEEE/ACM 5th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC).
Domain Specific Languages or Active Library frameworks have recently emerged as an important method for gaining performance portability, where an application can be efficiently executed on a wide range of HPC architectures without significant manual
Autor:
Carlo Bertolli, Hyojin Sung, Gheorghe-Teodor Bercea, Kevin O'Brien, Tian Jin, Georgios Rokos, Samuel Antao, Zehra Sura, Alexandre E. Eichenberger, Alexey Bataev, Tong Chen, Arpith C. Jacob
Publikováno v:
HiPC
Graphics Processing Units (GPUs) are increasingly used to accelerate portions of general-purpose applications. Higher level language extensions have been proposed to help non-experts bridge the gap between a host and the GPU's threading model. Recent
Autor:
Arpith C. Jacob, Tong Chen, Hyojin Sung, Alexey Bataev, Gheorghe-Teodor Bercea, Georgios Rokos, Carlo Bertolli, Alexandre E. Eichenberger, Kevin O'Brien
Publikováno v:
LLVM-HPC@SC
OpenMP is a shared memory programming model which supports the offloading of target regions to accelerators such as NVIDIA GPUs. The implementation in Clang/LLVM aims to deliver a generic GPU compilation toolchain that supports both the native CUDA C
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ec1469d07f332726914122594cfae2e8
Publikováno v:
Scaling OpenMP for Exascale Performance and Portability ISBN: 9783319655772
IWOMP
IWOMP
High Performance Computing is steadily embracing heterogeneous systems for supporting a wide variety of workloads. Currently there are two main sources of heterogeneity in compute nodes: (a) different compute elements such as multicore CPUs, GPUs, FP
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::60cea353d65de8468603401793c9697f
https://doi.org/10.1007/978-3-319-65578-9_1
https://doi.org/10.1007/978-3-319-65578-9_1
Autor:
Matt Martineau, Simon McIntosh-Smith, Carlo Bertolli, Arpith C. Jacob, Samuel F. Antao, Alexandre Eichenberger, Gheorghe-Teodor Bercea, Tong Chen, Tian Jin, Kevin O'Brien, Georgios Rokos, Hyojin Sung, Zehra Sura
Publikováno v:
2016 7th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS).
Autor:
David Poliakoff, Samuel Antao, David F. Richards, Carlo Bertolli, Holger E. Jones, Ian Karlin, Arpith C. Jacob, Adam Kunen, Thomas R. W. Scogland, Bronis R. de Supinski, Jim Glosli, Gheorghe-Teodor Bercea, Alexandre E. Eichenberger, Erik W. Draeger
Publikováno v:
OpenMP: Memory, Devices, and Tasks ISBN: 9783319455495
IWOMP
IWOMP
Many application developers need code that runs efficiently on multiple architectures, but cannot afford to maintain architecturally specific codes. With the addition of target directives to support offload accelerators, OpenMP now has the machinery
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::bb6352dfe749f7ff93882a683a7fad3d
https://doi.org/10.1007/978-3-319-45550-1_20
https://doi.org/10.1007/978-3-319-45550-1_20
Publikováno v:
International Journal of Advanced Pervasive and Ubiquitous Computing. 2:24-38
Several complex and time-critical applications require the existence of novel distributed, heterogeneous and dynamic platforms composed of a variety of fixed and mobile processing nodes and networks. Such platforms, that can be called Pervasive Mobil
Autor:
Zehra Sura, Gheorghe-Teodor Bercea, Kevin O'Brien, Hyojin Sung, Arpith C. Jacob, Alexandre E. Eichenberger, Carlo Bertolli, Georgios Rokos, Samuel Antao, David Appelhans, Tong Chen
Publikováno v:
PMBS@SC
OpenMP provides high-level parallel abstractions for programing heterogeneous systems based on acceleration technology. Active areas of research are looking to characterise the performance that can be expected from even the simplest combinations of d