Zobrazeno 1 - 10
of 14
pro vyhledávání: '"Ettore Tiotto"'
Autor:
Victor Perez, Ettore Tiotto, Whitney Tsang, Arnamoy Bhattacharyya, Lukas Sommer, Victor Lomüller, Jefferson Le Quellec, James Brodman
Publikováno v:
International Workshop on OpenCL.
Publikováno v:
ACM Transactions on Architecture and Code Optimization. 16:1-26
Iteration Point Difference Analysis is a new static analysis framework that can be used to determine the memory coalescing characteristics of parallel loops that target GPU offloading and to ascertain safety and profitability of loop transformations
Publikováno v:
IPDPS Workshops
Automating the device selection in heterogeneous computing platforms requires the modelling of performance both on CPUs and on accelerators. This work argues for the use of a hybrid analytical performance modelling approach is a practical way to buil
Publikováno v:
Recercat. Dipósit de la Recerca de Catalunya
instname
UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
instname
UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
We improve performance of fine-grain UPC applications by orders of magnitude.We introduce a novel shared-data localization transformation.We present a thorough performance analysis and evaluation.We show that reducing run-time calls is crucial for pe
Publikováno v:
IBM Journal of Research and Development. 64:14:1-14:11
Ability to efficiently offload computational workloads to graphic processing units (GPUs) is critical for the success of hybrid CPU–GPU architectures, such as the Summit and Sierra supercomputing systems. OpenMP 4.5 is a high-level programming mode
Publikováno v:
ICPP Workshops
In modern supercomputers, nodes are connected by networking hardware capable of up to 40 Gb/s. Data compression could allow for even higher effective bandwidth. However, data compression for such systems requires a unique tradeoff between the compres
Publikováno v:
2016 Third Workshop on Accelerator Programming Using Directives (WACCPD).
Publikováno v:
Languages and Compilers for Parallel Computing ISBN: 9783319174723
LCPC
LCPC
Partitioned Global Address Space (PGAS) languages are a popular alternative when building applications to run on large scale parallel machines. Unified Parallel C (UPC) is a well known PGAS language that is available on most high performance computin
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::64414713a93b40b45ac48b46d8b43bcc
https://doi.org/10.1007/978-3-319-17473-0_13
https://doi.org/10.1007/978-3-319-17473-0_13
Publikováno v:
International Journal of High Performance Computing and Networking. 1:1
Publikováno v:
SBAC-PAD
UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
Recercat. Dipósit de la Recerca de Catalunya
Universitat Jaume I
UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
Recercat. Dipósit de la Recerca de Catalunya
Universitat Jaume I
Programs written in Partitioned Global Address Space (PGAS) languages can access any location of the entire address space via standard read/write operations. However, the compiler have to create the communication mechanisms and the runtime system to