Výsledky vyhledávání - "Mohammed Sourouri"

Memory Bandwidth Contention: Communication vs Computation Tradeoffs in Supercomputers with Multicore Architectures

Autor: Johannes Langguth, Mohammed Sourouri, Xing Cai

Publikováno v: ICPADS

We study the problem of contention for memory bandwidth between computation and communication in supercomputers that feature multicore CPUs. The problem arises when communication and computation are overlapped, and both operations compete for the sam

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::e1a41a7e69dd419d1a9605569f89ed0b
https://doi.org/10.1109/padsw.2018.8644601

Zobrazit plný text záznamu

Towards fine-grained dynamic tuning of HPC applications on modern multi-core architectures

Autor: Nico Reissmann, Mohammed Sourouri, Per Gunnar Kjeldsberg, Johannes Langguth, Espen Birger Raknes, Daniel Hackenberg, Robert Schöne

Publikováno v: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on -SC '17
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on-SC 17
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
SC

There is a consensus that exascale systems should operate within a power envelope of 20MW. Consequently, energy conservation is still considered as the most crucial constraint if such systems are to be realized. So far, most research on this topic ha

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d4f6623a0f13a7d1914865a76dcc7ba0

Zobrazit plný text záznamu

On the performance and energy efficiency of the PGAS programming model on multicore architectures

Autor: Phuong Hoai Ha, Johannes Langguth, Xing Cai, Mohammed Sourouri, Jérémie Lagravière

Publikováno v: 2016 International Conference on High Performance Computing & Simulation (HPCS)

Using large-scale multicore systems to get the maximum performance and energy efficiency with manageable programmability is a major challenge. The partitioned global address space (PGAS) programming model enhances programmability by providing a globa

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ba55b24de9a400bf5df1177a1333311e
https://doi.org/10.1109/hpcsim.2016.7568416

Zobrazit plný text záznamu

A New Parallel 3D Front Propagation Algorithm for Fast Simulation of Geological folds

Autor: Mohammed Sourouri, Xing Cai, Tor Gillberg

Publikováno v: ICCS

We present a novel method for 3D anisotropic front propagation and apply it to the simulation of geological folding. The new iterative algorithm has a simple structure and abundant parallelism, and is easily adapted to multithreaded architectures usi

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3c818e96baebb9b993b84b69ca348214

Zobrazit plný text záznamu

CPU+GPU Programming of Stencil Computations for Resource-Efficient Use of GPU Clusters

Autor: Johannes Langguth, Xing Cai, Mohammed Sourouri, Scott B. Baden, Filippo Spiga

Publikováno v: CSE

On modern GPU clusters, the role of the CPUs is often restricted to controlling the GPUs and handling MPI communication. The unused computing power of the CPUs, however, can be considerable for computations whose performance is bounded by memory traf

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::a0b47532c075b418f1e3ab7d68962673
https://doi.org/10.1109/cse.2015.33

Zobrazit plný text záznamu

Scalable Heterogeneous CPU-GPU Computations for Unstructured Tetrahedral Meshes

Autor: Mohammed Sourouri, Johannes Langguth, Xing Cai, Glenn T. Lines, Scott B. Baden

Publikováno v: Langguth, J; Sourouri, M; Lines, GT; Baden, SB; & Cai, X. (2015). Scalable Heterogeneous CPU-GPU Computations for Unstructured Tetrahedral Meshes. IEEE Micro, 35(4), 6-15. doi: 10.1109/MM.2015.70. UC San Diego: Retrieved from: http://www.escholarship.org/uc/item/70x7h2mk
IEEE Micro, vol 35, iss 4

© 1981-2012 IEEE. A recent trend in modern high-performance computing environments is the introduction of powerful, energy-efficient hardware accelerators such as GPUs and Xeon Phi coprocessors. These specialized computing devices coexist with CPUs

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8531ef43ed231247a933338f0167d767
http://www.escholarship.org/uc/item/70x7h2mk

Zobrazit plný text záznamu

Effective multi-GPU communication using multiple CUDA streams and threads

Autor: Mohammed Sourouri, Xing Cai, Tor Gillberg, Scott B. Baden

Publikováno v: ICPADS

In the context of multiple GPUs that share the same PCIe bus, we propose a new communication scheme that leads to a more effective overlap of communication and computation. Multiple CUDA streams and OpenMP threads are adopted so that data can simulta

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::5e1c8a1ee13b182fe768c061abd9bd3c
https://doi.org/10.1109/padsw.2014.7097919

Zobrazit plný text záznamu

The READEX formalism for automatic tuning for energy efficiency

Publikováno v: Computing

Energy efficiency is an important aspect of future exascale systems, mainly due to rising energy cost. Although High performance computing (HPC) applications are compute centric, they still exhibit varying computational characteristics in different r

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e4e44c957c907025b3985b8c0ebb4c53

Zobrazit plný text záznamu

Parallel solutions of static Hamilton-Jacobi equations for simulations of geological folds

Autor: Are Magnus Bruaset, Mohammed Sourouri, Øyvind Hjelle, Tor Gillberg

Publikováno v: Journal of Mathematics in Industry. 4(1):10

Two new algorithms for numerical solution of static Hamilton-Jacobi equations are presented. These algorithms are designed to work efficiently on different parallel computing architectures, and numerical results for multicore CPU and GPU implementati

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::eb38148966bd397662d27a0ea7877b4c

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání