Výsledky vyhledávání - "Abdelhalim Amer"

Analyzing the Performance Trade-Off in Implementing User-Level Threads

Autor: Kenjiro Taura, Pavan Balaji, Shintaro Iwasaki, Abdelhalim Amer

Publikováno v: IEEE Transactions on Parallel and Distributed Systems. 31:1859-1877

User-level threads have been widely adopted as a means of achieving lightweight concurrent execution without the costs of OS-level threads. Nevertheless, the costs of managing user-level threads represent a performance barrier that dictates how fine

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::5fbe085ac70f75c1da997ab3bfeb527e
https://doi.org/10.1109/tpds.2020.2976057

Zobrazit plný text záznamu

Efficient Abortable-locking Protocol for Multi-level NUMA Systems

Autor: Xu Liu, Milind Chabbi, Abdelhalim Amer

Publikováno v: ACM Transactions on Parallel Computing. 7:1-32

The popularity of Non-Uniform Memory Access (NUMA) architectures has led to numerous locality-preserving hierarchical lock designs, such as HCLH, HMCS, and cohort locks. Locality-preserving locks trade fairness for higher throughput. Hence, some inst

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::2e8ea1ee413316fda73fc7d4d89a383f
https://doi.org/10.1145/3399728

Zobrazit plný text záznamu

Lock Contention Management in Multithreaded MPI

Autor: Yanjie Wei, Jeff R. Hammond, Milind Chabbi, Huiwei Lu, Satoshi Matsuoka, Pavan Balaji, Abdelhalim Amer

Publikováno v: ACM Transactions on Parallel Computing. 5:1-21

In this article, we investigate contention management in lock-based thread-safe MPI libraries. Specifically, we make two assumptions: (1) locks are the only form of synchronization when protecting communication paths; and (2) contention occurs, and t

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::ba5528db09ed9faae2fb4cd60bdf2388
https://doi.org/10.1145/3275443

Zobrazit plný text záznamu

BOLT: Optimizing OpenMP Parallel Regions with User-Level Threads

Autor: Shintaro Iwasaki, Sangmin Seo, Kenjiro Taura, Pavan Balaji, Abdelhalim Amer

Publikováno v: PACT

OpenMP is widely used by a number of applications, computational libraries, and runtime systems. As a result, multiple levels of the software stack use OpenMP independently of one another, often leading to nested parallel regions. Although exploiting

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::dd5eb506ff8001b9d19696e7b84bb0b0
https://doi.org/10.1109/pact.2019.00011

Zobrazit plný text záznamu

Software combining to mitigate multithreaded MPI contention

Autor: Shintaro Iwasaki, Chongxiao Cao, Charles J. Archer, Hajime Fujita, Yanfei Guo, Pavan Balaji, Min Si, Kenjiro Taura, Jeff R. Hammond, Kenneth Raffenetti, Sagar Thapaliya, María Jesús Garzarán, Mikhail Shiryaev, Michael Chuvelev, Abdelhalim Amer, Michael Alan Blocksome

Publikováno v: ICS

Efforts to mitigate lock contention from concurrent threaded accesses to MPI have reduced contention through fine-grained locking, avoided locking altogether by offloading communication to dedicated threads, or alleviated negative side effects from c

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::8a069d36086dcc42303270b301dd1576
https://doi.org/10.1145/3330345.3330378

Zobrazit plný text záznamu

Lessons Learned from Analyzing Dynamic Promotion for User-Level Threading

Autor: Shintaro Iwasaki, Abdelhalim Amer, Kenjiro Taura, Pavan Balaji

Publikováno v: SC18: International Conference for High Performance Computing, Networking, Storage and Analysis.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::bc2f5288c20f1a81b6cf931b5a7df60d
https://doi.org/10.1109/sc.2018.00026

Zobrazit plný text záznamu

Argobots: A Lightweight Low-Level Threading and Tasking Framework

Publikováno v: IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Parallel and Distributed Systems, 2018, 29 (3), pp.512-526. ⟨10.1109/TPDS.2017.2766062⟩
Repositori Universitat Jaume I
Universitat Jaume I
IEEE Transactions on Parallel and Distributed Systems, Institute of Electrical and Electronics Engineers, 2018, 29 (3), pp.512-526. ⟨10.1109/TPDS.2017.2766062⟩

International audience; In the past few decades, a number of user-level threading and tasking models have been proposed in the literature to address the shortcomings of OS-level threads, primarily with respect to cost and flexibility. Current state-o

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::bfe58a89003fa42a3cf17b1bdeec0255
https://inria.hal.science/hal-01887586

Zobrazit plný text záznamu

Why is MPI so slow?

Publikováno v: SC

This paper provides an in-depth analysis of the software overheads in the MPI performance-critical path and exposes mandatory performance overheads that are unavoidable based on the MPI-3.1 specification. We first present a highly optimized implement

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::239bcee1cca243a59b43e4761829cc19
https://doi.org/10.1145/3126908.3126963

Zobrazit plný text záznamu

Advanced Thread Synchronization for Multithreaded MPI Implementations

Autor: Sangmin Seo, Hoang-Vu Dang, Pavan Balaji, Abdelhalim Amer

Publikováno v: CCGrid

Concurrent multithreaded access to the Message Passing Interface (MPI) is gaining importance to support emerging hybrid MPI applications. The interoperability between threads and MPI, however, is complex and renders efficient implementations nontrivi

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::ed17a6b06737e1883fdd0d2c9bbdf703
https://doi.org/10.1109/ccgrid.2017.65

Zobrazit plný text záznamu

An Efficient Abortable-locking Protocol for Multi-level NUMA Systems

Autor: Shasha Wen, Milind Chabbi, Xu Liu, Abdelhalim Amer

Publikováno v: PPOPP

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::0cebfeb012830393b7eae1e1d915bfeb
https://doi.org/10.1145/3018743.3018768

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání