Zobrazeno 1 - 10
of 30
pro vyhledávání: '"Marc Gamell"'
Autor:
Philip E. Davis, Shaohua Duan, Keita Teranishi, Pradeep Subedi, Manish Parashar, Hemanth Kolla, Marc Gamell
Publikováno v:
ACM Transactions on Parallel Computing. 7:1-29
The dramatic increase in the scale of current and planned high-end HPC systems is leading new challenges, such as the growing costs of data movement and IO, and the reduced mean time between failures (MTBF) of system components. In-situ workflows, i.
Autor:
Manish Parashar, Hemanth Kolla, Michael A. Heroux, Jacqueline H. Chen, Keita Teranishi, Marc Gamell, Jackson R. Mayo
Publikováno v:
IEEE Transactions on Parallel and Distributed Systems. 28:2881-2895
Obtaining multi-process hard failure resilience at the application level is a key challenge that must be overcome before the promise of exascale can be fully realized. Previous work has shown that online global recovery can dramatically reduce the ov
Autor:
Hemanth Kolla, Michael A. Heroux, Jacqueline H. Chen, Marc Gamell, Jackson R. Mayo, Manish Parashar, Keita Teranishi
Publikováno v:
SIAM Journal on Scientific Computing. 39:S347-S378
In order to achieve exascale systems, application resilience needs to be addressed. Some programming models, such as task-DAG (directed acyclic graphs) architectures, currently embed resilience features whereas traditional SPMD (single program, multi
Autor:
Marc Gamell Balmana, Rashid Kaleem, Alexander Sannikov, María Jesús Garzarán, Dmitry Durnov, Akhil Langer, Surabhi Jain
Publikováno v:
SC
Collective operations are used in MPI programs to express common communication patterns, collective computations, or synchronization. In many collectives, such as MPI_Allreduce, the intra-node component of the collective lies on the critical path, as
Autor:
Manish Parashar, Pradeep Subedi, Keita Teranishi, Hemanth Kolla, Philip E. Davis, Shaohua Duan, Marc Gamell
Publikováno v:
IPDPS
The dramatic increase in the scale of current and planned high-end HPC systems is leading new challenges, such as the growing costs of data movement and IO, and the reduced mean times between failures (MTBF) of system components. In-situ workflows, i
Autor:
Manish Parashar, Keita Teranishi, Daniel S. Katz, Rob F. Van der Wijngaart, Marc Gamell, Michael A. Heroux, Timothy G. Mattson
Publikováno v:
ICPP Workshops
Exascale systems promise the potential for computation atunprecedented scales and resolutions, but achieving exascale by theend of this decade presents significant challenges. A key challenge isdue to the very large number of cores and components and
Autor:
Ivan Rodero, Manish Parashar, Marc Gamell, Hariharasudhan Viswanathan, Dario Pompili, Eun Kyung Lee
Publikováno v:
Journal of Grid Computing. 10:447-473
Virtualized datacenters and clouds are being increasingly considered for traditional High-Performance Computing (HPC) workloads that have typically targeted Grids and conventional HPC platforms. However, maximizing energy efficiency and utilization o
Autor:
Keita Teranishi, Hemanth Kolla, Michael A. Heroux, Marc Gamell, Jacqueline H. Chen, Manish Parashar, Jackson R. Mayo
Publikováno v:
SC
Application resilience is a key challenge that has to be addressed to realize the exascale vision. Online recovery, even when it involves all processes, can dramatically reduce the overhead of failures as compared to the more traditional approach whe
Autor:
George Bosilca, Manish Parashar, Keita Teranishi, Thomas Herault, Marc Gamell, Jack Dongarra, Aurelien Bouteiller
Publikováno v:
SC
The ability to consistently handle faults in a distributed environment requires, among a small set of basic routines, an agreement algorithm allowing surviving entities to reach a consensual decision between a bounded set of volatile resources. This