Zobrazeno 1 - 10
of 32
pro vyhledávání: '"Joachim Protze"'
Autor:
Alexander Huck, Sebastian Kreutzer, Joachim Protze, Jan-Patrick Lehr, Christian Bischof, Christian Terboven, Matthias S. Muller
Publikováno v:
IT Professional. 24:45-51
Publikováno v:
2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness).
Autor:
Tim Jammer, Alexander Hück, Jan-Patrick Lehr, Joachim Protze, Simon Schwitanski, Christian Bischof
Publikováno v:
EuroMPI/USA'22: 29th European MPI Users' Group Meeting.
Publikováno v:
Euro-Par 2022: Parallel Processing ISBN: 9783031125966
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::dd50ec5da6eff9d342da77c47ecc42c9
https://doi.org/10.1007/978-3-031-12597-3_5
https://doi.org/10.1007/978-3-031-12597-3_5
Autor:
Miwako Tsuji, Matthias S. Müller, Serge G. Petiton, Christian Terboven, Thomas Dufaud, Nahid Emad, Hitoshi Murai, Mitsuhisa Sato, Joachim Protze, Taisuke Boku
Publikováno v:
XcalableMP PGAS Programming Language
XcalableMP PGAS Programming Language, Springer Singapore, pp.219-243, 2021, ⟨10.1007/978-981-15-7683-6_9⟩
XcalableMP PGAS Programming Language ISBN: 9789811576829
XcalableMP PGAS Programming Language, Springer Singapore, pp.219-243, 2021, ⟨10.1007/978-981-15-7683-6_9⟩
XcalableMP PGAS Programming Language ISBN: 9789811576829
This chapter describes a multi-SPMD (mSPMD) programming model and a set of software and libraries to support the mSPMD programming model. The mSPMD programming model has been proposed to realize scalable applications on huge and hierarchical systems.
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1244300ad6e427c9addc87d0f19b3a4f
https://hal.archives-ouvertes.fr/hal-03429306
https://hal.archives-ouvertes.fr/hal-03429306
Pruners : Providing reproducibility for uncovering non-deterministic errors in runs on supercomputers
Autor:
Martin Schulz, Ganesh Gopalakrishnan, Michael Bentley, Zvonimir Rakamarić, Christopher M. Chambreau, Gregory L. Lee, Ignacio Laguna, Simone Atzeni, Dong H. Ahn, Geof Sawaya, Kento Sato, Joachim Protze
Publikováno v:
The International Journal of High Performance Computing Applications. 33:777-783
Large scientific simulations must be able to achieve the full-system potential of supercomputers. When they tap into high-performance features, however, a phenomenon known as non-determinism may be introduced in their program execution, which signifi
Publikováno v:
IPDPS
From OpenMP 4.0 onwards, programmers can offload code regions to accelerators by using the target offloading feature. However, incorrect usage of target offloading constructs may incur data mapping issues. A data mapping issue occurs when the host fa
Autor:
Denis Barthou, Van Man Nguyen, Emmanuelle Saillard, Joachim Protze, Patrick Carribault, Matthias S. Müller, Julien Jaeger, Marc-André Hermanns
Publikováno v:
Parallel Computing
Parallel Computing, 2022, 109, pp.102859. ⟨10.1016/j.parco.2021.102859⟩
Parallel Computing, 2022, 109, pp.102859. ⟨10.1016/j.parco.2021.102859⟩
When aiming for large-scale parallel computing, waiting time due to network latency, synchronization, and load imbalance are the primary opponents of high parallel efficiency. A common approach to hide latency with computation is the use of non-block
Autor:
Joachim Protze, Christian Terboven, Alexander Hück, Christian Bischof, Matthias S. Müller, Jan-Patrick Lehr
Publikováno v:
CORRECTNESS@SC
2020 IEEE/ACM 4th International Workshop on Software Correctness for HPC Applications (Correctness)
2020 IEEE/ACM 4th International Workshop on Software Correctness for HPC Applications (Correctness)
Algorithmic Differentiation (AD) is a set of techniques to calculate derivatives of a computer program. In C++, AD typically requires (i) a type change of the built-in double, and (ii) a replacement of all MPI calls with AD-specific implementations.
Publikováno v:
EuroMPI
27th European MPI Users' Group Meeting
27th European MPI Users' Group Meeting
When aiming for large scale parallel computing, waiting time due to network latency, synchronization, and load imbalance are the primary opponents of high parallel efficiency. A common approach to hide latency with computation is the use of non-block