Zobrazeno 1 - 10
of 94
pro vyhledávání: '"Gengbin Zheng"'
Publikováno v:
EuroMPI
Triggered operations and counting events or counters are building blocks used by communication libraries, such as MPI, to offload collective operations to the Host Fabric Interface (HFI) or Network Interface Card (NIC). Triggered operations can be us
Publikováno v:
IPDPS Workshops
Many-core architectures such as the Intel® Xeon PhiTM provide dozens of cores and hundreds of hardware threads. For these machines, a basic MPI implementation is inefficient, as it does not take advantage of the shared data across the ranks on the s
Publikováno v:
IEEE Transactions on Parallel and Distributed Systems. 26:2061-2074
Supercomputers have seen an exponential increase in their size in the last two decades. Such a high growth rate is expected to take us to exascale in the timeframe 2018-2022. But, to bring a productive exascale environment about, it is necessary to f
Autor:
Alexander Sannikov, Sangmin Seo, Yanfei Guo, Ken Raffenetti, Paul Fischer, Tomislav Janjusic, Thilina Rathnayake, Michael Alan Blocksome, Jithin Jose, Matthew Otten, Hajime Fujita, Sergey Oblomov, Sayantan Sur, Masamichi Takagi, Pavan Balaji, Masayuki Hatanaka, Misun Min, Abdelhalim Amer, Paul Coffman, Wesley Bland, Akhil Langer, Michael Chuvelev, Dmitry Durnov, Charles J. Archer, Min Si, Lena Oden, Gengbin Zheng, Xin Zhao
Publikováno v:
SC
This paper provides an in-depth analysis of the software overheads in the MPI performance-critical path and exposes mandatory performance overheads that are unavoidable based on the MPI-3.1 specification. We first present a highly optimized implement
Autor:
María Jesús Garzarán, Gengbin Zheng, Terry Wilmarth, Jeongnim Kim, James Cownie, Rubasri Kalidas, Taru Doodi, Amrita Mathuriya, Jonathan Peyton
Publikováno v:
Scaling OpenMP for Exascale Performance and Portability ISBN: 9783319655772
IWOMP
IWOMP
The OpenMP (The OpenMP name is a registered trademark of the OpenMP Architecture Review Board.) application programming interface provides a simple way for programmers to write parallel programs that are portable between machines and vendors. Program
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::bb58ec5c28bafe39d990290bf6a0e8e9
https://doi.org/10.1007/978-3-319-65578-9_19
https://doi.org/10.1007/978-3-319-65578-9_19
Autor:
Gengbin Zheng1 gzheng@cs.uiuc.edu, Wilmarth, Terry1 wilmarth@cs.uiuc.edu, Jagadishprasad, Praveen1 jagadish@cs.uiuc.edu, Kalé, Laxmikant V.1 kale@cs.uiuc.edut
Publikováno v:
International Journal of Parallel Programming. Jun2005, Vol. 33 Issue 2/3, p183-207. 25p.
Autor:
Gengbin Zheng, Laxmikant V. Kale
Publikováno v:
Parallel Science and Engineering Applications
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::5c8c90ac9a25fab7f2157f8630a71c29
https://doi.org/10.1201/b16251-12
https://doi.org/10.1201/b16251-12
Publikováno v:
Parallel Processing Letters. 21:319-338
State space search problems abound in the artificial intelligence, planning and optimization literature. Solving such problems is generally NP-hard, so that a brute-force approach to state space search must be employed. Given the exponential amount o
Publikováno v:
The International Journal of High Performance Computing Applications. 25:371-385
Large parallel machines with hundreds of thousands of processors are becoming more prevalent. Ensuring good load balance is critical for scaling certain classes of parallel applications on even thousands of processors. Centralized load balancing algo
Autor:
Hao Yu, James C. Phillips, Abhinav Bhatele, Gengbin Zheng, Eric Bohm, Chao Huang, Laxmikant V. Kale, Sameer Kumar
Publikováno v:
IBM Journal of Research and Development. 52:177-188
NAMD (nanoscale molecular dynamics) is a production molecular dynamics (MD) application for biomolecular simulations that include assemblages of proteins, cell membranes, and water molecules. In a biomolecular simulation, the problem size is fixed an