Zobrazeno 1 - 10
of 207
pro vyhledávání: '"Pavan Balaji"'
Publikováno v:
IEEE Transactions on Parallel and Distributed Systems. 34:123-140
Autor:
Hui Zhou, Martin Berzins, Damodar Sahasrabudhe, Rohit Zambre, Aparna Chandramowlishwaran, Pavan Balaji
Publikováno v:
IEEE Transactions on Parallel and Distributed Systems. 32:3038-3052
Supercomputing applications are increasingly adopting the MPI+threads programming model over the traditional “MPI everywhere” approach to better handle the disproportionate increase in the number of cores compared with other on-node resources. In
Publikováno v:
IEEE Transactions on Parallel and Distributed Systems. 31:2734-2748
Data analytics has become an integral part of large-scale scientific computing. Among various data analytics frameworks, MapReduce has gained the most traction. Although some efforts have been made to enable efficient MapReduce for supercomputing sys
Publikováno v:
IEEE Transactions on Parallel and Distributed Systems. 31:1859-1877
User-level threads have been widely adopted as a means of achieving lightweight concurrent execution without the costs of OS-level threads. Nevertheless, the costs of managing user-level threads represent a performance barrier that dictates how fine
Publikováno v:
OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Exascale and Smart Networks ISBN: 9783031048876
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::032a353f99bcf70249c18ccf5d7edb27
https://doi.org/10.1007/978-3-031-04888-3_3
https://doi.org/10.1007/978-3-031-04888-3_3
Publikováno v:
CLUSTER
MPI provides nonblocking point-to-point and one-sided communication models to help applications achieve communication and computation overlap. These models provide the opportunity for MPI to offload data transfer to low level network hardware while t
Publikováno v:
Interdisciplinary Sciences: Computational Life Sciences. 12:99-108
Counting the abundance of all the distinct kmers in biological sequence data is a fundamental step in bioinformatics. These applications include de novo genome assembly, error correction, etc. With the development of sequencing technology, the sequen
Publikováno v:
ACM Transactions on Parallel Computing. 6:1-34
Scalable deep neural network training has been gaining prominence because of the increasing importance of deep learning in a multitude of scientific and commercial domains. Consequently, a number of researchers have investigated techniques to optimiz
Publikováno v:
PPoPP
Many-to-many mapping models for user- to kernel-level threads (or "M:N threads") have been extensively studied for decades as a lightweight substitute for current Pthreads implementations that provide a simple one-to-one mapping ("1:1 threads"). M:N
Publikováno v:
HPCC/DSS/SmartCity
SW26010 is a heterogeneous many-core CPU equipped in the Sunway TaihuLight Supercomputer, which ranks fourth in June 2020 Top500 list. Large varieties of applications have been tuned on SW26010, but only a few researches focus on system-level modelin