Zobrazeno 1 - 10
of 29
pro vyhledávání: '"Abid M. Malik"'
Publikováno v:
Proceedings of the 14th International Workshop on Programming Models and Applications for Multicores and Manycores.
Publikováno v:
Proceedings of the 19th ACM International Conference on Computing Frontiers.
Publikováno v:
2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).
Autor:
Wenbin Lu, Baodi Shan, Eric Raut, Jie Meng, Mauricio Araya-Polo, Johannes Doerfert, Abid M. Malik, Barbara Chapman
Publikováno v:
OpenMP in a Modern World: From Multi-device Support to Meta Programming ISBN: 9783031159213
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::92024c866bcf8dc3d2e038146387cc81
https://doi.org/10.1007/978-3-031-15922-0_2
https://doi.org/10.1007/978-3-031-15922-0_2
Publikováno v:
Sustainable Computing: Informatics and Systems. 23:1-12
Energy consumption of an application has gained the same importance as execution time in current HPC systems, primarily due to the massive energy consumption of these systems and consequently substantial energy bills. This fact forced software develo
Publikováno v:
IPDPS Workshops
Recently, edge computing has received considerable attention as a promising means to provide Deep Learning (DL) based services. However, due to the limited computation capability of the data processing units (such as CPUs, GPUs, and specialized accel
Publikováno v:
2020 IEEE/ACM 6th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC) and Workshop on Hierarchical Parallelism for Exascale Computing (HiPar).
OpenMP 5.0 introduces many new directives to meet the demand of emerging high performance computing systems. Among these new directives, the metadirective and declare variant directives are important to control the execution behavior of a given appli
Publikováno v:
OpenMP: Portable Multi-Level Parallelism on Modern Systems ISBN: 9783030581435
IWOMP
IWOMP
In the high performance computing sector, researchers and application developers expend considerable effort to port their applications to GPU-based clusters in order to take advantage of the massive parallelism and energy efficiency of a GPU. Unfortu
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::de54212e7c3e3e46d63a3e0f0ff3d791
https://doi.org/10.1007/978-3-030-58144-2_18
https://doi.org/10.1007/978-3-030-58144-2_18
Publikováno v:
OpenMP: Portable Multi-Level Parallelism on Modern Systems ISBN: 9783030581435
IWOMP
IWOMP
Many modern supercomputers such as ORNL’s Summit, LLNL’s Sierra, and LBL’s upcoming Perlmutter offer or will offer multiple, e.g., 4 to 8, GPUs per node for running computational science and engineering applications. One should expect an applic
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::7defd7223aea6cf82a01354929239702
https://doi.org/10.1007/978-3-030-58144-2_19
https://doi.org/10.1007/978-3-030-58144-2_19
Publikováno v:
2018 New York Scientific Data Summit (NYSDS).
Long training times for building a high accuracy deep neural networks (DNNs) is impeding research for new DNN architectures. For example, time for training GoogleNet with the ImageNet dataset on a single Nvidia K20 GPU almost takes 25 days. Therefore