Zobrazeno 1 - 10
of 18
pro vyhledávání: '"Mikhail G. Kurnosov"'
Autor:
K. E. Kramarenko, E. N. Peryshkova, Kirill V. Pavsky, Alexandr V. Efimov, Mikhail G. Kurnosov, A. Yu. Polyakov
Publikováno v:
Автометрия. 57:119-128
Publikováno v:
Vestnik Tomskogo gosudarstvennogo universiteta. Upravlenie, vychislitel'naya tekhnika i informatika. :93-101
Publikováno v:
2021 Ural Symposium on Biomedical Engineering, Radioelectronics and Information Technology (USBEREIT).
In this paper algorithms to perform barrier synchronization in MPI applications on HPC clusters of NUMA machines are investigated. We consider a case when all MPI processes, need to be synchronized, reside on a same multi socket NUMA machine. In part
Publikováno v:
Communications in Computer and Information Science ISBN: 9783030646158
RuSCDays
RuSCDays
MPI_Bcast collective communication operation is used by many scientific applications and tend to limit overall parallel application scalability. This paper investigates the design and optimization of broadcast operation for NUMA nodes with GNU/Linux.
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::a99e2a55c8cb6d25d6196b2a4a5e6167
https://doi.org/10.1007/978-3-030-64616-5_41
https://doi.org/10.1007/978-3-030-64616-5_41
Publikováno v:
2019 15th International Asian School-Seminar Optimization Problems of Complex Systems (OPCS).
Theoretical and experimental analysis of MPI Bcast algortihms is presented. The optimal tree degrees and segment sizes for pipelined versions of algorithms are obtained. Algorithms were investigated according to their implementation in the Open MPI l
Publikováno v:
2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE).
A solution of a self-diagnosis problem for distributed computer systems consists in determining fault-free and faulty nodes of the system by the given syndrome. This problem can be reduced to the classification problem, which can be efficiently solve
Publikováno v:
2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE).
Interconnection networks of modern highperformance distributed computer systems are now deep hierarchical. In such systems, communication time between processors depends on their replacement in a computer system. In large-scale NUMA/SMP computer clus
Publikováno v:
2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE).
This work proposes the implementation of scalable concurrent pool based on diffraction trees. Developed pool ensures localization of addresses to shared variables to maximize its throughput. The proposed approaches increase the throughput at high and
Publikováno v:
2018 3rd Russian-Pacific Conference on Computer Technology and Applications (RPC).
This paper evaluates how well modern compilers Intel C/C++, GCC C/C++, LLVM/Clang and PGI C/C++ auto-vectorize loops. We use the Extended Test Suite for Vectorizing Compilers (ETSVC) as a benchmark. We estimate time, energy, power and speedup by runn
Publikováno v:
2017 IEEE II International Conference on Control in Technical Systems (CTS).
This paper represents the heuristic algorithms for optimizing communications in parallel PGAS-programs and minimizes of its execution time. This is achieved by taking into account of hierarchical structure of computer systems while reduction. Develop