Combining reduction with synchronization barrier on multi‐core processors.

Autor: Mohamed El Maarouf, Aboul‐Karim, Giraud, Luc, Guermouche, Abdou, Guignon, Thomas
Předmět:
Zdroj: Concurrency & Computation: Practice & Experience; Jan2023, Vol. 35 Issue 1, p1-16, 16p
Abstrakt: Summary: With the rise of multi‐core processors with a large number of cores, the need for shared memory reduction that performs efficiently on a large number of cores is more pressing. Efficient shared memory reduction on these multi‐core processors will help share memory programs be more efficient. In this article, we propose a reduction method combined with a barrier method that uses SIMD read/write instructions to combine barrier signaling and reduction value to minimize memory/cache traffic between cores, thereby reducing barrier latency. We compare different barriers and reduction methods on three multi‐core processors and show that the proposed combining barrier/reduction methods are 4 and 3.5 times faster than respectively GCC 11.1 and Intel 21.2 OpenMP 4.5 reduction. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index