Efficient irregular wavefront propagation algorithms on Intel ® Xeon Phi ™ .

Autor: Gomes JM; University of Brasília, Brasília, DF, Brazil., Teodoro G; University of Brasília, Brasília, DF, Brazil., de Melo A; University of Brasília, Brasília, DF, Brazil., Kong J; Emory University, Atlanta, GA, USA., Kurc T; Stony Brook University., Saltz JH; Stony Brook University.
Jazyk: angličtina
Zdroj: Proceedings. Symposium on Computer Architecture and High Performance Computing [Proc Symp Comput Archit High Perform Comput] 2015 Oct; Vol. 2015, pp. 25-32.
DOI: 10.1109/SBAC-PAD.2015.13
Abstrakt: We investigate the execution of the Irregular Wavefront Propagation Pattern (IWPP), a fundamental computing structure used in several image analysis operations, on the Intel ® Xeon Phi co-processor. An efficient implementation of IWPP on the Xeon Phi is a challenging problem because of IWPP's irregularity and the use of atomic instructions in the original IWPP algorithm to resolve race conditions. On the Xeon Phi, the use of SIMD and vectorization instructions is critical to attain high performance. However, SIMD atomic instructions are not supported. Therefore, we propose a new IWPP algorithm that can take advantage of the supported SIMD instruction set. We also evaluate an alternate storage container (priority queue) to track active elements in the wavefront in an effort to improve the parallel algorithm efficiency. The new IWPP algorithm is evaluated with Morphological Reconstruction and Imfill operations as use cases. Our results show performance improvements of up to 5.63 × on top of the original IWPP due to vectorization. Moreover, the new IWPP achieves speedups of 45.7 × and 1.62 × , respectively, as compared to efficient CPU and GPU implementations.
Databáze: MEDLINE