HI-FFT: Heterogeneous Parallel In-Place Algorithm for Large-Scale 2D-FFT
Autor: | Duksu Kim, Homin Kang, Jaehong Lee |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
General Computer Science
Computer science Computation Fast Fourier transform General Engineering Parallel algorithm GPU in-place Dynamic priority scheduling Parallel computing Matrix decomposition TK1-9971 Memory management parallel In-place algorithm General Materials Science heterogeneous Electrical engineering. Electronics. Nuclear engineering CPU Heterogeneous network 2D-FFT |
Zdroj: | IEEE Access, Vol 9, Pp 120261-120273 (2021) |
ISSN: | 2169-3536 |
Popis: | Fast Fourier Transform (FFT) is a fundamental operation for 2D data in various applications. To accelerate large-scale 2D-FFT computation, we propose a Heterogeneous parallel In-place 2D-FFT algorithm, HI-FFT. Our novel work decomposition method makes it possible to run our parallel algorithm on the original data (i.e., in-place), unlike prior parallel algorithms that require additional memory space (i.e., out-of-place) to guarantee independence among sub-tasks. Our work decomposition method also removes the duplicated operations on the out-of-place approaches. Using our decomposition method, we introduced an in-place heterogeneous parallel algorithm that utilizes both multi-core CPU and GPU simultaneously. To maximize the utilization efficiency of the computing resources, we also propose a priority-based dynamic scheduling method. We compared the performance of seven different 2D-FFT algorithms, including ours, for large-scale 2D-FFT problems whose sizes varied from 20K2 to 120K2. As a result, we found that our method achieved up to 2.92 and 4.42 times higher performance than the conventional homogeneous parallel algorithms based on the state-of-the-art CPU and GPU libraries, respectively. Also, our method showed up to 2.27 times higher performance than the prior heterogeneous algorithms while requiring two times less memory space. To check the benefit of our HI-FFT on an actual application, we applied it to a CGH (Computer Generated Holography) process. We found that it successfully reduces the hologram generation time. These results demonstrate the advantage of our approach for large-scale 2D-FFT computation. |
Databáze: | OpenAIRE |
Externí odkaz: |