A study of the loops control for reconfigurable computing with OpenCL in the LABS local search problem
Autor: | Agnieszka Dąbrowska-Boruch, Pawel Grzegorz Russek, Ernest Jamro, Kazimierz Wiatr |
---|---|
Rok vydání: | 2019 |
Předmět: |
business.industry
Computer science Autocorrelation Control (management) Binary number 020206 networking & telecommunications 02 engineering and technology Parallel computing Reconfigurable computing Theoretical Computer Science Hardware and Architecture 0202 electrical engineering electronic engineering information engineering Memetic algorithm 020201 artificial intelligence & image processing Local search (optimization) business Gradient descent Field-programmable gate array Software |
Zdroj: | The International Journal of High Performance Computing Applications. 34:103-114 |
ISSN: | 1741-2846 1094-3420 |
DOI: | 10.1177/1094342019868515 |
Popis: | In this article, we study the steepest descent local search (SDLS) algorithm that is used as the improvement step in the memetic algorithms for the search of low autocorrelation binary sequences (LABS). We address the method of reconfigurable computing, as the algorithm is of the field programmable gate array (FPGA) type as it features the integer operations, bit-wise data representation, regular execution flow, and huge computational complexity. It contains four levels of nested loops, but its direct parallel implementation as a custom processor leads to typical problems because the loops expose a dynamic range and too many iterations. This inhibits a simple parallel data path that is typically produced by the method of the loop unrolling. We have examined the four architectures that mitigate the found obstacles, and we provide the results of their implementation. The solutions take advantages of the loop pipelining, reordering of the loops, and dynamic reconfiguration. The recently available development tool was involved in our study as we have used the OpenCL (OCL) platform for FPGAs to draw practical conclusions. The given proposals are characterized by their performance and capacity for a problem size. Consequently, the speed/size trade-off is highlighted, as an FPGA size is a design constraint. The performance of the FPGA-based solutions is compared to the CPU speed, and the maximum reported speed-up is 750. Readers can further develop and/or use the presented OCL solutions for efficient LABS discovery as we provide the corresponding software repository. |
Databáze: | OpenAIRE |
Externí odkaz: |