Energy proportional computing with OpenCL on a FPGA-based overlay architecture
Autor: | Awais Hussain Sani, Jose L Nunez-Yanez |
---|---|
Rok vydání: | 2016 |
Předmět: |
Energy profiles
Computer science Application portability Parallel algorithm Open CL 02 engineering and technology Parallel computing Overlay computer.software_genre 01 natural sciences Improved design productivity Software Programming flow 0103 physical sciences 0202 electrical engineering electronic engineering information engineering Energy proportional computing Field-programmable gate array 010302 applied physics business.industry Energy consumption 020202 computer hardware & architecture Kernel Embedded system Hardware acceleration Compiler Hardware accelerator Soft processors business computer |
Zdroj: | NORCAS Sani, A H & Nunez-Yanez, J L 2016, Energy proportional computing with OpenCL on a FPGA-based overlay architecture . in Proceedings of the 2nd IEEE NORCAS Conference (NORCAS 2016) ., 7792905, Institute of Electrical and Electronics Engineers (IEEE), 2nd IEEE Nordic Circuits and Systems Conference, NORCAS 2016, Copenhagen, Denmark, 1/11/16 . https://doi.org/10.1109/NORCHIP.2016.7792905 |
DOI: | 10.1109/norchip.2016.7792905 |
Popis: | This paper proposes an architecture inspired by ARM big.LITTLE that combines a hardened host with a cluster of soft processors of different complexities, performance and energy profiles. This coarse-grained FPGA overlay architecture results in a hardware accelerator that offers software like programmability, fast compilation, improved design productivity and application portability. A programming flow based on OpenCL is introduced to allow application programmers to implement parallel algorithms at higher level of abstractions. Current OpenCL tools for FPGAS suffer from long compilation times and limited compiler support. Minor changes to the algorithm normally mean full implementation cycles that can take several hours to complete. The proposed architecture allows changes to the application at run-time with cross-compilation done in the host during program execution. To compensate for the loss of performance compared with custom logic the FPGA cluster supports adaptive voltage scaling that enables higher clock frequencies and better adaptation to the program load. Experimental results demonstrates 70% improvement in computational time and 80% reduction in energy consumption by computing OpenCL kernel on different clusters and various operating voltages and frequencies. |
Databáze: | OpenAIRE |
Externí odkaz: |