AUKE
Autor: | Paul H. J. Kelly, Sajad Saeedi, Thomas Debrunner |
---|---|
Přispěvatelé: | Engineering & Physical Science Research Council (E |
Rok vydání: | 2018 |
Předmět: |
010302 applied physics
Computer science 0803 Computer Software 02 engineering and technology Frame rate 01 natural sciences Processor array 020202 computer hardware & architecture Convolution Computational science Kernel (image processing) Hardware and Architecture 0103 physical sciences Digital image processing 0202 electrical engineering electronic engineering information engineering Code generation SIMD Pose Software Information Systems |
Zdroj: | ACM Transactions on Architecture and Code Optimization. 15:1-26 |
ISSN: | 1544-3973 1544-3566 |
Popis: | Focal-plane Sensor-Processor Arrays (FPSPs) are new imaging devices with parallel Single Instruction Multiple Data (SIMD) computational capabilities built into every pixel. Compared to traditional imaging devices, FPSPs allow for massive pixel-parallel execution of image processing algorithms. This enables the application of certain algorithms at extreme frame rates (>10,000 frames per second). By performing some early-stage processing in-situ, systems incorporating FPSPs can consume less power compared to conventional approaches using standard digital cameras. In this article, we explore code generation for an FPSP whose 256 × 256 processors operate on analogue signal data, leading to further opportunities for power reduction—and additional code synthesis challenges. While rudimentary image processing algorithms have been demonstrated on FPSPs before, progress with higher-level computer vision algorithms has been sparse due to the unique architecture and limits of the devices. This article presents a code generator for convolution filters for the SCAMP-5 FPSP, with applications in many high-level tasks such as convolutional neural networks, pose estimation, and so on. The SCAMP-5 FPSP has no effective multiply operator. Convolutions have to be implemented through sequences of more primitive operations such as additions, subtractions, and multiplications/divisions by two. We present a code generation algorithm to optimise convolutions by identifying common factors in the different weights and by determining an optimised pattern of pixel-to-pixel data movements to exploit them. We present evaluation in terms of both speed and energy consumption for a suite of well-known convolution filters. Furthermore, an application of the method is shown by the implementation of a Viola-Jones face detection algorithm. |
Databáze: | OpenAIRE |
Externí odkaz: |