Fast Neuromimetic Object Recognition using FPGA Outperforms GPU Implementations
Autor: | Garrick Orchard, Ralph Etienne-Cummings, Jacob G. Martin, R. Jacob Vogelstein |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2015 |
Předmět: |
FOS: Computer and information sciences
Computer Networks and Communications Computer science business.industry 3D single-object recognition Computer Vision and Pattern Recognition (cs.CV) ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION Cognitive neuroscience of visual object recognition Computer Science - Computer Vision and Pattern Recognition Recognition Psychology Models Biological Pattern Recognition Automated Computer Science Applications Pattern Recognition Visual Artificial Intelligence Pattern recognition (psychology) Humans Computer Simulation Computer vision Artificial intelligence Field-programmable gate array business Algorithms Software |
Popis: | Recognition of objects in still images has traditionally been regarded as a difficult computational problem. Although modern automated methods for visual object recognition have achieved steadily increasing recognition accuracy, even the most advanced computational vision approaches are unable to obtain performance equal to that of humans. This has led to the creation of many biologically-inspired models of visual object recognition, among them the HMAX model. HMAX is traditionally known to achieve high accuracy in visual object recognition tasks at the expense of significant computational complexity. Increasing complexity, in turn, increases computation time, reducing the number of images that can be processed per unit time. In this paper we describe how the computationally intensive, biologically inspired HMAX model for visual object recognition can be modified for implementation on a commercial Field Programmable Gate Array, specifically the Xilinx Virtex 6 ML605 evaluation board with XC6VLX240T FPGA. We show that with minor modifications to the traditional HMAX model we can perform recognition on images of size 128x128 pixels at a rate of 190 images per second with a less than 1% loss in recognition accuracy in both binary and multi-class visual object recognition tasks. 14 pages, 8 figures, 5 tables |
Databáze: | OpenAIRE |
Externí odkaz: |