Monocular Depth Perception on Microcontrollers for Edge Applications
Autor: | Fabio Tosi, Matteo Poggi, Valentino Peluso, Antonio Cipolletta, Filippo Aleotti, Stefano Mattoccia, Andrea Calimera |
---|---|
Přispěvatelé: | Peluso, Valentino, Cipolletta, Antonio, Calimera, Andrea, Poggi, Matteo, Tosi, Fabio, Aleotti, Filippo, Mattoccia, Stefano |
Jazyk: | angličtina |
Rok vydání: | 2022 |
Předmět: |
IoT
Monocular Computer vision depth estimation deep learning optimization methods edge computing IoT micro-controllers Computer science business.industry Deep learning ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION deep learning micro-controllers Convolutional neural network Field (computer science) ARM architecture edge computing Media Technology depth estimation optimization methods Computer vision Artificial intelligence Electrical and Electronic Engineering business Depth perception Wireless sensor network Edge computing |
Popis: | Depth estimation is crucial in several computer vision applications, and a recent trend in this field aims at inferring such a cue from a single camera. Unfortunately, despite the compelling results achieved, state-of-the-art monocular depth estimation methods are computationally demanding, thus precluding their practical deployment in several application contexts characterized by low-power constraints. Therefore, in this paper, we propose a lightweight Convolutional Neural Network based on a shallow pyramidal architecture, referred to as μPyD-Net, enabling monocular depth estimation on microcontrollers. The network is trained in a peculiar self-supervised manner leveraging proxy labels obtained through a traditional stereo algorithm. Moreover, we propose optimization strategies aimed at performing computations with quantized 8-bit data and map the high-level description of the network to low-level layers optimized for the target microcontroller architecture. Exhaustive experimental results on standard datasets and an in-depth evaluation with a device belonging to the popular Arm Cortex-M family confirm that obtaining sufficiently accurate monocular depth estimation on microcontrollers is feasible. To the best of our knowledge, our proposal is the first one enabling such remarkable achievement, paving the way for the deployment of monocular depth cues onto the tiny end-nodes of distributed sensor networks. |
Databáze: | OpenAIRE |
Externí odkaz: |