Hierarchical Object Detection with Deep Reinforcement Learning

Autor:	Bellver, Míriam, Giró Nieto, Xavier\|\|\|0000-0002-9935-5332, Marqués Acosta, Fernando\|\|\|0000-0001-8311-1168, Torres Viñals, Jordi\|\|\|0000-0003-1963-7418
Přispěvatelé:	Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors, Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo, Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
Jazyk:	angličtina
Rok vydání:	2016
Předmět:	FOS: Computer and information sciences Information storage and retrieval systems Image processing--Digital techniques Image segmentation Informàtica::Intel·ligència artificial::Aprenentatge automàtic [Àrees temàtiques de la UPC] Enginyeria de la telecomunicació::Telemàtica i xarxes d'ordinadors [Àrees temàtiques de la UPC] Object detection Computer Vision and Pattern Recognition (cs.CV) Computer Science - Computer Vision and Pattern Recognition So imatge i multimèdia::Creació multimèdia::Imatge digital [Àrees temàtiques de la UPC] Enginyeria de la telecomunicació::Processament del senyal::Processament de la imatge i del senyal vídeo [Àrees temàtiques de la UPC] Imatges -- Processament Machine Learning (cs.LG) Imatges -- Segmentació Neural networks (Computer science) Imatges -- Processament -- Tècniques digitals Computer Science - Learning Image processing Informació -- Sistemes d'emmagatzematge i recuperació Machine learning Aprenentatge automàtic Informàtica::Sistemes d'informació::Emmagatzematge i recuperació de la informació [Àrees temàtiques de la UPC] Reinforcement learning Xarxes neuronals (Informàtica) Computer vision
Zdroj:	Recercat. Dipósit de la Recerca de Catalunya instname UPCommons. Portal del coneixement obert de la UPC Universitat Politècnica de Catalunya (UPC)
Popis:	We present a method for performing hierarchical object detection in images guided by a deep reinforcement learning agent. The key idea is to focus on those parts of the image that contain richer information and zoom on them. We train an intelligent agent that, given an image window, is capable of deciding where to focus the attention among five different predefined region candidates (smaller windows). This procedure is iterated providing a hierarchical image analysis.We compare two different candidate proposal strategies to guide the object search: with and without overlap. Moreover, our work compares two different strategies to extract features from a convolutional neural network for each region proposal: a first one that computes new feature maps for each region proposal, and a second one that computes the feature maps for the whole image to later generate crops for each region proposal. Experiments indicate better results for the overlapping candidate proposal strategy and a loss of performance for the cropped image features due to the loss of spatial resolution. We argue that, while this loss seems unavoidable when working with large amounts of object candidates, the much more reduced amount of region proposals generated by our reinforcement learning agent allows considering to extract features for each location without sharing convolutional computation among regions. Deep Reinforcement Learning Workshop (NIPS 2016). Project page at https://imatge-upc.github.io/detection-2016-nipsws/
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1c3aa0293e03f388ca24a19a20095539 http://arxiv.org/abs/1611.03718 Zobrazit plný text záznamu