Study of Deep Learning Models for Class-Agnostic Counting
Autor: | Hilazo Aguilera, David |
---|---|
Přispěvatelé: | Universitat Politècnica de Catalunya. Departament de Ciències de la Computació, Béjar Alonso, Javier |
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
Xarxa neuronal convolucional
Recompte d'objectes Deep learning Convolutional Neural Network Visió artificial Autoencoders Neural networks (Computer science) Informàtica [Àrees temàtiques de la UPC] Machine learning Object counting Aprenentatge automàtic Xarxes neuronals (Informàtica) Computer vision Aprenentatge profund |
Zdroj: | UPCommons. Portal del coneixement obert de la UPC Universitat Politècnica de Catalunya (UPC) |
Popis: | In the field of object counting using computer vision techniques, multiple research projects developed highly performant models for counting specific classes. Nonetheless, none has achieved a deep learning model able to count any object independently of the class and very few have worked in that direction. We believe that achieving this class-agnostic counting model is possible by using deep learning methodologies such as Convolutional Neural Networks. We have developed a set of Convolutional neural networks following different counting strategies to validate our hypothesis and achieve a class-agnostic counting model, as well as with the purpose of proposing the best strategy to achieve a model with these characteristics. The developed models, try several ways of approaching the counting task using common CNN architectures that work well in other environments and by analyzing the counting task from a human point of view while trying to mimic our image processing behavior. Furthermore, three datasets have been defined following different counting approaches for training and evaluating the developed models, even in unseen data to evaluate their generalization capabilities without requiring to be trained on it. This master thesis proposes what we believe is the best approach for achieving class-agnostic counting as well as a new model improving the current state of the art in class-agnostic counting. We believe that the best approach is by developing a regression-based model with a spatial density output which splits the several tasks that take part in the counting problem into smaller and, if possible, supervised modules specialized in each sub-task. Moreover, this project offers a set of datasets that can be used in future works, fomenting this way the further research of a true class-agnostic counting model which doesn't require to be trained in unseen instances. |
Databáze: | OpenAIRE |
Externí odkaz: |