A Study on the Cardinality of Ordered Average Pooling in Visual Recognition
Autor: | Miguel Pagola, Javier Fernández, Edurne Barrenechea, Humberto Bustince, Juan I. Forcen |
---|---|
Rok vydání: | 2017 |
Předmět: |
Contextual image classification
Computer science business.industry Feature vector Pooling 020207 software engineering Pattern recognition 02 engineering and technology Convolutional neural network Cardinality Discriminative model Bag-of-words model 0202 electrical engineering electronic engineering information engineering Image scaling 020201 artificial intelligence & image processing Artificial intelligence business |
Zdroj: | Pattern Recognition and Image Analysis ISBN: 9783319588377 IbPRIA |
DOI: | 10.1007/978-3-319-58838-4_48 |
Popis: | Bag-of-Words methods can be robust to image scaling, translation, and occlusion. An important step in this methodology, and other visual recognition systems like Convolutional Neural Networks, is spatial pooling, where the descriptors of neighbouring elements are combined into a local or a global feature vector. The combined vector must contain relevant information, while removing irrelevant and confusing details. Maximum and average are the most common aggregation functions used in the pooling step. In this work we present a study about the cardinality of ordered average pooling, i.e. the number of ordered elements to be aggregated such that after the pooling process the relevant information is maintained without degrading their discriminative power for classification. We provide an extensive evaluation that shows that for different values of cardinalities we can obtain results better than simple average pooling and than maximum pooling when dealing with small dictionary sizes. |
Databáze: | OpenAIRE |
Externí odkaz: |