Object Recognition under Multifarious Conditions: A Reliability Analysis and A Feature Similarity-based Performance Estimation
Autor: | Ghassan AlRegib, Dogancan Temel, Jinsol Lee |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2019 |
Předmět: |
FOS: Computer and information sciences
I.2 Similarity (geometry) I.4 I.5 Computer science Computer Vision and Pattern Recognition (cs.CV) Computer Science - Computer Vision and Pattern Recognition 02 engineering and technology Variation (game tree) FOS: Electrical engineering electronic engineering information engineering 0202 electrical engineering electronic engineering information engineering Feature (machine learning) Overhead (computing) Computer vision Reliability (statistics) business.industry Deep learning Image and Video Processing (eess.IV) Cognitive neuroscience of visual object recognition 020206 networking & telecommunications Electrical Engineering and Systems Science - Image and Video Processing 020201 artificial intelligence & image processing Artificial intelligence Focus (optics) business |
Zdroj: | ICIP |
Popis: | In this paper, we investigate the reliability of online recognition platforms, Amazon Rekognition and Microsoft Azure, with respect to changes in background, acquisition device, and object orientation. We focus on platforms that are commonly used by the public to better understand their real-world performances. To assess the variation in recognition performance, we perform a controlled experiment by changing the acquisition conditions one at a time. We use three smartphones, one DSLR, and one webcam to capture side views and overhead views of objects in a living room, an office, and photo studio setups. Moreover, we introduce a framework to estimate the recognition performance with respect to backgrounds and orientations. In this framework, we utilize both handcrafted features based on color, texture, and shape characteristics and data-driven features obtained from deep neural networks. Experimental results show that deep learning-based image representations can estimate the recognition performance variation with a Spearman's rank-order correlation of 0.94 under multifarious acquisition conditions. 5 pages, 3 figures, 1 table |
Databáze: | OpenAIRE |
Externí odkaz: |