Autor: |
Tahir A; Research Center for Modeling and Simulation, National University of Sciences and Technology, Islamabad 64000, Pakistan., Munawar HS; School of Built Environment, University of New South Wales, Kensington, Sydney, NSW 2052, Australia., Akram J; Department of Computer Science, Superior University, Lahore 54700, Pakistan.; School of Computer Science, The University of Sydney, Camperdown, Sydney, NSW 2006, Australia., Adil M; Research Center for Modeling and Simulation, National University of Sciences and Technology, Islamabad 64000, Pakistan., Ali S; School of Engineering, Deakin University, Geelong, VIC 3216, Australia., Kouzani AZ; School of Engineering, Deakin University, Geelong, VIC 3216, Australia., Mahmud MAP; School of Engineering, Deakin University, Geelong, VIC 3216, Australia. |
Abstrakt: |
Object detection is a vital step in satellite imagery-based computer vision applications such as precision agriculture, urban planning and defense applications. In satellite imagery, object detection is a very complicated task due to various reasons including low pixel resolution of objects and detection of small objects in the large scale (a single satellite image taken by Digital Globe comprises over 240 million pixels) satellite images. Object detection in satellite images has many challenges such as class variations, multiple objects pose, high variance in object size, illumination and a dense background. This study aims to compare the performance of existing deep learning algorithms for object detection in satellite imagery. We created the dataset of satellite imagery to perform object detection using convolutional neural network-based frameworks such as faster RCNN (faster region-based convolutional neural network), YOLO (you only look once), SSD (single-shot detector) and SIMRDWN (satellite imagery multiscale rapid detection with windowed networks). In addition to that, we also performed an analysis of these approaches in terms of accuracy and speed using the developed dataset of satellite imagery. The results showed that SIMRDWN has an accuracy of 97% on high-resolution images, while Faster RCNN has an accuracy of 95.31% on the standard resolution (1000 × 600). YOLOv3 has an accuracy of 94.20% on standard resolution (416 × 416) while on the other hand SSD has an accuracy of 84.61% on standard resolution (300 × 300). When it comes to speed and efficiency, YOLO is the obvious leader. In real-time surveillance, SIMRDWN fails. When YOLO takes 170 to 190 milliseconds to perform a task, SIMRDWN takes 5 to 103 milliseconds. |