Combining transformer and CNN for object detection in UAV imagery

Autor: Willy Fitra Hendria, Quang Thinh Phan, Fikriansyah Adzaka, Cheol Jeong
Jazyk: angličtina
Rok vydání: 2023
Předmět:
Zdroj: ICT Express, Vol 9, Iss 2, Pp 258-263 (2023)
Druh dokumentu: article
ISSN: 2405-9595
DOI: 10.1016/j.icte.2021.12.006
Popis: Combining multiple models is a well-known technique to improve predictive performance in challenging tasks such as object detection in UAV imagery. In this paper, we propose fusion of transformer-based and convolutional neural network-based (CNN) models with two approaches. First, we ensemble Swin Transformer and DetectoRS with ResNet backbone, and conduct performance comparison on four typical methods for combining predictions of multiple object detection models. Second, we design a hybrid architecture by combining Swin Transformer backbone with a neck of DetectoRS. We show that the fusion of the transformer and the CNN-based models performs better compared to the respective baseline model.
Databáze: Directory of Open Access Journals