Autor: |
Willy Fitra Hendria, Quang Thinh Phan, Fikriansyah Adzaka, Cheol Jeong |
Jazyk: |
angličtina |
Rok vydání: |
2023 |
Předmět: |
|
Zdroj: |
ICT Express, Vol 9, Iss 2, Pp 258-263 (2023) |
Druh dokumentu: |
article |
ISSN: |
2405-9595 |
DOI: |
10.1016/j.icte.2021.12.006 |
Popis: |
Combining multiple models is a well-known technique to improve predictive performance in challenging tasks such as object detection in UAV imagery. In this paper, we propose fusion of transformer-based and convolutional neural network-based (CNN) models with two approaches. First, we ensemble Swin Transformer and DetectoRS with ResNet backbone, and conduct performance comparison on four typical methods for combining predictions of multiple object detection models. Second, we design a hybrid architecture by combining Swin Transformer backbone with a neck of DetectoRS. We show that the fusion of the transformer and the CNN-based models performs better compared to the respective baseline model. |
Databáze: |
Directory of Open Access Journals |
Externí odkaz: |
|