Fine-grained ship image classification and detection based on a vision transformer and multi-grain feature vector FPN model

Autor:	Fengxiang Wang, Deying Yu, Liang Huang, Yalun Zhang, Yongbing Chen, Zhiguo Wang
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	Deep learning image classification ship detection remote-sensing images transformer Mathematical geography. Cartography GA1-1776 Geodesy QB275-343
Zdroj:	Geo-spatial Information Science, Pp 1-22 (2024)
Druh dokumentu:	article
ISSN:	10095020 1993-5153 1009-5020 86203231
DOI:	10.1080/10095020.2024.2331552
Popis:	ABSTRACTIn naval and civilian domains, meticulous ship classification and detection are paramount. Nevertheless, predominant research has gravitated toward leveraging Convolutional Neural Network (CNN)-centered methodologies, often overlooking the diverse granularity inherent in ship samples. In our pursuit to holistically extract features from ship images across varying granularities, we present a transformative architecture: the Vision Transformer and Multi-Grain Feature Vector Feature Pyramid Network (ViT-MGFV-FPN). This model synergistically melds the merits of MGFV-FPN with an augmented Vision Transformer (ViT) for a comprehensive image feature extraction. To cater to the extraction of broader image features whilst sidestepping the innate quadratic complexity of traditional ViT, we unveil an enhanced version christened the Global Swin Transformer. Concurrently, the MGFV-FPN is orchestrated to harness the prowess of CNNs in distilling intricate ship attributes. Rigorous empirical evaluations underscore our model’s superiority in juxtaposition with extant CNN and transformer-based paradigms for nuanced ship categorization.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/4db8620323154f98a4389a54e739e757 Zobrazit plný text záznamu View record in DOAJ