YOLO-DCTI: Small Object Detection in Remote Sensing Base on Contextual Transformer Enhancement

Autor: Lingtong Min, Ziman Fan, Qinyi Lv, Mohamed Reda, Linghao Shen, Binglu Wang
Jazyk: angličtina
Rok vydání: 2023
Předmět:
Zdroj: Remote Sensing, Vol 15, Iss 16, p 3970 (2023)
Druh dokumentu: article
ISSN: 15163970
2072-4292
DOI: 10.3390/rs15163970
Popis: Object detection for remote sensing is a fundamental task in image processing of remote sensing; as one of the core components, small or tiny object detection plays an important role. Despite the considerable advancements achieved in small object detection with the integration of CNN and transformer networks, there remains untapped potential for enhancing the extraction and utilization of information associated with small objects. Particularly within transformer structures, this potential arises from the disregard of the complex and the intertwined interplay between spatial context information and channel information during the global modeling of pixel-level information within small objects. As a result, valuable information is prone to being obfuscated and annihilated. To mitigate this limitation, we propose an innovative framework, YOLO-DCTI, that capitalizes on the Contextual Transformer (CoT) framework for the detection of small or tiny objects. Specifically, within CoT, we seamlessly incorporate global residuals and local fusion mechanisms throughout the entire input-to-output pipeline. This integration facilitates a profound investigation into the network’s intrinsic representations at deeper levels and fosters the fusion of spatial contextual attributes with channel characteristics. Moreover, we propose an improved decoupled contextual transformer detection head structure, denoted as DCTI, to effectively resolve the feature conflicts that ensue from the concurrent classification and regression tasks. The experimental results on the Dota, VISDrone, and NWPU VHR-10 datasets show that, on the powerful real-time detection network YOLOv7, the speed and accuracy of tiny targets are better balanced.
Databáze: Directory of Open Access Journals
Nepřihlášeným uživatelům se plný text nezobrazuje