Toward Versatile Small Object Detection with Temporal-YOLOv8

Autor:	Martin C. van Leeuwen, Ella P. Fokkinga, Wyke Huizinga, Jan Baan, Friso G. Heslinga
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	small object detection YOLO temporal object detection Chemical technology TP1-1185
Zdroj:	Sensors, Vol 24, Iss 22, p 7387 (2024)
Druh dokumentu:	article
ISSN:	1424-8220
DOI:	10.3390/s24227387
Popis:	Deep learning has become the preferred method for automated object detection, but the accurate detection of small objects remains a challenge due to the lack of distinctive appearance features. Most deep learning-based detectors do not exploit the temporal information that is available in video, even though this context is often essential when the signal-to-noise ratio is low. In addition, model development choices, such as the loss function, are typically designed around medium-sized objects. Moreover, most datasets that are acquired for the development of small object detectors are task-specific and lack diversity, and the smallest objects are often not well annotated. In this study, we address the aforementioned challenges and create a deep learning-based pipeline for versatile small object detection. With an in-house dataset consisting of civilian and military objects, we achieve a substantial improvement in YOLOv8 (baseline mAP = 0.465) by leveraging the temporal context in video and data augmentations specifically tailored to small objects (mAP = 0.839). We also show the benefit of having a carefully curated dataset in comparison with public datasets and find that a model trained on a diverse dataset outperforms environment-specific models. Our findings indicate that small objects can be detected accurately in a wide range of environments while leveraging the speed of the YOLO architecture.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/72dd396dc23445c28507d84cbb2ed2f5 Zobrazit plný text záznamu View record in DOAJ Plný text ve formátu PDF Plný text ve formátu HTML
Nepřihlášeným uživatelům se plný text nezobrazuje	K zobrazení výsledku je třeba se přihlásit.