A transformer-based mask R-CNN for tomato detection and segmentation
Autor: | Chong Wang, Gongping Yang, Yuwen Huang, Yikun Liu, Yan Zhang |
---|---|
Rok vydání: | 2023 |
Předmět: | |
Zdroj: | Journal of Intelligent & Fuzzy Systems. 44:8585-8595 |
ISSN: | 1875-8967 1064-1246 |
DOI: | 10.3233/jifs-222954 |
Popis: | Fruit detection is essential for harvesting robot platforms. However, complicated environmental attributes such as illumination variation and occlusion have made fruit detection a challenging task. In this study, a Transformer-based mask region-based convolution neural network (R-CNN) model for tomato detection and segmentation is proposed to address these difficulties. Swin Transformer is used as the backbone network for better feature extraction. Multi-scale training techniques are shown to yield significant performance gains. Apart from accurately detecting and segmenting tomatoes, the method effectively identifies tomato cultivars (normal-size and cherry tomatoes) and tomato maturity stages (fully-ripened, half-ripened, and green). Compared with existing work, the method has the best detection and segmentation performance for these tomatoes, with mean average precision (mAP) results of 89.4% and 89.2%, respectively. |
Databáze: | OpenAIRE |
Externí odkaz: | |
Nepřihlášeným uživatelům se plný text nezobrazuje | K zobrazení výsledku je třeba se přihlásit. |