RailTrack-DaViT: A Vision Transformer-Based Approach for Automated Railway Track Defect Detection

Autor:	Aniwat Phaphuangwittayakul, Napat Harnpornchai, Fangli Ying, Jinming Zhang
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	railway track inspection vision transformer computer vision transportation safety public transportation monitoring Photography TR1-1050 Computer applications to medicine. Medical informatics R858-859.7 Electronic computers. Computer science QA75.5-76.95
Zdroj:	Journal of Imaging, Vol 10, Iss 8, p 192 (2024)
Druh dokumentu:	article
ISSN:	2313-433X
DOI:	10.3390/jimaging10080192
Popis:	Railway track defects pose significant safety risks and can lead to accidents, economic losses, and loss of life. Traditional manual inspection methods are either time-consuming, costly, or prone to human error. This paper proposes RailTrack-DaViT, a novel vision transformer-based approach for railway track defect classification. By leveraging the Dual Attention Vision Transformer (DaViT) architecture, RailTrack-DaViT effectively captures both global and local information, enabling accurate defect detection. The model is trained and evaluated on multiple datasets including rail, fastener and fishplate, multi-faults, and ThaiRailTrack. A comprehensive analysis of the model’s performance is provided including confusion matrices, training visualizations, and classification metrics. RailTrack-DaViT demonstrates superior performance compared to state-of-the-art CNN-based methods, achieving the highest accuracies: 96.9% on the rail dataset, 98.9% on the fastener and fishplate dataset, and 98.8% on the multi-faults dataset. Moreover, RailTrack-DaViT outperforms baselines on the ThaiRailTrack dataset with 99.2% accuracy, quickly adapts to unseen images, and shows better model stability during fine-tuning. This capability can significantly reduce time consumption when applying the model to novel datasets in practical applications.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/d40ce6fd3b3844409abab534341f98ab Zobrazit plný text záznamu View record in DOAJ