Understanding How Image Quality Affects Transformer Neural Networks

Autor:	Domonkos Varga
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	transformer models image classification noise sensitivity computer vision Applied mathematics. Quantitative methods T57-57.97
Zdroj:	Signals, Vol 5, Iss 3, Pp 562-579 (2024)
Druh dokumentu:	article
ISSN:	2624-6120
DOI:	10.3390/signals5030031
Popis:	Deep learning models, particularly transformer architectures, have revolutionized various computer vision tasks, including image classification. However, their performance under different types and levels of noise remains a crucial area of investigation. In this study, we explore the noise sensitivity of prominent transformer models trained on the ImageNet dataset. We systematically evaluate 22 transformer variants, ranging from state-of-the-art large-scale models to compact versions tailored for mobile applications, under five common types of image distortions. Our findings reveal diverse sensitivities across different transformer architectures, with notable variations in performance observed under additive Gaussian noise, multiplicative Gaussian noise, Gaussian blur, salt-and-pepper noise, and JPEG compression. Interestingly, we observe a consistent robustness of transformer models to JPEG compression, with top-5 accuracies exhibiting higher resilience to noise compared to top-1 accuracies. Furthermore, our analysis highlights the vulnerability of mobile-oriented transformer variants to various noise types, underscoring the importance of noise robustness considerations in model design and deployment for real-world applications. These insights contribute to a deeper understanding of transformer model behavior under noisy conditions and have implications for improving the robustness and reliability of deep learning systems in practical scenarios.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/7e05115dddb54439b0000bb6471df338 Zobrazit plný text záznamu View record in DOAJ