A Comprehensive Survey of Transformers for Computer Vision

Autor: Sonain Jamil, Md. Jalil Piran, Oh-Jin Kwon
Jazyk: angličtina
Rok vydání: 2023
Předmět:
Zdroj: Drones, Vol 7, Iss 5, p 287 (2023)
Druh dokumentu: article
ISSN: 2504-446X
DOI: 10.3390/drones7050287
Popis: As a special type of transformer, vision transformers (ViTs) can be used for various computer vision (CV) applications. Convolutional neural networks (CNNs) have several potential problems that can be resolved with ViTs. For image coding tasks such as compression, super-resolution, segmentation, and denoising, different variants of ViTs are used. In our survey, we determined the many CV applications to which ViTs are applicable. CV applications reviewed included image classification, object detection, image segmentation, image compression, image super-resolution, image denoising, anomaly detection, and drone imagery. We reviewed the state of the-art and compiled a list of available models and discussed the pros and cons of each model.
Databáze: Directory of Open Access Journals