Subjective Scoring Framework for VQA Models in Autonomous Driving

Autor:	Kaavya Rekanar, Abbirah Ahmed, Reenu Mohandas, Ganesh Sistu, Ciaran Eising, Martin Hayes
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	Semantic analysis scoring framework subjective assessment VQA models Electrical engineering. Electronics. Nuclear engineering TK1-9971
Zdroj:	IEEE Access, Vol 12, Pp 141306-141323 (2024)
Druh dokumentu:	article
ISSN:	2169-3536
DOI:	10.1109/ACCESS.2024.3404349
Popis:	The development of vision and language transformer models has paved the way for Visual Question Answering (VQA) models and related research. There are metrics to assess the general accuracy of VQA models but subjective assessment of the answers generated by the models is necessary to gain an in-depth understanding and a framework for subjective assessment is required. This work develops a novel scoring system based on the subjectivity of the question and analyses the answers provided by the model using multiple types of natural language processing models (bert-base-uncased, nli-distilBERT-base, all-mpnet-base-v2 and GPT-2) and sentence similarity benchmark metrics (Cosine Similarity). A case study detailing the use of the proposed subjective scoring framework on three prominent VQA models- ViLT, ViLBERT, and LXMERT using an automotive dataset is also presented. The framework proposed aids in analyzing the shortcomings of the discussed VQA models from a driving perspective and the results achieved help determine which model would work best when fine-tuned on a driving-specific VQA dataset.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/4c799dda1adf4dd59b2d459f269b5cc1 Zobrazit plný text záznamu View record in DOAJ