ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments
Autor: | Ray, Sourjyadip, Gupta, Kushal, Kundu, Soumi, Kasat, Payal Arvind, Aditya, Somak, Goyal, Pawan |
---|---|
Rok vydání: | 2024 |
Předmět: | |
Druh dokumentu: | Working Paper |
Popis: | The global shortage of healthcare workers has demanded the development of smart healthcare assistants, which can help monitor and alert healthcare workers when necessary. We examine the healthcare knowledge of existing Large Vision Language Models (LVLMs) via the Visual Question Answering (VQA) task in hospital settings through expert annotated open-ended questions. We introduce the Emergency Room Visual Question Answering (ERVQA) dataset, consisting of Comment: Accepted at EMNLP 2024 |
Databáze: | arXiv |
Externí odkaz: |