Výsledky vyhledávání - "Bezrukov, Oleksandr"

Report

Addressing Blind Guessing: Calibration of Selection Bias in Multiple-Choice Question Answering by Video Language Models

Autor: Loginova, Olga, Bezrukov, Oleksandr, Kravets, Alexey

Evaluating Video Language Models (VLMs) is a challenging task. Due to its transparency, Multiple-Choice Question Answering (MCQA) is widely used to measure the performance of these models through accuracy. However, existing MCQA benchmarks fail to ca

Externí odkaz: http://arxiv.org/abs/2410.14248

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání