Výsledky vyhledávání - "Miranda, Imanol"

Report

BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval

Autor: Miranda, Imanol, Salaberria, Ander, Agirre, Eneko, Azkune, Gorka

Existing Vision-Language Compositionality (VLC) benchmarks like SugarCrepe are formulated as image-to-text retrieval problems, where, given an image, the models need to select between the correct textual description and a synthetic hard negative text

Externí odkaz: http://arxiv.org/abs/2406.09952

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání