Výsledky vyhledávání - "Mongkhounsavath, Alana"

Report

How to Determine the Preferred Image Distribution of a Black-Box Vision-Language Model?

Autor: Taghanaki, Saeid Asgari, Lambourne, Joseph, Mongkhounsavath, Alana

Large foundation models have revolutionized the field, yet challenges remain in optimizing multi-modal models for specialized visual tasks. We propose a novel, generalizable methodology to identify preferred image distributions for black-box Vision-L

Externí odkaz: http://arxiv.org/abs/2409.02253

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání