Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Mongkhounsavath, Alana"'
Large foundation models have revolutionized the field, yet challenges remain in optimizing multi-modal models for specialized visual tasks. We propose a novel, generalizable methodology to identify preferred image distributions for black-box Vision-L
Externí odkaz:
http://arxiv.org/abs/2409.02253