Výsledky vyhledávání - "Mihailov, Serghei"

Report

Multi-Modal Adapter for Vision-Language Models

Autor: Seputis, Dominykas, Mihailov, Serghei, Chatterjee, Soham, Xiao, Zehao

Large pre-trained vision-language models, such as CLIP, have demonstrated state-of-the-art performance across a wide range of image classification tasks, without requiring retraining. Few-shot CLIP is competitive with existing specialized architectur

Externí odkaz: http://arxiv.org/abs/2409.02958

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání