Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Mihailov, Serghei"'
Large pre-trained vision-language models, such as CLIP, have demonstrated state-of-the-art performance across a wide range of image classification tasks, without requiring retraining. Few-shot CLIP is competitive with existing specialized architectur
Externí odkaz:
http://arxiv.org/abs/2409.02958