Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Kusumba, Abhiram"'
Autor:
Patel, Maitreya, Kusumba, Abhiram, Cheng, Sheng, Kim, Changhoon, Gokhale, Tejas, Baral, Chitta, Yang, Yezhou
Contrastive Language-Image Pretraining (CLIP) models maximize the mutual information between text and visual modalities to learn representations. This makes the nature of the training data a significant factor in the efficacy of CLIP for downstream t
Externí odkaz:
http://arxiv.org/abs/2411.02545