Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Srinivasulu, Vishnuvardhan Pogunulu"'
Multi-label Recognition (MLR) involves the identification of multiple objects within an image. To address the additional complexity of this problem, recent works have leveraged information from vision-language models (VLMs) trained on large text-imag
Externí odkaz:
http://arxiv.org/abs/2404.16193