Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Sivakumar, Anushka"'
Autor:
Wang, Zhecan, Liu, Junzhang, Tang, Chia-Wei, Alomari, Hani, Sivakumar, Anushka, Sun, Rui, Li, Wenhao, Atabuzzaman, Md., Ayyubi, Hammad, You, Haoxuan, Ishmam, Alvi, Chang, Kai-Wei, Chang, Shih-Fu, Thomas, Chris
Existing vision-language understanding benchmarks largely consist of images of objects in their usual contexts. As a consequence, recent multimodal large language models can perform well with only a shallow visual understanding by relying on backgrou
Externí odkaz:
http://arxiv.org/abs/2409.12953
Publikováno v:
International Journal of Information Technology; August 2023, Vol. 15 Issue: 6 p3381-3390, 10p