Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Malkershin, Egor"'
In this work, we present a conceptually simple yet powerful baseline for the multimodal dialog task, an S3 model, that achieves near state-of-the-art results on two compelling leaderboards: MMMU and AI Journey Contest 2023. The system is based on a p
Externí odkaz:
http://arxiv.org/abs/2406.18305