Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Yao, Kelu"'
Recently, the remarkable success of ChatGPT has sparked a renewed wave of interest in artificial intelligence (AI), and the advancements in visual language models (VLMs) have pushed this enthusiasm to new heights. Differring from previous AI approach
Externí odkaz:
http://arxiv.org/abs/2410.17283
Autor:
Wei, Guoting, Yuan, Xia, Liu, Yu, Shang, Zhenhao, Yao, Kelu, Li, Chao, Yan, Qingsen, Zhao, Chunxia, Zhang, Haokui, Xiao, Rong
Aerial object detection has been a hot topic for many years due to its wide application requirements. However, most existing approaches can only handle predefined categories, which limits their applicability for the open scenarios in real-world. In t
Externí odkaz:
http://arxiv.org/abs/2408.12246
Compositional reasoning capabilities are usually considered as fundamental skills to characterize human perception. Recent studies show that current Vision Language Models (VLMs) surprisingly lack sufficient knowledge with respect to such capabilitie
Externí odkaz:
http://arxiv.org/abs/2405.17201
Publikováno v:
In Expert Systems With Applications 15 March 2025 265