Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Yin, Shukang"'
Autor:
Yin, Shukang, Fu, Chaoyou, Zhao, Sirui, Xu, Tong, Wang, Hao, Sui, Dianbo, Shen, Yunhang, Li, Ke, Sun, Xing, Chen, Enhong
Hallucination is a big shadow hanging over the rapidly evolving Multimodal Large Language Models (MLLMs), referring to the phenomenon that the generated text is inconsistent with the image content. In order to mitigate hallucinations, existing studie
Externí odkaz:
http://arxiv.org/abs/2310.16045
Recently, Multimodal Large Language Model (MLLM) represented by GPT-4V has been a new rising research hotspot, which uses powerful Large Language Models (LLMs) as a brain to perform multimodal tasks. The surprising emergent capabilities of MLLM, such
Externí odkaz:
http://arxiv.org/abs/2306.13549
Automatic Micro-Expression (ME) spotting in long videos is a crucial step in ME analysis but also a challenging task due to the short duration and low intensity of MEs. When solving this problem, previous works generally lack in considering the struc
Externí odkaz:
http://arxiv.org/abs/2303.09114
Publikováno v:
ACM Transactions on Multimedia Computing, Communications & Applications; Oct2024, Vol. 20 Issue 10, p1-21, 21p