Zobrazeno 1 - 10
of 785
pro vyhledávání: '"ZHANG Mengxi"'
Autor:
Yao, Huanjin, Wu, Wenhao, Yang, Taojiannan, Song, YuXin, Zhang, Mengxi, Feng, Haocheng, Sun, Yifan, Li, Zhiheng, Ouyang, Wanli, Wang, Jingdong
Do we fully leverage the potential of visual encoder in Multimodal Large Language Models (MLLMs)? The recent outstanding performance of MLLMs in multimodal understanding has garnered broad attention from both academia and industry. In the current MLL
Externí odkaz:
http://arxiv.org/abs/2405.13800
Autor:
Zhang, Mengxi, Wu, Wenhao, Lu, Yu, Song, Yuxin, Rong, Kang, Yao, Huanjin, Zhao, Jianbo, Liu, Fanglong, Sun, Yifan, Feng, Haocheng, Wang, Jingdong
Current multimodal Large Language Models (MLLMs) suffer from ``hallucination'', occasionally generating responses that are not grounded in the input images. To tackle this challenge, one promising path is to utilize reinforcement learning from human
Externí odkaz:
http://arxiv.org/abs/2405.11165
Referring image segmentation (RIS) aims to locate the particular region corresponding to the language expression. Existing methods incorporate features from different modalities in a \emph{bottom-up} manner. This design may get some unnecessary image
Externí odkaz:
http://arxiv.org/abs/2405.10707
Diffusion models, known for their powerful generative capabilities, play a crucial role in addressing real-world super-resolution challenges. However, these models often focus on improving local textures while neglecting the impacts of global degrada
Externí odkaz:
http://arxiv.org/abs/2404.00661
Referring image segmentation (RIS) aims to segment a particular region based on a language expression prompt. Existing methods incorporate linguistic features into visual features and obtain multi-modal features for mask decoding. However, these meth
Externí odkaz:
http://arxiv.org/abs/2311.15727
This paper does not present a novel method. Instead, it delves into an essential, yet must-know baseline in light of the latest advancements in Generative Artificial Intelligence (GenAI): the utilization of GPT-4 for visual understanding. Our study c
Externí odkaz:
http://arxiv.org/abs/2311.15732
Radiotherapy is one of the primary treatment methods for tumors, but the organ movement caused by respiration limits its accuracy. Recently, 3D imaging from a single X-ray projection has received extensive attention as a promising approach to address
Externí odkaz:
http://arxiv.org/abs/2310.08080
Autor:
Zhang, Mengxi1 (AUTHOR) zhangmengxi@nudt.edu.cn, Chen, Honghui1 (AUTHOR) chenhonghui@nudt.edu.cn
Publikováno v:
Mathematics (2227-7390). Sep2024, Vol. 12 Issue 17, p2636. 14p.
Autor:
Liu, Yiming, Zhang, Mengxi, Zhang, Weiqin, Jiang, Bo, Hou, Bo, Liu, Dan, Chen, Jie, Lian, Heqing
Magnetic resonance imaging plays an essential role in clinical diagnosis by acquiring the structural information of biological tissue. Recently, many multi-contrast MRI super-resolution networks achieve good effects. However, most studies ignore the
Externí odkaz:
http://arxiv.org/abs/2210.03460
Autor:
Deng, Genhua, Li, Wenwei, He, Yinpeng, Zhang, Kailai, Wang, Xinyue, Cui, Jinyang, Li, Mingchao, Zhang, Mengxi
Publikováno v:
In Construction and Building Materials 22 November 2024 452