Výsledky vyhledávání - "Guo, Xuechen"

Report

LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound

Autor: Guo, Xuechen, Chai, Wenhao, Li, Shi-Yan, Wang, Gaoang

Multimodal Large Language Model (MLLM) has recently garnered attention as a prominent research focus. By harnessing powerful LLM, it facilitates a transition of conversational generative AI from unimodal text to performing multimodal tasks. This boom

Externí odkaz: http://arxiv.org/abs/2410.15074

Zobrazit plný text záznamu

Report

Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning

Autor: Zhang, Zhenyu, Wang, Benlu, Liang, Weijie, Li, Yizhi, Guo, Xuechen, Wang, Guanhong, Li, Shiyan, Wang, Gaoang

With the development of multimodality and large language models, the deep learning-based technique for medical image captioning holds the potential to offer valuable diagnostic recommendations. However, current generic text and image pre-trained mode

Externí odkaz: http://arxiv.org/abs/2311.01004

Zobrazit plný text záznamu

Report

Blind Inpainting with Object-aware Discrimination for Artificial Marker Removal

Autor: Guo, Xuechen, Hu, Wenhao, Ni, Chiming, Chai, Wenhao, Li, Shiyan, Wang, Gaoang

Medical images often incorporate doctor-added markers that can hinder AI-based diagnosis. This issue highlights the need of inpainting techniques to restore the corrupted visual contents. However, existing methods require manual mask annotation as in

Externí odkaz: http://arxiv.org/abs/2303.15124

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání