Zobrazeno 1 - 10
of 4 559
pro vyhledávání: '"Zhang, XiaoBo"'
Multimodal Question Answering (MMQA) is crucial as it enables comprehensive understanding and accurate responses by integrating insights from diverse data representations such as tables, charts, and text. Most existing researches in MMQA only focus o
Externí odkaz:
http://arxiv.org/abs/2410.21414
Autor:
Tian, Weiwei, Huang, Xinyu, Cheng, Tianhao, He, Wen, Fang, Jinwu, Feng, Rui, Geng, Daoying, Zhang, Xiaobo
Pediatric pneumonia is the leading cause of death among children under five years worldwide, imposing a substantial burden on affected families. Currently, there are three significant hurdles in diagnosing and treating pediatric pneumonia. Firstly, p
Externí odkaz:
http://arxiv.org/abs/2409.02608
Solving partial differential equations (PDEs) effectively necessitates a multi-scale approach, particularly critical in high-dimensional scenarios characterized by increasing grid points or resolution. Traditional methods often fail to capture the de
Externí odkaz:
http://arxiv.org/abs/2406.04822
Autor:
Li, Qingqiu, Yan, Xiaohan, Xu, Jilan, Yuan, Runtian, Zhang, Yuejie, Feng, Rui, Shen, Quanli, Zhang, Xiaobo, Wang, Shujun
Learning medical visual representations through vision-language pre-training has reached remarkable progress. Despite the promising performance, it still faces challenges, i.e., local alignment lacks interpretability and clinical relevance, and the i
Externí odkaz:
http://arxiv.org/abs/2403.09294
Autor:
Ma, Zhe, Dong, Jianfeng, Ji, Shouling, Liu, Zhenguang, Zhang, Xuhong, Wang, Zonghui, He, Sifeng, Qian, Feng, Zhang, Xiaobo, Yang, Lei
Visual retrieval aims to search for the most relevant visual items, e.g., images and videos, from a candidate gallery with a given query item. Accuracy and efficiency are two competing objectives in retrieval tasks. Instead of crafting a new method p
Externí odkaz:
http://arxiv.org/abs/2312.09716
Autor:
Zhao, Bowen, Ji, Changkai, Zhang, Yuejie, He, Wen, Wang, Yingwen, Wang, Qing, Feng, Rui, Zhang, Xiaobo
With the Generative Pre-trained Transformer 3.5 (GPT-3.5) exhibiting remarkable reasoning and comprehension abilities in Natural Language Processing (NLP), most Question Answering (QA) research has primarily centered around general QA tasks based on
Externí odkaz:
http://arxiv.org/abs/2312.11521
Autor:
Li, Qingqiu, Xu, Jilan, Yuan, Runtian, Chen, Mohan, Zhang, Yuejie, Feng, Rui, Zhang, Xiaobo, Gao, Shang
Automatic generation of radiology reports holds crucial clinical value, as it can alleviate substantial workload on radiologists and remind less experienced ones of potential anomalies. Despite the remarkable performance of various image captioning m
Externí odkaz:
http://arxiv.org/abs/2311.00399
Publikováno v:
Shipin Kexue, Vol 45, Iss 22, Pp 269-279 (2024)
With the acceleration of industrialization, heavy metal pollution has become increasingly severe, posing a significant threat to human health. In recent years, various biosensors have been widely used for heavy metal detection. Among them, electroche
Externí odkaz:
https://doaj.org/article/46a5bf97737b405ab6815b3151bbc4b7
Autor:
Jiang, Chen, Huang, Kaiming, He, Sifeng, Yang, Xudong, Zhang, Wei, Zhang, Xiaobo, Cheng, Yuan, Yang, Lei, Wang, Qing, Xu, Furong, Pan, Tan, Chu, Wei
With the explosive growth of web videos in recent years, large-scale Content-Based Video Retrieval (CBVR) becomes increasingly essential in video filtering, recommendation, and copyright protection. Segment-level CBVR (S-CBVR) locates the start and e
Externí odkaz:
http://arxiv.org/abs/2309.11091