Zobrazeno 1 - 10
of 594
pro vyhledávání: '"Li, Haoxin"'
Knowledge-based Visual Question-answering (K-VQA) often requires the use of background knowledge beyond the image. However, we discover that a single knowledge generation strategy is often insufficient for all K-VQA questions. To this end, we propose
Externí odkaz:
http://arxiv.org/abs/2406.12746
Autor:
Zhang, Kaiyan, Zeng, Sihang, Hua, Ermo, Ding, Ning, Chen, Zhang-Ren, Ma, Zhiyuan, Li, Haoxin, Cui, Ganqu, Qi, Biqing, Zhu, Xuekai, Lv, Xingtai, Jinfang, Hu, Liu, Zhiyuan, Zhou, Bowen
Large Language Models (LLMs) have demonstrated remarkable capabilities across various domains and are moving towards more specialized areas. Recent advanced proprietary models such as GPT-4 and Gemini have achieved significant advancements in biomedi
Externí odkaz:
http://arxiv.org/abs/2406.03949
The task of multimodal relation extraction has attracted significant research attention, but progress is constrained by the scarcity of available training data. One natural thought is to extend existing datasets with cross-modal generative models. In
Externí odkaz:
http://arxiv.org/abs/2312.03025
Generative retrieval (Wang et al., 2022; Tay et al., 2022) is a popular approach for end-to-end document retrieval that directly generates document identifiers given an input query. We introduce summarization-based document IDs, in which each documen
Externí odkaz:
http://arxiv.org/abs/2311.08593
Large-scale language model pretraining is a very successful form of self-supervised learning in natural language processing, but it is increasingly expensive to perform as the models and pretraining corpora have become larger over time. We propose Na
Externí odkaz:
http://arxiv.org/abs/2301.04761
Mitigating and Evaluating Static Bias of Action Representations in the Background and the Foreground
In video action recognition, shortcut static features can interfere with the learning of motion features, resulting in poor out-of-distribution (OOD) generalization. The video background is clearly a source of static bias, but the video foreground, s
Externí odkaz:
http://arxiv.org/abs/2211.12883
Publikováno v:
In Journal of Environmental Management November 2024 370
Autor:
Li, Haoxin, Guo, Jingpeng, Wang, Yadong, Wang, Weiyan, Jia, Qi, Wan, Huawei, Li, Frank Yonghong
Publikováno v:
In Catena November 2024 246
Autor:
Wang, Yadong, Cheng, Jianwei, Wuji, Siguleng, Li, Haoxin, Wang, Yanan, Guo, Jingpeng, Liu, Xinmin, Yonghong Li, Frank
Publikováno v:
In Ecological Indicators October 2024 167
Autor:
Jia, Qi, Gao, Xiaotian, Jiang, Zhaolin, Li, Haoxin, Guo, Jingpeng, Lu, Xueyan, Yonghong Li, Frank
Publikováno v:
In Ecological Indicators September 2024 166