Zobrazeno 1 - 10
of 764
pro vyhledávání: '"He, YuJie"'
Document visual question answering (DocVQA) pipelines that answer questions from documents have broad applications. Existing methods focus on handling single-page documents with multi-modal language models (MLMs), or rely on text-based retrieval-augm
Externí odkaz:
http://arxiv.org/abs/2411.04952
Efficient extraction of spectral sequences and geospatial information has always been a hot topic in hyperspectral image classification. In terms of spectral sequence feature capture, RNN and Transformer have become mainstream classification framewor
Externí odkaz:
http://arxiv.org/abs/2407.08255
To completely understand a document, the use of textual information is not enough. Understanding visual cues, such as layouts and charts, is also required. While the current state-of-the-art approaches for document understanding (both OCR-based and O
Externí odkaz:
http://arxiv.org/abs/2406.10085
RGB-D tracking significantly improves the accuracy of object tracking. However, its dependency on real depth inputs and the complexity involved in multi-modal fusion limit its applicability across various scenarios. The utilization of depth informati
Externí odkaz:
http://arxiv.org/abs/2405.14195
In the context of changing travel behaviors and the expanding user base of Geographic Information System (GIS) services, conventional centralized architectures responsible for handling shortest distance queries are facing increasing challenges, such
Externí odkaz:
http://arxiv.org/abs/2403.11246
Publikováno v:
Physical Review E (2024), 110, 024210
This research investigates the impact of dynamic, time-varying interactions on cooperative behaviour in social dilemmas. Traditional research has focused on deterministic rules governing pairwise interactions, yet the impact of interaction frequency
Externí odkaz:
http://arxiv.org/abs/2401.11782
Autor:
Gupta, Vivek, Kandoi, Pranshu, Vora, Mahek Bhavesh, Zhang, Shuo, He, Yujie, Reinanda, Ridho, Srikumar, Vivek
Semi-structured data, such as Infobox tables, often include temporal information about entities, either implicitly or explicitly. Can current NLP systems reason about such information in semi-structured tables? To tackle this question, we introduce t
Externí odkaz:
http://arxiv.org/abs/2311.08002
Anchor-based detectors have been continuously developed for object detection. However, the individual anchor box makes it difficult to predict the boundary's offset accurately. Instead of taking each bounding box as a closed individual, we consider u
Externí odkaz:
http://arxiv.org/abs/2310.05666
Autor:
Paez-Granados, Diego, He, Yujie, Gonon, David, Jia, Dan, Leibe, Bastian, Suzuki, Kenji, Billard, Aude
Publikováno v:
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2022)
Autonomous navigation in highly populated areas remains a challenging task for robots because of the difficulty in guaranteeing safe interactions with pedestrians in unstructured situations. In this work, we present a crowd navigation control framewo
Externí odkaz:
http://arxiv.org/abs/2208.02121
Autor:
Wang, Bin, Xu, Hang, Liu, Yu, Zhou, Kaiping, Li, Xinyu, Kong, Deyang, Chen, Jinmei, He, Yujie, Ji, Rong
Publikováno v:
In Water Research 15 November 2024 266