Zobrazeno 1 - 10
of 6 548
pro vyhledávání: '"Wei, Meng"'
Object-oriented embodied navigation aims to locate specific objects, defined by category or depicted in images. Existing methods often struggle to generalize to open vocabulary goals without extensive training data. While recent advances in Vision-La
Externí odkaz:
http://arxiv.org/abs/2407.09016
Weakly supervised learning has recently achieved considerable success in reducing annotation costs and label noise. Unfortunately, existing weakly supervised learning methods are short of ability in generating reliable labels via pre-trained vision-l
Externí odkaz:
http://arxiv.org/abs/2405.15228
In multi-label classification, each training instance is associated with multiple class labels simultaneously. Unfortunately, collecting the fully precise class labels for each training instance is time- and labor-consuming for real-world application
Externí odkaz:
http://arxiv.org/abs/2403.16482
Long-tailed data is prevalent in real-world classification tasks and heavily relies on supervised information, which makes the annotation process exceptionally labor-intensive and time-consuming. Unfortunately, despite being a common approach to miti
Externí odkaz:
http://arxiv.org/abs/2403.16469
Radiology report generation (RRG) has attracted significant attention due to its potential to reduce the workload of radiologists. Current RRG approaches are still unsatisfactory against clinical standards. This paper introduces a novel RRG method, \
Externí odkaz:
http://arxiv.org/abs/2403.06728
Segmenting and recognizing diverse object parts is a crucial ability in applications spanning various computer vision and robotic tasks. While significant progress has been made in object-level Open-Vocabulary Semantic Segmentation (OVSS), i.e., segm
Externí odkaz:
http://arxiv.org/abs/2310.05107
Masked AutoEncoder (MAE) has revolutionized the field of self-supervised learning with its simple yet effective masking and reconstruction strategies. However, despite achieving state-of-the-art performance across various downstream vision tasks, the
Externí odkaz:
http://arxiv.org/abs/2310.01994