Zobrazeno 1 - 10
of 329
pro vyhledávání: '"Wu, Dongming"'
Referring multi-object tracking (RMOT) aims at detecting and tracking multiple objects following human instruction represented by a natural language expression. Existing RMOT benchmarks are usually formulated through manual annotations, integrated wi
Externí odkaz:
http://arxiv.org/abs/2406.05039
Autor:
Bai, Yifan, Wu, Dongming, Liu, Yingfei, Jia, Fan, Mao, Weixin, Zhang, Ziheng, Zhao, Yucheng, Shen, Jianbing, Wei, Xing, Wang, Tiancai, Zhang, Xiangyu
Rapid advancements in Autonomous Driving (AD) tasks turned a significant shift toward end-to-end fashion, particularly in the utilization of vision-language models (VLMs) that integrate robust logical reasoning and cognitive abilities to enable compr
Externí odkaz:
http://arxiv.org/abs/2405.18361
Autor:
Yu, En, Zhao, Liang, Wei, Yana, Yang, Jinrong, Wu, Dongming, Kong, Lingyu, Wei, Haoran, Wang, Tiancai, Ge, Zheng, Zhang, Xiangyu, Tao, Wenbing
Humans possess the remarkable ability to foresee the future to a certain extent based on present observations, a skill we term as foresight minds. However, this capability remains largely under explored within existing Multimodal Large Language Model
Externí odkaz:
http://arxiv.org/abs/2312.00589
Topology reasoning aims to comprehensively understand road scenes and present drivable routes in autonomous driving. It requires detecting road centerlines (lane) and traffic elements, further reasoning their topology relationship, i.e., lane-lane to
Externí odkaz:
http://arxiv.org/abs/2310.06753
A new trend in the computer vision community is to capture objects of interest following flexible human command represented by a natural language prompt. However, the progress of using language prompts in driving scenarios is stuck in a bottleneck du
Externí odkaz:
http://arxiv.org/abs/2309.04379
Referring video object segmentation (RVOS) aims at segmenting an object in a video following human instruction. Current state-of-the-art methods fall into an offline pattern, in which each clip independently interacts with text embedding for cross-mo
Externí odkaz:
http://arxiv.org/abs/2307.09356
Aspect-based-sentiment-analysis (ABSA) is a fine-grained sentiment evaluation task, which analyzes the emotional polarity of the evaluation aspects. Generally, the emotional polarity of an aspect exists in the corresponding opinion expression, whose
Externí odkaz:
http://arxiv.org/abs/2306.11260
Autor:
Wu, Dongming, Jia, Fan, Chang, Jiahao, Li, Zhuoling, Sun, Jianjian, Han, Chunrui, Li, Shuailin, Liu, Yingfei, Ge, Zheng, Wang, Tiancai
We present the 1st-place solution of OpenLane Topology in Autonomous Driving Challenge. Considering that topology reasoning is based on centerline detection and traffic element detection, we develop a multi-stage framework for high performance. Speci
Externí odkaz:
http://arxiv.org/abs/2306.09590
Existing referring understanding tasks tend to involve the detection of a single text-referred object. In this paper, we propose a new and general referring understanding task, termed referring multi-object tracking (RMOT). Its core idea is to employ
Externí odkaz:
http://arxiv.org/abs/2303.03366
Autor:
Wu, Dongming
This dissertation examines the vital role of bronze in the political, economic, and cultural interactions in the southern borderlands during the period of the Zhou dynasty (1045-256 BCE) in present-day Hubei province, China. It shows how the bronze e