Zobrazeno 1 - 10
of 502
pro vyhledávání: '"Zimmermann, Roger"'
Autor:
Liu, Xu, Liu, Juncheng, Woo, Gerald, Aksu, Taha, Liang, Yuxuan, Zimmermann, Roger, Liu, Chenghao, Savarese, Silvio, Xiong, Caiming, Sahoo, Doyen
Time series foundation models have demonstrated impressive performance as zero-shot forecasters. However, achieving effectively unified training on time series remains an open challenge. Existing approaches introduce some level of model specializatio
Externí odkaz:
http://arxiv.org/abs/2410.10469
Achieving precise medical image segmentation is vital for effective treatment planning and accurate disease diagnosis. Traditional fully-supervised deep learning methods, though highly precise, are heavily reliant on large volumes of labeled data, wh
Externí odkaz:
http://arxiv.org/abs/2410.10287
In the realm of video dialog response generation, the understanding of video content and the temporal nuances of conversation history are paramount. While a segment of current research leans heavily on large-scale pretrained visual-language models an
Externí odkaz:
http://arxiv.org/abs/2410.05767
In autonomous driving, deep models have shown remarkable performance across various visual perception tasks with the demand of high-quality and huge-diversity training datasets. Such datasets are expected to cover various driving scenarios with adver
Externí odkaz:
http://arxiv.org/abs/2407.15661
Autor:
Ji, Wei, Liu, Xiangyan, Sun, Yingfei, Deng, Jiajun, Qin, You, Nuwanna, Ammar, Qiu, Mengyao, Wei, Lina, Zimmermann, Roger
Detecting visual content on language expression has become an emerging topic in the community. However, in the video domain, the existing setting, i.e., spatial-temporal video grounding (STVG), is formulated to only detect one pre-existing object in
Externí odkaz:
http://arxiv.org/abs/2407.05610
Autor:
Wu, Sifan, Liu, Zhenguang, Zhang, Beibei, Zimmermann, Roger, Ba, Zhongjie, Zhang, Xiaosong, Ren, Kui
Human motion copy is an intriguing yet challenging task in artificial intelligence and computer vision, which strives to generate a fake video of a target person performing the motion of a source person. The problem is inherently challenging due to t
Externí odkaz:
http://arxiv.org/abs/2406.16601
Autor:
Hu, Wenmiao, Zhang, Yichen, Liang, Yuxuan, Han, Xianjing, Yin, Yifang, Kruppa, Hannes, Ng, See-Kiong, Zimmermann, Roger
Publikováno v:
Proceedings of the 31st ACM International Conference on Multimedia (2023) 56-66
Satellite-based street-view information extraction by cross-view matching refers to a task that extracts the location and orientation information of a given street-view image query by using one or multiple geo-referenced satellite images. Recent work
Externí odkaz:
http://arxiv.org/abs/2406.13409
Autor:
Zhang, Huaiwu, Xia, Yutong, Zhong, Siru, Wang, Kun, Tong, Zekun, Wen, Qingsong, Zimmermann, Roger, Liang, Yuxuan
The increasing number of vehicles highlights the need for efficient parking space management. Predicting real-time Parking Availability (PA) can help mitigate traffic congestion and the corresponding social problems, which is a pressing issue in dens
Externí odkaz:
http://arxiv.org/abs/2405.18910
Autor:
Ji, Wei, Li, Li, Lv, Zheqi, Zhang, Wenqiao, Li, Mengze, Wan, Zhen, Lei, Wenqiang, Zimmermann, Roger
In our increasingly interconnected world, where intelligent devices continually amass copious personalized multi-modal data, a pressing need arises to deliver high-quality, personalized device-aware services. However, this endeavor presents a multifa
Externí odkaz:
http://arxiv.org/abs/2406.01601
Autor:
Hu, Junfeng, Liu, Xu, Fan, Zhencheng, Yin, Yifang, Xiang, Shili, Ramasamy, Savitha, Zimmermann, Roger
Spatio-temporal graph neural networks have demonstrated efficacy in capturing complex dependencies for urban computing tasks such as forecasting and kriging. However, their performance is constrained by the reliance on extensive data for training on
Externí odkaz:
http://arxiv.org/abs/2405.12452