Zobrazeno 1 - 10
of 429
pro vyhledávání: '"Zimmermann, Roger"'
In autonomous driving, deep models have shown remarkable performance across various visual perception tasks with the demand of high-quality and huge-diversity training datasets. Such datasets are expected to cover various driving scenarios with adver
Externí odkaz:
http://arxiv.org/abs/2407.15661
Autor:
Ji, Wei, Liu, Xiangyan, Sun, Yingfei, Deng, Jiajun, Qin, You, Nuwanna, Ammar, Qiu, Mengyao, Wei, Lina, Zimmermann, Roger
Detecting visual content on language expression has become an emerging topic in the community. However, in the video domain, the existing setting, i.e., spatial-temporal video grounding (STVG), is formulated to only detect one pre-existing object in
Externí odkaz:
http://arxiv.org/abs/2407.05610
Autor:
Wu, Sifan, Liu, Zhenguang, Zhang, Beibei, Zimmermann, Roger, Ba, Zhongjie, Zhang, Xiaosong, Ren, Kui
Human motion copy is an intriguing yet challenging task in artificial intelligence and computer vision, which strives to generate a fake video of a target person performing the motion of a source person. The problem is inherently challenging due to t
Externí odkaz:
http://arxiv.org/abs/2406.16601
Autor:
Hu, Wenmiao, Zhang, Yichen, Liang, Yuxuan, Han, Xianjing, Yin, Yifang, Kruppa, Hannes, Ng, See-Kiong, Zimmermann, Roger
Publikováno v:
Proceedings of the 31st ACM International Conference on Multimedia (2023) 56-66
Satellite-based street-view information extraction by cross-view matching refers to a task that extracts the location and orientation information of a given street-view image query by using one or multiple geo-referenced satellite images. Recent work
Externí odkaz:
http://arxiv.org/abs/2406.13409
Autor:
Zhang, Huaiwu, Xia, Yutong, Zhong, Siru, Wang, Kun, Tong, Zekun, Wen, Qingsong, Zimmermann, Roger, Liang, Yuxuan
The increasing number of vehicles highlights the need for efficient parking space management. Predicting real-time Parking Availability (PA) can help mitigate traffic congestion and the corresponding social problems, which is a pressing issue in dens
Externí odkaz:
http://arxiv.org/abs/2405.18910
Autor:
Ji, Wei, Li, Li, Lv, Zheqi, Zhang, Wenqiao, Li, Mengze, Wan, Zhen, Lei, Wenqiang, Zimmermann, Roger
In our increasingly interconnected world, where intelligent devices continually amass copious personalized multi-modal data, a pressing need arises to deliver high-quality, personalized device-aware services. However, this endeavor presents a multifa
Externí odkaz:
http://arxiv.org/abs/2406.01601
Autor:
Hu, Junfeng, Liu, Xu, Fan, Zhencheng, Yin, Yifang, Xiang, Shili, Ramasamy, Savitha, Zimmermann, Roger
Spatio-temporal graph neural networks have demonstrated efficacy in capturing complex dependencies for urban computing tasks such as forecasting and kriging. However, their performance is constrained by the reliance on extensive data for training on
Externí odkaz:
http://arxiv.org/abs/2405.12452
Photographing optoelectronic displays often introduces unwanted moir\'e patterns due to analog signal interference between the pixel grids of the display and the camera sensor arrays. This work identifies two problems that are largely ignored by exis
Externí odkaz:
http://arxiv.org/abs/2404.18155
Autor:
Anand, Avinash, Kapuriya, Janak, Kirtani, Chhavi, Singh, Apoorv, Saraf, Jay, Lal, Naman, Kumar, Jatin, Shivam, Adarsh Raj, Verma, Astha, Shah, Rajiv Ratn, Zimmermann, Roger
Recent advancements in LLMs have shown their significant potential in tasks like text summarization and generation. Yet, they often encounter difficulty while solving complex physics problems that require arithmetic calculation and a good understandi
Externí odkaz:
http://arxiv.org/abs/2404.12926
Product bundling has been a prevailing marketing strategy that is beneficial in the online shopping scenario. Effective product bundling methods depend on high-quality item representations, which need to capture both the individual items' semantics a
Externí odkaz:
http://arxiv.org/abs/2404.01735