Výsledky vyhledávání - "Zimmermann, Roger"

Report

Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts

Autor: Liu, Xu, Liu, Juncheng, Woo, Gerald, Aksu, Taha, Liang, Yuxuan, Zimmermann, Roger, Liu, Chenghao, Savarese, Silvio, Xiong, Caiming, Sahoo, Doyen

Time series foundation models have demonstrated impressive performance as zero-shot forecasters. However, achieving effectively unified training on time series remains an open challenge. Existing approaches introduce some level of model specializatio

Externí odkaz: http://arxiv.org/abs/2410.10469

Zobrazit plný text záznamu

Report

Manifold-Aware Local Feature Modeling for Semi-Supervised Medical Image Segmentation

Autor: Shen, Sicheng, Cao, Jinming, Yin, Yifang, Zimmermann, Roger

Achieving precise medical image segmentation is vital for effective treatment planning and accurate disease diagnosis. Traditional fully-supervised deep learning methods, though highly precise, are heavily reliant on large volumes of labeled data, wh

Externí odkaz: http://arxiv.org/abs/2410.10287

Zobrazit plný text záznamu

Report

Grounding is All You Need? Dual Temporal Grounding for Video Dialog

Autor: Qin, You, Ji, Wei, Lan, Xinze, Fei, Hao, Yang, Xun, Guo, Dan, Zimmermann, Roger, Liao, Lizi

In the realm of video dialog response generation, the understanding of video content and the temporal nuances of conversation history are paramount. While a segment of current research leans heavily on large-scale pretrained visual-language models an

Externí odkaz: http://arxiv.org/abs/2410.05767

Zobrazit plný text záznamu

Report

DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Autor: Tu, Jiahang, Ji, Wei, Zhao, Hanbin, Zhang, Chao, Zimmermann, Roger, Qian, Hui

In autonomous driving, deep models have shown remarkable performance across various visual perception tasks with the demand of high-quality and huge-diversity training datasets. Such datasets are expected to cover various driving scenarios with adver

Externí odkaz: http://arxiv.org/abs/2407.15661

Zobrazit plný text záznamu

Report

Described Spatial-Temporal Video Detection

Autor: Ji, Wei, Liu, Xiangyan, Sun, Yingfei, Deng, Jiajun, Qin, You, Nuwanna, Ammar, Qiu, Mengyao, Wei, Lina, Zimmermann, Roger

Detecting visual content on language expression has become an emerging topic in the community. However, in the video domain, the existing setting, i.e., spatial-temporal video grounding (STVG), is formulated to only detect one pre-existing object in

Externí odkaz: http://arxiv.org/abs/2407.05610

Zobrazit plný text záznamu

Report

Do As I Do: Pose Guided Human Motion Copy

Autor: Wu, Sifan, Liu, Zhenguang, Zhang, Beibei, Zimmermann, Roger, Ba, Zhongjie, Zhang, Xiaosong, Ren, Kui

Human motion copy is an intriguing yet challenging task in artificial intelligence and computer vision, which strives to generate a fake video of a target person performing the motion of a source person. The problem is inherently challenging due to t

Externí odkaz: http://arxiv.org/abs/2406.16601

Zobrazit plný text záznamu

Report

PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials

Autor: Hu, Wenmiao, Zhang, Yichen, Liang, Yuxuan, Han, Xianjing, Yin, Yifang, Kruppa, Hannes, Ng, See-Kiong, Zimmermann, Roger

Publikováno v: Proceedings of the 31st ACM International Conference on Multimedia (2023) 56-66

Satellite-based street-view information extraction by cross-view matching refers to a task that extracts the location and orientation information of a given street-view image query by using one or multiple geo-referenced satellite images. Recent work

Externí odkaz: http://arxiv.org/abs/2406.13409

Zobrazit plný text záznamu

Report

Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach

Autor: Zhang, Huaiwu, Xia, Yutong, Zhong, Siru, Wang, Kun, Tong, Zekun, Wen, Qingsong, Zimmermann, Roger, Liang, Yuxuan

The increasing number of vehicles highlights the need for efficient parking space management. Predicting real-time Parking Availability (PA) can help mitigate traffic congestion and the corresponding social problems, which is a pressing issue in dens

Externí odkaz: http://arxiv.org/abs/2405.18910

Zobrazit plný text záznamu

Report

Backpropagation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration

Autor: Ji, Wei, Li, Li, Lv, Zheqi, Zhang, Wenqiao, Li, Mengze, Wan, Zhen, Lei, Wenqiang, Zimmermann, Roger

In our increasingly interconnected world, where intelligent devices continually amass copious personalized multi-modal data, a pressing need arises to deliver high-quality, personalized device-aware services. However, this endeavor presents a multifa

Externí odkaz: http://arxiv.org/abs/2406.01601

Zobrazit plný text záznamu

Report

Prompt-Enhanced Spatio-Temporal Graph Transfer Learning

Autor: Hu, Junfeng, Liu, Xu, Fan, Zhencheng, Yin, Yifang, Xiang, Shili, Ramasamy, Savitha, Zimmermann, Roger

Spatio-temporal graph neural networks have demonstrated efficacy in capturing complex dependencies for urban computing tasks such as forecasting and kriging. However, their performance is constrained by the reliance on extensive data for training on

Externí odkaz: http://arxiv.org/abs/2405.12452

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání