Výsledky vyhledávání

Report

Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning

Autor: Yu, Lu, Tao, Zhe, Yao, Hantao, Van de Weijer, Joost, Xu, Changsheng

Deep neural networks (DNNs) excel on fixed datasets but struggle with incremental and shifting data in real-world scenarios. Continual learning addresses this challenge by allowing models to learn from new data while retaining previously learned know

Externí odkaz: http://arxiv.org/abs/2408.01076

Zobrazit plný text záznamu

Report

Prompt-based Visual Alignment for Zero-shot Policy Transfer

Autor: Gao, Haihan, Zhang, Rui, Yi, Qi, Yao, Hantao, Li, Haochen, Guo, Jiaming, Peng, Shaohui, Gao, Yunkai, Wang, QiCheng, Hu, Xing, Wen, Yuanbo, Zhang, Zihao, Du, Zidong, Li, Ling, Guo, Qi, Chen, Yunji

Overfitting in RL has become one of the main obstacles to applications in reinforcement learning(RL). Existing methods do not provide explicit semantic constrain for the feature extractor, hindering the agent from learning a unified cross-domain repr

Externí odkaz: http://arxiv.org/abs/2406.03250

Zobrazit plný text záznamu

Report

Cluster-Aware Similarity Diffusion for Instance Retrieval

Autor: Luo, Jifei, Yao, Hantao, Xu, Changsheng

Diffusion-based re-ranking is a common method used for retrieving instances by performing similarity propagation in a nearest neighbor graph. However, existing techniques that construct the affinity graph based on pairwise instances can lead to the p

Externí odkaz: http://arxiv.org/abs/2406.02343

Zobrazit plný text záznamu

Report

SEP: Self-Enhanced Prompt Tuning for Visual-Language Model

Autor: Yao, Hantao, Zhang, Rui, Yu, Lu, Xu, Changsheng

Prompt tuning based on Context Optimization (CoOp) effectively adapts visual-language models (VLMs) to downstream tasks by inferring additional learnable prompt tokens. However, these tokens are less discriminative as they are independent of the pre-

Externí odkaz: http://arxiv.org/abs/2405.15549

Zobrazit plný text záznamu

Report

Hierarchical Prompts for Rehearsal-free Continual Learning

Autor: Zuo, Yukun, Yao, Hantao, Yu, Lu, Zhuang, Liansheng, Xu, Changsheng

Continual learning endeavors to equip the model with the capability to integrate current task knowledge while mitigating the forgetting of past task knowledge. Inspired by prompt tuning, prompt-based methods maintain a frozen backbone and train with

Externí odkaz: http://arxiv.org/abs/2401.11544

Zobrazit plný text záznamu

Report

Hierarchical Augmentation and Distillation for Class Incremental Audio-Visual Video Recognition

Autor: Zuo, Yukun, Yao, Hantao, Zhuang, Liansheng, Xu, Changsheng

Audio-visual video recognition (AVVR) aims to integrate audio and visual clues to categorize videos accurately. While existing methods train AVVR models using provided datasets and achieve satisfactory results, they struggle to retain historical clas

Externí odkaz: http://arxiv.org/abs/2401.06287

Zobrazit plný text záznamu

Report

TCP:Textual-based Class-aware Prompt tuning for Visual-Language Model

Autor: Yao, Hantao, Zhang, Rui, Xu, Changsheng

Prompt tuning represents a valuable technique for adapting pre-trained visual-language models (VLM) to various downstream tasks. Recent advancements in CoOp-based methods propose a set of learnable domain-shared or image-conditional textual tokens to

Externí odkaz: http://arxiv.org/abs/2311.18231

Zobrazit plný text záznamu

Report

Learning Domain-Aware Detection Head with Prompt Tuning

Autor: Li, Haochen, Zhang, Rui, Yao, Hantao, Song, Xinkai, Hao, Yifan, Zhao, Yongwei, Li, Ling, Chen, Yunji

Domain adaptive object detection (DAOD) aims to generalize detectors trained on an annotated source domain to an unlabelled target domain. However, existing methods focus on reducing the domain bias of the detection backbone by inferring a discrimina

Externí odkaz: http://arxiv.org/abs/2306.05718

Zobrazit plný text záznamu

Report

Camera-Incremental Object Re-Identification with Identity Knowledge Evolution

Autor: Yao, Hantao, Yu, Lu, Luo, Jifei, Xu, Changsheng

Object Re-identification (ReID) aims to retrieve the probe object from many gallery images with the ReID model inferred based on a stationary camera-free dataset by associating and collecting the identities across all camera views. When deploying the

Externí odkaz: http://arxiv.org/abs/2305.15909

Zobrazit plný text záznamu

Report

Visual-Language Prompt Tuning with Knowledge-guided Context Optimization

Autor: Yao, Hantao, Zhang, Rui, Xu, Changsheng

Prompt tuning is an effective way to adapt the pre-trained visual-language model (VLM) to the downstream task using task-related textual tokens. Representative CoOp-based work combines the learnable textual tokens with the class tokens to obtain spec

Externí odkaz: http://arxiv.org/abs/2303.13283

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání