Zobrazeno 1 - 10
of 42
pro vyhledávání: '"Huang, Linjiang"'
Autor:
He, Runze, Ma, Kai, Huang, Linjiang, Huang, Shaofei, Gao, Jialin, Wei, Xiaoming, Dai, Jiao, Han, Jizhong, Liu, Si
Introducing user-specified visual concepts in image editing is highly practical as these concepts convey the user's intent more precisely than text-based descriptions. We propose FreeEdit, a novel approach for achieving such reference-based image edi
Externí odkaz:
http://arxiv.org/abs/2409.18071
Autor:
Huang, Linjiang, Fang, Rongyao, Zhang, Aiping, Song, Guanglu, Liu, Si, Liu, Yu, Li, Hongsheng
In this study, we delve into the generation of high-resolution images from pre-trained diffusion models, addressing persistent challenges, such as repetitive patterns and structural distortions, that emerge when models are applied beyond their traine
Externí odkaz:
http://arxiv.org/abs/2403.12963
Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels
The task of weakly supervised temporal action localization targets at generating temporal boundaries for actions of interest, meanwhile the action category should also be classified. Pseudo-label-based methods, which serve as an effective solution, h
Externí odkaz:
http://arxiv.org/abs/2304.07978
In this paper, we present a novel training scheme, namely Teach-DETR, to learn better DETR-based detectors from versatile teacher detectors. We show that the predicted boxes from teacher detectors are effective medium to transfer knowledge of teacher
Externí odkaz:
http://arxiv.org/abs/2211.11953
Weakly supervised temporal action localization aims to localize temporal boundaries of actions and simultaneously identify their categories with only video-level category labels. Many existing methods seek to generate pseudo labels for bridging the d
Externí odkaz:
http://arxiv.org/abs/2203.02925
As a challenging task of high-level video understanding, weakly supervised temporal action localization has been attracting increasing attention. With only video annotations, most existing methods seek to handle this task with a localization-by-class
Externí odkaz:
http://arxiv.org/abs/2108.06524
Text-based video segmentation aims to segment an actor in video sequences by specifying the actor and its performing action with a textual query. Previous methods fail to explicitly align the video content with the textual query in a fine-grained man
Externí odkaz:
http://arxiv.org/abs/2011.00786
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
In Pattern Recognition August 2019 92:165-176