Zobrazeno 1 - 10
of 60
pro vyhledávání: '"Zhang, Aixi"'
Facial parts swapping aims to selectively transfer regions of interest from the source image onto the target image while maintaining the rest of the target image unchanged. Most studies on face swapping designed specifically for full-face swapping, a
Externí odkaz:
http://arxiv.org/abs/2410.22771
Autor:
Zhou, Jingkai, Wang, Benzhi, Chen, Weihua, Bai, Jingqi, Li, Dongyang, Zhang, Aixi, Xu, Hao, Yang, Mingyang, Wang, Fan
Controllable character animation is an emerging task that generates character videos controlled by pose sequences from given character images. Although character consistency has made significant progress via reference UNet, another crucial factor, po
Externí odkaz:
http://arxiv.org/abs/2409.06202
When hearing music, it is natural for people to dance to its rhythm. Automatic dance generation, however, is a challenging task due to the physical constraints of human motion and rhythmic alignment with target music. Conventional autoregressive meth
Externí odkaz:
http://arxiv.org/abs/2308.02915
Autor:
Zhuo, Le, Wang, Zhaokai, Wang, Baisen, Liao, Yue, Bao, Chenxi, Peng, Stanley, Han, Songhao, Zhang, Aixi, Fang, Fei, Liu, Si
Music is essential when editing videos, but selecting music manually is difficult and time-consuming. Thus, we seek to automatically generate background music tracks given video input. This is a challenging task since it requires music-video datasets
Externí odkaz:
http://arxiv.org/abs/2211.11248
The task of Human-Object Interaction~(HOI) detection could be divided into two core problems, i.e., human-object association and interaction understanding. In this paper, we reveal and address the disadvantages of the conventional query-driven HOI de
Externí odkaz:
http://arxiv.org/abs/2203.13954
Two-stage methods have dominated Human-Object Interaction (HOI) detection for several years. Recently, one-stage HOI detection methods have become popular. In this paper, we aim to explore the essential pros and cons of two-stage and one-stage method
Externí odkaz:
http://arxiv.org/abs/2108.05077
Recently proposed fine-grained 3D visual grounding is an essential and challenging task, whose goal is to identify the 3D object referred by a natural language sentence from other distractive objects of the same category. Existing works usually adopt
Externí odkaz:
http://arxiv.org/abs/2108.02388
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
Proceedings of the National Academy of Sciences of the United States of America, 2020 Sep . 117(39), 24076-24081.
Externí odkaz:
https://www.jstor.org/stable/26969511
Publikováno v:
IEEE Transactions on Pattern Analysis and Machine Intelligence; October 2024, Vol. 46 Issue: 10 p6826-6841, 16p