Výsledky vyhledávání

Report

Taming Rectified Flow for Inversion and Editing

Autor: Wang, Jiangshan, Pu, Junfu, Qi, Zhongang, Guo, Jiayi, Ma, Yue, Huang, Nisha, Chen, Yuxin, Li, Xiu, Shan, Ying

Rectified-flow-based diffusion transformers, such as FLUX and OpenSora, have demonstrated exceptional performance in the field of image and video generation. Despite their robust generative capabilities, these models often suffer from inaccurate inve

Externí odkaz: http://arxiv.org/abs/2411.04746

Zobrazit plný text záznamu

Report

SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses

Autor: Tan, Chaolei, Lin, Zihang, Pu, Junfu, Qi, Zhongang, Pei, Wei-Yi, Qu, Zhi, Wang, Yexin, Shan, Ying, Zheng, Wei-Shi, Hu, Jian-Fang

Video grounding is a fundamental problem in multimodal content understanding, aiming to localize specific natural language queries in an untrimmed video. However, current video grounding datasets merely focus on simple events and are either limited t

Externí odkaz: http://arxiv.org/abs/2408.01669

Zobrazit plný text záznamu

Report

How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?

Autor: Chen, Yuxin, Ma, Zongyang, Zhang, Ziqi, Qi, Zhongang, Yuan, Chunfeng, Li, Bing, Pu, Junfu, Shan, Ying, Qi, Xiaojuan, Hu, Weiming

Dominant dual-encoder models enable efficient image-text retrieval but suffer from limited accuracy while the cross-encoder models offer higher accuracy at the expense of efficiency. Distilling cross-modality matching knowledge from cross-encoder to

Externí odkaz: http://arxiv.org/abs/2407.07479

Zobrazit plný text záznamu

Report

Music-driven Dance Regeneration with Controllable Key Pose Constraints

Autor: Pu, Junfu, Shan, Ying

In this paper, we propose a novel framework for music-driven dance motion synthesis with controllable key pose constraint. In contrast to methods that generate dance motion sequences only based on music without any other controllable conditions, this

Externí odkaz: http://arxiv.org/abs/2207.03682

Zobrazit plný text záznamu

Report

Learning Music-Dance Representations through Explicit-Implicit Rhythm Synchronization

Autor: Yu, Jiashuo, Pu, Junfu, Cheng, Ying, Feng, Rui, Shan, Ying

Although audio-visual representation has been proved to be applicable in many downstream tasks, the representation of dancing videos, which is more specific and always accompanied by music with complex auditory contents, remains challenging and uninv

Externí odkaz: http://arxiv.org/abs/2207.03190

Zobrazit plný text záznamu

Report

Improving Sign Language Translation with Monolingual Data by Sign Back-Translation

Autor: Zhou, Hao, Zhou, Wengang, Qi, Weizhen, Pu, Junfu, Li, Houqiang

Despite existing pioneering works on sign language translation (SLT), there is a non-trivial obstacle, i.e., the limited quantity of parallel sign-text data. To tackle this parallel data bottleneck, we propose a sign back-translation (SignBT) approac

Externí odkaz: http://arxiv.org/abs/2105.12397

Zobrazit plný text záznamu

Report

Boosting Continuous Sign Language Recognition via Cross Modality Augmentation

Autor: Pu, Junfu, Zhou, Wengang, Hu, Hezhen, Li, Houqiang

Continuous sign language recognition (SLR) deals with unaligned video-text pair and uses the word error rate (WER), i.e., edit distance, as the main evaluation metric. Since it is not differentiable, we usually instead optimize the learning model wit

Externí odkaz: http://arxiv.org/abs/2010.05264

Zobrazit plný text záznamu

Report

Global-local Enhancement Network for NMFs-aware Sign Language Recognition

Autor: Hu, Hezhen, Zhou, Wengang, Pu, Junfu, Li, Houqiang

Sign language recognition (SLR) is a challenging problem, involving complex manual features, i.e., hand gestures, and fine-grained non-manual features (NMFs), i.e., facial expression, mouth shapes, etc. Although manual features are dominant, non-manu

Externí odkaz: http://arxiv.org/abs/2008.10428

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání