Zobrazeno 1 - 10
of 16 936
pro vyhledávání: '"Jia, Jia"'
Capturing and maintaining geometric interactions among different body parts is crucial for successful motion retargeting in skinned characters. Existing approaches often overlook body geometries or add a geometry correction stage after skeletal motio
Externí odkaz:
http://arxiv.org/abs/2410.20986
Recent advancements in diffusion models trained on large-scale data have enabled the generation of indistinguishable human-level images, yet they often produce harmful content misaligned with human values, e.g., social bias, and offensive content. De
Externí odkaz:
http://arxiv.org/abs/2410.12700
Autor:
Chen, Houlun, Wang, Xin, Chen, Hong, Zhang, Zeyang, Feng, Wei, Huang, Bin, Jia, Jia, Zhu, Wenwu
Existing Video Corpus Moment Retrieval (VCMR) is limited to coarse-grained understanding, which hinders precise video moment localization when given fine-grained queries. In this paper, we propose a more challenging fine-grained VCMR benchmark requir
Externí odkaz:
http://arxiv.org/abs/2410.08593
Synthesizing camera movements from music and dance is highly challenging due to the contradicting requirements and complexities of dance cinematography. Unlike human movements, which are always continuous, dance camera movements involve both continuo
Externí odkaz:
http://arxiv.org/abs/2409.14925
Autor:
Zhou, Yixuan, Qin, Xiaoyu, Jin, Zeyu, Zhou, Shuoyi, Lei, Shun, Zhou, Songtao, Wu, Zhiyong, Jia, Jia
Recent AIGC systems possess the capability to generate digital multimedia content based on human language instructions, such as text, image and video. However, when it comes to speech, existing methods related to human instruction-to-speech generatio
Externí odkaz:
http://arxiv.org/abs/2408.15676
Autor:
Jin, Zeyu, Jia, Jia, Wang, Qixin, Li, Kehan, Zhou, Shuoyi, Zhou, Songtao, Qin, Xiaoyu, Wu, Zhiyong
Speech-language multi-modal learning presents a significant challenge due to the fine nuanced information inherent in speech styles. Therefore, a large-scale dataset providing elaborate comprehension of speech style is urgently needed to facilitate i
Externí odkaz:
http://arxiv.org/abs/2408.13608
Autor:
Huang, Shuo, Sun, Shikun, Wang, Zixuan, Qin, Xiaoyu, Xiong, Yanmin, Zhang, Yuan, Wan, Pengfei, Zhang, Di, Jia, Jia
Recently, text-to-3D generation has attracted significant attention, resulting in notable performance enhancements. Previous methods utilize end-to-end 3D generation models to initialize 3D Gaussians, multi-view diffusion models to enforce multi-view
Externí odkaz:
http://arxiv.org/abs/2407.13976
Autor:
Li, Bin, Pei, Jiayan, Xiao, Feiyang, Zhao, Yifan, Zhang, Zhixing, Liu, Diwei, He, HengXu, Jia, Jia
In the mobile internet era, the Online Food Ordering Service (OFOS) emerges as an integral component of inclusive finance owing to the convenience it brings to people. OFOS platforms offer dynamic allocation incentives to users and merchants through
Externí odkaz:
http://arxiv.org/abs/2406.14132
In this paper we build bridges between moduli theory of sheaf stable pairs on one hand and birational geometry on the other hand. We will in particular treat moduli of sheaf stable pairs on smooth projective curves in detail and present some calculat
Externí odkaz:
http://arxiv.org/abs/2406.00230
Autor:
Wang, Zixuan, Jia, Jia, Sun, Shikun, Wu, Haozhe, Han, Rong, Li, Zhenyu, Tang, Di, Zhou, Jiaqing, Luo, Jiebo
Choreographers determine what the dances look like, while cameramen determine the final presentation of dances. Recently, various methods and datasets have showcased the feasibility of dance synthesis. However, camera movement synthesis with music an
Externí odkaz:
http://arxiv.org/abs/2403.13667