Výsledky vyhledávání

Report

Skinned Motion Retargeting with Dense Geometric Interaction Perception

Autor: Ye, Zijie, Liu, Jia-Wei, Jia, Jia, Sun, Shikun, Shou, Mike Zheng

Capturing and maintaining geometric interactions among different body parts is crucial for successful motion retargeting in skinned characters. Existing approaches often overlook body geometries or add a geometry correction stage after skeletal motio

Externí odkaz: http://arxiv.org/abs/2410.20986

Zobrazit plný text záznamu

Report

Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization

Autor: Wang, Xingqi, Yi, Xiaoyuan, Xie, Xing, Jia, Jia

Recent advancements in diffusion models trained on large-scale data have enabled the generation of indistinguishable human-level images, yet they often produce harmful content misaligned with human values, e.g., social bias, and offensive content. De

Externí odkaz: http://arxiv.org/abs/2410.12700

Zobrazit plný text záznamu

Report

VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding

Autor: Chen, Houlun, Wang, Xin, Chen, Hong, Zhang, Zeyang, Feng, Wei, Huang, Bin, Jia, Jia, Zhu, Wenwu

Existing Video Corpus Moment Retrieval (VCMR) is limited to coarse-grained understanding, which hinders precise video moment localization when given fine-grained queries. In this paper, we propose a more challenging fine-grained VCMR benchmark requir

Externí odkaz: http://arxiv.org/abs/2410.08593

Zobrazit plný text záznamu

Report

DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis

Autor: Wang, Zixuan, Li, Jiayi, Qin, Xiaoyu, Sun, Shikun, Zhou, Songtao, Jia, Jia, Luo, Jiebo

Synthesizing camera movements from music and dance is highly challenging due to the contradicting requirements and complexities of dance cinematography. Unlike human movements, which are always continuous, dance camera movements involve both continuo

Externí odkaz: http://arxiv.org/abs/2409.14925

Zobrazit plný text záznamu

Report

VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling

Autor: Zhou, Yixuan, Qin, Xiaoyu, Jin, Zeyu, Zhou, Shuoyi, Lei, Shun, Zhou, Songtao, Wu, Zhiyong, Jia, Jia

Recent AIGC systems possess the capability to generate digital multimedia content based on human language instructions, such as text, image and video. However, when it comes to speech, existing methods related to human instruction-to-speech generatio

Externí odkaz: http://arxiv.org/abs/2408.15676

Zobrazit plný text záznamu

Report

SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language Description

Autor: Jin, Zeyu, Jia, Jia, Wang, Qixin, Li, Kehan, Zhou, Shuoyi, Zhou, Songtao, Qin, Xiaoyu, Wu, Zhiyong

Speech-language multi-modal learning presents a significant challenge due to the fine nuanced information inherent in speech styles. Therefore, a large-scale dataset providing elaborate comprehension of speech style is urgently needed to facilitate i

Externí odkaz: http://arxiv.org/abs/2408.13608

Zobrazit plný text záznamu

Report

PlacidDreamer: Advancing Harmony in Text-to-3D Generation

Autor: Huang, Shuo, Sun, Shikun, Wang, Zixuan, Qin, Xiaoyu, Xiong, Yanmin, Zhang, Yuan, Wan, Pengfei, Zhang, Di, Jia, Jia

Recently, text-to-3D generation has attracted significant attention, resulting in notable performance enhancements. Previous methods utilize end-to-end 3D generation models to initialize 3D Gaussians, multi-view diffusion models to enforce multi-view

Externí odkaz: http://arxiv.org/abs/2407.13976

Zobrazit plný text záznamu

Report

Enhancing Monotonic Modeling with Spatio-Temporal Adaptive Awareness in Diverse Marketing

Autor: Li, Bin, Pei, Jiayan, Xiao, Feiyang, Zhao, Yifan, Zhang, Zhixing, Liu, Diwei, He, HengXu, Jia, Jia

In the mobile internet era, the Online Food Ordering Service (OFOS) emerges as an integral component of inclusive finance owing to the convenience it brings to people. OFOS platforms offer dynamic allocation incentives to users and merchants through

Externí odkaz: http://arxiv.org/abs/2406.14132

Zobrazit plný text záznamu

Report

Sheaf stable pairs, Quot-schemes, and birational geometry

Autor: Birkar, Caucher, Jia, Jia, Sheshmani, Artan

In this paper we build bridges between moduli theory of sheaf stable pairs on one hand and birational geometry on the other hand. We will in particular treat moduli of sheaf stable pairs on smooth projective curves in detail and present some calculat

Externí odkaz: http://arxiv.org/abs/2406.00230

Zobrazit plný text záznamu

Report

DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance

Autor: Wang, Zixuan, Jia, Jia, Sun, Shikun, Wu, Haozhe, Han, Rong, Li, Zhenyu, Tang, Di, Zhou, Jiaqing, Luo, Jiebo

Choreographers determine what the dances look like, while cameramen determine the final presentation of dances. Recently, various methods and datasets have showcased the feasibility of dance synthesis. However, camera movement synthesis with music an

Externí odkaz: http://arxiv.org/abs/2403.13667

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání