Zobrazeno 1 - 10
of 236 995
pro vyhledávání: '"computer science - computer vision and pattern recognition"'
Recent multi-frame lifting methods have dominated the 3D human pose estimation. However, previous methods ignore the intricate dependence within the 2D pose sequence and learn single temporal correlation. To alleviate this limitation, we propose TCPF
Externí odkaz:
http://arxiv.org/abs/2501.01770
Publikováno v:
IEEE Transactions on Circuits and Systems for Video Technology, 2024
Most facial expression recognition (FER) models are trained on large-scale expression data with centralized learning. Unfortunately, collecting a large amount of centralized expression data is difficult in practice due to privacy concerns of facial i
Externí odkaz:
http://arxiv.org/abs/2501.01816
Autor:
Liu, Huaize, Sun, Wenzhang, Di, Donglin, Sun, Shibo, Yang, Jiahui, Zou, Changqing, Bao, Hujun
The generation of talking avatars has achieved significant advancements in precise audio synchronization. However, crafting lifelike talking head videos requires capturing a broad spectrum of emotions and subtle facial expressions. Current methods fa
Externí odkaz:
http://arxiv.org/abs/2501.01808
Significant progress has been made in talking-face video generation research; however, precise lip-audio synchronization and high visual quality remain challenging in editing lip shapes based on input audio. This paper introduces JoyGen, a novel two-
Externí odkaz:
http://arxiv.org/abs/2501.01798
Loop closure detection in large-scale and long-term missions can be computationally demanding due to the need to identify, verify, and process numerous candidate pairs to establish edge connections for the pose graph optimization. Keyframe sampling m
Externí odkaz:
http://arxiv.org/abs/2501.01791
This paper presents a powerful framework to customize video creations by incorporating multiple specific identity (ID) photos, with video diffusion Transformers, referred to as \texttt{Ingredients}. Generally, our method consists of three primary mod
Externí odkaz:
http://arxiv.org/abs/2501.01790
6-Degree of Freedom (6DoF) motion estimation with a combination of visual and inertial sensors is a growing area with numerous real-world applications. However, precise calibration of the time offset between these two sensor types is a prerequisite f
Externí odkaz:
http://arxiv.org/abs/2501.01788
Cloud gaming is an advanced form of Internet service that necessitates local terminals to decode within limited resources and time latency. Super-Resolution (SR) techniques are often employed on these terminals as an efficient way to reduce the requi
Externí odkaz:
http://arxiv.org/abs/2501.01773
Autor:
Jin, Er, Feng, Qihui, Mou, Yongli, Decker, Stefan, Lakemeyer, Gerhard, Simons, Oliver, Stegmaier, Johannes
Logical image understanding involves interpreting and reasoning about the relationships and consistency within an image's visual content. This capability is essential in applications such as industrial inspection, where logical anomaly detection is c
Externí odkaz:
http://arxiv.org/abs/2501.01767
LiDAR scenes constitute a fundamental source for several autonomous driving applications. Despite the existence of several datasets, scenes from adverse weather conditions are rarely available. This limits the robustness of downstream machine learnin
Externí odkaz:
http://arxiv.org/abs/2501.01761