Zobrazeno 1 - 10
of 449
pro vyhledávání: '"Guo, Xiaohu"'
Autor:
Li, Hongbo, Zhu, Haikuan, Zhong, Sikai, Wang, Ningna, Lin, Cheng, Guo, Xiaohu, Xin, Shiqing, Wang, Wenping, Hua, Jing, Zhong, Zichun
This paper introduces a new learning-based method, NASM, for anisotropic surface meshing. Our key idea is to propose a graph neural network to embed an input mesh into a high-dimensional (high-d) Euclidean embedding space to preserve curvature-based
Externí odkaz:
http://arxiv.org/abs/2410.23109
Audio-driven talking video generation has advanced significantly, but existing methods often depend on video-to-video translation techniques and traditional generative networks like GANs and they typically generate taking heads and co-speech gestures
Externí odkaz:
http://arxiv.org/abs/2409.07649
Dynamic reconstruction of deformable tissues in endoscopic video is a key technology for robot-assisted surgery. Recent reconstruction methods based on neural radiance fields (NeRFs) have achieved remarkable results in the reconstruction of surgical
Externí odkaz:
http://arxiv.org/abs/2407.05023
We introduce a data capture system and a new dataset named HO-Cap that can be used to study 3D reconstruction and pose tracking of hands and objects in videos. The capture system uses multiple RGB-D cameras and a HoloLens headset for data collection,
Externí odkaz:
http://arxiv.org/abs/2406.06843
Autor:
Xu, Rui, Liu, Longdu, Wang, Ningna, Chen, Shuangmin, Xin, Shiqing, Guo, Xiaohu, Zhong, Zichun, Komura, Taku, Wang, Wenping, Tu, Changhe
In mesh simplification, common requirements like accuracy, triangle quality, and feature alignment are often considered as a trade-off. Existing algorithms concentrate on just one or a few specific aspects of these requirements. For example, the well
Externí odkaz:
http://arxiv.org/abs/2404.15661
This paper addresses the issue of active speaker detection (ASD) in noisy environments and formulates a robust active speaker detection (rASD) problem. Existing ASD approaches leverage both audio and visual modalities, but non-speech sounds in the su
Externí odkaz:
http://arxiv.org/abs/2403.19002
We present a novel topology-preserving 3D medial axis computation framework based on volumetric restricted power diagram (RPD), while preserving the medial features and geometric convergence simultaneously, for both 3D CAD and organic shapes. The vol
Externí odkaz:
http://arxiv.org/abs/2403.18761
With the popularity of monocular videos generated by video sharing and live broadcasting applications, reconstructing and editing dynamic scenes in stationary monocular cameras has become a special but anticipated technology. In contrast to scene rec
Externí odkaz:
http://arxiv.org/abs/2402.00740
Autor:
Zhang, Chenxu, Wang, Chao, Zhang, Jianfeng, Xu, Hongyi, Song, Guoxian, Xie, You, Luo, Linjie, Tian, Yapeng, Guo, Xiaohu, Feng, Jiashi
The generation of emotional talking faces from a single portrait image remains a significant challenge. The simultaneous achievement of expressive emotional talking and accurate lip-sync is particularly difficult, as expressiveness is often compromis
Externí odkaz:
http://arxiv.org/abs/2312.13578
Recently, deep learning-based tooth segmentation methods have been limited by the expensive and time-consuming processes of data collection and labeling. Achieving high-precision segmentation with limited datasets is critical. A viable solution to th
Externí odkaz:
http://arxiv.org/abs/2310.14489