Výsledky vyhledávání

Report

NASM: Neural Anisotropic Surface Meshing

Autor: Li, Hongbo, Zhu, Haikuan, Zhong, Sikai, Wang, Ningna, Lin, Cheng, Guo, Xiaohu, Xin, Shiqing, Wang, Wenping, Hua, Jing, Zhong, Zichun

This paper introduces a new learning-based method, NASM, for anisotropic surface meshing. Our key idea is to propose a graph neural network to embed an input mesh into a high-dimensional (high-d) Euclidean embedding space to preserve curvature-based

Externí odkaz: http://arxiv.org/abs/2410.23109

Zobrazit plný text záznamu

Report

DiffTED: One-shot Audio-driven TED Talk Video Generation with Diffusion-based Co-speech Gestures

Autor: Hogue, Steven, Zhang, Chenxu, Daruger, Hamza, Tian, Yapeng, Guo, Xiaohu

Audio-driven talking video generation has advanced significantly, but existing methods often depend on video-to-video translation techniques and traditional generative networks like GANs and they typically generate taking heads and co-speech gestures

Externí odkaz: http://arxiv.org/abs/2409.07649

Zobrazit plný text záznamu

Report

SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction

Autor: Xie, Weixing, Yao, Junfeng, Cao, Xianpeng, Lin, Qiqin, Tang, Zerui, Dong, Xiao, Guo, Xiaohu

Dynamic reconstruction of deformable tissues in endoscopic video is a key technology for robot-assisted surgery. Recent reconstruction methods based on neural radiance fields (NeRFs) have achieved remarkable results in the reconstruction of surgical

Externí odkaz: http://arxiv.org/abs/2407.05023

Zobrazit plný text záznamu

Report

HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction

Autor: Wang, Jikai, Zhang, Qifan, Chao, Yu-Wei, Wen, Bowen, Guo, Xiaohu, Xiang, Yu

We introduce a data capture system and a new dataset named HO-Cap that can be used to study 3D reconstruction and pose tracking of hands and objects in videos. The capture system uses multiple RGB-D cameras and a HoloLens headset for data collection,

Externí odkaz: http://arxiv.org/abs/2406.06843

Zobrazit plný text záznamu

Report

CWF: Consolidating Weak Features in High-quality Mesh Simplification

Autor: Xu, Rui, Liu, Longdu, Wang, Ningna, Chen, Shuangmin, Xin, Shiqing, Guo, Xiaohu, Zhong, Zichun, Komura, Taku, Wang, Wenping, Tu, Changhe

In mesh simplification, common requirements like accuracy, triangle quality, and feature alignment are often considered as a trade-off. Existing algorithms concentrate on just one or a few specific aspects of these requirements. For example, the well

Externí odkaz: http://arxiv.org/abs/2404.15661

Zobrazit plný text záznamu

Report

Robust Active Speaker Detection in Noisy Environments

Autor: Vasireddy, Siva Sai Nagender, Zhang, Chenxu, Guo, Xiaohu, Tian, Yapeng

This paper addresses the issue of active speaker detection (ASD) in noisy environments and formulates a robust active speaker detection (rASD) problem. Existing ASD approaches leverage both audio and visual modalities, but non-speech sounds in the su

Externí odkaz: http://arxiv.org/abs/2403.19002

Zobrazit plný text záznamu

Report

MATTopo: Topology-preserving Medial Axis Transform with Restricted Power Diagram

Autor: Wang, Ningna, Huang, Hui, Song, Shibo, Wang, Bin, Wang, Wenping, Guo, Xiaohu

We present a novel topology-preserving 3D medial axis computation framework based on volumetric restricted power diagram (RPD), while preserving the medial features and geometric convergence simultaneously, for both 3D CAD and organic shapes. The vol

Externí odkaz: http://arxiv.org/abs/2403.18761

Zobrazit plný text záznamu

Report

DRSM: efficient neural 4d decomposition for dynamic reconstruction in stationary monocular cameras

Autor: Xie, Weixing, Dong, Xiao, Yang, Yong, Lin, Qiqin, Chen, Jingze, Yao, Junfeng, Guo, Xiaohu

With the popularity of monocular videos generated by video sharing and live broadcasting applications, reconstructing and editing dynamic scenes in stationary monocular cameras has become a special but anticipated technology. In contrast to scene rec

Externí odkaz: http://arxiv.org/abs/2402.00740

Zobrazit plný text záznamu

Report

DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation

Autor: Zhang, Chenxu, Wang, Chao, Zhang, Jianfeng, Xu, Hongyi, Song, Guoxian, Xie, You, Luo, Linjie, Tian, Yapeng, Guo, Xiaohu, Feng, Jiashi

The generation of emotional talking faces from a single portrait image remains a significant challenge. The simultaneous achievement of expressive emotional talking and accurate lip-sync is particularly difficult, as expressiveness is often compromis

Externí odkaz: http://arxiv.org/abs/2312.13578

Zobrazit plný text záznamu

Report

MSFormer: A Skeleton-multiview Fusion Method For Tooth Instance Segmentation

Autor: Li, Yuan, Liu, Huan, Tao, Yubo, He, Xiangyang, Li, Haifeng, Guo, Xiaohu, Lin, Hai

Recently, deep learning-based tooth segmentation methods have been limited by the expensive and time-consuming processes of data collection and labeling. Achieving high-precision segmentation with limited datasets is critical. A viable solution to th

Externí odkaz: http://arxiv.org/abs/2310.14489

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání