Zobrazeno 1 - 10
of 808
pro vyhledávání: '"Wang, Zhisheng"'
In this study, we propose AniPortrait, a novel framework for generating high-quality animation driven by audio and a reference portrait image. Our methodology is divided into two stages. Initially, we extract 3D intermediate representations from audi
Externí odkaz:
http://arxiv.org/abs/2403.17694
Neural radiance fields (NeRFs) are promising 3D representations for scenes, objects, and humans. However, most existing methods require multi-view inputs and per-scene training, which limits their real-life applications. Moreover, current methods foc
Externí odkaz:
http://arxiv.org/abs/2401.00979
Current parametric models have made notable progress in 3D hand pose and shape estimation. However, due to the fixed hand topology and complex hand poses, current models are hard to generate meshes that are aligned with the image well. To tackle this
Externí odkaz:
http://arxiv.org/abs/2312.15916
Recently, linear computed tomography (LCT) systems have actively attracted attention. To weaken projection truncation and image the region of interest (ROI) for LCT, the backprojection filtration (BPF) algorithm is an effective solution. However, in
Externí odkaz:
http://arxiv.org/abs/2309.11858
The objective of stylized speech-driven facial animation is to create animations that encapsulate specific emotional expressions. Existing methods often depend on pre-established emotional labels or facial expression templates, which may limit the ne
Externí odkaz:
http://arxiv.org/abs/2308.14448
Dancing with music is always an essential human art form to express emotion. Due to the high temporal-spacial complexity, long-term 3D realist dance generation synchronized with music is challenging. Existing methods suffer from the freezing problem
Externí odkaz:
http://arxiv.org/abs/2308.11945
This paper is to investigate the high-quality analytical reconstructions of multiple source-translation computed tomography (mSTCT) under an extended field of view (FOV). Under the larger FOVs, the previously proposed backprojection filtration (BPF)
Externí odkaz:
http://arxiv.org/abs/2305.19767
Autor:
Wang, Zhisheng, Yu, Haijun, Huang, Yixing, Wang, Shunli, Ni, Song, Li, Zongfeng, Liu, Fenglin, Cui, Junning
Micro-computed tomography (micro-CT) is a widely used state-of-the-art instrument employed to study the morphological structures of objects in various fields. However, its small field-of-view (FOV) cannot meet the pressing demand for imaging relative
Externí odkaz:
http://arxiv.org/abs/2305.18878
Learning accent from crowd-sourced data is a feasible way to achieve a target speaker TTS system that can synthesize accent speech. To this end, there are two challenging problems to be solved. First, direct use of the poor acoustic quality crowd-sou
Externí odkaz:
http://arxiv.org/abs/2210.17305
Publikováno v:
Tourism Review, 2023, Vol. 78, Issue 6, pp. 1387-1413.
Externí odkaz:
http://www.emeraldinsight.com/doi/10.1108/TR-09-2022-0465