Zobrazeno 1 - 10
of 17 934
pro vyhledávání: '"Kim, A. Joo"'
We introduce a novel approach for high-resolution talking head generation from a single image and audio input. Prior methods using explicit face models, like 3D morphable models (3DMM) and facial landmarks, often fall short in generating high-fidelit
Externí odkaz:
http://arxiv.org/abs/2412.04000
Existing 4D Gaussian methods for dynamic scene reconstruction offer high visual fidelity and fast rendering. However, these methods suffer from excessive memory and storage demands, which limits their practical deployment. This paper proposes a 4D an
Externí odkaz:
http://arxiv.org/abs/2411.17044
Autor:
Kim, Yoon-Joo
Let $\pi : X \to B$ be a projective Lagrangian fibration of a smooth symplectic variety $X$ to a smooth variety $B$. Denote the complement of the discriminant locus by $B_0 = B \setminus \operatorname{Disc}(\pi)$, its preimage by $X_0 = \pi^{-1}(B_0)
Externí odkaz:
http://arxiv.org/abs/2410.21193
Autor:
Hur, Youngmi, Kim, Sung Joo
In this paper, we present a new method for designing wavelet filter banks for any dilation matrices and in any dimension. Our approach utilizes extended Laplacian pyramid matrices to achieve this flexibility. By generalizing recent tight wavelet fram
Externí odkaz:
http://arxiv.org/abs/2409.14242
We propose a new framework for creating and easily manipulating 3D models of arbitrary objects using casually captured videos. Our core ingredient is a novel hierarchy deformation model, which captures motions of objects with a tree-structured bones.
Externí odkaz:
http://arxiv.org/abs/2408.00351
In recent times, the need for effective super-resolution (SR) techniques has surged, especially for large-scale images ranging 2K to 8K resolutions. For DNN-based SISR, decomposing images into overlapping patches is typically necessary due to computa
Externí odkaz:
http://arxiv.org/abs/2407.21448
This paper aims to facilitate more practical NLOS imaging by reducing the number of samplings and scan areas. To this end, we introduce a phasor-based enhancement network that is capable of predicting clean and full measurements from noisy partial ob
Externí odkaz:
http://arxiv.org/abs/2407.18574
Online Temporal Action Localization (On-TAL) is a critical task that aims to instantaneously identify action instances in untrimmed streaming videos as soon as an action concludes -- a major leap from frame-based Online Action Detection (OAD). Yet, t
Externí odkaz:
http://arxiv.org/abs/2407.12987
The vocabulary size in temporal action localization (TAL) is limited by the scarcity of large-scale annotated datasets. To overcome this, recent works integrate vision-language models (VLMs), such as CLIP, for open-vocabulary TAL (OV-TAL). However, d
Externí odkaz:
http://arxiv.org/abs/2407.07024
Autor:
Lee, Dong Soo, Kim, Hyun Joo, Huh, Youngmin, Kang, Yeon Koo, Whi, Wonseok, Lee, Hyekyoung, Kang, Hyejin
Voxel hierarchy on dynamic brain graphs is produced by k core percolation on functional dynamic amplitude correlation of resting-state fMRI. Directed graphs and their afferent/efferent capacities are produced by Markov modeling of the universal cover
Externí odkaz:
http://arxiv.org/abs/2406.08140