Výsledky vyhledávání

Report

IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation

Autor: Yang, Sejong, Oh, Seoung Wug, Zhou, Yang, Kim, Seon Joo

We introduce a novel approach for high-resolution talking head generation from a single image and audio input. Prior methods using explicit face models, like 3D morphable models (3DMM) and facial landmarks, often fall short in generating high-fidelit

Externí odkaz: http://arxiv.org/abs/2412.04000

Zobrazit plný text záznamu

Report

4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction

Autor: Cho, Woong Oh, Cho, In, Kim, Seoha, Bae, Jeongmin, Uh, Youngjung, Kim, Seon Joo

Existing 4D Gaussian methods for dynamic scene reconstruction offer high visual fidelity and fast rendering. However, these methods suffer from excessive memory and storage demands, which limits their practical deployment. This paper proposes a 4D an

Externí odkaz: http://arxiv.org/abs/2411.17044

Zobrazit plný text záznamu

Report

The N\'eron model of a higher-dimensional Lagrangian fibration

Autor: Kim, Yoon-Joo

Let $\pi : X \to B$ be a projective Lagrangian fibration of a smooth symplectic variety $X$ to a smooth variety $B$. Denote the complement of the discriminant locus by $B_0 = B \setminus \operatorname{Disc}(\pi)$, its preimage by $X_0 = \pi^{-1}(B_0)

Externí odkaz: http://arxiv.org/abs/2410.21193

Zobrazit plný text záznamu

Report

Design of wavelet filter banks for any dilation using Extended Laplacian Pyramid Matrices

Autor: Hur, Youngmi, Kim, Sung Joo

In this paper, we present a new method for designing wavelet filter banks for any dilation matrices and in any dimension. Our approach utilizes extended Laplacian pyramid matrices to achieve this flexibility. By generalizing recent tight wavelet fram

Externí odkaz: http://arxiv.org/abs/2409.14242

Zobrazit plný text záznamu

Report

Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos

Autor: Jeon, Subin, Cho, In, Kim, Minsu, Cho, Woong Oh, Kim, Seon Joo

We propose a new framework for creating and easily manipulating 3D models of arbitrary objects using casually captured videos. Our core ingredient is a novel hierarchy deformation model, which captures motions of objects with a tree-structured bones.

Externí odkaz: http://arxiv.org/abs/2408.00351

Zobrazit plný text záznamu

Report

Accelerating Image Super-Resolution Networks with Pixel-Level Classification

Autor: Jeong, Jinho, Kim, Jinwoo, Jo, Younghyun, Kim, Seon Joo

In recent times, the need for effective super-resolution (SR) techniques has surged, especially for large-scale images ranging 2K to 8K resolutions. For DNN-based SISR, decomposing images into overlapping patches is typically necessary due to computa

Externí odkaz: http://arxiv.org/abs/2407.21448

Zobrazit plný text záznamu

Report

Learning to Enhance Aperture Phasor Field for Non-Line-of-Sight Imaging

Autor: Cho, In, Shim, Hyunbo, Kim, Seon Joo

This paper aims to facilitate more practical NLOS imaging by reducing the number of samplings and scan areas. To this end, we introduce a phasor-based enhancement network that is capable of predicting clean and full measurements from noisy partial ob

Externí odkaz: http://arxiv.org/abs/2407.18574

Zobrazit plný text záznamu

Report

ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos

Autor: Kang, Hyolim, Hyun, Jeongseok, An, Joungbin, Yu, Youngjae, Kim, Seon Joo

Online Temporal Action Localization (On-TAL) is a critical task that aims to instantaneously identify action instances in untrimmed streaming videos as soon as an action concludes -- a major leap from frame-based Online Action Detection (OAD). Yet, t

Externí odkaz: http://arxiv.org/abs/2407.12987

Zobrazit plný text záznamu

Report

Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization

Autor: Hyun, Jeongseok, Han, Su Ho, Kang, Hyolim, Lee, Joon-Young, Kim, Seon Joo

The vocabulary size in temporal action localization (TAL) is limited by the scarcity of large-scale annotated datasets. To overcome this, recent works integrate vision-language models (VLMs), such as CLIP, for open-vocabulary TAL (OV-TAL). However, d

Externí odkaz: http://arxiv.org/abs/2407.07024

Zobrazit plný text záznamu

Report

Functional voxel hierarchy and afferent capacity revealed mental state transition on dynamic correlation resting-state fMRI

Autor: Lee, Dong Soo, Kim, Hyun Joo, Huh, Youngmin, Kang, Yeon Koo, Whi, Wonseok, Lee, Hyekyoung, Kang, Hyejin

Voxel hierarchy on dynamic brain graphs is produced by k core percolation on functional dynamic amplitude correlation of resting-state fMRI. Directed graphs and their afferent/efferent capacities are produced by Markov modeling of the universal cover

Externí odkaz: http://arxiv.org/abs/2406.08140

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání