Sequential Variational Autoencoder with Adversarial Classifier for Video Disentanglement

Autor:	Takeshi Haga, Hiroshi Kera, Kazuhiko Kawamoto
Jazyk:	angličtina
Rok vydání:	2023
Předmět:	adversarial training auxiliary adversarial classifier inductive biases sequential variational autoencoder video disentanglement Chemical technology TP1-1185
Zdroj:	Sensors, Vol 23, Iss 5, p 2515 (2023)
Druh dokumentu:	article
ISSN:	1424-8220
DOI:	10.3390/s23052515
Popis:	In this paper, we propose a sequential variational autoencoder for video disentanglement, which is a representation learning method that can be used to separately extract static and dynamic features from videos. Building sequential variational autoencoders with a two-stream architecture induces inductive bias for video disentanglement. However, our preliminary experiment demonstrated that the two-stream architecture is insufficient for video disentanglement because static features frequently contain dynamic features. Additionally, we found that dynamic features are not discriminative in the latent space. To address these problems, we introduced an adversarial classifier using supervised learning into the two-stream architecture. The strong inductive bias through supervision separates dynamic features from static features and yields discriminative representations of the dynamic features. Through a comparison with other sequential variational autoencoders, we qualitatively and quantitatively demonstrate the effectiveness of the proposed method on the Sprites and MUG datasets.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/e6f5521f78274178bdf9bcf883dce619 Zobrazit plný text záznamu View record in DOAJ Plný text ve formátu PDF Plný text ve formátu HTML
Nepřihlášeným uživatelům se plný text nezobrazuje	K zobrazení výsledku je třeba se přihlásit.