Discovering Interpretable Latent Space Directions of GANs Beyond Binary Attributes

Autor:	Liangyu Chai, Shengfeng He, Huiting Yang, Shuang Zhao, Qiang Wen, Zixun Sun
Rok vydání:	2021
Předmět:	business.industry Computer science Space (commercial competition) Machine learning computer.software_genre Semantics Image (mathematics) Constraint (information theory) Pattern recognition (psychology) Code (cryptography) Artificial intelligence Noise (video) business Representation (mathematics) computer
Zdroj:	CVPR
DOI:	10.1109/cvpr46437.2021.01200
Popis:	Generative adversarial networks (GANs) learn to map noise latent vectors to high-fidelity image outputs. It is found that the input latent space shows semantic correlations with the output image space. Recent works aim to interpret the latent space and discover meaningful directions that correspond to human interpretable image transformations. However, these methods either rely on explicit scores of attributes (e.g., memorability) or are restricted to binary ones (e.g., gender), which largely limits the applicability of editing tasks, especially for free-form artistic tasks like style/anime editing. In this paper, we propose an adversarial method, AdvStyle, for discovering interpretable directions in the absence of well-labeled scores or binary attributes. In particular, the proposed adversarial method simultaneously optimizes the discovered directions and the attribute assessor using the target attribute data as positive samples, while the generated ones being negative. In this way, arbitrary attributes can be edited by collecting positive data only, and the proposed method learns a controllable representation enabling manipulation of non-binary attributes like anime styles and facial characteristics. Moreover, the proposed learning strategy attenuates the entanglement between attributes, such that multi-attribute manipulation can be easily achieved without any additional constraint. Furthermore, we reveal several interesting semantics with the involuntarily learned negative directions. Extensive experiments on 9 anime attributes and 7 human attributes demonstrate the effectiveness of our adversarial approach qualitatively and quantitatively. Code is available at https://github.com/BERYLSHEEP/AdvStyle.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::d0fbf7186627e5501e0b3845aabec1ca https://doi.org/10.1109/cvpr46437.2021.01200 Zobrazit plný text záznamu