Differentiable samplers for deep latent variable models.

Autor: Doucet, Arnaud, Moulines, Eric, Thin, Achille
Předmět:
Zdroj: Philosophical Transactions of the Royal Society A: Mathematical, Physical & Engineering Sciences; 5/15/2023, Vol. 381 Issue 2247, p1-13, 13p
Abstrakt: Latent variable models are a popular class of models in statistics. Combined with neural networks to improve their expressivity, the resulting deep latent variable models have also found numerous applications in machine learning. A drawback of these models is that their likelihood function is intractable so approximations have to be carried out to perform inference. A standard approach consists of maximizing instead an evidence lower bound (ELBO) obtained based on a variational approximation of the posterior distribution of the latent variables. The standard ELBO can, however, be a very loose bound if the variational family is not rich enough. A generic strategy to tighten such bounds is to rely on an unbiased low-variance Monte Carlo estimate of the evidence. We review here some recent importance sampling, Markov chain Monte Carlo and sequential Monte Carlo strategies that have been proposed to achieve this. This article is part of the theme issue 'Bayesian inference: challenges, perspectives, and prospects'. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index