Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Mama, Rayhane"'
Hierarchical VAEs have emerged in recent years as a reliable option for maximum likelihood estimation. However, instability issues and demanding computational requirements have hindered research progress in the area. We present simple modifications t
Externí odkaz:
http://arxiv.org/abs/2203.13751
In this work we introduce NWT, an expressive speech-to-video model. Unlike approaches that use domain-specific intermediate representations such as pose keypoints, NWT learns its own latent representations, with minimal assumptions about the audio an
Externí odkaz:
http://arxiv.org/abs/2106.04283