Výsledky vyhledávání - "Kadhim, Hashiam"

Report

NWT: Towards natural audio-to-video generation with representation learning

Autor: Mama, Rayhane, Tyndel, Marc S., Kadhim, Hashiam, Clifford, Cole, Thurairatnam, Ragavan

In this work we introduce NWT, an expressive speech-to-video model. Unlike approaches that use domain-specific intermediate representations such as pose keypoints, NWT learns its own latent representations, with minimal assumptions about the audio an

Externí odkaz: http://arxiv.org/abs/2106.04283

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání