Výsledky vyhledávání

Report

Self-Supervised Contrastive Learning for Videos using Differentiable Local Alignment

Autor: Oei, Keyne, Gomaa, Amr, Feit, Anna Maria, Belo, João

Robust frame-wise embeddings are essential to perform video analysis and understanding tasks. We present a self-supervised method for representation learning based on aligning temporal video sequences. Our framework uses a transformer-based encoder t

Externí odkaz: http://arxiv.org/abs/2409.04607

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání