Výsledky vyhledávání - "Ruoxuan Feng"

MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning

Autor: Ruize Xu, Ruoxuan Feng, Shi-Xiong Zhang, Di Hu

Audio-visual learning helps to comprehensively understand the world by fusing practical information from multiple modalities. However, recent studies show that the imbalanced optimization of uni-modal encoders in a joint-learning model is a bottlenec

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6d187a75f9f56aa4e0bac36b64e7e7b1

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání