Listen as you wish: Fusion of audio and text for cross-modal event detection in smart cities
Autor: | Tang, Haoyu, Hu, Yupeng, Wang, Yunxiao, Zhang, Shuaike, Xu, Mingzhu, Zhu, Jihua, Zheng, Qinghai |
---|---|
Zdroj: | In Information Fusion October 2024 110 |
Databáze: | ScienceDirect |
Externí odkaz: |