Literature Review of Cross-Modal Retrieval Research

Autor: CHEN Ning, DUAN Youxiang, SUN Qifeng
Jazyk: čínština
Rok vydání: 2021
Předmět:
Zdroj: Jisuanji kexue yu tansuo, Vol 15, Iss 8, Pp 1390-1404 (2021)
Druh dokumentu: article
ISSN: 1673-9418
DOI: 10.3778/j.issn.1673-9418.2101092
Popis: With the vigorous development of Internet technology and the popularization of smart devices, while the amount of multimedia data exploding, their forms become increasingly diverse. People's demand for information is no longer satisfied with single-modal data retrieval. Realizing cross-modal retrieval through knowledge collaboration of different modalities has become a research hotspot in recent years. On the basis of in-depth understanding and analysis of the research background and progress of cross-modal retrieval, with the key technology of cross-modal retrieval, public subspace modeling as the main line, this paper analyzes three types of methods of cross-modal retrieval technology: traditional statistical analysis, deep learning, and Hash learning. This paper conducts a comprehensive and multi-angle comparative analysis on the research content, key technology, limitations, applicability and characteristics from different angles, and experiments are done for more in-depth comparisons. Finally, the difficulties to be solved in cross-modal retrieval, future exploration directions, mainstream design ideas and development trends in recent years are fully prospected to provide a theoretical basis for further research.
Databáze: Directory of Open Access Journals