Zobrazeno 1 - 10
of 1 765
pro vyhledávání: '"Cross-modal Retrieval"'
Publikováno v:
EURASIP Journal on Image and Video Processing, Vol 2024, Iss 1, Pp 1-17 (2024)
Abstract As image–text matching (a critical task in the field of computer vision) links cross-modal data, it has captured extensive attention. Most of the existing methods intended for matching images and texts explore the local similarity levels b
Externí odkaz:
https://doaj.org/article/4f0f584438e34f98a7c74827eeb1443c
Autor:
Shen, Wei a, Fang, Ming a, Wang, Yuxia a, Xiao, Jiafeng a, Li, Diping a, Chen, Huangqun a, Xu, Ling b, c, Zhang, Weifeng d, ⁎
Publikováno v:
In Knowledge-Based Systems 30 January 2025 309
Publikováno v:
In Knowledge-Based Systems 25 November 2024 304
Publikováno v:
In Expert Systems With Applications 10 March 2025 264
Publikováno v:
Applied Sciences, Vol 14, Iss 22, p 10384 (2024)
Image–text matching is a fundamental task in the multimodal research field, connecting computer vision and natural language processing by aligning visual content with corresponding textual descriptions. Accurate matching is critical for application
Externí odkaz:
https://doaj.org/article/7fb041f55248462bac0445e78b9f6b32
Autor:
Dongxiao Ren, Weihua Xu
Publikováno v:
Frontiers in Physics, Vol 12 (2024)
Along with the continuous breakthrough and popularization of information network technology, multi-modal data, including texts, images, videos, and audio, is growing rapidly. We can retrieve different modal data to meet our needs, so cross-modal retr
Externí odkaz:
https://doaj.org/article/f8b029655799436e8add9cbba472e068
Autor:
Umair Tariq, Zonghai Hu, Khawaja Tauseef Tasneem, Md Belal Bin Heyat, Muhammad Shahid Iqbal, Kamran Aziz
Publikováno v:
IEEE Access, Vol 12, Pp 162622-162637 (2024)
Zero-shot learning (ZSL) in a multi-model environment presents significant challenges and opportunities for improving cross-modal retrieval and object detection in unseen data. This study introduced a novel embedding approach of vector space clusteri
Externí odkaz:
https://doaj.org/article/1b58d658c93a4699904a429fc1c54d07
Publikováno v:
IEEE Access, Vol 12, Pp 128559-128569 (2024)
Currently, deep hashing methods for cross-modal retrieval have achieved significant performance. However, label-based pairwise semantic keep correspondence within bounds of tags, while overlooking the connection between the essence of content. To sol
Externí odkaz:
https://doaj.org/article/3e88c443fb334ff1b569f2a84a739162
Autor:
Maojin Sun
Publikováno v:
IEEE Access, Vol 12, Pp 123430-123446 (2024)
To address the challenges of efficient intelligent retrieval and cross-modal analysis brought by the surge in audio-video data, this study proposes an intelligent retrieval method for audio-video content based on deep learning techniques, aimed at im
Externí odkaz:
https://doaj.org/article/33aa202a7552444fbdfb4a54622c8d5a
Publikováno v:
IEEE Access, Vol 12, Pp 115716-115741 (2024)
With the rapid development of science and technology, all types of mixed media contain large amounts of data. Traditional single multimedia data can no longer satisfy daily requirements. Therefore, the cross-modal retrieval technology has become an u
Externí odkaz:
https://doaj.org/article/be623345779a4841b9062b1c960e0a3b