Výsledky vyhledávání - "Cross-modal Retrieval"

Akademický článek

A method for image–text matching based on semantic filtering and adaptive adjustment

Autor: Ran Jin, Tengda Hou, Tao Jin, Jie Yuan, Chenjie Du

Publikováno v: EURASIP Journal on Image and Video Processing, Vol 2024, Iss 1, Pp 1-17 (2024)

Abstract As image–text matching (a critical task in the field of computer vision) links cross-modal data, it has captured extensive attention. Most of the existing methods intended for matching images and texts explore the local similarity levels b

Externí odkaz: https://doaj.org/article/4f0f584438e34f98a7c74827eeb1443c

Zobrazit plný text záznamu

Akademický článek

Enhancing visual representation for text-based person searching

Autor: Shen, Wei ^a, Fang, Ming ^a, Wang, Yuxia ^a, Xiao, Jiafeng ^a, Li, Diping ^a, Chen, Huangqun ^a, Xu, Ling ^{b, c}, Zhang, Weifeng ^{d, ⁎}

Publikováno v: In Knowledge-Based Systems 30 January 2025 309

Zobrazit plný text záznamu

Akademický článek

Semi-supervised cross-modal hashing with joint hyperboloid mapping

Autor: Fu, Hao ^{a, b}, Gu, Guanghua ^{a, b, ⁎}, Dou, Yiyang ^{a, b}, Li, Zhuoyi ^{a, b}, Zhao, Yao ^c

Publikováno v: In Knowledge-Based Systems 25 November 2024 304

Zobrazit plný text záznamu

Akademický článek

Bridging asymmetry between image and video: Cross-modality knowledge transfer based on learning from video

Autor: Zhou, Bingxin ¹, Zhou, Jianghao ¹, Chen, Zhongming, Li, Ziqiang, Deng, Long, Ge, Yongxin ^⁎

Publikováno v: In Expert Systems With Applications 10 March 2025 264

Zobrazit plný text záznamu

Akademický článek

Image–Text Matching Model Based on CLIP Bimodal Encoding

Autor: Yihuan Zhu, Honghua Xu, Ailin Du, Bin Wang

Publikováno v: Applied Sciences, Vol 14, Iss 22, p 10384 (2024)

Image–text matching is a fundamental task in the multimodal research field, connecting computer vision and natural language processing by aligning visual content with corresponding textual descriptions. Accurate matching is critical for application

Externí odkaz: https://doaj.org/article/7fb041f55248462bac0445e78b9f6b32

Zobrazit plný text záznamu

Akademický článek

Cross-modal retrieval based on multi-dimensional feature fusion hashing

Autor: Dongxiao Ren, Weihua Xu

Publikováno v: Frontiers in Physics, Vol 12 (2024)

Along with the continuous breakthrough and popularization of information network technology, multi-modal data, including texts, images, videos, and audio, is growing rapidly. We can retrieve different modal data to meet our needs, so cross-modal retr

Externí odkaz: https://doaj.org/article/f8b029655799436e8add9cbba472e068

Zobrazit plný text záznamu

Akademický článek

ClusterE-ZSL: A Novel Cluster-Based Embedding for Enhanced Zero-Shot Learning in Contrastive Pre-Training Cross-Modal Retrieval

Autor: Umair Tariq, Zonghai Hu, Khawaja Tauseef Tasneem, Md Belal Bin Heyat, Muhammad Shahid Iqbal, Kamran Aziz

Publikováno v: IEEE Access, Vol 12, Pp 162622-162637 (2024)

Zero-shot learning (ZSL) in a multi-model environment presents significant challenges and opportunities for improving cross-modal retrieval and object detection in unseen data. This study introduced a novel embedding approach of vector space clusteri

Externí odkaz: https://doaj.org/article/1b58d658c93a4699904a429fc1c54d07

Zobrazit plný text záznamu

Akademický článek

Deep Feature-Based Neighbor Similarity Hashing With Adversarial Learning for Cross-Modal Retrieval

Autor: Kun Li, Yonghui Zhang, Feng Wang, Guoxu Liu, Xianmin Wei

Publikováno v: IEEE Access, Vol 12, Pp 128559-128569 (2024)

Currently, deep hashing methods for cross-modal retrieval have achieved significant performance. However, label-based pairwise semantic keep correspondence within bounds of tags, while overlooking the connection between the essence of content. To sol

Externí odkaz: https://doaj.org/article/3e88c443fb334ff1b569f2a84a739162

Zobrazit plný text záznamu

Akademický článek

An Intelligent Retrieval Method for Audio and Video Content: Deep Learning Technology Based on Artificial Intelligence

Autor: Maojin Sun

Publikováno v: IEEE Access, Vol 12, Pp 123430-123446 (2024)

To address the challenges of efficient intelligent retrieval and cross-modal analysis brought by the surge in audio-video data, this study proposes an intelligent retrieval method for audio-video content based on deep learning techniques, aimed at im

Externí odkaz: https://doaj.org/article/33aa202a7552444fbdfb4a54622c8d5a

Zobrazit plný text záznamu

Akademický článek

Cross-Modal Retrieval: A Review of Methodologies, Datasets, and Future Perspectives

Autor: Zhichao Han, Azreen Bin Azman, Mas Rina Binti Mustaffa, Fatimah Binti Khalid

Publikováno v: IEEE Access, Vol 12, Pp 115716-115741 (2024)

With the rapid development of science and technology, all types of mixed media contain large amounts of data. Traditional single multimedia data can no longer satisfy daily requirements. Therefore, the cross-modal retrieval technology has become an u

Externí odkaz: https://doaj.org/article/be623345779a4841b9062b1c960e0a3b

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání