Attention-enhanced multimodal feature fusion network for clothes-changing person re-identification

Autor:	Yongkang Ding, Jiechen Li, Hao Wang, Ziang Liu, Anqi Wang
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	Person re-identification Clothes-changing scenarios Computer vision Image retrieval Electronic computers. Computer science QA75.5-76.95 Information technology T58.5-58.64
Zdroj:	Complex & Intelligent Systems, Vol 11, Iss 1, Pp 1-15 (2024)
Druh dokumentu:	article
ISSN:	2199-4536 2198-6053
DOI:	10.1007/s40747-024-01646-2
Popis:	Abstract Clothes-Changing Person Re-Identification is a challenging problem in computer vision, primarily due to the appearance variations caused by clothing changes across different camera views. This poses significant challenges to traditional person re-identification techniques that rely on clothing features. These challenges include the inconsistency of clothing and the difficulty in learning reliable clothing-irrelevant local features. To address this issue, we propose a novel network architecture called the Attention-Enhanced Multimodal Feature Fusion Network (AE-Net). AE-Net effectively mitigates the impact of clothing changes on recognition accuracy by integrating RGB global features, grayscale image features, and clothing-irrelevant features obtained through semantic segmentation. Specifically, global features capture the overall appearance of the person; grayscale image features help eliminate the interference of color in recognition; and clothing-irrelevant features derived from semantic segmentation enforce the model to learn features independent of the person’s clothing. Additionally, we introduce a multi-scale fusion attention mechanism that further enhances the model’s ability to capture both detailed and global structures, thereby improving recognition accuracy and robustness. Extensive experimental results demonstrate that AE-Net outperforms several state-of-the-art methods on the PRCC and LTCC datasets, particularly in scenarios with significant clothing changes. On the PRCC and LTCC datasets, AE-Net achieves Top-1 accuracy rates of 60.4% and 42.9%, respectively.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/2e8ba9afc8fb474984f7eb0306c70674 Zobrazit plný text záznamu Full text from SpringerLink View record in DOAJ