Lightweight multi-stage temporal inference network for video crowd counting

Autor:	Wei Gao, Rui Feng, Xiaochun Sheng
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	crowd counting crowd density spatio-temporal dependencies temporal inference deep learning Physics QC1-999
Zdroj:	Frontiers in Physics, Vol 12 (2024)
Druh dokumentu:	article
ISSN:	2296-424X
DOI:	10.3389/fphy.2024.1489245
Popis:	Crowd density is an important metric for preventing excessive crowding in a particular area, but it still faces challenges such as perspective distortion, scale variation, and pedestrian occlusion. Existing studies have attempted to model the spatio-temporal dependencies in videos using LSTM and 3D CNNs. However, these methods suffer from large computational costs, excessive parameter redundancy, and loss of temporal information, leading to difficulties in model convergence and limited recognition performance. To address these issues, we propose a lightweight multi-stage temporal inference network (LMSTIN) for video crowd counting. LMSTIN effectively models the spatio-temporal dependencies in video sequences at a fine-grained level, enabling real-time and accurate video crowd counting. Our proposed method achieves significant performance improvements on three public crowd counting datasets.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/aa95d9318672458dad0edcfec75c6fed Zobrazit plný text záznamu View record in DOAJ