Compressed video ensemble based pseudo-labeling for semi-supervised action recognition

Autor:	Hayato Terao, Wataru Noguchi, Hiroyuki Iizuka, Masahito Yamamoto
Jazyk:	angličtina
Rok vydání:	2022
Předmět:	Action recognition Compressed video action recognition Semi-supervised learning Pseudo labeling Cybernetics Q300-390 Electronic computers. Computer science QA75.5-76.95
Zdroj:	Machine Learning with Applications, Vol 9, Iss , Pp 100336- (2022)
Druh dokumentu:	article
ISSN:	2666-8270
DOI:	10.1016/j.mlwa.2022.100336
Popis:	Some recent studies have focused on deep learning based semi-supervised learning for action recognition. However, it is difficult to scale up their training because their input is RGB frames, the obtainment of which incurs computational and storage costs. In this paper, we propose a semi-supervised action recognition method that makes it easy to scale up the training by using features stored in compressed videos. Our method directly extracts multiple types of input features from compressed videos without any decoding and generates artificial labels of unlabeled videos through the ensembling of the predictions from these features. In addition to the standard supervised training on labeled videos, our models are trained to predict the artificial labels from strongly augmented features in unlabeled compressed videos. We show that our method is more efficient and achieves a better classification performance on some widely used datasets than conventional semi-supervised learning methods applying RGB frames.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/14092313fd5547c7a8d8f1ebf0581a90 Zobrazit plný text záznamu View record in DOAJ