Dynamic Summarization of Videos Based on Descriptors in Space-Time Video Volumes and Sparse Autoencoder

Autor: Jesna Mohan, Madhu S. Nair
Jazyk: angličtina
Rok vydání: 2018
Předmět:
Zdroj: IEEE Access, Vol 6, Pp 59768-59778 (2018)
Druh dokumentu: article
ISSN: 2169-3536
DOI: 10.1109/ACCESS.2018.2872685
Popis: This paper addresses the problem of generating meaningful summaries from unedited user videos. A framework based on spatiotemporal and high-level features is proposed in this paper to detect the key-shots after segmenting the videos into shots based on motion magnitude. To encode the time-varying characteristics of a video, we explore the local phase quantization feature descriptor from three orthogonal planes (LPQ-TOP). The sparse autoencoder (SAE), an instance of deep learning strategy, is used for the extraction of high-level features from LPQ-TOP descriptors to represent the shots carrying key-contents of videos efficiently. The Chebyshev distance between the feature vectors of the consecutive shots are calculated and thresholded using the mean value of the distance score as the threshold value. The optimal subset of shots with distance score greater than the threshold value is used to generate a high-quality video summary. The method is evaluated using SumMe data set. The summaries thus generated are of better quality than those produced by the other state of the art techniques. The effectiveness of the method is further evaluated by comparing with the human-created summaries in the ground truth.
Databáze: Directory of Open Access Journals