A novel feature fusion based framework for efficient shot indexing to massive web videos
Autor: | Yuan Dong, Lezi Wang, Shiguo Lian, Wei Liu, Shusheng Cen |
---|---|
Rok vydání: | 2014 |
Předmět: |
Structure (mathematical logic)
business.industry Computer science Shot (filmmaking) Search engine indexing Hash function Pattern recognition computer.software_genre Image (mathematics) Dynamic programming Feature (computer vision) Data mining Artificial intelligence Electrical and Electronic Engineering business Scale (map) computer |
Zdroj: | Telecommunication Systems. 59:401-413 |
ISSN: | 1572-9451 1018-4864 |
DOI: | 10.1007/s11235-014-9945-9 |
Popis: | This study addresses an automatic approach to analyze the structure of large scale web videos based on visual and acoustic information. In our approach, video streams are macro-segmented via mining the duplicate sequences. Acoustic and visual information are both adopted for mining so as to avoid missing true-positive. Web videos contain severe visual and acoustic distortions, differing to TV data, where duplicate clips are quite similar. In this case, we present novel visual-acoustic feature schemes to handle the distortions. And shot based indexing algorithm and several temporary constrains are presented to mine the duplicate sequences, where the weak geometric verification is combined with direct hashing to achieve high efficiency and superior performance of image-based duplicate sequences detection, and dynamic programming is introduced to recall missing true-positives in audio-based section. Experiments conducted on the dataset composed of 500 h content-unknown videos show that F-Measure of duplicate sequences mining for web videos can achieve the rate of 95 % and, in terms of efficiency and detection performance, the proposed algorithm outperforms the state-of-art approaches. |
Databáze: | OpenAIRE |
Externí odkaz: |