Three Dimensional Information Extraction and Applications to Video Analysis
Autor: | Xiuwen Liu, Arturo Donate |
---|---|
Rok vydání: | 2010 |
Předmět: |
business.industry
Computer science ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION Context (language use) Simultaneous localization and mapping computer.software_genre TRECVID Video compression picture types Activity recognition Information extraction Video tracking Structure from motion Computer vision Artificial intelligence business computer |
Zdroj: | Video Search and Mining ISBN: 9783642128998 Video Search and Mining |
DOI: | 10.1007/978-3-642-12900-1_4 |
Popis: | This chapter explores the idea of extracting three dimensional features from a video, and using such features to aid various video analysis and mining tasks. The use of 3D information in video analysis is scarce in the literature due to the inherent difficulties of such a system. When the only input to the system is a video stream with no previous knowledge of the scene or camera (a typical scenario in video analysis), computing an accurate 3D representation becomes a difficult task; however, several recently proposed methods can be applied to solving the problem efficiently, including simultaneous localization and mapping, structure from motion, and 3D reconstruction. These methods are surveyed and presented in the context of video analysis and demonstrated using videos from TRECVID 2005; their limitations are also discussed. Once an accurate 3D representation of a video is obtained, it can be used to increase the performance and accuracy of existing systems for various video analysis and mining tasks. Advantages of utilizing 3D representation are illustrated using several of these tasks, including shot boundary detection, object recognition, content-based video retrieval, as well as human activity recognition. The chapter concludes with a discussion on limitations of existing 3D methods and future research directions. |
Databáze: | OpenAIRE |
Externí odkaz: |