SVAS: Surveillance Video Analysis System
Autor: | Nihan Kesim Cicekli, Karani Kardas |
---|---|
Rok vydání: | 2017 |
Předmět: |
Event (computing)
business.industry Computer science Event tree analysis General Engineering Complex event processing Inference 020207 software engineering 02 engineering and technology Machine learning computer.software_genre Computer Science Applications Artificial Intelligence 0202 electrical engineering electronic engineering information engineering Feature (machine learning) 020201 artificial intelligence & image processing Artificial intelligence Data mining Markov logic network business computer Semantic gap |
Zdroj: | Expert Systems with Applications. 89:343-361 |
ISSN: | 0957-4174 |
DOI: | 10.1016/j.eswa.2017.07.051 |
Popis: | This paper introduces a Surveillance Video Analysis System, called SVAS, for surveillance domain, in which the semantic rules and the definition of event models can be learned or defined by the user for automatic detection and inference of complex video events. In the scope of SVAS, an event model method named Interval-Based Spatio-Temporal Model (IBSTM) is proposed. SVAS can learn action models and event models without any predefined threshold values and generates understandable and manageable IBSTM event models. Hybrid machine learning methods are proposed and used. A set of feature models named Threshold Model, which reflects the spatio-temporal motion analysis of an event, is kept as the first model. As the second model, Bag of Actions (BoA) model is used in order to reduce the search space in the detection phase. Markov Logic Network (MLN) model, which provides understandable and manageable logic predicates for users, is kept as the third model. SVAS has high performance event detection capability due to its interval-based hierarchical manner. It determines related candidate intervals for each main model of IBSTM and uses the related main model when needed rather than using all models as a whole. The main contribution of this study is to fill the semantic gap between humans and video computer systems such that, on the one hand it decreases human intervention through its learning capabilities, but on the other hand it also enables human intervention when necessary through its manageable event model method. The study achieves all of them in the most efficient way through its machine learning methods. The proposed system is applied to different event datasets from CAVIAR, BEHAVE and our synthetic datasets. The experimental results show that our approach improves the event recognition performance and precision as compared to the current state-of-the-art approaches. |
Databáze: | OpenAIRE |
Externí odkaz: |