In-DBMS Sampling-based Sub-trajectory Clustering

Autor: Nikos Pelekis, Panagiotis Tampakis, Marios Vodas, Costas Panagiotakis, Yannis Theodoridis
Rok vydání: 2017
Předmět:
Popis: In this paper, we propose an efficient in-DBMS solution for the problem of sub-trajectory clustering and outlier detection in large moving object datasets. The method relies on a two-phase process: a voting-and-segmentation phase that segments trajectories according to a local density criterion and trajectory similarity criteria, followed by a sampling-and-clustering phase that selects the most representative sub-trajectories to be used as seeds for the clustering process. Our proposal, called S 2 T-Clustering (for Sampling-based Sub-Trajectory Clustering) is novel since it is the first, to our knowledge, that addresses the pure spatiotemporal sub-trajectory clustering and outlier detection problem in a real-world setting (by ‘pure’ we mean that the entire spatiotemporal information of trajectories is taken into consideration). Moreover, our proposal can be efficiently registered as a database query operator in the context of extensible DBMS (namely, PostgreSQL in our current implementation). The effectiveness and the efficiency of the proposed algorithm are experimentally validated over synthetic and real-world trajectory datasets, demonstrating that S 2 T-Clustering outperforms an off-the-shelf in-DBMS solution using PostGIS by several orders of magnitude.
Databáze: OpenAIRE