bitSPADE: A Lattice-Based Sequential Pattern Mining Algorithm Using Bitmap Representation

Autor: Aomar Osmani, Emmanuel Viennet, Sujeevan Aseervatham
Přispěvatelé: Aseervatham, Sujeevan, Laboratoire d'Informatique de Paris-Nord (LIPN), Université Paris 13 (UP13)-Institut Galilée-Université Sorbonne Paris Cité (USPC)-Centre National de la Recherche Scientifique (CNRS)
Jazyk: angličtina
Rok vydání: 2006
Předmět:
Zdroj: ICDM
Proceedings of the Sixth International Conference on Data Mining (ICDM'06)
International Conference on Data Mining (ICDM'06)
International Conference on Data Mining (ICDM'06), 2006, Hong Kong SAR China. pp.6
HAL
Popis: Sequential pattern mining allows to discover temporal relationship between items within a database. The patterns can then be used to generate association rules. When the databases are very large, the execution speed and the memory usage of the mining algorithm become critical parameters. Previous research has focused on either one of the two parameters. In this paper, we present bitSPADE, a novel algorithm that combines the best features of SPAM, one of the fastest algorithm, and SPADE, one of the most memory efficient algorithm. Moreover, we introduce a new pruning strategy that enables bitSPADE to reach high performances. Experimental evaluations showed that bitSPADE ensures an efficient tradeoff between speed and memory usage by outperforming SPADE by both speed and memory usage factors more than 3.4 and SPAM by a memory consumption factor up to more than an order of magnitude.
Databáze: OpenAIRE