An algorithm for fast mining top-rank-k frequent patterns based on Node-list data structure

Autor: Darryl N. Davis, Qian Wang, Yongqiang Cheng, Jiadong Ren
Rok vydání: 2018
Předmět:
Zdroj: Intelligent Automation and Soft Computing. 24:399-404
ISSN: 2326-005X
1079-8587
DOI: 10.1080/10798587.2017.1340135
Popis: Frequent pattern mining usually requires much run time and memory usage. In some applications, only the patterns with top frequency rank are needed. Because of the limited pattern numbers, quality of the results is even more important than time and memory consumption. A Frequent Pattern algorithm for mining Top-rank-K patterns, FP_TopK, is proposed. It is based on a Node-list data structure extracted from FTPP-tree. Each node is with one or more triple sets, which contain supports, preorder and post-order transversal orders for candidate pattern generation and top-rank-k frequent pattern mining. FP_TopK uses the minimal support threshold for pruning strategy to guarantee that each pattern in the top-rank-k table is really frequent and this further improves the efficiency. Experiments are conducted to compare FP_TopK with iNTK and BTK on four datasets. The results show that FP_TopK achieves better performance.
Databáze: OpenAIRE