Comparative Performance of Interestingness Measures to Identify Redundant and Non-informative Rules from Web Usage Data

Autor: Riya Singhal, Vijay Kandal, Dilip Singh Sisodia
Rok vydání: 2018
Předmět:
Zdroj: International Journal of Technology, Vol 9, Iss 1, Pp 201-211 (2018)
ISSN: 2087-2100
2086-9614
DOI: 10.14716/ijtech.v9i1.1510
Popis: Association rules are used to predict frequent web user behaviors from web usage data. These rules are formed using frequent items. The number of association rules increases as the number of frequent items increases and produces several redundant and non-informative rules. In this paper, five interestingness measures, including cosine, lift, leverage, confidence, and conviction with a constant value of support are compared based on the number of redundant and non-informative rules that they produce. Redundant and non-informative rules are a subset of rules present in the top generated rules. The experimental results suggested that leverage produced the least number of redundant rules in the top rules but also produced the least informative rules among all measures. Lift showed the highest number of redundant rules but the most informative rules among all the measures.
Databáze: OpenAIRE