The application and effectiveness of a multi-objective metaheuristic algorithm for partial classification
Autor: | Victor J. Rayward-Smith, Graeme Richards, M.S. Philpott, B. de la Iglesia |
---|---|
Rok vydání: | 2006 |
Předmět: |
Information Systems and Management
General Computer Science Association rule learning business.industry Computer science Management Science and Operations Research computer.software_genre Machine learning Industrial and Manufacturing Engineering Field (computer science) Task (project management) Set (abstract data type) Information extraction Knowledge extraction Modeling and Simulation Artificial intelligence Data mining business Metaheuristic computer Algorithm |
Zdroj: | European Journal of Operational Research. 169:898-917 |
ISSN: | 0377-2217 |
DOI: | 10.1016/j.ejor.2004.08.025 |
Popis: | In this paper, we present an application of multi-objective metaheuristics to the field of data mining. We introduce the data mining task of nugget discovery (also known as partial classification) and show how the multi-objective metaheuristic algorithm NSGA II can be modified to solve this problem. We also present an alternative algorithm for the same task, the ARAC algorithm, which can find all rules that are best according to some measures of interest subject to certain constraints. The ARAC algorithm provides an excellent basis for comparison with the results of the multi-objective metaheuristic algorithm as it can deliver the Pareto optimal front consisting of all partial classification rules that lie in the upper confidence/coverage border, for databases of limited size. We present the results of experiments with various well-known databases for both algorithms. We also discuss how the two methods can be used complementarily for large databases to deliver a set of best rules according to some predefined criteria, providing a powerful tool for knowledge discovery in databases. |
Databáze: | OpenAIRE |
Externí odkaz: |