Opening black box data mining models using sensitivity analysis

Autor: Cortez, Paulo, Embrechts, Mark
Přispěvatelé: Universidade do Minho
Jazyk: angličtina
Rok vydání: 2011
Předmět:
Popis: There are several supervised learning Data Mining (DM) methods, such as Neural Networks (NN), Support Vector Machines (SVM) and ensembles, that often attain high quality predictions, although the obtained models are difficult to inter- pret by humans. In this paper, we open these black box DM models by using a novel visualization approach that is based on a Sensitivity Analysis (SA) method. In particular, we propose a Global SA (GSA), which extends the applicability of previous SA methods (e.g. to classification tasks), and several visualization techniques (e.g. variable effect characteristic curve), for assessing input relevance and effects on the model’s responses. We show the GSA capabilities by conducting several experiments, using a NN ensemble and SVM model, in both synthetic and real-world datasets.
(undefined)
Databáze: OpenAIRE