Visual exploration of an ensemble of classifiers
Autor: | Hélio Lopes, Simone Diniz Junqueira Barbosa, Paula Ceccon Ribeiro, Clarisse Sieckenius de Souza, Guilherme G. Schardong |
---|---|
Rok vydání: | 2019 |
Předmět: |
Computer science
business.industry Dimensionality reduction media_common.quotation_subject General Engineering 020207 software engineering 02 engineering and technology Machine learning computer.software_genre Computer Graphics and Computer-Aided Design Human-Computer Interaction Data set Statistical classification ComputingMethodologies_PATTERNRECOGNITION Voting 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Artificial intelligence business Classifier (UML) computer MNIST database media_common |
Zdroj: | Computers & Graphics. 85:23-41 |
ISSN: | 0097-8493 |
DOI: | 10.1016/j.cag.2019.08.012 |
Popis: | Inspecting the outputs of classification algorithms is becoming progressively difficult due to the increase in both scale and complexity of both the data and the algorithms. This has led to research efforts to develop new techniques to interpret the behavior of these algorithms and to facilitate the understanding of their results. A common classification approach is the “ensemble of classifiers”, where a set of classifiers c ∈ C is trained on the input data set and the final classification is computed by “voting”, i.e., ranking their results. One of the issues with this approach, however, is that instead of having only one classifier to analyze, now there are |C|, each with its characteristics. Thus, there is a demand for methods that provide insights into the results of an ensemble of classifiers and at the same time allow a detailed analysis of each classifier in the ensemble. Our work proposes to draw on dimensionality reduction techniques to provide visual tools to interpret the results of an ensemble of classifiers, while also giving insights into how each classifier contributes to the final results. Our approach also presents a measure of classification uncertainty by highlighting regions where there is a divergence among the classifiers in the ensemble, allowing one to focus their analysis on these regions. We tested our approach using the Digits MNIST and Fashion MNIST data sets. Through the use of maps that provide an overview of a classifier behavior to instance-based visualizations, we show how our approach can assist in the interpretation of why a specific decision (classification) was made. |
Databáze: | OpenAIRE |
Externí odkaz: |