Simpler is better: Lifting interpretability-performance trade-off via automated feature engineering

Autor:	Przemyslaw Biecek, Alicja Gosiewska, Anna Kozak
Rok vydání:	2021
Předmět:	Feature engineering Information Systems and Management Supervisor business.industry Computer science Transparency (human–computer interaction) Machine learning computer.software_genre Abstract machine Management Information Systems Tree (data structure) Arts and Humanities (miscellaneous) Developmental and Educational Psychology Benchmark (computing) Artificial intelligence White box business computer Information Systems Interpretability
Zdroj:	Decision Support Systems. 150:113556
ISSN:	0167-9236
DOI:	10.1016/j.dss.2021.113556
Popis:	Machine learning has proved to generate useful predictive models that can and should support decision makers in many areas. The availability of tools for AutoML makes it possible to quickly create an effective but complex predictive model. However, the complexity of such models is often a major obstacle in applications, especially in terms of high-stake decisions. We are experiencing a growing number of examples where the use of black boxes leads to decisions that are harmful, unfair or simply wrong. In this paper, we show that very often we can simplify complex models without compromising their performance; however, with the benefit of much needed transparency. We propose a framework that uses elastic black boxes as supervisor models to create simpler, less opaque, yet still accurate and interpretable glass box models. The new models were created using newly engineered features extracted with the help of a supervisor model. We supply the analysis using a large-scale benchmark on several tabular data sets from the OpenML database. There are tree main results of this paper: 1) we show that extracting information from complex models may improve the performance of simpler models, 2) we question a common myth that complex predictive models outperform simpler predictive models, 3) we present a real-life application of the proposed method.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::138323a2533f3a6d61837b39155da535 https://doi.org/10.1016/j.dss.2021.113556 Zobrazit plný text záznamu Full Text from ScienceDirect