Optimal Interpretability-Performance Trade-off of Classification Trees with Black-Box Reinforcement Learning

Autor:	Kohler, Hector, Akrour, Riad, Preux, Philippe
Přispěvatelé:	Scool (Scool), Inria Lille - Nord Europe, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 (CRIStAL), Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS)-Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS), Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 (CRIStAL), Centrale Lille-Université de Lille-Centre National de la Recherche Scientifique (CNRS), Université de Lille, Centrale Lille, Institut National de Recherche en Informatique et en Automatique (Inria), Inria Lille Nord Europe - Laboratoire CRIStAL - Université de Lille, Scool [Scool], Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 [CRIStAL]
Jazyk:	angličtina
Rok vydání:	2023
Předmět:	Reinforcement Leaning Supervised Learning Interpretability [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
Zdroj:	RR-9503, Inria Lille Nord Europe-Laboratoire CRIStAL-Université de Lille. 2023
Popis:	Interpretability of AI models allows for user safety checks to build trust in these models. In particular, decision trees (DTs) provide a global view on the learned model and clearly outlines the role of the features that are critical to classify a given data. However, interpretability is hindered if the DT is too large. To learn compact trees, a Reinforcement Learning (RL) framework has been recently proposed to explore the space of DTs. A given supervised classification task is modeled as a Markov decision problem (MDP) and then augmented with additional actions that gather information about the features, equivalent to building a DT. By appropriately penalizing these actions, the RL agent learns to optimally trade-off size and performance of a DT. However, to do so, this RL agent has to solve a partially observable MDP. The main contribution of this paper is to prove that it is sufficient to solve a fully observable problem to learn a DT optimizing the interpretability-performance trade-off. As such any planning or RL algorithm can be used. We demonstrate the effectiveness of this approach on a set of classical supervised classification datasets and compare our approach with other interpretability-performance optimizing methods.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::a4fe0986c0ed751874aa667fcc467838 https://hal.science/hal-04060986/document Zobrazit plný text záznamu