Exceptional Model Mining to support Multi-objective Optimization

Autor: Millot, Alexandre, Cazabet, Rémy, Boulicaut, Jean-François
Přispěvatelé: Data Mining and Machine Learning (DM2L), Laboratoire d'InfoRmatique en Image et Systèmes d'information (LIRIS), Université Lumière - Lyon 2 (UL2)-École Centrale de Lyon (ECL), Université de Lyon-Université de Lyon-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-Institut National des Sciences Appliquées de Lyon (INSA Lyon), Université de Lyon-Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Centre National de la Recherche Scientifique (CNRS)-Université Lumière - Lyon 2 (UL2)-École Centrale de Lyon (ECL), Université de Lyon-Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Centre National de la Recherche Scientifique (CNRS), Université Claude Bernard Lyon 1 (UCBL), Université de Lyon, Institut Rhône-Alpin des systèmes complexes (IXXI), École normale supérieure de Lyon (ENS de Lyon)-Université Lumière - Lyon 2 (UL2)-Université Jean Moulin - Lyon 3 (UJML), Université de Lyon-Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA), FUI DUF 4.0 French programme, LIRIS UMR 5205
Jazyk: angličtina
Rok vydání: 2022
Předmět:
Zdroj: LIRIS UMR 5205. 2022
Popis: Given numerical data about objects defined by descriptive and target attributes, we investigate the discovery of interesting conjunctions of descriptive attribute range of values that are associated to optimized target values. This can be useful across many application domains where one wants to perform Multi-Objective Optimization: the goal is to find the best compromise between the competing objectives that correspond to the different targets and one expects to learn about the optimal values of the descriptive attributes. For this purpose, we design Exceptional Model Mining instances: we look for subsets of objects-subgroups-whose models deviate significantly from the same models fitted on the whole dataset. A first method, called Exceptional Pareto Front Deviation Mining (EPFDM), characterizes the differences between the Pareto front computed on the original data and the Pareto front computed after removing a subgroup of objects. We discuss in detail the design of a generic quality measure for EPFDM and we provide comprehensive empirical results. We also develop an approach called Exceptional Pareto Front Approximation Mining (EPFAM), whose goal is the discovery of models that approximate exceptionally well the true Pareto front. Beside empirical studies that consider both the qualitative and quantitative aspects of EPFDM and EPFAM, we present a use-case on plant growth recipe optimization in controlled environments, a timely challenge for a smarter agriculture.
Databáze: OpenAIRE