Black-box Safety Analysis and Retraining of DNNs based on Feature Extraction and Clustering
Autor: | Mohammed Attaoui, Hazem Fahmy, Fabrizio Pastore, Lionel Briand |
---|---|
Přispěvatelé: | Fonds National de la Recherche - FnR [sponsor], European Commission - EC [sponsor], Interdisciplinary Centre for Security, Reliability and Trust (SnT) > Software Verification and Validation Lab (SVV Lab) [research center] |
Rok vydání: | 2022 |
Předmět: |
Computer science [C05] [Engineering
computing & technology] Software Engineering (cs.SE) FOS: Computer and information sciences Computer Science - Software Engineering Computer Science - Machine Learning Transfer Learning DNN Debugging DNN Functional Safety Analysis Sciences informatiques [C05] [Ingénierie informatique & technologie] Software Clustering DNN Explanation Machine Learning (cs.LG) |
Zdroj: | info:eu-repo/grantAgreement/EC/H2020/694277 |
DOI: | 10.48550/arxiv.2201.05077 |
Popis: | Deep neural networks (DNNs) have demonstrated superior performance over classical machine learning to support many features in safety-critical systems. Although DNNs are now widely used in such systems (e.g., self driving cars), there is limited progress regarding automated support for functional safety analysis in DNN-based systems. For example, the identification of root causes of errors, to enable both risk analysis and DNN retraining, remains an open problem. In this paper, we propose SAFE, a black-box approach to automatically characterize the root causes of DNN errors. SAFE relies on a transfer learning model pre-trained on ImageNet to extract the features from error-inducing images. It then applies a density-based clustering algorithm to detect arbitrary shaped clusters of images modeling plausible causes of error. Last, clusters are used to effectively retrain and improve the DNN. The black-box nature of SAFE is motivated by our objective not to require changes or even access to the DNN internals to facilitate adoption. Experimental results show the superior ability of SAFE in identifying different root causes of DNN errors based on case studies in the automotive domain. It also yields significant improvements in DNN accuracy after retraining, while saving significant execution time and memory when compared to alternatives. Comment: 41 pages, 12 figures, 15 tables |
Databáze: | OpenAIRE |
Externí odkaz: |