An Iterative Multilayer Unsupervised Learning Approach for Sensory Data Reliability Evaluation
Autor: | Johnathan Votion, Feng Tao, Yongcan Cao |
---|---|
Rok vydání: | 2019 |
Předmět: |
Polynomial regression
Reliability theory Computer science Iterative method Monte Carlo method Overfitting computer.software_genre Computer Science Applications Data modeling Control and Systems Engineering Unsupervised learning Data mining False alarm Sensitivity (control systems) Electrical and Electronic Engineering computer Reliability (statistics) Information Systems |
Zdroj: | IEEE Transactions on Industrial Informatics. 15:2199-2209 |
ISSN: | 1941-0050 1551-3203 |
Popis: | This paper investigates the problem of extracting actionable patterns/models from unlabeled and potentially erroneous datasets in an unsupervised way. To address the need for both model extraction and data reliability evaluation, we propose a novel iterative multilayer micro–macro (IM3) method that defines data reliability, learns micro–macro models, and iteratively refines learned models. The IM3 method includes a general data reliability definition to evaluate the reliability level of each sample, a micro–macro model complexity determination, and an iterative data reliability and model complexity update mechanism to overcome the underfitting and overfitting issue. In particular, we propose a consistency-index-based approach to address underfitting and overfitting in an unsupervised way. The refinement of the learned models is enabled via dropping the most unreliable data until the data reliability is above a given threshold. The sensitivity of the proposed IM3 method with respect to the reliability threshold selection is further quantified via false alarm and missdetection to facilitate the selection of an appropriate reliability threshold. Evaluation of the proposed method and quantitative analysis of its sensitivity are provided on a polynomial regression problem via Monte Carlo simulations. |
Databáze: | OpenAIRE |
Externí odkaz: |