A predictive noise correction methodology for manufacturing process datasets

Autor:	Omogbai Oleghe
Jazyk:	angličtina
Rok vydání:	2020
Předmět:	Machine learning Manufacturing process Noise correction Feature reduction Classification prediction Computer engineering. Computer hardware TK7885-7895 Information technology T58.5-58.64 Electronic computers. Computer science QA75.5-76.95
Zdroj:	Journal of Big Data, Vol 7, Iss 1, Pp 1-27 (2020)
Druh dokumentu:	article
ISSN:	2196-1115
DOI:	10.1186/s40537-020-00367-w
Popis:	Abstract In manufacturing processes, datasets intended for data driven decisions are majorly generated from time-sequenced sensor readings. Industrial sensor systems are prone to transmit inaccurate readings, which result in noisy datasets. Noisy datasets inhibit machine learning and knowledge discovery. Using a multi-stage, multi-output process dataset as an experimental case, this article reports a methodology for replacing erroneous sensor values with their predicted likely values. In the methodology, invalid values specified by process owners are first converted to missing values. Then, ReliefF algorithm is used to select the most relevant features to progress for prediction modelling, and also to boost the performance of the prediction model. A Random Forest classifier model is built to predict replacement values for the missing values. Finally, predicted values are inserted into the dataset to fill in the missing entries. With many attributes having a significant number of erroneous values, the invalid values replacement is done one attribute at a time. To do this systematically, the process flow direction and stages in the manufacturing process are exploited to partition the dataset into subsets for model building. The results indicate that the methodology is able to replace erroneous values with likely true values, to a very high degree of accuracy. There is a paucity of this type of methodology for dealing with invalid entries in process datasets. The methodology is useful for both missing and invalid value correction in process datasets. In the future, the plan is to inject the prediction models into streaming data to simultaneously enable erroneous value correction and predictive process monitoring in real-time.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/7ce0e5e900804c7f9ecf165372c1dadf Zobrazit plný text záznamu Full text from SpringerLink View record in DOAJ