A novel variable selection approach based on co-linearity index to discover optimal process settings by analysing mixed data

Autor: Cinzia Giannetti, Meghana R. Ransing, David T. Gethin, David Bould, Johann Sienz, Rajesh S. Ransing
Rok vydání: 2014
Předmět:
Zdroj: Computers & Industrial Engineering. 72:217-229
ISSN: 0360-8352
DOI: 10.1016/j.cie.2014.03.017
Popis: In the last two decades the application of statistical techniques to process control has gained popularity due to the widespread adoption of quality management systems such as ISO9001. Demonstration of continual process improvement by monitoring process effectiveness has become an integral part of satisfying the requirements of clause 8 of the ISO9001:2008 standard. The process effectiveness is measured in terms of one or more process responses. Data driven approaches are often used to associate the variability in process responses with one or more process variables. However, traditional techniques become unpractical in the presence of large number of variables and noisy data sets. This paper extends the co-linearity index and penalty matrix approach (Ransing et al., 2013) for discovering noise free correlations between heterogeneous process variables and responses. Noise is removed by reducing the dimensionality of the variable space and using robust data pre-treatment methods which are more suitable in the presence of outliers and skewed distributions for process variables. Scaling factors have been proposed to balance variance contributions from response variables, quantitative and categorical variables. The proposed method allows process variables with skewed distribution to contribute more to the variance than Gaussian distributed variables so that these variables can be investigated further, if necessary. Correlations are visualised in a single plot and can be used in real industrial settings to assist process engineers in manufacturing diagnosis and root cause analysis. The applicability and validity of this novel method has been demonstrated through two industrial case studies.
Databáze: OpenAIRE