A method for calculation of optimum data size and bin size of histogram features in fault diagnosis of mono-block centrifugal pump
Autor: | V. Sugumaran, V. Indira, R. Vasanthakumari, N. R. Sakthivel |
---|---|
Rok vydání: | 2011 |
Předmět: |
business.industry
Computer science Decision tree learning Feature extraction General Engineering Condition monitoring Pattern recognition Feature selection Centrifugal pump Computer Science Applications C4.5 algorithm Artificial Intelligence Histogram Artificial intelligence business Classifier (UML) |
Zdroj: | Expert Systems with Applications. 38:7708-7717 |
ISSN: | 0957-4174 |
DOI: | 10.1016/j.eswa.2010.12.140 |
Popis: | Research highlights? This paper illustrates a method to choose the number of bins and minimum number of samples required to train the classifier with statistical stability. ? Power analysis method was used to find the minimum number of samples required. ? J48 algorithm was used to validate the results of power analysis and to find the optimum number of bins. Mono-block centrifugal pump plays a key role in various applications. Any deviation in the functions of centrifugal pump would lead to a monetary loss. Thus, it becomes very essential to avoid the economic loss due to malfunctioning of centrifugal pump. It is clear that the fault diagnosis and condition monitoring of pumps are important issues that cannot be ignored. Over the past 25years, much research has been focused on vibration based techniques. Machine learning approach is one of the most widely used techniques using vibration signals in fault diagnosis. There are set of connected activities involved in machine learning approach namely, data acquisition, feature extraction, feature selection, and feature classification. Training and testing the classifier are the two important activities in the process of feature classification. When the histogram features are used as the representative of the vibration signals, a proper guideline has not been proposed so far to choose number of bins and number of samples required to train the classifier. This paper illustrates a systematic method to choose the number of bins and the minimum number of samples required to train the classifier with statistical stability so as to get best classification accuracy. In this study, power analysis method was employed to find the minimum number of samples required and a decision tree algorithm namely J48 was used to validate the results of power analysis and to find the optimum number of bins. |
Databáze: | OpenAIRE |
Externí odkaz: |