ENMFM Algorithm Applicable for the Multi-Source Heterogeneous Data Source Model of Power Grid Regulatory Cloud Platform
Autor: | Lan Haibo, Mu Yongzheng, Lou Tianyue, Cao Liangjing, Liu Huiyong, Xiao Linpeng |
---|---|
Rok vydání: | 2020 |
Předmět: | |
Zdroj: | Journal of Physics: Conference Series. 1617:012005 |
ISSN: | 1742-6596 1742-6588 |
DOI: | 10.1088/1742-6596/1617/1/012005 |
Popis: | With the rapid development of the intelligent grid technology and the arrival of the informationalized big data era, the application of computing method to power grid specialty will be a key step for the value of power grid data. Grid data record is not standard, the confusion caused by a device with multiple recording, professional equipment for power grid name recognition and use of data and algorithm based on identification service work will gradually replace artificial recognition, this paper presents a ENMTM algorithm with the grid raw data, which can identify accurately the same equipment with different names. The equipment name record is an important data of power grid business scenario, however, the same device is easy to describe in different situations or staff’s records, which causes data redundancy and equipment confusion caused by error identification of equipment name. The name recognition is to segment the device name string into a structured element unit which can be calculated by the computer, so as to realize the identification of multiple words with one meaning and devices with different names. In this paper, we study the recognition algorithm of recognition, the establishment of professional word library, segmentation and calculation of similarity recognition. The results show that the ENMFM result error rate is only 3.13%, on the basis of segmentation results and the confusion matrix of device name similarity can be seen that the positive samples has good prediction. In addition, in the experiment, the Top N items of higher similarity were given, which provided the staff with professional judgment to ensure the rigor of work. The calculation method to solve the name recognition of power equipment can greatly reduce the human work of identifying equipment, improve the work efficiency of power professional business scenarios, and lay a solid foundation for the unified data environment of intelligent grid. |
Databáze: | OpenAIRE |
Externí odkaz: |