Gradient boosting and Shapley additive explanations for fraud detection in electricity distribution grids.

Autor: Santos, Ricardo N., Yamouni, Sami, Albiero, Beatriz, Vicente, Renato, A. Silva, Juliano, F. B. Souza, Tales, C. M. Freitas Souza, Mario, Lei, Zhili
Předmět:
Zdroj: International Transactions on Electrical Energy Systems; Sep2021, Vol. 31 Issue 9, p1-13, 13p
Abstrakt: Summary: Fraud in electrical energy consumption represents a critical economic burden for utility companies around the world. Despite systematic efforts to mitigate electricity theft, this practice persists mostly in developing countries where companies rely on traditional detection methods. In Brazil it is estimated that around 7% of the total electrical energy available for consumption in 2016 was lost due to frauds. Here we describe an efficient and scalable system to predict fraudulent behavior and guide in loco inspections. We compared the performances of several machine learning algorithms using consumption and inspection data provided by CPFL Energia. We show that proper feature engineering and boosted classification trees trained with XGBoost are able to extract patterns related to fraud occurrence and to achieve predictive power of practical consequences. Moreover, we demonstrate how Shapley additive explanation (SHAP) values can be employed to build user friendly explanations. Together, the proposed model and its explainers contribute not only to reveal potentially fraudulent behavior but also to understand root causes, what can be used to devise robust mitigation strategies. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index