Quantifying Uncertainty in Machine Learning-Based Power Outage Prediction Model Training: A Tool for Sustainable Storm Restoration

Autor: Emmanouil N. Anagnostou, Feifei Yang, Abul Ehsan Bhuiyan, Diego Cerrai, D. W. Wanik
Jazyk: angličtina
Rok vydání: 2020
Předmět:
Computer science
0208 environmental biotechnology
Geography
Planning and Development

lcsh:TJ807-830
outage prediction model
0211 other engineering and technologies
lcsh:Renewable energy sources
02 engineering and technology
Management
Monitoring
Policy and Law

Machine learning
computer.software_genre
Representativeness heuristic
severe weather
cross entropy
uncertainty
lcsh:Environmental sciences
Event (probability theory)
lcsh:GE1-350
021110 strategic
defence & security studies

Severe weather
Renewable Energy
Sustainability and the Environment

business.industry
lcsh:Environmental effects of industries and plants
Training (meteorology)
Storm
sample size
020801 environmental engineering
Cross entropy
lcsh:TD194-195
event severity representativeness
machine learning
Sample size determination
Artificial intelligence
business
computer
Predictive modelling
Zdroj: Sustainability
Volume 12
Issue 4
Sustainability, Vol 12, Iss 4, p 1525 (2020)
ISSN: 2071-1050
DOI: 10.3390/su12041525
Popis: A growing number of electricity utilities use machine learning-based outage prediction models (OPMs) to predict the impact of storms on their networks for sustainable management. The accuracy of OPM predictions is sensitive to sample size and event severity representativeness in the training dataset, the extent of which has not yet been quantified. This study devised a randomized and out-of-sample validation experiment to quantify an OPM&rsquo
s prediction uncertainty to different training sample sizes and event severity representativeness. The study showed random error decreasing by more than 100% for sample sizes ranging from 10 to 80 extratropical events, and by 32% for sample sizes from 10 to 40 thunderstorms. 
This study quantified the minimum number of sample size for the OPM attaining an acceptable prediction performance. The results demonstrated that conditioning the training of the OPM to a subset of events representative of the predicted event&rsquo
s severity reduced the underestimation bias exhibited in high-impact events and the overestimation bias in low-impact ones. We used cross entropy (CE) to quantify the relatedness of weather variable distribution between the training dataset and the forecasted event.
Databáze: OpenAIRE