Forecasting Smart Meter Energy Usage Using Distributed Systems and Machine Learning
Autor: | Chris Dong, Lingzhi Du, Feiran Ji, Zizhen Song, Yuedi Zheng, Alexander Howard, Paul Intrevado, Diane Myung-kyung Woodbridge, Alexander J. Howard |
---|---|
Rok vydání: | 2018 |
Předmět: |
Distributed database
business.industry Computer science Smart meter 020209 energy Distributed computing 02 engineering and technology Energy consumption Machine learning computer.software_genre Random forest Smart grid 0202 electrical engineering electronic engineering information engineering Leverage (statistics) Artificial intelligence business computer |
Zdroj: | HPCC/SmartCity/DSS |
DOI: | 10.1109/hpcc/smartcity/dss.2018.00216 |
Popis: | In this research, we explore the technical and computational merits of a machine learning algorithm on a large data set, employing distributed systems. Using 167 million (10 GB) energy consumption observations collected by smart meters from residential consumers in London, England, we predict future residential energy consumption using a Random Forest machine learning algorithm. Distributed systems such as AWS S3 and EMR, MongoDB and Apache Spark are used. Computational times and predictive accuracy are evaluated. We conclude that there are significant computational advantages to using distributed systems when applying machine learning algorithms on large-scale data. We also observe that distributed systems can be computationally burdensome when the amount of data being processed is below a threshold at which it can leverage the computational efficiencies provided by distributed systems. |
Databáze: | OpenAIRE |
Externí odkaz: |