Evaluating Energy Performance Certificate Data with Data Science
Autor: | Vitor Santos, Maria Anastasiadou, José Sales Dias |
---|---|
Přispěvatelé: | NOVA Information Management School (NOVA IMS), Information Management Research Center (MagIC) - NOVA Information Management School |
Rok vydání: | 2021 |
Předmět: |
Clustering algorithms
prediction of retrofitting measures Energy Engineering and Power Technology Forestry Data science Correlation Energy performance of buildings Energy measurement machine learning Systematics SDG 13 - Climate Action SDG 7 - Affordable and Clean Energy Buildings Electrical and Electronic Engineering Energy performance certification Computer Science(all) |
Zdroj: | 2021 International Conference on Electrical, Computer and Energy Technologies (ICECET). |
DOI: | 10.1109/icecet52533.2021.9698806 |
Popis: | Anastasiadou, M., Santos, V., & Dias, M. S. (2021). Evaluating Energy Performance Certificate Data with Data Science. In 2021 International Conference on Electrical, Computer and Energy Technologies (ICECET) (pp. 1-5). IEEE. https://doi.org/10.1109/ICECET52533.2021.9698806 The related problems of improving existing buildings' energy performance, reducing energy consumption, and improving indoor comfort and their many consequences are well known. Considering increasing urbanization and climate change, governments define strategies to enhance and measure buildings' energy performance and energy efficiency. This work aims to contribute to the improvement of buildings' characteristics by conducting a thorough systematic literature review and adopting a data science approach to these problems, presenting initial results with an open-access energy performance certificate dataset from the Lombardy Region, in Italy. We provide a pre-processing method to the data, applicable for future research, aiming to address challenges such as automatic classification of existing buildings' energy performance certification, and predicting energy-efficient retrofit measures, using machine learning techniques. The analysis of this dataset is challenging because of the high variability and dimensionality of this dataset. For this purpose, a robust iterative process was developed. First, the data dimensionality was reduced with Pearson Correlation to find the best set of variables against the non-renewable global energy performance index (EPgl, nren). Then, the outliers were handled by utilizing Box Plot and Isolation Forest algorithms. The main contribution is to inform private and public building sectors on dealing with high dimensional data to achieve enhanced energy performance and predict energy-efficient retrofit measures to improve their energy performance. authorsversion published |
Databáze: | OpenAIRE |
Externí odkaz: |