Autor: |
Garay-Martinez, Roberto, Siddique, Muhammad Talha, Lopez-Garde, Juan Manuel |
Předmět: |
|
Zdroj: |
Procedia Computer Science; 2024, Vol. 246, p2071-2079, 9p |
Abstrakt: |
Data-driven methods are increasingly popular for building energy performance assessment. For these to be useful, it is required that good quality data is created through the filtering of outliers and imputation of missing information. In the field of energy use in buildings, there is a clear sensitivity of heat load to outdoor climate, which needs to be considered when identifying outliers and developing imputation methods. We propose to use a well-known changepoint model to define the sensitivity of the data to climate, further segmented by time of the week. Then we use the residuals of the model to identify outliers, where those observation with residuals substantially out of the normality expectations are identified as outliers. Then missing data is repaired by means of linear imputation techniques, considering the patterns for same times of the week in the dataset. As a result of the full process, we were able to identify 5% of outliers, which resulted in the improvement of model metrics in the range of 20% mean absolute error (MAE) and slightly better R2 values. [ABSTRACT FROM AUTHOR] |
Databáze: |
Supplemental Index |
Externí odkaz: |
|