Powernet: A Smart Energy Forecasting Architecture Based On Neural Networks
Powernet: A Smart Energy Forecasting Architecture Based On Neural Networks
net/publication/345970274
CITATIONS READS
0 3
7 authors, including:
Partha Biswas
Nanyang Technological University
43 PUBLICATIONS 362 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Partha Biswas on 17 November 2020.
Abstract:
Electricity demand forecasting is a critical task for efficient, reliable and economical operation of the power grid, which is one of the
most essential building blocks of smart cities. Accurate forecasting allows grid operators to properly maintain the balance of supply
and demand as well as to optimize operational cost for generation and transmission. Several efforts have been made recently to
apply various machine learning techniques for this purpose. This article proposes a novel neural network architecture PowerNet
which can incorporate multiple heterogeneous features such as historical energy consumption data, weather data and calendar
information for the demand forecasting task. Using real-world smart meter dataset, we conduct an extensive evaluation to show
the advantages of PowerNet over recently-proposed machine learning methods such as Gradient Boosting Tree (GBT), Support
Vector Regression (SVR), Random Forest (RF) and Gated Recurrent Unit (GRU). PowerNet demonstrates notable performance in
reducing both the median and worst-case prediction errors when forecasting demands of individual residential households. We fur-
ther provide empirical results concerning the two operational considerations that are crucial when using PowerNet in practice: the
time horizon the model can predict with a decent accuracy and the frequency of training the model to retain its modeling capability.
Finally, we briefly discuss a multi-layer anomaly/electricity-theft detection approach based on PowerNet demand forecasting.
Power Demand
Power Consumption
Weather & Calendar Fusion Layer
Encoding Layer
demand via a Prediction Layer. In the following, we dissect each Specifically, we apply a stacked LSTM to every time step of the
component of PowerNet. power consumption time series data E,
RF
Energy Consumption(kW)
GRU 0.5 GRU
1.4
0.4
1.2 0.3
0.2
1.0 0.1
0 10 20 30 40 0 10 20 30 40
Time(hour)
Time(hour)
Fig. 4: Forecasting results of Apartment 91 (Summer)
Fig. 3: Forecasting results of Apartment 69 (Spring)
400 Original
1.8 PowerNet
350 SVR
1.6 GBT
300 Energy Consumption(kW)
RF
GRU
250 1.4
MAPE
200
1.2
150
100 1.0
50 0.8
0
PowerNet GBT SVR RF GRU 0.6
Prediction Method 0 10 20 30 40
Time(hour)
Fig. 5: Comparison in Distribution of MAPE Fig. 6: Forecasting results of Aggregated Consumption of 16 apart-
ments (Spring)
selected datasets by 8%, 34% and 14%, respectively, compared to 4.3 Forecasting Period of PowerNet
the second-best model.
We further conduct experiments with more apartment data to In general, the accuracy of power demand forecasting deteriorates as
compare the accuracy in terms of distribution of MAPE. We use the prediction horizon moves farther. Therefore, it is crucial for grid
summer season consumption data for randomly picked 50 apart- operators to know how much ahead in time the PowerNet can pre-
ments and perform the same experiment for each apartment to dict the demand without significant drop in accuracy. In this section,
calculate MAPE. The result is summarized in Fig. 5. As can be seen, we provide empirical results on forecasting accuracy against differ-
overall the MAPE of PowerNet is lower than the other competitors. ent forecasting periods using the real-world electricity consumption
In general, as predicting demand of individual household level is data. By doing so, grid operators can evaluate whether PowerNet
challenging, MAPE is often high. However, MAPE of PowerNet is is suitable for certain tasks that require different lengths of predic-
still bounded below 100% and the median is below 50%. tion period, such as bidding in the day-ahead electricity market and
Lastly, we conduct experiments with aggregated energy consump- day-ahead electricity scheduling which require the one day-ahead
tion data. We evaluate accuracy with 2 different aggregation levels, forecasting results [23].
16 apartments and 114 apartments (i.e., all apartments available in Some features for predicting the power demand in the far future
the dataset). For each case, we predict for 48 hours as done in may not available at the time of prediction. For example, the power
the experiments for individual apartments, and calculate MAPE and consumption of the previous one hour is an important feature to
MSE. As seen in Table 3, when the aggregation level is low, Power- predict the power demand for the next hour. If we predict beyond
Net has advantage over the others. The corresponding plot is found one hour at once, we would not know the actual consumption value
in Fig. 6. On the other hand, when consumption values of all apart- for every ‘previous’ hour. Therefore, the prediction in the far future
ments are aggregated, all prediction models except GRU perform relies on the predicted values prior to that. The fact has an inherent
reasonably well. Based on these results as well as the results of indi- risk of error accumulation.
vidual apartment experiments discussed earlier, PowerNet exhibits In this experiment, we predict the power demand for the future 30
competitive performance alongwith mostly better accuracy over the days at once based on current historical data. We train the model on
other models evaluated in our set-up. However, we admit that our the aggregated historical data in July and predict the power demand
results may have bias caused by the specific dataset we use in this for the following 30 days. The forecasting results are shown in Fig. 7
study and evaluation with other datasets for generality will be part in red. We can see that the red line follows the original peaks and val-
of our future work. leys well at the beginning. However, starting from a point around 550
on the x-axis, the red line totally loses track of the original values. In
manner.5It means that the more the user steals, the larger the devia-
400% 70% 0.165389 0.409455582
tion between
6 the predicted value Mp and the reported value Mr is.
200% 60% 0.318785 0.685253794
In other words, the more the user steals, the more obvious the devi-
7
50% 0.475108 0.991160088
ation is. A reasonable threshold that would trigger an alarm can be
8from the historical data as well as the tolerance of theft.
0% inferred
40% 0.656682 1.453382155 9
10% 20% 30% 40% 50% 60% 70% 80% 90%
30% 0.896347 2.293428141 Anomaly detection can be deployed in both substation layer and
10 consumer layer. We discuss how PowerNet can be utilized
Electricity theft percetage individual
20% 1.295557 4.10831409 11 such anomalies in both layers.
to detect
Fig. 9: The MAPE predictions10% 1.618826 theft scenar-
over different electricity 9.155661892
Anomaly detection in substation layer: At the substation level,
0%
ios characterized by the theft percentage 1.852523
from 10% nan
to 90%. there is a master meter which is a meter to measure the aggregated
consumption of the whole supply region.P The reading of master
Nc i
50.00% powernet gbt
meter is denoted as M s . So we have M s = i=1 Mu + T L, where
mse mape Nc is the number of consumers in the supply region and T L is the
40.00%
90% 0.037519 0.12976528 technical loss. The substation can observe Mri which is the reported
30.00%
MAPE