Received: 17 April 2021 - Revised: 16 May 2021 - Accepted: 18 May 2021 - IET Electrical Systems in TransportationDOI: 10.1049/els2.12028
CA S E S TUDY
Short‐term load forecasting at electric vehicle charging sites
using a multivariate multi‐step long short‐term memory: A case
study from Finland
Tim Unterluggauer1,2,3 | Kalle Rauma2 | Pertti Järventausta1 | Christian Rehtanz2
1Unit of Electrical Engineering, Tampere University, Abstract
Tampere, Finland This study assesses the performance of a multivariate multi‐step charging load prediction
2Institute of Energy Systems, Energy Efficiency and approach based on the long short‐term memory (LSTM) and commercial charging data.
Energy Economics, TU Dortmund University,
Dortmund, Germany The major contribution of this study is to provide a comparison of load prediction be-
3 tween various types of charging sites. Real charging data from shopping centres, resi-Department of Electrical Engineering, Technical
University of Denmark, Denmark dential, public, and workplace charging sites are gathered. Altogether, the data consists of
50,504 charging events measured at 37 different charging sites in Finland between January
Correspondence 2019 and January 2020. A forecast of the aggregated charging load is performed in 15‐
Kalle Rauma, Institute of Energy Systems, Energy min resolution for each type of charging site. The second contribution of the work is
Efficiency and Energy Economics, TU Dortmund the extended short‐term forecast horizon. A multi‐step prediction of either four (i.e., one
University, Emil‐Figge‐Str. 76, 44227 Dortmund,
Germany. hour) or 96 (i.e., 24 h) time steps is carried out, enabling a comparison of both horizons.
Email: kalle.rauma@tu-dortmund.de The findings reveal that all charging sites exhibit distinct charging characteristics, which
affects the forecasting accuracy and suggests a differentiated analysis of the different
Funding information charging categories. Furthermore, the results indicate that the forecasting accuracy
The German Federal Ministry of Transportation and strongly correlates with the forecast horizon. The 4‐time step prediction yields consid-
Digital Infrastructure
erably superior results compared with the 96‐time step forecast.
Open access funding enabled and organized by
Projekt DEAL.
1 | INTRODUCTION could facilitate the seamless integration of intermittent
renewable energy sources into the power grid [5–7]. A large
In 2017, the transport sector was responsible for 27% of all number of EVs could also benefit transmission system
greenhouse gas emissions in the European Union, with pas- operators by providing their flexibility in the form of a control
senger cars accounting for 44% of the transport emissions [1]. reserve to the energy market [8, 9], or assist distribution system
Electric vehicles (EVs) are one of the solutions to cut carbon operators in terms of voltage regulation, congestion manage-
emissions in the transport sector and achieve the climate ment, peak shaving, and valley filling measures [10–12].
protection goals. Other related concerns, such as urban air For both cases, an accurate short‐term EV charging load
pollution and its impact on health, have also encouraged pol- forecast is of utmost importance. For grid operators, the
iticians to promote the adoption of EVs [2]. aggregated charging load forecast is relevant to detect bottle-
The resulting anticipated large‐scale EV rollout represents necks in the distribution grid at an early stage and initiate
both a considerable challenge and an opportunity for the po- appropriate measures [13]. Aggregators, who aggregate a large
wer system. On the one hand, the simultaneous charging of a number of EVs, also rely on accurate capacity predictions to be
large number of EVs could lead to severe bottlenecks in the able to market EV flexibilities in the energy market for ancil-
distribution network, requiring costly grid upgrades [2–4]. On lary services [4, 14].
the other hand, EVs could also benefit the power system by With this motivation in mind, this work introduces a novel
providing ancillary services. Due to their fast‐response ability, charging load prediction application based on the long short‐
high degree of flexibility, and energy storage capacity, EVs term memory. A multivariate multi‐step forecasting approach
This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is
properly cited.
© 2021 The Authors. IET Electrical Systems in Transportation published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology.
IET Electr. Syst. Transp. 2021;1–15. wileyonlinelibrary.com/journal/els2 - 1
2 - UNTERLUGGAUER ET AL.
is presented, which considers important determinants of the predictions. Compared to their linear counterparts, non‐linear
load curve as multivariate inputs and allows the forecast of the approaches rely on non‐linear functions to capture more
whole prediction horizon in a single computation. complex time series behavior. Study [20] applies four different
The subsequent parts of this paper are organised as fol- algorithms, namely the RF, support vector regression (SVR),
lows. Section 2 presents the state of the art in recent studies. time weighted dot product based nearest neighbor
Section 3 deals with the basics of LSTMs and the origin of the (TWDPNN), and MPSF for 24‐h charging load forecasting.
data used in this work. Section 4 introduces the methodology The evaluation is based on two different datasets – charging and
of the charging load forecast. Section 5 presents the results, station measurements. The most accurate predictions based on
which are further discussed in Section 6. Conclusions and di- the charging measurement are achieved by using the TWDPNN
rections for future research are provided in Section 7. approach, whereas the MPSF provides the most precise results
for the station measurement dataset. An approved RF algo-
| rithm is applied to predict the EV charging load of single and2 RELATED RESEARCH grouped EV charging stations in [21]. The analysis reveals that
in contrast to an aggregated prediction, more precise results are
In this section, a literature review of the state of the art in the obtained while performing individual forecasts.
field of EV short‐term charging load forecasting is conducted Moreover, considerable attention has been paid to artificial
to identify the shortcomings in previous research and highlight neural networks (ANNs) lately. The predictive performance of
the contributions of this work. different neural networks (NN) is frequently addressed in the
literature [22–25]. Article [22] performs a 24‐h charging load
2.1 | Linear methods forecast based on historical driving patterns. Three different
ANNs – a simple ANN, a rough artificial neural network
Linear forecasting approaches use linear functions to model (R‐ANN), and a recurrent rough artificial neural network
time series behaviour. Popular methods include autoregressive, (RR‐ANN) – are deployed and compared to a Monte Carlo
moving average, autoregressive moving average, and autore- simulation (MCS). RR‐ANNs generate the most accurate pre-
gressive integrated moving average (ARIMA) processes. Con- dictions. In [23], a novel reinforcement learning technique is
cerning the prediction of the EV charging load, ARIMA introduced to forecast the EV charging load under three sce-
models in different versions are particularly popular among all narios – uncoordinated, coordinated, and smart charging. The
linear approaches. A plain ARIMA model is introduced in [15] proposed Q‐learning technique increases the prediction accu-
to forecast the day‐ahead EV charging demand in 15‐min racy compared to a simple ANN and a recurrent neural network
(min) resolution. The prediction error is lowered with larger (RNN). The work carried out in [24] studies the short‐term
aggregation. Another work [16] examines the applicability of charging load predictability of four different types of ANNs
an ARIMA model to predict the EV charging demand of the – a deep neural network (DNN), an RNN, an LSTM, and a
next 24 h (h). The proposed decoupled forecaster, which gated recurrent unit (GRU). GRUs with one hidden layer
independently predicts the charging demand and the conven- achieve the most accurate prediction results. Study [25] pro-
tional electrical load, significantly reduces the prediction error vides a further evaluation of different ANN approaches for
compared to an integrated forecaster. super short‐term charging load forecasting. The predictive
The works in [17–19] present a more sophisticated version performance of a simple ANN, an RNN, an LSTM, a bidi-
of the ARIMA model. While paper [17] introduces a fractional rectional LSTM, a GRU, and a stacked auto‐encoder (SAE) are
autoregressive integrated moving average (FARIMA) model, analyzed. The LSTM yields the best performance in both case
article [18] proposes several seasonal autoregressive integrated studies – the charging in public and at commercial buildings.
moving average (SARIMA) approaches for EV charging load Similar to the results in [25], articles [26–28] demonstrate the
forecasting. The FARIMA model outperforms various ARIMA superior performance of the LSTMs for EV load forecasting.
models for all forecast horizons up to 120 min. The SARIMA The study in [26] develops an LSTM for single‐step predictions.
models achieve superior results compared to a persistence The analysis demonstrates that the LSTM outperforms a simple
forecast and a modified pattern sequence‐based forecast ANN, both for a forecast horizon of 15 and 30 min and that the
(MPSF). Similar to the work in [18], study [19] introduces two prediction error diminishes with decreasing forecast horizon.
seasonal autoregressive integrated moving average models with Another LSTM forecasting approach is presented in [27]. The
exogenous variables (SARIMAX). Compared to a random work considers four types of EVs – private and commercial EVs,
forest (RF) and gradient boosted regression tree (GBRT) electric busses, and electric taxis. Based on the MCS, the LSTM
model, the SARIMAX models outperform both machine outperforms a backpropagation (BP) network and SVR. Paper
learning algorithms for different forecast horizons. [28] proposes an LSTM‐based approach with hybrid classifica-
tion to forecast EV travel behaviour and their electrical demand.
Compared to copula, quasi‐MCS, and MCS, the novel approach
2.2 | Non‐linear methods improves the forecast accuracy, thus, resulting in lower EV
aggregator costs. The research in [29] presents a hybrid model of
Contrary to the results in [19], several studies indicate a superior extreme gradient boosting and LSTM for domestic charging
performance of non‐linear methods for EV charging load station load forecasting. Multivariate inputs are used to predict
UNTERLUGGAUER ET AL. - 3
the EV consumption in the next period. Study [30] proposes a � Comparing and assessing the forecasting accuracy at
hybrid approach of a deep belief network (DBN) and the LSTM different categories of charging sites which exhibit different
network for short‐term EV load prediction. The LSTM‐DBN charging characteristics, and
achieves superior forecasting results compared to a single � Using a large amount of high‐quality charging data from
DBN and single LSTM. various commercial charging sites.
2.3 | Hybrid methods 3 | DESCRIPTION OF THE DATASET
AND LSTM
Similar to studies [29, 30], other hybrid approaches as com-
pounds of different models have gained great popularity in This section deals with both the characteristics of the real
recent studies. Paper [33] combines a deterministic bottom‐up charging data used in this work and the LSTM fundamentals.
approach with a probabilistic RF model for the short‐term
forecast of the aggregated load of 46 privately owned EVs in
Austin, Texas. The proposed model shows superior perfor- 3.1 | Introduction of the dataset
mance compared to a persistence forecast, but similar to a
GBRT model. The study in [31] combines a least squares This study is based on real charging data of 50,504 slow
support vector machine (LSSVM), fuzzy clustering (FC), and a charging events with a maximum capacity of 22 kW, measured
wolf pack algorithm (WPA) to predict the load of electric bus at 37 different charging sites in Finland between January 2019
charging stations. The FC‐WPA‐LSSVM model outperforms a and January 2020. While the smallest charging site contains less
WPA‐LSSVM, a regular LSSVM and a BP‐NN. Article [32] than 10, the largest one features nearly 300 charging stations.
proposes another hybrid approach, combining a lion algorithm An overview of the characteristics of the charging data is
with the niche immune (NILA) and a convolutional neural illustrated in Table 2.
network (CNN), for short‐term EV charging load forecasting. The charging events are clustered according to the four
Taking into account the multivariate inputs, the NILA‐CNN categories of charging sites selected for this work. The data for
shows superior performance to a single CNN, a lion algo- the first category, the charging at shopping centres (SCc), stems
rithm, CNN, and an SVM. The work in [34] introduces a from one charging site. The charging events of 21 sites are
hybrid model, based on wavelet decomposition (WT), con- assigned to the second category named residential charging
volutional neural networks, and probabilistic queuing, for day‐ (REc), representing the charging at home. The third category,
ahead charging load predictions. The WT‐CNN outperforms a public charging (PUc), covers the charging at car parks that can
BP‐NN, an SAE, an RNN, an SVR, a time‐delayed neural be used by all EV drivers. The last category, workplace
network (TDNN), and a growing DBN. charging (WOc), refers to EV charging at company premises
that can only be used by the staff working in the vicinity. Eight
| and seven sites are assigned to PUc and WOc, respectively.2.4 Research gaps and contributions of
this work
|
Table 1 outlines the main findings of the literature review. All in 3.2 Long short‐term memory
all, three major limitations can be summarised. The studies
either lack real charging data [16, 17, 22, 23, 27] or the data The charging load of EVs is typically subject to strong time
employed is outdated, making it difficult to transfer the results dependencies as the load corresponds to cyclical and seasonal
to today's strongly changed situation in terms of a larger patterns. The LSTM is able to overcome the vanishing or
number of EVs, different charging powers, and the altered exploding gradient problem of RNNs, and to learn both
utilization of EVs [15, 18–20, 33, 34]. Moreover, most of the long‐term and short‐term dependencies, thus making it an
works only perform single‐step predictions [16, 20, 21, 23–27, ideal choice for this work [35]. The ability to learn temporal
29, 31, 32] which limits the scope of practical applications due correlations stems from the unique structure of an LSTM
to the short forecast horizon. Lastly, although in Ref. [25] it has cell, illustrated in Figure 1. The LSTM at a single ts t consists
been indicated that the achievable prediction accuracy may be of three gate units, namely the forget f t, input i t, and output
dependent on the type of charging, no attention has been paid gate ot.
to the analysis and comparison of different charging sites. The functionality of the LSTM cell can be expressed
This work contributes to the identified research gaps in the mathematically by the following formulas [36]
following ways:
  �
� Introducing a novel multivariate, multi‐step LSTM applica- f t ¼ σ W f ;xxt þW f ;hht−1 þ bf ð1Þ
tion, which significantly extends the existing short‐term
forecast horizon by providing a 4‐time step (ts) or 96‐ts   �
prediction at once, i t ¼ σ W i;xxt þW i;hht−1 þ bi ð2Þ
4 - UNTERLUGGAUER ET AL.
TABLE 1 Literature review on studies addressing the issue of short‐term EV charging load forecasting
Baseline Superior Reso‐ Data Data origin
Year ref. Proposed model(s) model(s) model Forecast approach Lution characteristics [year]
2016 [16] ARIMA – Decoupled 24‐h (single‐step) 1‐h Travel behaviour USA
forecast
2016 [20] MPSF, SVR,RF, – MPSF, 24‐h (single‐step) 1‐h Charging &Station UCLA (USA)
TWDPNN TWDPNN records [Dec 2011 ‐
Feb 2014]
2016 [17] FARIMA ARIMA FARIMA 2‐h 1.2‐min Travel behaviour Seattle (USA)
2017 [18] SARIMA MPSF,Persistence SARIMA 1‐, 2‐, & 24‐h 1‐h Charging data California (USA)
(aggregated) [Jan 2011 ‐
Jan 2013]
2018 [31] FC‐WPA‐LSSVM BP‐NN, LSSVM, FC‐WPA‐ 24‐h (multivariate & 1‐h Electric bus Baoding (China)
WPA‐LSSVM LSSVM single‐step) charging data [2017]
2018 [32] NILA‐CNN CNN, SVM,Lion NILA‐CNN 24‐h (multivariate & 30‐min Public charging Beijing (China)
CNN single‐step) (aggregated) [Jun 2017 ‐
Nov 2017]
2018 [21] RF – – multivariate & single‐ 15‐min Public charging Shenzhen (China)
step (single & aggr.) [2016 ‐ 2018]
2019 [15] ARIMA – – 24‐h 15‐min Non‐residential California (USA)
charging (aggr.) [2013]
2019 [22] ANN, R‐ANN,RR‐ MCS RR‐ANN 24‐h 1‐h Travel behaviour USA [2017]
ANN
2019 [26] LSTM ANN LSTM 24‐h (Single‐step) 15‐ &30‐ Public charging Shenzhen (China)
min (aggregated) [Jul 2017 ‐ Jul
2018]
2019 [25] ANN, GRU, LSTM, – LSTM 1‐, 5‐, & 15‐min 1‐min Public & Shenzhen (China)
RNN, SAE, DNN, (single‐step) commercial [ Jun 2017 ‐
charging (aggr.) Jul 2018]
2019 [24] GRU, bidirectional – GRU 24‐h (multivariate & 1‐h Public charging Shenzhen (China)
LSTMLSTM, single‐step) (aggregated) [Apr 2017 ‐
RNN Jun 2018]
2019 [33] Bottom‐up RF GBRT,Persistence Bottom‐up RF 24‐h (multivariate) – Residential Austin (USA)
charging (aggr.) [2015]
2019 [19] GBRT, RF,SARIMAX Persistence SARIMAX 7‐, 14‐, & 28‐days 24‐h Charging data Netherlands
(aggregated) [ Jan 2012 ‐
Mar 2016]
2019 [27] LSTM BP, SVR LSTM Univariate & single‐ 1‐h Simulated charging –
step data
2020 [23] Q‐learning technique ANN, RNN Q‐learning 24‐h (single‐step) 1‐h Simulated charging –
data
2020 [34] WT‐CNN & BP‐NN, SVR, WT‐CNN 24‐h (multi‐step) 1‐h Traffic flow data England, U.K
probabilistic SAE, TDNN, [ Jan 2014 ‐
queuing model RNN, DBN Dec 2014]
2020 [30] LSTM‐DBN DBN, LSTM LSTM‐DBN Multivariate 15‐min Charging data Liaoning, China
(aggregated) [Jan 2018 ‐
Mar 2018]
2020 [28] LSTM Copula, MCS, LSTM Day‐ahead 1‐h Travel behaviour USA [2017]
quasi MCS
2021 [29] Extreme gradient – – Multivariate & single‐ 1‐h Charging data Jiangsu, China
boosting LSTM step (aggregated) [ Jan 2017 ‐
Dec 2018]
UNTERLUGGAUER ET AL. - 5
TA B LE 1 (Continued)
Baseline Superior Reso‐ Data Data origin
Year ref. Proposed model(s) model(s) model Forecast approach Lution characteristics [year]
2021 This Novel LSTM Comparison of different 1‐h & 24‐h 15‐min Charging data Finland [ Jan
paper implementation charging sites & forecast (multivariate & (aggregated) 2019 – Jan
with multivariate horizons multi‐step) 2020]
inputs and multi‐
step forecast
horizon
Abbreviations: ANN, Artificial neural network; ARIMA, autoregressive integrated moving average; CNN, Convolutional neural network; DBN, Deep belief network; DNN Deep neural
network EV, electric vehicle; FARIMA, Fractional autoregressive integrated moving average; FC, Fuzzy clustering; GRU, gated recurrent unit; LSTM, long short‐term memory; MCS,
Monte carlo simulation; MPSF, modified pattern sequence‐based forecast; NILA, Lion algorithm by niche immune; R‐ANN, rough artificial neural network; RR‐ANN, recurrent rough
artificial neural network; RF, Random forest; SAE, stacked auto‐encoder; SARIMAX, Seasonal autoregressive integrated moving average (with exogenous variables; TWDPNN, time
weighted dot product based nearest neighbour; SARIMA, Seasonal autoregressive integrated moving average; WPA, Wolf pack algorithm.
TABLE 2 Characteristics of the EV charging data
W f ;x, W f ;h, W i;x, W i;h, W o;x, W o;h, W ~c x, and W; ~c;h label
# Charging # Charging weight matrices, bf , bi, bo, and b are bias vectors. Operations
Type of charging sites events Measured period ~cþ and ⊙ denote the element‐wise addition and multiplication,
Shopping centre 1 9283 01.01.19 –31.12.19 and ϕ depicts the tanh activation function. The forget gate de-
Residential 21 10,920 21.01.19–20.01.20 termines which information from the previous cell state, c t−1, is
preserved. The cell state represents the long‐term memory of
Public 8 18,785 31.01.19–30.01.20 the LSTM. The input gate decides which information from the
Workplace 7 11,516 31.01.19–30.01.20 candidate cell state, c~t, is to be used to update the previous cell
state. The output gate controls which part of the new cell state,
Abbreviations: EV, electric vehicle.
c t, to output and pass as the hidden state, ht, the short‐term
memory, to the next LSTM cell. The current input, x t , and
the previous hidden state, ht−1, are multiplied with their
respective weights along the bias vectors form the inputs for all
three gates. The sigmoid function, σ, introduces non‐linear
characteristics to the gates and decides which signals pass the
gates. While a value of zero causes signals to disappear, a value of
one ensures that the signals pass the gate.
4 | METHODOLOGY
The methodology of the proposed multivariate multi‐step
LSTM is shown in Figure 2 and consists of three main
stages – data preprocessing, LSTM implementation and
F I GURE 1 Architecture of an long short‐term memory cell at a single training, and LSTM forecasting. EV, electric vehicle; LSTM,
time step t. long short‐term memory
4.1 | Data preprocessing
  �
c~t ¼ ϕ W c xx t þW c hht−1 þ bc ð3Þ~; ~; ~ The following paragraph details the data preprocessing.
c t ¼ f t ⊙ c t−1 þ i t ⊙ c~t ð4Þ 4.1.1 | Aggregated charging load time series
  � generation
ot ¼ σ W o;xx t þW o;hht−1 þ bo ð5Þ
In the first step, the event‐based charging data is converted
into a time series of aggregated charging load values in 15‐min
ht ¼ ot ⊙ ϕðc tÞ ð6Þ resolution under the assumption of a constant charging power.
6 - UNTERLUGGAUER ET AL.
F I GURE 2 Methodology of the LSTM EV
charging load forecast
The data contains both the plug‐in‐ and power‐to‐zero‐ energy
timestamps and the original amount of energy, energyor , load ¼
or
new ð8Þ
charged for each charging event. Consequently, initially, the tnew
original charging time, tor , in min is derived by measuring the
deviation between both timestamps. In case the power‐to‐
zero‐timestamp is missing due to measurement errors, a 4.1.2 | Multivariate input selection
nominal charging power of 1.8 kW is assumed as most of the
charging events indicate charging at low powers. The original To determine the important features to be fed to the LSTMas
charging time is calculated by dividing the original amount of multivariate inputs to support the detection of interrelation-
energy charged by the assumed charging power. Subsequently, ships, an analysis of the characteristics of the charging load is
since the charging times might change due to the analysis in- carried out. Figure 3 illustrates the development of the
terval of 15 min, the original charging time is adjusted subse- aggregated charging load over the course of a year at all four
quently. If the original charging time lasts less than 15 min, it is categories of charging sites and reveals two particularities. First,
rounded up to a full 15‐min interval. Otherwise, a modulo an increase of the aggregated load over the course of the year
operation, mod, is performed and the modified charging time, can be observed for all charging sites. The strong impact of
tnew, is calculated according to (7). holiday periods on REc, PUc, and WOc marks the second
conspicuity. The Finnish summer holidays in July and the
8 9 winter holidays in December noticeably reduce the load. Both
< 15 tor < 15min = findings support the selection of a month indicator as a feature.
tnew ¼ tor −modðtorÞ modðtorÞ < 7:5min ð7Þ
: ; However, since the data only covers one year, a feature monthtor −modðtorÞ þ 15 else would lead to unknown indicators during the data split and is,
thus, omitted.
Furthermore, the modified charging load, loadnew , of each Figure 4 provides a detailed analysis of the charging sites by
charging process is calculated according to (8) under the visualising the charging load for three different weeks of the
premise of a constant amount of energy charged. Lastly, by year. When comparing the load curve of the first week of
aggregating the calculated load of each charging event, the February (green) with that of the end of November (grey), the
charging load time series is generated, spanning one year with increase in charging load over the course of a year is evident.
96 load values for each day. Moreover, a clear impact of the type of day and time of the day
UNTERLUGGAUER ET AL. - 7
F I GURE 3 Aggregated electric vehicle charging load over the course of one year illustrated for each category of charging site
F I GURE 4 Aggregated charging at each
category of charging site exemplified for three
different weeks
can be seen for REc, PUc, and WOc. The charging load varies 4.1.3 | Scaling and encoding of data
significantly between weekdays and weekends, and the time of
the peak load follows a clear pattern. SCc is the only charging To avoid data leakage, the time series is split into training and
site that exhibits a highly random load pattern. When looking test sets with a 90% to 10% ratio prior to the scaling and
at the purple plot, the influence of public holidays on the load encoding. Furthermore, 20% of the training data is used to
is evident as well. For all charging sites, the load is considerably validate the model performance during training, resulting in a
reduced during the three Christmas holidays. Consequently, to 72% training, 18% validation, and 10% test split.
help the LSTM to detect interrelationships, each load value is To accelerate network convergence and allow the LSTM to
linked to a quarter hour indicator, type of day indicator, and process categorical features, the load is normalised, and the
public holiday indicator, collectively representing the multi- indicators are encoded. The commonly used min‐max scaling
variate inputs to the LSTM. method is used to normalise the load values [37]. The popular
8 - UNTERLUGGAUER ET AL.
one hot encoding technique is used to encode the binary In stateless mode 4, 8, 16, and 32 ts are tested as inputs for the
character of the public holiday indicator [38]. For the quarter 4‐ts forecast, 96, and 192 ts for the 96‐ts prediction. The equal
hour and type of day indicators, sine/cosine encoding is tested amount of input and output ts are applied in the stateful mode.
as well to allow a better representation of the cyclic nature of Subsequently, based on the minimal validation loss, the most
both indicators [39, 40]. suitable approach undergoes tuning.
4.1.4 | Supervised learning framing and reshaping 4.2.1 | Training and hyperparameter tuning
of data implementation
Finally, the data is converted into a supervised learning prob- The LSTM is implemented as illustrated in Figure 5, using the
lem and formatted such that the shape meets the specific Sequential Model in Keras. While indices n denote the number
LSTM requirements. In the first step, the normalised and of input features, indices tin depict the amount of input ts for
scaled data is concatenated into a single array for every 15‐min each sample. Indices tout define the number of predicted load
ts. Using one hot encoding, each ts possesses 106 dimensions values subsequent to the input sequence.
of feature space, this number is reduced to seven while using Table 3 provides an overview of the hp selected for initial
the sine/cosine encoding. training and hp tuning. The tuning, based on a random search,
Subsequently, the data is framed in a supervised learning is performed in hyperopt, a popular library for conducting hp
manner by separating the data into input and output sequences. optimisation in Python. A detailed description of the hyperopt
In this study, two approaches are examined – the stateless and is given in [40]. The number of evaluations is set to 50 and the
stateful mode. The sliding window approach is applied for the seeding is set to one to ensure the comparability between the
stateless mode. In the stateful mode, the data is prepared in different charging sites and reproducibility of the results.
such a way that the fixed input sequences adjoin each other
without overlapping. For both cases, the input data contains
the load value and all indicators for each ts, the output only 4.2.2 | LSTM training process
contains the load to be compared with the predicted load.
In the last step, the data is reshaped in the required 3‐ The training proceeds as follows. The training and validation
dimensional shape, defining the number of samples, the data is shown to the LSTM in the input layer. Both data sets
number of input ts, and the number of features for each ts. consist of multiple samples which are successively processed
by the hidden layer(s), each comprising the input and output
sequences. For each sample, the concatenated features of each
4.2 | Long short‐term memory ts form the current input for the LSTM cell. In the stateless
implementation and training mode, the final state of each processed batch is removed. In
the stateful mode, however, the final state is provided as the
The most suitable LSTM configuration is selected in two steps. initial state for each sample in the next batch. The state is
First, the LSTM is trained with initially selected hyper- manually reset after each epoch as each epoch contains the
parameters (hp) on different approaches: the two encoding same time series data. After the input data has been processed
techniques, the two modes, and a varying number of input ts. by either one or two hidden layers, the final state of the last cell
F I GURE 5 Architecture and training process
of the long short‐term memory. MSE, mean
squared error
UNTERLUGGAUER ET AL. - 9
TABLE 3 Initial hyperparameters and
Type of hp Initial Training Hp Tuning
search space
Optimiser Adam No tuning
Loss function MSE No tuning
Activation function Tanh & hard sigmoid (cell), ReLU (dense layer) No tuning
# Epochs 500 1000 (no tuning)
# Hiddenlayers 1 1, 2
# Units 2, 8, 32, 128 2, 4, 8, 16, 32, 64, 128
Learning rate 0.001 0.01, 0.001, 0.0001
Batch size 1 (stateful), 32 (stateless) 16, 32, 64, 96
Dropout [%] 0 0, 20, 50
Abbreviations: MSE, mean squared error; ReLU, Rectified Linear Unit activation function.
in the last hidden layer is passed to a dense layer which outputs scale‐dependent, which implies the need for additional error
the number of ts to be predicted for each input sequence. metrics to allow a better comparison of the different charging
Once a certain number of samples, defined by the batch size, is sites. Given its scale‐independence, the popular mean absolute
processed, the predicted values are compared with the desired percentage error (MAPE) would provide an easily interpreta-
real load values to calculate the training and validation loss. tive error metric. However, the charging load time series
Based on the mean squared error (MSE) during training, the exhibit a charging load of zero at numerous points in time. For
weights are then modified after each batch size by the Adam this reason, MAPE cannot be used for overall comparison, as
optimiser using backpropagation through time. This process is the calculation is based on the division of the error by the true
repeated until the maximum number of epochs is met or the load value at each ts. To overcome these difficulties, two var-
training is terminated prematurely by the implemented early iants of the normalised mean absolute error (NMAE) are used
stopping callback. as shown in Equations (10) and (11). The MAE is normalised
by the mean charging load and the difference between the
maximum loadmax and minimum load loadmin, respectively.
4.3 | LSTM forecasting
1 XN
After the completion of the random search, the LSTM with the MAE ¼ jloadt − loâdtj ð9ÞN
optimal hp configuration performs the load forecast. t¼1
, 1 XN
| NMAE1¼MAE loadt ð10Þ4.3.1 Implementation of the forecast N t¼1
The forecast is carried out with a batch size of one, to avoid
errors in the prediction process caused by the varying batch MAENMAE2¼ ð11Þ
sizes during training. For every input sequence of the test data, loadmax − loadmin
the model predicts the subsequent four or 96 ts. Afterwards,
both the normalised real and predicted load are converted to For the 96‐ts forecast, three additional metrics are used to
load values in kW. The inverse normalisation is performed with better assess how well the daily peak load can be predicted,
the initial parameters of the min‐max scaler that have been both in terms of temporal occurrence and magnitude of the
saved during the preprocessing stage for this purpose. peak load. The timing and magnitude of the peak load are
especially important for assessing possible congestions in the
distribution network. The average peak deviation, pdev, in
4.3.2 | Metrics kW between the actual daily peak load, loaddm, and predicted
daily peak load, loâddm, is calculated according to Equa-
Six different error metrics are employed to evaluate the model tion (12). While D specifies the number of predicted days, dm
performance. The mean absolute error (MAE) in kW, calcu- denotes the daily peak. Likewise, the MAPE in % is calcu-
lated according to (9), measures the precision of the forecast by lated according to Equation (13). Finally, Equation (14)
averaging the error between the predicted and real load, and is specifies the average time deviation, tdev, in min separating
selected due to its simple comprehensibility. While N depicts the real and predicted peak load. Variable slot corresponds to
the number of forecasts, loadt indicates the real charging load the respective 15‐min interval of the real and the predicted
at time t, and loâdt the predicted one. However, the MAE is peak load.
10 - UNTERLUGGAUER ET AL.
TABLE 4 Summary of the best approach after initial training
1 XD
pdev¼ jload − loâd j ð12Þ SCc REc PUc WOc
D dm dm Site
d¼1 horizon 4‐ts 96‐ts 4‐ts 96‐ts 4‐ts 96‐ts 4‐ts 96‐ts
Min. Val. lossa 5.29 8.94 2.83 8.35 2.02 9.79 1.85 6.95
1 XD ,
MAPE ¼ ðloaddm − loâddmÞ loaddm � 100 ð13Þ # Epochs 17 7 49 31 14 147 221 130D
d¼1 # Input ts 16 96 16 96 4 96 16 96
Modeb SL SL SL SL SL SL SL SL
1 XD Encodingc S/C OH S/C S/C S/C S/C S/C OH
tdev¼ jslotðloaddmÞ − slotðloâddmÞj � 15 ð14ÞD
d¼1 Abbreviations: PUc, public charging; REc, residential charging; SCc, shopping center
charging; WOc, workplace charging.
aAll MSE values (validation loss) are given in units of 10−3.
bSL = Stateless mode.
5 | RESULTS cS/C = Sine/Cosine encoding, OH = One hot encoding.
Having covered the methodology of this work, this chapter training, except for the 96‐ts REc forecast. Lastly, in most of
addresses the training, tuning, and prediction results. the cases, the learning rate of 0.001 yields the lowest loss.
5.1 | Initial training results 5.3 | Forecasting results
Table 4 illustrates the selected approach for each charging site In the following sections, the forecasting results are analysed
and forecast horizon based on the minimum validation loss both graphically and numerically.
obtained during initial training. In all cases the minimum loss is
recorded while training the LSTM with 128 units.
The small epoch number for most of the charging sites 5.3.1 | Graphical results
indicates that the LSTM suffers from early overfitting. More-
over, the different modes, encoding techniques, and input ts Figure 6 illustrates the forecasting results for the 36 days of
only yield small differences in the validation loss. However, in test data for each category of charging site for both the 4‐ts
all the cases the lowest MSE is obtained while using the (green) and 96‐ts (purple) prediction. While comparing the
stateless mode. The sine/cosine encoding shows a slightly outcomes with the real charging load (grey), it becomes
superior performance in most cases. Finally, using the same apparent that the 4‐ts prediction achieves superior results
amount of input ts yields the lowest MSE for the 96‐ts fore- compared to the 96‐ts forecast. The 4‐ts forecast yields a
cast. For the 4‐ts prediction, 16 input ts results in the lowest relatively precise picture of the real load curve in grey, with
validation loss for SCc, Rec, and WOc, whereas four input ts no major outliers to be seen.
achieve the best fit for PUc. Considering the 96‐ts prediction results, three short-
comings can be identified. First, the LSTM is not capable
of predicting the charging load on public holidays. The
5.2 | Hyperparameter tuning results projected load almost exclusively exceeds the real load by a
substantial margin and exhibits a course similar to that of
The maximum and minimum validation loss obtained during non‐holidays. Second, the LSTM is not able to accurately
tuning, and the chosen superior hp combination is given in predict the level of real peak load for most days, frequently
Table 5. The maximum and minimum validation loss differ exceeding the predicted peak load substantially. Finally, the
considerably. The highest MSE values are obtained for the 4‐ts LSTM struggles to anticipate the impact of holiday periods.
prediction while using a high dropout or learning rate. For the This is particularly evident for REc, where the forecast
96‐ts forecast, a small number of units lead to the highest significantly exceeds the reduced real charging load during
validation losses. While comparing the minimum loss between the winter holiday period.
random search and initial training, it is evident that the hp
tuning only yields improvements for half of the forecasts. The
lowest losses are obtained with a high number of units (64 or 5.3.2 | Numerical results
128). In most cases, an LSTM with only one hidden layer
seems sufficient. Only for the REc 96‐ts forecast and WOc 4‐ Table 6 summarises the numerical results. As previously evi-
ts prediction the minimum loss is recorded while using two denced during the graphical analysis, reducing the forecast
hidden layers. All batch sizes are applied for the different horizon from 96 ts to 4 ts considerably reduces the prediction
forecasts. Dropout generally does not have a positive effect on error. The MAE for REc, PUc, and WOc can be more than
UNTERLUGGAUER ET AL. - 11
TABLE 5 Summary of tuning results
and selected hyperparameters Site
SCc REc PUc WOc
horizon 4‐ts 96‐ts 4‐ts 96‐ts 4‐ts 96‐ts 4‐ts 96‐ts
Min. lossa 33.68 13.51 10.17 21.04 12.08 32.34 47.24 14.55
Max. lossa 5.33 8.78 2.83 8.00 2.03 9.05 1.59 7.25
Improvement No Yes No Yes No Yes Yes No
# Hidden layer 1 1 1 2 1 1 2 1
# Units 128 128 128 128 128 64 64 128
Batch size 32 64 32 16 32 96 96 32
Dropout [%] 0 0 0 20 0 0 0 0
Learning rate 0.001 0.01 0.001 0.001 0.001 0.0001 0.0001 0.001
Abbreviations: PUc, public charging; REc, residential charging; SCc, shopping center charging; WOc, workplace charging.
aAll MSE values are given in units of 10−3.
F I GURE 6 Forecasting results for the 36 days of test data illustrated for all categories of charging sites
halved in most of the cases. For SCc, the MAE is reduced in all Conversely, while using the NMAE2 as the evaluation crite-
cases by more than 2 kW. The lowest overall MAE for both the rion, varying results are visible depending on the type of day.
forecast horizons is recorded for REc. WOc generates the For weekdays the most accurate predictions are seen for WOc
second lowest MAE values, followed by PUc. The highest followed by PUc, Rec, and SCc. While REc yields the most
MAE values are seen for SCc. exact forecast for weekends, WOc, PUc, and SCc take the
A different picture emerges while looking at the NMAE1 second, third, and fourth place. For public holidays, REc ranks
and NMAE2 results. For the 4‐ts prediction, the lowest first again, followed by SCc, PUc, and WOc. Similar findings to
NMAE1 is obtained for PUc, followed by REc, WOc, and SCc. the NMAE1 results of the 4‐ts forecast can be seen for the
Conversely, using the NMAE2, the most precise results are NMAE1 results of the 96‐ts forecast. REc and PUc achieve the
seen for WOc, followed by PUc, Rec, and SCc. For the 96‐ts most precise scores in most of the cases. However, for public
forecast the lowest NMAE1 again is obtained for PUc and holidays, SCc outperforms PUc and comes in the second place,
REc, however, the least accurate prognosis is given for WOc behind REc. Looking at the NMAE2 results, the poorest
this time. In contrast, the lowest NMAE2 is seen for WOc, performance for weekdays can be observed for Rec, whereas
followed by SCc and PUc. REc ranks last. SCc yields the most accurate results. On weekends SCc ranks
While comparing the clustered NMAE1 results, it can be first again, and WOc ranks the last. For public holidays, WOc
seen that the 4‐ts forecast yields the most accurate forecast for performs particularly poor again. REc obtains the most
PUc and REc. SCc and WOc score the poorest results. favourable score.
12 - UNTERLUGGAUER ET AL.
Table 7 quantifies the LSTM’s difficulties in predicting absolute errors are seen for public holidays, although the time
peak loads with a forecast horizon of 96 ts in numerical terms. deviation is minimised. For PUc and WOc, according to the
The pdev, the MAPE, and the tdev are given for each charging MAPE, the most precise results are achieved during the week.
site. For all forecasts, the average deviation between true and For WOc predictions on weekends, the absolute error amounts
predicted peak load amounts to 29.16 kW for SCc, 6.37 kW for to 4.73 kW and the MAPE to 100% due to a constant pre-
REc, 14.17 kW for PUc, and 17.85 kW for WOc. The MAPE diction of 0 kW. Therefore, no deviation between the time of
amounts to 53.19% for SCc, 34.38% for REc, 37.16% for PUc, the true and predicted peak load can be calculated. The highest
and 128.97% for WOc. The average deviation between real and error scores between the real and predicted peak load are
predicted time of the peak load in minutes are 153, 88, 104, recorded on public holidays for both charging sites and the
and 285 for SCc, REc, PUc, and WOc, respectively. time deviation between true and predicted peak load amounts
Examining the scores grouped by weekdays, weekends, and to a multiple of the deviation during the week.
public holidays, several discrepancies between the charging
sites can be highlighted. For SCc, on weekends and public
holidays, the absolute and relative errors can be reduced rela- 6 | DISCUSSION
tive to weekdays. In addition, the time deviation between true
and predicted occurrence of the peak load is significantly In this section, the initial training, hp tuning, and forecasting
lowered as well. For REc, in contrast, the highest relative and results are discussed in more detail.
TABLE 6 Evaluation of forecasting results 6.1 | Initial training results
Charging site SCc REc PUc WOc Four key findings can be summarised. To begin with, the in-
horizon 4‐ts 96‐ts 4‐ts 96‐ts 4‐ts 96‐ts 4‐ts 96‐ts clusion of multivariate inputs shows a positive influence on the
training results in terms of validation loss. However, a detailed
MAEa (all) 4.35 6.78 1.53 3.37 2.7 5.84 1.85 5.35 analysis of the impact on the training results and forecasting
results was not part of the analysis. Since the training data used
NMAE1 (all) 0.352 0.549 0.178 0.391 0.166 0.359 0.204 0.59 in this work spans less than one year for each category of
NMAE2 (all) 0.041 0.065 0.038 0.083 0.032 0.068 0.021 0.06 charging site, the inclusion of multivariate inputs is selected to
MAEa (wd) 4.41 6.75 1.52 3.33 3.1 6.37 2.56 6.56 help the LSTM identify patterns in the data. With multiyear
data available, the importance of including multivariate inputs
NMAE1 (wd) 0.369 0.564 0.175 0.381 0.152 0.312 0.191 0.489 may decrease or become obsolete.
NMAE2 (wd) 0.042 0.064 0.038 0.082 0.036 0.075 0.029 0.073 Next, the early overfitting indicates that the complexity of
MAEa (we) 4.39 7.15 1.6 3.62 1.83 3.2 0.49 1.31 the model, relative to the size of the available data, is too high,
and due to changing load characteristics throughout the year,
NMAE1 (we) 0.329 0.536 0.189 0.428 0.208 0.363 0.377 1.00 the validation data is not fully representative for the entire
NMAE2 (we) 0.06 0.097 0.047 0.106 0.056 0.098 0.054 0.142 dataset. The access to charging data measured over a longer
MAEa (ph) 3.71 5.74 1.36 2.89 2.48 10.64 0.88 9.51 period is, thus, of great interest to enhance the performance of
the model in future.
NMAE1 (ph) 0.312 0.484 0.162 0.345 0.262 1.124 0.595 6.454 Furthermore, the different encoding techniques and input ts
NMAE2 (ph) 0.073 0.113 0.047 0.099 0.092 0.393 0.16 1.739 variants only cause negligible differences in the validation loss.
Thus, the extra time needed to implement and train the LSTM
Abbreviations: MAE, mean absolute error; NMAE, normalised mean absolute error;
ph = public holiday; PUc, public charging; REc, residential charging; SCc, shopping with various variants cannot be justified. Hence, the selection of
center charging; WOc, workplace charging; wd = weekday, we = weekend. the same number of input ts and encoding choice for all sites
aAll MAE values are given in kW
TABLE 7 Summary of the peak deviation, MAPE and time deviation results of the 96‐ts prediction
Charging site SCc REc PUc WOc
Pdev MAPE Tdev Pdev Tdev Pdev Tdev Pdev Tdev
metric (kW) (%) (min) (kW) MAPE (%) (min) (kW) MAPE (%) (min) (kW) MAPE (%) (min)
All days 29.16 53.19 153 6.37 34.38 88 14.17 37.16 104 17.85 128.97 285
Weekdays 30.96 53.65 187 6.24 34.71 86 13.75 23.50 52 20.54 35.50 35
Weekends 26.37 53.11 95 6.29 32.00 99 11.39 43.28 116 4.73 100.00 ‐
Public holidays 24.65 49.90 85 7.64 39.80 65 26.62 121.46 615 41.01 942.13 435
Abbreviations: MAPE, mean absolute percentage error; Pdev, average deviation between real and predicted peak load; PUc, public charging; REc, residential charging; SCc, shopping
center charging; Tdev, average time deviation of predicted peak load.
UNTERLUGGAUER ET AL. - 13
seems to be more rational. Whereas the sine/cosine encoding noticeably, but the entire load series shows strongly fluctu-
offers a viable encoding variant, 16 input ts for the 4‐ts and 96 ating load characteristics. Thus, the impact of public holidays
input ts for the 96‐ts prediction seem to be appropriate. remains small.
Lastly, the stateful mode does not provide benefits Similar findings can be seen while looking at the peak
compared to the stateless mode under the given conditions. deviation and MAPE results. While for SCc and Rec, the
However, when multi‐year data becomes available in the MAPE difference between weekdays and public holidays
future, the stateful mode may reveal its strengths, as seasonal amounts to only 3.75 and 5.09 percentage points, respec-
relationships can be identified by the LSTM. Consequently, tively, the figure for public and workplace charging increases
with the availability of data over several years, a further com- by 97.96 and 906.63 percentage points, respectively. Once
parison of the different modes is advisable. again, the discrepancies can be explained by the impact of
public holidays on the load pattern. Analysing the weekday
results, the significant discrepancies in the MAPE and time
6.2 | Hyperparameter tuning results deviation, likewise, indicate that the characteristics of the
load profile decisively impact the accuracy of the forecast.
Three relevant conclusions can be drawn. First, the significant The highly irregular load profile for SCc results in the
variation between the minimum and maximum loss during poorest accuracy for both, the predicted peak load level and
tuning reveals that the different hp combinations exert a major the time of the peak load. WOc, on the other hand, exhibits
impact on the generalisability of the LSTM and, ultimately, on the steadiest pattern concerning the time of the peak load,
its predictive power. Thus, the importance of hp tuning for which is why the average time deviation is the lowest at
selecting the most suitable LSTM model is outlined. 35 min.
Moreover, the varying combinations of hp exert a funda-
mentally different impact on the validation loss for each
charging site category and forecast horizon. Thus, it is essential 6.3.2 | Overall findings
to perform the hp tuning separately for each use case.
Finally, random search shows only minor or no enhance- The superior performance of the 4‐ts prediction compared to
ments compared with the initial training. This might be due to the 96‐ts forecast is attributable to two factors. Due to the
the limited amount of evaluations and search space. A higher shorter forecast horizon, the preceding load values are shown
number of executions, the inclusion of other hp in the search to the LSTM more frequently and assist the LSTM in mapping
space, or the use of a more subtle method, like the bayesian hp the height and course of the load more precisely. Moreover, the
tuning, could lead to further improvements. poor prediction results for the 96‐ts forecast might be traced
back to the strongly altered load profile of the test data, caused
by the increase of charging load throughout the year and the
6.3 | Forecasting results winter holiday period. Therefore, the training and validation
data might have not been fully representative of the test data,
The results of the forecast are discussed below, followed by limiting the LSTM’s predictive power.
implications for the practical application of the forecast.
6.3.3 | Implications for aggregators and network
6.3.1 | Differences between the different operators
categories of charging sites
There are two courses of action on how aggregators and
The comparison of the forecasting results of the different network operators can implement the LSTM to achieve opti-
charging sites revealed that the evaluation is highly dependent mised forecasts for their respective use case. First, it is bene-
on the metric. However, SCc almost solely yields the poorest ficial to limit the load forecast to weekdays. Weekends and
results for the 4‐ts prediction, indicating that the fluctuating public holidays are accompanied by changes in the user
load hinders the predictive power of the LSTM. behaviour that are difficult to predict and exhibit a much lower
Further, evident discrepancies involve the varying results aggregation potential and risk of bottlenecks due to the
for weekdays, weekends, and public holidays. While the reduced charging load. By focussing the forecast on weekdays,
lowest NMAE1 is obtained for SCc and REc on public a higher prediction accuracy can be obtained.
holidays, the public holiday load predictions for PUc and Second, the forecast should only be carried out for the
WOc are significantly less accurate than the predictions on periods of a day with the highest charging load, where the
weekdays and weekends. This finding is attributable to the aggregation potential is the highest, and bottlenecks in the
distinct load profile characteristics rather than to the different distribution network are most likely. By shortening the forecast
forecasting ability of the LSTM. While the load for PUc and period, the accuracy is improved as shown in this work. The
WOc shifts radically on public holidays, the change for REc forecast will be further enhanced by focussing the LSTM on a
is less pronounced. With SCc, the load profile also varies specific period of the day.
14 - UNTERLUGGAUER ET AL.
7 | CONCLUSIONS AND FUTURE Open access funding enabled and organized by Projekt
WORK DEAL.
This study proposes a novel multivariate long short‐term ORCID
memory approach for multi‐step EV charging load fore- Tim Unterluggauer https://orcid.org/0000-0002-6461-
casting with two different prediction horizons. The perfor- 7291
mance of the forecasting approach is evaluated and compared Kalle Rauma https://orcid.org/0000-0002-5553-8751
between four different categories of charging sites. The results
show that the distinct characteristics of the different charging REFERENCES
sites influence the predictability of the charging load, but that 1. Greenhouse Gas Emissions From Transport in Europe. https://www.
the evaluation is highly dependent on the chosen metric. It is eea.europa.eu/data‐and‐maps/indicators/transport‐emissions‐of‐greenho
also demonstrated that with an increasing forecast horizon, the use‐gases/transport‐emissions‐of‐greenhouse‐gases‐12. (2020). Accessed
accuracy diminishes as well. Reducing the forecast horizon 10 Dec 2020
from 96 to four time steps, the MAE is more than halved in 2. Diana, N., Sivakumar, A., Polak, J.W.: Modelling electric vehicles use: a
most cases, and amounts to 4.35 kW for shopping centre, survey on the methods. Renew. Sust. Energ. Rev. 68, 447–460 (2017)3. Wu, X., et al.: Stochastic optimal energy management of smart home with
1.53 kW for residential, 2.7 kW for public, and 1.85 kW for PEV energy storage. IEEE Trans. Smart Grid. 9(3), 2065–2075 (2018)
workplace charging. In general, the forecasting accuracy tends 4. Toquica, D., De Oliveira‐De Jesus, P.M., Cadena, A.I.: Power market
to be the best at residential and workplace charging sites. The equilibrium considering an EV storage aggregator exposed to marginal
weakest accuracy is found at shopping centres. prices ‐ a bilevel optimization approach. J. Energy Storage. 28, 101267
The findings of this work benefit mainly two stakeholders. (2020)5. Wang, D., et al.: Integrated energy exchange scheduling for multi-
Aggregators have great interest in a reliable load forecast to sell microgrid system with electric vehicles. IEEE Trans. Smart Grid. 7(4),
EV flexibilities to the energy market. Network operators, in 1762–1774 (2016)
contrast, are keen on forecasting the charging load to identify 6. Xia, S., et al.: An autonomous real‐time charging strategy for plug‐in
possible bottlenecks in the distribution network. To apply the electric vehicles to regulate frequency of distribution system with fluc-
proposed LSTM model most effectively for both use cases, tuating wind generation. IEEE Trans. Sustain. Energy. 9(2), 511–524(2018)
future studies should consider two aspects. 7. Appino, R.R., et al.: Reliable dispatch of renewable generation via
To address the outlined weaknesses of the 96‐time step charging of time‐varying PEV populations. IEEE Trans. Power Syst.
forecast, the LSTMwill be trained on data collected over a longer 34(2), 1558–1568 (2019)
time period to ensure that the training and validation data is truly 8. Janjic, A., et al.: Commercial electric vehicle fleet scheduling for sec-
representative of the test data. With data available over several ondary frequency control. Elec. Power Syst. Res. 147, 31–41 (2017)9. Knezovic, K., et al.: Enhancing the role of electric vehicles in the power
years, the LSTM will be extended by further indicators, such as grid: field validation of multiple ancillary services. IEEE Trans. Transp.
the month, the stateful mode will be investigated again, and time Electrific. 3(1), 201–209 (2017)
series cross validation will be performed to increase the 10. Zeraati, M., Hamedani Golshan, M.E., Guerrero, J.M.: A consensus‐
robustness of the model and overcome early overfitting. based cooperative control of PEV battery and PV active power curtail-
Furthermore, a future research objective involves the ment for voltage regulation in distribution networks. IEEE Trans. SmartGrid. 10(1), 670–680 (2019)
development of a customised LSTM, tailored to each charging 11. Asrari, A., et al.: A market framework for decentralised congestion
site. The forecast will be targeted to weekdays, since user management in smart distribution grids considering collaboration among
behaviour on weekends and holidays is often difficult to pre- electric vehicle aggregators. IEEE Trans. Smart Grid. 11(2), 1147–1158
dict and, in many cases, as with workplace charging, the ag- (2020)
gregation potential is not sufficient enough. Additionally, the 12. Liang, H., et al.: Dynamic economic/emission dispatch including PEVsfor peak shaving and valley filling. IEEE Trans. Ind. Electron. 66(4),
prediction will be narrowed down to the most suitable time 2880–2890 (2019)
periods of the day at each charging site when a sufficient 13. Duan, Z., Gutierrez, B., Wang, L.: Forecasting plug‐in electric vehicle
volume of EVs is available for pooling the EV flexibility and sales and the diurnal recharging load curve. IEEE Trans. Smart Grid.
bottlenecks caused by simultaneous charging, such as 4:00 PM 5(1), 527–535 (2014)
to 8:00 PM for residential charging, 8:00 AM to 12:00 PM for 14. Zhang, H., et al.: Evaluation of achievable vehicle‐to‐grid capacityusing aggregate PEV model. IEEE Trans. Power Syst. 32(1), 784–794
public charging, or 6:00 AM to 10:00 AM for workplace (2017)
charging. 15. Pertl, M., et al.: An equivalent time‐variant storage model to harness EV
flexibility: forecast and aggregation. IEEE Trans. Ind. Inf. 15(4),
ACKNOWLEDGMENTS 1899–1910 (2019)
The authors would like to thank IGL Technologies Oy and 16. Amini, M.H., Kargarian, A., Karabasoglu, O.: ARIMA‐based decoupledtime series forecasting of electric vehicle charging demand for stochastic
Parking Energy Ltd for providing the charging data for this power system operation. Elec. Power Syst. Res. 140, 378–390 (2016)
work. Kalle Rauma would like to thank the support of the 17. Korolko, N., Sahinoglu, Z., Nikovski, D.: Modelling and forecasting self‐
German Federal Ministry of Transport and Digital Infra- similar power load due to EV fast chargers. IEEE Trans. Smart Grid.
structure through the project PuLS – Parken und Laden in der 7(3), 1620–1629 (2016)
Stadt (03EMF0203 B). Tim Unterluggauer and Pertti Järven- 18. Louie, H.M.: Time‐series modelling of aggregated electric vehicle chargingstation load. Elec. Power Compon. Syst. 45(14), 1498–1511 (2017)
tausta would like to thank the support of the project 19. Buzna, L., et al.: Electric vehicle load forecasting: a comparison between
Energiaratkaisut. time series and machine learning approaches. In: 2019 1st International
UNTERLUGGAUER ET AL. - 15
Conference on Energy Transition in the Mediterranean Area (SyNERGY 32. Li, Y., Huang, Y., Zhang, M.: Short‐term load forecasting for electric
MED), pp. 1–5. Cagliary (2019). May vehicle charging station based on niche immunity lion algorithm and
20. Majidpour, M., et al.: Forecasting the EV charging load based on customer convolutional neural network. Energies. 11(5), 1–18 (2018)
profile or station measurement? Appl. Energy. 163, 134–141 (2016) 33. Gerossier, A., Girard, R., Kariniotakis, G.: Modelling and fore-
21. Lu, Y., et al.: The application of improved random forest algorithm casting electric vehicle consumption profiles. Energies. 12(7), 1–14
on the prediction of electric vehicle charging load. Energies. 11(11), (2019)
1–16 (2018) 34. Zhang, X., et al.: Deep‐ learning‐based probabilistic forecasting of
22. Jahangir, H., et al.: Charging demand of plug‐in electric vehicles: fore- electric vehicle charging load with a novel queuing model. IEEE Trans.
casting travel behaviour based on a novel rough artificial neural network Cybernetics. 51(6), 3157–3170 (2020)
approach. J. Clean. Prod. 229, 1029–1044 (2019) 35. Bianchi, F.M., et al.: Recurrent Neural Networks for Short‐Term Load
23. Dabbaghjamanesh, M., Moeini, A., Kavousi‐Fard, A.: Reinforcement Forecasting: An Overview and Comparative Analysis. Springer, Berlin
learning‐based load forecasting of electric vehicle charging station using (2017)
Q‐learning technique. IEEE Trans. Ind. Inf. 17(6), 4229–4237 (2021) 36. Qing, X., Niu, Y.: Hourly day‐ahead solar irradiance prediction using
24. Zhu, J., et al.: Short‐term load forecasting for electric vehicle weather forecasts by LSTM. Energy. 148, 461–468 (2018)
charging stations based on deep learning approaches. Appl. Sci. 9(9), 37. Kong, W., et al.: Short‐term residential load forecasting based on LSTM
1–12 (2019) recurrent neural network. IEEE Trans. Smart Grid. 10(1), 841–851
25. Zhu, J., et al.: Electric vehicle charging load forecasting: a comparative (2019)
study of deep learning approaches. Energies. 12(14) (2019) 38. Potdar, K., Pardawala, S.T., Pai, D.C.: A comparative study of categorical
26. Zhu, J., et al.: A novel LSTM based deep learning approach for multi‐ variable encoding techniques for neural network classifiers. Ijca. 175(4),
time scale electric vehicles charging load prediction, In: IEEE Innova- 7–9 (2017)
tive Smart Grid Technologies ‐ Asia (ISGT Asia), pp. 3531–3536. 39. Hernandez, L., et al.: Short‐term load forecasting for microgrids
Chengdu (2019) based on artificial neural networks. Energies. 6(3), 1385–1408
27. Gao, Q., et al.: Charging load forecasting of electric vehicle based on (2013)
Monte Carlo and deep Learning. IEEE Sustainable Power and Energy 40. Bergstra, J., Bengio, Y.: Random search for hyper‐parameter optimiza-
Conference (iSPEC), pp. 1309–1314, Beijing (2019) tion. J. Mach. Learn. Res. 13(1), 281–305 (2012)
28. Jahangir, H., et al.: Plug‐in electric vehicle behaviour modelling in energy
market: a novel deep learning‐based approach with clustering technique.
IEEE Trans. Smart Grid. 11(6), 4738–4748 (2020)
29. Xue, M., et al.: Research on load forecasting of charging station based on
XGBoost and LSTM model. J. Phys. Conf. 1757(1), 1–10 (2021) How to cite this article: Unterluggauer T., et al.: Short‐
30. Na, Z., et al.: Short‐term load forecasting algorithm based on LSTM‐ term load forecasting at electric vehicle charging sites
DBN considering the flexibility of electric vehicle. IOP Conf. Ser. using a multivariate multi‐step long short‐term memory:
Earth Environ. Sci. 546(4), 1–8 (2020) A case study from Finland. IET Electr. Syst. Transp.
31. Zhang, X.: Short‐term load forecasting for electric bus charging stations
based on fuzzy clustering and least squares support vector machine 1–15 (2021). https://doi.org/10.1049/els2.12028
optimised by wolf pack algorithm. Energies. 11(6), 1–18 (2018)