Water: Development of A Deep Learning-Based Prediction Model For Water Consumption at The Household Level
Water: Development of A Deep Learning-Based Prediction Model For Water Consumption at The Household Level
Water: Development of A Deep Learning-Based Prediction Model For Water Consumption at The Household Level
Article
Development of a Deep Learning-Based Prediction Model for
Water Consumption at the Household Level
Jongsung Kim 1 , Haneul Lee 1 , Myungjin Lee 1 , Heechan Han 2 , Donghyun Kim 3 and Hung Soo Kim 3, *
1 Institute of Water Resources System, Inha University, Incheon 22201, Korea; kjjs0308@naver.com (J.K.);
haneul6802@naver.com (H.L.); lmj3544@naver.com (M.L.)
2 Blackland Research and Extension Center, Texas A&M AgriLife, Temple, TX 76502, USA;
heechan.han@ag.tamu.edu
3 Department of Civil Engineering, Inha University, Incheon 22201, Korea; yesdktpdi@naver.com
* Correspondence: sookim@inha.ac.kr; Tel.: +82-10-3441-1038
Abstract: The importance of efficient water resource supply has been acknowledged, and it is essential
to predict short-term water consumption in the future. Recently, it has become possible to obtain data
on water consumption at the household level through smart water meters. The pattern of these data is
nonlinear due to various factors related to human activities, such as holidays and weather. However,
it is difficult to accurately predict household water consumption with a nonlinear pattern with the
autoregressive integrated moving average (ARIMA) model, a traditional time series prediction model.
Thus, this study used a deep learning-based long short-term memory (LSTM) approach to develop
a water consumption prediction model for each customer. The proposed model considers several
variables to learn nonlinear water consumption patterns. We developed an ARIMA model and an
LSTM model in the training dataset for customers with four different water-use types (detached
houses, apartment, restaurant, and elementary school). The performances of the two models were
evaluated using a test dataset that was not used for model learning. The LSTM model outperformed
the ARIMA model in all households (correlation coefficient: mean 89% and root mean square error:
mean 5.60 m3 ). Therefore, it is expected that the proposed model can predict customer-specific water
Citation: Kim, J.; Lee, H.; Lee, M.;
consumption at the household level depending on the type of use.
Han, H.; Kim, D.; Kim, H.S.
Development of a Deep
Keywords: ARIMA model; household-level; LSTM model; water consumption prediction
Learning-Based Prediction Model for
Water Consumption at the
Household Level. Water 2022, 14,
1512. https://doi.org/10.3390/
w14091512 1. Introduction
noted that accurate short-term water demand prediction is essential for water managers
to make efficient decisions. They proposed the harmony search optimization algorithm
based on the autoregressive integrated moving average (ARIMA) model to improve water
demand prediction performance. Salah et al. [10] proposed a method for removing noise
from water consumption data and used the noise-removed water consumption data and
an AR model to predict monthly water demand. Furthermore, Haiyan et al. [11] predicted
annual water demand for adequate water supply in Beijing, experiencing a water shortage
as a result of rapid urban growth. The uncertain autoregressive (UAR) model, which can
consider uncertainty in the AR model, was evaluated by comparing its water demand
prediction performance to that of the traditional AR model. Traditional time series analysis
models, such as AR and ARIMA models, have a limitation in that they cannot predict
nonlinear patterns as they are based on linear patterns of past data.
With the recent development of computing technology, machine learning and deep
learning are often used to predict nonlinear data in the hydrology field [12–14]. The
machine learning and deep learning models, such as random forest (RF) and long short-
term memory (LSTM), which can reflect the nonlinear time series characteristics of water
consumption, are being used for predicting water consumption [15–18]. For example,
Bougadis et al. [19] highlighted that it is necessary to predict future water demand and
expand water supply infrastructures to build an optimal water distribution facility based on
accurate water consumption prediction. They evaluated the artificial neural network (ANN)
and ARIMA models for predicting potential water demand and discovered that the ANN-
based prediction model outperformed the ARIMA model. Furthermore, Manuel et al. [20]
stated that it is crucial to predict the water consumption of consumers to provide an
efficient water supply, and they used ANN, RF, and support vector regression (SVR)
algorithms to predict the water consumption of cities in southeastern Spain. SVR had the
best predictive performance among the three machine learning algorithms. Mohammed
et al. [21] stated that governments must plan water distribution to plan for sustainable
development. Moreover, they used an ANN model to predict future water demand and
used various optimization techniques, such as Levenberg–Marquardt (LM) and genetic
algorithms (GA), to develop an optimal model. Li et al. [22] predicted daily water demand
in Hefei, China, by applying various data-driven models, such as LSTM, SVR, and RF
models. The SVR model had a mean absolute percentage error (MAPE) value of 7.66%,
the RF model had a MAPE value of 2.64%, and the LSTM model demonstrated the best
performance with a MAPE value of 1.36%. External factors must be considered to improve
water consumption prediction as water consumption is influenced by nonlinear patterns
such as holidays, weekdays, weather features, and human activities [23–26]. However,
most previous studies have limitations in that they only used past water consumption data
without considering external factors.
Bakker et al. [27] stated that predicting water consumption with only a single predic-
tor can produce accurate results, but prediction errors can occur significantly depending
on weather conditions. They used a multiple regression model that considers various
meteorological variables to predict water consumption in urban areas. They discovered
that prediction errors can be reduced by up to 11% when multiple meteorological vari-
ables are considered. Austin et al. [28] used a multiple regression model that considers
demographics and weather conditions to predict water consumption in Seattle, Wash-
ington. Adam et al. [29] used ANN and multiple linear regression (MLR) models that
consider historical water consumption data, humidity, and daily variables to predict daily
water consumption in Torun, Poland. They discovered that the MAPE for the MLR model
was 2.56%, while that for the ANN model was 2.28%, indicating that the ANN model
performed better. Previous studies have revealed that it is essential to consider external
factors such as weather conditions and previous water consumption data for accurate water
demand prediction.
Smart technology has recently been introduced in the water supply field as a result of
the advancement in information and communication technology and the fourth industrial
Water 2022, 14, 1512 3 of 17
revolution, and internet of things (IoT) devices such as smart water meters are being used
for efficient water management [30–33]. Since data for individual users are collected, water
leaks and errors can be easily assessed, and various services can be provided by analyzing
water consumption patterns for each user. In the United Kingdom, Asset Management
Plan 6 was launched to solve a serious leak problem, and the smart water meter project
was promoted for efficient water management [34]. Maria et al. [35] used household water
consumption data measured by a smart water meter to predict future water consumption
in the United Kingdom. In addition, the effect of weather conditions on household water
consumption was analyzed. Xenochristou et al. [36] predicted water consumption for one
to seven days based on household water consumption data collected at 30-min intervals in
the southwestern part of the United Kingdom. The characteristics of 600 households were
identified and divided into five groups, and a gradient boosting machine (GBM) model
was used to predict the water consumption of each group.
The smart water grid (SWG) project has been promoted in Incheon Metropolitan City,
South Korea, and smart water meters have been installed in urban areas to manage water
to solve the water shortage problem [37]. Choi and Kim [38] used a multiple regression
model, a multilayer perceptron model, and an LSTM model to predict future water con-
sumption based on hourly household and commercial water consumption data collected
through smart water meters. They discovered that the LSTM model has better predictive
performance than the other models. Various studies on smart water meters are being
expanded to establish an efficient water supply plan and provide various water-related
services. However, in South Korea, there are still insufficient studies on smart water meters
to predict consumer-level water consumption.
Therefore, this study aimed to develop a model for predicting consumer-level water
consumption using data from a smart water meter installed in Yeongjong Island, Incheon,
South Korea. In this study we used an LSTM-based model, to predict water consumption
prediction at the household level, that is suitable for time series analysis among deep
learning models and can consider external factors. We evaluated its predictive performance
by comparing it to the ARIMA model, which is a traditional stochastic model.
Figure1.1.The
Figure Thelocation
locationof
ofthe
thestudy
studyarea
areaand
andsmart
smartwater
watermeter.
meter.
2.2.
2.2.Data
DataDescription
Description
AAsmart
smartwater
watermeter
metercan
cantransmit
transmitand
andreceive
receivereal-time
real-timewater
water consumption
consumption and water
and wa-
quality data, which can be used to confirm water consumption fees, leakage,
ter quality data, which can be used to confirm water consumption fees, leakage, and ad- and additional
details
ditionalfordetails
each customer.
for each customer.
AA smartwater
smart watermeter
meterhas
hasbeen
beeninstalled
installedininthis
thisstudy
studyarea
areasince
sinceJanuary
January2017
2017asaspart
part
of
of the SWG project,
the SWG project,andandconsumer
consumerwater
water consumption
consumption data
data havehave
since since
beenbeen collected.
collected. This
This
study study
usedused a representative
a representative household
household for water-use
for each each water-use
type astypethe as the sensor,
target target sen-
and
sor,
water consumption data were collected from January 2017 to December 2019. Table 12019.
and water consumption data were collected from January 2017 to December sum-
Table
marizes1 summarizes the water consumption
the water consumption data for thedata forsensor
target the target
based sensor based ontype.
on water-use water-use
Rows
type. Rows 1 and 2 contain descriptions of the type of use, while rows 3–5
1 and 2 contain descriptions of the type of use, while rows 3–5 contain water consumptioncontain water
consumption
statistics. statistics.
Table 1. Water consumption data for each water-use type.
Table 1. Water consumption data for each water-use type.
Description Type A Type B Type C Type D
Description Type A Type B Type C Type D
Use type Residential Residential Commercial Public
Use type Residential Residential Commercial Public
Detached house Apartment
Detail information Detached house Apartment Restaurant Elementary
Elementary school
(1 household)
Detail information (366 households) Restaurant
(1 household) (366 households) school
water consumption range (m3 ) 0.5–2.8 23.8–214.8 0.78–23.7 0–55.9
water consumption
Mean (m3 ) 1.38 0.5–2.8
153.9 23.8–214.810.96 0.78–23.7 0–55.9
24.79
range (m3)
Standard deviation (m3 ) Mean 0.42
(m3) 1.3830.13 153.9 2.48 10.96 9.9524.79
Standard
Type A represents 0.42 use (detached
residential 30.13 2.48 consumption9.95
house), and water is low as
deviation (m3)
the number of users is lower than that for other types. Type B represents residential use
(apartment), with the largest number of users and the highest water consumption. Type C
represents commercial use (restaurant), and it has the second-lowest water consumption
level. Type D represents public use (elementary school) and has the second-highest water
consumption level.
Type A represents residential use (detached house), and water consumption is low
as the number of users is lower than that for other types. Type B represents residential use
(apartment), with the largest number of users and the highest water consumption. Type
Water 2022, 14, 1512 C represents commercial use (restaurant), and it has the second-lowest water consump- 5 of 17
tion level. Type D represents public use (elementary school) and has the second-highest
water consumption level.
Figure 2 shows the weekly water consumption pattern for each water-use type. Types
Figure 2 shows the weekly water consumption pattern for each water-use type. Types
A, B, and C had similar patterns, and their water consumption was high on weekends and
A, B, and C had similar patterns, and their water consumption was high on weekends
low on weekdays. However, the water consumption pattern of Type D was low on week-
and low on weekdays. However, the water consumption pattern of Type D was low on
ends and high on weekdays.
weekends and high on weekdays.
using past data and the periodicity of the data. However, the LSTM model was trained by
considering both the periodicity of the data and external variables such as weather and
weekend information. In the model evaluation phase, the prediction results of the two
Water 2022, 14, x FOR PEER REVIEWmodels were evaluated for observed data, and the best-performing model was chosen 6 of as
18
the final model. The root mean square error (RMSE) and correlation coefficient (CC) were
used as the evaluation metrics in this case.
Figure3.3.Monthly
Figure Monthlypatterns
patternsofofthree meteorological
three data
meteorological (rainfall,
data relative
(rainfall, humidity,
relative and temperature).
humidity, and tempera-
ture).
The moving average (MA) model predicts output variables using the error term (𝜀 )
of its previous values. This model assumes time series data as stationary, similar to the
AR model, and it is expressed as Equation (2). Here, q is the lag time of the MA model, 𝜇
Water 2022, 14, 1512 7 of 17
The moving average (MA) model predicts output variables using the error term (ε t ) of
its previous values. This model assumes time series data as stationary, similar to the AR
model, and it is expressed as Equation (2). Here, q is the lag time of the MA model, µ is the
mean of the series, and θ is the MA coefficient [42–44].
q
y t = µ + ε t + θ 1 ε t −1 + θ 2 ε t −2 + · · · + θ q ε t − q = µ + ∑ θ i ε t − i + ε t (2)
i =1
Box et al. [48] proposed an ARIMA model that can be used with nonstationary time
series data. The nonstationary time series is converted into a stationary time series using
the differencing. Here, the differencing is to make the average change in the time series
constant through the difference in continuous observations. The model is usually referred
to as the ARIMA (p, d, q) model and is expressed as Equation (4), where B is the backward
shift operator, ∆ is the differences, and d is the parameter of the differences.
𝐶 𝑓 ∙𝐶 𝑖 ∙𝐶 , (8)
𝑜 𝜎 𝑊 ∙ ℎ ,𝑥 𝑏 , (9)
Water 2022, 14, 1512 8 of 17
ℎ 𝑜 ∙ tanh 𝐶 . (10)
Figure
Figure 5.
5. Conceptual
Conceptual diagram
diagram of
of the
the LSTM.
LSTM.
2.4. Evaluation
2.4. Evaluation Metrics
Metrics
The RMSE
The RMSE andandCC CCwere
wereusedusedto to
evaluate thethe
evaluate performance
performanceof each model
of each in thisinstudy.
model this
The RMSE
study. is usedistoused
The RMSE indicate representative
to indicate errors in
representative predicted
errors and observed
in predicted values. val-
and observed The
lower the RMSE, the better the performance, and it can be expressed as Equation
ues. The lower the RMSE, the better the performance, and it can be expressed as Equation (11),
where
(11), n is the
where number
n is of data,ofyidata,
the number is the𝑦observed value, and
is the observed ŷi is the
value, andpredicted
𝑦 is thevalue.
predicted
value. r
1 n
RMSE =
n ∑i=i (yi − ŷi )2 . (11)
The CC measures the linear correlation between two datasets (predicted value and
observed value). It is a ratio between the covariance of two datasets, and it is essentially
a normalized measurement of the covariance, with the result always ranging between
−1 and 1. A CC closer to −1 indicates a negative correlation, and a CC closer to +1
indicates a positive correlation. It is expressed as Equation (12), where y is the mean of the
observed value and ŷ is the mean of the predicted value.
∑(yi − y) ŷi − ŷ
CC = q . (12)
2 2
∑ (yi − y) ∑ (ŷi − ŷ)
3. Results
3.1. Application of the ARIMA Model
An autocorrelation function (ACF) analysis was conducted before training the ARIMA
model (Figure 6). The autocorrelation decreased as the lag time increased, but it increased
again when the lag time was seven or eight days. This result indicates that the dataset had
a seven-day or eight-day cycle.
The ARIMA model consists of three parameters: an AR model parameter (p), an MA
model parameter (q), and a differential parameter (d). In this study, the sensitivity of the
parameters was analyzed to build the best ARIMA model for each type. The p and q
parameters were considered from zero to eight with reference to the ACF results, and the
d parameter was considered from zero to one. A total of 162 (8( p) × 8(q) × 2(d)) ARIMA
models were developed according to parameter combination, and the ARIMA model with
the lowest RMSE was selected. Table 2 summarizes the parameters of the optimal ARIMA
model for each type. For Types A and D, the p and q parameters were set at seven, while
the d parameter d was set at zero. For Types B and C, the p and q parameters were set at
eight, while the d parameter d was set at zero.
3. Results
3.1. Application of the ARIMA Model
An autocorrelation function (ACF) analysis was conducted before training the
ARIMA model (Figure 6). The autocorrelation decreased as the lag time increased, but it
Water 2022, 14, 1512 9 of 17
increased again when the lag time was seven or eight days. This result indicates that the
dataset had a seven-day or eight-day cycle.
Figure 7 shows the prediction results of the ARIMA model in the training dataset as a
time series. The solid blue lines represent the time series of the observed water consumption.
At the same time, the red dashed lines represent the time series of the predicted water
consumption by the ARIMA model. The solid blue lines and the red dashed lines exhibit
almost similar results. Type D had the best performance in terms of performance by type,
while Type B had the worst performance. This is due to Type D (school) having a relatively
constant pattern, while Type B (apartment) has a complex pattern (see Figure 7).
Figure 7 shows the prediction results of the ARIMA model in the training dataset as
a time series. The solid blue lines represent the time series of the observed water consump-
tion. At the same time, the red dashed lines represent the time series of the predicted water
consumption by the ARIMA model. The solid blue lines and the red dashed lines exhibit
almost similar results. Type D had the best performance in terms of performance by type,
Water 2022, 14, 1512 10 of 17
while Type B had the worst performance. This is due to Type D (school) having a rela-
tively constant pattern, while Type B (apartment) has a complex pattern (see Figure 7).
Figure 7.
Figure Time series
7. Time series of
of the
the predicted
predicted and
and observed
observed values
values by
by the
the ARIMA
ARIMA model (training dataset).
Overall, the performance of the ARIMA model in the training dataset was excellent,
indicating that it can make similar predictions for the observed water consumption.
A sensitivity analysis was also performed on the LSTM model to derive the optimal
parameters. The parameters of the LSTM model mainly deal with “units,” which refers
to the number of chains, “batch size”, which refers to the number of data extracted for
Water 2022, 14, 1512 11 of 17
learning, and “epoch”, which refers to repetitive learning [12]. Here, the epoch and batch
size are determined to repeat until they maximize learning performance according to
the number of data, and units are determined to accommodate the information to the
maximum according to the number of explanatory variables. In this study, the parameters
were considered 6, 12, 24, and 36 for the units; 12, 36, 72, and 144 for the batch size; and
20, 30, 50, and 100 for the epoch; and grid search was performed for this list. In addition,
activation function was considered as “Tanh”, and “dropout layer” was considered as 45%
to minimize overfitting problem. On the other hand, the other parameters such as learning
rate, momentum were set to their default values. The LSTM models were developed using
64 parameter combinations for each type, and the LSTM model with the best performance
was selected. Table 5 shows the parameter combinations for the top four models. The batch
size and epoch in the top four models are 12 and 100, respectively.
Table 5. Parameter combinations for the top four models.
Figure 8 shows the loss graph for each type using the top four models. Here, the red
line and points represent model 1, the blue line and points represent model 2, the yellow
line and points
line and points represent
represent model
model 3, and the
3, and the green
green line
line and
and points
points represent
represent model
model 4.
4. The
The
x-axis represents the epoch, and the y-axis represents the loss per epoch. When
x-axis represents the epoch, and the y-axis represents the loss per epoch. When the LSTM the LSTM
models
models were trained, the
were trained, the RMSE
RMSE was was used
used as the loss
as the loss metric,
metric, and
and the
the target
target variable
variable was
was
normalized. Thus, the
normalized. Thus, theloss
lossunit
unitinin Figure
Figure 8 differs
8 differs fromfrom
the the original
original consumption
consumption data data
unit.
unit.
In addition, we used 20% of the batch size as a validation to prevent overfitting, andand
In addition, we used 20% of the batch size as a validation to prevent overfitting, the
the
lossloss presented
presented in Figure
in Figure 8 is validation
8 is the the validation
loss.loss.
Figure 8. Loss graph for each type using the top four models.
Figure 8 shows that as the epoch increased, the loss graph for all types decreased and
converged, and the loss trend did not increase again, indicating that the overfitting prob-
lem did not occur. Thus, the loss was the smallest when Type A was model 3, Type B was
model 3, Type C was model 2, and Type D was model 4. The optimal LSTM model for
each type was determined through this result. Table 6 shows the performance of the opti-
Water 2022, 14, 1512 12 of 17
Figure 8 shows that as the epoch increased, the loss graph for all types decreased and
converged, and the loss trend did not increase again, indicating that the overfitting problem
did not occur. Thus, the loss was the smallest when Type A was model 3, Type B was model
3, Type C was model 2, and Type D was model 4. The optimal LSTM model for each type
was determined through this result. Table 6 shows the performance of the optimal LSTM
model in the training dataset, and Figure 9 shows the time series of the LSTM model’s
prediction results. The average correlation for the four types was 89%, and the average
RMSE was calculated to be 5.6 m3 . It can also be seen that Type D has the highest correlation
and Type B has the lowest correlation. All the LSTM models had a correlation of 80% or
higher in the training dataset. Thus, the LSTM models were well-trained well for observing
water consumption data.
Table 6. Performance of the optimal LSTM model (training dataset).
Figure 9.
Figure Time series
9. Time series of
of the
the predicted
predicted and
and observed
observed values
values of
of the
the LSTM model (training dataset).
blue line represents the observed water consumption time series, the red dashed line
represents the predicted time series of the LSTM model, and the solid green line represents
the predicted time series of the ARIMA model. The red dashed line tends to be similar to
the solid blue line, but the solid green line tends to be more underestimated than the solid
blue line. In addition, the red dashed line appears to be more similar to the solid blue line
than the green line. Table 7 summarizes the performance evaluation of each model in the
test dataset. The ARIMA model had an average correlation of 62% and an average RMSE
of 8.91 m3 . The correlation decreased by about 31%, and the RMSE increased by 4.48 m3 in
comparison to the results in the training dataset. Compared to that in the training dataset,
the performance of the ARIMA model in the test dataset decreased due to overfitting.
Water 2022, 14, x FOR PEER REVIEW However, the LSTM model had an average correlation of 89% and an average RMSE 14 of of 18
5.60 m3 . These results were similar to the performance in the training dataset. The LSTM
model had a better correlation than the ARIMA model by an average of 27%.
Figure 10.10.
Figure Time series
Time seriesofofthe
thepredicted andobserved
predicted and observed values
values in the
in the testtest dataset
dataset for each
for each model.
model.
ARIMA LSTM
Type Correlation RMSE Correlation RMSE
A 65.81% 0.36 92.70% 0.19
B 55.42% 26.37 82.96% 17.58
C 56.42% 2.21 89.15% 1.24
D 69.79% 6.71 91.29% 4.75
Author Contributions: Conceptualization, J.K. and H.L.; Data curation, H.L. and D.K.; Formal
analysis, J.K.; Methodology, J.K., M.L. and D.K.; Supervision, M.L., H.H. and H.S.K.; Writing—
original draft, J.K. and H.L.; Writing—review and editing, H.H. and H.S.K. All authors have read and
agreed to the published version of the manuscript.
Funding: INHA UNIVERSITY Research Grant.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Acknowledgments: This work was supported by INHA UNIVERSITY Research Grant.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Korea Environmental Industry & Technology Institute (KEITI). Water and Sewerage R&D Technology Trend Report; KEITI: Seoul,
Korea, 2020.
2. Ministry of Environmental (MOE). Water Supply Statistics; Ministry of Environmental: Sejong, Korea, 2019.
3. Adamowski, J.; Chan, H.F.; Prasher, S.O.; Ozga-Zielinski, B.; Sliusarieva, A. Comparison of multiple linear and nonlinear
regression, autoregressive integrated moving average, artificial neural network, and wavelet artificial neural network methods
for urban water demand forecasting in Montreal, Canada. Water Resour. Res. 2012, 48. [CrossRef]
4. Tripathi, A.; Kaur, S.; Sankaranarayanan, S.; Narayanan, L.K.; Tom, R.J. Water demand prediction for housing apartments using
time series analysis. Int. J. Intell. Inf. Technol. 2019, 15, 57–75. [CrossRef]
5. Razali, S.N.A.M.; Rusiman, M.S.; Zawawi, N.I.; Arbin, N. Forecasting of Water Consumptions Expenditure Using Holt-Winter’s
and ARIMA. J. Phys. Conf. Ser. 2018, 995, 012041. [CrossRef]
Water 2022, 14, 1512 16 of 17
6. Du, H.; Zhao, Z.; Xue, H. ARIMA-M: A new model for daily water consumption prediction based on the autoregressive inte-grated
moving average model and the markov chain error correction. Water 2020, 12, 760. [CrossRef]
7. Alvisi, S.; Franchini, M.; Marinelli, A. A short-term, pattern-based model for water-demand forecasting. J. Hydroinform. 2007, 9,
39–50. [CrossRef]
8. Atsalakis, G.; Minoudaki, C.; Markatos, N.; Stamou, A.; Beltrao, J.; Panagopoulos, T. Daily irrigation water demand prediction
using adaptive neuro-fuzzy inferences systems (anfis). In Proceedings of the 3rd IASME/WSEAS International Conference on
Energy, Environment, Ecosystems and Sustainable Development, Agios Nikolaos, Greece, 24–26 July 2007; pp. 369–374.
9. Oliveira, P.J.; Steffen, J.L.; Cheung, P. Parameter Estimation of Seasonal Arima Models for Water Demand Forecasting Using the
Harmony Search Algorithm. Procedia Eng. 2017, 186, 177–185. [CrossRef]
10. Zubaidi, S.L.; Al-Bugharbee, H.; Muhsen, Y.R.; Hashim, K.; Alkhaddar, R.M.; Hmeesh, W.H. The Prediction of Municipal Water
Demand in Iraq: A Case Study of Baghdad Governorate. In Proceedings of the 12th International Conference on Developments in
eSystems Engineering, Kazan, Russia, 7–10 October 2019; pp. 274–277. [CrossRef]
11. Li, H.; Wang, X.; Guo, H. Uncertain time series forecasting method for the water demand prediction in Beijing. Water Supply 2021,
22, 3254. [CrossRef]
12. Xiang, Z.; Yan, J.; Demir, I. A rainfall-runoff model with LSTM-based sequence-to-sequence learning. Water Resour. Res. 2020,
56, e2019WR025326. [CrossRef]
13. Kratzert, F.; Klotz, D.; Brenner, C.; Schulz, K.; Herrnegger, M. Rainfall–runoff modelling using Long Short-Term Memory (LSTM)
networks. Hydrol. Earth Syst. Sci. 2018, 22, 6005–6022. [CrossRef]
14. Kim, J.; Lee, M.; Han, H.; Kim, D.; Bae, Y.; Kim, H.S. Case Study: Development of the CNN Model Considering Teleconnection for
Spatial Downscaling of Precipitation in a Climate Change Scenario. Sustainability 2022, 14, 4719. [CrossRef]
15. Velasco, L.C.P.; Granados, A.R.B.; Ortega, J.M.A.; Pagtalunan, K.V.D. Performance analysis of artificial neural networks training
algorithms and transfer functions for medium-term water consumption forecasting. Int. J. Adv. Sci. Appl. 2018, 9, 109–116.
[CrossRef]
16. Gagliardi, F.; Alvisi, S.; Franchini, M.; Guidorzi, M. A comparison between pattern-based and neural network short-term water
demand forecasting models. Water Sci. Technol. Water Supply 2017, 17, 1426–1435. [CrossRef]
17. Boudhaouia, A.; Wira, P. A Real-Time Data Analysis Platform for Short-Term Water Consumption Forecasting with Machine
Learning. Forecasting 2021, 3, 682–694. [CrossRef]
18. Salloom, T.; Kaynak, O.; He, W. A novel deep neural network architecture for real-time water demand forecasting. J. Hydrol. 2021,
599, 126353. [CrossRef]
19. Bougadis, J.; Adamowski, K.; Diduch, R. Short-term municipal water demand forecasting. Hydrol. Processes Inter-Natl. J. 2005, 19,
137–148. [CrossRef]
20. Herrera, M.; Torgo, L.; Izquierdo, J.; Pérez-García, R. Predictive models for forecasting hourly urban water demand. J. Hydrol.
2010, 387, 141–150. [CrossRef]
21. Awad, M.; Zaid-Alkelani, M. Prediction of Water Demand Using Artificial Neural Networks Models and Statistical Model. Int. J.
Intell. Syst. Appl. 2019, 11, 40–55. [CrossRef]
22. Mu, L.; Zheng, F.; Tao, R.; Zhang, Q.; Kapelan, Z. Hourly and Daily Urban Water Demand Predictions Using a Long Short-Term
Memory Based Model. J. Water Resour. Plan. Manag. 2020, 146, 05020017. [CrossRef]
23. Firat, M.; Yurdusev, M.A.; Turan, M.E. Evaluation of Artificial Neural Network Techniques for Municipal Water Consumption
Modeling. Water Resour. Manag. 2009, 23, 617–632. [CrossRef]
24. Adamowski, J.; Karapataki, C. Comparison of multivariate regression and artificial neural networks for peak urban wa-ter-
demand forecasting: Evaluation of different ANN learning algorithms. J. Hydrol. Eng. 2010, 15, 729–743. [CrossRef]
25. Al-Zahrani, M.A.; Abo-Monasar, A. Urban Residential Water Demand Prediction Based on Artificial Neural Networks and Time
Series Models. Water Resour. Manag. 2015, 29, 3651–3662. [CrossRef]
26. Zubaidi, S.L.; Ortega-Martorell, S.; Kot, P.; Alkhaddar, R.M.; Abdellatif, M.; Gharghan, S.K.; Hashim, K. A method for pre-dicting
long-term municipal water demands under climate change. Water Resour. Manag. 2020, 34, 1265–1279. [CrossRef]
27. Bakker, M.; van Duist, H.; van Schagen, K.; Vreeburg, J.; Rietveld, L. Improving the Performance of Water Demand Forecasting
Models by Using Weather Input. Procedia Eng. 2014, 70, 93–102. [CrossRef]
28. Polebitski, A.S.; Palmer, R.N. Seasonal Residential Water Demand Forecasting for Census Tracts. J. Water Resour. Plan. Manag.
2010, 136, 27–36. [CrossRef]
29. Piasecki, A.; Jurasz, J.; Kaźmierczak, B. Forecasting Daily Water Consumption: A Case Study in Torun, Poland. Period. Polytech.
Civ. Eng. 2018, 62, 8241–8318. [CrossRef]
30. Bennett, C.; Stewart, R.; Beal, C. ANN-based residential water end-use demand forecasting model. Expert Syst. Appl. 2013, 40,
1014–1023. [CrossRef]
31. Candelieri, A.; Soldi, D.; Archetti, F. Short-term forecasting of hourly water consumption by using automatic metering readers
data. Procedia Eng. 2015, 119, 844–853. [CrossRef]
32. Vijai, P.; Sivakumar, P.B. Performance comparison of techniques for water demand forecasting. Procedia Comput. Sci. 2018, 143,
258–266. [CrossRef]
Water 2022, 14, 1512 17 of 17
33. Faiz, M.; Daniel, A.K. Wireless Sensor Network Based Distribution and Prediction of Water Consumption in Residential Houses
Using ANN. In Proceedings of the International Conference on Internet of Things and Connected Technologies, Wuhan, China,
22–25 April 2022; pp. 107–116. [CrossRef]
34. Water Services Regulation Authority. Business Retail Price Review 2016: Final Determinations; Water Services Regulation Authority:
Birmingham, UK, 2016.
35. Xenochristou, M.; Kapelan, Z.; Hutton, C. Using Smart Demand-Metering Data and Customer Characteristics to Investigate
Influence of Weather on Water Consumption in the UK. J. Water Resour. Plan. Manag. 2020, 146, 04019073. [CrossRef]
36. Xenochristou, M.; Hutton, C.; Hofman, J.; Kapelan, Z. Water Demand Forecasting Accuracy and Influencing Factors at Different
Spatial Scales Using a Gradient Boosting Machine. Water Resour. Res. 2020, 56, e2019WR026304. [CrossRef]
37. Korea Agency for Infrastructure Technology Advancement (KAIA). Water Grid Intelligence Research; KAIA: Anyang, Korea, 2017.
38. Choi, J.; Kim, J. Analysis of water consumption dart from smart water meter using machine learning and deep learning algorithms.
J. Inst. Electron. Inf. Eng. 2018, 55, 31–39.
39. Bhansali, R.J. Linear Prediction by Autoregressive Model Fitting in the Time Domain. Ann. Stat. 1978, 6, 224–231. [CrossRef]
40. Broersen, P.M. Autoregressive model orders for Durbin’s MA and ARMA estimators. IEEE Trans. Signal Process. 2000, 48,
2454–2457. [CrossRef]
41. Cai, Z.; Tiwari, R.C. Application of a local linear autoregressive model to BOD time series. Env. Off. J. Int. Env. Soc. 2000, 11,
341–350. [CrossRef]
42. Durbin, J. Efficient estimation of parameters in moving-average models. Biometrika 1959, 46, 306–316. [CrossRef]
43. Galbraith, J.W.; Ullah, A.; Zinde-Walsh, V. Estimation of the Vector Moving Average Model by Vector Autoregression. Econ. Rev.
2002, 21, 205–219. [CrossRef]
44. Akrami, S.A.; El-Shafie, A.; Naseri, M.; Santos, C.A.G. Rainfall data analyzing using moving average (MA) model and wavelet
multi-resolution intelligent model for noise evaluation to improve the forecasting accuracy. Neural Comput. Appl. 2014, 25,
1853–1861. [CrossRef]
45. Chen, J.-F.; Wang, W.-M.; Huang, C.-M. Analysis of an adaptive time-series autoregressive moving-average (ARMA) model for
short-term load forecasting. Electr. Power Syst. Res. 1995, 34, 187–196. [CrossRef]
46. Benjamin, M.A.; Rigby, R.A.; Stasinopoulos, D.M. Generalized autoregressive moving average models. J. Am. Stat. Assoc. 2003,
98, 214–223. [CrossRef]
47. Pappas, S.S.; Ekonomou, L.; Karamousantas, D.C.; Chatzarakis, G.E.; Katsikas, S.K.; Liatsis, P. Electricity demand loads modeling
using AutoRegressive Moving Average (ARMA) models. Energy 2008, 33, 1353–1360. [CrossRef]
48. Box, G.E.P.; Jenkins, G.M.; Reinsel, G.C.; Ljung, G.M. Time Series Analysis: Forecasting and Control, 5th ed.; John Wiley & Sons:
Hoboken, NJ, USA, 2015; pp. 23–88. ISBN 9781118674925.
49. Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [CrossRef] [PubMed]
50. Sagheer, A.; Kotb, M. Time series forecasting of petroleum production using deep LSTM recurrent networks. Neurocomputing
2018, 323, 203–213. [CrossRef]
51. Liu, P.; Wang, J.; Sangaiah, A.K.; Xie, Y.; Yin, X. Analysis and Prediction of Water Quality Using LSTM Deep Neural Networks in
IoT Environment. Sustainability 2019, 11, 2058. [CrossRef]
52. Anagnostis, A.; Papageorgiou, E.; Bochtis, D. Application of Artificial Neural Networks for Natural Gas Consumption Forecasting.
Sustainability 2020, 12, 6409. [CrossRef]
53. Taylor, K.E. Summarizing multiple aspects of model performance in a single diagram. J. Geophys. Res. Atmos. 2001, 106, 7183–7192.
[CrossRef]