1. Introduction
Water distribution systems (WDS) are intricate dynamic systems necessitating real-time decision making to ensure adequate service levels in terms of pressure, quantity, and quality. Typically, these decisions rely on a predictive control framework where data from network sensors are continuously fed into a mathematical model and the model identifies the optimal operation policy, which is subsequently delivered back to the system actuators. This process is referred to as model predictive control (MPC) or real-time control [
1]. While real-time control has many applications in WDS management (pressure management, chlorine injection, pump speed, etc.), its implementation presents many challenges due to several factors [
2]. Most of the optimization models are based on physical network modeling, which is mathematically complex. Another challenge is the uncertainty inherent in WDS. Unknown factors such as consumer demands, pipe roughness, and chlorine decay coefficients affect the optimality and feasibility of the solution. Recently a novel paradigm of data-enabled predictive control (DeePC) was introduced to cope with the challenge of real-time control [
3]. It is based on input-output measurements alone, where the physical system may be unknown. DeePC can be used to find optimal input signals such that corresponding output parameters will follow desired trajectories. DeePC does not depend on a parametric or physical model. Instead, it leverages historical system behavior to find an optimal policy based on combinations of past recorded input–output matches. The nature of this method holds promising potential for WDS control which requires solving highly complex optimization problems subject to a strict computational burden.
2. Methodology
The DeePC theory is herein described for the case of linear systems, and later, regularization terms are introduced for the more complex case of nonlinear uncertain systems. Consider a discrete-time linear system characterized by
m inputs and
p outputs. Let
and
represent the system input and output trajectories, respectively. The methodology employs a dataset of measured inputs and outputs spanning a length of
T time steps to derive an optimal control policy. For a signal
, we denote the Hankel matrix of order L to be a matrix of shifted signal entries, as defined in Equation (1):
A signal
is considered “persistently exciting of order L” if the above matrix is of full rank (its rank is equal to L). This means that the signal is rich and long enough to adequately capture the system’s dynamics [
1]. Let
be the size of an initial subset of records out of the set of
T measurements, and Let
N be the predictive horizon, such that
; the following Hankel matrices are constructed:
where the
p and
f subscripts represent past and future, respectively, and the
d superscript denotes the measured data, meaning that
are the measured input and output signals.
and
comprise the first
block of rows, and
and
comprise the last
N block of rows of the respective Hankel matrices. Based on this notation, the past data are used to estimate the system’s current state, and the future data are used to predict the system’s future state.
The key principle of the DeePC methodology lies in the following result, which states that a Hankel matrix of sufficient order spans the space of all system trajectories. According to [
4], an input–output set
is a solution to the control problem if and only if there exist
that satisfies the following:
Equation (3) signifies that a linear constraint can effectively substitute a physical model to describe the system dynamics. A control problem based on this approximation can be formulated:
where
u,
y, and
are the input, output, and reference trajectories, respectively, all with a length of the forecast horizon.
is a regularization term that mitigates overfitting, thus enhancing the robustness of the solution under nonlinear and uncertain conditions. Similar regularization terms can be introduced to regulate the input and output, as presented in [
1]. Equation (4) lays out a tractable control problem with a quadratic objective and linear constraints. Further constraints can be added to maintain
u and
y within a desired range. This control problem is solved within an MPC framework, that is, after solving problem (4), the optimal input trajectory
is applied to the system. After
s time steps (s < N), the problem is updated to include the latest
measurements and then solved again.
Case Study
The Fossolo network represents a single district meter area (DMA), as depicted in
Figure 1. The network has one inlet, regulated by a PRV that governs the inlet head. It comprises 36 demand nodes with a total demand fluctuating between 24 and 42 L/s. The objective is to control the PRV setting to ensure minimal sufficient pressure for consumers where excess pressure would exacerbate leakages. Accordingly, the PRV setting serves as the input signal (
u), while the output signal (
y) corresponds to the pressure measured in a representative sensor.
Since real measured data are not available, this study employed simulated data. The input trajectory is normally distributed, with a mean value of 45 and a standard deviation of 8, and comprises 600 data points. To test the method against real-world noised data, the output signal was adjusted by a random factor within the range of 0.95 to 1.05.
3. Results
The DMA pressures were optimized over one week with a reference trajectory of 30 m. Additionally, constraints were added to keep the pressure in the range of 25–35 m. The DeePC method was compared to two common control strategies, a constant setting, and a time-based strategy that employs higher PRV settings during the daytime to accommodate increased demands. The input–output trajectories of the three strategies are presented in
Figure 2.
Figure 2’s upper subplot demonstrates that the DeePC results in a lower-magnitude trajectory that can better track the 30 m target compared to the two alternatives. The input signal of the PRV setting is highly correlated with the demand pattern, allowing for higher pressures at the peak demand times and redundant pressure reduction for the off-peak times. To quantify the performance of the three approaches, the mean absolute error (MAE) was computed. Additionally, constraint violation rates were calculated to evaluate how often pressures exceeded the predefined range of 25–35 m. Furthermore, the violations of a broader range of 10 m deviations were also computed to evaluate the rate of dramatic violations. The results in
Table 1 show that DeePC is superior to the constant and time-based controls in all metrics. In terms of deviations from the reference trajectory, DeePC achieved a MAE which is 32% lower than the time-based control. DeePC violated the constraints of only 16.7% of the time steps compared to 64.3% and 35.1 of the constant and time-based controls, respectively, where none of the violations exceeds the 10 m range.