Professional Documents
Culture Documents
Abstract: The need to improve the accuracy of mathematical models describing hydraulic networks (e.g., water distribution systems) poses
numerous challenges regarding the problems of parameter calibration and optimal sampling. In this paper, a comparison is made between
different sensor placing strategies, including a novel, direct (iteration-free) algorithm. The approach used is based on maximizing both the
sensitivity and the so-called hydraulic distance of the sampling points without the need for optimization. Based on these sampled measure-
ments, the pipe roughness coefficient of each pipe is calibrated individually (without grouping) by solving the resulting underdetermined
linear system with singular value decomposition. The efficiency of the approach is demonstrated on case studies, including artificial
(e.g., Anytown and C-town) and real-life water distribution systems through detailed statistical comparison. DOI: 10.1061/(ASCE)
WR.1943-5452.0001150. © 2019 American Society of Civil Engineers.
Introduction as the weighted shortest path between two sampled nodes. This
approach ensures that the pressure loggers are located at nodes that
Water distribution systems (WDSs) are essential infrastructures are (a) sensitive and (b) “far” from each other, i.e., widespread over
providing sufficient quantity and quality of drinking water for both the network.
civilian and industrial customers. The complexity of these systems The calibration process adjusts the friction parameter of each
increases quickly with the number of demand (consumption) points pipe individually. The advantage of this approach is that no addi-
(that is, the size of the area served), resulting in the challenging task tional data (e.g., age or pipe material) is needed, which is usually
of instrumentation and supervision of such large systems. One of used for the grouping of the pipes and might be inaccurate or
the emerging challenges is that because the number of measure- even missing. However, the resulting problem is underdetermined,
ment devices—pressure loggers, flow meters—is far smaller than i.e., the number of parameters to be adjusted is (far) greater than
the number of potential measurement points (typically fire hydrants the available measurement data. Here, the use of singular value de-
for pressure), one must optimize (in some sense) the layout of the composition (SVD) is proposed to solve such problems, which re-
measurements. Another issue with the increasing complexity is that sults in a parameter vector closest to the initial guess (least-norm
the number of parameters to be calibrated in a WDS computer solution).
model escalates with system size and the calibration process re- This paper is organized as follows. The next section includes a
quires both high computational effort and highly sophisticated literature overview in which, even though the focus is on friction
algorithms. parameter calibration, a brief overview of demand calibration and
The focus of the present study is on hydraulic systems with sampling approaches is also provided. Then the mathematical tools
significant frictional pressure drop, and the aim is to calibrate that are used, such as the mathematical background of hydraulic
the pipe roughness values in such a way that the model output simulation, sensitivity analysis, SVD, and distance metrics on
(nodal pressures) is as close to the “real values” as possible. Such graphs, are summarized. Then the main results, i.e., a deterministic,
calibration requires two steps: (1) designating the sampling points, iterative calibration and a direct, iteration-free sensor placement
i.e., the nodes at which pressure values are measured and (2) a technique, are described. The next section includes a comparison
calibration algorithm that minimizes the discrepancy between of the proposed layout design with five standard approaches from
the measured and the computed pressure values by adjusting the the literature on six different networks of increasing size and com-
pipe friction coefficients. The novelties of the proposed techniques plexity, including the the Anytown, C-town, and KY2 networks, in
are as follows. addition to a small artificial network and two real-life networks. In
The sampling design is based on a quantity combining the sen- the case of the simple systems, a full enumeration of the possible
sitivity and the “hydraulic distance” of the sampling points, defined sampling layouts (that is, for a given number of sampling points, all
possible combinations were evaluated) is performed, while for the
1 larger systems a MATLAB version R2018a built-in optimization
Ph.D. Student, Dept. of Hydrodynamic Systems, Faculty of Mechan-
ical Engineering, Budapest Univ. of Technology and Economics, Budapest package is used. For each case, 100 calibration processes were
1111, Hungary (corresponding author). ORCID: https://orcid.org/0000 run with random initial parameter sets, and a detailed statistical
-0002-2556-3841. Email: rweber@hds.bme.hu comparison is provided. The final section concludes the study.
2
Associate Professor, Dept. of Hydrodynamic Systems, Faculty of
Mechanical Engineering, Budapest Univ. of Technology and Economics,
Budapest 1111, Hungary. ORCID: https://orcid.org/0000-0002-1930-515X.
Email: cshos@hds.bme.hu Literature Review
Note. This manuscript was submitted on November 8, 2018; approved
on May 22, 2019; published online on November 12, 2019. Discussion Hydraulic models of WDSs consist of a large number of nodal mass
period open until April 12, 2020; separate discussions must be submitted balances and pressure difference–flow rate relationships, with the
for individual papers. This paper is part of the Journal of Water Resources former being linear and the latter being nonlinear, due to the inher-
Planning and Management, © ASCE, ISSN 0733-9496. ently nonlinear dependence of frictional pressure losses on flow
set that provides the lowest value of the objective function. The metering cost was minimized using decision trees. Bush and Uber
first algorithmic method can be found in Ormsbee (1989), where (1998) introduced an idea in which the propagated measurement
nonlinear optimization was used. error should be minimized. This theory was also applied to build
Studies have been published on the calibration of the roughness a three-step method in Lansey et al. (2001), and further develop-
coefficient itself without the adjustment of consumption values. ment was carried out where the uncertainty of the model output was
Greco (1999) used the sensitivity matrix with linear and nonlinear considered. Kapelan et al. (2003) used this theory by taking into
programming to increase the accuracy of the model. To avoid the consideration the total sampling design cost and solved the lem
(potentially inaccurate) computation of the derivatives, Kapelan as multiobjective optimization. Single and multiobjective genetic
et al. (2007) presented a method called shuffled complex evolution algorithms were used in Kapelan et al. (2005), where the latter
metropolis that could also calculate uncertainties without the wide- was capable of determining a whole set of (Pareto) optimal solu-
spread first-order second method (FOSM). Koppel and Vassiljev tions. In Behzadian et al. (2009) the authors combined a multiob-
(2009) presented a technique in which the number of parameters jective genetic algorithm with an adaptive neural network to
was decreased by grouping the pipes based on their age, and then minimize uncertainties. However, completely different approaches
they used the Levenberg–Marquardt algorithm for optimization. were presented in de Schaetzen et al. (2000), where the methods
Khedr et al. (2015) showed that a sophisticated optimization algo- were based on the shortest path algorithm, sensitivity, and Shannon
rithm could increase the accuracy of the calibration compared to entropy. Morosini et al. (2014) performed sensitivity analysis to
the case of adjusting the roughness coefficients manually. Simi-
prioritize the influences of nodes and pipes, but it was not empha-
larly, Meirelles et al. (2017) also compared the standard calibration
sized that the most sensitive nodes and pipes were often in the
process to optimization using particle swarm optimization and
vicinity of each other or even adjacent ones. Moreover, if one
concluded that the latter results in higher accuracy. Todini and
wishes to place several sensors incrementally (which is typically
Pilati (1987) introduced the Global Gradient Algorithm for solving
the case), it is not clear how to locate the subsequent sampling
the basic hydraulic equations, which was further developed for
node. Klapcsik et al. (2017) demonstrated the performance of sev-
calibration in Giustolisi and Berardi (2011). This algorithm also
eral of the aforementioned techniques plus a novel one based on
inspired a direct inversion algorithm [published in Du et al. (2018)]
graph theory using real-life water distribution networks.
combining all unknowns in a system into a single global vector,
Another typical problem of optimal sampling involves early
including unmeasured pipe flows, nodal pressures, and rough-
warning systems for detecting contamination, accidental or inten-
ness coefficients, and then solving them simultaneously using a
Newton-based linear method. Ostfeld et al. (2012) summarize the tional. Numerous studies were carried out to address the issue of
competition known as the Battle of Water Calibration Networks how to minimize the effects of pollution by locating an optimal set
whose purpose was an objective comparison of different calibration of sensor layout. One of the first attempts was made by Lee and
techniques using the C-town network. Deininger (1992) based on maximizing the “covered nodes” in
A different approach to increasing the accuracy of the hydraulic the network using integer linear programming. The origin tracking
model is to calibrate nodal demands or demand patterns. For this algorithm was presented in Laird et al. (2005); it is capable of de-
purpose Kang and Lansey (2009) used a weighted least-squares termining contamination spread in a system without the need for
method that was also capable of predicting the chlorine concentra- discretizing pipelines. Later, Berry et al. (2006) introduced the min-
tion in the examined system. The SVD theory was applied to imization of the expected impact calculating numerous different
determine the search direction where the golden section search scenarios and optimized with mixed integer programming and heu-
was used to obtain the optimal nodal demands in Cheng and He ristic method. A competition, called the Battle of the Water Sensor
(2011). Kun et al. (2015) presented an efficient inversion model Networks, was organized in 2006 and its results were published
that avoided the repeated computation of a Jacobian matrix. Do later in Ostfeld et al. (2008). The main conclusions were that plac-
et al. (2016) developed a genetic algorithm for the calibration of ing sensors at water sources (e.g., pump stations, tanks) is not nec-
the demand multipliers where the number of measurements is lower essary and the clustering of sensors is inefficient. Xu et al. (2008)
than the number of unknowns, i.e., it is underdetermined. SVD was presented a method based on graph theory where no additional data
also applied in Sanz and Pérez (2015) to reduce the parameters by apart from the topology of the network were needed, which seems
grouping nodal demands into several patterns. to be uneconomical given that hydraulic data are already available
Another frequent idea of calibration is to take into consideration but not utilized. A technique can be found in Krause et al. (2008)
both the demands (or demand patterns) and roughness coefficients. that is capable of handling huge networks (21,000 nodes) while
A two-step weighted least-squares method was introduced for maintaining rigorous approximation and low CPU running time.
the adjustment of parameters in Kang and Lansey (2011). Dini Hart and Murray (2010) published a comprehensive review on
and Tabesh (2014) used the ant colony optimization method for the topic, also suggesting directions for further research, e.g., effi-
simultaneous calibration. In Xie et al. (2017) a Bayesian-based, ciently solving large-scale problems or comparing new methods
two-level Markov chain Monte Carlo–Particle Filter method was to existing ones. Later, Zheng et al. (2018) published a paper com-
used that can also handle the uncertainties of calibrated values. paring different sensor placement strategies, using the quantities
J i;j ¼ ∂Fi =∂xj . Note that this is a sparse system with mostly two is solved by SVD, the solution vector will have the smallest norm of
or three nonzero elements per row, so the package UMFPACK all possible solutions, that is, kyk → Min. However, the SVD is
(Davis 2004) is used to solve it efficiently and avoid an expensive also computationally favorable since the overall cost is Oðmn2 Þ,
inversion of the Jacobian. where n ≪ m, meaning, in the present case, it will grow linearly
with the size of the network.
Another useful feature of this technique is that it can determine
Pressure Sensitivity Analysis
the rank of a matrix, i.e., the number of linearly independent rows
It is useful to compute the variation of the solution (that is, pressure or columns. In theory, the singular values are always positive, and
or flow rate values) with respect to (wrt) a parameter, for example, the number of nonzero singular values gives the rank of the matrix.
the pipe friction coefficient λ. That is, the variation of the pressure However, in reality, in numerical cases the out-of-rank singular val-
at node i wrt to a small change in the friction coefficient of ues are never precisely zero (but, instead, some small finite num-
pipe j is ber). Therefore, a widely used approach is that 90% of the overall
energy (information) should be retrained. This means that once Σ
∂pi
Si;j ¼ ð6Þ (which was a diagonal matrix including the singular values) has
∂λj been computed, one deletes the smallest singular values in such
a way that the sum of the squares of the remaining (nondeleted)
To compute this quantity, Eq. (4) is differentiated wrt to the singular values should provide at least 90% of the sum of the
parameters [and keeping in mind that if the parameter is varied, squares of all singular values (Leskovec 2014).
so is the solution, i.e., pðλÞ], giving After determining the SVD of a matrix, its inverse (or pseudoin-
∂F ∂p ∂F verse) can also be easily found by Aþ ¼ VΣþ UT , where Σþ is a
þ ¼0 ð7Þ diagonal matrix with the reciprocal of the retained (nondeleted) sin-
∂p ∂λi ∂λi
gular values. Finally, the method is capable of evaluating the de-
where the first term (the Jacobian) was already computed during the terminant of a singular, nonsquared matrix; the determinant is
Newton iteration, the second term is one column from the sensi- simply the product of the retained singular values. For additional
tivity matrix S pursued to compute, and the third term is easily features on SVD, the reader may consult Leskovec (2014). These
evaluated from either Eq. (1) or (2). Eq. (7) is again a linear set computations were performed using the C++ library Eigen (Gael
of algebraic equations, which has to be solved as many times as and Benoi 2010), which ensures computational efficiency.
there are parameters. However, because the linear coefficient matrix
(the Jacobian) is constant, the use of lower–upper (LU) decompo- Graph Theory: Distance
sition makes this process efficient and inexpensive. It should be
noted that the same process could be employed to compute the sen- The final mathematical tool that is needed is borrowed from graph
sitivity of any output variable wrt to any parameter (for example, theory. A water distribution system can be interpreted as a graph
the sensitivity of the pipe flow rates wrt to demand); however, this where the pipelines, valves, pumps, and so forth are the edges
study focuses on the sensitivity of nodal pressures to the pipe fric- (links), and their intersections (nodes) are the vertices. In graph
tion parameters. theory, a path is defined as a route through links between two nodes
The roughness coefficients of every pipeline affect the pressure (possibly far away from each other) (e.g., Barabási 2016). Clearly,
at each node; the local or nodal sensitivity can be defined by taking in a looped network there could be several different paths between
the absolute value of every element and summing the rows of the two particular nodes and, in the simplest case, the shortest path or
matrix: distance is the one with the minimum number of edges. For exam-
ple, in Fig. 1 the shortest path between Nodes 1 and 4 is three (if
X
N edges just the number of edges defines the length of the path), as high-
Snodal;i ¼ jSfi;jg j ð8Þ lighted by the solid thick line. However, the distance or length can
j¼1 also be defined in the case of a weighted graph (Barabási 2016). In
this case, the shortest path is the one along which the sum of the
This equation forecasts the increase (or decrease) of a nodal weights is minimal. In this sense, in Fig. 1, the distance between
pressure in cases where the roughness of every pipe is (equally) Nodes 1 and 4 is 10 þ 10 þ 5 þ 10 ¼ 35 (dashed line). For com-
increased. puting this quantity, the C++ library igraph was used (Csardi and
Nepusz 2012).
Singular Value Decomposition
The calibration technique introduced in the next section results in a Proposed Calibration and Sensor Placement Method
nonsquare linear system; that is, there are fewer equations than un-
knowns. Such a linear system is underdetermined, and there are In the following sections, the model calibration and the sensor
infinitely many solutions of which the one with the smallest norm placement problem both focus on the accuracy of the model output,
system at Point 5 is far from trivial. On the one hand, the linear
system is underdetermined, i.e., the number of equations (number
of sampling points) is significantly lower than the number of var-
iables (number of pipes). On the other hand, it is possible that the
Fig. 1. Visualization of definition of distance. rows (the measurements) are not linearly independent, resulting in
almost identical rows in the sensitivity matrix. This can easily hap-
pen if the sampling points are close to each other (e.g., they are two
i.e., the purpose is to minimize the discrepancy between the “real” nodes of the same, short pipe). As presented earlier, both issues can
pressure distribution and the output of the calibrated model by ad- be addressed if SVD is used to compute the pseudoinverse of the
justing the roughness coefficients of the model. Note that the cal- reduced sensitivity matrix (Sþ r ). Because the system is underdeter-
ibration and sampling techniques are independent of each other. mined, there are usually multiple possible solutions for the prob-
First, the proposed calibration technique is explained using prede- lem, but the SVD finds the one with the least norm, that is, this
fined sampling locations, then an efficient sensor placing strategy is iteration gives the closest set of parameters to the initial estimation
described. at the first step and the closest to the previous one in the following
steps; i.e., it is a step-by-step least-norm solution. To speed up
the convergence of the subsequent parameter improvement steps,
Calibration Technique it is useful to apply relaxation, i.e., Δλ ¼ ωSþ r Δp, where
0.1 ≤ ω ≤ 0.8.
There are two important features of the proposed calibration tech-
In this paper, the method is used for only one hydraulic state
nique. First, it assigns an independent roughness coefficient to each
with nominal values, but the technique itself can be used straight-
pipeline i.e., here they are not grouped as in, e.g., Mallick et al.
forwardly for multiple operational points. Assume that there are
(2002). The advantage of grouping similar pipes (based on material
measurements available with N op different operational loads. In this
or age, for example), and then assigning a single friction coefficient
case, in Step 2, N op hydraulic simulation needs to be performed and
to all pipes within a group reduces the problem to a large extent, but
it might require more input data that might not be available or N op sensitivity matrices calculated ; constructing the reduced ma-
be uncertain. Second, it is iterative; thus, no heuristic optimization trix is similar, and it has N op × N sampled rows. The rest of the
(e.g., genetic algorithm, simulated annealing) is required. This method is unchanged.
method could use any friction model, e.g., Darcy-Weisbach, Another property of the method is that in the case of fully in-
Hazen-Williams: in the first case, the parameter to be adjusted dependent measurements (mathematically speaking, if the rank of
is the relative roughness, while in the latter case, it is the C-factor. the reduced sensitivity matrix is not lower than its smaller size), the
For the time being it is assumed that the sampled nodes have al- technique is capable of removing the difference fully between the
ready been chosen (the sensor placement techniques will be dis- measurements and the model output, i.e., Δp ¼ 0, see Eq. (9).
cussed subsequently). The calibration technique consists of the However, if there are linearly dependent measurements, the result-
following steps. ing linear system Sr Δλ ¼ Δp is ill-conditioned because this means
1. The sampling nodes, the measured values at these nodes, and the that the left-hand side of the linear equations are almost identical,
estimated parameter vector are input values. If additional data while the corresponding right-hand sides are different. This can
(e.g., age or material) are available for the pipelines, they should easily happen, e.g., if two sampled nodes are too close to each other
be used to increase the accuracy of the calibration. If such data and there is uncertainty in the consumption values or in the pressure
are unavailable, some “default” value needs to be assigned to measurement. In such cases, the proposed calibration technique can
each pipe. still be utilized and will converge robustly. However, the difference
2. Run a standard hydraulic simulation with the estimated rough- between the measurement and the model output would not
ness values and with the nominal (average) demand values. vanish, i.e., Δp ≠ 0.
Then calculate the sensitivity matrix.
3. Extract the reduced sensitivity matrix Sr including only the sen- Sensor Placement Based on Hydraulic Distance with
sitivity of the sampled pressures, i.e., delete the rows corre- Sensitivity
sponding to unsampled nodes. The original sensitivity matrix
is of size N nodes × N edges and, in the case of N sampled number Intuitively, a “good” sensor placement fulfills two requirements.
of pressure transducers, the size of the reduced sensitivity matrix First, sensitive nodes are used for sampling. As a counterexample,
Sr is N sampled × N edges . it is inefficient to deploy pressure loggers close to reservoirs, where
4. Compute the pressure difference between the hydraulic model the pressure is primarily defined by the reservoir water level. How-
output (obtained with the estimated parameters) and the mea- ever, sensitive nodes tend to accumulate at certain locations within
surements (“real” values): a WDS, so simply choosing the most sensitive nodes often results
in the accumulation of sampling points close to each other (de
Δp ¼ pmod − pmeas ð9Þ Schaetzen et al. 2000; Klapcsik et al. 2017). The problem with
Fig. 2. Layout of networks. Anytown, Grid, C-town, and KY2 are artificial networks, while the other two are different zones of Sopron (Hungarian
city). For basic data of the water distribution networks, see Table 2.
4. Find the maximum value and designate the corresponding node i¼1
as a sampled node, i.e., add it to the master nodes.
5. Repeat from Step 3 until a sufficient number of nodes has been FCov−D ¼ ðdetðCÞÞð1=2NÞ → Min! ð13Þ
selected.
Note that the computation of the hydraulics and the sensitivity is A similar method, based on a sensitivity matrix, can be found in
required only once, and only the distances are recalculated at each de Schaetzen et al. (2000). The idea is to maximize the sensitivity
step. This ensures a low computational time, even in the case of that the measured nodes cover, mathematically
large networks.
X
N edges
FSmax ¼ aj ; aj ¼ maxðSi;j Þ → Max! ð14Þ
i
j¼1
Comparison of Measurement Layouts
which means the sum of the columnwise maximum values, and this
This section presents a detailed comparison of different sensor basically maximizes the sensitivity of the measured nodes. In de
placement strategies, including the proposed technique and other Schaetzen et al. (2000) the entropy of the sensitivity is also used:
approaches available in the literature. The previously presented
calibration technique is used for every method during the compari- X
N edges
aj
son to ensure low CPU time. This ensures that once the sampling FShannon ¼ − bj lnðbj Þ; bj ¼ P → Max! ð15Þ
nodes are designated, the calibration process will be the same for all j¼1 j aj
methods. The analysis that follows uses six WDSs with increasing
complexity, whose main parameters are listed in Table 2 and the While in de Schaetzen et al. (2000) one objective function
layouts of the networks are depicted in Fig. 2. The steps of the was defined by taking the weighted sum of the two method, the
analysis are as follows. present analysis deals with these theories separately, similarly to
• Fix the number of pressure measurement points: N sampled . Klapcsik et al. (2017).
• Perform a hydraulic analysis with nominal friction values and Finally, in de Schaetzen et al. (2000) graph theory is used
average demands. Save all nodal pressure data. This simulates (similar to the proposed technique); the weights in the definition
the measurement, and the corresponding pressure values are de- of distance are the lengths of the pipelines. With this approach,
noted by pmeas. the next sensors are located as far away from each other and the
• By means of a sampling layout design technique, choose sources as possible (in a geographical sense). This approach adds
N sampled sampling nodes. the sensors one by one. Hence, there is no need for optimization as
• Perturb all nominal friction parameter values by a random in the case of the first four approaches. The mentioned techniques
number with normal distribution (0 expected value and 20% are summarized in Table 3.
standard deviation). Perform calibration using the perturbed
frictions as initial values and evaluate the error, that is, the norm
Test Networks
of the difference between the nominal and calibrated model
pressure at all nodes, mathematically In this section, six networks are used for comparison. The first net-
sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi work is entirely artificial, and it is a regular 4 × 4 grid network. The
PN nodes 2
i¼1 ðpmod;i − pmeas;i Þ
next two systems are well known from the literature: the Anytown
ep ¼ ð10Þ and C-town networks. While the fourth one, KY2, is still artificial,
N nodes
• Repeat this process 100 times, then compute the average and
standard deviation of the ep errors. For a given number of sam- Table 3. Sensor placement methods for roughness calibration in the
pling nodes, one layout is superior to another one if the average literature
of ep is smaller. It is expected that the calibration error will van- Name Short notation Optimization Reference
ish if all nodes are sampled.
Covariance Cov-A/Cov-D Yes Lansey
Note that in a real-life case, ep is not available since only the
(A-optimal/D-optimal) et al. (2001)
sampled pressures are known. However, the authors think that this Sensitivity Smax Yes de Schaetzen
quantity gives an objective way of comparing different sampling maximization et al. (2000)
strategies, at least for artificial test cases. Shannon entropy Shannon Yes de Schaetzen
et al. (2000)
Shortest path Spath No de Schaetzen
Standard Sampling Methods et al. (2000)
The standard sampling methods found in the literature are briefly Hydraulic distance HDS No Presented
with sensitivity in this paper
described; for a complete description, see the cited papers. The first
ep / max(ep ) [-]
0.6 0.6
0.4 0.4
Downloaded from ascelibrary.org by "Indian Institute of Science, Bangalore" on 07/12/21. Copyright ASCE. For personal use only; all rights reserved.
0.2
0.2
0
0 0 5 10 15 20
0 2 4 6 8 10 12 14 16
no. of sensors
no. of sensors
Fig. 4. Average error of calibration (with standard deviation of average
Fig. 3. Average error of calibration (with standard deviation of average
as error bars) as function of number of sensors for Anytown network.
as error bars) as function of number of sensors for 4 × 4 Grid network.
the final two networks are real-life WDSs near Sopron, Hungary. determining the objective function for every possible scenario (di-
The network layouts can be found in Fig. 2 while some of the es- rect enumeration) was impossible (the number of potential layouts,
sential characteristic data are provided in Table 2. e.g., for five sensors only, is approximately 8.2 × 1010 ). Thus,
As mentioned, the first system for the comparison is a simple, MATLAB’s built-in discrete genetic algorithm was used to find
4 × 4 grid network, for which all parameters (e.g., pipe length, no- a solution close to optimal for all techniques except the Spath
dal demand) are identical. The pressure source is located at the top and HDS methods, which do not require any optimization. The re-
left corner and a prescribed amount of water is fed into the system sults for C-town can be seen in Fig. 5, where Smax outperforms the
at the bottom right corner, which is equal to the total nominal con- other techniques, HDS performs second best, and Spath is slightly
sumption, i.e., there is no water entering at the prescribed pressure behind them. In the case of KY2 (Fig. 6) four techniques converge
point. The second system is the Anytown network described in similarly, but HDS is slightly ahead of Smax. In Fig. 9 the optimal
Walski et al. (1987). In the case of these networks (Grid and Any- measurement layout depicts the cases corresponding to sampling
town), due to the small number of nodes, it was possible to perform with less than 20% error. In the case of C-town, the Smax method
full enumeration, that is, for a given sensor number, evaluate all requires 16 sensors, while HDS requires 29, but the other tech-
possible combinations of the sampling nodes and compare the niques could not achieve this goal within 30. On the other hand,
objective function (FCov−A , FCov−D , FSmax , FShannon ) results and in the case of the KY2 network, HDS needs 15 sensors. Again,
choose the optimal layout (minimum or maximum) for each the sensors are well spread out across the network, and they are
method. mostly found close to the end of the networks or at pressure zone
The results for the Grid network can be seen in Fig. 3 and those borders.
for the Anytown system in Fig. 4. The average pressure error de-
fined by Eq. (10) (including the pressure error of the unsampled
nodes) is depicted against the number of sensors. In addition, C-town
the standard deviation of the average, based on 100 calibrations,
Cov-A
is depicted using error bars. For the Grid network, the Spath 1 Cov-D
and Shannon techniques provide slow convergence, and the Smax
Shannon
Cov-A technique gives slightly (but not significantly) slower con- Spath
0.8 HDS
vergence than the rest of the techniques. In the case of the Anytown
system, Smax and the HDS techniques provide the fastest conver-
ep / max(ep ) [-]
gence. The number of sensors needed to decrease the error of the 0.6
initial “uncalibrated” error to its 20% is also highlighted. These
sensor placement layouts are depicted in Fig. 9. It is easy to see
that these “optimal” layouts are far from the pressure sources (res- 0.4
ervoirs), but they seem to be located close to each other. However,
these sampling points are far from each other in the “hydraulic”
sense in that they are separated by pipes with large frictional pres- 0.2
sure drop due to the high velocity.
The next networks in this comparison are the C-town (Ostfeld
0
et al. 2012) and KY2 (Hwang and Lansey 2017) (see Fig. 2 and 0 5 10 15 20 25 30
Table 2 for basic data). These are still hypothetical networks, no. of sensors
but they are already an order of magnitude larger than the previous
Fig. 5. Average error of calibration (with standard deviation of average
ones and contain several pumps, pressure-reducing valves, and so
as error bars) as function of number of sensors for C-town network.
forth. Since even the C-town network includes over 400 nodes,
ep / max(ep ) [-]
0.6 0.6
0.4 0.4
Downloaded from ascelibrary.org by "Indian Institute of Science, Bangalore" on 07/12/21. Copyright ASCE. For personal use only; all rights reserved.
0.2 0.2
0 0
0 5 10 15 0 1 2 3 4 5 6 7 8
no. of sensors no. of sensors
Fig. 6. Average error of calibration (with standard deviation of average Fig. 8. Average error of calibration (with standard deviation of average
as error bars) as function of number of sensors for KY2 network. as error bars) as function of number of sensors for Network II from
Sopron Waterworks.
Real-Life WDSs
The real water distribution networks are derived from Sopron, a city are widely spread out and are mostly located around the edge of the
in Hungary where several tens of thousands of people live and the network.
examined systems are in different zones. Their technical data can be Finally, it is essential to highlight the CPU time that is required
found in Table 2, and the layouts are depicted in Fig. 2. Similarly to to find an optimal layout. Table 4 contains the runtime values in
the C-town case, for the evaluation of the optimization methods, seconds for both Sopron networks and C-town. It can be seen that
MATLAB’s built-in genetic algorithm function was used. The re- the covariance-based methods (Cov-A and Cov-D) require a mas-
sults can be seen in Figs. 7 and 8. At SN-I HDS, the Smax and sive amount of CPU time since they contain expensive matrix op-
Spath techniques are relatively close to each other, the latter slightly erations (e.g., pseudoinverse, determinant). In contrast, the HDS
outsmarts the others, while at SN-II Shannon is also among the best and Spath techniques are computationally inexpensive.
ones. It is interesting to note that these networks only require four
(SN-I) and two (SN-II) transducers to achieve a sufficiently cali- Conclusions
brated model. Moreover, the placement of additional sensors does
not increase the accuracy significantly. The optimal sensor place- This paper introduced an objective comparison of different sensor
ments are depicted in Fig. 9. Similarly to C-town, the transducers placement techniques for roughness calibration. A novel calibration
technique for hydraulic systems was introduced that is capable of
adjusting the roughness coefficients of each pipe individually with
low CPU time, creating the possibility of statistical comparison be-
Sopron Network I tween sampling strategies.
Cov-A The proposed calibration technique is based on the least-norm
1 Cov-D solution of underdetermined linear systems utilizing SVD and
Smax
Shannon pseudoinverse. This approach is deterministic in the sense that,
0.8
Spath even though the parameter vector is improved in several steps,
HDS
no heuristic optimization method is used. A novel iteration-free
pressure sensor placement strategy was also proposed based on
ep / max(ep ) [-]
Fig. 9. “Ideal” sensor placements for each network with minimum number of sensors required to decrease initial (uncalibrated) error to 20%.
systems, the Spath, Smax, and the presented HDS technique gives The main advantage of the proposed technique lies in its com-
the most accurate calibration for a given number of sensors. petitive performance and its computational efficiency. Apart from
However, the Spath technique struggled with the Grid layout the C-town problem, the HDS technique gave similar results to the
(consisting of many identical pipes). It was also interesting to Smax approach within a fraction of the CPU time (Table 4). The
see that the tree-like C-town system turned out to be a “challeng- authors speculate that this advantage becomes more pronounced
ing” problem for all approaches, clearly because of its lack of topo- with increases in network size. Furthermore, HDS is free from
logical connectivity. any unpredictability found in heuristic optimization techniques.
However, how uncertainties of other model parameters (e.g., de-
mand, pipe diameter) affected the presented HDS technique is a
Table 4. CPU time in seconds to calculate optimal measurement layouts question for further research.
for different methods Considering the “best layouts” for calibration (Fig. 9), it can be
concluded that the pressure transducers should be located far from
Network Cov-A Cov-D Smax Shannon Spath HDS
points of prescribed pressure; moreover, they were also found to be
C-town 6,750 3,200 40 20 4 <1 “hydraulically far away” from each other, which was observed
KY2 4,000 11,800 32 30 6 <1 regardless of the actual sensor deployment technique. However,
SN-I 400 100 10 10 3 <1 in the case of tree-like networks, it is important to cover as many
SN-II 8,750 2,066 20 14 10 <1
branches as possible.