You are on page 1of 15

Environmental Science and Pollution Research (2023) 30:2530–2543

https://doi.org/10.1007/s11356-022-22335-y

RESEARCH ARTICLE

An efficient supply management in water flow network using graph


spectral techniques
Tamilselvi Gopalsamy1   · Vasanthi Thankappan1 · Sundar Chandramohan2

Received: 7 February 2022 / Accepted: 28 July 2022 / Published online: 6 August 2022
© The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2022

Abstract
An efficient way of distributing water in urban cities is one of the major global challenges of our time. A Water Distri-
bution Network (WDN) is a vital infrastructure, intended to provide fresh water to households within a city or desig-
nated boundary. Given the WDN’s complexity, effective numerical techniques are required to aid in the development
of an ideal monitoring system. Flow meters and gate valves should be placed in low-connected areas of the WDN with
water flow reaching several regions of the network. This study proposes a general strategy for assisting water utilities
in making decisions for effective water supply. The aim of the research is to use weighted spectral clustering algo-
rithms to outline water districts while addressing hydraulic restrictions via weighted adjacency and Laplacian matrices
of the weighted network. This work aims to identify influenced nodes in the network based on Eigen centrality and
effectively distribute water across those nodes. This project also looks at how to measure the network’s connection
strength to avoid water leaks. The best clusters are found using Eigenvalues and Eigenvectors of weighted adjacency
matrices and Laplacian matrices of the water network in the proposed graph spectral framework. In order to establish
the optimum water network division approach, topological and graph metrics were used to compare multiple spectral
clustering techniques. The proposed graph spectral approach is tested using a genuine water distribution network serv-
ing an urban area of Coimbatore city in India and offering a method for partitioning complex networks that employ the
spectral graph partitioning algorithm.

Keywords  Weighted graph · Water network · Graph spectral · Spectral clustering · Water network partitioning · Laplacian
matrix

Abbreviations GST Graph Spectral Techniques


WDN Water Distribution Network EPA Environmental Protection Agency
DMA District Meter Area
WNC Water Network Clustering
Introduction
Responsible Editor: Arshian Sharif
Increasing population and urbanization in the past few
Vasanthi Thankappan and Sundar Chandramohan contributed decades has to lead an exponential increase in the use of
equally to this work.
water. This increase in the demand for water networks to be
* Tamilselvi Gopalsamy extremely efficient additionally highly measurable. District
gts.maths@psgtech.ac.in Metered Areas (sectorisation) is one among the foremost
Vasanthi Thankappan promising methods of improving the water supply quali-
tvi.maths@psgtech.ac.in tatively and quantitatively (Nardo et al. 2014). In normal,
Sundar Chandramohan water is supplied to a specific region from elevated stor-
csr.mca@psgtech.ac.in age tank. If one such region is broken down into multiple
smaller sub-regions where it is able to monitor water input
1
Department of Mathematics, PSG College of Technology, to and consumption in each of such sub-regions, these sub-
Coimbatore 641004, Tamil Nadu, India
regions are referred to as District Metered Areas. Kozelj
2
Department of Computer Applications, PSG College et al. (2017) developed weighted spectral partitioning for
of Technology, Coimbatore 641004, Tamil Nadu, India

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


Environmental Science and Pollution Research (2023) 30:2530–2543 2531

DMA design, which additionally included hydraulic param- Spectral graph theory
eters in the partitioning process. Nardo et al. (2017) investi-
gated three distinct weighted spectral clustering algorithms. The eigenvalues and eigenvectors of matrices related
A graph partitioning-based technique is suggested for to graphs are studied in spectral graph theory, a field
detecting leaking nodes or pipes in water distribution net- of mathematics. It’s a term that’s commonly used to
works. The method integrates graph theory and cluster algo- describe the properties of complex networks and Inter-
rithms, and it was formulated as an optimization problem to net infrastructural facilities. The adjacency matrix
find the ideal centroid corresponding to a cluster(DMA) by and the Laplacian matrix are two frequent matrices in
Han and Liu (2017). Eigenvector centrality was used to find spectral graph theory. The graph’s eigenvalues are the
the supply point of each DMA, as well as the meter posi- eigenvalues of the adjacency matrix A of graph G. The
tion. Rajeswaran et al. (2016) suggested a graph partition- spectrum of graph G is made up of its eigenvalues. The
ing-based technique for detecting leaking nodes or pipelines benefit of spectral graph parameters is that they transmit
in water distribution systems. This algorithm entails solv- a great deal of information about the graph’s structure
ing a multi-objective optimization problem that simulates (Jovanovic et al. 2017).
hierarchical network partitioning as closely as possible. A Mathematical optimization, communication links, net-
spectral clustering method for developing DMA configura- work topology, information retrieval, machine learning,
tion is described in this paper. routing protocols, multi - processor interconnection net-
Converting a water distribution network into District works, anti-virus preservation, information sharing, data
Metered Areas(DMAs) is the first step toward reducing analysis databases, social media applications, quantum
water loss. A water flow network is a complicated net- computing, bioinformatics, coding theory, control theory,
work comprised of nodes that represent water sources and other disciplines have all influenced graph spectra the-
and consumption points, as well as connections that rep- ory. The eigenvalues (spectrum) of specific matrices pro-
resent pipes and valves. The flow of water from the source duced from the graph or information set are used in spectral
through the treatment plant and finally to the distribution clustering. Construct these matrices using the eigenvectors,
region. Free-flow conduits (gravity flow canals and tun- analyse their spectrum, and assign knowledge to clusters.
nels) or pressurized pipes (pumping system) or a combi-
nation of the two (combination of gravity flow and pump)
can be used. Graph adjacency matrix and its spectrum
Metropolitan cities water needs are addressed via a water
flow network that connects many diverse locations. Hydraulic Let G = (V, E) be an undirected graph with V being the
models are used to offer detailed information about the net- set of n vertices vi (the junctions and the reservoirs) and
work’s hydraulic operation, such as journey times and flow E being the set of m edges ei (the pipes connecting the
rates. A quantity can flow in a specific channel in a network; junctions ); a useful method to characterise a graph is to
such a flow network is a weighted graph, with the weight of construct its Adjacency matrix A = [aij ]  , with aij = 1 if
each arc referred to as the arc’s capacity. Matrix representa- nodes i and j have a link and and aij = 0 otherwise. The
tions such as the adjacency matrix and the Laplacian matrix Weighted Adjacency matrix W = [wij ] can be created when
can be used to examine the graph. Graph Spectral Techniques connection information is available; link weights represent
(GSTs) allow for the identification of the most influential nodes the capacity of links between nodes in terms of proximity
in networks based on their topological structure (Jovanovic and/or similarity. The non-negative weight is wij = wji > 0
et al. 2017). if i and j are linked; otherwise, wij = 0.
Ranking water distribution network(WDN) nodes might be The Spectral Radius or Index 𝜆1 , which is the larg-
beneficial for selecting ideal nodes where several devices such est eigenvalue of the Adjacency graph matrix A (or the
as regulatory valves, quality sensors, flow metres, and other Weighted Adjacency matrix W in the case of a weighted
devices are located. GSTs are used to do these crucial tasks graph) in this technique, can be determined; it is impor-
even when additional information, such as network topology tant in modeling the propagation of moving substances
and geometric information about pipes on the network, is in a network (Arsic et al. 2012). The Principal eigenvec-
unavailable. GSTs are further supported by the fact that linear tor v1 of the greatest Adjacency matrix’s eigenvalue (or
algebra methods can be used to efficiently approach them. the Weighted Adjacency matrix’s eigenvalue in the case
They have been used effectively in WDNs for a variety of of a weighted graph’s eigenvalue) of a connected graph
purposes, including robustness analysis and water network is thus related to 𝜆1 ; Its eigenvector centrality attribute
partitioning. allows one to rank network vertices by their coordinates

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


2532 Environmental Science and Pollution Research (2023) 30:2530–2543

according to the number of walks. Nodes with link- metrics used constitute an ensemble of performance
ages to high-connected vertices are more “important” proxies for shortest routes, node-link combinatorics, and
for communication than equal connected nodes with spectral-based features (graph Laplacian normalizations).
less connected neighbours, taking into account not These network ideas can give a wealth of fresh insights
only direct neighbours of vertices but also neighbours for integrating traditional statistical models with
of neighbours. The use of major eigenvectors in web infrastructure performance prediction modelling. This
search engines is a common application (the most known water study uses connectivity metrics including “spectral
systems are Page Rank) utilized in Google (Jovanovic gap(Δ𝜆 )” and “algebraic connectivity(AlgC)” to evaluate
et al. 2017; Wan et al. 2011). The spectral gap is the gap the network’s well-connectedness and strength (Yazdani
between the adjacency matrix A’s first two eigenvalues. and Jeffrey 2011).
This is quite useful for determining the network’s con-
nection strength.
Methodology
Graph laplacian matrix and spectral clustering
Steps in using EPANET
The Laplacian Matrix L is another required matrix of the
weighted graph G = (V, E, W) , where V is a set of nodes, The United States Environmental Protection Agency’s
E is a set of edges, and W is a set of weights. The Lapla- (EPA) Water Supply and Water Resources Division pro-
cian Matrix L is a difference for an undirected graph, duced EPANET, a public domain water distribution sys-
L = D − W  , where D is the degree matrix and W is the tem modelling software programme. EPANET is a water
graph’s weight matrix. The second lowest eigenvalue distribution network modelling programme that simulates
of the Laplacian matrix L (Luxburg 2007; Nardo et al. hydraulic and water quality behaviour over a long period
2017; Kelmans and Yong 1999) is known as algebraic of time (Athulya and Anjali 2020). It can manage any size
connectedness. It calculates the network connectedness water network and conduct hydraulic analysis. It’s designed
strength, and a higher number makes it more difficult to to be a research tool that helps researchers better understand
split a graph into more independent components. Graph how drinking water constituents flow through distribution
clustering is an important issue, and spectral clustering systems and where they end up.It keeps track of the water
has become a widely used method for partitioning graphs flow in each pipe, the pressure at each node, and the water
in network theory. These methods for clustering networks level in each tank across the network.
use the eigenvalues and eigenvectors of the graph nor-
malised Laplacian matrix LN = Dk −1∕2 LDk −1∕2 to assign • Initially, the extracted map can be used to create a net-
nodes and links to clusters (Spielman 2007). The eigen- work representation of the distribution system.
vector corresponding to the lowest nonzero eigenvalue of • After that, change the characteristics of the system’s
the Laplacian matrix is an essential calculation in spectral objects.
graph partitioning. The Fiedler vector is an eigenvector in • Then each node’s and pipe’s input parameters must be
undirected graphs that is connected to the minimal cut. appropriately assigned. Describe how the system func-
The bipartition cluster for the nodes in the graph can be tions.
determined based on the signs of the components of the • After that, choose from a variety of analysis alternatives.
Fiedler vector (Rajeswaran et al. 2016; Miroslav 1973). • Last but not least, perform a hydraulic/water quality
examination.
Graph theory metrics • The final stage is to look at the analysis outcomes.

To investigate how topological properties relate to WDN Model input parameters


performance for simplified pipe networks, graph theory
metrics such as spectral gap ( Δ𝜆 ), algebraic connectivity The following input data files are required to study the water
(AlgC), average degree (AvgD), average shortest path distribution network using EPANET:
length (APL), number of edges (m), network density (q),
network diameter (Dm), network radius (R), Maximum Junction report
gap between eigenvalues (EigenGap), and single degree
nodes (DE) were used Torres et  al. (2016); Yazdani Junctions are sites in the network where links connect and
and Jeffrey (2010b). Within the network sciences, these water enters or exits the system. The following are the fun-
metrics reflect a small subset of a large and expanding damental input data for junctions:
array of graph theory characteristics. The graph theory

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


Environmental Science and Pollution Research (2023) 30:2530–2543 2533

• Elevation over a point of reference (usually mean sea 8. Obtain k clusters from V using the k-means clustering
level) technique (Park et al. 2020).
• Demand for water (rate of withdrawal from the network) 9. Define the collection of edge-cuts (also known as bound-
• Water quality at the start ary pipes) Nec

The following are the output results generated for junctions Water network dividing
at all simulation time periods:
After collecting the set edge-cut (boundary pipes) Nec , the
• Hydraulic system (internal energy per unit weight of number of boundary pipes that must be closed with Nbv gate
fluid) valves or utilized for installation with the number of flow
• Pressure meters Nfm = (Nec − Nbv ) (Nardo et al. 2016) must be deter-
• The quality of the water mined. In other words, once spectral clustering has deter-
mined the feasible sites Nec (boundary pipes between clus-
ters) for flow meters and boundary valves, and the number
Pipe report
of flow meters Nfm to be installed in the network (Fig. 1).
Pipes are links that allow water to flow from one place in
the network to another. At all times, EPANET assumes
Study area
that all pipelines are filled. The flow is from the greater
hydraulic head (internal energy per weight of water) to the
EPANET Software (Athulya and Anjali 2020) was used
lower hydraulic head. The following are the most important
to create a small water distribution network in the study
hydraulic input parameters for pipes:
region of Coimbatore District in Tamil Nadu, India which
consists of 19 pipelines, 27 junctions, and one major res-
• Start and end nodes
ervoir (Fig. 2). The Hazen-William Approach is used to
• Diameter
calculate the pressure. All of the junctions pressures are
• Length
confirmed to be appropriate. The pipe used had a mini-
• Roughness coefficient (for determining Head-loss)
mum diameter of 400 mm. In the pressure head, there is
• Status (open, closed, or contains a check valve)
a variation. The pipe has a roughness coefficient of 100
throughout the network.
Water flow network clustering The following are some of the pipe output results (Table 1):

The clustering phase for the proposed water network parti- • Flow rate
tioning (Nardo et al. 2016, 2018b) entails the phases listed • Velocity
below: • Head-loss
• Darcy-Weisbach friction factor
1. The water supply network is represented as a graph G =
(V,E); The product of the pipe’s area and the liquid’s velocity will
2. Adjacency matrix A and weight matrix W ( WU without be used to compute the rate of liquid flow. The flow rate’s
weight, WF with flow rate, WV with velocity, WH with absolute value always indicates the flow rate via the pipe,
Unit head loss, and WY with friction factor) were to be while a negative sign indicates direction. The quantity of
defined. head loss is determined by a variety of elements, including
3. For all weight matrices W, computation of the spectrum pipe length, flow velocity, diameter, and pipe roughness,
of Laplacian matrices LF / LV / LH  / LY and normalised which influences the friction variable and Reynolds number
Laplacian matrices LNF/LNV / LNH  / LNY ; in the flow.
4. For each spectrum, compute the eigenvector v2 corre- One of three formulae can be used to calculate the
sponding to the second smallest eigenvalue xv2. hydraulic head loss caused by water moving in a pipe due to
5. For the graph bipartition, sort the vertices in ascending friction with the pipe walls:
order by their xv2 of eigenvector v2 components value,
and then divide them into two groups by the sign of the • Hazen-Williams formula
component. • Darcy-Weisbach formula
6. Verify the consistency of the obtained clusters. • Chezy-Manning formula
7. When k > 2 , recursive bipartition of the current sub-
graph;

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


2534 Environmental Science and Pollution Research (2023) 30:2530–2543

Fig. 1  Water network Division

The most often used head loss formula is the Hazen-Wil- Results and discussions
liams formula, where the Friction factor is calculated by the
Reynolds number in the flow and the degree of roughness Several researchers (Nardo et al. 2018b; Yazdani and Jef-
of the pipe’s inner surface. frey 2010a; Torres et al. 2016; Giudicianni et al. 2018)
have investigated the statistical and topological aspects

Fig. 2  Distribution network
diagram of the study area

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


Environmental Science and Pollution Research (2023) 30:2530–2543 2535

of various systems and infrastructures in recent years of WDNs, various metrics from complex network theory
in order to develop unique solutions and better under- can be used by Yazdani and Jeffrey (2010a) and Giudicianni
stand what kind of network is required to support and et al. (2018).
optimise the systems functionality. In this context, the De Arruda et al. (2014) found that multiple central-
topological properties of WDNs are investigated in this ity metrics can accurately determine a node’s spread-
work using real-world and synthetic systems. Accord- ing capability. Other researchers (Yoo et al. 2015) used
ing to Nardo et  al. (2018b), the WDNs of Parete and betweenness centrality to locate water quality sensors
C-Town are ranked based on the score assigned by the in water distribution networks, although their methods
associated coordinates to the Adjacency matrix’s first used hydraulic modelling data as input. The proposed
eigenvector, v1 . Nardo et al. (2018b) proposed using the topological solution, on the other hand, does not require
main eigenvector (Newman 2008) as a centrality measure any hydraulic modelling, has a very short calculation
and spectral clustering (Shi and Malik 2000) to define time, and is very straightforward to apply for water
the k subsets of nodes in their original technique. In con- utilities.
trast, betweenness centrality (Freeman 1977) is used to In order to improve the suggested methodology for
identify the positions of monitoring points in the WDN. water quality monitoring, the effectiveness of other met-
It was first used to examine the value of an individual in rics is considered here, including correct weights, for
a social network (Boccaletti et al. 2006). topological approaches. The proposed dynamic DMAs
Betweenness was chosen as the centrality metric in this technology is being tested on Coimbatore’s water utility
study because it evaluates the importance of a node not network in Tamil Nadu (India).
only in terms of nearby nodes but also in terms of the net- All of the measurements and algorithms based on
work as a whole (Freeman 1977). To assess the behaviour Graph Spectral Techniques listed above sections can

Table 1  Output results of Link Start End Length Diameter Flow Velocity Unit Headloss Friction factor
network Pipes
P1 R1 j1 10 400 5301 42.18 4017.05 0.018
P2 j1 j2 16 400 786.05 6.26 117.16 0.024
P3 j2 j3 50 400 676.05 5.38 88.62 0.024
P4 j4 j3 12 400 −476.05 3.79 46.29 0.025
P5 j5 j4 10 400 −376.05 2.99 29.88 0.026
P6 j5 j1 12 400 −1893.11 15.06 596.65 0.021
P7 j1 j9 12 400 2121.84 16.88 737 0.02
P8 j8 j9 12 400 −882.97 7.03 145.31 0.023
P9 j9 j13 12 400 988.86 7.87 179.24 0.023
P10 j13 j12 23 400 378.86 3.01 30.31 0.026
P11 j12 j10 12 400 −115.95 0.92 3.37 0.031
P12 j10 j8 10 400 −746.11 5.94 106.38 0.024
P13 j7 j8 12 400 −16.86 0.13 0.1 0.043
P14 j6 j7 10 400 773.43 6.15 113.7 0.024
P15 j5 j6 12 400 1023.43 8.14 191 0.023
P16 j7 j11 12 400 655.29 5.21 83.64 0.024
P17 j12 j11 12 400 −188.1 1.5 8.28 0.029
P18 j10 j17 12 400 189.26 1.51 8.38 0.029
P19 j17 j11 10 400 −267.82 2.13 15.95 0.028
P20 j14 j11 14 400 200.63 1.6 9.35 0.029
P21 j14 j16 12 400 −695.73 5.54 93.46 0.024
P22 j16 j5 16 400 −1045.73 8.32 198.76 0.023
P23 j12 j15 12 400 182.92 1.46 7.89 0.029
P24 j17 j15 10 400 117.08 0.93 3.45 0.031
P25 j10 j18 12 400 204.9 1.63 9.72 0.029
P26 j19 j18 13 400 195.1 1.55 8.88 0.029
P27 j14 j19 10 400 295.1 2.35 19.11 0.027

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


2536 Environmental Science and Pollution Research (2023) 30:2530–2543

Table 2  Graph metrics on Network m N m0 Δ𝜆 AlgC 𝜆1 q APL Dm AvgD Eigen gap R


network of study area
Metrices 26 19 1 0.68 0.29 3.05 0.15 2.85 6 2.74 0.19 4

be thought of as a GST toolbox capable of dealing with breakdown of the original network into several sub-
crucial water distribution network management con- regions is simple.
cerns. GSTs are assessed in Coimbatore city on a real- The WDN’s suggested approach was put to the test in
world, small-scale water infrastructure. a real-world case study of Coimbatore’s water delivery
The major topological and energy features of the WDN are network, which is located in a heavily populated region.
summarized in the Table 2. The weights in the case of the WDNs The network has m = 26 links (except for the reservoir
might indicate prior knowledge of the pipe’s geometric and link) and n = 19 nodes, and it is a sparse network from a
hydraulic characteristics (flow, velocity, friction factor and oth- topological viewpoint, which is typical of most real sys-
ers) (Fig. 3). tems. It is fully connected, with m << n2 edges, and a link
Table  2 shows the water network eigenvalues for density value of 0.15.
the case study. The multiplicity of the zero eigenvalue The average node degree AvgD = 2.74 is low because
in the Adjacency matrix is m0 = 1 . This indicates that the number of edges that may be coupled to a single node
the Coimbatore WDN has only one linked component. in a spatial network is limited by physical space. The
It’s worth mentioning that even for complex network example study has a low average path length APL = 2.85 ,
models (made of multiple components), determining if indicating that the network is cohesive and resilient
any irregularity in the water supply is caused by the (Yazdani and Jeffrey 2011), as well as a graph dimen-
sion Dm = 6 , indicating that the nodes are mutually and

Fig. 3  Network diagram of the study area with unweighted, flow value, velocity, unit head loss, friction factor

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


Environmental Science and Pollution Research (2023) 30:2530–2543 2537

easily approachable, and that the network is arranged For spectral clustering, the eigengap heuristic (Luxburg
decentrally (Yazdani and Jeffrey 2010b) (In the case of 2007) is used to all five graph Laplacian matrices, with
hydraulic networks). the number of clusters k chosen so that all eigenvalues 𝜆1
The “spectral gap” Δ𝜆 (the difference between the two ,...,𝜆k are small and similar, but 𝜆k+1 is larger (Nardo et al.
greatest eigenvalues of the adjacency matrix) is equal to 2018b). In comparison to the no-weight Laplacian matrix,
0.68, while the “algebraic connectivity” (AlgC) (Miroslav where the eigenvalues are shown, the most optimal number
1973) is equal to 0.29. These are smaller numbers, indicat- of clusters is four when employing the K Means Algorithm
ing that the graph layout may be split down into discrete based on normalized maximum eigen gap (Alshammari
portions (clusters or districts). There is no single degree et al. 2021; Park et al. 2020; Filippi et al. 2021), as illus-
node in the network, indicating that this network has no trated in Fig. 1 for the case study of the Coimbatore water
dead-end connections. This network system is not lower network (Fig. 5).
level, as shown by the radius R = 4. After fixing the number k = 4 of clusters into which the
Density indicates the relationship between the number network is partitioned, the suggested partitioning technique
of existing pipes and the maximum number of pipe con- gives spectral clustering of the Coimbatore water supply net-
nections that may be made. Sparse networks are those hav- work (Perelman et al. 2014; Alshammari et al. 2021).
ing a low density. Well-connected networks, on the whole, The process of rating WDN nodes and subsequently
result in fewer unit headlosses. For ‘m’ and ‘AlgC,’ the determining the most significant nodes is presented in
headloss trend is similar in that increases in related graph Graph Spectral Theory. The WDNs of Coimbatore are
metrics result in decreased maximum unit headloss. This ranked (Rahman et al. 2019) in Fig. 6 according to the
network’s Eigen gap is 0.19, which is small and implies score provided to the Adjacency matrix’s first eigenvec-
that there are more connections between cluster regions tor (Principal eigenvector), v1 , by the related Coordinates.
(Nardo et al. 2018b) (Fig. 4). The ability to rate WDN nodes is essential for identifying

Fig. 4  Adjacency Eigenvalues for the case study with unweighted, flow value, velocity, unit head loss, friction factor

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


2538 Environmental Science and Pollution Research (2023) 30:2530–2543

Fig. 5  Laplacian Eigenvalues for the case study with unweighted, flow value, velocity, unit head loss, friction factor

the suitable nodes for locating devices (i.e., chlorine sta- the network. These divide the network nodes according on
tions, pressure regulation valves, quality sensors, flow the sign of the Fiedler eigenvector’s associated value (posi-
meters, etc.). When approaching the placement of a valve tive or negative). Because each node in a cluster is linked
or sensor in a water distribution network, it’s helpful to to at least another node in the same cluster, this strategy
rank the nodes relevance. The most “powerful” or sig- ensures that each designated cluster is maintained.
nificant nodes have also been identified in Fig. 7. The If the ideal number of clusters (Fig. 7) (determined by
most relevant nodes are also highlighted in Fig. 7 based the biggest eigengap) is more than two, the Fiedler eigen-
on the eigenvector centrality criterion. The top centrality vector v2 can be used as input for a recursive bisection
node per each DMA of the Coimbatore city partitioned technique (Fig. 9). That is, the Fiedler eigenvector v2 for
WDNs is shown in the results. Regarding the clustering of each cluster is being determined for the next clustering
WDNs, the procedure focuses on each Adjacency matrix until the required number of clusters is reached. Other
associated with water distribution sub-networks. From a recursive algorithms that need an initial random cluster
topological and connectivity perspective, the eigenvector selection can utilise this network bisection as a beginning
centrality offers the majority of the important nodes per layout (Santo 2010).
cluster or DMA. The most influential nodes can also be Based on the number of neighbours of neighbour-
utilized as a starting point for creating device localization ing nodes, the eigenvector centrality (Nardo et al. 2018a)
algorithms. might create the most “influential” nodes. Furthermore,
Using spectral clustering to utilize the properties of the the accepted centrality metric can be changed by weight-
Laplacian matrix spectrum is a valid and effective method. ing network pipes (Giovanni et al. 2021) with hydraulic or
Figure 8 depicts the Fiedler eigenvectors, v2 , for Coimbatore geometrical data, such as pipe diameter and length, or the
WDNs. As previously proven, the coordinates of the second number of users served by each node. The network central-
eigenvector, v2 , easily offer an optimum bipartition layout for ity idea is supported by the ability to detect which nodes are

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


Environmental Science and Pollution Research (2023) 30:2530–2543 2539


visited by the most connections. As a result, central nodes Fig. 7  Optimal clustering with the influential node using eigen
are regarded as critical nodes for network connection clus- centrality layout of the case study with unweighted, flow value,
velocity, unit head loss, friction factor
tering, and they have control over broad network regions.
Using the eigenvector centrality approach, Fig. 7 illustrates
the most significant nodes. The highest centrality node of performance while also simplifying the water budget com-
the Coimbatore partitioned WDNs is shown by the findings putation and allowing for simpler identification of water
for each DMA. losses. In all cases, the number of gate valves is clearly
The technique next concentrates on each and every equal to the difference Nbv = Nec − Nfm . Another major
Adjacency matrix connected with water distribution finding of the article is that, even when the most efficient
sub-networks after the WDNs have been clustered. The location of quality detection points is determined using
eigenvector centrality finds the most influential nodes per an optimization approach, installing all of the flow meters
cluster or DMA (Giudicianni et al. 2020) from a topologi- required to discover all possible contamination scenarios.
cal and connectivity aspect. The water network clustering The major goal is to keep the number of flow metres Nfm
(WNC) is focused on the definition of the shape and the as low as feasible in order to make the water budget cal-
dimensions of each DMA; starting from the original layout culation as simple as possible (Nardo et al. 2017, 2018b).
of the network (Fig. 9) the clustering algorithm groups
nodes reducing the number of cuts (Fig. 7). The second
step of water network division (WND) entails splitting Conclusions
the network by deciding which pipes will carry the flow
metres and gate valves (Fig. 10). For all combinations, the This research descr ibed a unique approach for
number of flow metres has been set at Nfm = 5 , which is determining the ideal number of clusters in a WDN
the smallest number that ensures the network’s hydraulic using the eigenvalues of the Graph Laplacian and the

Fig. 6  Principal eigenvector v1 coordinates for the case study with unweighted, flow value, velocity, unit head loss, friction factor

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


2540 Environmental Science and Pollution Research (2023) 30:2530–2543

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


Environmental Science and Pollution Research (2023) 30:2530–2543 2541

Fig. 8  Fiedler eigenvector v2 coordinates for the case study with unweighted, flow value, velocity, unit head loss, friction factor

eigenvectors to get the actual cluster labels, which The optimum result was identified in this study using
were evaluated using several formulations. Water the spectral clustering technique and unweighted and
supply network management was the emphasis of the weighted matrices. This differs from prior research in the
partitioning of a WDN into DMA. In this regard, this literature, which used a variety of clustering approaches
research presents the results of a related application and found that weighted matrices produced the best
of weighted spectral clustering for water network results. In the future, other spectral graph partitioning
partitioning to a WDN. Unweighted, flow, velocity, methods such as K-way spectral graph partitioning,
unit head loss, and friction factor were chosen as ratio cut partitioning, bipartite isoperimetric graph
alternate weights for reducing edge-cuts and balancing partitioning and directed spectral graph partitioning will
the size of the clusters. Eigenvector centrality was also be implemented and tested on different complex WDNs.
used to identify influential nodes for each clustering. We will also improve the proposed DMA connection
When the data in this study did not fit the constraints method by taking into account the water mains. As a
of other typical techniques, spectral clustering proved result, more research is needed to establish a weighted
to be a very flexible strategy for discovering clusters. spectral clustering technique with broad validity that
These results are directly dependent on pipe capability uses diverse parameters to define DMAs. There is a
distribution and so cannot be relied on for general need for further evidence to establish the relationships
validity. This work accomplished the goal of creating between pipe network topology and WDS performance.
the quick graph partitioning technique for DMA design A variety of pipe network topologies, hydraulic property
and putting it to the test on a real-world WDN. The modifications, graph metrics, and statistical models will
ultimate solution was chosen, evaluated, and discussed. be investigated and evaluated.

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


2542 Environmental Science and Pollution Research (2023) 30:2530–2543

Fig. 9  Recursive bisection process for the case study with unweight, flow value, velocity, unit head loss, friction factor

Fig. 10  Flow meter and


Gate valve for Unweighted and
Weighted Network

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


Environmental Science and Pollution Research (2023) 30:2530–2543 2543

Author Contributions  Tamilselvi, G: writing original draft. Vasanthi, Luxburg UV (2007) A tutorial on spectral clustering. Stat Comput
T: review and supervision. Sundar, C: review and editing. 17(4):395–416
Miroslav F (1973) Algebraic connectivity of graphs. Czechoslov
Funding  This research received no external funding Math J 23:298
Nardo AD, Giudicianni C, Greco R, Herrera M (2018a) Sensor
placement in water distribution networks based on spectral
Availability of data and material  All data generated or analyzed during
algorithms. 13th International Conference on Hydroinformat-
this study are included in this article (and its supplementary informa-
ics HIC
tion files)
Nardo AD, Giudicianni C, Greco R, Manuel Herrera, Santonastaso
G (2018) Applications of graph spectral techniques to water
Code availability  Not applicable distribution network management. Water 10(1):45
Nardo AD, Giudicianni C, Greco R, Santonastaso GF (2016) Water
Declarations  supply network partitioning based on weighted spectral cluster-
ing. Complex Networks and Their Applications V
Nardo AD, Natale MD, Giudicianni C, Greco R, Nardo GFSD (2017)
Ethics approval  Not applicable
Weighted spectral clustering for water distribution network par-
titioning. Appl Netw Sci 2(1):1–16
Consent to participate  Not applicable
Nardo AD, Natale MD, Santonastaso GF, Tzatchkov VG, Alcocer-
Yamanaka VH (2014) Water network sectorization based on
Consent for publication  Not applicable
graphtheory and energy performance indices. Am Soc Civil Eng
140(5):620–629
Conflicts of interest  The authors declare no conflict of interest. Newman M (2008) Mathematics of Networks, pp 1–8
Park TJ, Han KJ, Kumar M, Shrikanth Narayanan (2020) Auto-tuning
spectral clustering for speaker diarization using normalized maxi-
mum eigengap. IEEE
Perelman L, Allen M, Preis A, MudasserIqbal, Whittle A J (2014)
References Automated sub-zoning of water distribution systems. Environ
Modell and Software
Alshammari M, Stavrakakis J, Takatsuka M (2021) Refining a k-nearest Rahman M, Akhter M, Meghanathan N (2019) Use of eigenvector cen-
neighbor graph for a computationally efficient spectral clustering. trality to rank the vertices in a disease-disease network. Advances
Pattern Recog 14:107869 in Intelligent Systems and Computing, 800
Arsic B, Cvetkovic D, Simic SK, Skaric M (2012) Graph spectral tech- Rajeswaran A, Narasimhan S, Narasimhan S (2016) A graph partition-
niques in computer sciences. Appl Anal Discret Math ing algorithm for leak detection in water distribution networks.
Athulya, Anjali K Ullas (2020) Design of water distribution network cs.DS
using epanet software. IRJET Santo F (2010) Community detection in graphs. Phys Rep 486:75–174
Boccaletti S, Latora V, Moreno Y, Chavez M, Hwang D (2006) Com- Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE
plex networks: structure and dynamics. Phys Rep 424:175–308 22:888–905
De Arruda G, Barbieri A, Rodríguez P, Rodrigues F, Moreno Y, Costa Spielman DA (2007) Spectral graph theory and its applications. 48th
L (2014) Role of centrality for the identification of influential Annual IEEE symposium on foundations of computer science
spreaders in complex networks. Phys Rev E - Stat Nonlinear Torres JM, Leonardo Duenas-Osorio M, Qilin Li M, Yazdani A (2016)
Filippi M, Rypina I, Hadjighasem A, Peacock T (2021) An optimized- Exploring topological effects on water distribution system per-
parameter spectral clustering approach to coherent structure detec- formance using graph theory and statistical models. American
tion in geophysical flows. Fluids 6:39 Society of Civil Engineers
Freeman L (1977) A set of measures of centrality based on between- Wan Y, Di Z, Fan Y (2011) Identifying and characterizing nodes
ness. Sociometry 40:35–41 important to community structure using the spectrum of the graph.
Giovanni FS, Armando DN, Enrico C, Dino M, Roberto G (2021) PLoS ONE 6(11):e27418
Comparison of topological, empirical and optimization-based Yazdani A, Jeffrey P (2010a) A complex network approach to robust-
approaches for locating quality detection points in water distribu- ness and vulnerability of spatially organized water distribution
tion networks. Environ Sci Pollut Res 28:33844–33853 networks
Giudicianni C, Herrera M, Nardo AD, Adeyeye K (2020) Automatic Yazdani A, Jeffrey P (2010b) Robustness and vulnerability analysis of
multiscale approach for water networks partitioning into dynamic water distribution networks using graph theoretic and complex
district metered areas. Water Resour Manag 34(2):835–848 network principles. Water Distribution Systems Analysis
Giudicianni C, Nardo AD, Natale MD, Greco R, Santonastaso GF, Yazdani A, Jeffrey P (2011) Complex network analysis of water distri-
Scala A (2018) Topological taxonomy of water distribution net- bution systems. Chaos Interdiscip J Nonlinear Sci 21(1):016111
works. Water 10(4):444 Yoo D, Chung G, Sadollah A, Kim J (2015) Applications of network
Han R, Liu J (2017) Spectral clustering and genetic algorithm for analysis and multi-objective genetic algorithm for selecting opti-
design of district metered areas in water distribution systems. mal water quality sensor locations in water distribution networks.
Procedia Eng 186:152–159 KSCE J Civ Eng 19:2333–2344
Jovanovic N, Jovanovic Z, Jevremovic A (2017) Complex networks
analysis by spectral graph theory. Sinteza Springer Nature or its licensor holds exclusive rights to this article under
Kelmans A, Yong X (1999) On the distributions of eigenvalues of a publishing agreement with the author(s) or other rightsholder(s);
graphs. Discrete Math 199:251–258 author self-archiving of the accepted manuscript version of this article
Kozelj D, Gorjup M, Kramar Fijavz M (2017) An application of is solely governed by the terms of such publishing agreement and
spectral graph partition for designing district metered areas in applicable law.
water supply networks. Acta Hydrotechnica 30(53):81–96

13

Content courtesy of Springer Nature, terms of use apply. Rights reserved.


Terms and Conditions
Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).
Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-
scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By
accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these
purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.
These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal
subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription
(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will
apply.
We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within
ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not
otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as
detailed in the Privacy Policy.
While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may
not:

1. use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access
control;
2. use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is
otherwise unlawful;
3. falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in
writing;
4. use bots or other automated methods to access the content or redirect messages
5. override any security feature or exclusionary protocol; or
6. share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal
content.
In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,
royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal
content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any
other, institutional repository.
These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or
content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature
may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.
To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied
with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,
including merchantability or fitness for any particular purpose.
Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed
from third parties.
If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not
expressly permitted by these Terms, please contact Springer Nature at

onlineservice@springernature.com

You might also like