Professional Documents
Culture Documents
AVANCERAD NIVÅ, 30 HP
STOCKHOLM, SVERIGE 2020
KRISTOFER ESPINOSA
TAM VU
KTH
SKOLAN FÖR INDUSTRIELL TEKNIK OCH MANAGEMENT
Abstract
1
Sammanfattning
Grafteori är ett matematiskt område där objekt och deras parvisa relationer,
även kända som noder respektive kanter, studeras. Grafteorins födsel anses ofta
ha ägt rum år 1736 när Leonhard Euler försökte lösa ett problem som
involverade sju broar i Königsberg i Preussen. På senare tid har grafer fått
uppmärksamhet från företag inom flera branscher på grund av dess kraft att
modellera och analysera stora nätverk.
Detta arbete undersöker användningen av grafteori inom energisektorn för ett
allmännyttigt företag, närmare bestämt Fortum, vars verksamhet består av, men
inte är begränsad till, produktion och distribution av el och värme. Arbetet
resulterar i en bred genomgång av grafteoretiska begrepp och deras tillämpningar
inom både allmänna tekniska sammanhang och i synnerhet energisektorn, samt
ett fallstudium där några begrepp sätts in i en djupare analys. Den valda
fallstudien inom ramen för arbetet är variabelselektering för
elprisprognostisering. Variabelselektering är en process för att minska antalet
ingångsvariabler, vilket vanligtvis genomförs innan en regressions- modell skapas
för att undvika överanpassning och öka modellens tydbarhet.
Fem grafbaserade metoder för variabelselektering med olika ståndpunkter
studeras. Experiment genomförs på realistiska datamängder med många
ingångsvariabler för att verifiera metodernas giltighet. En av datamängderna ägs
av Fortum och används för att prognostisera elpriset, bland andra viktiga
kvantiteter. De erhållna resultaten ser lovande ut enligt flera utvärderingsmått
och kan användas av Fortum som ett stödverktyg för att utveckla
prediktionsmodeller. I allmänhet kan ett energiföretag sannolikt dra fördel av
grafteori på många sätt och skapa värde i sin affär med hjälp av berikad
matematisk kunskap.
Nyckelord: grafteori, variabelselektering, energiindustri
2
Declaration
This study is conducted by students Kristofer Espinosa and Tam Vu, and
commissioned by Fortum Sverige AB.
3
Acknowledgements
We want to thank our supervisors from Fortum, Alexandra Bådenlid and Linda
Marklund Ramstedt, as well as our manager and mentor, Hans Bjerhag, for their
constant support and engagement. As well, we are thankful for the help we have
received from our respective supervisors at KTH, Elena Malakhatka and
Xiaoming Hu.
4
Contents
1 Introduction 1
1.1 Purpose and research question . . . . . . . . . . . . . . . . . . . . . 2
1.2 Research contribution . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Limitations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.4 Delimitations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.5 Disposition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
3 Methodology 15
3.1 Research design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
3.2 Research approach . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3.3 Research layout . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3.3.1 The preparation phase . . . . . . . . . . . . . . . . . . . . . 17
3.3.2 The idea generation and suggestion phases . . . . . . . . . . 17
3.3.3 The evaluation phase . . . . . . . . . . . . . . . . . . . . . . 21
3.3.4 The implementation phase . . . . . . . . . . . . . . . . . . . 24
3.4 Literature review . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
5
CONTENTS
Summary 49
II Selection of use-case 50
6 Idea generation 51
6.1 Inventory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
6.1.1 Clustering . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
6.1.2 Heat map . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
6
CONTENTS
7
CONTENTS
12 Preliminaries 118
12.1 Laplacian matrix . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118
12.2 Nearest neighbour graph . . . . . . . . . . . . . . . . . . . . . . . . 119
12.3 Graph clustering . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120
12.4 Comparison of two clusterings . . . . . . . . . . . . . . . . . . . . . 120
12.4.1 Clustering accuracy . . . . . . . . . . . . . . . . . . . . . . . 121
12.4.2 Normalised mutual information . . . . . . . . . . . . . . . . 121
12.4.3 Adjusted mutual information . . . . . . . . . . . . . . . . . 122
12.5 Similarity comparison of two sets . . . . . . . . . . . . . . . . . . . 122
8
CONTENTS
15 Discussion 144
15.1 Convergence speed . . . . . . . . . . . . . . . . . . . . . . . . . . . 144
15.2 Parameter sensitivity . . . . . . . . . . . . . . . . . . . . . . . . . . 145
15.3 Jaccard index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145
15.4 Conceivable challenges . . . . . . . . . . . . . . . . . . . . . . . . . 146
15.5 Implications for forecasting activities . . . . . . . . . . . . . . . . . 146
15.6 Scalability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
Bibliography 150
Appendices 176
9
CONTENTS
C Glossary
10
List of Figures
6.1 Heat map of past use-cases per cluster and per graph-related concept. 53
11
LIST OF FIGURES
12
List of Tables
14.1 Data sets with their numbers of samples (m) and features (n). . . . 140
13
LIST OF TABLES
14
CHAPTER 1
Introduction
1
1.1. PURPOSE AND RESEARCH QUESTION
2
1.2. RESEARCH CONTRIBUTION
1.3 Limitations
In this study, a trade-off is necessary between the number of applications
evaluated and the amount of details provided in each. Each application covered
is in fact subject for a study in itself. Consequently, the assessments are to be
considered at face-value, as a guideline for practitioners to where attention is to
be directed.
1.4 Delimitations
This study was commissioned by Fortum. Hence, the verticals and geographies of
the energy sector considered for the applications are aligned with the operations
of Fortum, with the exception of some of the more corporate activities, including
strategic and financial decision-making as well as regulatory and compliance
activities. The reason for this is to contain the research mainly to energy
industry-specific challenges.
1.5 Disposition
This paper comprises three parts.
Part I contains a wide overview with elementary concepts of graph theory and
selected applications with support of recent researches. This part aims to provide
3
1.5. DISPOSITION
4
CHAPTER 2
Market trends for utility
companies
5
2.2. A MORE COMPLEX PORTFOLIO
Despite this recent trend, the electrification of the residential and transportation
sector is adding uncertainty to the future electricity demand projections,
particularly in terms of peak demand, potentially driving up the need for the
total power system capacity, yet without necessarily increasing the demand [13].
6
2.3. EVOLVING TECHNOLOGY
The intermittent nature of certain DERs such as wind and solar causes their
operation to be different than conventional power sources. The uncertainty of the
weather conditions adds complexity to the day-ahead bidding strategy of
utilities, notably in terms of residual demand of the energy system. Not only is
their own power output more unpredictable, the output of competitors is too,
causing higher margins of error induced by inaccurate weather modelling [14].
As wind power plants and solar farms are more distributed, complexity is also
added on the maintenance-side. The portfolio of the utilities now encompasses
higher distributions in terms of asset location, ages and life cycles. Even though
OEMs oftentimes provide warranties for the wind and solar power plants,
utilities are incentivised to create their own intelligence on these matters to
attain a competitive edge.
7
2.5. CUSTOMER TRENDS
power generation, combined with the reduction in the system inertia provided by
heavier rotating equipment. Thus, increasing attention is channelled towards
providing ancillary services for the grid operator on the intra-day market as new
revenue sources for asset utilities [2].
On top of this, the energy commodities market, particularly coal, oil and gas, is
becoming all the more volatile and financialised, subject to recurring global
political and economic crises. In particular, natural gas is increasingly decoupled
from oil prices and subject to more fundamental forces. This puts pressure on
higher forecasting needs, whether or not a utility uses coal or gas as fuel,
impacting both their input price for electricity generation and their electricity
price projections [8].
8
2.6. DIGITALISATION OF UTILITY COMPANIES
9
2.7. THE EMERGENCE OF GRAPH ANALYTICS
10
2.7. THE EMERGENCE OF GRAPH ANALYTICS
11
2.7. THE EMERGENCE OF GRAPH ANALYTICS
SQL has remained a consistently popular choice for database administrators over
the years primarily due to its ease of use and the highly effective manner in which
it queries, manipulates, aggregates data and performs a wide range of other
functions to turn massive collections of structured data into usable information.
Benefits and drawbacks
Tabular databases have however two inherent drawbacks. The first one is that
relationships can only be described by foreign keys and as such are
descriptionless. This entails that the relationship cannot be described in full.
Another drawback of tabular databases is the complexity of traversing the
database. If a connection needs to be found between tables which are indirectly
linked, the user first needs to join all tables based on the foreign keys.
A basic example can help illustrate this. To find students in a given geographical
continent, the database manager would have to join the tables Student,
University, City, Country, Continent on their respective foreign keys. The query
would look like below:
SELECT * FROM students
JOIN universities ON universities.id = students.university_id
JOIN cities ON cities.id = universities.city_id
JOIN countries ON countries.id = cities.country_id
JOIN continents ON continents.id = countries.continent_id
WHERE continents.name = "Europe"
This query has two implications:
◦ The complexity of the queries can quickly increase as the distance between
A and B increases.
◦ The complexity of the calculation for the server can also increase
dramatically as the size of the database or the distance between A and B
increases, slowing down the response time and hence the productivity of
the organisation.
SQL is not the best choice for all database applications. For one thing, while
SQL had been effective at data scales up through the 1990s and beyond, it
started to falter at the hyperscale levels at the turn of the century. Some users
also complain of its sharding limitations hampering the ability to break large
databases into smaller, more manageable ones.
12
2.7. THE EMERGENCE OF GRAPH ANALYTICS
These drawbacks are what led to the creation of NoSQL and the more recent
NewSQL, which attempt to enhance the traditional SQL’s scalability without
sacrificing its inherent atomicity, consistency, isolation and durability (ACID),
critical components of stable databases.
13
2.7. THE EMERGENCE OF GRAPH ANALYTICS
14
CHAPTER 3
Methodology
15
3.2. RESEARCH APPROACH
and new insights can alter the direction of the study [17]. In general, the research
starts with a broad transversal perspective and narrows down as insights are
gained. Regarding information gathering, common approaches include literature
review, interviews with subject experts and the conducting of focus groups.
16
3.3. RESEARCH LAYOUT
This research focuses on the first steps of this process, that is, from the
generation of new ideas to the development of a proof-of-concept for a selected
idea. Therefore, the more narrow field of research of idea management was taken
inspiration from for the research layout. Idea Management (IM) is a sub-process
of innovation management, aiming at structuring and streamlining the idea
generation, evaluation and selection processes [32]. It is also referred to as the
“front-end” of innovation, as the ideas are generally generated by the employees
themselves to address their specific problems and needs. The systematic
approach adopted in IM is being implemented by many large firms to cope with
an otherwise “fuzzy” nature of the front-end of innovation management, bearing
a high level of informality and uncertainty [23, 24].
The research layout is explained in more details below. It followed the main
steps proposed by Gerlach and Brem in their guidelines for IM practitioners [25],
namely the preparation phase, the idea generation and suggestion phases, the
evaluation phase and the implementation phase.
17
3.3. RESEARCH LAYOUT
possibilities, this research proposes a data-driven approach for the idea generation
phase. The interviews conducted helped identifying critical problems in each
sector of activity of the firm, seeding the search for past use-cases of graph-based
solutions and contributing to assess the relevance of these to the organisation.
A vast inventory of use-cases was compiled across the core activities at the utility
company, spanning all possible types of graph-theoretic solutions, both from
academia and the industry. In parallel, interviews were conducted to understand
the challenges and opportunities in the various sectors Fortum operates in,
providing deeper insights in the relevancy of the inventoried use-cases and their
implications. The use-cases were clustered into areas of applications and filtered
by relevancy to the company’s value chain as part of the “idea improvement”
process suggested by several authors in the IM literature [23, 28, 29].
The approach to generate application ideas is inspired from the concepts of
“market pull” and “technology push” [38]. In the “market pull”, the source of the
innovation is an unmet need of the customer (in this case, it could be both the
end-customer and the end-user of the solution). This results in new demands for
problem-solving (‘invent-to-order’ a product for a certain need). The impulse
comes from individuals or groups who (are willing to) articulate their subjective
demands.
In the “technology push”, the stimulus for new products and processes comes
from (internal or external) research; the goal is to make commercial use of new
know-how. The impulse is caused by the application push of a technical
capability. Therefore, it does not matter if a certain demand already exists or
not.
In this research, the pull approach consists in identifying the needs in the
respective sectors, whereas the push approach considers the benefits and
drawbacks of the technology itself and the solution it has permitted in academia
and various industries.
Technology push
The push approach consists in identifying classic, verified use-cases of the
technology in order to understand the inherent benefits of the technology in
practice. This inventory of use-case domains provides an important benchmark
for tying the utility company’s inventory of areas of improvements to graph
analytics. The two sources of benchmark use-cases are academia and the
18
3.3. RESEARCH LAYOUT
industry.
Academia
Graph theory is a mature and vastly studied mathematical area. A Scopus
search for “graph theory” yields about 100,000 results. As a comparison, a
“blockchain”-search results in 15,000 hits. In addition, their high degree of
adaptability across engineering tasks, thus potentially across a utility company’s
operations.
Therefore, a systematic, programmatic search for research articles relating graph
theory and energy activities was performed in order to guarantee a complete
scanning for potential applications. The first phase consisted in searching a
combination of keywords representing graph theory: “graph theory”,
“graph-based”, “network theory”, in combination with each energy sector the
company is operating in (e.g. hydropower, nuclear power, wind power, etc.).
The second phase consisted in listing graph-related concepts and algorithms
(about 70) and identifying their presence in academia across the energy sectors.
Due to the higher dimensionality of this search (70 × 15 entries), the search was
done programmatically with the Scopus API (Application Programming
Interface), on TITLE-ABSTRACT-KEYWORDS. In total, about 3,000 articles
were found, stored, curated first automatically according to various filters and
then with the help of the abstracts.
Some interesting articles could be found thanks to the higher specificity of the
graph-theoretic concepts used in the search, graph theory being a rather wide
area in mathematics, sometimes making it too vague to be present in the
abstract or in the keywords on Scopus.
Industry
As described in the background, graph-based solutions have had an increasing
interest from the industry, particularly in highly digitalised companies.
Benchmark use cases from industry were mainly extracted from classical
use-cases published online, as well as publicised case studies from graph-database
providers (Neo4j, Expero). The industry benchmark use-cases having a lower a
priori relevance level to the specific energy industry and are less detailed in their
implementation than are the examples from academia but give a complementary
view of what is being worked on in a more practical manner.
Demand pull
19
3.3. RESEARCH LAYOUT
20
3.3. RESEARCH LAYOUT
21
3.3. RESEARCH LAYOUT
The evaluation tool and set of criteria were chosen with the supervision of our
supervisors at KTH and experts from Fortum, with the following objectives in
mind:
◦ It needs to span a wide space of decision-making dimensions, otherwise the
evaluation would risk being misleading.
◦ Redundancy is minimised, meaning that factors where all clusters would
score equally are omitted (for example, the presence of a database,
potential presence of big data, etc.)
◦ The size of each evaluation block should be balanced, in order to avoid a
bias arising from a dimension including more criteria than others.
◦ A trade-off is needed between the simplicity and comprehensiveness of the
evaluation, to make it both user-friendly and complete.
◦ The evaluation needs to comprise both industry-specific and
company-specific criteria, to be both reproducible and relevant for the
company.
The difficulty of evaluating an idea can be seen in the lack of background
information of the ideator or examiner when describing or assessing the
potentials or limitations of an idea [34]. For mastering an information extensive
evaluation process, suitable evaluation criteria are crucial to ensure reliability.
The criteria represent guiding factors that can enable transparency,
comparability and repeatability. To ensure this, formal definitions of each grade
in the scoring system were made and attached in the appendix.
As was proposed by Edeland et al. in their assessment of the potential
22
3.3. RESEARCH LAYOUT
blockchain technology in the energy sector, an alignment of each cluster with the
company’s strategic objectives was made. This allowed for a more contextualised
prioritisation in accordance with the corporate objectives. In fact, strategy
alignment is often cited as one of the most important dimensions in IM literature.
Strategic alignment
How well does the use-case cluster align with the strategic objectives expressed by
the company?
In this section, all four main strategy aspects contain the same score definition,
as follows:
◦ 0: No impact on strategy point.
◦ 1: Indirect impact in the time being. Potential impact in the long run
subject to changes endogenous or exogenous to the model.
◦ 2: Direct and immediate impact.
The four strategy points in Fortum’s CEO’s Business Review are outlined below
[40]:
◦ Pursue operational excellence and increased flexibility
Fortum attaches high priority to extract value from its current business
portfolio. Increased flexibility refers to the flexible generation assets and
demand response of large customers in order to balance the volatility
caused by renewable generation. Operational excellence refers to
minimising operational costs, either through productivity increase or asset
management distinction.
◦ Ensure value creation from investments and portfolio
optimisation
Fortum aims to consolidate its sizeable investments from the recent years
to further improve its financial performance. This includes the investment
in Uniper, a German generator with a portfolio mainly comprising flexible
power plants (gas, coal, hydro, oil, nuclear). In addition, a continuous
review of Fortum’s portfolio of power plants is made, with an emphasis on
CO2 -free assets, flexibility and low operating costs.
◦ Drive focused growth in the power value chain
Fortum aims to grow its CO2 -free power generation portfolio in an
23
3.3. RESEARCH LAYOUT
24
3.3. RESEARCH LAYOUT
25
3.4. LITERATURE REVIEW
chosen.
◦ Model improvements and extensions and key implications for practitioners
were proposed.
26
3.5. DATA COLLECTION
In the second phase of the research, another literature review was conducted.
The scope of sources was narrowed to academic papers related to the selected
use-case to gain insight in the challenges and opportunities of possible methods
and find an interesting angle to tackle the problem.
27
Part I
28
CHAPTER 4
Elementary terminology
This chapter conveys a number of selected concepts in graph theory which are
typically presented in literature about graph theory on an introductory level.
The definitions and properties are intentionally written in running texts to create
coherent paragraphs and a relaxing reading. If no specific reference is mentioned,
the mathematical definition, theorem, property, example or statement in
question is common and can be found in such textbooks as [41], [42] and [43].
29
4.1. GRAPH AND SUBGRAPH
finite set and E is a set of pairs of elements in V . The set V consists of vertices
and E of edges in G. A graph H is called a subgraph of G if each vertex and
each edge in H is also in G. In Figure 4.1 (a), the graph G = (V, E) has 7 vertices
and 9 edges, where V = {a, b, c, d, e, f, g} and
E = {{a, c}, {a, e}, {b, c}, {b, d}, {c, e}, {c, f }, {d, e}, {e, g}, {f, g}}.
a
g 0 0 1 0 1 0 0
b 0 0 1 1 0 0 0
1 1 0 0 1 1 0
0 1 0 0 1 0 0
c f 1 0 1 1 0 0 1
0 0 1 0 0 0 1
d e
0 0 0 0 1 1 0
(a) Graph G = (V, E) (b) Adjacency matrix of G
a a
b g b g
c f c f
d e d e
30
4.2. GRAPH TRAVERSAL
31
4.3. TREES AND CONNECTIVITY
a d a d a d
b e b e b e
c f c f c f
(a) Graph G (b) A trail in G (c) A path in G
a d a d
b e b e
c f c f
(d) A circuit in G (e) A cycle in G
Figure 4.2: Various kinds of walks in a graph. Note that the circuit in (d) is not a
cycle since it passes the vertex b twice. The cycle in (e) is, however, also a circuit.
32
4.3. TREES AND CONNECTIVITY
Figure 4.3: The graph G is connected but is not a tree since it has a cycle. The
graph H has no cycles but is not a tree either since it is non-connected.
Figure 4.4: Differently configured trees with 5 vertices and 4 edges each.
a a a
b c b c b c
d d d
e f e f e f
g g g
33
4.4. MATCHING
4.4 Matching
Consider a group of people where some of them are mutually in love with each
other. If two people, who are already in love with each other, get married to each
other, this can be considered a matching. Typically, each person can take part in
at most one marriage (matching). In graph theory, a matching M in a graph
G = (V, E) is defined as a subset of E, where the edges in M are pairwise
disjoint, meaning that any two edges in M do not share a vertex.
If a matching contains as many vertices in V as possible, it is called a
maximum matching. If a matching contains all vertices in V , it is called a
perfect matching. Some graphs do, however, not have a perfect matching,
which aligns with the reality where not all people in the end can get married to
their dream partners.
4.5 Colouring
Given a graph G = (V, E) drawn 2 on a surface, one may sometimes want to put
a colour on each vertex in V such that no two adjacent vertices get the same
colour. More formally speaking, a proper vertex colouring for G is a function
c : V → N such that if {x, y} ∈ E, then c(x) 6= c(y). If nothing else is stated, a
vertex colouring should always be interpreted as a proper vertex colouring.
The least number of colours required to vertex colour a graph G is call the
chromatic number (synonym: vertex colouring number) of G, denoted χ(G) or
simply χ if there is no ambiguity. The Greek letter χ (khi, chi) has to do with
the fact that the Greek word for “colour” is “qr¸ma” (khróma, chróma).
2
A more mathematical term for “drawn” in this case is “embedded”.
34
4.6. DIRECTED GRAPH
b d b d
a f a f
c e c e
(a) Graph G (b) Graph G with
plainly drawn a matching M1
b d b d
a f a f
c e c e
(c) Graph G with (d) Graph G with
another matching M2 another matching M3
35
4.7. WEIGHTED GRAPH
b d b d
a f a f
c e c e
(a) Graph G with (b) Graph H with
χ(G) = 3 χ(H) = 4
36
4.7. WEIGHTED GRAPH
16
f
c 26
29
e
14
11
7 42
4
l a
33
23 47
28 5
j 9 i 17 h
67
Figure 4.8: A weighted directed graph showing the number of people (red edge
labels) travelling between some cities (round vertices) at some point of time, in a
small fictitious kingdom far, far away.
37
CHAPTER 5
Selected applications of graphs
The applications described in this chapter are selected from a wide range of
areas, where graph theory has proven to be an effective tool to model and solve
problems. Each application begins with a couple of keywords, followed by an
example which can be adapted into the activities of Fortum as a utility company
and ends with some suggestions for adaptation possibilities.
38
5.2. STAFF ASSIGNMENT
39
5.3. COST-EFFECTIVE RAILWAY BUILDING
energy flows among other things, its data are highly vulnerable, making the
power system exposed to security risks [48]. It is interesting to note that graph
matching is not the only approach to detect anomalies in smart grid problems.
For instance, normative subgraphs can also be considered [49].
40
5.4. LOGISTIC NETWORK AND OPTIMAL ROUTING
numerous switches, power flow paths can be easier monitored using minimum
spanning trees combined with Kruskal’s algorithm to identify these [50]. When
studying financial markets, such as stock markets [51] and energy commodity
markets [52], a minimum spanning tree can be a simple, yet powerful, tool to
uncover the characteristics and dynamics of financial instruments and
institutions.
41
5.5. PLANAR EMBEDDING OF GRAPHS
a b c
E G W
Figure 5.1: The complete bipartite graph K3,3 is not planar. A real-life
interpretation is as follows: Given three households V1 = {a, b, c} and three
utility sources V2 = {E, G, W } (electricity, gas, water) on a plane ground, draw a
line connecting each household with each utility source (nine lines in total).
Anyhow one places the households or the utility sources and bends the lines,
there will always be at least two lines crossing each other.
There are theorems in graph theory which reveal in advance whether a graph is
42
5.6. WINTER ROAD MAINTENANCE
43
5.7. SOCIAL NETWORK
the snowplough needs to visit some edges more than once if all the streets need
to be traversed. It turns out that it is possible to come up with a shortest trail
which passes each edge exactly once or at most twice [63].
Adaptation: The related graph walk Hamiltonian cycle can be useful in
monitoring of smart grids, or distribution networks in general. State estimation
is significant to sustain distribution management systems. In [64], a calculation
scheme based on Hamiltonian cycle is proposed to quickly obtain the network
states. From a wider perspective, traversing in graphs can be a powerful
technique for searching in graph databases where data is represented and stored
as a graph structure [65], if Fortum wishes to model their data sets this way. To
give a concrete example, graph traversals can be used to perform a recovery
sequence in automated fault management for medium voltage direct current
shipboard power systems [66].
dist(x, y) denotes the shortest distance between two people x and y, if one
assigns each edge in the graph with a weight which indicates the relationship
distance between two people, meaning how well they know each other.
In general, it is often desirable to detect which vertices that have the most
important role in a graph, or in other words the most central vertex. There are
some different ways to mathematically define what “most central” means,
depending on the real-life interpretations. Beside degree centrality and closeness
44
5.7. SOCIAL NETWORK
centrality, some other useful types of centrality are betweenness centrality and
eigenvector centrality. A famous variant of eigenvector centrality is PageRank
centrality 1 used by Google for ranking web pages in their search engine [67]. A
small but illustrative example is given on the next page. Note that the most
central vertices are different depending on what centrality measures are used.
Adaptation: Developers working with strategies involving customers can use
graph-based social networks, with centrality among other concepts, as a powerful
decision support tool. A company can gain more customers and profits by for
example identifying influential customers and creating suitable marketing
campaigns [68]. Some interesting ideas for analysing consumer behaviours and
purchasing patterns are proposed by [69]. A graph-based approach for valuing
the explicit financial worth of paying customers as well as the implicit social
value of non-paying customers is suggested by [70].
An application of centrality measures worth mentioning for a utility company is
management of power systems. A plan for efficient placement of security sensors
in a large network can be achieved with the notion of graph centrality [71].
Hydraulic centrality metrics can be derived from the conventional centrality
metrics to extract useful information of a hydropower plant or a water
distribution network [72].
c
b
d
a
e
g
f
1
The word “PageRank”, spelled with a capital P, a capital R and with no space, was named
after Larry Page, one of the founders of Google.
45
5.8. TOURNAMENT RANKING SYSTEM
most central
Centrality a b c d e f g vertices
y
Degree 2 3 1 1 3 2 2 b, e
Closeness 0.10 0.09 0.06 0.06 0.09 0.07 0.07 a
Betweenness 9 9 0 0 8 0 0 a, b
PageRank 0.14 0.23 0.09 0.09 0.19 0.13 0.13 b
46
5.9. IMAGE SEGMENTATION
47
5.9. IMAGE SEGMENTATION
48
Summary
49
Part II
Selection of use-case
50
CHAPTER 6
Idea generation
6.1 Inventory
While the search for use-cases from academia resulted in 15,000 papers, the one
on industrial use-cases was less prolific, as about 100 use-cases were gathered. An
inventory of use-cases was built after the filtering processes, with information on
their authors, energy-cluster and graph-cluster belongings. The inventory was
continuously updated along the research, as more keywords were found useful
and relevant. A curated set of use-cases can be found in A.2 and the full
inventory can be found online 1 .
As earlier mentioned, several types of academic use-cases were not saved in the
inventory, for the various reasons listed below.
From academia:
◦ Solar power: too few use-cases found.
◦ Waste management: too distant from the classic digitalisation challenges a
classic utility company intends to solve.
◦ District heating: the district heating operations in Sweden are now part of
Stockholm Exergi.
1
Inventory of uses cases of graph theory in the energy sector
51
6.1. INVENTORY
6.1.1 Clustering
The table below summarises the proposed clusters and the number of use-cases
in each.
Use-case cluster Number of past use-cases
Hydropower: optimisation 13
Hydropower: operation and maintenance 12
Nuclear power: operation and maintenance 47
Wind power: design, operation and maintenance 65
Electric vehicle applications 42
Energy trading 110
Master data management 16
Energy storage 21
Knowledge graphs 20
52
6.1. INVENTORY
master data management, as they could not be tied to specific energy sectors. It
provides insights not only on which clusters have most use-cases, but also on
which graph-related concepts (graph-type, problem-type or algorithm) are most
recurrent in literature.
53
CHAPTER 7
Assessment of use-case clusters
7.1 Preliminaries
7.1.1 Presentation of the use-case clusters
In this section, a face-value assessment will be made of each of the selected
clusters. The structure of the assessment will look like follows:
◦ Background: the main challenges and objectives in the clusters are
covered.
◦ Case study: a specific case study from the inventory was selected and
explained in further details to give practitioners some more details of the
methods applied and their results.
◦ Expert opinion: for most clusters, feedback from industry experts on the
challenges and opportunities in this cluster, notably with graphs, was
gathered.
◦ Assessment: the cluster is graded on each of the dimensions detailed in
the methodology. The overall grade and the grades in each dimension are
given in brackets next to the dimension.
◦ Further reading: a few more articles are recommended for the reader to
have a wider perspective of the potential other solutions of graphs in the
cluster.
54
7.1. PRELIMINARIES
55
7.1. PRELIMINARIES
Legend:
C1: Hydropower: optimisation
C2: Hydropower: operation and maintenance
C3: Nuclear power: operation and maintenance
C4: Wind power: design, operation and maintenance
C5: Electric vehicle applications
C6: Energy trading
C7: Energy storage solutions
C8: Master Data Management
C9: Knowledge graphs
56
7.2. OPTIMISATION OF HYDROPOWER OPERATIONS
57
7.2. OPTIMISATION OF HYDROPOWER OPERATIONS
58
7.2. OPTIMISATION OF HYDROPOWER OPERATIONS
59
7.3. HYDROPOWER OPERATION AND MAINTENANCE
Further reading
Arc-based constrained ant colony optimisation algorithms for the optimal solution
of hydropower reservoir operation problems [89]
Maximizing power production in path and tree riverine networks [90]
60
7.3. HYDROPOWER OPERATION AND MAINTENANCE
61
7.3. HYDROPOWER OPERATION AND MAINTENANCE
62
7.4. OPERATION AND MAINTENANCE FOR NUCLEAR POWER
63
7.4. OPERATION AND MAINTENANCE FOR NUCLEAR POWER
Dabrowski et al. have used graphs to improve the Discrete Time Markov Chains
(DTMCs), which are useful in analysing scenarios in which a system’s operations
go from normal to failure [104]. Wu et al. use Signed Directed Graphs (SDG) to
reveal a fault propagation path [105]. SDGs are a commonly used for fault
detection notably in the process industry, as they show the complex relationship
between parameters in a model-free, flexible manner. Others have used graphs to
model Pressurised Water Reactors (PWR), a very common type of NPP
[106, 107], for plant piping design [108].
Case study Fault diagnosis and severity estimation in nuclear power plants [105]
SDG models do not require neither precise mathematical descriptions nor
complete operational data and can be constructed with only partial information
and from the experience of operators. According to the authors of this study,
SDGs reveal the latent dangers and the propagation rules in a simple and
effective way, making it particularly suitable for fault diagnosis in nuclear power
plants.
The authors constructed an SDG model with nodes representing parameters of
measure or fault root cause (e.g. pressure points, temperatures, flows) and
labelled the directed edges “+” for a positive causal impact between one node
and another and “−” in case of a negative impact. They converted this graph
into a fault diagnosis decision with a set of rules to more easily detect the root of
reoccurring faults.
This SDG is only a sub-component in their fault diagnosis tool, as it is preceded
by a feature extraction method for faster computation and followed by a neural
network assessing the severity of a fault (e.g. the size of a break) with Back
Propagation (BP), an algorithm aiming at fine-tuning the weights of a neural net
based on the error rates [109]. The SDG in this case is contained to identifying
the fault type. They verified their model on sub-systems such as loss of coolant
accidents and the steam generator tube rupture, successfully detecting all the
considered faults and determining the fault propagation chains.
Assessment (5/10)
Graph applicability (7/8)
As for the case of operation and maintenance for hydropower stations,
graph-based solutions can be beneficial to nuclear power plants operators.
Although the two systems express some similarities in what should be analysed
(equipment health monitoring, decision support tools), nuclear power stations
64
7.4. OPERATION AND MAINTENANCE FOR NUCLEAR POWER
Further reading
65
7.5. DESIGN AND MAINTENANCE FOR WIND POWER
66
7.5. DESIGN AND MAINTENANCE FOR WIND POWER
Inspired from the search pattern known as the Lévy flight (a random walk
process) of cuckoo species, CS is a relatively new optimisation algorithm and has
successfully been applied to various problems, such as scheduling problems, the
travelling salesman problem.
Their test results indicate that the infrastructure cost has a significant effect on
the optimum wind farm solution. Compared with the genetic algorithm (GA), a
commonly used meta-heuristic algorithm in wind farm optimisation, CS was
slower, but found a better solution.
Expert opinion
As WFLO is a central aspect of wind power projects, there are already a number
of tools developed by OEM:s performing well, which also include load calculation
(i.e. the maximum load supported by the turbines) and life-time assessment.
However, due to particularly diverse constraints and bureaucracy of wind power
projects, there is a certain level of ad hoc, manual processing, making an
automated tool less powerful.
A widely researched topic in wind power nowadays is a good estimation of the
remaining life-time for the turbines and how to operate them in an optimal way
given this operational cost. Although a more automated scheduling of
maintenance is indeed an interesting topic, it is subject to internal organisational
capacities and should be in accordance to the optimal timing with regards to low
(or even negative) prices.
Finally, even if wind power projects are partnerships and much of the
computation work is made by sub-contractors, there is still an advantage of
analysing this system in-house as a complement for better negotiations [123].
Assessment (4.1/10)
Graph applicability (7/8)
As shown by the availability of the use-cases, graph theory is indeed applicable
to the problems of designing and monitoring wind power plants, on different
scales: at farm-level, auxiliary equipment-level, turbine-level. In all these system
levels, significant information is to be gained in the relationships between
components and edges can represent cost minimisation, clustering, fault
propagation. This leads to a good graph-applicability score.
Technical feasibility (1/8)
The setup of the model is considered complex, given the wide set of constraints
67
7.5. DESIGN AND MAINTENANCE FOR WIND POWER
of different nature. Coordination is needed from the fields finance and project
development, wind power and as well with the partners for the WFLO, as well as
in mechanical engineering and data science for the equipment monitoring. The
granularity of the time-series is also currently a barrier for proper life-time
estimation of the turbines.
Economic potential (4/8)
The global wind power market size is approximately US$ 70 to US$ 100 billion
[124, 125] and is expected to grow at a CAGR of 7.20 % until 2023. The turbine
operations and maintenance market size was valued at US$ 12 billion in 2018,
with an expected CAGR of 8.54 % from until 2025, amounting to US$ 21 billion
[126]. However, due to the high competitiveness of existing software solutions of
both WFLO and condition monitoring as well as the relatively low
generalisability of WFLO due to the many local constraints, the economic
potential was considered low.
Workability (1/8)
Potential graph-based solutions have low relevance to the current workflow as
Fortum is working with sub-contractors for wind power projects. Similarly, the
integrability and maintainability of the tools is considered too burdensome and
resource-costly for the workforce.
Further reading
Identification of critical components of wind turbines using FTA over the time
[127]
68
7.6. ELECTRIC VEHICLE APPLICATIONS
Criticality analysis of wind turbine energy system using fuzzy digraph models and
matrix method [118]
69
7.6. ELECTRIC VEHICLE APPLICATIONS
aspect is not taken into consideration but could be subject to future work.
Expert opinion
Placement of charging stations
Although the charging station placement and routing problems are interesting in
theory, in practice the nature of the needs is quite different. Investment in
charging point stations is capital intensive and not particularly profitable.
Outside Norway, the EV charging recurrence is considered too low for a roll-out
of infrastructure. The most important challenge is to increase return on
investment by maximising usage of the charging stations. To obtain this, a
deeper understanding of the user’s behaviours is needed to answer to the
question “What makes people charge at a particular station?”. For example,
there are areas where the Charging Point Operator’s (CPO) coverage overlaps
the users’ addresses and have a low utilisation, whereas other CPOs where user
density is lower experience higher utilisation rates [133, 134].
Graphs could potentially help better understand the usage rates of a charging
station by identifying critical success factors. One hypothesis which can be tried
is the “neighbourhood” factor, i.e. identify the points of interest around the CPO
and find a pattern. Charging stations and other landmarks could then be used as
nodes and the edges between them is the physical distance. Valuable attributes
could be given to the charging stations, such as the usage rate and the average
cost of charging, the time of the day the charges were completed, etc.). Stations
could then be clustered based on their usage rate and neighbourhood to detect
whether or not a pattern can be identified.
Routing
As well, charging price may not necessarily be the most important constraint in
the routing problem, as it is neither the prime concern of EV-owners, who could
otherwise charge at home, nor of business customers, who need availability.
Assessment (6.3/10)
Graph applicability (8/8)
EV has the most instances of case studies of all clusters. Many algorithms have
been tried and implemented, among others, different forms of shortest path
(Djikstra, A*, Kruskal), breadth-first-search, matching, etc. The graph
applicability in this case study is thus maximal.
Technical feasibility (7/8)
70
7.6. ELECTRIC VEHICLE APPLICATIONS
Generally speaking, the routing problem is a classic graph problem and thus
setting up a simple model does not require expertise from other domains.
However, the problem can quickly expand to large and complex systems with
mixed data sources as we add more constraints to the optimisation to make it
realistic. The commercialisation possibility of the ongoing research is debatable,
as most studies focus on a small set of constraints. Moreover, most studies do
not take into account the actual behaviour of the users, which is still a big
question mark in this industry.
Economic potential (3/8)
The electric vehicle charger market size was valued at US$ 3.8 billion in 2019 and
is projected to reach US$ 25.5 billion by 2027, registering a CAGR of 26.8 %
from 2020 to 2027 [135]. There is significant opportunity for utilities: Boston
Consulting Group estimates that the rise of EVs could create US$ 3 billion to
US$ 10 billion of new value for the average utility [136]. This has to be balanced
by the yet low maturity of EV, the currently low charging station utilisation rate
and thus their return on investment.
Workability (2/8)
For the moment, the problems that graphs solve are not the highest priority for
practitioners. The usage is so low that there is little need for automation of the
services, notably for the charging stations. Moreover, the data availability is low,
given the low interaction level between the user and the station. Therefore, it has
a low workability currently.
71
7.7. MARKET INTELLIGENCE FOR ENERGY TRADING
Further reading
On calendar-based scheduling for user-friendly charging of plug-in electric
vehicles [137].
A location model for electric vehicle (EV) public charging stations based on
drivers’ existing activities [128].
72
7.7. MARKET INTELLIGENCE FOR ENERGY TRADING
73
7.7. MARKET INTELLIGENCE FOR ENERGY TRADING
relationships between the energy commodities [159], smart-grid data [158]. Fewer
traditional graph-algorithms were found within this cluster (e.g. shortest path,
MST, etc.). However, many of the problems needed to be solved can be done
with a graph-approach, such as clustering of electricity loads [160] and feature
selection for electricity price forecasting [161].
Technical feasibility (5/8)
As opposed to most other applications considered in this research, market
intelligence is not a cyber-physical model, which facilitates the model setup.
There is a certain homogeneity in the expertise necessary, namely data science,
mathematics and trading, as the fields are merging. The computational
constraint of forecasting is high, since trading decisions need to be made fast. As
well, even though the physical risk of model failures is low, considerable financial
risks come into play.
Economic potential (5/8)
According to Zareipour et al., there are significant gains to be made by higher
forecasting accuracy: a 1 % improvement in the mean absolute percentage error
(MAPE) would result in about 0.10 % – 0.35 % cost reductions from short-term
electricity price forecasting [162]. In dollar terms, this would translate into
savings of circa US$ 1.5 million per year for a typical medium-size utility with a
5 GW peak load [163].
Workability (6/8)
Deeper market insights are highly relevant for Fortum’s trading operations. Most
of the data needed in this area are made readily available by third parties, either
in the public domain (e.g. weather, commodity prices, power grid operator data)
or through third-party vendors. The integration of a graph-based solution is
considered less of a challenge than other areas given the proximity of graph
analytics and machine learning models.
74
7.8. STORAGE SOLUTIONS FOR THE DISTRIBUTION GRID
Further reading
Evolution of the world crude oil market integration - A graph theory analysis
[164].
Grid topology identification using electricity prices [165].
75
7.8. STORAGE SOLUTIONS FOR THE DISTRIBUTION GRID
study.
Case study Optimisation of virtual power plant topology with distributed
generation sources [167]
This case study proposes a hierarchical control strategy to coordinate battery
energy storage devices based on a multi-agent system. A multi-agent system can
cope with complex problems effectively in a power system and improve the
robustness, reliability and flexibility of the system [168]. Therefore, many papers
bring a multi-agent method into the controller of an energy storage system.
However, as the authors point out, in most existing studies, the geographical
feature was not considered. As such, they design a distributed step-by-step
consensus algorithm based on droop controller (grid frequency stabilisation),
considering geographical location, aiming to improve the cooperative control of
the energy storage. According to different geographical distributions, Battery
Energy Storage Systems (BESSs) can be divided into several clusters with the
Kruskal algorithm, which is a version of the MST algorithm.
Graph Laplacians, or Laplacian matrices and their spectral properties are
important graph-related matrices that play a vital role in convergence analysis of
consensus and alignment algorithms. In particular, the stability properties of the
distributed consensus algorithms for networked multi-agent systems are
completely determined by the location of the Laplacian eigenvalues of the
network. Considering the location feature of each agent, an improved Laplacian
matrix with the weight corresponding to each edge is adopted. The edge set
describes the relative distance between agents in the communication topology,
implying their communication intensity.
This paper proposes a step-by-step consensus algorithm with several active
leaders, or a virtual leader. In fact, active leaders are internal agents while the
virtual leader is an external command. All active leaders can reach consensus by
exchanging information with each other and other agents can reach consensus
according to their leader. The virtual leader can force all agents to follow the
leader.
Assessment (5.6/10)
Graph applicability (5/8)
The application of graphs for energy storage solutions in the previous case
studies relies to a large extent to the topological analysis of the distribution
76
7.8. STORAGE SOLUTIONS FOR THE DISTRIBUTION GRID
network. In this context, graphs are deemed fit as topological analyses of power
grids are a classic use-case for graphs, where edges are the power lines and nodes
are power sources, load sources and interconnections between the lines. In fact,
there have been various studies modelling the power grid via graph databases,
finding promising results [169, 170].
Technical feasibility (3/8)
The cases studied were mostly based on graph theory, which simplifies the model
setup and reduces the diversity of expertise needed. As opposed to the optimal
siting of BESSs, an operational optimisation would require online models and
fast response times. Given the smaller scale of a BESS, the risk of failure is
considered low, relative to the larger-scale systems considered in this study.
Economic potential (4/8)
Adding the renewable energy technology, batteries and other components, the
virtual power plant market could grow from US$ 1.5 billion in annual revenue in
2016 to a US$ 5.3 billion market by 2023. Europe would stand for US$ 1.3 billion
(Gallucci, 2016).
Thus, grid services with energy storage is a niche market and still to a large
degree in the research phase. This leads to a lower competitive barrier and gives
a certain window of opportunity for software building.
Workability (6/8)
Graphs have been used for VPP-siting, which is a critical component in the
battery energy storage systems. Fortum is in fact already providing services to
the DSO through their Smart City Solutions (Fortum, 2020), which indicates a
certain alignment in terms of human capital. However, more information is
needed regarding the data and the internal software being used in order to assess
the data and maintainability alignments.
Figure 12: Evaluation of storage solutions for the grid
Further reading
A stochastic shortest path framework for quantifying the value and lifetime of
battery energy storage under dynamic pricing [171]
Consensus design for heterogeneous battery energy storage systems with droop
control considering geographical factor [172]
77
7.9. MASTER DATA MANAGEMENT
78
7.9. MASTER DATA MANAGEMENT
79
7.9. MASTER DATA MANAGEMENT
number of previous use-cases, although not from the energy sector specifically.
However, there are numerous previous cases of graph-based MDMs in commercial
settings, so the graph applicability is considered maximal.
Technical feasibility (7/8)
The technical feasibility is high, as the main task is about merging the data.
Creating the architecture to integrate data from different sources and keeping the
model updated is the highest source of technical complexity. Data-cleaning can
however become a liability, as the data sources can have different structures and
be incomplete. The risk associated with MDM is mainly a waste of resources: an
inconclusive model (e.g. because of incomplete or poor data) or a low adoption
rate by the users within the company. However, these risks are common to all
applications. Another risk is a breach of the privacy policy of customers, but it is
seen rather as a limitation for increasing the richness of the graph.
Economic potential (4/8)
The global MDM industry is projected to grow to US$ 8.6 billion by 2022
(Persistence Market Research, 2017). Given the generality of this use-case, many
graph database-vendors offer this service where minimal customisation is
required from the company (Neo4j, TigerGraph, AWS, etc.). There are several
industrial use-cases having reported economic benefits from their implementation
(Cisco, Pitney Bowes, the German Center for Diabetes Research). However, a
graph-based solution is not the only alternative, as there is a myriad of
non-graph-based MDM (e.g. IBM).
Workability (2/8)
Although the problem can be of high overall relevance for companies with large
customer bases and a wide product offering with data spread out across the
organisation, the perceived relevance for the end-users of this application. This
can in turn affect the integration and maintenance of the application over-time
(i.e., keeping the data updated). Moreover, MDM is a query-driven graph,
responding to specific questions such as “What is the churn rate of customers?”,
generally provided from the sales or the marketing department, making
engagement from the end-users particularly critical. Another challenge is the
data alignment, which is logical given that data integration is the very purpose of
this application.
Further reading
Graph-based customer entity resolution [178].
80
7.10. KNOWLEDGE GRAPHS
81
7.10. KNOWLEDGE GRAPHS
82
7.11. RESULTS OF USE-CASE CLUSTER ASSESSMENT
Further reading
Hydro-graph: A knowledge graph for hydrogen research trends and relations [183].
A Potential Solution for Intelligent Energy Management—Knowledge [184].
83
7.11. RESULTS OF USE-CASE CLUSTER ASSESSMENT
The graph-score is plotted on the vertical axis and the strategy-score on the
horizontal axis. To better discern the clusters, both axes have been rescaled
according to the following rule: the minimum of each axis is set to the value of
the cluster with the minimum score along the axis minus one. Similarly, the
maximum of each axes is set to the maximum score plus one. A more
fundamental reason for this is that in order to select a cluster, it is more
appropriate to evaluate the clusters in relative terms than in absolute terms.
As such, four quadrants can be identified by drawing a boundary set at the mean
84
7.11. RESULTS OF USE-CASE CLUSTER ASSESSMENT
of each axis.
As explained in the methodology, the use-case clusters were categorised in three
groups:
◦ The first group is comprised of the use-case clusters located in the
top-right-hand quadrant and represents the accepted clusters: Trading -
Market Intelligence, Knowledge Graph, Master Data Management and
Hydropower O & M.
◦ The second group is comprised of the use-case clusters located in the
top-left-hand and bottom-right-hand quadrants and represents the deferred
clusters: EV Applications, Storage solutions for the grid, Hydropower
optimisation and Nuclear power O & M.
◦ The third group consists of the use-case clusters in the bottom-left-hand
quadrant and represents the rejected clusters: Wind power design and O &
M.
The selected cluster for a proof-of-concept implementation was that of Energy
trading: Market intelligence, as it obtained the highest score on both dimensions.
85
7.11. RESULTS OF USE-CASE CLUSTER ASSESSMENT
86
7.12. DISCUSSION ON THE SELECTION OF USE-CASE CLUSTER
87
CHAPTER 8
Assessment of use-cases in Energy
trading
In this section, three use-cases within the Trading cluster are presented in a
similar fashion as the use-case clusters were presented in the previous section.
The use-cases were chosen so as to best meet the needs of a utility company like
Fortum, thus relate to the domains of natural gas, short-term load forecasting
and electricity price forecasting.
The difference in evaluating specific use-cases as opposed to use-case clusters is
that since a specific application is sought, a higher level of detail imposes itself.
Each proposal is inspired from the use-case from the inventory which best
highlights the benefits of graph theory within the aforementioned selected
domains.
Table 8.1 summarises the scoring of each use-case. The same scoring system on
each dimension is applied as for the use-case clusters, detailed in 7.1.2. Note that
the strategy-score was not computed, as they each pertain to the same cluster
and thus score similarly.
88
Table 8.1: Scoring of use-cases in Energy trading
Legend:
UC1: Natural gas: visibility graph
UC2: Short-term load forecasting: clustering
UC3: Electricity price forecasting: feature selection
89
8.1. NATURAL GAS MARKET ANALYSIS WITH VISIBILITY
GRAPHS
90
8.1. NATURAL GAS MARKET ANALYSIS WITH VISIBILITY
GRAPHS
into a regular graph, whereas random series convert into exponential random
graphs. In addition, the temporal characteristics and the internal evolution
mechanisms of time series can also be explained by the visibility graph algorithm.
This allowed the authors to model the natural gas price fluctuations by observing
the evolution characteristics of its corresponding graph. Namely, three attributes
of the visibility graphs were discussed: the degree and degree distribution, the
average shortest path length and the community structure.
The main contributions of this paper are as follows:
◦ It provides a new method to study the spot price fluctuations of natural
gas in North America, which also can be related to energy prices, such as
coal, oil, solar, etc.
◦ Combined with the degree distribution characteristics of NGP-VGN,
selecting appropriate window size, the overall natural gas price series is
divided into five 6-year time windows, which makes the research more
detailed and targeted.
◦ Degree and degree distribution, small-world characteristics and community
structure of the network.
Proposal: Construction of a visibility graph for the European natural gas
market.
Assessment (7.2/10)
Graph applicability (8/8)
Graph theory is a tool of growing interest in the signal processing community for
data representation and analysis. Their structure offers a new perspective, often
unveiling non trivial properties on the data they represent. In particular, time
series analysis has greatly benefited from graph representations as they provide a
mapping able to deal with non-linearities and scaling issues present in multiple
applications. In the case studied, classical graph-theoretic algorithms were used
(e.g. degree distribution, shortest path).
Technical feasibility (6/8)
The model uses univariate time series of prices and the set of rules for the graph
construction are considered simple and the graph gives a lot of freedom for
analyses. This can also turn out to be the model’s very drawback as well, as it
requires a considerable intuition and understanding of networks to know what to
91
8.2. SMART METER CLUSTERING FOR SHORT-TERM LOAD
FORECASTING
Figure 8.1: Assessment of Natural gas market analysis with visibility graphs
92
8.2. SMART METER CLUSTERING FOR SHORT-TERM LOAD
FORECASTING
smart meters, can measure and record each consumer’s energy usage, every 15,
30 or 60 minutes. The general method for improving the forecast using smart
meter data is based on clustering.
Assuming that the next day electricity demand of a city with a number of
consumers is to be predicted for the next 24 hours. A model can be trained to
generate a forecast for the next day, using inputs like temperature, load at the
same hour on the current day, load at the same hour on the same day in the last
week, etc. Any of the methods mentioned above can be used for creating such a
model.
The research about using smart data for improving load prediction is still in its
early stages. As a result, there are not many papers addressing this issue. The
existing works can be categorised according to their feature set selection
methods, clustering methods and the applied prediction methods. Alzate and
Sinn use wavelet analysis for feature extraction, a method called kernel spectral
clustering for clustering and a prediction method called PARX as their forecast
method. They report a more than 20 % improvement in forecast accuracy using
these techniques, which the subject of the case study below [191]. Gajowniczek et
al. simulate clustering approaches for residential electricity demand profiles [192].
Case study Improved electricity load forecasting via kernel spectral clustering of
smart meters [191]
The authors of this study aim to improve forecasts of aggregated demand with
smart meter data, by exploring the possibilities from kernel spectral clustering.
Their goal is to cluster the smart meter data in order to build a forecasting
model for each cluster separately instead of building a unified model for fitted to
the total aggregate of all meters.
The purpose of spectral clustering is to identify communities of nodes based on
the edges connecting them. It uses information from the eigenvalues (spectrum)
of adjacency matrices, whether built from a graph or a data set. As opposed to
k-means clustering, which results in convex sets, spectral clustering can solve
problems, such as intertwined spirals, because it does not make assumptions on
the form of the cluster. This property comes from the mapping of the original
space to an eigenspace. Given a sparse similarity graph, spectral clustering can
be implemented efficiently even for large data sets [193]. Kernel Spectral
Clustering (KSC) is an extension of the model described above. It allows for
out-of-sample extensions and model selection in a learning setting with training,
93
8.2. SMART METER CLUSTERING FOR SHORT-TERM LOAD
FORECASTING
94
8.3. FEATURE SELECTION FOR ELECTRICITY PRICE
FORECASTING
rather low. Data from smart meters is also be tricky to obtain, especially for
larger geographical areas, for which the load forecasting is necessary.
95
8.3. FEATURE SELECTION FOR ELECTRICITY PRICE
FORECASTING
Almost all models share a common, critical phase: selecting the right
combination of parameters. It is important to note that though there have been
many studies focusing on feature selection, the development of an objective
method of selecting a minimum set of the most effective input variables would be
very valuable [200].
Case study A hybrid short-term electricity price forecasting framework: Cuckoo
search-based feature selection with singular spectrum analysis and SVM [194]
This study proposes a hybrid forecasting framework for short-term electricity
price forecasting to cope with non-linear dynamics of the electricity price.
The first step in their model is a singular spectrum analysis (SSA), to exploit the
important information hidden in the electricity price signal, such as trend,
periodicity and noise components. SSA can be an aid in the decomposition of
time-series into a sum of components, each having a meaningful interpretation.
Standard SSA algorithm consists of four stages: embedding, Singular Value
Decomposition (SVD), grouping and diagonal averaging. A great advantage of
this methodology is the fact that it is non-parametric, or model-free, meaning
that it can adapt itself to the underlying data set, dismissing the necessity of a
priori models. They decompose the price into a trend, a seasonal and a residual
component. This step is then followed by a cuckoo-search (CS) algorithm to
facilitate the development of feature selection and finally a support vector
machine (SVM) as a forecasting engine.
Benchmarking their model against a set of commonly used forecasting
techniques, among others LASSO, SVM and ARIMA, they find a superior model
performance. They attribute this to the HFS, which can generate the optimal lag
order for alternative electricity price series and then the hybrid of CS, SSA and
SVM can more effectively capture the relationship between inputs and output,
resulting in the desired forecasting.
The authors point out that a combination of models can obtain improved
robustness and be more generalisable than single methods, because they
integrate the strengths of a set of individual methods.
Proposal: A graph-based feature selection for EPF.
Expert opinion
There is reported interest in an automated feature selection process to improve
the forecasting results, computation efficiency and reduce manual input.
96
8.3. FEATURE SELECTION FOR ELECTRICITY PRICE
FORECASTING
97
8.4. RESULTS OF THE USE-CASE ASSESSMENT
98
8.4. RESULTS OF THE USE-CASE ASSESSMENT
99
8.5. DISCUSSION ON THE SELECTION OF USE-CASE
100
CHAPTER 9
Conclusion and discussion on
PART II
9.1 Conclusion
As was covered in Part I of this research, graph theory has a wide array of
applications across engineering fields. Modelling physical or digital systems into
nodes and edges enable the usage of many graph-related algorithms and tools of
analysis.
Part II aimed at illustrating where in the energy sector graphs can bring
additional value. A systematic search through academic and industrial use-cases
was made to obtain a more formalised manner of identifying where graphs have
previously been implemented and more particularly within the domains of
activity relevant for a utility companies.
Inspired from established Idea Management theory and guidelines, an idea
selection process was designed. Firstly, a filtering and clustering was made. The
use-cases were clustered in two dimensions, namely by domain of activity and by
graph-related concept to more easily identify which graph-based methods are
most prevalent in the respective domains. Then, an evaluation framework was
elaborated in coordination with organisational and academic supervisors. It
consists of two sequential evaluations, one of use-case clusters and then of specific
use-cases within the chosen cluster. This way, utilities can have a wider choice of
101
9.2. DISCUSSION
9.2 Discussion
The contributions of Part II of this research are the following:
◦ A data-driven idea generation approach for technological scouting within
the innovation processes of electric utility companies. It can be extended to
more domains of applications or replicated for other technologies.
◦ A framework for selecting a use-case of graph theory within the energy
sector.
◦ An index of possible use-cases within the energy sector, which can be used
as basis for further, more detailed studies in this context.
As the assessment of the use-cases is made at face-value, it aims at providing
guidelines and indications as to the potential value of graph theory in a subset of
domains of activity within the energy sector. This research contains certain
inherent limitations in order to objectively examine each use-case clusters,
namely in terms of time, scope, expertise and data. A suggestion for a more
objective evaluation would be to distribute the assessment process across the
organisation’s relevant experts and selected by a voting system. However, this
could introduce potential subjective biases.
Another possible direction would be to develop the idea generation tool further
to index all documents and do natural language processing for creating
102
9.2. DISCUSSION
knowledge from the information, in line with the concept of knowledge graphs.
This has been made for nuclear power operation effectivisation, for enriching
smart grid information and to asses the status of hydrogen research. Specific
elements of the previous case studies could be detected and compared
systematically and objectively. But this process could become time-consuming
and there will still remain some sources of uncertainty, be they organisational,
technical or economic. Taking into account the value of an agile approach,
consisting in incremental and iterative development and factoring in compounded
learnings throughout the process, rapid prototyping of a proof-of-concept of a
use-case might contribute to a more realistic assessment of an application.
As this research is exploratory, another literature review will be conducted to
gather a more profound understanding of the challenges with electricity price
forecasting and the various graph-based approaches which can be adopted. Given
the versatility of graph theory, some flexibility from the original use-case is
reserved for the proof-of-concept proposal.
103
Part III
104
CHAPTER 10
Graph theory in electricity price
forecasting
105
10.1. BACKGROUND ON THE ELECTRICITY MARKET
106
10.1. BACKGROUND ON THE ELECTRICITY MARKET
logical that there is some margin of error and the actual values will look
different. These variations are remedied in various ways in the balancing market.
It is worthwhile mentioning that the biggest portion of power sales is still
generated by bilateral contracts with large customers, also called futures.
Suppliers and buyers agree on a future delivery of electricity at a pre-determined
rate, hedging against the risk caused by the daily price volatility.
107
10.2. ELECTRICITY PRICE FORECASTING
generators produce exactly the quantity from their bid in their day-ahead market.
The TSO incurs a cost for this imbalance, either an upward-regulation if the
system needs more electricity or a downward-regulation in the opposite scenario.
Secondly, market participants can choose to trade on this market instead of the
day-ahead market, if they predict that it would be more beneficial to them.
As soon as buy- and sell-orders are matched, the trade is executed. Electricity
can be traded up to 5 minutes before delivery and through hourly, half-hourly or
quarter-hourly contracts. As this allows for a high level of flexibility, market
participants can also make last minute adjustments to balance their positions
closer to real time to reduce their imbalance [201].
108
10.2. ELECTRICITY PRICE FORECASTING
109
10.2. ELECTRICITY PRICE FORECASTING
models can be a serious limitation and few attempts are being made, it performs
reasonably well to capture the volatility or price spike forecasts [200].
Statistical models build upon the seasonality of electricity prices and capture
their auto-regression at the same time as exogenous factors can be considered.
They have been exhaustively applied to EPF thanks to their simplicity and
satisfying accuracy. Traditionally used models include autoregression, transfer
functions, autoregressive integrated moving average (ARIMA) and Generalised
Autoregressive Conditional Heteroskedasticity (GARCH) [194]. However, they
fail at capturing price spikes because of their non-linearity, so hybrid models are
often used, for example with various combinations of ARIMA and GARCH
models [210, 211].
Computational intelligence (CI) models aim at overcoming the
aforementioned weakness of statistical models as they are effective for capturing
the non-linear behaviour of electricity prices [212]. Artificial neural networks
[213] Support Vector Machines [214], Fuzzy inference systems [215] have often
been suggested, but a vast set of models is being tried (e.g., genetic algorithms,
regression trees). Their ability to handle complexity and non-linearity make
them a promising approach for short-term predictions and authors have reported
excellent performance. They can furthermore accept much larger amounts of
data than traditional statistical regression models. However, this can also be
their very drawback, as a risk is to overfit the model to the training set and
incorrectly forecast the point forecasts [200]. The accuracy of the models is
dependent on the calibration of the hyperparameters, the initial conditions,
which is to a large extent made by computationally intensive exploratory means
and through trial and error, which reduces their replicability. Also, they can
suffer from a low level of explainability as they generally adopt a black-box
model, i.e. what happens in the neural networks is not fully understood.
EPF models are strongly dependent on the quality of features provided as input.
To a large extent, methods have been leveraging on expert knowledge to guide
the search of the right variables. Specific time-lags, oftentimes with time-varying
windows. And brute-force exhaustive search for variables is computationally
expensive. Still, there is interest in automated procedures for precise
dimensionality reduction [145].
Graphs are widely adopted in combination with neural networks, e.g. through
graph neural networks, graph convolutional networks or graph embeddings for
110
10.2. ELECTRICITY PRICE FORECASTING
111
10.2. ELECTRICITY PRICE FORECASTING
112
10.2. ELECTRICITY PRICE FORECASTING
approaches have been proposed to tackle this, such as mutual information (MI)
[144, 222] and the relief algorithm [223] which are well-known filters. But
according to Zhang et al. they fail to evaluate the importance of alternative
variables due to the independence of the filter methods from the forecasting
model [194].
According to Yang et al., hybrid models have caught the attention of researchers
in recent years, for their ability to aggregate the benefits from each method [224].
Abedinia et al. propose a filter-wrapper feature selection for load and price
forecasts. The filter model is able to select a minimum subset of features with
regards to relevancy, redundancy and synergy, which is the interaction between
the input parameters. Subsequently, they applied a wrapper to fine-tune the
results from the first step [225].
113
10.2. ELECTRICITY PRICE FORECASTING
forecasts reduces their own forecasting errors by more than 2 %. Their inclusion
of exogenous variables in addition to their baseline model with lagged versions of
intra-day and day-ahead prices is motivated by the results of [163], who stressed
that fundamental variables had an explanatory value when forecasting day-ahead
electricity prices. It contradicted the observations made by Monteiro et al.
regarding the impact of market fundamentals.
Feature selection is a particularly important challenge to solve in the case of
intra-day trading because of the large amounts of data available and the
currently lower level of understanding of the price movements. Prices are
continuously updated for all hours of the day along with the trading decisions of
generators and loads and the updated weather forecasts. Compared with the
DA-models where the dispatch is optimised by the TSO, line congestions can
further increase the volatility of prices.
10.2.4 Summary
To summarise the discussion above, feature selection is key to increase the
accuracy and speed of any electricity price forecasting models and more
specifically in the case computational intelligence models. To approach this,
researchers have tried filter, wrapper and embedded methods, each having their
advantages and drawbacks. Whereas filter methods such as mutual information
are model-free, scalable and can be apply non-linear dimensionality reduction,
they have a lower accuracy level. Embedded and wrapper methods, on the other
hand, can contribute to more accurate predictions, but can become too
computationally intensive. To cope with large dimensions of data, sparse
regularisation methods have been employed successfully, particularly LASSO and
elastic nets. However, they can still lack in performance for non-linear mappings.
Hybrid models are receiving increasing attention for their ability to aggregate the
inherent benefits of each method, for instance filter-wrapper methods.
Feature selection is particularly important in the context of intra-day trading,
given the large amounts of data available, the higher volatility of prices and the
impact of fundamental factors such as line congestions. As trade volumes are
being shifted to the intra-day market, research on the mechanisms in this market
are still lagging and the varying conclusions inferred by the models proposed in
literature suggest that there is a need to better understand the price dynamics.
Graphs have a reported value for enriching computational intelligence models,
114
10.3. MOTIVATIONS ON THE PROPOSED METHODOLOGY
115
CHAPTER 11
Introduction to feature selection
11.1 Background
Large-scale data with many variables, also called features, come with both
wealthy information and challenges in engineering modelling problems. Not only
do high-dimensional data cause high computational time and memory errors for
computing systems, they also lead to a problem called overfitting. This means
that the mathematical model is too closely fitted to many variables, where some
of them are noisy, inaccurate or irrelevant data. A consequence is that the model
is too complicated to analyse and generalise. It is therefore suitable to reduce the
number of features before the modelling phase, for example by removing the
redundant features and keeping the most representative ones.
Although feature selection can be done by empirical expert knowledge in the
respective fields, this approach can be loaded with personal biases, unclear
evaluation measures, time-consuming manual work and most of all uncertainty
when discriminating the importance of different features. Various methods for
automatic feature selection have been proposed to solve this challenge. Some
typical methods in traditional regression analysis are stepwise regression, Akaike
information criterion and principal component analysis [237]. In information
theory, features can be selected using informative fragments [238], conditional
infomax learning [239] and fast correlation-based filter [240], among many other
methods.
116
11.2. PURPOSE AND OUTLINE
117
CHAPTER 12
Preliminaries
118
12.2. NEAREST NEIGHBOUR GRAPH
119
12.3. GRAPH CLUSTERING
120
12.4. COMPARISON OF TWO CLUSTERINGS
˜ 1 Xm
ACC(C, C) = δ ỹi , Map(yi )
m i=1
where Map is the permutation mapping function which maps each cluster label
yi to an equivalent label in ỹ and the Kronecker delta function δ is defined as
1
if a = b
δ(a, b) =
0
if a 6= b
x∈ΩX
121
12.5. SIMILARITY COMPARISON OF TWO SETS
˜ = 2I(y, ỹ)
NMI(C, C)
H(y) + H(ỹ)
where I(y, ỹ) is the mutual information between y and ỹ and H(y) the
information entropy of y.
where E I(y, ỹ) denotes the expectation of the mutual information between C
˜
and C.
or in other words the ratio between the number of elements in common and the
number of different elements in total. This index is named after the French
professor of botany Paul Jaccard (1868–1944).
122
CHAPTER 13
Feature selection methods
Five graph-based methods for feature selection with different points of view are
presented in this chapter. Given a data set with m samples and n features, the
aim is to select the best k n features according to certain criteria. In
chronological order, the five methods are as follows:
1. Laplacian score (LS), proposed 2005 [249]: This method favours the
features which have large variances and preserve the local manifold
structure of the data.
2. Multi-cluster feature selection (MCFS), proposed 2010 [251]: This method
favours the features which preserve the multiple cluster structure of the
data. The optimisation problem involves spectral regression with `1 -norm
regularisation.
3. Non-negative discriminative feature selection (NDFS), proposed 2012 [252]:
This method favours the most discriminative features. The optimisation
problem involves non-negative spectral analysis and regression with
`2,1 -norm regularisation.
4. Feature selection via non-negative spectral analysis and redundancy control
(NSCR), proposed 2015 [253]: This method is an extension of NDFS which
even controls the redundancy between features. The optimisation problem
involves non-negative spectral analysis and regression with `2,q -norm
regularisation. Abbreviation: “NS” for “non-negative spectrum” and “CR”
123
for “constrained redundancy”.
5. Feature selection via adaptive similarity learning and subspace clustering
(SCFS), proposed 2019 [250]: This method favours the most discriminative
features which also preserve the similarity structure of the data. The
optimisation problem involves regression with `2,1 -norm regularisation.
Abbreviation: “SC” for “subspace clustering”.
Of all five methods, Laplacian score is the only one which contains neither
iterations, regression, regularisation nor clustering of data.
For simplicity, some notations will be used consistently in the descriptions of all
methods according to Table 13.1.
where the trace tr AT A is the sum of the main diagonal elements of AT A.
124
13.1. LAPLACIAN SCORE (LS)
13.1.1 Preparation
Create the p-nearest neighbour graph G = (V, E, W) with m vertices
corresponding to the m sample vectors x1 , x2 , · · · , xm ∈ Rn , for some own choice
of p. There are two ways to define the edge weight matrix W [245]:
1. Let Wij = 1 if the vertices vi and vj are adjacent and Wij = 0 otherwise.
2. Let
2
e−ωkxi −xj k
if vi and vj are adjacent
Wij = ψ(xi , xj ) =
0 otherwise
125
13.1. LAPLACIAN SCORE (LS)
2
shall be minimised. The weight Wij in the expression (fr )i − (fr )j Wij can be
interpreted as a kind of penalty to ensure the locality preserving ability of a
feature. To be more concrete, the closer xi and xj are to each other, meaning the
2
larger Wij , then the smaller (fr )i − (fr )j should be. With Var(fr ), denoting
the weighted variance of the feature fr , in the denominator, LS(fr ) is minimised
by maximising Var(fr ). This also allows the most representative features to be
selected.
Algebraic simplification: To simplify (?), use the re-writings
m 2 m h i2 h i2
(fr )i − (fr )j Wij = (fr )i + (fr )j − 2 (fr )i (fr )j Wij
X X
i,j=1 i,j=1
m h i2 m
=2 (fr )i Wij − 2 (fr )i Wij (fr )j = 2frT Dfr − 2frT Wfr = 2frT Lfr
X X
i,j=1 i,j=1
and #2
n h n
f T D1
"
i2
Var(fr ) ≈ (fr )i − µr Dii = (fr )i − rT
X X
Dii
i=1 i=1 1 D1
h iT
shown in [245], where 1 = 1 1 · · · 1 , D = diag(W1) and L = D − W is
the Laplacian matrix of the graph G.
Further improvement: As pointed out in [249], there is a risk that the vector
fr can be a non-zero constant vector such as 1. This leads to
frT Lfr = 1T L1 = 0
2
(fr )i − (fr )j
Pm
i,j=1 Wij 2frT Lfr
⇒ (?) : = =0
Var(fr ) Var(fr )
which gives no information about the feature fr . This problem can be avoided by
introducing
f T D1
f̃r = fr − rT 1
1 D1
which gives
f T Lf = f̃rT Lf̃r
r r
Var(fr )
≈ f̃rT Df̃r
126
13.2. MULTI-CLUSTER FEATURE SELECTION (MCFS)
f̃rT Lf̃r
LS(fr ) =
f̃rT Df̃r
where LS(fr ) stands for the Laplacian score of the feature fr . At this point, one
can note that a good feature fr has a low Laplacian score. An algorithm for
ranking the features according to their Laplacian scores is now ready to be
formulated.
frT D1
f̃r = fr − 1
1T D1
h iT
where 1 = 1 1 · · · 1 ∈ Rm and D = diag(W1).
4. Compute the Laplacian matrix L = D − W of the graph G and the
Laplacian score LS of each feature fr as
f̃rT Lf̃r
LS(fr ) =
f̃rT Df̃r
127
13.2. MULTI-CLUSTER FEATURE SELECTION (MCFS)
13.2.1 Preparation
A graph here is constructed in the same way as described in 13.1.1 for the
Laplacian score method. After obtaining a p-nearest neighbour graph and the
Laplacian matrix L = D − W, solve the generalised eigenvalue problem
Lu = λDu
to obtain a flat embedding for the data points. According to [245], the equation
Lu = λDu is derived from an optimisation problem of constructing a
representation for data lying on a low-dimensional manifold embedded in a
high-dimensional space, where local neighbourhood information is preserved.
Let U = [u1 , · · · , uκ ] be a matrix containing the eigenvectors of the above
mentioned eigenvalue problem corresponding to the κ smallest eigenvalues. Here,
κ can be interpreted as the insintric dimensionality of the data and each
eigenvector reflects how the data is distributed along the corresponding
dimension, or in other word the corresponding cluster [251]. If the number of
clusters of the data set is known, κ can be chosen to be this number.
where X is the data matrix, |ai | denotes the sum nr=1 (ai )r and β is some
P
128
13.3. NON-NEGATIVE DISCRIMINATIVE FEATURE SELECTION
(NDFS)
The k of n features which should be selected according to this approach are now
those with highest multi-cluster feature selection scores.
Lu = λDu
129
13.3. NON-NEGATIVE DISCRIMINATIVE FEATURE SELECTION
(NDFS)
13.3.1 Preparation
A p-nearest neighbour graph G is created as described in 13.1. In this proposed
method, instead of the Laplacian matrix L = D − W, the corresponding
normalised Laplacian matrix
L̂ = D−1/2 (D − W)D−1/2
is considered. This particular chosen version of the Laplacian matrix for the task
of feature selection is not motivated in [252]. However, it is pointed out in [243]
that L̂ in general has its theoretical advantages where many results can be
generalised to all graphs and not only regular graphs.
Define even a cluster indicator matrix Y ∈ {0, 1}m×κ based on a clustering of the
vertices in G, where κ is the number of clusters. Then, the corresponding scaled
cluster indicator matrix F ∈ Rm×κ is defined as
−1/2
F = Y YT Y
−1/2 −1/2
= YT Y YT Y YT Y
=I
over Y and Z ∈ Rn×κ for some parameters α and β, where the scaled cluster
label matrix F and the feature selection matrix Z are optimised simultaneously.
130
13.3. NON-NEGATIVE DISCRIMINATIVE FEATURE SELECTION
(NDFS)
The first term tr(FT L̂F) is about modelling the local structure of the data
manifold, in a similar manner to what is done in LS. This structure is important
for clustering the data points. The second term αkXZ − Yk2F is about
minimising the fitting error when the original data X is embedded on a
low-dimensional subspace with a transformation matrix Z. In the third term, the
`2,1 -norm ensures the row sparsity of Z to filter out noisy features. If the feature
fj is not highly correlated to the pseudo cluster labels described by F, it is likely
that the row zj will be shrunk to the zero vector if β is large enough.
Since F is a cluster indicator matrix, it is necessary that F ≥ 0. This gives now
the optimisation problem
s. t. FT F = I and F ≥ 0
+ γkFT F − Ik2F
s. t. F≥0
131
13.3. NON-NEGATIVE DISCRIMINATIVE FEATURE SELECTION
(NDFS)
s. t. F ≥ 0
132
13.3. NON-NEGATIVE DISCRIMINATIVE FEATURE SELECTION
(NDFS)
(2γFt )ij
ij = Fij
Ft+1 t
Mt Ft + 2γFt (Ft )T Ft
ij
−1
Zt+1 = Et XT Ft+1
1
2kzt1 k
..
G t+1
=
.
1
2kztn k
4. Select the k features with the highest values kzi k, where i = 1, · · · , n and zi
denotes the i-th row of Z.
133
13.4. FEATURE SELECTION VIA NON-NEGATIVE SPECTRAL
ANALYSIS AND REDUNDANCY CONTROL (NSCR)
i,j=1
s. t. F≥0
for some 0 < q ≤ 1. Even though this choice may seem strange and complicated,
the `2,q -norm for regularisation leads to a better sparse matrix than the usual
134
13.4. FEATURE SELECTION VIA NON-NEGATIVE SPECTRAL
ANALYSIS AND REDUNDANCY CONTROL (NSCR)
Because of the matrices C and H in this method, the elements in M may have
mixed signs and affect the sign of the elements in F, which in turn may violate
the constraint F ≥ 0. An intervention can be made [253] by introducing the
auxiliary matrices M+ and M− where
Mij + Mij Mij − Mij
(M+ )ij = and (M− )ij =
2 2
These matrices satisfy M = M+ − M− and the updating rule can be written as
(M− F + 2γF)ij
Fij ← Fij
(M+ F + 2γFFT F)ij
The whole process can now be summed up in an algorithm.
135
13.4. FEATURE SELECTION VIA NON-NEGATIVE SPECTRAL
ANALYSIS AND REDUNDANCY CONTROL (NSCR)
Mt + Mt
Mt+ =
2
M − Mt
t
Mt− =
2
Mt− Ft + 2γFt
ij
ij = Fij
Ft+1 t
Mt+ Ft + 2γFt (F ) Ft
t T
ij
−1
Zt+1 = Et XT Ft+1
q
2−q
2kzt+1
1 k
..
G t+1
= .
q
2−q
2kzt+1
n k
P
j (zt+1 )j C1j
1 k
2kzt+1
..
Ht+1 = .
(zt+1 )j
P
j
Cnj
2kzt+1
n k
5. Select the k features with the highest values kzi k, where i = 1, · · · , n and zi
136
13.5. FEATURE SELECTION VIA ADAPTIVE SIMILARITY
LEARNING AND SUBSPACE CLUSTERING (SCFS)
+ γkFFT 1 − 1k2F
s. t. F≥0
(2N + αXZ)ij
Fij ← Fij
(NFT F + FFT N + αF)ij
where N = XXT + mγ1 F.
137
13.5. FEATURE SELECTION VIA ADAPTIVE SIMILARITY
LEARNING AND SUBSPACE CLUSTERING (SCFS)
and update
1
2kzt+1
1 k
..
G t+1
=
.
1
2kzt+1
n k
3. Select the k features with the highest values kzi k, where i = 1, · · · , n and zi
denotes the i-th row of Z.
138
CHAPTER 14
Experiments and results
139
14.2. DATA SETS
Data set m n
ALLAML 72 7129
BASEHOCK 1993 4862
GLIOMA 50 4434
LSOLET 2600 500
LYMPHOMA 96 4026
NCI9 60 9712
PIE10P 210 2420
PROSTATE 102 5966
CELEBI 941 442 576
Table 14.1: Data sets with their numbers of samples (m) and features (n).
140
14.4. EXPERIMENT: EIGHT PUBLIC DATA SETS
Since the k-means clustering algorithm returns a different clustering each session
due to the inherent random choice of start value, 50 clusterings C˜ are generated
for computing the three clustering quality measures. After that, the mean and
the standard deviation of each measure are computed.
Because the optimal number of selected features k is normally unknown in
practice, an empirical grid search is performed with the value set {50, 100,
150, 200, 250, 300} for k in case of public data sets and the value set
{25, 50, 75, 100, 125, 150, 175, 200} in case of CELEBI as CELEBI has much fewer
features than seven of the public data sets. After that, the k-values
corresponding to the largest ACC, NMI and AMI respectively are reported.
141
14.5. EXPERIMENT: CELEBI OF FORTUM
14.4.3 Stability
The result is shown in Figure B.1. Except for LSOLET, none of the other seven
data sets have a Jaccard index close to 80 %. This is not surprising [?]
considering the fact that the other seven data sets have high amounts of features
(thousands) and six of them have relatively low amounts of samples (below 250).
A low Jaccard index does not necessarily mean that a feature selection algorithm
is not stable. The number of samples and features also have a role. This
experiment shows empirically that a high Jaccard index can be obtained if a data
set has many samples (over a thousand) and relatively few (less than a thousand)
samples.
142
14.5. EXPERIMENT: CELEBI OF FORTUM
Just as in the case with the eight public data sets, the values of ACC, NMI and
AMI for all 48 subsets increase when fewer features are used. Figure 14.1
suggests that the most influential features for the electricity price vary the most
during the months January, February and March and the least during the
months April, May and June.
This experiment shows that the SCFS algorithm can prove to be a good
candidate for feature selection of large intra-day data sets with both endogenous
and exogenous parameters. The variation in the selected features suggests that
shorter or longer time intervals for selecting features are preferable depending on
the period of the year.
143
CHAPTER 15
Discussion
φt − φt−1
relative change = <θ
φt
for the three methods NDFS, NSCR and SCFS affects the number of necessary
iterations. An experiment is done where all three methods are executed for 15
iterations on the eight public data sets and how the relative change varies is
observed. The result is shown in Figure B.2.
As observed from the plots, the relative change in SCFS has a tendency to
decrease linearly after four iterations and seems to be able get as low as 10−10 .
Meanwhile, the relative changes in NDFS and NSCR do not seem to be able to
come down to 10−8 . It is therefore important to keep in mind that a too low
value on θ may cause some iterative methods to go on endlessly. This problem
can be easily solved by setting a limit on the number of iterations, such as 15.
All three methods seem to converge quickly, where the relative change can get
down to about 10−4 within 10 iterations.
144
15.2. PARAMETER SENSITIVITY
145
15.4. CONCEIVABLE CHALLENGES
feature selections are essentially different or that the feature selection algorithm
is unstable. A more suitable measure than the Jaccard index can be symmetrical
uncertainty [?] which accounts for the information similarity of the features and
not only the indices of these.
146
15.6. SCALABILITY
15.6 Scalability
An interesting point to raise is the potential scalability of this model, both
within the trading sector and even in others. As already discussed at the end of
Part I, an inclusion of other parameters from energy markets and from the grid,
is conceivable. In the use-case gathering, feature selection was included in
maintenance for hydropower, maintenance for nuclear power, maintenance for
wind-power, as it is technically suitable on any time series data or for almost any
classification purposes. As filter-based feature selection has the advantage of
being model-free and faster than the alternatives, it can technically be
generalisable to more domain areas as the pre-processing phase of computational
or statistical models in presence of large scale or chaotic data. A proper
evaluation of the actual scalability of graph-based feature selection methods
compared with industry standards would be necessary to confirm the hypothesis.
147
CHAPTER 16
Research conclusion
148
possible bias in decision making. Graph-based feature selection methods having
the ability to reduce the dimensionality of data sets by preserving the geometric
structure have been chosen in order to tackle the challenges brought about by
the sometimes non-linear impact of variables on the electricity prices as well as
the high dimensionality of the data sets. They were applied after obtaining the
nearest neighbour graph from the observations in the intra-day time series.
The presented methods LS, MCFS, NDFS, NSCR and SCFS work fast with data
sets of high feature dimension and low sample dimension. However, it takes
considerably more time with data sets of high sample dimension despite low
feature dimension. Some metrics such as clustering accuracy, normalised mutual
information, adjusted mutual information and the Jaccard index can be used to
evaluate and compare the selected features. Because of its high performance in
experiments on benchmark, open-source data sets SCFS was applied to intra-day
data consisting in over 500 features. Its higher scores than the Baseline (original)
data set on the all three metrics suggest that the reduced data set could improve
forecasting accuracy.
Prototypical regression models can be built after having the number of original
features reduced. From this, the quality of the models can be assessed with
respect to prediction errors, which may even give a hint about whether potential
sets of already selected features need some adjustments. The independence of the
feature selection and its evaluation from the forecasting model allows to isolate
the performance of the graph-based method. Thus, this approach can be
generalised to more applications benefiting from feature selection.
For effectiveness purposes, one should consider splitting the data, utilising
graphics processing units or looking into other computing methods suitable for
large-scale data as a complement. Another direction is to improve graph-based
feature selections by customising some functions and measures depending on the
characteristics of the data set and what the problem solver wishes to achieve. For
example, alternative ways for computing the similarity matrix, for clustering the
data points and for comparing selected features in case of split data are
interesting subjects.
149
Bibliography
150
BIBLIOGRAPHY
[8] Tsafos, Nikos (2020). How will natural gas fare in the eneergy transition.
Center for strategic and international studies (CSIS). https://csis-prod.
s3.amazonaws.com/s3fs-public/publication/200114_Tsafos_How_Will_
Natural_Gas_Fare.pdf?_oyuJJTiixKn0sDY1y7ZeQ3c3C5ijhnG
[9] Burke, Paul J., Abayasekera, Ashani (2017). The price elasticity of electricity
demand in the United States: A three-dimensional analysis. Center for
Applied Macroeconomic Analysis (CAMA), Crawford School of Public Policy.
https:
//cama.crawford.anu.edu.au/sites/default/files/publication/cama_
crawford_anu_edu_au/2017-08/50_2017_burke_abayasekara_0.pdf
[10] Wagner, Marcus (2011). To explore or to exploit? An empirical investigation
of acquisitions by large incumbents. Research Policy, Volume 40, Issue 9,
November 2011, Pages 1217-1225. doi.org/10.1016/j.respol.2011.07.006
[11] Albadi, M.H. ; El-Saadany, E.F. (2007). Demand Response in Electricity
Markets: An Overview.
[12] Poullikas, Andreas (2013). A comparative assessment of net metering and
feed in tariff schemes for residential PV systems. Sustainable Energy
Technologies and Assessments 3:1–8. DOI: 10.1016/j.seta.2013.04.001.
[13] International Energy Agency (IEA) (2019). World Energy Outlook 2019.
[14] Staffell, Iain ; Pfenninger, Stefan (2017). The increasing impact of weather
on electricity supply and deemand. Energy, Volume 145, 15 February 2018,
Pages 65-78. doi.org/10.1016/j.energy.2017.12.051.
[15] International Energy Agency (2017). Digitalization & Energy.
http://www.iea.org/publications/freepublications/publication/
DigitalizationandEnergy3.pdf.
[16] Bloomberg, J. (2018) Digitization, Digitalization, And Digital
Transformation: Confuse Them At Your Peril. Forbes Magazine
https://www.forbes.com/sites/jasonbloomberg/2018/04/29/
digitization-digitalization-and-digital-transformation-confuse-them-at-your-per
#7b166c3d2f2c.
[17] Saunders, M. N. K., Lewis, P., & Thornhill, A. (2009). Research methods for
business students (5th ed). Prentice Hall.
151
BIBLIOGRAPHY
152
BIBLIOGRAPHY
153
BIBLIOGRAPHY
[38] Brem, A. ; Voigt, K.-I. (2007). Integration of Market Pull and Technology
Push in the Corporate Front End and Innovation Management - Insights
from the German Software Industry. Technovation 29(5):351-367. DOI:
10.1016/j.technovation.2008.06.003
[39] Dean, D.L. ; Hender, J.M. ; Rodgers, T.L. ; Santanen, E.L. (2006).
Identifying quality, novel, and creative ideas: Constructs and scales for idea
evaluation. Journal of the Association for Information Systems, 7 (10) (2006),
pp. 646-698.
[40] Fortum (2019). CEO’s Business Review.
[41] Diestel, R. (2016). Graph Theory, 5th edition.
[42] Bondy, J. & Murty, U. (1982). Graph Theory with Applications, 5th edition.
[43] Wilson, R. (1998). Introduction to Graph Theory, 4th edition.
[44] Nandhini, V. (2019). A study on course timetable scheduling and exam
timetable scheduling using graph coloring approach. International Journal for
Research in Applied Science and Engineering Technology, 7(3), 1999–2006.
doi: 10.22214/ijraset.2019.3368
[45] Imhof, K., & Arias, C. (1990). On the use of colours for presenting power
system connectivity information. Conference Papers Power Industry
Computer Application Conference. doi: 10.1109/pica.1989.38989
[46] Yu, H., Han, X., Ma, Y. & Xing, X. (2016). Transmission line maintenance
scheduling based on graph coloring. Proceedings of the 2016 International
Conference on Education, Management, Computer and Society. doi:
10.2991/emcs-16.2016.37
[47] Anwar, A., & Mahmood, A. N. (2016). Anomaly detection in electric
network database of smart grid: Graph matching approach. Electric Power
Systems Research, 133, 51–62. doi: 10.1016/j.epsr.2015.12.006
[48] Elmrabet, Z., Elghazi, H., Kaabouch, N. & Elghazi, H. (2018).
Cyber-security in smart grid: Survey and challenges. Computer and Electrical
Engineering, 67. doi: 10.1016/j.compeleceng.2018.01.015
154
BIBLIOGRAPHY
[49] Mookiah, L., Dean, C. & Eberle, W. (2017). Graph-based anomaly detection
on smart grid data. flairs conference. Proceedings of the Thirtieth
International Florida Artificial Intelligence Research Society Conference.
[50] Mohamad, H., Faezy Wan Zalnidzham, W., Ashida Salim, N., Shahbudin,
S., & Mat Yasin, Z. (2019). Power system restoration in distribution network
using minimum spanning tree - Kruskal’s algorithm. Indonesian Journal of
Electrical Engineering and Computer Science, 16 (1). doi:
10.11591/ijeecs.v16.i1.pp1-8
[51] Zhang, Y. (2009). Stock market network topology analysis based on a
minimum spanning tree approach. Bowling Green State University.
[52] Lautier, D., Ling, J., & Raynaud, F. (2014). Systemic risk in commodity
markets: What do trees tell us about crises? SSRN Electronic Journal. doi:
10.2139/ssrn.2430167
[53] Erickson, J. (2019). Algorithms. Retrieved: 2020.05.29.
https://jeffe.cs.illinois.edu/teaching/algorithms
[54] Takaoka, T. (1998). Shortest path algorithms for nearly acyclic directed
graphs. Theoretical Computer Science, 203 (1), 143–150. doi:
10.1016/s0304-3975(97)00292-2
[55] Fortum. (2020). Laddinfrastruktur för elfordon med Fortum Charge &
Drive. Retrieved: 2020.05.29.
https://www.fortum.se/foretag/ladda-elbil/om-fortum-charge-drive
[56] Baum, M., Dibbelt, J., Gemsa, A., Wagner, D., & Zündorf, T. (2019).
Shortest feasible paths with charging stops for battery electric vehicles.
Transportation Science, 53 (6), 1627–1655. doi: 10.1287/trsc.2018.0889
[57] Parastvand, H., Moghaddam, V., Bass, O., Masoum, M. A. S., Chapman,
A., & Lachowicz, S. (2020). A graph automorphic approach for placement
and sizing of charging stations in EV network considering traffic. IEEE
Transactions on Smart Grid, 1–1. doi: 10.1109/tsg.2020.2984037
[58] Jia, L., Hu, Z., Song, Y., & Luo, Z. (2012). Optimal siting and sizing of
electric vehicle charging stations. 2012 IEEE International Electric Vehicle
Conference. doi: 10.1109/ievc.2012.6183283
155
BIBLIOGRAPHY
[59] Chauhan, V., Gutfraind, A., & Safro, I. (2019). Multiscale planar graph
generation. Applied Network Science, 4 (1). doi: 10.1007/s41109-019-0142-3
[60] Chauhan, V. (2018). Planar graph generation with application to water
distribution networks. Clemson University. doi: 10.13140/RG.2.2.18915.81445
[61] Ansari, M. H., Vakili, V. T., Bahrak, B., & Tavassoli, P. (2018). Graph
theoretical defense mechanisms against false data injection attacks in smart
grids. Journal of Modern Power Systems and Clean Energy, 6 (5), 860–871.
doi: 10.1007/s40565-018-0432-2
[62] Nussbaum, Y. (2014). Network flow problems in planar graphs/ Yahav,
Nussbaum. Tel-Aviv: Universitat Tel-Aviv.
[63] Fujie, F., & Zhang, P. (2014). Covering walks in graphs. SpringerBriefs in
Mathematics. doi: 10.1007/978-1-4939-0305-4
[64] Leite, J. B., & Mantovani, J. R. S. (2016). Distribution system state
estimation using the Hamiltonian cycle theory. 2016 IEEE Power and Energy
Society General Meeting (PESGM). doi: 10.1109/pesgm.2016.7741360
[65] Robinson, I., Webber, J. & Eifrem, E. (2015). Graph databases. Sebastopol,
CA: O’Reilly & Associates.
[66] Diendorfer, C., Haslwanter, J. D. H., Stanovich, M., Schoder, K.,
Sloderbeck, M., Ravindra, H., & Steurer, M. (2017). Graph traversal-based
automation of fault detection, location, and recovery on MVDC shipboard
power systems. 2017 IEEE Second International Conference on DC
Microgrids (ICDCM). doi: 10.1109/icdcm.2017.8001032
[67] Liao, H., Mariani, M., Medo, M., Zhang, Y. & Zhou, M. (2017). Ranking in
evolving complex networks. Physics Reports, 689. doi:
10.1016/j.physrep.2017.05.001
[68] Kiss, C., & Bichler, M. (2008). Identification of influencers - Measuring
influence in customer networks. Decision Support Systems, 46 (1), 233–253.
doi: 10.1016/j.dss.2008.06.007
[69] Yada, K., Motoda, H., Washio, T., & Miyawaki, A. (2006). Consumer
behavior analysis by graph mining technique. New Mathematics and Natural
Computation, 02 (01), 59–68. doi: 10.1142/s1793005706000294
156
BIBLIOGRAPHY
[70] Green, T. & Hartley, N. (2015). Using graph theory to value paying and
nonpaying customers in a social network: Linking customer lifetime value to
word-of-mouth social value. Journal of Relationship Marketing, 14 (4). doi:
10.1080/15332667.2015.1095008
[71] Sami, M. (2018). Effective placement of sensors for efficient early warning
system in water distribution network. Chalmers University of Technology.
[72] Simone, A., Ridolfi, L., Laucelli, D., Berardi, L., & Giustolisi, O. (2018).
Centrality metrics for water distribution networks. doi: 10.29007/7lxd
[73] Hochbaum, D. S. (2006). Ranking sports teams and the inverse equal paths
problem. Lecture Notes in Computer Science Internet and Network
Economics, 307–318. doi: 10.1007/11944874_28
[74] Vigna, S. (2019). Spectral ranking. Retrieved: 2020.05.29.
https://arxiv.org/pdf/0912.0238.pdf
[75] Darvish, M., Yasaei, M., & Saeedi, A. (2009). Application of the graph
theory and matrix methods to contractor ranking. International Journal of
Project Management, 27 (6), 610–619. doi: 10.1016/j.ijproman.2008.10.004
[76] Hakimi-Asl, A., Amalnick, M. S., & Hakimi-Asl, M. (2016). Proposing a
graph ranking method for manufacturing system selection in high-tech
industries. Neural Computing and Applications, 29 (1), 133–142. doi:
10.1007/s00521-016-2420-7
[77] Nian, K., Zhang, H., Tayal, A., Coleman, T. & Li, Y. (2016). Unsupervised
spectral ranking for anomaly and application to auto insurance fraud
detection. The Journal of Finance and Data Science, 2 (1). doi:
10.1016/j.jfds.2016.03.001
[78] Lempitsky, V., Blake, A., & Rother, C. (2012). Branch-and-mincut: Global
optimization for image segmentation with high-level priors. Journal of
Mathematical Imaging and Vision, 44 (3), 315–329. doi:
10.1007/s10851-012-0328-0
[79] Sen, A., Ghosh, P., Vittal, V., & Yang, B. (2009). A new min-cut problem
with application to electric power network partitioning. European
Transactions on Electrical Power, 19(6), 778–797. doi: 10.1002/etep.255
157
BIBLIOGRAPHY
[80] Sun, K., Hou, Y., Sun, W. & Qi, J. (2019). Power system control under
cascading failures: Understanding, mitigation, and system restoration.
Wiley-IEEE Press.
[81] Sou, K. C., Sandberg, H., & Johansson, K. H. (2011). Electric power
network security analysis via minimum cut relaxation. IEEE Conference on
Decision and Control and European Control Conference. doi:
10.1109/cdc.2011.6160456
[82] Salem, R., Moneim, W. & Hassan, M. (2019). Graph mining techniques for
graph clustering: Starting point. Journal of Theoretical and Applied
Information Technology, 97 (15).
[83] Ehteram, M., Binti Koting, S., Afan, H. A., Mohd, N. S., Malek, M. A.,
Ahmed, A. N., El-shafie, A. H., Onn, C. C., Lai, S. H., & El-Shafie, A.
(2019). New Evolutionary Algorithm for Optimizing Hydropower Generation
Considering Multireservoir Systems. Applied Sciences, 9(11), 2280.
https://doi.org/10.3390/app9112280.
[84] Stoll, B., Andrade, J., Cohen, S., Brinkman, G., & Brancucci
Martinez-Anido, C. (2017). Hydropower Modeling Challenges
(NREL/TP–5D00-68231, 1353003; p. NREL/TP–5D00-68231, 1353003).
https://doi.org/10.2172/1353003.
[85] Liu, P., Nguyen, T.-D., Cai, X., & Jiang, X. (2012). Finding Multiple
Optimal Solutions to Optimal Load Distribution Problem in Hydropower
Plant. Energies, 5(5), 1413–1432. https://doi.org/10.3390/en5051413.
[86] Fortum Interviewee M (2020).
[87] International Hydropower Association (2020). 2020 Hydropower Status
Report. https://www.hydropower.org/country-profiles/region-europe
[88] Fortum (2019). Financials 2019.
[89] Moeini, R., & Afshar, M. H. (2011). Arc-based constrained ant colony
optimisation algorithms for the optimal solution of hydropower reservoir
operation problems. 38, 14.
[90] Johnson, M. P., Rybalov, L., Zhao, L., & Bald, S. (2019). Maximizing power
production in path and tree riverine networks. Sustainable Computing:
158
BIBLIOGRAPHY
159
BIBLIOGRAPHY
[100] Selak, L., Butala, P., Sluga, A. (2014). Condition monitoring and fault
diagnostics for hydropower plants. Computers in Industry 65 (2014), pp.
924–936.
[101] Åstrand, S., (2008). Environmental Effects of Turbine Oil Spills from
Hydro Power Plants to Rivers, Stockholm: The Royal Institute of Technology.
[102] Ma, J., & Jiang, J. (2009). Applications of Fault Diagnosis in Nuclear
Power Plants: An Introductory Survey. IFAC Proceedings Volumes, 42(8),
1150–1161. https://doi.org/10.3182/20090630-4-ES-2003.00189.
[103] HINES, J. W., & DAVISH, E. (n.d.). LESSONS LEARNED FROM THE
U.S. NUCLEAR POWER PLANT ON-LINE MONITORING PROGRAMS.
14.
[104] Dabrowski, C., Hunt F., (2011). Identifying failure scenarios in complex
systems by perturbing markov chain models. Proceedings of the 2011
Pressure Vessels & Piping Division (PVPD) Conference July 7-11, 2011,
Baltimore, MD, USA.
[105] Wu, M., Liu, Y., Wen, Z., Peng, M., Xia, H., Li, W., Wang, H., Abiodun,
A., & Yu, W. (2017). Fault diagnosis and severity estimation in nuclear power
plants.
[106] Chen, J., Zhou, T., & Ran, K. (2010). The Energy-Saving Diagnosis of
PWR Nuclear Power Station Based on the Thermo-Economic Analysis
Model. 18th International Conference on Nuclear Engineering: Volume 1,
137–144. https://doi.org/10.1115/ICONE18-29454.
[107] Ashaari, A., Ahmad, T., Shamsuddin, M., Mohamad, W. M. W., & Omar,
N. (2015). Graph Representation for Secondary System of Pressurized Water
Reactor with Autocatalytic Set Approach. Journal of Mathematics and
Statistics, 11(4), 107–112. https://doi.org/10.3844/jmssp.2015.107.112.
[108] Yamada, Y., & Teraoka, Y. (1998). An optimal design of piping route in a
CAD system for power plant. Computers & Mathematics with Applications,
35(6), 137–149. https://doi.org/10.1016/S0898-1221(98)00025-X.
[109] Al-Masri, A. (2019). How Does Back-Propagation in Artificial Neural
Networks Work? Towards Data Science (Medium). shorturl.at/lwxyI.
160
BIBLIOGRAPHY
[110] Baraldi, P., Di Maio, F., Rigamonti, M., Zio, E., & Seraoui, R. (2015).
Unsupervised clustering of vibration signals for identifying anomalous
conditions in a nuclear turbine. Journal of Intelligent & Fuzzy Systems, 28(4),
1723–1731. https://doi.org/10.3233/IFS-141459.
[111] Chun-Ling Dong, Zhang, Q., & Yue Zhao. (2013). An adaptive decision
method using structure feature analysis on dynamic fault propagation model.
2013 International Conference on Machine Learning and Cybernetics,
664–669. https://doi.org/10.1109/ICMLC.2013.6890373.
[112] Dutta, S., & Overbye, T. J. (2012). Optimal Wind Farm Collector System
Topology Design Considering Total Trenching Length. IEEE Transactions on
Sustainable Energy, 3(3), 339–348.
https://doi.org/10.1109/TSTE.2012.2185817.
[113] Pouyaei, A., Golgouneh, A., & Kholghi, B. (2019). A New Local Searching
Approach for Wind Farm Layout Optimization Problem. 2019 IEEE 2nd
International Conference on Renewable Energy and Power Engineering
(REPE), 114–119. https://doi.org/10.1109/REPE48501.2019.9025115.
[114] Esfahanian, V.; Pour, A.S.; Harsini, I.; Haghani, A.; Pasandeh, R.;
Shahbazi, A.; Ahmadi, G. Numerical analysis of flow field around NREL
Phase II wind turbine by a hybrid CFD/BEM method. J. Wind Eng. Ind.
Aerod. 2013, 120, 29–36.
[115] Xu, C., Chen, D., Han, X., Pan, H., & Shen, W. (2016). The Collection of
The Main Issues for Wind Farm Optimisation in Complex Terrain. Journal of
Physics: Conference Series, 753, 032066.
https://doi.org/10.1088/1742-6596/753/3/032066.
[116] Li, J., Hu, W., Wu, X., Huang, Q., Liu, Z., Chen, Z., & Blaabjerg, F.
(2019). A Hybrid Cable Connection Structure for Wind Farms With
Reliability Consideration. IEEE Access, 7, 144398–144407.
https://doi.org/10.1109/ACCESS.2019.2944888.
[117] Dahmani O, Bourguet S, Machmoum M et al (2015) Optimization of the
connection topology of an offshore wind farm network. IEEE Syst J
9(4):1519–1528.
[118] Loganathan, M.K., Besbaurah, I., Gandhi, O.P., Borah, R.C. (2018).
Criticality analysis of wind turbine energy system using fuzzy digraph models
161
BIBLIOGRAPHY
162
BIBLIOGRAPHY
[129] Baum, M., Dibbelt, J., Pajor, T., Sauer, J., Wagner, D., & Zündorf, T.
(2020). Energy-Optimal Routes for Battery Electric Vehicles. Algorithmica,
82(5), 1490–1546. https://doi.org/10.1007/s00453-019-00655-9.
[130] Campana, M., & Inga, E. (2019). Optimal Allocation of Public Charging
Stations based on Traffic Density in Smart Cities. 2019 IEEE Colombian
Conference on Applications in Computational Intelligence (ColCACI), 1–6.
https://doi.org/10.1109/ColCACI.2019.8781986.
[131] Phonrattanasak, P., & Leeprechanon, N. (n.d.). Optimal placement of EV
fast charging stations considering the impact on electrical distribution and
traffic condition.
[132] Raagapriya, S., Ain, A., & Dasgupta, P. (2017). Route optimization for an
electric vehicle with priority destinations. 2017 International Conference On
Smart Technologies For Smart Nation (SmartTechCon), 1244–1249.
https://doi.org/10.1109/SmartTechCon.2017.8358565.
[133] Fortum Interviewee G (2020).
[134] Fortum Interviewee J (2020).
[135] Allied Market Research (2020). Electric Vehicle Charger Market by Vehicle
Type (Battery Electric Vehicle (BEV), Plug-in Hybrid Electric Vehicle
(PHEV), and Hybrid Electric Vehicle (HEV)), Charging Type (On-board
Chargers, and Off-board Chargers), and End User (Residential and
Commercial): Global Opportunity Analysis and Industry Forecast, 2020-2027.
[136] Baker, T., Aibino, S., Belsito, E., Aubert, G., Sahoo, A. (2019). Electric
Vehicles Are a Multibillion-Dollar Opportunity for Utilities. Published by
Boston Consulting Group (BCG).
https://www.bcg.com/publications/2019/
electric-vehicles-multibillion-dollar-opportunity-utilities.
aspx.
[137] Schwenk, K., Faix, M., Mikut, R., Hagenmeyer, V., & Appino, R. R.
(2019). On Calendar-Based Scheduling for User-Friendly Charging of Plug-In
Electric Vehicles. 2019 IEEE 2nd Connected and Automated Vehicles
Symposium (CAVS), 1–5. https://doi.org/10.1109/CAVS.2019.8887782.
163
BIBLIOGRAPHY
164
BIBLIOGRAPHY
[148] Ding, T., Sun, H., Sun, K., Li, F., & Zhang, X. (2015). Graph theory based
splitting strategies for power system islanding operation. 2015 IEEE Power
Energy Society General Meeting, 1–5.
https://doi.org/10.1109/PESGM.2015.7286392.
[149] Hines, P., & Blumsack, S. (2008). A Centrality Measure for Electrical
Networks. Proceedings of the 41st Annual Hawaii International Conference on
System Sciences (HICSS 2008), 185–185.
https://doi.org/10.1109/HICSS.2008.5.
[150] H., P., L., W., Georgiadis, G., Papatriantafilou, M., A., L., & Bertling, L.
(2012). Application of the Graph Theory in Managing Power Flows in Future
Electric Networks. In Y. Zhang (Ed.), New Frontiers in Graph Theory.
InTech. https://doi.org/10.5772/35527.
[151] Abrams, J. (2017). Analysis of Equity Markets: A Graph Theory
Approach. SIAM Undergraduate Research Online, 10.
https://doi.org/10.1137/16S015632.
[152] Lautier, D., & Raynaud, F. (2012). Systemic risk in energy derivative
markets: A graph theory analysis.
[153] Liu, S., Huang, S., Chi, Y., Feng, S., Li, Y., & Sun, Q. (2020). Three-level
network analysis of the North American natural gas price: A multiscale
perspective. International Review of Financial Analysis, 67, 101420.
https://doi.org/10.1016/j.irfa.2019.101420.
[154] Park, H., Mjelde, J. W., & Bessler, D. A. (2006). Price dynamics among
U.S. electricity spot markets. Energy Economics, 28(1), 81–101.
https://doi.org/10.1016/j.eneco.2005.09.009.
[155] Shimizu, S., Hoyer, P. O., Hyvarinen, A., & Kerminen, A. (n.d.). A Linear
Non-Gaussian Acyclic Model for Causal Discovery.
[156] Fortum Interviewee L (2020).
[157] Ackerman, E., Fox, J., Pach, J., & Suk, A. (2014). On grids in topological
graphs. Computational Geometry, 47(7), 710–723.
https://doi.org/10.1016/j.comgeo.2014.02.003.
[158] Cuadra, L., Pino, M., Nieto-Borge, J., & Salcedo-Sanz, S. (2017).
Optimizing the Structure of Distribution Smart Grids with Renewable
165
BIBLIOGRAPHY
166
BIBLIOGRAPHY
167
BIBLIOGRAPHY
168
BIBLIOGRAPHY
[188] Yu, L. (2013). Visibility graph network analysis of gold price time series,
Physica A 392 (16) (2013) 3374–3384.
[189] Zhang, B., Wang, J., Fang, W. (2015). Volatility behavior of visibility
graph EMD financial time series from Ising interacting system, Physica A 432
(2015) 301–314.
[190] Yang, Y., Wang, J., Yang, H., Mang, J. (2009). Visibility graph approach
to exchange rate series, PhysicaA 388 (20) (2009) 4431–4437.
[191] Alzate, C., & Sinn, M. (2013). Improved Electricity Load Forecasting via
Kernel Spectral Clustering of Smart Meters. 2013 IEEE 13th International
Conference on Data Mining, 943–948.
https://doi.org/10.1109/ICDM.2013.144.
[192] Gajowniczek, K., & Ząbkowski, T. (2018). Simulation Study on Clustering
Approaches for Short-Term Electricity Forecasting. Complexity, 2018, 1–21.
https://doi.org/10.1155/2018/3683969.
[193] Verma, D., & Meila, M. (2003). A comparison of spectral clustering
algorithms. University of Washington, uw-cse-03-05-01.
[194] Zhang, X., Wang, J., & Gao, Y. (2019). A hybrid short-term electricity
price forecasting framework: Cuckoo search-based feature selection with
singular spectrum analysis and SVM. Energy Economics, 81, 899–913.
https://doi.org/10.1016/j.eneco.2019.05.026.
[195] Fortum Interviewee D (2020).
[196] Casper, W. R., & Nadiga, B. (2017). A New Spectral Clustering
Algorithm. ArXiv:1710.02756 [Physics]. http://arxiv.org/abs/1710.02756.
[197] Bolón-Canedo, V., Rego-Fernández, D., Peteiro-Barral, D.,
Alonso-Betanzos, A., Guijarro-Berdiñas, B., & Sánchez-Maroño, N. (2018).
On the scalability of feature selection methods on high-dimensional data.
Knowledge and Information Systems, 56(2), 395–442.
https://doi.org/10.1007/s10115-017-1140-3.
[198] Amjday, N., Hemmati, M. (2006). Energy price forecasting problems and
proposals for such predictions. IEEE Power & energy magazine, March/April
2006.
169
BIBLIOGRAPHY
170
BIBLIOGRAPHY
[209] Bengio, Y., Courville, A., & Vincent, P. (2014). Representation Learning:
A Review and New Perspectives. ArXiv:1206.5538 [Cs].
http://arxiv.org/abs/1206.5538.
[210] Conejo, A.J.; Contreras, J.; Espínola, R.; Plazas, M.A. Forecasting
electricity prices for a day-ahead pool-based electric energy market. Int. J.
Forecast. 2005, 21, 435–462.
[211] Tan, Z., Zhang, J., Wang, J., Xu, J., 2010. Day-ahead electricity price
forecasting using wave- let transform combined with ARIMA and GARCH
models. Appl. Energy 87 (11), 3606–3610.
[212] Lago, J., Ridder, F.D., Schutter, B.D., 2018a. Forecasting spot electricity
prices: deep learn- ing approaches and empirical comparison of traditional
algorithms. Appl. Energy 221, 386–405.
[213] Singh, N., Mohanty, S.R., Shukla, R.D., 2017. Short term electricity price
forecast based on environmentally adapted generalized neuron. Energy 125,
127–139.
[214] Che, J., Wang, J., 2010. Short-term electricity prices forecasting based on
support vector regression and auto-regressive integrated moving average
modeling. Energy Conversion & Management 51 (10), 1911–1917.
[215] Catalao, J.P.S., Pousinho, H.M.I., Mendes, V.M.F. (2011). Hybrid
wavelet-PSO-ANFIS approach for short-term electricity prices forecasting.
IEEE Trans. Power Syst. 26 (1), 137–144.
[216] Stevenson M. (2001). Filtering and forecasting spot electricity prices in the
increasingly deregulated Australian electricity market. QFRC research paper
series, no. 63. Quantitative Finance Research Centre, University of
Technology, Sydney; 2001.
[217] Cruz, A., Muñoz, A., Zamora, J., Espínola, R. (2011). The effect of wind
generation and week-day on Spanish electricity spot price forecasting. Electr.
Power Syst. Res. 81 (10), 1924–1935.
[218] Rodriguez, C.P., Anders, G.J., 2004a. Energy price forecasting in the
Ontario competitive power system market. IEEE Trans. Power Syst. 19,
366–374.
171
BIBLIOGRAPHY
[219] Hong, Y., Wu, C., 2012. Day-ahead electricity price forecasting using a
hybrid principal component analysis network. Energies 5 (11), 4711–4725.
[220] Ludwig, N., Feuerriegel, S., & Neumann, D. (2015). Putting Big Data
analytics to work: Feature selection for forecasting electricity prices using the
LASSO and random forests. Journal of Decision Systems, 24(1), 19–36.
https://doi.org/10.1080/12460125.2015.994290.
[221] Ziel, F. (2016). Forecasting Electricity Spot Prices Using Lasso: On
Capturing the Autoregressive Intraday Structure. IEEE Transactions on
Power Systems, 31(6), 4977–4987.
https://doi.org/10.1109/TPWRS.2016.2521545.
[222] Shao, Z., Yang, S., Gao, F., Zhou, K., & Lin, P. (2017). A new electricity
price prediction strategy using mutual information-based SVM-RFE
classification. Renewable and Sustainable Energy Reviews, 70, 330–341.
https://doi.org/10.1016/j.rser.2016.11.155.
[223] Amjady, N., & Daraeepour, A. (2008). Day-ahead electricity price
forecasting using the relief algorithm and neural networks. 2008 5th
International Conference on the European Electricity Market, 1–7.
https://doi.org/10.1109/EEM.2008.4579109.
[224] Yang, W., Wang, J., Niu, T., & Du, P. (2019). A hybrid forecasting system
based on a dual decomposition strategy and multi-objective optimization for
electricity price forecasting. Applied Energy, 235, 1205–1225.
https://doi.org/10.1016/j.apenergy.2018.11.034.
[225] Abedinia, O., Amjady, N., & Zareipour, H. (2017). A New Feature
Selection Technique for Load and Price Forecast of Electrical Power Systems.
IEEE Transactions on Power Systems, 32(1), 62–74.
https://doi.org/10.1109/TPWRS.2016.2556620.
[226] Urro, C., Niciejewska, K. (2018). EPEX SPOT 2017/2018. EEX Group.
https://www.eex.com/blob/83774/5b3b05a55934b0970baba79d2d872deb/
20180628-workshop-4-epex-spot-2017-2018-data.pdf.
[227] Uniejewski, B., Marcjasz, G., & Weron, R. (2019). Understanding intraday
electricity markets: Variable selection and very short-term price forecasting
using LASSO. International Journal of Forecasting, 35(4), 1533–1547.
https://doi.org/10.1016/j.ijforecast.2019.02.001.
172
BIBLIOGRAPHY
[228] Pape, C. (2017). The impact of intraday markets on the market value of
flexibility– decomposing effects on profile and the imbalance costs.
[229] Kulakov, S., & Ziel, F. (2019). The impact of renewable energy forecasts on
intraday electricity prices.
[230] Monteiro, C., Ramirez-Rosado, I., Fernandez-Jimenez, L., & Conde, P.
(2016). Short-Term Price Forecasting Models Based on Artificial Neural
Networks for Intraday Sessions in the Iberian Electricity Market. Energies,
9(9), 721. https://doi.org/10.3390/en9090721.
[231] Andrade, J., Filipe, J., Reis, M., & Bessa, R. (2017). Probabilistic Price
Forecasting for Day-Ahead and Intraday Markets: Beyond the Statistical
Model. Sustainability, 9(11), 1990. https://doi.org/10.3390/su9111990.
[232] Marcjasz, G., Uniejewski, B., & Weron, R. (2020). Beating the
Naïve—Combining LASSO with Naïve Intraday Electricity Price Forecasts.
Energies, 13(7), 1667. https://doi.org/10.3390/en13071667.
Maciejowska, K.; Nitka, W.; Weron, T. Day-ahead vs. Intraday—Forecasting
the price spread to maximize economic benefits. Energies 2019, 12, 631.
[233] N. Satish, N. Sundaram, M. M. A. Patwary, J. Seo, J. Park, M. A. Hassaan,
S. Sengupta, Z. Yin, and P. Dubey, “Navigating the maze of graph analytics
frameworks using massive graph datasets,” in SIGMOD, 2014, pp. 979–990.
[234] Needham, M., & Hodler, A. E. (2019). Graph Algorithms Practical
Examples in Apache Spark and Neo4j. O’Reilly.
[235] Cai, H., Zheng, V. W., & Chang, K. C.-C. (2018). A Comprehensive
Survey of Graph Embedding: Problems, Techniques and Applications.
ArXiv:1709.07604 [Cs]. http://arxiv.org/abs/1709.07604.
[236] Meier, J.-H., Schneider, S., Le, C., & Schmidt, I. (2020). Short-Term
Electricity Price Forecasting: Deep ANN vs GAM. In V. Ermolayev, F.
Mallet, V. Yakovyna, H. C. Mayr, & A. Spivakovsky (Eds.), Information and
Communication Technologies in Education, Research, and Industrial
Applications (Vol. 1175, pp. 257–276). Springer International Publishing.
https://doi.org/10.1007/978-3-030-39459-2_12.
[237] Montgomery, D., Peck, E. & Vining, G. (2012). Introduction to linear
regression analysis. Oxford: Wiley-Blackwell.
173
BIBLIOGRAPHY
174
BIBLIOGRAPHY
175
Appendices
176
APPENDIX A
Part I & II: Figures and tables
Richness of relationships
How much information is stored in the relationships between the elements of the
system?
0: The system components have no concrete nor theoretical influence on each
other.
1: System components a priori impact each other but in a way that it is abstract,
or not yet perceivable.
2: There is a high degree of connection between the system components, essential
to capture to solve the problems.
Model setup How resource-light can the setup of a proof of concept be, in terms
of time, money and competence?
0: A proof of concept cannot easily be created, given the diversity of the
competences and additional tools needed.
1: A proof of concept is possible to build within a short time frame but will need
some level of support from experts.
2: A proof of concept can be quickly set up, without need for additional support.
Computational constraints
How critical is it for the application to yield results quickly?
0: High computational constraints (seconds to minutes), typically online
applications.
1: Medium computational constraint (minutes to hours)
2: Low computational constraint (days)
Risk
How impactful would an error in the graph-based model be in terms of social,
technical or economical aspects?
0: The model’s output would potentially be central in the decision-making
process and small errors would have great consequences with low to no
risk-mitigation possibilities, such as human verification (usually the case with
prescriptive models).
1: Errors in the model would have a moderately to large impact, which can
however be mitigated to some extent.
2: Errors in the model would have relatively low impact.
A.3. EVALUATION TOOL
Sector size
What is the economic size of the sector in the markets within which the company
is operating?
0: Small
1: Medium
2: Big
Sector growth
How fast is the sector from the respective clusters growing in the markets the
company is operating in?
0: Slow
1: Medium
2: Fast
Substitute solutions
Is there a gap in available technology to solve the problem at hand?
0: High maturity and consolidation of industry with established tools with a
proven performance
1: Good amount of available applications, but no dominant design established.
2: Low level of maturity, low availability of commercial applications and in-house
development necessary.
Scalability
To what extent can a pilot be generalised to more applications, either within the
same vertical or in other domains?
0: “One-shot” application, very specific and contained to the cluster.
1: A pilot could be replicated to other applications but would require significant
adaptations.
2: High level of modularity and deployability of graph application within the
cluster or in other clusters with minimal adaptation.
A.3. EVALUATION TOOL
A.3.4 Workability
How well do the clusters’ solutions align with the perceived problems and
available resources?
Relevance
How central is the problem at hand in the sector’s operations, either from an IT
or a mathematical perspective?
0: The potential application would solve a need only of secondary nature for the
business.
1: The potential application is directly in line with the activities within the
sector.
2: A solution to the pain-point is of critical importance and could bring a high
competitive advantage for the company in this sector.
Data alignment
How well aligned is the data with a graph-based solution, in terms of quantity,
quality and availability?
0: Data too sparse or not ready to be extracted for analyses.
1: Smaller datasets potentially available and could be analysed with little
restrictions.
2: Large datasets readily available.
Human alignment
How well aligned is the competences with the potential graph-based application, in
terms of quantity, quality and availability?
0: There is little or no relevant expertise available in this domain in-house and
making it available would be subject to managerial decisions.
1: A certain level of competence potentially available in-house, requiring minor
organisational changes.
2: Competence in this sector is readily available and can potentially support the
establishment of the model.
work.
1: A graph-based application could be plugged in the technological stack.
2: A graph-based application would empower the technological stack of the
company within the use-case cluster or could be implemented as a stand-alone.
APPENDIX B
Part III: Figures and tables
Table B.1: Clustering accuracy (ACC) [%] corresponding to different data sets
and feature selection methods.
B.1. EXPERIMENTS AND RESULTS
Table B.4: Clustering quality measures for feature selection of CELEBI. The first
column shows the indices of the subsets.
B.2. DISCUSSION
B.2 Discussion
Figure B.2: Relative change of the objective value in iterative methods. A mark
ζ on the vertical axis means that the relative change is 10ζ .
B.2. DISCUSSION
(a) ALLAML
(b) BASEHOCK
(c) GLIOMA
(d) LSOLET
(e) LYMPHOMA
(f) NCI9
(g) PIE10P
(h) PROSTATE
Figure B.3: Clustering quality for different α and β, obtained with the features
selected using SCFS. A mark ζ on the axes for α and β corresponds to a value of
10ζ .
APPENDIX C
Glossary
www.kth.se