Professional Documents
Culture Documents
Wang 2023 - Elsevier
Wang 2023 - Elsevier
Safety Science
journal homepage: www.elsevier.com/locate/safety
A R T I C L E I N F O A B S T R A C T
Keywords: With the continuous development and construction of railway transportation, the railway accidents occur
Hazards correlation analysis frequently which greatly threatens the life safety of passengers and the further development of railway industry.
Knowledge graph Discussing and summarizing experience from past accidents is benefit to improve the safety of railway. This
Railway accidents
paper proposes a modeling method for the correlation analysis of hazards in railway accidents based on the
Safety management
knowledge graph theory. By describing the association between accidents and hazards in the knowledge graph
network, the potential law of accident occurrence is revealed. The innovation of this study is that it considers the
correlations between hazards. In addition, the hazards are further refined, and new topology indexes that adapt
to the heterogeneous structure characteristics of knowledge graph are presented. Based on actual railway ac
cident data in the UK, a number of key hazards have been identified using the methods proposed in this paper.
The experimental results show that by controlling key hazards one by one, the harmful consequence caused by
hazards to accidents is also continuously decreasing. Finally, based on the experimental exploration of the
correlation between key hazards, corresponding preventive measures were developed. The method based on
knowledge graph is expected to be applied to explore the relationship between hazards in railway accidents and
provide additional decision-making information for the prevention of railway accidents.
1. Introduction important link in maintaining the safety of national and people’s lives
and property.
Railway transportation is a high-speed, massive, and high-frequency Under the foregoing facts, railway accidents have received extensive
mode of transportation. It involves the safety of numerous personnel, attention from scholars. For example, Evans (2021) and others conduct
goods and materials. If an accident occurs during railway transportation, relevant research on fatal train accidents in European Railways up to
it will not only cause serious casualties and property losses, but also have 2019. There are many hazards causing railway accidents. In order to
a significant impact on social and economic development. In the railway prevent the occurrence of railway accidents, it is necessary to explore
transportation system, train derailment, runaway and collision are and study the key hazard factors in railway accidents. In recent years,
common railway accidents. Especially in terms of traffic accidents, many scholars have analyzed the hazard factors of railway accidents,
major railway traffic accidents have happened in various countries. Chang et al. (2009) examines the causality among driving performance,
Most of the accidents have caused serious consequences such as heavy traffic actors and intersection accidents using path analysis. Stanton and
casualties. For example, on April 28, 2008, the T195 train from Beijing Walker (2011) discuss the psychological factors involved in the Lad
to Qingdao was off-line when it arrived at the Jiaoji railway, and broke Grove rail accident. Based on the ordered probit model and data
collided with the 5034 train from Yantai to Xuzhou, causing 71 deaths analysis, Hao et al. (2016) find that driver, environment and weather
and 416 injuries. On February 22, 2012, in Buenos Aires, Argentina, a characteristics have a strong impact on the severity of injuries in acci
train hit a buffer barrier at the end of the rail and derailed, killing 49 dents at highway-rail grade crossings. Haleem (2016) investigates haz
people and injuring more than 600. Therefore, railway transportation ard factors of traffic casualties at private highway-railroad grade
safety is very necessary, and ensuring railway transportation safety is an crossings in the United States, including temporal crash characteristics,
* Corresponding authors.
E-mail addresses: 11111047@bjtu.edu.cn (X. Yang), cjh@szpt.edu.cn (J. Chen).
https://doi.org/10.1016/j.ssci.2023.106238
Received 20 September 2022; Received in revised form 20 April 2023; Accepted 16 June 2023
Available online 25 June 2023
0925-7535/© 2023 Elsevier Ltd. All rights reserved.
N. Wang et al. Safety Science 166 (2023) 106238
geometry, railroad, traffic, vehicle, and environment. Li et al. (2019) network theory has also been widely used. Klockner and Toft (2018)
integrate theoretic accident model and the process (STAMP) and human propose a safety net method that can be used to understand the
factors analysis and classification System (HFACS), and reveal a number complexity of the relationship between system factors in railway safety
of prominent accident reasons caused by human factors. However, most accidents. Liu et al. (2019) build an iterative causal network model for
railway accidents are not caused by a single factor. The hazards that lead railway operation accidents and develope indicators to analyze the
to railway accidents can be divided into four categories: personnel fac causes of railway accidents. Based on Pearson correlation coefficient,
tors, equipment factors, environment factors and management factors. Hua and Zheng (2019) analyze the correlation between railway cause
Scholars also discuss the connections between the multiple hazards factors, establish a railway accident cause network model, and analyze
factors. Mirabadi and Sharifian (2010) use association rule data mining the overall structure of the network by using the basic topology index of
technology to analyze the data of past accidents in Iran’s railway, the complex network. Lam and Tai (2020) propose a hazard analysis
consider human factors, vehicles, tracks, signal systems and other haz framework for railway accidents, and consider the correlation of railway
ard factors, and identify the potential relations between hazard factors. accident data from three perspectives of local, global and scenario. Li
Evans and Hughes (2019) study the relations between travelers, delays and Wang (2018) propose a hazard monitoring model based on complex
and failures to road users at railway level crossings in the UK. Exploring network to identify and analyze the correlation between accident cause
the relations between different hazards and obtaining valuable knowl factors.
edge from them will contribute to the development of railway safety. However, most of the railway accident networks proposed by the
In the research of security hazard analysis in many fields, network above scholars are one-dimensional complex networks, which can only
models are commonly used to reveal the characteristics and hidden provide limited information. In general, the process of railway accidents
knowledge of complex systems in this field (Wu et al., 2021; Zeng et al., is relatively complex and affected by various factors. In order to explore
2021). At present, the commonly used network models include Bayesian the correlation between hazard factors and obtain more useful infor
network, Complex network and the emerging knowledge graph network mation about railway accidents, the multi-dimensional network model
in recent years. provides a new perspective. In one-dimensional complex networks, de
Bayesian networks are often used to infer uncertain knowledge in gree, clustering coefficient, centrality and other network topology
railway transportation. Logical structure diagram and probability cal analysis indexes are widely used. By comparison, in multi-dimensional
culations are used for reasoning to explore useful information in railway complex networks, the corresponding network topology analysis in
transportation. Wang et al. (2017a, 2017b) build a Bayesian network dexes are relatively few. Although Liu et al. (2021) expand the multi-
fault prediction model with fault causes as model variables to evaluate dimensional indexes for railway accidents, it only contains qualitative
the impact of weather on railway switches. Dindar et al. (2018) use causal relationships and hazard types without considering quantitative
Bayesian network to study the impact of weather on railway turnout causality, hazard occurrence time and other valuable information.
systems and analyze the probability of train derailment. Huang et al. Therefore, in order to further explore the correlation between hazards
(2020) used interpretative structural modeling combined with Bayesian and provide effective information for the prevention of railway acci
networks to quantitatively analyze the relationship between various dents, it is necessary to study and innovate the multi-dimensional
factors in the railway dangerous goods transportation system. However, network of railway accidents and its topological analysis indexes.
in the process of using Bayesian networks to explore railway accident As a new network, the knowledge graph was proposed by Google in
information, most scholars describe the relationship between nodes as a 2012. It is a typical multilateral relationship graph, which is composed
causal relationship of hazards, other types of nodes including time, ac of nodes (entities) and edges (relationships between entities) (Zhang
cident hazard consequences, and so on. and edges that can describe the et al., 2022). Knowledge graph describes related domain knowledge
relationship between them. At the same time, edges that can describe the entities into connected network nodes, which can describe various re
relationships between various types of nodes are also missing. On the lations between knowledge entities. The greatest advantage of this
other hand, although Bayesian network is a directed network, it cannot knowledge graph network compared to complex network is that it can
form the representation of ring graph. However, in the complex system contain various types of nodes and the association relationships between
of real hazard of railway accidents, the factors of hazard can influence various types of nodes. In addition to the causal relationship, explore the
each other and form a directed ring graph. Therefore, Bayesian network relationship between other nodes. At the same time, the knowledge
also has some limitations for the correlation analysis of hazard of graph can break through the limitation of Bayesian network, build a
complex systems. directed ring knowledge graph network, and explore the relationship
Complex network theory is a new theory which has been rising in between hazard in complex systems. At present, knowledge graph has
recent years. It is an abstract concept for a large number of real complex been applied to knowledge modeling in various fields, Ma (2022) sum
systems. It can describe various interactions or relations within complex marizes the construction and application of knowledge graph in Geo
systems, such as a causal relationship (Liu et al., 2018). The model based science. At the same time, it has also been explored by many scholars in
on complex network theory can form a directed ring graph, explore the the field of food science and industry (Min et al., 2022), medical (Gong
correlation between a large number of hazards, and break through the et al., 2021), drug discovery (Bonner et al., 2022) and social networks
limitations of Bayesian network. Therefore, it is widely used in the (Agouti, 2022), etc. In recent years, scholars have also made many re
research of exploring the relations between knowledge. It provides a searches in the field of transportation using knowledge graph theory. In
new way to understand the complex interactions between railway haz the field of urban transportation, Tan et al. (2021) explore the relations
ards. In terms of subway safety research, Li et al. (2017) establish a between transportation entities by using the knowledge graph theory.
subway hazard network using the complex network theory and discuss Tian et al. (2022) propose a tracking method for people exposed to
the correlation between hazards. In the subway construction safety epidemic situation in public transport based on knowledge graph theory.
research, in order to analyze and understand the complexity of the ac Liu et al. (2021) propose a method to build railway operational accident
cident network, Zhou et al. (2014) establish a complex network model of knowledge graph model by using the theory of knowledge graph, and
subway construction accidents, which finally proves that it can control formulate new heterogeneous topology indexes to analyze the topology
the original accident and prevent the occurrence of secondary accidents characteristics of the network. However, it overlooks the correlation
and derivative accidents. Zhou et al. (2019) combin the complex strength between accidents and hazards as well as the fact that the
network with the association rule mining algorithm, and propose an number of hazards is small, which can’t fully reveal the correlation
improved Apriori algorithm to explore the type of anomaly monitoring, between accident hazards.
revealing the association rules between the security hazard monitoring This study proposes a modeling method of railway accidents based
type and the hazard coupling. In the aspect of railway security, complex on knowledge graph, and analyzes the correlation between hazards, so
2
N. Wang et al. Safety Science 166 (2023) 106238
Table 1
Description of railway accident.
Railway Description of Accident occurrence All the hazards and accidents Relations Casualties
accident time
report
R072011 At 00:25 hrs on Tuesday 4 May 2010 five Night Management hazard:There are defects in There are defects in the inspection, No casualties
wagons ran away and two of them the inspection, maintenance and maintenance and management of all
derailed close to Ashburys station in management of all components of the components of the train → Brake system
Manchester. The runaway was caused by train;Equipment hazard:Brake system problem → Runaway accident →
ineffective handbrakes on the wagons. problem;Types of accidents:Runaway Derailment accident
The investigation found deficiencies in accident;Derailment accident.
the maintenance plan for the wagons.
Nobody was injured in the derailment.
R142013 Around 07:06 on 28 June 2012, when Daytime Management hazard:Insufficient Insufficient preparation plans for severe No casualties
the train was moving along a straight preparation plans for severe weather weather conditions such as floods and
track, it hit a washed out embankment conditions such as floods and storms; storms → The flood caused by heavy
near Cckmore, Northern Ireland. The Environmental hazards:The flood caused rain → Embankments eroded by the
investigation found that on June by heavy rain;Embankments eroded by the current → Collision accident
27,2012, heavy rainfall in the catchment current;Types of accident:Collision
area of the nearby brokerage River, and accident.
the culvert system on the river could not
cope with the flow from heavy rainfall,
causing water to flow back behind the
railway embankment and causing local
flooding. Weather preparation
procedures do not include planning for
flood or rainstorm events and fail to
respond appropriately to rainfall events.
There were no injuries.
R062018 At around 18:05 on July 28, 2017, a Daytime Management hazard:Insufficient Insufficient inspection and maintenance Three people
train heading north entered Abergavini inspection and maintenance of cables and of cables and wires→Cable and wire were slightly
Station, and a power cable hanging from wires;Equipment hazard:Cable and wire fracture defect fault;→Electrical injured
the station pedestrian bridge was stuck fracture defect fault;Types of accident: accidents → Collision accidents.
on the roof. The train dragged the cable Electrical accidents;Collision accidents.
out of the remaining fixtures until one
end of the cable fell off the connected
distribution cabinet. The free end of the
cable quickly spun and collided with a
group of passengers on the pedestrian
bridge stairs, three of whom suffered
minor injuries. The investigation found
that the cable did not undergo regular
inspections in accordance with the
electrical installation requirements.
Table 2
Mortality and weighted injury quantification.
Injury degree Weighting
Table 3
Railway accidents and average consequences.
Accident Description of accident Average consequence (DWI/Year)
3
N. Wang et al. Safety Science 166 (2023) 106238
4
N. Wang et al. Safety Science 166 (2023) 106238
The third matrix is the result matrix (RM), and RMij is defined by Eq.
(3). The RMij is determined by the knowledge triples including the
keyword’Value − Is’. In the knowledge triples, i represents an accident, j
represents the consequence of accident, δ represents a specific value, and
KTs represents all knowledge triples identified in step 2. RM can map the
knowledge triples describing the result value into the graph, in other
words, if the value of RMij > 0, there is an edge marked’Value − Isδ’ from
entity i to entity j.
{
δ < i, Value − Isδ, j >∈ KTs
RM ij = (3)
0 < i, Value − Isδ, j >∕
∈ KTs
The fourth matrix is the time matrix (TIM), which is a matrix defined
by Eq. (4). The TIMij is determined by the knowledge triples including
the keyword ‘Happened − When’. In the knowledge triples, i represents a
hazard, j represents the occurrence time of hazard, δ represents a spe
cific value, and KTs represents all knowledge triples identified in step 2.
Fig. 3. Causal strength matrix.
This means that through TIM, the knowledge triples describing the time
relationship can be mapped into a graph. Because there is a one-way
causal relationship between knowledge entities, the edge of the corre
lation between hazards and time is described as a directed edge in this
study. The higher the frequency of the correlation is, the higher the
5
N. Wang et al. Safety Science 166 (2023) 106238
intensity of the correlation is. Therefore, weighted edges are used to weight relationship of causality is not be considered here, and the cau
describe the intensity of the correlation between hazards and time. That sality is described as RWCSM matrix.
is, when TIMij = δ, there are δ time relationship edges from knowledge {
1CSM ij > 0
entity i to knowledge entity j. RWCSM ij = (5)
0CSM ij = 0
{
δ < i, Happened − Whenδ, j >∈ KTs
TIM ij = (4) The shortest path matrix (SPM) is constructed based on the RWCSM,
0 < i, Happened − Whenδ, j >∕∈ KTs
and is defined as SPMij by Eq. (6). Where SPMij represents the shortest
Based on the first matrix, the remove weighted causal matrix path length from hazard i to hazard j or accident j and the shortest path
(RWCSM) is constructed, and RWCSMij is defined by Eq. (5). In order to length from accident i to accident j, u and v represent two entities, N
facilitate the subsequent construction of the shortest path matrix, the represents all entities on the shortest path. SPM can capture the length
6
N. Wang et al. Safety Science 166 (2023) 106238
7
N. Wang et al. Safety Science 166 (2023) 106238
accessibility between types of hazards and time DtiaTIW indicates the de
including direct harmful consequence of hazard and intermediate
gree of direct correlation between W-type hazards and TI time, which harmful consequence of hazard. Direct harmful consequence reveals the
can be calculated by Eq. (14). Similarly, the indirect accessibility be severity of harmful consequence of hazards from the perspective of each
tween types of hazards and time is intended to measure the strength of hazard, and intermediate harmful consequence reveals the severity of
the indirect correlation between different types of hazards and different harmful consequence of hazards from the whole knowledge graph
times. That is, the indirect accessibility between types of hazards and network model. These two indicators can provide decision-making basis
time ItiaTI for investment in accident prevention.
W indicates the degree of indirect correlation between W-type
hazards and TI time, which can be calculated by Eq. (15). The direct harmful consequence of hazards is intended to measure
∑ the severity of injury to personnel when a specific hazard occurs in TI
DtiaTI
W = (RWCSM ij × TYM iW × TIM TI
j ) (14) time. The direct harmful consequence of generated by hazard h in TI
i,j∈HN
time can be calculated by Eq. (16). TIM matrix records the intensity of
∑ hazard occurrence time, that is, the number of hazards occurring in TI
ItiaTI (CAM ij × TYM iW × TIM TI (15) time in N years, so TIMTIh /N represents the likelihood of occurrence of
W = j )
i,j∈HN
the hazard h in TI time; CAMhAi is an entity of the causal reachability
Harmful consequence indexes of hazards measure the severity of the matrix. RMAiCon is an entity of the result matrix, representing the average
hazards caused by a given hazard to personnel in railway accidents, consequence of accident Ai.
8
N. Wang et al. Safety Science 166 (2023) 106238
9
N. Wang et al. Safety Science 166 (2023) 106238
10
N. Wang et al. Safety Science 166 (2023) 106238
3.2. HCAKG construction this model, the accident consequences are measured by the severity of
casualties, which is quantified by the death and weighted injury (DWI)
First of all, railway accident, railway accident consequences, haz shown in Table 2 (RSSB. 2014.).
ards, types of hazards, hazard occurrence time are identified as knowl If only accidents numbered R072011、R142013 and R062018 have
edge entities from the 176 railway accident reports collected above. In occurred within 3 years, take the accident reports R072011、R142013
11
N. Wang et al. Safety Science 166 (2023) 106238
and R062018 in Table 1 as examples to calculate the harm consequences The links between knowledge entities are described by knowledge
of the accident. According to Tables 1 and Table 2, it can be seen that in triples with keywords ‘Result − In’, ‘Type − Is’, ‘Value − Is’ and
the accident reports numbered R072011、R142013 and R062018, there ‘Happened − When’.
are no casualties in R072011 and R142013, so the degree of the damage The railway accidents numbered R072011, R142013 and R062018
is not estimated, and the Weighting is 0. But in R062018, there are three are used as examples to identify knowledge entities and the Knowledge
people were slightly injured, so the Weighting is 0.04 according to triples, the results shown in Table 5 below.
Table 2. In these there reports, the average consequence of collision Taking the knowledge triples identified in Table 5 as an example,
accidents in 3 years was 0 + 0.04 / 3 = 0.0133 DWI/Year, the average Neo4j is used to build the knowledge graph shown in Fig. 2 can be
consequence of Electrical accidents in 3 years was 0.04 / 3 = 0.0133 constructed.
DWI/Year. The average consequence of derailment accidents and The required matrix in the model constructed from the knowledge
runaway accidents within three years are both 0 / 3 = 0 DWI / Year. graph in Fig. 2. Where the CSM matrix is shown in Fig. 3, the TYM
According to the above calculation method, the average consequences of matrix, the RM matrix, and the TIM matrix are shown in Fig. 4 (a), 4 (b),
different accident types during 10 years can be calculated, as shown in and 4 (c). And the RWCSM matrix and the CAM matrix are shown in
Table 3. Fig. 5 (a) and 5 (b), respectively. The subsequent index calculation can
The railway accidents numbered A01 ~ A08 are identified as acci be performed according to the constructed matrix.
dent knowledge entities that are shown in Table 3. And their conse Through the data processing method of the above three accident
quence is identified as accident consequence knowledge entities reports, other accident reports in the past 10 years can be analyzed and
numbered Con. The 89 hazards numbered H01 ~ H34, EI01 ~ EI21, E01 all knowledge triplets can be obtained. By recording and analyzing other
~ E15 and M01 ~ M19 and the four types(H-Type、EI-Type、E-Type and accident reports in the past 10 years, all of the knowledge triples can be
M-Type) of the 89 hazards are identified that are shown in Table 4. Due obtained. Due to space limitations, only some knowledge triples are
to space limitations, see the appendix for details. Meanwhile, in this listed here that are shown in Table 6.
study, Daytime and Night are identified as time knowledge entities. Based on the identified all knowledge entities and knowledge triples,
12
N. Wang et al. Safety Science 166 (2023) 106238
13
N. Wang et al. Safety Science 166 (2023) 106238
time period, which requires special attention. H33 is 0 in daytime, indicating that they cannot be directly affected by
Fig. 7(d) shows the active causal correlation intimacy and passive other hazards in daytime.
causal correlation intimacy of M-Type hazards in daytime and night. On The active type distribution proportion and the passive type distri
the whole, the passive causal correlation intimacy of hazards is generally bution proportion of EI-Type hazards in daytime and night are shown in
lower than their active causal correlation intimacy. The passive causal Fig. 9 respectively. Through analysis, it can be found that in daytime and
correlation intimacy of most hazards is 0, indicating that it cannot be night, most EI-Type hazards are more likely to lead to EI-Type hazards.
caused by other hazards, while only M04 and M05 in daytime and M03, In daytime and night, EI-Type hazards are mainly caused by M-Type and
M04 and M05 in night have a passive causal correlation intimacy greater EI-Type hazards.
than 0, indicating that it can be caused by other hazards in this time The active type distribution proportion and the passive type distri
period. Among them, M04, M07 and M12 in daytime have a high degree bution proportion of E-Type hazards in daytime and night are shown in
of active causal correlation, indicating that these hazards are more likely Fig. 10 respectively. Through analysis, it can be found that in daytime
to lead to other hazards in this period of time, which requires special and night, E-Type hazards are more likely to lead to E-Type and EI-Type
attention. hazards, indicating that the occurrence of general natural disasters will
Obtain the correlation between different hazards in different time lead to the occurrence of other secondary natural disasters, as well as
periods, which is conducive to develop prevention strategies for hazards. mechanical equipment and other failures. In daytime and night, most of
the E-Type hazards are caused by M-Type hazards and E-Type hazards,
3.3.2. Type distribution proportion of hazards which also indicates that improper management will also lead to the
The active type distribution proportion of hazards and the passive occurrence of environmental hazards.
type distribution proportion of hazards can be calculated by. (10) and The active type distribution proportion and the passive type distri
Eq. (11). bution proportion of M-Type hazards in daytime and night are shown in
The active type distribution proportion and the passive hazards type Fig. 11 respectively. Through analysis, it can be found that all M-Type
distribution proportion of H-Type hazards in daytime and night are hazards can lead to the occurrence of other types of hazards in daytime
shown in Fig. 8 respectively. Through analysis, it can be found that in and night, among which H-Type hazards and EI-Type hazards account
daytime and night, most H-Type hazards can cause H-Type hazards, a for a large proportion. In daytime and night, only M03, M04 and M05
small number of hazards can cause the generation of other types of are M-Type hazards caused by other types of hazards, and most of them
hazards, and a small number of hazards such as H07, H10, H11, H28, are M-Type hazards, with a small number of H-Type hazards.
H30, whose active type distribution proportion is 0, indicating that they Exploring the type distribution proportion index is conducive to the
can’t cause the generation of other hazard types, which will directly lead subsequent development of targeted prevention strategies for each
to accidents. In daytime and night, most H-Type hazards are caused by hazard and the prevention of railway accidents.
H-type and M-Type, indicating that there is a strong correlation not only
between H-type hazards and M-type hazards, but also between H-type 3.3.3. Accessibility between different types of hazards
hazards. The passive type distribution proportion of H09, H15, H16 and The direct accessibility between different types of hazards and the
14
N. Wang et al. Safety Science 166 (2023) 106238
indirect accessibility between different types of hazards can be calcu a local perspective. The indirect accessibility between H-Type hazards
lated by Eqs. (12) and (13). Fig. 12 shows the statistical results of direct and H-Type hazards, M-Type hazards and H-Type hazards, M-Type
type accessibility index and indirect type accessibility index between hazards and EI-Type hazards are all high, indicating that from a global
different types of hazards in daytime. Fig. 12 shows that during the daily perspective, the relationships between these hazards are relatively close.
time period, the direct accessibility between H-Type hazards and H-Type Fig. 13 shows the statistical results of direct accessibility between
hazards, EI-Type hazards and EI-Type hazards, M-Type hazards and H- different types of hazards and indirect accessibility between different
Type hazards, M-Type hazards and EI-Type hazards are all high, indi types of hazards in night. As seen in Fig. 13, in the night time period, the
cating that the relations between these hazards are relatively close from indirect accessibility between H-Type hazards and H-Type hazards, M-
15
N. Wang et al. Safety Science 166 (2023) 106238
16
N. Wang et al. Safety Science 166 (2023) 106238
17
N. Wang et al. Safety Science 166 (2023) 106238
18
N. Wang et al. Safety Science 166 (2023) 106238
each hazard can bring great harm to the participants of railway acci as an example are choosed to introduce the prevention strategies for
dents. However, in general, direct harmful consequences of H-Type each hazard. H11, EI07, E08 and M04 have heavy intermediate harmful
hazards and M-Type hazards are generally larger than those of other consequences in the two time periods of daytime and night. Therefore,
types of hazards in daytime. At night, the direct harmful consequences of these four hazards are selected for the formulation of specific prevention
some M-Type hazards are larger than other types of hazards. In daytime strategies, as shown below.
and night, the intermediate direct harmful consequences of H-Type H07 (Train driver accelerates, overspeed or fails to decelerate in
hazards are generally larger than those of other types of hazards. time): It can be seen from Fig. 7 (a) that in daytime and night, the active
Because some hazards play an intermediate role in the causal path of causal correlation intimacy of H07 is 0, indicating that it is caused by
accidents, if the hazards with high intermediate harmful consequences other hazards, cannot lead to other hazards, and will directly lead to
are eliminated or controlled, the harmful consequences of a large accidents. The passive causal correlation intimacy of H07 in daytime is
number of hazards will be reduced, so as to prevent and control the greater than that of H07 in night, indicating that H07 in daytime is more
injury of accidents. likely to be caused by other hazards. In this knowledge graph model, the
hazards that directly lead to H07 are mainly H01, H03, H04, H06, EI10,
4. Discussion E10 and M03. Therefore, some targeted measures can be formulated for
these hazards, “Improving the job requirements of train drivers,
The results of the case study show that the method based on ensuring that they have sufficient ability and experience to work again”,
knowledge map is useful for exploring railway accidents. The proposed for example. More measures can be seen in Table 5. At the same time,
topological index can adapt to the multidimensional structural charac when carrying out daily safety hazard protection, management depart
teristics of railway accidents, and provide useful information for un ment can choose to appropriately increase the daytime inspection
derstanding the potential laws of railway accidents and the correlation scheduling plan. Eliminate H07 by blocking the occurrence of these
between hazards. hazards in time, so as to block the relationship with other hazards and
prevent accidents.
4.1. Comparison of harmful consequences related to hazards EI07 (Vehicle with broken train components): as can be seen from
Fig. 7 (b), the active causal correlation intimacy of EI07 is 0 in daytime
According to the intermediate harmful consequence index of haz and night, indicating that it is caused by other hazards, cannot lead to
ards, the hazards with high intermediate harmful consequence index can other hazards, and will directly lead to accidents. The passive causal
be identified. The hazards with higher intermediate harmful conse correlation intimacy of EI07 in daytime and night are not 0 and equal,
quence are controlled one by one in the two time periods of daytime and indicating that EI07 can be caused by other hazards with the same in
night, and it is found that the direct harmful consequence of the whole tensity. In this knowledge graph model, the hazards that directly lead to
accident is also gradually reducing, as shown in Figs. 18 and 19. Among EI07 are EI04, EI12, EI15 and E01. Therefore, some targeted measures
them, the harmful consequences in daytime are relatively heavy. By can be formulated for these hazards, such as “Replacing faulty wheels in
eliminating the hazard, we can see that the degree of influence on its time, checking and ensuring the normal use status of wheels on time”,
control is also large. Therefore, from the perspective of the whole more measures can be seen in Table 7. At the same time, during the daily
knowledge graph, eliminating the hazards with higher intermediate protection of safety hazards, the daytime and night shall be arranged for
harmful consequence can greatly reduce the direct harmful consequence undifferentiated inspection, so as to ensure that the inspection and
in accidents. scheduling plans during the day and night are generally consistent. EI07
As is shown in the Fig. 18 and Figs. 19, 17 hazards were removed in can be eliminated by blocking the occurrence of these hazards in time, so
turn, accounting for 19.1% of all hazards. In the daytime, by removing as to block the relationship with other hazards and prevent accidents.
19.1% of the hazards, 21.9% of the direct harmful consequences of all E08 (Blocking lines of instruments and equipment): as can be seen
hazards and 57.5% of the intermediate harmful consequences of all from Fig. 7 (c), the active causal correlation intimacy of E08 is 0 in
hazards can be reduced. In the night, 20% of the direct harmful conse daytime and night, indicating that it is caused by other hazards, which
quences of all hazards and 56.9% of the intermediate harmful conse cannot lead to other hazards, and will directly lead to accidents. In both
quences of all hazards can be reduced by removing 19.1% of all hazards. daytime and night, the passive causal correlation intimacy of E08 is not
It can be found that the proposed indexes can effectively reduce the 0, and the passive causal correlation intimacy of E08 in daytime is high,
direct harmful consequences of all hazards and intermediate harmful indicating that it is easier to be caused by other hazards in daytime. In
consequences of all hazards, and the impact on intermediate harmful this knowledge graph model, the hazards that directly lead to E08 are
consequences of all hazards is greater. H34, EI17, E04 and M12. Therefore, some targeted measures can be
formulated for these hazards, for example, “Strengthening the man
4.2. Developing preventive measures agement training for the operators of mechanical equipment to ensure
the operation specification and accuracy”, more measures can be seen in
The third step is to consider the influence of different time factors Table 5. At the same time, when carrying out daily safety hazard pro
according to the correlation between the key hazard and other hazards. tection, management department can choose to appropriately increase
Through the topological analysis indexes, the correlation between the daytime inspection scheduling plan. Eliminate E08 by blocking the
hazards can be got. According to these relations, relevant strategies can occurrence of these hazards in time, so as to block the relationship with
be formulated to provide reference for the prevention of railway acci other hazards and prevent accidents.
dents. The prevention strategy shall be formulated in accordance with M04 (Not implemented or unsuitable safety work system imple
the following three steps, as shown in Fig. 20. The first step is to identify mented): as can be seen from Fig. 7 (d), M04 has a high degree of active
the key hazards according to the intermediate harmful consequence causal correlation intimacy in daytime and a low degree of active causal
index of the hazards in different time. The second step is to analyze the correlation intimacy in night, indicating that it is more likely to lead to
relationship between key hazards and other hazards by using casual the occurrence of other hazards in daytime, which requires special
correlation intimacy. The third step is to analyze other hazards that attention. The low degree of passive causal correlation intimacy in both
cause the key hazards, and formulate specific prevention strategies daytime and night indicates that it is not easy to lead to other hazards in
considering the influence of different time. this time period. In this knowledge graph model, the hazards that lead to
Because there are a large number of intermediate hazards with more M04 are H16, H22, H32, M16, M19. Therefore, some targeted measures
heavy intermediate harmful consequences identified in the two time can be formulated for these hazards, for instance “Reasonably planning
periods of daytime and night, four hazards from various types of hazards the work arrangement of railway staff, ensuring moderate workload and
19
N. Wang et al. Safety Science 166 (2023) 106238
prohibiting long-term fatigue operation”, more measures can be seen in Declaration of Competing Interest
Table 7. At the same time, during the daily protection of safety hazards,
the daytime inspection scheduling should be appropriately increased. The authors declare that they have no known competing financial
M04 can be eliminated by timely blocking the occurrence of these interests or personal relationships that could have appeared to influence
hazards, so as to block the relationship with other hazards and prevent the work reported in this paper.
accidents.
It is noteworthy that most of the identified hazards with more heavy Acknowledgements
intermediate harmful consequences are H-type hazards. That is,
personnel factors greatly affect the occurrence and severity of accidents. This work is supported by the National Natural Science Foundation
By identifying the relationship between these hazards and other hazards of China (Nos. 72071015, 72288101, 71890972/71890970), and the
and taking specific prevention and management measures, the spread 111 Project (No. B20071).
and impact of hazards can be greatly controlled. Based on the above
analysis, corresponding preventive measures and safety inspection Appendix A
schemes have been determined and formulated as shown in Table 7
below. See Table A1
References
4.3. Further application prospects
Agouti, T., 2022. Graph-based modeling using association rule mining to detect
The correlation analysis method based on knowledge graph has high influential users in social networks. Expert Syst. Appl. 202, 117436.
Bonner, S., Barrett, I.P., Ye, C., Swiers, R., Engkvist, O., Hoyt, C.T., Hamilton, W.L., 2022.
application value in the field of safety. This paper discusses the appli Understanding the Performance of Knowledge Graph Embeddings in Drug Discovery.
cation of this method in the correlation analysis of railway accident Artif. Intell. Life Sci. 2, 100036.
hazrad. The correlation between various hazards in railway accidents is Chang, S., Lin, C., Hsu, C., Fung, C., Hwang, J., 2009. The effect of a collision warning
system on the driving performance of young drivers at intersections. Transport. Res.
obtained. The key hazards of railway accidents have been explored. F: Traffic Psychol. Behav. 12, 371–380.
Preventing and controlling key hazards is conducive to the prevention of Dindar, S., Kaewunruen, S., An, M., Sussman, J.M., 2018. Bayesian network-based
railway accidents. The method proposed in this paper is not only limited probability analysis of train derailments caused by various extreme weather patterns
on railway turnouts. Saf. Sci. 110, 20–30.
to the field of railway safety, but also can be applied in other safety fields
Evans, A.W., 2021. Fatal train accidents on Europe’s railways: An update to 2019. Accid.
such as maritime safety, highway safety, and subway safety. Anal. Prev. 158, 106182.
The real benefits of this method for the development of the safety Evans, A.W., Hughes, P., 2019. Traverses, delays and fatalities at railway level crossings
field is that it can identify the knowledge entities and the relations be in Great Britain. Accid. Anal. Prev. 129, 66–75.
Gong, F., Wang, M., Wang, H., Wang, S., Liu, M., 2021. SMR: Medical Knowledge Graph
tween the knowledge entities in the corresponding fields based on the Embedding for Safe Medicine Recommendation. Big Data Res. 23, 100174.
accident and disaster data in different fields. Thereby establishing a Haleem, K., 2016. Investigating risk factors of traffic casualties at private highway-
knowledge graph model that conforms to a specific accident. By railroad grade crossings in the United States. Accid. Anal. Prev. 95, 274–283.
Hao, W., Kamga, C., Yang, X., Ma, J., Thorson, E., Zhong, M., Wu, C., 2016. Driver injury
analyzing the constructed model using topological analysis indexes that severity study for truck involved accidents at highway-rail grade crossings in the
conform to the characteristics of the model, it can explore the relations United States. Transport. Res. F: Traffic Psychol. Behav. 43, 379–386.
between hazards in different fields. Obtaining key hazards and elabo Hua, L., Zheng, W., 2019. Research on causation of railway accidents based on complex
network theory. Chin. J. Saf. Sci. 29, 114–119.
rately preventing and controlling them is conducive to the development Huang, W., Zhang, Y., Kou, X., Yin, D., Li, L., 2020. Railway dangerous goods
of the security field. transportation system risk analysis: an interpretive structural modeling and bayesian
network combining approach. Reliab. Eng. Syst. Saf. 204, 107220.
Klockner, K., Toft, Y., 2018. Railway accidents and incidents: Complex socio-technical
5. Conclusion system accident modelling comes of age. Saf. Sci. 110, 59–66.
Lam, C.Y., Tai, K., 2020. Network topological approach to modeling accident causations
In this paper, a new correlation analysis model of railway accident and characteristics: analysis of railway incidents in Japan. Reliab. Eng. Syst. Saf.
193, 106626.
hazards based on knowledge graph is proposed. In order to find the Li, Q., Song, L., List, G.F., Deng, Y., Zhou, Z., Liu, P., 2017. A new approach to
deeper correlation between hazards and prevent railway accidents, new understand metro operation safety by exploring metro operation hazard network
topological analysis indexes are formulated. This method has been (MOHN). Saf. Sci. 93, 50–61.
Li, C., Tang, T., Chatzimichailidou, M.M., Jun, G.T., Waterson, P., 2019. A hybrid human
applied to the actual railway accidents in the UK. The results show that and organisational analysis method for railway accidents based on STAMP-HFACS
this method can effectively find out the relations between accidents and and human information processing. Appl. Ergon. 79, 122–142.
hazards. Through further analysis of the revealed relations, it provides Li, K., Wang, S., 2018. A network accident causation model for monitoring railway
safety. Saf. Sci. 109, 398–402.
an effective decision-making basis for the prevention of railway acci Liu, J., Li, K., Zheng, W., Zhu, J., 2018. An importance order analysis method for causes
dents. The method based on knowledge graph is expected to be applied of railway signaling system hazards based on complex networks. Risk Reliab. 233
to explore other types of railway accidents and provide additional (4), 567–579.
Liu, J., Schmid, F., Zheng, W., Zhu, J., 2019. Understanding railway operational
decision-making information for railway accident prevention.
accidents using network theory. Reliab. Eng. Syst. Saf. 189, 218–231.
However, this research still has some inadequate, requires further Liu, J., Schmid, F., Zheng, W., 2021. A knowledge graph-based approach for exploring
study and improvement in the future work. More abundant data can be railway operational accidents. Reliab. Eng. Syst. Saf. 207, 107352.
Ma, X., 2022. Knowledge graph construction and application in geosciences: A review.
considered and more considerations can be added in model building.
Comput. Geosci. 161, 105082.
Moreover, more knowledge entities and their relations should be Min, W., Liu, C., Xu, L., Jiang, S., 2022. Applications of knowledge graphs for food
included and a knowledge model with more information should be science and industry. Patterns (N Y). 3 (5), 100484.
established so as to further explore the relations between hazards. Mirabadi, A., Sharifian, S., 2010. Application of association rules in Iranian Railways
(RAI) accident data analysis. Saf. Sci. 48 (10), 1427–1435.
Rssb, 2014. Guidance on hazard identification and classification (GE/GN8642). Rail
CRediT authorship contribution statement Safety And Standards Board, London.
Stanton, N.A., Walker, G.H., 2011. Exploring the psychological factors involved in the
Ladbroke Grove rail accident. Accid. Anal. Prev. 43 (3), 1117–1127.
Ning Wang: Writing – original draft, Methodology, Data curation. Tan, J., Qiu, Q., Guo, W., Li, T., 2021. Research on the Construction of a Knowledge
Xin Yang: Writing – original draft, Methodology, Conceptualization. Graph and Knowledge Reasoning Model in the Field of Urban Traffic. Sustainability.
13 (6), 3191.
Jianhua Chen: Writing – review & editing, Methodology. Hongwei
Tian, C., Zhang, Y., Qian, X., Li, J., 2022. A knowledge graph-based method for epidemic
Wang: Writing – review & editing, Conceptualization. Jianjun Wu: contact tracing in public transportation. Transp. Res. C 137, 103587.
Writing – review & editing, Methodology.
20
N. Wang et al. Safety Science 166 (2023) 106238
Wang, Q., Mao, Z., Wang, B., Guo, L., 2017a. Knowledge Graph Embedding: A Survey of Zhang, J., Zhang, X., Wu, C., Zhao, Z., 2022. Survey of knowledge graph construction
Approaches and Applications. IEEE Trans. Knowl. Data Eng. 29 (12), 2724–2743. techniques. Comput. Eng. 48 (3), 23–37.
Wang, G., Xu, T., Tang, T., Yuan, T., Wang, H., 2017b. A bayesian network model for Zhou, Z., Irizarry, J., Li, Q., 2014. Using network theory to explore the complexity of
prediction of weather-related failures in railway turnout systems. Expert Syst. Appl. subway construction accident network (SCAN) for promoting safety management.
69, 247–256. Saf. Sci. 64, 127–136.
Wu, J., Li, D., Si, S., Gao, Z., 2021. Special issue: Reliability management of complex Zhou, Y., Li, C., Ding, L., Sekula, P., Love, P., Zhou, C., 2019. Combining Association
system. Front. Eng. Manag. 8 (4), 477–479. Rules Mining with Complex Networks to Monitor Coupled Risks. Reliab. Eng. Syst.
Zeng, G., Sun, Z., Liu, S., Chen, X., Li, D., Wu, J., Gao, Z., 2021. Percolation-based health Saf. 186, 194–208.
management of complex traffic systems. Front. Eng. Manag. 8 (4), 557–571.
21