Professional Documents
Culture Documents
article info a b s t r a c t
Article history: With the increasing usage of information technology on the criminal side, the digital forensic analysis,
Received 6 May 2020 especially multimedia forensics, becomes an emerging technique for cybercrime investigators to
Received in revised form 12 March 2021 improve examination efficiency. The study focuses on the digital triage problem for evidence location
Accepted 17 March 2021
during the automatic forensic process. After defining the multi-scale knowledge base for storing
Available online 21 April 2021
digital forensic investigators’ prior knowledge, a variable scale case-based reasoning method (VSCBR)
Keywords: is proposed to support investigators predicting evidential areas. The variable-scale clustering algorithm
Digital forensics based on the scale transformation strategy (VSC-STS) is also put forward, which could identify highly
Scale transformation similar past cases containing candidate evidence in the case reuse and revise phase. A case study is
Variable-scale clustering established using a real 15.9 GB bidding case dataset, which contains both text bidding documents and
Case-based reasoning image technical drawings. Numerical experimental results show that the validation of the proposed
VSC-STS is significantly improved compared with the traditional single-scale clustering algorithm, and
it is insensitive to the initial parameter threshold. Moreover, the proposed method VSCBR is able to
help investigators locate suspicious rule-violating evidences in practice.
© 2021 Elsevier B.V. All rights reserved.
https://doi.org/10.1016/j.future.2021.03.019
0167-739X/© 2021 Elsevier B.V. All rights reserved.
A. Wang and X. Gao Future Generation Computer Systems 122 (2021) 209–219
Table 1
Research summary of the scale transformation theory.
# Research Problem [31] [8] [9] [32] [33] [34] [35] The proposed method
√ √ √ √
1 Concept space model of categorical variables √ √ √
2 Concept space model of binary variables √ √
3 Concept space model of numerical variables √ √ √ √ √ √ √
4 Scale depth transformation mode √ √ √
5 Scale breadth transformation mode √ √ √ √ √ √ √
6 Single scale up transformation direction √ √
7 Single scale down transformation direction √
8 Scale transformation feedback mechanism √
9 Optimistic scale transformation strategy √
10 Pessimistic scale transformation strategy
with six cases U = {xk |k ∈ [1, 6]}, two attributes A = A , A S all attributes under all scale transformation directions is also
defined. Then, according to the scale transform space, calculate
with three observation scales respectively (that is A10 ⪯ A11 ⪯ A12 ,
the scale transform rate of each attribute. As for the multi-scale
A20 ⪯ A21 ⪯ A22 ), and six solutions d = {dk |k ∈ [1, 6]}.
data analysis adopting the OSTS, attribute with the maximum
Taking the basic single-scale case dataset {(that all } the at- scale transformation value is selected as the target attribute of
tributes stand at the lowest hierarchy) D0 = (U , A10 , A20 , d) as an each scale transformation. Finally, apply the new observation
example, it can be seen that U /A10 = {{x1 } , {x2 } , {x3 } , {x4 } , {x5 } , scale to represent the target attribute and update dataset.
{x6 }}, U /A11 = {{x1 , x2 , x3 } , {x4 } , {x5 , x6 }}, U /A12 = {{x1 , x2 , x3 , Strategy 2. (Pessimistic scale transformation strategy, PSTS)
x4 } , {x5 , x6 }}. During once scale transformation, always being prior to selecting
If attribute A1 is implemented a one-step scale up transforma- attribute with the minimum scale transformation value (attribute
tion ST (A1 , 1), the effect of ST (A1 , 1) is STR(A10 , A11 ) = |U4| = 0.17;
|{x }|
with the largest scale transformation rate).
1
If attribute A is implemented a two-step scale up transformation Fig. 3 also shows the scale transformation process under the
ST (A1 , 2), the effect of ST (A1 , 2) is STR(A10 , A12 ) = |U | = 0. PSTS. Similarly , in the beginning, establish the scale transfor-
|∅|
Similarly, taking the highest single-scale case dataset{ (that all mation space of the initial single-scale case dataset through the
the attributes stand at the highest hierarchy) D2 = (U , A12 , A22 ,
} multi-scale knowledge base, and the candidate observation scale
combination of all attributes under all scale transformation di-
d) as an example, if attribute A2 is implemented a one-step
rections is also defined. Then, according to the scale transform
scale down transformation ST (A2 , 1), the effect of ST (A2 , 1) is
space, calculate the scale transform rate of each attribute. While
STR(A22 , A21 )
for the multi-scale data analysis adopting the PSTS, attribute
= |{x5|U,x|6 }| = 0.33; If attribute A2 is implemented a two-step with the minimum scale transformation value is selected as the
scale down transformation ST (A2 , 2), the effect of ST (A2 , 2) is target attribute of the scale transformation. Finally, apply the new
|{x ,x }|
STR(A22 , A20 ) = 5|U |6 = 0.33. observation scale to represent the target attribute and update
From the above numerical examples, it can be seen that both dataset.
the scale up and down transformation rate STR ∈ [0, 1]. The In conclusion, the scale transformation performance of multi-
smaller STR represents the more significant changes of case par- scale knowledge base could be heavily affected by several factors,
titions during once scale transformation; On the contrary, the such as the scale transformation steps (i.e., single step transfor-
larger STR represents the less significant changes of case parti- mation versus multiple steps transformation), the scale transfor-
tions during once scale transformation. mation strategies (i.e., optimistic scale transformation strategy
versus pessimistic scale transformation strategy), and the scale
transformation directions (i.e., scale up transformation direction
3.2. Variable-scale clustering method based on the scale transforma-
versus scale down transformation direction).
tion strategy Therefore, in order to better control the scale transformation
process of multi-scale knowledge base, especially the variable-
In Section 3.1, the scale transformation rate (STR) has been scale clustering process, three assumptions are put forward fol-
able to quantitatively evaluate the case partition change of once lowing the traditional VSC method in Section 2.3:
scale transformation process of each attribute, that is attribute ⃝1 During once iteration process of variable-scale clustering,
scale transformation value. The smaller (STR) represents the more only the observation scales that have direct partial order re-
significant scale transformation value; On the contrary, the larger lation with target scale could become the candidates for scale
(STR) represents the less significant scale transformation value. transformation.
212
A. Wang and X. Gao Future Generation Computer Systems 122 (2021) 209–219
⃝2 During once iteration process of variable-scale clustering, Algorithm 1: Variable-Scale Clustering Algorithm Based on the
only one scale transformation strategy could be applied for scale Scale Transformation Strategy (VSC-STS)
transformation.
3 During once iteration process of variable-scale clustering,
Input: Multi-scale knowledge base DS with clear hierarchy
⃝
structure (CS), Number of clusters k, Scale transformation
only one attribute in the whole multi-scale knowledge base could
range [Sstart , Send ]
be applied for scale transformation.
Output: Scale hierarchy (Scale transformation route), Satisfied
Moreover, in order to evaluate the clustering validity of multi-
clusters with scale characteristics.
scale knowledge base DS = (U , AS , d) the traditional granular
Step 1. Start the initial clustering analysis on the basic scale
deviation (GrD) could be expressed as:
∑ ∑ combination of DS and evaluate the satisfaction degree
n rj=1 ni=1 δ (xij , xIj )
of clustering results via GrD (see Eq.(3)).
GrD(XI , Aλ ) = √ (3) Step 2. Identify satisfied clusters and take the largest satisfied
λ 2
∑t
k=1 |Uk | granular deviation value as R0 .
Where XI ⊆ U is any case cluster obtained by once iteration pro- Step 3. Output all satisfied clusters with scale characteristics
cess of variable-scale clustering, and XI = {x1 , x{2 , . . . , xn }; Aλ (λ }= and delete all the objects of satisfied clusters from DS .
1, 2, . . . , r) is the target attribute, and U /Aλ = U1λ , U2λ , . . . , Utλ ; Step 4. Start scale transformation following CS.
xI is the cluster center of XI , and for all xij ∈ XI , if xij = xIj ,
• If OSTS is adopt, scale up one attribute with the
δ (xij , xIj ) = 0; otherwise, δ (xij , xIj ) = 1. minimum STR under the range condition
Fig. 4 shows the variable-scale clustering process under scale
[Sstart , Send ], and update Ḋ.
transformation strategies. Firstly, initial clustering analysis is per-
• If PSTS is adopt, scale up one attribute with the
formed on the single-scale case dataset D1st at the basic hierarchy
maximum STR under the range condition
level, and the satisfaction threshold R0 is determined via the GrD.
[Sstart , Send ], and update Ḋ.
Retain the clusters that satisfies R0 with scale characteristics.
Then, delete all the objects of satisfied clusters from the original Step 5. Start clustering analysis on Ḋ via R0 . If the GrD of all
multi-scale dataset, and select one attribute for scale transforma- clusters exceeds R0 , reset R0 as the minimum granular
tion following one of the scale transformation strategies (OSTS deviation value.
or PSTS), in order to obtain the new single-scale case data set Step 6. Output all satisfied clusters with scale characteristics
D2nd with higher hierarchy and fewer objects. After that, start and delete all the objects of satisfied clusters from Ḋ.
clustering analysis on D2nd via R0 , and retain all the satisfied Step 7. If all objects are partitioned into a satisfied cluster
clusters with scale characteristics. Similarly, only the cases in the (i.e.Ḋ = ∅), output the scale transformation route;
remaining non-satisfaction clusters are continued to carry out the Otherwise, go to Step 4.
next scale transformation process. Finally, the VSC-STS stops until
all objects have been partitioned into a satisfied cluster.
According to the optimistic and pessimistic scale transforma-
tion strategy, combined with the scale up transformation mech-
For instance, let D1st = (U , A10 , A20 , d) represents the initial
{ }
anism, a variable scale clustering algorithm based on the scale
transformation strategy (VSC-STS) is proposed. The algorithm single-scale case dataset of the multi-scale knowledge base DS
steps is shown in Algorithm 1. in Table 2. According to the first step of {the VSC-STS, the scale
The time complexity of VSC-STS is O(t ϕ ), where ϕ = min(m, up transformation space is STS(D1st ) = A11 , A21 . If the PSTS is
}
nr ), r is the number of attributes, n is the maximum number of adopted, A2 becomes the target attribute for the second itera-
scale levels in one attribute, m is the number of cases, and t is tion due to STR(A10 , A11 ) < STR(A20 , A21 ), and the single-scale}case
dataset for the second iteration process is D2nd = (U , A10 , A21 , d).
{
the time consuming of the meta clustering process.
213
A. Wang and X. Gao Future Generation Computer Systems 122 (2021) 209–219
Table 3
Investigation content for digital forensics in the bidding context.
# Laws and regulations Number of articles
1 Law of the People’s Republic of China on Tenders and Bids 67
2 Regulations for Implementations on the Law of the People’s Republic of China on Tenders and Bids 82
3 Government Procurement Law of the People’s Republic of China 86
4 Regulations for Implementations on the Government Procurement Law of the People’s Republic of China 78
5 The Administrative Rules on Bidding Invitation and Procurement of the Shenhua Group (the 2016 version) 137
6 Measures for Bidding Invitation and Procurement of Goods Needed in Construction Projects 64
7 Measures for Tenders and Bids for Investigation and Design of Engineering Construction Projects 59
8 Measures for Tenders and Bids for Construction of Engineering Construction Projects 91
9 Handling Measures for Dealing with Complaints in Bidding Invitation and Bidding in Construction Projects 30
10 Provisional Rules on Bidding Evaluation Committee and on Bidding Evaluation Methods 61
11 Provisional Measures for Administration of Bidding Evaluation Experts and Bidding Evaluation Tank of Experts 18
12 Provisional Rules for Announcing Bidding Results 21
13 Measures for Invitation for Online Bidding 66
14 The Administrative Rules on Bidding Invitation and Procurement of the Shenhua Group (the 2017 version) 109
Table 4
Summary of variables.
Attribute Scale Description
A10 Supply risk rating during the latest three months observation scale, that is [2017-09-01,2017-12-31]
Supply A11 Supply risk rating during the latest six months observation scale, that is [2017-06-01,2017-12-31]
A12 Supply risk rating during the latest one year observation scale, that is [2017-01-01,2017-12-31]
A20 Product value rating during the latest three months observation scale, that is [2017-09-01,2017-12-31]
Product A21 Product value rating during the latest six months observation scale, that is [2017-06-01,2017-12-31]
A22 Product value rating during the latest one year observation scale, that is [2017-01-01,2017-12-31]
A30 Demand risk rating during the latest three months observation scale, that is [2017-09-01,2017-12-31]
Demand A31 Demand risk rating during the latest six months observation scale, that is [2017-06-01,2017-12-31]
A32 Demand risk rating during the latest one year observation scale, that is [2017-01-01,2017-12-31]
A40 Location of the project under city observation scale
Project
A41 Location of the project under province observation scale
4. Experimental analysis decreasing during every scale transformation process, the obser-
vation scale of the final variable-scale clustering result is incon-
4.1. Experiment design sistent with other comparison algorithms. Thus, only the exter-
nal validation evaluation approach is adopted for comparative
A real 15.9 GB bidding knowledge base from the Shenhua experiments.
Group is utilized for the experiments, which includes both text (2) Numerical experiment on the performance of the VSCBR
bidding documents and image technical drawings. Table 3 shows The experiment objective of practical value verification is to
the 969 articles from fourteen laws and regulations that need test whether the VSCBR is able to predict suspicious evidential ar-
to be investigated. And all the 85 evidence location of bidding eas for digital forensics investigators. Since the VSC-STS is utilized
cases are encoded as four digits (see Fig. 6). It can be seen that to support key steps of digital triage in the VSCBR (see Algorithm
compared to the multi-scale attribute set, the solution set (evi- 2) , the practical value of the VSC-STS is also be verified through
dence location) of bidding knowledge base also has a hierarchy this numerical experiment.
structure.
4.2. Data preparation
Three numerical experiments are designed to test the effec-
tiveness of the VSC-STS and VSCBR theoretically and experimen-
After field investigations of eight bidding managers from the
tally.
Shenhua Group, multi-scale attributes (dimensions) of the bid-
(1) Numerical experiments on the performance of the VSC-STS
ding knowledge base is established (see Table 4) jointly with
The experiment objective of theoretical value verification is
managers and data analysts. According to managers’ experience,
to test whether the VSC-STS is able to improve the accuracy of
there are totally four dimensions that greatly affects the forensic
traditional clustering algorithm under various evaluation metrics. investigation in the practical bidding process, i.e., the supply risk
⃝1 Validation analysis of the clustering results
dimension, product value dimension, demand risk dimension and
According to the algorithm steps of VSC-STS (see Algorithm 1), project value dimension.
select k-modes as meta cluster algorithm. Moreover, we repeat Moreover, since the bidding managers make regular report of
numerical experiment fifty times due to the random feature of their responsible districts quarterly, three temporary observation
k-modes, and compare the variable scale clustering results with scales (i.e., the latest three months, the latest six months and the
the entire scale space clustering results. latest one year) and two spatial observation scales (i.e., the city
⃝2 Sensitive analysis of the parameter threshold level and province level) are determined for the bidding knowl-
According to the algorithm steps of VSC-STS, set the scale edge base. Therefore, all bidding raw data relevant to the first
transformation range as [3,6] and variation step as 0.1. Similarly, three dimensions is extracted, transformed, loaded respectively
we repeat numerical experiment fifty times due to the random under three different temporal scales to obtain the first nine
feature of k-modes, and compare the average clustering accuracy variables. All bidding raw data relevant to the fourth dimension
under different initial parameter value. is extracted, transformed, loaded respectively under two different
There are two widely utilized evaluation approaches of clus- spatial scales to obtain the last two variables.
tering algorithms, that is internal validation evaluation and ex- In order to clearly show the bidding experimental data, a sam-
ternal effective evaluation. Since the case set (domain) keeps ple of the bidding knowledge base with twenty five cases, four
215
A. Wang and X. Gao Future Generation Computer Systems 122 (2021) 209–219
observation attributes (A1 , A2 , A3 , A4 ) and one solution attribute ⃝1 During the fifty experiments, the average performance of
(d) is provided (see Table 5). Taking the lowest observation scales the VSC-STS method in all evaluation indexes are better than the
as the basic scale combination (A10 , A20 , A30 ), it can be seen that the average effectiveness results obtained by the traditional single-
entire scale space could generate 54 different single-scale case scale clustering method k-modes at the basic scale level. And
dataset (scale combinations). the accuracy improvement rate exceeds 10%, which verifies that
Moreover, experiments utilize the classic external evaluation the VSC-STS could meet the validity requirement of traditional
metric (such as Fmeasure, NMI and RI) to analyze the validation single-scale clustering algorithms.
of VSC-STS in the first two experiments. All the experiments are ⃝2 During the fifty experiments, the deviation between the
performed in OS X (10.14.4) environment on a machine with 8GB average performance of the VSC-STS method and the average
RAM and 2.9 GHz Intel Core i5 CPU. All methods are coded in effectiveness results obtained by the traditional single-scale clus-
Matlab R2017a. tering analysis method k-modes under the optimal scale level is
less than 5%. That proves that the VSC-STS could improve the
4.3. Experiment results and discussion
efficiency of traditional single-scale clustering algorithms.
(2) Numerical experiment on the parameter sensitivity of the
In this section, the VSC-STS results of the first two experiments
VSC-STS
and the VSCBR (combined with the VSC-STS) results of the third
experiment are discussed respectively. Fig. 8 shows the sensitivity experiment results of the initial
(1) Numerical experiment on the validation of the VSC-STS algorithm parameter of the VSC-STS. The broken line represents
Fig. 7 further shows the validation experiment results of the the standardized average clustering validation evaluated by ex-
VSC-STS. The black broken line represents evaluation results of ternal validation metrics (i.e., the Fmeasure, NMI and RI), within
each external validation metrics, i.e., the Fmeasure, NMI and the scale transformation range of initial algorithm parameter.
RI [36,37]. The blue dashed line represents the average clustering It can be seen that all evaluation indexes stay at a relatively
accuracy of the k-modes under the basic scale combination; while stable level with the increase of algorithm parameters. Further
the red dashed line represents the average clustering accuracy discussion is as follows.
of the k-modes under the optimal (best) scale combination (see ⃝1 Although the validation evaluation results of three indexes
Table 6). fluctuates by changing the initial satisfaction (degree) threshold,
It is found that the VSC-STS could completely meet the re- the maximum fluctuation range is less than 1%, which indicates
quirements of traditional single-scale clustering methods on the that the validation of the VSC-STS is insensitive to the initial
result validation. Further discussion is as follows. algorithm parameter.
216
A. Wang and X. Gao Future Generation Computer Systems 122 (2021) 209–219
Table 5
Example: Bidding knowledge base.
U A10 A11 A12 A20 A21 A22 A30 A31 A32 A40 A41 d
Case1 1 E Y 1 L N 1 T Y 1 Y 1101
Case2 1 E Y 1 L N 1 T Y 1 Y 1101
Case3 1 E Y 2 L N 2 U Y 1 Y 1101
Case4 2 E Y 2 L N 3 R O 1 Y 1101
Case5 2 E Y 2 L N 2 U Y 2 N 1201
Case6 2 E Y 3 L N 3 R O 1 Y 2002
Case7 3 G Y 3 L N 3 R O 2 N 2002
Case8 3 G Y 4 L N 2 U Y 2 N 1201
Case9 3 G Y 4 L N 3 R O 1 Y 2002
Case10 3 G Y 5 I N 2 U Y 1 Y 3001
Case11 4 G Y 5 I N 3 R O 2 N 2002
Fig. 8. The experiment results of algorithm parameter sensitivity.
Case12 4 G Y 5 I N 4 S O 1 Y 3001
Case13 4 G Y 5 I N 1 T Y 1 Y 3001
Case14 5 F N 5 I N 4 S O 2 N 5001
Case15 5 F N 6 I N 2 U Y 2 N 5001 the accurate evidence area automatically, which is able to reduce
Case16 5 F N 6 I N 3 R O 1 Y 5001
Case17 5 F N 6 I N 4 S O 1 Y 5001
the digital triage workload of forensic investigators. It can be
Case18 6 B N 6 I N 1 T Y 2 N 5001 seen that time-consuming curve of both two algorithms presents
Case19 6 B N 6 I N 5 Q N 1 Y 3001 an overall stable but local fluctuations tendency, and decreases
Case20 6 B N 7 M Y 5 Q N 2 N 4001 obviously among the initial stage. That is mainly because the
Case21 7 B N 6 I N 4 S O 3 N 4001 heavy initial case clustering work of the bidding knowledge base,
Case22 7 B N 7 M Y 5 Q N 3 N 4001 which has been improved rapidly proven by the sharp decrease of
Case23 7 B N 8 M Y 5 Q N 4 N 4001
Case24 7 B N 8 M Y 6 Q N 4 N 4001
calculation time. The experimental results demonstrate that the
Case25 7 B N 8 M Y 6 Q N 4 N 4001 historical experience contributes to the solution of new problems
(cases) significantly. In addition, the time consuming of the VSCBR
* Valuables A10 , A11 and A12 are encoded as [1-11], [A-G] and [Y,N] respectively is always lower than the traditional CBR at any cases, which
according to the level of supply risk under each temporal scale; Valuables A20 , A21
indicates that the proposed VSCBR has high efficiency in practical
and A22 are encoded as [1-9], [I-M] and [Y,N] respectively according to the level
forensic tasks.
of product value under each temporal scale; Valuables A30 , A31 and A32 are encoded
Fig. 9(b) shows the comparison results of the traditional CBR
as [1-7], [Q-U] and [Y,O,N] respectively according to the level of demand risk
under each temporal scale; Valuables A40 and A41 are encoded as [1-5] and [Y,N] and the proposed VSCBR method in the case revise phase. Ten
respectively according to the level of project value under each spatial scale. new cases are utilized to test whether the VSCBR could locate
all the suspicious evidential areas automatically. It can be seen
that time-consuming curve of both two algorithms also presents
an overall stable but local fluctuations tendency, and decreases
⃝2 The overall tendency of three validation evaluation results
obviously among the initial stage. And the time consuming of
increase slightly with the increase of the initial satisfaction pa-
the VSCBR is always lower than the traditional CBR at any cases,
rameter, which indicates that the over strict initial satisfaction
which further indicates that the proposed VSCBR has high effi-
constraint is not conducive to the VSC-STS obtaining the optimal ciency in practical forensic tasks.
solution.
(3) Numerical experiment on the performance of the VSCBR 5. Conclusions
Fig. 9(a) shows the comparison results of the traditional CBR
and the proposed VSCBR method in the case reuse phase. Fifteen As for the increasingly criminal usage of information technol-
new cases are utilized to test whether the VSCBR could locate ogy, the digital forensic analysis (like the digital triage methods)
217
A. Wang and X. Gao Future Generation Computer Systems 122 (2021) 209–219
Table 6
Evaluation results of the VSC-STS.
Objectives Initial observation scale combination (A10 A20 A30 A40 ) Optimal observation scale combination (A11 A21 A31 A40 )
Average Maximum Minimum Standard deviation Average Maximum Minimum Standard deviation
Fmeasure 0.6269 0.7863 0.4777 0.0692 0.6940 0.8436 0.5543 0.0615
NMI 0.5791 0.7610 0.3830 0.0742 0.6548 0.8537 0.4786 0.0669
RI 0.8079 0.8900 0.7300 0.0369 0.8409 0.9167 0.7733 0.0288
Fig. 9. The comparison results of the traditional CBR and the proposed VSCBR.
218
A. Wang and X. Gao Future Generation Computer Systems 122 (2021) 209–219
[19] M.L. Reid, S.M. Emery, Scale-dependent effects of gypsophila paniculata in- [34] A. Wang, X. Gao, Intelligent computing: Knowledge acquisition method
vasion and management on plant and soil nematode community diversity based on the management scale transformation, Comput. J. 64 (3) (2021)
and heterogeneity, Biol. Cons. 224 (2018) 153–161. 314–324, http://dx.doi.org/10.1093/comjnl/bxaa077.
[20] C.D.D. Sohoulande, K. Stone, V.P. Singh, Quantifying the probabilistic [35] A. Wang, X. Gao, M. Tang, Computer supported data-driven decisions for
divergences related to time-space scales for inferences in water resource service personalization: A variable-scale clustering method, Stud. Inform.
management, Agricult. Water Manag. 217 (2019) 282–291. Control 29 (1) (2020) 55–65.
[21] H. Jiawei, K. Micheline, Data mining: Concepts and techniques, Data Mining [36] S. Liang, D. Han, Y. Yang, Cluster validity index for irregular clustering
Concepts Models Methods & Algorithms, vol. 5(4), second ed, 2006, pp. results, Appl. Soft Comput. 95 (2020) 106583.
1–18. [37] J.M. Luna-Romera, M. Martínez-Ballesteros, J. García-Gutiérrez, J.C.
[22] N. Takashina, M.L. Marissa, L. Baskett, Exploring the effect of the spatial Riquelme, External clustering validity index based on chi-squared
scale of fishery management, J. Theoret. Biol. 390 (2016) 14–22. statistical test, Inform. Sci. 487 (2019) 1–17.
[23] G. Mariscal, O. Marban, C. Fernandez, A survey of data mining and [38] L. Longo, R. Goebel, F. Lecue, P. Kieseberg, Explainable Artificial Intelli-
knowledge discovery process models and methodologies, Knowl. Eng. Rev. gence: Concepts, Applications, Research Challenges and Visions, Machine
25 (2) (2010) 137–166. Learning and Knowledge Extraction, in: Lecture Notes, LNCS 12279, 2020,
[24] S. Wu, X. Gao, M. Bastien, Data Warehousing and Data Mining, http://dx.doi.org/10.1007/978-3-030-57321-8-1.
Metallurgical Industry Press, 2003.
[25] T. Gocken, M. Yaktubay, Comparison of different clustering algorithms via
genetic algorithm for vrptw, Int. J. Simul. Modell. 18 (4) (2019) 574–585.
[26] R.C. Hrosik, E. Tuba, E. Dolicanin, R. Jovanovic, M. Tuba, Brain image seg- Ai Wang received her Ph.D degree in 2021 in Man-
mentation based on firefly algorithm combined with k-means clustering, agement Science and Engineering and she is currently
Stud. Inform. Control 28 (2) (2019) 167–176. doing her postdoctoral research in the School of Hu-
[27] L. Wang, Z. Hao, X. Han, R. Zhou, Gravity theory-based affinity propaga- manities and Social Science, University of Science and
tion clustering algorithm and its applications, Teh. Vjesn. 25 (4) (2018) Technology Beijing, China. She has published papers
1125–1135. in respected journals like IEEE Access, Studies in In-
[28] Z.Z. Wang, Y. Xiong, R. Wang, C.H. Zhong, Numerical investigation of the formatics and Control. Her research interests focus on
scale effect of hydrodynamic performance of the hybrid crp pod propulsion data mining and decision making, as well as emergency
system, Appl. Ocean Res. 54 (2016) 26–38. management.
[29] G. Tavakoli Mehrjardi, R. Behrad, S.N. Moghaddas Tafreshi, Scale effect on
the behavior of geocell-reinforced soil, Geotext. Geomembr. 47 (2) (2019)
154–163.
[30] G. Feng, D. Ming, M. Wang, J. Yang, Connotations of pixel-based scale Xuedong Gao received his Bachelor degree from
effect in remote sensing and the modified fractal-based analysis method, Nankai University, China in 1983, and the Ph.D. de-
Comput. Geosci. 103 (C) (2017) 183–190. gree from Belarusian State University in 1993. He is
[31] X. Gao, A. Wang, Variable-scale clustering, in: Proceeding of 8th In- currently the professor in the Department of Manage-
ternational Conference on Logistics, Informatics and Service Sciences, ment Science and Engineering, School of Economics and
2018. Management at University of Science and Technology
[32] A. Wang, X. Gao, Hybrid variable-scale clustering method for social media Beijing, China. His research interests include man-
marketing on user generated instant music video, Teh. Vjesn. 26 (3) (2019) agement process optimization, data mining, decision
771–777. making.
[33] A. Wang, X. Gao, M. Yang, Variable-scale clustering based on the numerical
concept space, in: Proceeding of 9th International Conference on Logistics,
Informatics and Service Sciences, 2019.
219