Professional Documents
Culture Documents
net/publication/331245903
CITATIONS READS
2 794
6 authors, including:
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Abdullah Nahid on 02 March 2019.
Abstract— Epileptic seizure is a neurological disorder imaging (MRI), computed tomography (CT) scan, positron
characterized by abnormal synchronous discharge of the emission tomography (PET), ultrasound,
neuronal activities in the brain structures. These abnormal Electroencephalogram (EEG). MRI, CT and Ultrasound are
electrical activities can be recorded via multi-channel costly and cannot be used for long term evaluation. On the
electroencephalography (EEG) signals placed on the scalp of the
brain. Usually, these signals, recorded from this EEG device, are
other hand, EEG is a low cost tool and non-invasive tool
interpreted by the neurologist which require their availability which can be used for long-term evaluation. Therefore, EEG
and it is very time consuming especially for long duration is the most useful tool for the diagnosis of epilepsy. In this
signals. This study presents a fully automatic system for the test, electrodes are attached on the scalp of the brain via
detection of seizure from non-seizure signals. Firstly, it pre- adhesive gel. If a person has epilepsy, his/her EEG recording,
processes the signal to remove noise and artefacts from the raw- and EEG pattern will be different from normal pattern.
EEG signals and then extracts features. Features are extracted Neurologist reads that EEG recoding and take decision based
from time-domain, spectral domain, wavelet domain. In on the pattern. Therefore, epileptic seizure is connected with
addition, connectivity and entropy based feature have also been the primary parameter which is provided by the EEG signal
extracted. After that, prominent features have been selected
from this large feature set by a multi-objective evolutionary
[1]. Epileptic activity associated with the actual detector can
algorithm and finally, Support Vector Machine (SVM) classifier be integrated into a cellular feedback program for daily
has been used for classification. A Bayesian optimization utilization [2-4]. After all the consequences, epileptic seizure
algorithm has been used to optimize the hyper-parameters of can be shortened lifespan and include social and
SVM. Linear Discriminant Analysis (LDA) and Quadratic neuropsychological disability, excessive bodily injury.
Linear Discriminant Analysis (QLDA) have also been used for Performance detection of existing algorithms may be affected
comparison. The proposed system is tested on a publicly by such kind of differences. So, EEG is more conventional
available CHB-MIT database and results show the significance than other diagnosing systems [5]. Different features have
of the proposed system. The distinguished accuracy of the
been extracted from discrete wavelet transform (DWT) and
classifier is 76.41%, 80.79% and 97.05% in LDA, QLDA and
SVM, respectively.
artificial neural network (ANN). For example, standard
deviation, mean of absolute value, average power and ratio of
Keywords— Bayesian optimization, Classifier, absolute mean values of DWT were applied in [6], relative
Electroencephalogram (EEG), Feature selection, LDA, QLDA, energy was applied in [7] and line length of DWT was studied
SVM, Seizure. in [8] and energy of DWT was used in [9]. On the other hand,
SVM and DWT features have been used by [10][11]. Sample
I. INTRODUCTION entropy (SampEn) and distribution entropy (DistEn) and
SVM was applied in [12]. As can be seen that, different
Epilepsy is a brain neurological disorder and often studies use different type of features. Therefore, it is
observed in the ictal period and between inter-ictal period. It important to explore and extract a large set of features and
is a common neurological disorder which affects people of all use feature selection algorithm in order to get prominent
ages [1, 2]. It is one of the most common neural disorder that feature subset. Motivated by this, we have extracted different
affects about 1% of the whole world population and among features from different domain. The major contributions and
them about 0.2% affected individuals lose their lives. A key topics covered by this study are as follows:
report reveals that approximately 50 million people of • We have collected publicly available bipolar EEG
worldwide have epilepsy and approximately 1.38 million database and selected channel to reduce
people are suffering from epilepsy in our country. Mostly 1 computational complexity (Section II.A).
in every 200 newborn babies face some form of seizures. • Prominent feature subset has been selected using a
Different diagnostic tools such as magnetic resonance multi-objective feature selection algorithm that
QLDA is not so different from LDA except the Precision or positive predictive value (𝑃𝑃𝑉)
covariance matrix. The covariance matrix ∑k is separated for 𝑃𝑃𝑉 = 𝑇𝑃⁄(𝑇𝑃 + 𝐹𝑃) (11)
each class, k = 1,2, … … K. Quadratic discriminant function:
Accuracy (𝐴𝐶𝐶)
1 (𝑇𝑃 + 𝑇𝑁)
𝐴𝐶𝐶 = ⁄(𝑇𝑃 + 𝐹𝑃 + 𝐹𝑁 + 𝑇𝑁) (12)
𝛿𝑘 = − 𝑙𝑜𝑔 |∑ |
2
𝑘
−1
1 Matthews’s correlation coefficient (𝑀𝐶𝐶)
− (𝑥 − 𝜇𝑘 )𝑇 ∑(𝑥 − 𝜇𝑘 ) + 𝑙𝑜𝑔𝜋𝑘 (6) 𝑇𝑃 × 𝑇𝑁 − 𝐹𝑃 × 𝐹𝑁
2 𝑀𝐶𝐶 =
𝑘 √(𝑇𝑃 + 𝐹𝑃)(𝑇𝑃 + 𝐹𝑁)(𝑇𝑁 + 𝐹𝑃)(𝑇𝑁 + 𝐹𝑁)
(13)
C. Support Vector Machines (SVM) 𝐹1 Score is the harmonic mean of precision and sensitivity
𝐹1 = 2𝑇𝑃⁄(2𝑇𝑃 + 𝐹𝑃 + 𝐹𝑁) (14)
SVM tries to build an optimal hyper plane from a given
training set This optimal model has been used for the Here, 𝑇𝑃, 𝐹𝑃, 𝑇𝑁, 𝐹𝑁 represent true positve, false
classification of new examples. Given a training set of positive, true negative and false negative, respectively.
instance-label pairs(𝑥𝑖 , 𝑦𝑖 ),𝑖 = 1,2, … … . . , 𝑙 where 𝑥𝑖 ∈ 𝑅 𝑛
and 𝑦 ∈ (1, −1)𝑙 the support vector machines (SVM) [21] IV. RESULTS
require the solution of the following optimization problem:
𝑙
𝑚𝑖𝑛
1 𝑇 In this study, EEG based seizure prediction result will be
𝑤,𝑏,𝜖 𝑊 𝑊 + 𝐶 ∑ 𝜀𝑖 (7)
2 presented. To perform this, we have collected data from
𝑖=1
CHB-MIT base database. The raw EEG data was pre-
processed to remove major artefact.
Subject to 𝑦𝑖 (𝑊 𝑇 ∅(𝑥𝑖 ) + 𝑏) ≥ 1 − 𝜀𝑖 , 𝜀𝑖 ≥ 0
TABLE I. COMPARISION AMONG LDA, QLDA AND SVM
Correlation
Coefficient
Specificity
Sensitivity
Accuracy
F1_Score
Precision
Mathews
Positive
Kappa
Error
False
AUC
LDA 76.41 23.59 76.09 76.73 76.61 23.27 76.35 52.82 83.05 52.82
QLDA 80.79 19.21 73.40 88.20 86.17 11.80 79.27 62.28 89.98 61.59
SVM 97.05 2.95 98.99 95.12 95.29 4.88 97.11 94.17 99.54 94.10
After that, it was band-passed filtered to get only (0.5-30) whose objective function is the minimum one. In this figure,
Hz EEG. We know that, the most of the activities live the lowest point which represents the lowest classifier error
between this frequency bands. Different time domain value. The lowest of error is the highest of the accuracy of
features, frequency feature, time-frequency features (wave- this objective function.
length) as well as entropy base features has been extracted. A
total of 145 features have been extracted. The dataset was
imbalanced i.e., the number of non-seizure data is greater
than seizure data. The imbalanced dataset was balanced by
using ADASYN algorithm. A feature selection algorithm
called Multi-objective evolutionary feature selection based
on ENORA (Evolutionary Non-dominated Radial slots based
Algorithm) has been used [20]. The calling parameters are:
Generation=100, Population size=200, Report frequency=10,
Seed=1. The selected features using this algorithm are given
TABLE II:
Specificity
Sensitivity
Accuracy
F1_Score
Precision
Kappa
Positive
MCC
Error
False
AUC
features
domain
Time 67.14 32.86 73.74 60.54 65.18 39.46 69.19 34.58 72.23 34.28
Frequency 64.87 35.13 71.89 57.84 63.07 42.16 67.19 30.03 70.45 29.73
Connectivity 79.76 20.24 83.64 75.89 77.62 24.11 80.52 59.71 88.22 59.53
Entropy 57.76 42.24 75.38 40.13 55.74 59.83 64.09 16.58 58.70 15.51
Without feature selection 91.15 8.85 82.29 100 100 0 90.29 83.61 99.62 82.29
SVM with feature selection (Proposed) 97.05 2.95 98.99 95.12 95.29 4.88 97.11 94.17 99.54 94.10
REFERENCES
[8] L. Guo, D. Rivero, J. Dorado, J. R. Rabunal, and A. Pazos, [18] M. A. Awal, M. M. Lai, G. Azemi, B. Boashash, and P. B.
"Automatic epileptic seizure detection in EEGs based on line Colditz, "EEG background features that predict outcome in
length feature and artificial neural networks," Journal of term neonates with hypoxic ischaemic encephalopathy: A
neuroscience methods, vol. 191 (1), pp. 101-109, 2010. structured review," Clinical Neurophysiology, vol. 127 (1), pp.
285-296, 2016.
[9] I. Omerhodzic, S. Avdakovic, A. Nuhanovic, and K.
Dizdarevic, "Energy distribution of EEG signals: EEG signal [19] H. He, Y. Bai, E. A. Garcia, and S. Li, "ADASYN: Adaptive
wavelet-neural network classifier," arXiv preprint synthetic sampling approach for imbalanced learning," in
arXiv:1307.7897, 2013. IEEE International Joint Conference on Neural Networks,
(IJCNN), 2008, pp. 1322-1328.
[10] C. A. Lima, A. L. Coelho, and M. Eisencraft, "Tackling EEG
signal classification with least squares support vector [20] F. Jiménez, G. Sánchez, J. M. García, G. Sciavicco, and L.
machines: A sensitivity analysis study," Computers in Biology Miralles, "Multi-objective evolutionary feature selection for
and Medicine, vol. 40 (8), pp. 705-714, 2010. online sales forecasting," Neurocomputing, vol. 234 pp. 75-92,
2017.
[11] D. Chen, S. Wan, and F. S. Bao, "Epileptic focus localization
using EEG based on discrete wavelet transform through full- [21] C. Cortes and V. Vapnik, "Support-vector networks," Machine
learning, vol. 20 (3), pp. 273-297, 1995.